DOE Office of Scientific and Technical Information (OSTI.GOV)
Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.
1994-12-31
Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Evolution of the alternative AQP2 gene: Acquisition of a novel protein-coding sequence in dolphins.
Kishida, Takushi; Suzuki, Miwa; Takayama, Asuka
2018-01-01
Taxon-specific de novo protein-coding sequences are thought to be important for taxon-specific environmental adaptation. A recent study revealed that bottlenose dolphins acquired a novel isoform of aquaporin 2 generated by alternative splicing (alternative AQP2), which helps dolphins to live in hyperosmotic seawater. The AQP2 gene consists of four exons, but the alternative AQP2 gene lacks the fourth exon and instead has a longer third exon that includes the original third exon and a part of the original third intron. Here, we show that the latter half of the third exon of the alternative AQP2 arose from a non-protein-coding sequence. Intact ORF of this de novo sequence is shared not by all cetaceans, but only by delphinoids. However, this sequence is conservative in all modern cetaceans, implying that this de novo sequence potentially plays important roles for marine adaptation in cetaceans. Copyright © 2017 Elsevier Inc. All rights reserved.
Third International Meeting on Esterases Reacting with Organophosphorus Compounds
1998-01-01
cassette for negative selection, 884 bp of ACHE including exon 1, 1.6 kb of a Neor gene cassette for positive selection, 5.2 kb of the ACHE Bam HI...fragment including exon 6, and 3 kb of Bluescript. Deletion of exons 2-5 removed 80% of the ACHE coding sequence. The gene targeting vector was...expression due to environmental influences on CYP3A4 and the presence or absence of CYP3A5 which may be under genetic control in man. Plasma
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2015-01-01
Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior
2011-09-23
Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.
Vouille, V; Amiche, M; Nicolas, P
1997-09-01
We cloned the genes of two members of the dermaseptin family, broad-spectrum antimicrobial peptides isolated from the skin of the arboreal frog Phyllomedusa bicolor. The dermaseptin gene Drg2 has a 2-exon coding structure interrupted by a small 137-bp intron, wherein exon 1 encoded a 22-residue hydrophobic signal peptide and the first three amino acids of the acidic propiece; exon 2 contained the 18 additional acidic residues of the propiece plus a typical prohormone processing signal Lys-Arg and a 32-residue dermaseptin progenitor sequence. The dermaseptin genes Drg2 and Drg1g2 have conserved sequences at both untranslated ends and in the first and second coding exons. In contrast, Drg1g2 comprises a third coding exon for a short version of the acidic propiece and a second dermaseptin progenitor sequence. Structural conservation between the two genes suggests that Drg1g2 arose recently from an ancestral Drg2-like gene through amplification of part of the second coding exon and 3'-untranslated region. Analysis of the cDNAs coding precursors for several frog skin peptides of highly different structures and activities demonstrates that the signal peptides and part of the acidic propieces are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The organization of the genes that belong to this family, with the signal peptide and the progenitor sequence on separate exons, permits strikingly different peptides to be directed into the secretory pathway. The recruitment of such a homologous 'secretory' exon by otherwise non-homologous genes may have been an early event in the evolution of amphibian.
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa
2015-01-01
Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity. PMID:26693737
Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O.; Decker, Christian; Preising, Markus N.; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Issa, Peter Charbel; Holz, Frank G.; Baig, Shahid M.; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y.; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S.; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J.
2013-01-01
Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover “hidden mutations” such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5′ exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5′-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even truncating mutations may be misleading. PMID:24265693
Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O; Decker, Christian; Preising, Markus N; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Charbel Issa, Peter; Holz, Frank G; Baig, Shahid M; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J
2013-01-01
Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover "hidden mutations" such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5' exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5'-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even truncating mutations may be misleading.
A Dual Origin of the Xist Gene from a Protein-Coding Gene and a Set of Transposable Elements
Elisaphenko, Eugeny A.; Kolesnikov, Nikolay N.; Shevchenko, Alexander I.; Rogozin, Igor B.; Nesterova, Tatyana B.; Brockdorff, Neil; Zakian, Suren M.
2008-01-01
X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC). Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA. PMID:18575625
SEQassembly: A Practical Tools Program for Coding Sequences Splicing
NASA Astrophysics Data System (ADS)
Lee, Hongbin; Yang, Hang; Fu, Lei; Qin, Long; Li, Huili; He, Feng; Wang, Bo; Wu, Xiaoming
CDS (Coding Sequences) is a portion of mRNA sequences, which are composed by a number of exon sequence segments. The construction of CDS sequence is important for profound genetic analysis such as genotyping. A program in MATLAB environment is presented, which can process batch of samples sequences into code segments under the guide of reference exon models, and splice these code segments of same sample source into CDS according to the exon order in queue file. This program is useful in transcriptional polymorphism detection and gene function study.
Pianigiani, Giulia; Licastro, Danilo; Fortugno, Paola; Castiglia, Daniele; Petrovic, Ivana; Pagani, Franco
2018-06-12
MicroRNAs are found throughout the genome and are processed by the microprocessor complex (MPC) from longer precursors. Some precursor miRNAs overlap intron:exon junctions. These Splice site Overlapping microRNAs (SO-miRNAs) are mostly located in coding genes. It has been intimated, in the rarer examples of SO-miRNAs in non-coding RNAs, that the competition between the spliceosome and the MPC modulates alternative splicing. However, the effect of this overlap on coding transcripts is unknown. Unexpectedly, we show that neither Drosha silencing nor SF3b1 silencing changed the inclusion ratio of SO-miRNA exons. Two SO-miRNAs, located in genes that code for basal membrane proteins, are known to inhibit proliferation in primary keratinocytes. These SO-miRNAs were upregulated during differentiation and the host mRNAs were downregulated, but again there was no change in inclusion ratio of the SO-miRNA exons. Interestingly, Drosha silencing increased nascent RNA density, on chromatin, downstream of SO-miRNA exons. Overall our data suggest a novel mechanism for regulating gene expression in which MPC-dependent cleavage of SO-miRNA exons could cause premature transcriptional termination of coding genes rather than affecting alternative splicing. Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Splicing regulation and dysregulation of cholinergic genes expressed at the neuromuscular junction.
Ohno, Kinji; Rahman, Mohammad Alinoor; Nazim, Mohammad; Nasrin, Farhana; Lin, Yingni; Takeda, Jun-Ichi; Masuda, Akio
2017-08-01
We humans have evolved by acquiring diversity of alternative RNA metabolisms including alternative means of splicing and transcribing non-coding genes, and not by acquiring new coding genes. Tissue-specific and developmental stage-specific alternative RNA splicing is achieved by tightly regulated spatiotemporal regulation of expressions and activations of RNA-binding proteins that recognize their cognate splicing cis-elements on nascent RNA transcripts. Genes expressed at the neuromuscular junction are also alternatively spliced. In addition, germline mutations provoke aberrant splicing by compromising binding of RNA-binding proteins, and cause congenital myasthenic syndromes (CMS). We present physiological splicing mechanisms of genes for agrin (AGRN), acetylcholinesterase (ACHE), MuSK (MUSK), acetylcholine receptor (AChR) α1 subunit (CHRNA1), and collagen Q (COLQ) in human, and their aberration in diseases. Splicing isoforms of AChE T , AChE H , and AChE R are generated by hnRNP H/F. Skipping of MUSK exon 10 makes a Wnt-insensitive MuSK isoform, which is unique to human. Skipping of exon 10 is achieved by coordinated binding of hnRNP C, YB-1, and hnRNP L to exon 10. Exon P3A of CHRNA1 is alternatively included to generate a non-functional AChR α1 subunit in human. Molecular dissection of splicing mutations in patients with CMS reveals that exon P3A is alternatively skipped by hnRNP H, polypyrimidine tract-binding protein 1, and hnRNP L. Similarly, analysis of an exonic mutation in COLQ exon 16 in a CMS patient discloses that constitutive splicing of exon 16 requires binding of serine arginine-rich splicing factor 1. Intronic and exonic splicing mutations in CMS enable us to dissect molecular mechanisms underlying alternative and constitutive splicing of genes expressed at the neuromuscular junction. This is an article for the special issue XVth International Symposium on Cholinergic Mechanisms. © 2017 International Society for Neurochemistry.
Turco, Gina; Schnable, James C.; Pedersen, Brent; Freeling, Michael
2013-01-01
Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by >12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize. PMID:23874343
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene.
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-11-29
Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-01-01
Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains. PMID:18047649
Yin, Changchuan
2015-04-01
To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-01-01
Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis. PMID:18954468
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-10-28
The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis.
Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M
2017-01-01
Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
Meher, J K; Meher, P K; Dash, G N; Raval, M K
2012-01-01
The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Umans, L.; Serneels, L.; Hilliker, C.
1994-08-01
The authors have cloned the mouse gene coding for {alpha}{sub 2}-macroglobulin in overlapping {lambda} clones and have analyzed its structure. The gene contains 36 exons, coding for the 4.8-kb cDNA that we cloned previously. Including putative control elements in the 5{prime} flanking region, the gene covers about 45 kb. A region of 3.8 kb, stretching from 835 bases upstream of the cDNA start site to exon 4, including all intervening sequences, was sequenced completely. The analysis demonstrated that the putative promoter region of the mouse A2M gene differed considerably from the known promoter sequences of the human A2M gene andmore » of the rat acute-phas A2M gene. Comparison of the exon-intron structure of all known genes of the A2M family confirmed that the rat acute phase A2M gene is more closely related to the human gene than to the mouse A2M gene. To generate mice with the A2M gene inactivated, an insertion type of construct containing 7.5 kb of genomic DNA of the mouse strain 129/J, encompassing exons 16 to 19, was synthesized. A hygromycin marker gene was embedded in intron 17. After electroporation, 198 hygromycin-resistant ES cell lines were isolated and analyzed by Southern blotting. Five ES cell lines were obtained with one allele of the mouse A2M gene targeted by this insertion construct, demonstrating that the position and the characteristics of the vector served the intended goal.« less
Murray, R; Pederson, K; Prosser, H; Muller, D; Hutchison, C A; Frelinger, J A
1988-01-01
We have used random oligonucleotide mutagenesis (or saturation mutagenesis) to create a library of point mutations in the alpha 1 protein domain of a Major Histocompatibility Complex (MHC) molecule. This protein domain is critical for T cell and B cell recognition. We altered the MHC class I H-2DP gene sequence such that synthetic mutant alpha 1 exons (270 bp of coding sequence), which contain mutations identified by sequence analysis, can replace the wild type alpha 1 exon. The synthetic exons were constructed from twelve overlapping oligonucleotides which contained an average of 1.3 random point mutations per intact exon. DNA sequence analysis of mutant alpha 1 exons has shown a point mutant distribution that fits a Poisson distribution, and thus emphasizes the utility of this mutagenesis technique to "scan" a large protein sequence for important mutations. We report our use of saturation mutagenesis to scan an entire exon of the H-2DP gene, a cassette strategy to replace the wild type alpha 1 exon with individual mutant alpha 1 exons, and analysis of mutant molecules expressed on the surface of transfected mouse L cells. Images PMID:2903482
Zipper plot: visualizing transcriptional activity of genomic regions.
Avila Cobos, Francisco; Anckaert, Jasper; Volders, Pieter-Jan; Everaert, Celine; Rombaut, Dries; Vandesompele, Jo; De Preter, Katleen; Mestdagh, Pieter
2017-05-02
Reconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. Current state-of-the-art tools for long non-coding RNA (lncRNA) annotation are mainly based on evolutionary constraints, which may result in false negatives due to the overall limited conservation of lncRNAs. To tackle this problem we have developed the Zipper plot, a novel visualization and analysis method that enables users to simultaneously interrogate thousands of human putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These include publicly available CAGE-sequencing, ChIP-sequencing and DNase-sequencing datasets. Our method only requires three tab-separated fields (chromosome, genomic coordinate of the TSS and strand) as input and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot. Using the Zipper plot, we found evidence of transcription for a set of well-characterized lncRNAs and observed that fewer mono-exonic lncRNAs have CAGE peaks overlapping with their TSSs compared to multi-exonic lncRNAs. Using publicly available RNA-seq data, we found more than one hundred cases where junction reads connected protein-coding gene exons with a downstream mono-exonic lncRNA, revealing the need for a careful evaluation of lncRNA 5'-boundaries. Our method is implemented using the statistical programming language R and is freely available as a webtool.
Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays
Sugnet, Charles W; Srinivasan, Karpagam; Clark, Tyson A; O'Brien, Georgeann; Cline, Melissa S; Wang, Hui; Williams, Alan; Kulp, David; Blume, John E; Haussler, David; Ares, Manuel
2006-01-01
Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families. PMID:16424921
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kerr, J.M.; Fisher, L.W.; Termine, J.D.
The authors have isolated and partially sequenced the human bone sialoprotein gene (IBSP). IBSP has been sublocalized by in situ hybridization to chromosome 4q38-q31 and is composed of six small exons (51 to 159 bp) and 1 large exon ([approximately]2.6 kb). The intron/exon junctions defined by sequence analysis are of class O, retaining an intact coding triplet. Sequence analysis of the 5[prime] upstream region revealed a TATAA (nucleotides -30 to-25 from the transcriptional start point) and a CCAAT (nucleotides -56 to-52) box, both in the reverse orientation. Intron 1 contains interesting structural elements composed of polypyrimidine repeats followed by amore » poly(AC)[sub n] tract. Both types of structural elements have been detected in promoter regions of other genes and have been implicated in transcriptional regulation. Several differences between the previously published cDNA sequence and the authors' sequence have been identified, most of which are contained within the untranslated exon 1. Three base revisions in the coding region include a G to T (Gly to Val, amino acid 195), T to C (Val to Ala, amino acid 268), and T to A (Glu to Asp, amino acid 270). In conclusion, the genomic organization and potential regulatory elements of human IBSP have been elucidated. 42 refs., 4 figs., 1 tab.« less
SinEx DB: a database for single exon coding sequences in mammalian genomes.
Jorquera, Roddy; Ortiz, Rodrigo; Ossandon, F; Cárdenas, Juan Pablo; Sepúlveda, Rene; González, Carolina; Holmes, David S
2016-01-01
Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as 'single exon genes' (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs.Database URL: www.sinex.cl. © The Author(s) 2016. Published by Oxford University Press.
Zorc, Minja; Kunej, Tanja
2016-05-01
MicroRNAs (miRNAs) are a class of non-coding RNAs involved in posttranscriptional regulation of target genes. Regulation requires complementarity between target mRNA and the mature miRNA seed region, responsible for their recognition and binding. It has been estimated that each miRNA targets approximately 200 genes, and genetic variability of miRNA genes has been reported to affect phenotypic variability and disease susceptibility in humans, livestock species, and model organisms. Polymorphisms in miRNA genes could therefore represent biomarkers for phenotypic traits in livestock animals. In our previous study, we collected polymorphisms within miRNA genes in chicken. In the present study, we identified miRNA-related genomic overlaps to prioritize genomic regions of interest for further functional studies and biomarker discovery. Overlapping genomic regions in chicken were analyzed using the following bioinformatics tools and databases: miRNA SNiPer, Ensembl, miRBase, NCBI Blast, and QTLdb. Out of 740 known pre-miRNA genes, 263 (35.5 %) contain polymorphisms; among them, 35 contain more than three polymorphisms The most polymorphic miRNA genes in chicken are gga-miR-6662, containing 23 single nucleotide polymorphisms (SNPs) within the pre-miRNA region, including five consecutive SNPs, and gga-miR-6688, containing ten polymorphisms including three consecutive polymorphisms. Several miRNA-related genomic hotspots have been revealed in chicken genome; polymorphic miRNA genes are located within protein-coding and/or non-coding transcription units and quantitative trait loci (QTL) associated with production traits. The present study includes the first description of an exonic miRNA in a chicken genome, an overlap between the miRNA gene and the exon of the protein-coding gene (gga-miR-6578/HADHB), and the first report of a missense polymorphism located within a mature miRNA seed region. Identified miRNA-related genomic hotspots in chicken can serve researchers as a starting point for further functional studies and association studies with poultry production and health traits and the basis for systematic screening of exonic miRNAs and missense/miRNA seed polymorphisms in other genomes.
Evaluating the protein coding potential of exonized transposable element sequences
Piriyapongsa, Jittima; Rutledge, Mark T; Patel, Sanil; Borodovsky, Mark; Jordan, I King
2007-01-01
Background Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. Results We compared the ability of three classes of sequence similarity search methods to detect TE-derived sequences among data sets of experimentally characterized proteins: 1-a profile-based hidden Markov model (HMM) approach, 2-BLAST methods and 3-RepeatMasker. Profile based methods are more sensitive and more selective than the other methods evaluated. However, the application of profile-based search methods to the detection of TE-derived sequences among well-curated experimentally characterized protein data sets did not turn up many more cases than had been previously detected and nowhere near as many cases as recent genome-wide searches have. We observed that the different search methods used were complementary in the sense that they yielded largely non-overlapping sets of hits and differed in their ability to recover known cases of TE-derived CDS. The probabilistic analysis of TE-derived exon sequences indicates that these sequences have low protein coding potential on average. In particular, non-autonomous TEs that do not encode protein sequences, such as Alu elements, are frequently exonized but unlikely to encode protein sequences. Conclusion The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence. Reviewers: This article was reviewed by Itai Yanai, Kateryna D. Makova, Melissa Wilson (nominated by Kateryna D. Makova) and Cedric Feschotte (nominated by John M. Logsdon Jr.). PMID:18036258
Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Fang, Zhide; Deininger, Prescott; Zhang, Kun
2013-08-28
The exonization of transposable elements (TEs) has proven to be a significant mechanism for the creation of novel exons. Existing knowledge of the retention patterns of TE exons in mRNAs were mainly established by the analysis of Expressed Sequence Tag (EST) data and microarray data. This study seeks to validate and extend previous studies on the expression of TE exons by an integrative statistical analysis of high throughput RNA sequencing data. We collected 26 RNA-seq datasets spanning multiple tissues and cancer types. The exon-level digital expressions (indicating retention rates in mRNAs) were quantified by a double normalized measure, called the rescaled RPKM (Reads Per Kilobase of exon model per Million mapped reads). We analyzed the distribution profiles and the variability (across samples and between tissue/disease groups) of TE exon expressions, and compared them with those of other constitutive or cassette exons. We inferred the effects of four genomic factors, including the location, length, cognate TE family and TE nucleotide proportion (RTE, see Methods section) of a TE exon, on the exons' expression level and expression variability. We also investigated the biological implications of an assembly of highly-expressed TE exons. Our analysis confirmed prior studies from the following four aspects. First, with relatively high expression variability, most TE exons in mRNAs, especially those without exact counterparts in the UCSC RefSeq (Reference Sequence) gene tables, demonstrate low but still detectable expression levels in most tissue samples. Second, the TE exons in coding DNA sequences (CDSs) are less highly expressed than those in 3' (5') untranslated regions (UTRs). Third, the exons derived from chronologically ancient repeat elements, such as MIRs, tend to be highly expressed in comparison with those derived from younger TEs. Fourth, the previously observed negative relationship between the lengths of exons and the inclusion levels in transcripts is also true for exonized TEs. Furthermore, our study resulted in several novel findings. They include: (1) for the TE exons with non-zero expression and as shown in most of the studied biological samples, a high TE nucleotide proportion leads to their lower retention rates in mRNAs; (2) the considered genomic features (i.e. a continuous variable such as the exon length or a category indicator such as 3'UTR) influence the expression level and the expression variability (CV) of TE exons in an inverse manner; (3) not only the exons derived from Alu elements but also the exons from the TEs of other families were preferentially established in zinc finger (ZNF) genes.
Interactive web-based identification and visualization of transcript shared sequences.
Azhir, Alaleh; Merino, Louis-Henri; Nauen, David W
2018-05-12
We have developed TraC (Transcript Consensus), a web-based tool for detecting and visualizing shared sequences among two or more mRNA transcripts such as splice variants. Results including exon-exon boundaries are returned in a highly intuitive, data-rich, interactive plot that permits users to explore the similarities and differences of multiple transcript sequences. The online tool (http://labs.pathology.jhu.edu/nauen/trac/) is free to use. The source code is freely available for download (https://github.com/nauenlab/TraC). Copyright © 2018 Elsevier Inc. All rights reserved.
Exon 11 skipping of SCN10A coding for voltage-gated sodium channels in dorsal root ganglia
Schirmeyer, Jana; Szafranski, Karol; Leipold, Enrico; Mawrin, Christian; Platzer, Matthias; Heinemann, Stefan H
2014-01-01
The voltage-gated sodium channel NaV1.8 (encoded by SCN10A) is predominantly expressed in dorsal root ganglia (DRG) and plays a critical role in pain perception. We analyzed SCN10A transcripts isolated from human DRGs using deep sequencing and found a novel splice variant lacking exon 11, which codes for 98 amino acids of the domain I/II linker. Quantitative PCR analysis revealed an abundance of this variant of up to 5–10% in human, while no such variants were detected in mouse or rat. Since no obvious functional differences between channels with and without the exon-11 sequence were detected, it is suggested that SCN10A exon 11 skipping in humans is a tolerated event. PMID:24763188
Multi-step splicing of sphingomyelin synthase linear and circular RNAs.
Filippenkov, Ivan B; Sudarkina, Olga Yu; Limborska, Svetlana A; Dergunova, Lyudmila V
2018-05-15
The SGMS1 gene encodes the enzyme sphingomyelin synthase 1 (SMS1), which is involved in the regulation of lipid metabolism, apoptosis, intracellular vesicular transport and other significant processes. The SGMS1 gene is located on chromosome 10 and has a size of 320 kb. Previously, we showed that dozens of alternative transcripts of the SGMS1 gene are present in various human tissues. In addition to mRNAs that provide synthesis of the SMS1 protein, this gene participates in the synthesis of non-coding transcripts, including circular RNAs (circRNAs), which include exons of the 5'-untranslated region (5'-UTR) and are highly represented in the brain. In this study, using the high-throughput technology RNA-CaptureSeq, many new SGMS1 transcripts were identified, including both intronic unspliced RNAs (premature RNAs) and RNAs formed via alternative splicing. Recursive exons (RS-exons) that can participate in the multi-step splicing of long introns of the gene were also identified. These exons participate in the formation of circRNAs. Thus, multi-step splicing may provide a variety of linear and circular RNAs of eukaryotic genes in tissues. Copyright © 2018 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dyer, K.D.; Handen, J.S.; Rosenberg, H.F.
The Charcot-Leyden crystal (CLC) protein, or eosinophil lysophospholipase, is a characteristic protein of human eosinophils and basophils; recent work has demonstrated that the CLC protein is both structurally and functionally related to the galectin family of {beta}-galactoside binding proteins. The galectins as a group share a number of features in common, including a linear ligand binding site encoded on a single exon. In this work, we demonstrate that the intron-exon structure of the gene encoding CLC is analogous to those encoding the galectins. The coding sequence of the CLC gene is divided into four exons, with the entire {beta}-galactoside bindingmore » site encoded by exon III. We have isolated CLC {beta}-galactoside binding sites from both orangutan (Pongo pygmaeus) and murine (Mus musculus) genomic DNAs, both encoded on single exons, and noted conservation of the amino acids shown to interact directly with the {beta}-galactoside ligand. The most likely interpretation of these results suggests the occurrence of one or more exon duplication and insertion events, resulting in the distribution of this lectin domain to CLC as well as to the multiple galectin genes. 35 refs., 3 figs.« less
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene.
Levy-Lahad, E; Poorkaj, P; Wang, K; Fu, Y H; Oshima, J; Mulligan, J; Schellenberg, G D
1996-06-01
Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23,737 bp. The first 2 exons encode the 5'-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splice acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system.
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Levy-Lahad, E.; Wang, Kai; Fu, Ying Hui
1996-06-01
Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23, 737 bp. The first 2 exons encode the 5{prime}-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splicemore » acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system. 19 refs., 2 figs., 3 tabs.« less
Exome-wide DNA capture and next generation sequencing in domestic and wild species.
Cosart, Ted; Beja-Pereira, Albano; Chen, Shanyuan; Ng, Sarah B; Shendure, Jay; Luikart, Gordon
2011-07-05
Gene-targeted and genome-wide markers are crucial to advance evolutionary biology, agriculture, and biodiversity conservation by improving our understanding of genetic processes underlying adaptation and speciation. Unfortunately, for eukaryotic species with large genomes it remains costly to obtain genome sequences and to develop genome resources such as genome-wide SNPs. A method is needed to allow gene-targeted, next-generation sequencing that is flexible enough to include any gene or number of genes, unlike transcriptome sequencing. Such a method would allow sequencing of many individuals, avoiding ascertainment bias in subsequent population genetic analyses.We demonstrate the usefulness of a recent technology, exon capture, for genome-wide, gene-targeted marker discovery in species with no genome resources. We use coding gene sequences from the domestic cow genome sequence (Bos taurus) to capture (enrich for), and subsequently sequence, thousands of exons of B. taurus, B. indicus, and Bison bison (wild bison). Our capture array has probes for 16,131 exons in 2,570 genes, including 203 candidate genes with known function and of interest for their association with disease and other fitness traits. We successfully sequenced and mapped exon sequences from across the 29 autosomes and X chromosome in the B. taurus genome sequence. Exon capture and high-throughput sequencing identified thousands of putative SNPs spread evenly across all reference chromosomes, in all three individuals, including hundreds of SNPs in our targeted candidate genes. This study shows exon capture can be customized for SNP discovery in many individuals and for non-model species without genomic resources. Our captured exome subset was small enough for affordable next-generation sequencing, and successfully captured exons from a divergent wild species using the domestic cow genome as reference.
Ortuño-Pineda, Carlos; Galindo-Rosales, José Manuel; Calderón-Salinas, José Victor; Villegas-Sepúlveda, Nicolás; Saucedo-Cárdenas, Odila; De Nova-Ocampo, Mónica; Valdés, Jesús
2012-01-01
The splicing of the N exon in the pre-mRNA coding for the RE1-silencing transcription factor (REST) results in a truncated protein that modifies the expression pattern of some of its target genes. A weak 3'ss, three alternative 5'ss (N4-, N50-, and N62-5'ss) and a variety of putative target sites for splicing regulatory proteins are found around the N exon; two GGGG codes (G2-G3) and a poly-Uridine tract (N-PU) are found in front of the N50-5'ss. In this work we analyzed some of the regulatory factors and elements involved in the preferred selection of the N50-5'ss (N50 activation) in the small cell lung cancer cell line H69. Wild type and mutant N exon/β-globin minigenes recapitulated N50 exon splicing in H69 cells, and showed that the N-PU and the G2-G3 elements are required for N50 exon splicing. Biochemical and knockdown experiments identified these elements as U2AF65 and hnRNP H targets, respectively, and that they are also required for N50 exon activation. Compared to normal MRC5 cells, and in keeping with N50 exon activation, U2AF65, hnRNP H and other splicing factors were highly expressed in H69 cells. CLIP experiments revealed that hnRNP H RNA-binding occurs first and is a prerequisite for U2AF65 RNA binding, and EMSA and CLIP experiments suggest that U2AF65-RNA recognition displaces hnRNP H and helps to recruit other splicing factors (at least U1 70K) to the N50-5'ss. Our results evidenced novel hnRNP H and U2AF65 functions: respectively, U2AF65-recruiting to a 5'ss in humans and the hnRNP H-displacing function from two juxtaposed GGGG codes. PMID:22792276
Toyoda, N; Kleinhaus, N; Larsen, P R
1996-06-01
We analyzed the exon-intron structure of the human type 1 deiodinase gene (dio1) and compared it with that of a patient with suspected congenital type 1 deiodinase (D1) deficiency. The hdio1 gene is identical in exon-intron arrangement to the mouse gene, with coding sequences and a selenocysteine insertion sequence (SECIS) element contained in four exons. There were no mutations in the sequences of exons 1-4 of the patient's genomic DNA. Functional studies by transient expression techniques showed no difference in basal promoter activity or T3 responsiveness between the patient's and the normal dio1 gene. A structural abnormality in the dio1 gene is not a likely explanation for this patient's D1-deficient phenotype.
The Exon-Florio National Security Test for Foreign Investment
2006-03-15
Congressional Research Service ˜ The Library of Congress CRS Report for Congress Received through the CRS Web Order Code RL33312 The Exon- Florio ...number. 1. REPORT DATE 15 MAR 2006 2. REPORT TYPE N/A 3. DATES COVERED - 4. TITLE AND SUBTITLE The Exon- Florio National Security Test for...Z39-18 The Exon- Florio National Security Test for Foreign Investment Summary The proposed acquisitions of major operations in six major U.S. ports by
A rare coding variant in TREM2 increases risk for Alzheimer's disease in Han Chinese.
Jiang, Teng; Tan, Lan; Chen, Qi; Tan, Meng-Shan; Zhou, Jun-Shan; Zhu, Xi-Chen; Lu, Huan; Wang, Hui-Fu; Zhang, Ying-Dong; Yu, Jin-Tai
2016-06-01
Two recent studies have identified that a rare coding variant (p.R47H) in exon 2 of triggering receptor expressed on myeloid cells 2 (TREM2) gene is associated with Alzheimer's disease (AD) susceptibility in Caucasians. This association was not successfully replicated in Han Chinese, where this variant was rare or even absent. Previously, we resequenced TREM2 exon 2 to investigate whether additional rare variants conferred risk to AD in our cohort. Although several new variants had been identified, none of them was significantly associated with disease susceptibility. Here, to test whether TREM2 is truly a susceptibility gene of AD in Han Chinese, we extend our previous study by sequencing the other four exons of TREM2 in 988 AD patients and 1,354 healthy controls. We provided the first evidence that a rare coding variant (p.H157Y) in TREM2 exon 3 conferred a considerable risk of AD in our cohort (Pcorrected = 0.02, odds ratio = 11.01, 95% confidence interval: 1.38-88.05). This finding indicates that rare coding variants of TREM2 may play an important role in AD in Han Chinese. Copyright © 2016 Elsevier Inc. All rights reserved.
A mechanism for exon skipping caused by nonsense or missense mutations in BRCA1 and other genes.
Liu, H X; Cartegni, L; Zhang, M Q; Krainer, A R
2001-01-01
Point mutations can generate defective and sometimes harmful proteins. The nonsense-mediated mRNA decay (NMD) pathway minimizes the potential damage caused by nonsense mutations. In-frame nonsense codons located at a minimum distance upstream of the last exon-exon junction are recognized as premature termination codons (PTCs), targeting the mRNA for degradation. Some nonsense mutations cause skipping of one or more exons, presumably during pre-mRNA splicing in the nucleus; this phenomenon is termed nonsense-mediated altered splicing (NAS), and its underlying mechanism is unclear. By analyzing NAS in BRCA1, we show here that inappropriate exon skipping can be reproduced in vitro, and results from disruption of a splicing enhancer in the coding sequence. Enhancers can be disrupted by single nonsense, missense and translationally silent point mutations, without recognition of an open reading frame as such. These results argue against a nuclear reading-frame scanning mechanism for NAS. Coding-region single-nucleotide polymorphisms (cSNPs) within exonic splicing enhancers or silencers may affect the patterns or efficiency of mRNA splicing, which may in turn cause phenotypic variability and variable penetrance of mutations elsewhere in a gene.
Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome
Stolc, Viktor; Deng, Wei; He, Hang; Korbel, Jan; Chen, Xuewei; Tongprasit, Waraporn; Ronald, Pamela; Chen, Runsheng; Gerstein, Mark; Wang Deng, Xing
2007-01-01
Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome. PMID:17372628
Kim, Yoonhee; Suktitipat, Bhoom; Yanek, Lisa R.; Faraday, Nauder; Wilson, Alexander F.; Becker, Diane M.; Becker, Lewis C.; Mathias, Rasika A.
2013-01-01
Platelet aggregation is heritable, and genome-wide association studies have detected strong associations with a common intronic variant of the platelet endothelial aggregation receptor1 (PEAR1) gene both in African American and European American individuals. In this study, we used a sequencing approach to identify additional exonic variants in PEAR1 that may also determine variability in platelet aggregation in the GeneSTAR Study. A 0.3 Mb targeted region on chromosome 1q23.1 including the entire PEAR1 gene was Sanger sequenced in 104 subjects (45% male, 49% African American, age = 52±13) selected on the basis of hyper- and hypo- aggregation across three different agonists (collagen, epinephrine, and adenosine diphosphate). Single-variant and multi-variant burden tests for association were performed. Of the 235 variants identified through sequencing, 61 were novel, and three of these were missense variants. More rare variants (MAF<5%) were noted in African Americans compared to European Americans (108 vs. 45). The common intronic GWAS-identified variant (rs12041331) demonstrated the most significant association signal in African Americans (p = 4.020×10−4); no association was seen for additional exonic variants in this group. In contrast, multi-variant burden tests indicated that exonic variants play a more significant role in European Americans (p = 0.0099 for the collective coding variants compared to p = 0.0565 for intronic variant rs12041331). Imputation of the individual exonic variants in the rest of the GeneSTAR European American cohort (N = 1,965) supports the results noted in the sequenced discovery sample: p = 3.56×10−4, 2.27×10−7, 5.20×10−5 for coding synonymous variant rs56260937 and collagen, epinephrine and adenosine diphosphate induced platelet aggregation, respectively. Sequencing approaches confirm that a common intronic variant has the strongest association with platelet aggregation in African Americans, and show that exonic variants play an additional role in platelet aggregation in European Americans. PMID:23704978
Detection of BRCA1 gross rearrangements by droplet digital PCR.
Preobrazhenskaya, Elena V; Bizin, Ilya V; Kuligina, Ekatherina Sh; Shleykina, Alla Yu; Suspitsin, Evgeny N; Zaytseva, Olga A; Anisimova, Elena I; Laptiev, Sergey A; Gorodnova, Tatiana V; Belyaev, Alexey M; Imyanitov, Evgeny N; Sokolenko, Anna P
2017-10-01
Large genomic rearrangements (LGRs) constitute a significant share of pathogenic BRCA1 mutations. Multiplex ligation-dependent probe amplification (MLPA) is a leading method for LGR detection; however, it is entirely based on the use of commercial kits, includes relatively time-consuming hybridization step, and is not convenient for large-scale screening of recurrent LGRs. We developed and validated the droplet digital PCR (ddPCR) assay, which covers the entire coding region of BRCA1 gene and is capable to precisely quantitate the copy number for each exon. 141 breast cancer (BC) patients, who demonstrated evident clinical features of hereditary BC but turned out to be negative for founder BRCA1/2 mutations, were subjected to the LGR analysis. Four patients with LGR were identified, with three cases of exon 8 deletion and one women carrying the deletion of exons 5-7. Excellent concordance with MLPA test was observed. Exon 8 copy number was tested in additional 720 BC and 184 ovarian cancer (OC) high-risk patients, and another four cases with the deletion were revealed; MLPA re-analysis demonstrated that exon 8 loss was a part of a larger genetic alteration in two cases, while the remaining two patients had isolated defect of exon 8. Long-range PCR and next generation sequencing of DNA samples carrying exon 8 deletion revealed two types of recurrent LGRs. Droplet digital PCR is a reliable tool for the detection of large genomic rearrangements.
Evaluation of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy.
Stabej, Polona; Leegwater, Peter A; Stokhof, Arnold A; Domanjko-Petric, Aleksandra; van Oost, Bernard A
2005-03-01
To evaluate the role of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy (DCM). 6 dogs with DCM, including 2 Doberman Pinschers, 2 Newfoundlands, and 2 Great Danes. All dogs had clinical signs of congestive heart failure, and a diagnosis of DCM was made on the basis of echocardiographic findings. Blood samples were collected from each dog, and genomic DNA was isolated by a salt extraction method. Specific oligonucleotides were designed to amplify the promoter, exon 1, the 5'-part of exon 2 including the complete coding region, and part of intron 1 of the canine phospholamban gene via polymerase chain reaction procedures. These regions were screened for mutations in DNA obtained from the 6 dogs with DCM. No mutations were identified in the promoter, 5' untranslated region, part of intron 1, part of the 3' untranslated region, and the complete coding region of the phospholamban gene in dogs with DCM. Results indicate that mutations in the phospholamban gene are not a frequent cause of DCM in Doberman Pinschers, Newfoundlands, and Great Danes.
MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing
2014-01-01
We have developed a novel machine-learning approach, MutPred Splice, for the identification of coding region substitutions that disrupt pre-mRNA splicing. Applying MutPred Splice to human disease-causing exonic mutations suggests that 16% of mutations causing inherited disease and 10 to 14% of somatic mutations in cancer may disrupt pre-mRNA splicing. For inherited disease, the main mechanism responsible for the splicing defect is splice site loss, whereas for cancer the predominant mechanism of splicing disruption is predicted to be exon skipping via loss of exonic splicing enhancers or gain of exonic splicing silencer elements. MutPred Splice is available at http://mutdb.org/mutpredsplice. PMID:24451234
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pham-Dinh, D.; Gaspera, D.B.; Dautigny, A.
1995-09-20
Myelin/oligodendrocyte glycoprotein (MOG), a special component of the central nervous system localization on the outermost lamellae of mature myelin, is a member of the immunoglobulin superfamily. We report here the organization of the human MOG gene, which spans approximately 17 kb, and the characterization of six MOG mRNA splicing variants. The intron/exon structure of the human MOG gene confirmed the splicing pattern, supporting the hypothesis that mRNA isoforms could arise by alternative splicing of a single gene. In addition to the eight exons coding for the major MOG isoform, the human MOG gene also contains 3` region, a previously unknownmore » alternatively spliced coding exon, VIA. Alternative utilization of two acceptor splicing sites for exon VIII could produce two different C-termini. The nucleotide sequences presented here may be a useful tool to study further possible involvement if the MOG gene in hereditary neurological disorders. 23 refs., 5 figs.« less
Chograni, Manèl; Rejeb, Imen; Jemaa, Lamia Ben; Châabouni, Myriam; Bouhamed, Habiba Chaabouni
2011-01-01
Nance-Horan Syndrome (NHS) or X-linked cataract-dental syndrome is a disease of unknown gene action mechanism, characterized by congenital cataract, dental anomalies, dysmorphic features and, in some cases, mental retardation. We performed linkage analysis in a Tunisian family with NHS in which affected males and obligate carrier female share a common haplotype in the Xp22.32-p11.21 region that contains the NHS gene. Direct sequencing of NHS coding exons and flanking intronic sequences allowed us to identify the first missense mutation (P551S) and a reported SNP-polymorphism (L1319F) in exon 6, a reported UTR–SNP (c.7422 C>T) and a novel one (c.8239 T>A) in exon 8. Both variations P551S and c.8239 T>A segregate with NHS phenotype in this family. Although truncations, frame-shift and copy number variants have been reported in this gene, no missense mutations have been found to segregate previously. This is the first report of a missense NHS mutation causing NHS phenotype (including cardiac defects). We hypothesize also that the non-reported UTR–SNP of the exon 8 (3′-UTR) is specific to the Tunisian population. PMID:21559051
Chograni, Manèl; Rejeb, Imen; Jemaa, Lamia Ben; Châabouni, Myriam; Bouhamed, Habiba Chaabouni
2011-08-01
Nance-Horan Syndrome (NHS) or X-linked cataract-dental syndrome is a disease of unknown gene action mechanism, characterized by congenital cataract, dental anomalies, dysmorphic features and, in some cases, mental retardation. We performed linkage analysis in a Tunisian family with NHS in which affected males and obligate carrier female share a common haplotype in the Xp22.32-p11.21 region that contains the NHS gene. Direct sequencing of NHS coding exons and flanking intronic sequences allowed us to identify the first missense mutation (P551S) and a reported SNP-polymorphism (L1319F) in exon 6, a reported UTR-SNP (c.7422 C>T) and a novel one (c.8239 T>A) in exon 8. Both variations P551S and c.8239 T>A segregate with NHS phenotype in this family. Although truncations, frame-shift and copy number variants have been reported in this gene, no missense mutations have been found to segregate previously. This is the first report of a missense NHS mutation causing NHS phenotype (including cardiac defects). We hypothesize also that the non-reported UTR-SNP of the exon 8 (3'-UTR) is specific to the Tunisian population.
Duellman, Tyler; Warren, Christopher; Yang, Jay
2014-01-01
Microribonucleic acids (miRNAs) work with exquisite specificity and are able to distinguish a target from a non-target based on a single nucleotide mismatch in the core nucleotide domain. We questioned whether miRNA regulation of gene expression could occur in a single nucleotide polymorphism (SNP)-specific manner, manifesting as a post-transcriptional control of expression of genetic polymorphisms. In our recent study of the functional consequences of matrix metalloproteinase (MMP)-9 SNPs, we discovered that expression of a coding exon SNP in the pro-domain of the protein resulted in a profound decrease in the secreted protein. This missense SNP results in the N38S amino acid change and a loss of an N-glycosylation site. A systematic study demonstrated that the loss of secreted protein was due not to the loss of an N-glycosylation site, but rather an SNP-specific targeting by miR-671-3p and miR-657. Bioinformatics analysis identified 41 SNP-specific miRNA targeting MMP-9 SNPs, mostly in the coding exon and an extension of the analysis to chromosome 20, where the MMP-9 gene is located, suggesting that SNP-specific miRNAs targeting the coding exon are prevalent. This selective post-transcriptional regulation of a target messenger RNA harboring genetic polymorphisms by miRNAs offers an SNP-dependent post-transcriptional regulatory mechanism, allowing for polymorphic-specific differential gene regulation. PMID:24627221
Lavenu, A; Pistoi, S; Pournin, S; Babinet, C; Morello, D
1995-01-01
In vivo, the steady-state level of c-myc mRNA is mainly controlled by posttranscriptional mechanisms. Using a panel of transgenic mice in which various versions of the human c-myc proto-oncogene were under the control of major histocompatibility complex H-2Kb class I regulatory sequences, we have shown that the 5' and the 3' noncoding sequences are dispensable for obtaining a regulated expression of the transgene in adult quiescent tissues, at the start of liver regeneration, and after inhibition of protein synthesis. These results indicated that the coding sequences were sufficient to ensure a regulated c-myc expression. In the present study, we have pursued this analysis with transgenes containing one or the other of the two c-myc coding exons either alone or in association with the c-myc 3' untranslated region. We demonstrate that each of the exons contains determinants which control c-myc mRNA expression. Moreover, we show that in the liver, c-myc exon 2 sequences are able to down-regulate an otherwise stable H-2K mRNA when embedded within it and to induce its transient accumulation after cycloheximide treatment and soon after liver ablation. Finally, the use of transgenes with different coding capacities has allowed us to postulate that the primary mRNA sequence itself and not c-Myc peptides is an important component of c-myc posttranscriptional regulation. PMID:7623834
Intergenic disease-associated regions are abundant in novel transcripts.
Bartonicek, N; Clark, M B; Quek, X C; Torpy, J R; Pritchard, A L; Maag, J L V; Gloss, B S; Crawford, J; Taft, R J; Hayward, N K; Montgomery, G W; Mattick, J S; Mercer, T R; Dinger, M E
2017-12-28
Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.
Kim, Dong Seon; Hahn, Yoonsoo
2012-11-13
Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.
RBFOX and PTBP1 proteins regulate the alternative splicing of micro-exons in human brain transcripts
Sanchez-Pulido, Luis; Haerty, Wilfried
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein–protein interactions. PMID:25524026
Evaluation of non-coding variation in GLUT1 deficiency.
Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S
2016-12-01
Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.
Novel germline PALB2 truncating mutations in African-American breast cancer patients
Zheng, Yonglan; Zhang, Jing; Niu, Qun; Huo, Dezheng; Olopade, Olufunmilayo I.
2011-01-01
Background It has been demonstrated that PALB2 acts as a bridging molecule between the BRCA1 and BRCA2 proteins and is responsible for facilitating BRCA2-mediated DNA repair. Truncating mutations in the PALB2 gene have been reported to be enriched in Fanconi anemia and breast cancer patients in various populations. Methods We evaluated the contribution of PALB2 germline mutations in 279 African-American breast cancer patients including 29 patients with a strong family history, 29 patients with a moderate family history, 75 patients with a weak family history, and 146 non-familial or sporadic breast cancer cases. Results After direct sequencing of all the coding exons, exon/intron boundaries, 5′UTR and 3′UTR of PALB2, three (1.08%; 3 in 279) novel monoallelic truncating mutations were identified: c.758dupT (exon4), c.1479delC (exon4) and c.3048delT (exon 10); together with 50 sequence variants, 27 of which are novel. None of the truncating mutations were found in 262 controls from the same population. Conclusions PALB2 mutations are present in both familial and non-familial breast cancer among African-Americans. Rare PALB2 mutations account for a small but substantial proportion of breast cancer patients. PMID:21932393
Veenstra, Jan A; Khammassi, Hela
2017-04-01
RYamides are arthropod neuropeptides with unknown function. In 2011 two RYamides were isolated from D. melanogaster as the ligands for the G-protein coupled receptor CG5811. The D. melanogaster gene encoding these neuropeptides is highly unusual, as there are four RYamide encoding exons in the current genome assembly, but an exon encoding a signal peptide is absent. Comparing the D. melanogaster gene structure with those from other species, including D. virilis, suggests that the gene is degenerating. RNAseq data from 1634 short sequence read archives at NCBI containing more than 34 billion spots yielded numerous individual spots that correspond to the RYamide encoding exons, of which a large number include the intron-exon boundary at the start of this exon. Although 72 different sequences have been spliced onto this RYamide encoding exon, none codes for the signal peptide of this gene. Thus, the RNAseq data for this gene reveal only noise and no signal. The very small quantities of peptide recovered during isolation and the absence of credible RNAseq data, indicates that the gene is very little expressed, while the RYamide gene structure in D. melanogaster suggests that it might be evolving into a pseudogene. Yet, the identification of the peptides it encodes clearly shows it is still functional. Using region specific antisera, we could localize numerous neurons and enteroendocrine cells in D. willistoni, D. virilis and D. pseudoobscura, but only two adult abdominal neurons in D. melanogaster. Those two neurons project to and innervate the rectal papillae, suggesting that RYamides may be involved in the regulation of water homeostasis. Copyright © 2017 Elsevier Ltd. All rights reserved.
COOLAIR Antisense RNAs Form Evolutionarily Conserved Elaborate Secondary Structures
Hawkes, Emily J.; Hennelly, Scott P.; Novikova, Irina V.; ...
2016-09-20
There is considerable debate about the functionality of long non-coding RNAs (lncRNAs). Lack of sequence conservation has been used to argue against functional relevance. Here, we investigated antisense lncRNAs, called COOLAIR, at the A. thaliana FLC locus and experimentally determined their secondary structure. The major COOLAIR variants are highly structured, organized by exon. The distally polyadenylated transcript has a complex multi-domain structure, altered by a single non-coding SNP defining a functionally distinct A. thaliana FLC haplotype. The A. thaliana COOLAIR secondary structure was used to predict COOLAIR exons in evolutionarily divergent Brassicaceae species. These predictions were validated through chemical probingmore » and cloning. Despite the relatively low nucleotide sequence identity, the structures, including multi-helix junctions, show remarkable evolutionary conservation. In a number of places, the structure is conserved through covariation of a non-contiguous DNA sequence. This structural conservation supports a functional role for COOLAIR transcripts rather than, or in addition to, antisense transcription.« less
COOLAIR Antisense RNAs Form Evolutionarily Conserved Elaborate Secondary Structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hawkes, Emily J.; Hennelly, Scott P.; Novikova, Irina V.
There is considerable debate about the functionality of long non-coding RNAs (lncRNAs). Lack of sequence conservation has been used to argue against functional relevance. Here, we investigated antisense lncRNAs, called COOLAIR, at the A. thaliana FLC locus and experimentally determined their secondary structure. The major COOLAIR variants are highly structured, organized by exon. The distally polyadenylated transcript has a complex multi-domain structure, altered by a single non-coding SNP defining a functionally distinct A. thaliana FLC haplotype. The A. thaliana COOLAIR secondary structure was used to predict COOLAIR exons in evolutionarily divergent Brassicaceae species. These predictions were validated through chemical probingmore » and cloning. Despite the relatively low nucleotide sequence identity, the structures, including multi-helix junctions, show remarkable evolutionary conservation. In a number of places, the structure is conserved through covariation of a non-contiguous DNA sequence. This structural conservation supports a functional role for COOLAIR transcripts rather than, or in addition to, antisense transcription.« less
Li, Yang I; Sanchez-Pulido, Luis; Haerty, Wilfried; Ponting, Chris P
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein-protein interactions. © 2015 Li et al.; Published by Cold Spring Harbor Laboratory Press.
Glaus, Esther; Lorenz, Birgit; Netzer, Christian; Li, Yün; Schambeck, Maria; Wittmer, Mariana; Feil, Silke; Kirschner-Schwabe, Renate; Rosenberg, Thomas; Cremers, Frans P.M.; Bergen, Arthur A.B.; Barthelmes, Daniel; Baraki, Husnia; Schmid, Fabian; Tanner, Gaby; Fleischhauer, Johannes; Orth, Ulrike; Becker, Christian; Wegscheider, Erika; Nürnberg, Gudrun; Nürnberg, Peter; Bolz, Hanno Jörn; Gal, Andreas; Berger, Wolfgang
2008-01-01
Purpose The goal of this study was to identify mutations in X-chromosomal genes associated with retinitis pigmentosa (RP) in patients from Germany, The Netherlands, Denmark, and Switzerland. Methods In addition to all coding exons of RP2, exons 1 through 15, 9a, ORF15, 15a and 15b of RPGR were screened for mutations. PCR products were amplified from genomic DNA extracted from blood samples and analyzed by direct sequencing. In one family with apparently dominant inheritance of RP, linkage analysis identified an interval on the X chromosome containing RPGR, and mutation screening revealed a pathogenic variant in this gene. Patients of this family were examined clinically and by X-inactivation studies. Results This study included 141 RP families with possible X-chromosomal inheritance. In total, we identified 46 families with pathogenic sequence alterations in RPGR and RP2, of which 17 mutations have not been described previously. Two of the novel mutations represent the most 3’-terminal pathogenic sequence variants in RPGR and RP2 reported to date. In exon ORF15 of RPGR, we found eight novel and 14 known mutations. All lead to a disruption of open reading frame. Of the families with suggested X-chromosomal inheritance, 35% showed mutations in ORF15. In addition, we found five novel mutations in other exons of RPGR and four in RP2. Deletions in ORF15 of RPGR were identified in three families in which female carriers showed variable manifestation of the phenotype. Furthermore, an ORF15 mutation was found in an RP patient who additionally carries a 6.4 kbp deletion downstream of the coding region of exon ORF15. We did not identify mutations in 39 sporadic male cases from Switzerland. Conclusions RPGR mutations were confirmed to be the most frequent cause of RP in families with an X-chromosomal inheritance pattern. We propose a screening strategy to provide molecular diagnostics in these families. PMID:18552978
2012-01-01
Background Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. Results We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Conclusions Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution. PMID:23148531
Lee, Tai-Sung; Ma, Wanlong; Zhang, Xi; Kantarjian, Hagop; Albitar, Maher
2009-01-01
Background The functional relevance of many of the recently detected JAK2 mutations, except V617F and exon 12 mutants, in patients with chronic myeloproliferative neoplasia (MPN) has been significantly overlooked. To explore atomic-level explanations of the possible mutational effects from those overlooked mutants, we performed a set of molecular dynamics simulations on clinically observed mutants, including newly discovered mutations (K539L, R564L, L579F, H587N, S591L, H606Q, V617I, V617F, C618R, L624P, whole exon 14-deletion) and control mutants (V617C, V617Y, K603Q/N667K). Results Simulation results are consistent with all currently available clinical/experimental evidence. The simulation-derived putative interface, not possibly obtained from static models, between the kinase (JH1) and pseudokinase (JH2) domains of JAK2 provides a platform able to explain the mutational effect for all mutants, including presumably benign control mutants, at the atomic level. Conclusion The results and analysis provide structural bases for mutational mechanisms of JAK2, may advance the understanding of JAK2 auto-regulation, and have the potential to lead to therapeutic approaches. Together with recent mutation profiling results demonstrating the breadth of clinically observed JAK2 mutations, our findings suggest that molecular testing/diagnostics of JAK2 should extend beyond V617F and exon 12 mutations, and perhaps should encompass most of the pseudo-kinase domain-coding region. PMID:19744331
Genetic variations of VDR/NR1I1 encoding vitamin D receptor in a Japanese population.
Ukaji, Maho; Saito, Yoshiro; Fukushima-Uesaka, Hiromi; Maekawa, Keiko; Katori, Noriko; Kaniwa, Nahoko; Yoshida, Teruhiko; Nokihara, Hiroshi; Sekine, Ikuo; Kunitoh, Hideo; Ohe, Yuichiro; Yamamoto, Noboru; Tamura, Tomohide; Saijo, Nagahiro; Sawada, Jun-ichi
2007-12-01
The vitamin D receptor (VDR) is a transcriptional factor responsive to 1alpha,25-dihydroxyvitamin D(3) and lithocholic acid, and induces expression of drug metabolizing enzymes CYP3A4, CYP2B6 and CYP2C9. In this study, the promoter regions, 14 exons (including 6 exon 1's) and their flanking introns of VDR were comprehensively screened for genetic variations in 107 Japanese subjects. Sixty-one genetic variations including 25 novel ones were found: 9 in the 5'-flanking region, 2 in the 5'-untranslated region (UTR), 7 in the coding exons (5 synonymous and 2 nonsynonymous variations), 12 in the 3'-UTR, 19 in the introns between the exon 1's, and 12 in introns 2 to 8. Of these, one novel nonsynonymous variation, 154A>G (Met52Val), was detected with an allele frequency of 0.005. The single nucleotide polymorphisms (SNPs) that increase VDR expression or activity, -29649G>A, 2T>C and 1592((*)308)C>A tagging linked variations in the 3'-UTR, were detected at 0.430, 0.636, and 0.318 allele frequencies, respectively. Another SNP, -26930A>G, with reduced VDR transcription was found at a 0.028 frequency. These findings would be useful for association studies on VDR variations in Japanese.
Maia, Rafaela M; Valente, Valeria; Cunha, Marco A V; Sousa, Josane F; Araujo, Daniela D; Silva, Wilson A; Zago, Marco A; Dias-Neto, Emmanuel; Souza, Sandro J; Simpson, Andrew J G; Monesi, Nadia; Ramos, Ricardo G P; Espreafico, Enilza M; Paçó-Larson, Maria L
2007-07-24
The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA+ northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data.
Maia, Rafaela M; Valente, Valeria; Cunha, Marco AV; Sousa, Josane F; Araujo, Daniela D; Silva, Wilson A; Zago, Marco A; Dias-Neto, Emmanuel; Souza, Sandro J; Simpson, Andrew JG; Monesi, Nadia; Ramos, Ricardo GP; Espreafico, Enilza M; Paçó-Larson, Maria L
2007-01-01
Background The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. Results Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA+ northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. Conclusion Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data. PMID:17650329
RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.
Xiong, Hui Y; Alipanahi, Babak; Lee, Leo J; Bretschneider, Hannes; Merico, Daniele; Yuen, Ryan K C; Hua, Yimin; Gueroussov, Serge; Najafabadi, Hamed S; Hughes, Timothy R; Morris, Quaid; Barash, Yoseph; Krainer, Adrian R; Jojic, Nebojsa; Scherer, Stephen W; Blencowe, Benjamin J; Frey, Brendan J
2015-01-09
To facilitate precision medicine and whole-genome annotation, we developed a machine-learning technique that scores how strongly genetic variants affect RNA splicing, whose alteration contributes to many diseases. Analysis of more than 650,000 intronic and exonic variants revealed widespread patterns of mutation-driven aberrant splicing. Intronic disease mutations that are more than 30 nucleotides from any splice site alter splicing nine times as often as common variants, and missense exonic disease mutations that have the least impact on protein function are five times as likely as others to alter splicing. We detected tens of thousands of disease-causing mutations, including those involved in cancers and spinal muscular atrophy. Examination of intronic and exonic variants found using whole-genome sequencing of individuals with autism revealed misspliced genes with neurodevelopmental phenotypes. Our approach provides evidence for causal variants and should enable new discoveries in precision medicine. Copyright © 2015, American Association for the Advancement of Science.
Fisher, S E; van Bakel, I; Lloyd, S E; Pearce, S H; Thakker, R V; Craig, I W
1995-10-10
Dent disease, an X-linked familial renal tubular disorder, is a form of Fanconi syndrome associated with proteinuria, hypercalciuria, nephrocalcinosis, kidney stones, and eventual renal failure. We have previously used positional cloning to identify the 3' part of a novel kidney-specific gene (initially termed hClC-K2, but now referred to as CLCN5), which is deleted in patients from one pedigree segregating Dent disease. Mutations that disrupt this gene have been identified in other patients with this disorder. Here we describe the isolation and characterization of the complete open reading frame of the human CLCN5 gene, which is predicted to encode a protein of 746 amino acids, with significant homology to all known members of the ClC family of voltage-gated chloride channels. CLCN5 belongs to a distinct branch of this family, which also includes the recently identified genes CLCN3 and CLCN4. We have shown that the coding region of CLCN5 is organized into 12 exons, spanning 25-30 kb of genomic DNA, and have determined the sequence of each exon-intron boundary. The elucidation of the coding sequence and exon-intron organization of CLCN5 will both expedite the evaluation of structure/function relationships of these ion channels and facilitate the screening of other patients with renal tubular dysfunction for mutations at this locus.
The functional spectrum of low-frequency coding variation.
Marth, Gabor T; Yu, Fuli; Indap, Amit R; Garimella, Kiran; Gravel, Simon; Leong, Wen Fung; Tyler-Smith, Chris; Bainbridge, Matthew; Blackwell, Tom; Zheng-Bradley, Xiangqun; Chen, Yuan; Challis, Danny; Clarke, Laura; Ball, Edward V; Cibulskis, Kristian; Cooper, David N; Fulton, Bob; Hartl, Chris; Koboldt, Dan; Muzny, Donna; Smith, Richard; Sougnez, Carrie; Stewart, Chip; Ward, Alistair; Yu, Jin; Xue, Yali; Altshuler, David; Bustamante, Carlos D; Clark, Andrew G; Daly, Mark; DePristo, Mark; Flicek, Paul; Gabriel, Stacey; Mardis, Elaine; Palotie, Aarno; Gibbs, Richard
2011-09-14
Rare coding variants constitute an important class of human genetic variation, but are underrepresented in current databases that are based on small population samples. Recent studies show that variants altering amino acid sequence and protein function are enriched at low variant allele frequency, 2 to 5%, but because of insufficient sample size it is not clear if the same trend holds for rare variants below 1% allele frequency. The 1000 Genomes Exon Pilot Project has collected deep-coverage exon-capture data in roughly 1,000 human genes, for nearly 700 samples. Although medical whole-exome projects are currently afoot, this is still the deepest reported sampling of a large number of human genes with next-generation technologies. According to the goals of the 1000 Genomes Project, we created effective informatics pipelines to process and analyze the data, and discovered 12,758 exonic SNPs, 70% of them novel, and 74% below 1% allele frequency in the seven population samples we examined. Our analysis confirms that coding variants below 1% allele frequency show increased population-specificity and are enriched for functional variants. This study represents a large step toward detecting and interpreting low frequency coding variation, clearly lays out technical steps for effective analysis of DNA capture data, and articulates functional and population properties of this important class of genetic variation.
Fayaz, Shima; Fard-Esfahani, Pezhman; Fard-Esfahani, Armaghan; Mostafavi, Ehsan; Meshkani, Reza; Mirmiranpour, Hossein; Khaghani, Shahnaz
2012-01-01
Homologous recombination (HR) is the major pathway for repairing double strand breaks (DSBs) in eukaryotes and XRCC2 is an essential component of the HR repair machinery. To evaluate the potential role of mutations in gene repair by HR in individuals susceptible to differentiated thyroid carcinoma (DTC) we used high resolution melting (HRM) analysis, a recently introduced method for detecting mutations, to examine the entire XRCC2 coding region in an Iranian population. HRM analysis was used to screen for mutations in three XRCC2 coding regions in 50 patients and 50 controls. There was no variation in the HRM curves obtained from the analysis of exons 1 and 2 in the case and control groups. In exon 3, an Arg188His polymorphism (rs3218536) was detected as a new melting curve group (OR: 1.46; 95%CI: 0.432–4.969; p = 0.38) compared with the normal melting curve. We also found a new Ser150Arg polymorphism in exon 3 of the control group. These findings suggest that genetic variations in the XRCC2 coding region have no potential effects on susceptibility to DTC. However, further studies with larger populations are required to confirm this conclusion. PMID:22481871
Nowacka-Woszuk, J; Switonski, M
2010-02-01
Numerous mutations of the human androgen receptor (AR) gene cause an intersexual phenotype, called the androgen insensitivity syndrome. The intersexual phenotype is also quite often diagnosed in dogs. The aim of this study was to conduct a comparative analysis of the entire coding sequence (eight exons) of the AR gene in healthy and four intersex dogs, as well as in three other canids (the red fox, arctic fox and Chinese raccoon dog). The coding sequence of the studied species appeared to be conserved (similarity above 97%) and polymorphism was found in exon 1 only. Altogether, 2 SNPs were identified in healthy dogs, 14 in red foxes, 16 in arctic foxes and 6 were found in Chinese raccoon dogs, respectively. Moreover, a variable number of tandem repeats (CAG and CAA), encoding an array of glutamines, was also observed in this exon. The CAA codon numbers were invariable within species, but the CAG repeats were polymorphic. The highest number of the CAG and CAA repeats was found in dogs (from 40 to 42) and the observed variability was similar in intersex and healthy dogs. In the other canids the variability fell within the following ranges: 29-37 (red fox), 37-39 (arctic fox) and 29-32 (Chinese raccoon dog). In addition, a polymorphic microsatellite marker in intron 2 was found in the dog, red fox and Chinese raccoon dog. It was concluded that the polymorphism level of the AR gene in the dog was lower than in the other canids and none of the detected polymorphisms, including variability of the CAG tandem repeats, could be related with the intersexual phenotype of the studied dogs.
Wang, Jinyan; Yang, Yuwen; Jin, Lamei; Ling, Xitie; Liu, Tingli; Chen, Tianzi; Ji, Yinghua; Yu, Wengui; Zhang, Baolong
2018-06-04
Long Noncoding-RNAs (LncRNAs) are known to be involved in some biological processes, but their roles in plant-virus interactions remain largely unexplored. While circular RNAs (circRNAs) have been studied in animals, there has yet to be extensive research on them in a plant system, especially in tomato-tomato yellow leaf curl virus (TYLCV) interaction. In this study, RNA transcripts from the susceptible tomato line JS-CT-9210 either infected with TYLCV or untreated, were sequenced in a pair-end strand-specific manner using ribo-zero rRNA removal library method. A total of 2056 lncRNAs including 1767 long intergenic non-coding RNA (lincRNAs) and 289 long non-coding natural antisense transcripts (lncNATs) were obtained. The expression patterns in lncRNAs were similar in susceptible tomato plants between control check (CK) and TYLCV infected samples. Our analysis suggested that lncRNAs likely played a role in a variety of functions, including plant hormone signaling, protein processing in the endoplasmic reticulum, RNA transport, ribosome function, photosynthesis, glulathione metabolism, and plant-pathogen interactions. Using virus-induced gene silencing (VIGS) analysis, we found that reduced expression of the lncRNA S-slylnc0957 resulted in enhanced resistance to TYLCV in susceptible tomato plants. Moreover, we identified 184 circRNAs candidates using the CircRNA Identifier (CIRI) software, of which 32 circRNAs were specifically expressed in untreated samples and 83 circRNAs in TYLCV samples. Approximately 62% of these circRNAs were derived from exons. We validated the circRNAs by both PCR and Sanger sequencing using divergent primers, and found that most of circRNAs were derived from the exons of protein coding genes. The silencing of these circRNAs parent genes resulted in decreased TYLCV virus accumulation. In this study, we identified novel lncRNAs and circRNAs using bioinformatic approaches and showed that these RNAs function as negative regulators of TYLCV infection. Moreover, the expression patterns of lncRNAs in susceptible tomato plants were different from that of resistant tomato plants, while exonic circRNAs expression positively associated with their respective protein coding genes. This work provides a foundation for elaborating the novel roles of lncRNAs and circRNAs in susceptible tomatoes following TYLCV infection.
ERIC Educational Resources Information Center
Ressler, Kerry J.; Rattiner, Lisa M.; Davis, Michael
2004-01-01
Brain-derived neurotrophic factor (BDNF) has been implicated as a molecular mediator of learning and memory. The BDNF gene contains four differentially regulated promoters that generate four distinct mRNA transcripts, each containing a unique noncoding 5[prime]-exon and a common 3[prime]-coding exon. This study describes novel evidence for the…
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Gretchen H. Roffler; Stephen J. Amish; Seth Smith; Ted Cosart; Marty Kardos; Michael K. Schwartz; Gordon Luikart
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding...
Soukarieh, Omar; Gaildrat, Pascaline; Hamieh, Mohamad; Drouet, Aurélie; Baert-Desurmont, Stéphanie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra
2016-01-01
The identification of a causal mutation is essential for molecular diagnosis and clinical management of many genetic disorders. However, even if next-generation exome sequencing has greatly improved the detection of nucleotide changes, the biological interpretation of most exonic variants remains challenging. Moreover, particular attention is typically given to protein-coding changes often neglecting the potential impact of exonic variants on RNA splicing. Here, we used the exon 10 of MLH1, a gene implicated in hereditary cancer, as a model system to assess the prevalence of RNA splicing mutations among all single-nucleotide variants identified in a given exon. We performed comprehensive minigene assays and analyzed patient’s RNA when available. Our study revealed a staggering number of splicing mutations in MLH1 exon 10 (77% of the 22 analyzed variants), including mutations directly affecting splice sites and, particularly, mutations altering potential splicing regulatory elements (ESRs). We then used this thoroughly characterized dataset, together with experimental data derived from previous studies on BRCA1, BRCA2, CFTR and NF1, to evaluate the predictive power of 3 in silico approaches recently described as promising tools for pinpointing ESR-mutations. Our results indicate that ΔtESRseq and ΔHZEI-based approaches not only discriminate which variants affect splicing, but also predict the direction and severity of the induced splicing defects. In contrast, the ΔΨ-based approach did not show a compelling predictive power. Our data indicates that exonic splicing mutations are more prevalent than currently appreciated and that they can now be predicted by using bioinformatics methods. These findings have implications for all genetically-caused diseases. PMID:26761715
Lyons, Brendan M; McHenry, Monique A; Barrington, David S
2017-07-01
Cytosolic phosphoglucose isomerase (pgiC) is an enzyme essential to glycolysis found universally in eukaryotes, but broad understanding of variation in the gene coding for pgiC is lacking for ferns. We used a substantially expanded representation of the gene for Andean species of the fern genus Polystichum to characterize pgiC in ferns relative to angiosperms, insects, and an amoebozoan; assess the impact of selection versus neutral evolutionary processes on pgiC; and explore evolutionary relationships of selected Andean species. The dataset of complete sequences comprised nine accessions representing seven species and one hybrid from the Andes and Serra do Mar. The aligned sequences of the full data set comprised 3376 base pairs (70% of the entire gene) including 17 exons and 15 introns from two central areas of the gene. The exons are highly conserved relative to angiosperms and retain substantial homology to insect pgiC, but intron length and structure are unique to the ferns. Average intron size is similar to angiosperms; intron number and location in insects are unlike those of the plants we considered. The introns included an array of indels and, in intron 7, an extensive microsatellite array with potential utility in analyzing population-level histories. Bayesian and maximum-parsimony analysis of 129 variable nucleotides in the Andean polystichums revealed that 59 (1.7% of the 3376 total) were phylogenetically informative; most of these united sister accessions. The phylogenetic trees for the Andean polystichums were incongruent with previously published cpDNA trees for the same taxa, likely the result of rapid evolutionary change in the introns and contrasting stability in the exons. The exons code a total of seven amino-acid substitutions. Comparison of non-synonymous to synonymous substitutions did not suggest that the pgiC gene is under selection in the Andes. Variation in pgiC including two additional accessions represented by incomplete sequences provided new insights into reticulate relationships among Andean taxa. Copyright © 2017 Elsevier Inc. All rights reserved.
Guttman, Mitchell; Garber, Manuel; Levin, Joshua Z.; Donaghey, Julie; Robinson, James; Adiconis, Xian; Fan, Lin; Koziol, Magdalena J.; Gnirke, Andreas; Nusbaum, Chad; Rinn, John L.; Lander, Eric S.; Regev, Aviv
2010-01-01
RNA-Seq provides an unbiased way to study a transcriptome, including both coding and non-coding genes. To date, most RNA-Seq studies have critically depended on existing annotations, and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We apply it to mouse embryonic stem cells, neuronal precursor cells, and lung fibroblasts to accurately reconstruct the full-length gene structures for the vast majority of known expressed genes. We identify substantial variation in protein-coding genes, including thousands of novel 5′-start sites, 3′-ends, and internal coding exons. We then determine the gene structures of over a thousand lincRNA and antisense loci. Our results open the way to direct experimental manipulation of thousands of non-coding RNAs, and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes. PMID:20436462
Novel RS1 mutations associated with X-linked juvenile retinoschisis
YI, JUNHUI; LI, SHIQIANG; JIA, XIAOYUN; XIAO, XUESHAN; WANG, PANFENG; GUO, XIANGMING; ZHANG, QINGJIONG
2012-01-01
To identify mutations in the retinoschisin (RS1) gene in families with X-linked retinoschisis (XLRS). Twenty families with XLRS were enrolled in this study. All six coding exons and adjacent intronic regions of RS1 were amplified by polymerase chain reaction (PCR). The nucleotide sequences of the amplicons were determined by Sanger sequencing. Ten hemizygous mutations in RS1 were detected in patients from 14 of the 20 families. Four of the ten mutations were novel, including c:176G>A (p:Cys59Tyr) in exon 3, c:531T>G (p:Tyr177X), c:607C>G (p:Pro203Ala) and c:668G>A (p:Cys223Tyr) in exon 6. These four novel mutations were not present in 176 normal individuals. The remaining six were recurrent mutations, including c:214G>A (p:Glu72Lys), c:304C>T (p:Arg102Trp), c:436G>A (p:Glu146Lys), c:544C>T (p:Arg182Cys), c:599G>A (p:Arg200His) and c:644A>T (p:Glu215Val). Our study expanded the mutation spectrum of RS1 and enriches our understanding of the molecular basis of XLRS. PMID:22245991
Novel RS1 mutations associated with X-linked juvenile retinoschisis.
Yi, Junhui; Li, Shiqiang; Jia, Xiaoyun; Xiao, Xueshan; Wang, Panfeng; Guo, Xiangming; Zhang, Qingjiong
2012-04-01
To identify mutations in the retinoschisin (RS1) gene in families with X-linked retinoschisis (XLRS). Twenty families with XLRS were enrolled in this study. All six coding exons and adjacent intronic regions of RS1 were amplified by polymerase chain reaction (PCR). The nucleotide sequences of the amplicons were determined by Sanger sequencing. Ten hemizygous mutations in RS1 were detected in patients from 14 of the 20 families. Four of the ten mutations were novel, including c:176G>A (p:Cys59Tyr) in exon 3, c:531T>G (p:Tyr177X), c:607C>G (p:Pro203Ala) and c:668G>A (p:Cys223Tyr) in exon 6. These four novel mutations were not present in 176 normal individuals. The remaining six were recurrent mutations, including c:214G>A (p:Glu72Lys), c:304C>T (p:Arg102Trp), c:436G>A (p:Glu146Lys), c:544C>T (p:Arg182Cys), c:599G>A (p:Arg200His) and c:644A>T (p:Glu215Val). Our study expanded the mutation spectrum of RS1 and enriches our understanding of the molecular basis of XLRS.
Ambigapathy, Ganesh; Zheng, Zhaoqing; Li, Wei; Keifer, Joyce
2013-01-01
Brain-derived neurotrophic factor (BDNF) has a diverse functional role and complex pattern of gene expression. Alternative splicing of mRNA transcripts leads to further diversity of mRNAs and protein isoforms. Here, we describe the regulation of BDNF mRNA transcripts in an in vitro model of eyeblink classical conditioning and a unique transcript that forms a functionally distinct truncated BDNF protein isoform. Nine different mRNA transcripts from the BDNF gene of the pond turtle Trachemys scripta elegans (tBDNF) are selectively regulated during classical conditioning: exon I mRNA transcripts show no change, exon II transcripts are downregulated, while exon III transcripts are upregulated. One unique transcript that codes from exon II, tBDNF2a, contains a 40 base pair deletion in the protein coding exon that generates a truncated tBDNF protein. The truncated transcript and protein are expressed in the naïve untrained state and are fully repressed during conditioning when full-length mature tBDNF is expressed, thereby having an alternate pattern of expression in conditioning. Truncated BDNF is not restricted to turtles as a truncated mRNA splice variant has been described for the human BDNF gene. Further studies are required to determine the ubiquity of truncated BDNF alternative splice variants across species and the mechanisms of regulation and function of this newly recognized BDNF protein.
Ambigapathy, Ganesh; Zheng, Zhaoqing; Li, Wei; Keifer, Joyce
2013-01-01
Brain-derived neurotrophic factor (BDNF) has a diverse functional role and complex pattern of gene expression. Alternative splicing of mRNA transcripts leads to further diversity of mRNAs and protein isoforms. Here, we describe the regulation of BDNF mRNA transcripts in an in vitro model of eyeblink classical conditioning and a unique transcript that forms a functionally distinct truncated BDNF protein isoform. Nine different mRNA transcripts from the BDNF gene of the pond turtle Trachemys scripta elegans (tBDNF) are selectively regulated during classical conditioning: exon I mRNA transcripts show no change, exon II transcripts are downregulated, while exon III transcripts are upregulated. One unique transcript that codes from exon II, tBDNF2a, contains a 40 base pair deletion in the protein coding exon that generates a truncated tBDNF protein. The truncated transcript and protein are expressed in the naïve untrained state and are fully repressed during conditioning when full-length mature tBDNF is expressed, thereby having an alternate pattern of expression in conditioning. Truncated BDNF is not restricted to turtles as a truncated mRNA splice variant has been described for the human BDNF gene. Further studies are required to determine the ubiquity of truncated BDNF alternative splice variants across species and the mechanisms of regulation and function of this newly recognized BDNF protein. PMID:23825634
Novel mutations of endothelin-B receptor gene in Pakistani patients with Waardenburg syndrome.
Jabeen, Raheela; Babar, Masroor Ellahi; Ahmad, Jamil; Awan, Ali Raza
2012-01-01
Mutations in EDNRB gene have been reported to cause Waardenburg-Shah syndrome (WS4) in humans. We investigated 17 patients with WS4 for identification of mutations in EDNRB gene using PCR and direct sequencing technique. Four genomic mutations were detected in four patients; a G to C transversion in codon 335 (S335C) in exon 5 and a transition of T to C in codon (S361L) in exon 5, a transition of A to G in codon 277 (L277L) in exon 4, a non coding transversion of T to A at -30 nucleotide position of exon 5. None of these mutations were found in controls. One of the patients harbored two novel mutations (S335C, S361L) in exon 5 and one in Intronic region (-30exon5 A>G). All of the mutations were homozygous and novel except the mutation observed in exon 4. In this study, we have identified 3 novel mutations in EDNRB gene associated with WS4 in Pakistani patients.
ExoLocator--an online view into genetic makeup of vertebrate proteins.
Khoo, Aik Aun; Ogrizek-Tomas, Mario; Bulovic, Ana; Korpar, Matija; Gürler, Ece; Slijepcevic, Ivan; Šikic, Mile; Mihalek, Ivana
2014-01-01
ExoLocator (http://exolocator.eopsf.org) collects in a single place information needed for comparative analysis of protein-coding exons from vertebrate species. The main source of data--the genomic sequences, and the existing exon and homology annotation--is the ENSEMBL database of completed vertebrate genomes. To these, ExoLocator adds the search for ostensibly missing exons in orthologous protein pairs across species, using an extensive computational pipeline to narrow down the search region for the candidate exons and find a suitable template in the other species, as well as state-of-the-art implementations of pairwise alignment algorithms. The resulting complements of exons are organized in a way currently unique to ExoLocator: multiple sequence alignments, both on the nucleotide and on the peptide levels, clearly indicating the exon boundaries. The alignments can be inspected in the web-embedded viewer, downloaded or used on the spot to produce an estimate of conservation within orthologous sets, or functional divergence across paralogues.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rittig, S.; Siggaard, C.; Pedersen, E.B.
1996-01-01
Familial neurohypophyseal diabetes insipidus (FNDI) is an autosomal dominant disorder characterized by progressive postnatal deficiency of arginine vasopressin as a result of mutation in the gene that encodes the hormone. To determine the extent of mutations in the coding region that produce the phenotype, we studied members of 17 unrelated kindreds with the disorder. We sequenced all 3 exons of the gene by using a rapid, direct dye-terminator method and found the causative mutation in each kindred. In four kindreds, the mutations were each identical to mutations described in other affected families. In the other 13 kindreds each mutation wasmore » unique. There were two missense mutations that altered the cleavage region of the signal peptide, seven missense mutations in exon 2, which codes for the conserved portion of the protein, one nonsense mutation in exon 2, and three nonsense mutations in exon 3. These findings, together with the clinical features of FNDI, suggest that each of the mutations exerts an effect by directing the production of a pre-prohormone that cannot be folded, processed, or degraded properly and eventually destroys vasopressinergic neurons. 63 refs., 5 figs., 6 tabs.« less
Rittig, S.; Robertson, G. L.; Siggaard, C.; Kovács, L.; Gregersen, N.; Nyborg, J.; Pedersen, E. B.
1996-01-01
Familial neurohypophyseal diabetes insipidus (FNDI) is an autosomal dominant disorder characterized by progressive postnatal deficiency of arginine vasopressin as a result of mutation in the gene that encodes the hormone. To determine the extent of mutations in the coding region that produce the phenotype, we studied members of 17 unrelated kindreds with the disorder. We sequenced all 3 exons of the gene by using a rapid, direct dye-terminator method and found the causative mutation in each kindred. In four kindreds, the mutations were each identical to mutations described in other affected families. In the other 13 kindreds each mutation was unique. There were two missense mutations that altered the cleavage region of the signal peptide, seven missense mutations in exon 2, which codes for the conserved portion of the protein, one nonsense mutation in exon 2, and three nonsense mutations in exon 3. These findings, together with the clinical features of FNDI, suggest that each of the mutations exerts an effect by directing the production of a pre-prohormone that cannot be folded, processed, or degraded properly and eventually destroys vasopressinergic neurons. Images Figure 3 PMID:8554046
Shapiro, James A
2016-06-08
The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess "Read-Write Genomes" they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.
Shapiro, James A.
2016-01-01
The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess “Read–Write Genomes” they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification. PMID:27338490
Shabalina, Svetlana A.; Ogurtsov, Aleksey Y.; Spiridonov, Nikolay A.; Koonin, Eugene V.
2014-01-01
Alternative splicing (AS), alternative transcription initiation (ATI) and alternative transcription termination (ATT) create the extraordinary complexity of transcriptomes and make key contributions to the structural and functional diversity of mammalian proteomes. Analysis of mammalian genomic and transcriptomic data shows that contrary to the traditional view, the joint contribution of ATI and ATT to the transcriptome and proteome diversity is quantitatively greater than the contribution of AS. Although the mean numbers of protein-coding constitutive and alternative nucleotides in gene loci are nearly identical, their distribution along the transcripts is highly non-uniform. On average, coding exons in the variable 5′ and 3′ transcript ends that are created by ATI and ATT contain approximately four times more alternative nucleotides than core protein-coding regions that diversify exclusively via AS. Short upstream exons that encompass alternative 5′-untranslated regions and N-termini of proteins evolve under strong nucleotide-level selection whereas in 3′-terminal exons that encode protein C-termini, protein-level selection is significantly stronger. The groups of genes that are subject to ATI and ATT show major differences in biological roles, expression and selection patterns. PMID:24792168
Genome-wide Discovery of Circular RNAs in the Leaf and Seedling Tissues of Arabidopsis Thaliana
Dou, Yongchao; Li, Shengjun; Yang, Weilong; Liu, Kan; Du, Qian; Ren, Guodong; Yu, Bin; Zhang, Chi
2017-01-01
Background: Recently, identification and functional studies of circular RNAs, a type of non-coding RNAs arising from a ligation of 3’ and 5’ ends of a linear RNA molecule, were conducted in mammalian cells with the development of RNA-seq technology. Method: Since compared with animals, studies on circular RNAs in plants are less thorough, a genome-wide identification of circular RNA candidates in Arabidopsis was conducted with our own developed bioinformatics tool to several existing RNA-seq datasets specifically for non-coding RNAs. Results: A total of 164 circular RNA candidates were identified from RNA-seq data, and 4 circular RNA transcripts, including both exonic and intronic circular RNAs, were experimentally validated. Interestingly, our results show that circular RNA transcripts are enriched in the photosynthesis system for the leaf tissue and correlated to the higher expression levels of their parent genes. Sixteen out of all 40 genes that have circular RNA candidates are related to the photosynthesis system, and out of the total 146 exonic circular RNA candidates, 63 are found in chloroplast. PMID:29081691
SURVEY AND SUMMARY: exon-intron organization of genes in the slime mold Physarum polycephalum.
Trzcinska-Danielewicz, J; Fronk, J
2000-09-15
The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon-intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon-intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon-intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3'-ends.
Reggiani, Claudio; Coppens, Sandra; Sekhara, Tayeb; Dimov, Ivan; Pichon, Bruno; Lufin, Nicolas; Addor, Marie-Claude; Belligni, Elga Fabia; Digilio, Maria Cristina; Faletra, Flavio; Ferrero, Giovanni Battista; Gerard, Marion; Isidor, Bertrand; Joss, Shelagh; Niel-Bütschi, Florence; Perrone, Maria Dolores; Petit, Florence; Renieri, Alessandra; Romana, Serge; Topa, Alexandra; Vermeesch, Joris Robert; Lenaerts, Tom; Casimir, Georges; Abramowicz, Marc; Bontempi, Gianluca; Vilain, Catheline; Deconinck, Nicolas; Smits, Guillaume
2017-07-19
Tissue-specific integrative omics has the potential to reveal new genic elements important for developmental disorders. Two pediatric patients with global developmental delay and intellectual disability phenotype underwent array-CGH genetic testing, both showing a partial deletion of the DLG2 gene. From independent human and murine omics datasets, we combined copy number variations, histone modifications, developmental tissue-specific regulation, and protein data to explore the molecular mechanism at play. Integrating genomics, transcriptomics, and epigenomics data, we describe two novel DLG2 promoters and coding first exons expressed in human fetal brain. Their murine conservation and protein-level evidence allowed us to produce new DLG2 gene models for human and mouse. These new genic elements are deleted in 90% of 29 patients (public and in-house) showing partial deletion of the DLG2 gene. The patients' clinical characteristics expand the neurodevelopmental phenotypic spectrum linked to DLG2 gene disruption to cognitive and behavioral categories. While protein-coding genes are regarded as well known, our work shows that integration of multiple omics datasets can unveil novel coding elements. From a clinical perspective, our work demonstrates that two new DLG2 promoters and exons are crucial for the neurodevelopmental phenotypes associated with this gene. In addition, our work brings evidence for the lack of cross-annotation in human versus mouse reference genomes and nucleotide versus protein databases.
Lack of haplotype structuring for two candidate genes for trypanotolerance in cattle.
Álvarez, I; Pérez-Pardal, L; Traoré, A; Fernández, I; Goyache, F
2016-04-01
Bovine trypanotolerance is a heritable trait associated to the ability of the individuals to control parasitaemia and anaemia. The INHBA (BTA4) and TICAM1 (BTA7) genes are strong candidates for trypanotolerance-related traits. The coding sequence of both genes (3951 bp in total) were analysed in a panel including 79 Asian, African and European cattle (Bos taurus and B. indicus) to identify naturally occurring polymorphisms on both genes. In general, the genetic diversity was low. Nineteen of the 33 mutations identified were found just one time. Seventeen different haplotypes were defined for the TICAM1 gene, and 9 and 12 were defined for the exon 1 and the exon 2 of the INHBA gene, respectively. There was no clear separation between cattle groups. The most frequent haplotypes identified in West African taurine samples were also identified in other cattle groups including Asian zebu and European cattle. Phylogenetic trees and principal component analysis confirmed that divergence among the cattle groups analysed was poor, particularly for the INHBA sequences. The European cattle subset had the lowest values of haplotype diversity for both the exon1 (monomorphic) and the exon2 (0.077 ± 0.066) of the INHBA gene. Neutrality tests, in general, did not suggest that the analysed genes were under positive selection. The assessed scenario would be consistent with the identification of recent mutations in evolutionary terms. © 2015 Blackwell Verlag GmbH.
Mutational screening of FGFR1, CER1, and CDON in a large cohort of trigonocephalic patients.
Jehee, Fernanda Sarquis; Alonso, Luis G; Cavalcanti, Denise P; Kim, Chong; Wall, Steven A; Mulliken, John B; Sun, Miao; Jabs, Ethylin Wang; Boyadjiev, Simeon A; Wilkie, Andrew O M; Passos-Bueno, Maria Rita
2006-03-01
Screen the known craniosynostotic related gene, FGFR1 (exon 7), and two new identified potential candidates, CER1 and CDON, in patients with syndromic and nonsyndromic metopic craniosynostosis to determine if they might be causative genes. Using single-strand conformational polymorphisms (SSCPs), denaturing high-performance liquid chromatography, and/or direct sequencing, we analyzed a total of 81 patients for FGFR1 (exon 7), 70 for CER1, and 44 for CDON. Patients were ascertained in the Centro de Estudos do Genoma Humano in São Paulo, Brazil (n = 39), the Craniofacial Unit, Oxford, U.K. (n = 23), and the Johns Hopkins University, Baltimore, Maryland (n = 31). Clinical inclusion criteria included a triangular head and/or forehead, with or without a metopic ridge, and a radiographic documentation of metopic synostosis. Both syndromic and nonsyndromic patients were studied. No sequence alterations were found for FGFR1 (exon 7). Different patterns of SSCP migration for CER1 compatible with the segregation of single nucleotide polymorphisms reported in the region were identified. Seventeen sequence alterations were detected in the coding region of CDON, seven of which are new, but segregation analysis in parents and homology studies did not indicate a pathological role. FGFR1 (exon 7), CER1, and CDON are not related to trigonocephaly in our sample and should not be considered as causative genes for metopic synostosis. Screening of FGFR1 (exon 7) for diagnostic purposes should not be performed in trigonocephalic patients.
Dynamic and Widespread lncRNA Expression in a Sponge and the Origin of Animal Complexity
Gaiti, Federico; Fernandez-Valverde, Selene L.; Nakanishi, Nagayasu; Calcino, Andrew D.; Yanai, Itai; Tanurdzic, Milos; Degnan, Bernard M.
2015-01-01
Long noncoding RNAs (lncRNAs) are important developmental regulators in bilaterian animals. A correlation has been claimed between the lncRNA repertoire expansion and morphological complexity in vertebrate evolution. However, this claim has not been tested by examining morphologically simple animals. Here, we undertake a systematic investigation of lncRNAs in the demosponge Amphimedon queenslandica, a morphologically simple, early-branching metazoan. We combine RNA-Seq data across multiple developmental stages of Amphimedon with a filtering pipeline to conservatively predict 2,935 lncRNAs. These include intronic overlapping lncRNAs, exonic antisense overlapping lncRNAs, long intergenic nonprotein coding RNAs, and precursors for small RNAs. Sponge lncRNAs are remarkably similar to their bilaterian counterparts in being relatively short with few exons and having low primary sequence conservation relative to protein-coding genes. As in bilaterians, a majority of sponge lncRNAs exhibit typical hallmarks of regulatory molecules, including high temporal specificity and dynamic developmental expression. Specific lncRNA expression profiles correlate tightly with conserved protein-coding genes likely involved in a range of developmental and physiological processes, such as the Wnt signaling pathway. Although the majority of Amphimedon lncRNAs appears to be taxonomically restricted with no identifiable orthologs, we find a few cases of conservation between demosponges in lncRNAs that are antisense to coding sequences. Based on the high similarity in the structure, organization, and dynamic expression of sponge lncRNAs to their bilaterian counterparts, we propose that these noncoding RNAs are an ancient feature of the metazoan genome. These results are consistent with lncRNAs regulating the development of animals, regardless of their level of morphological complexity. PMID:25976353
Fernández-Cancio, Mónica; Nistal, Manuel; Gracia, Ricardo; Molina, M Antonia; Tovar, Juan Antonio; Esteban, Cristina; Carrascosa, Antonio; Audí, Laura
2004-01-01
The goal of this study was to perform 5-alpha-reductase type 2 gene (SRD5A2) analysis in a male pseudohermaphrodite (MPH) patient with normal testosterone (T) production and normal androgen receptor (AR) gene coding sequences. A patient of Chinese origin with ambiguous genitalia at 14 months, a 46,XY karyotype, and normal T secretion under human chorionic gonadotropin (hCG) stimulation underwent a gonadectomy at 20 months. Exons 1-8 of the AR gene and exons 1-5 of the SRD5A2 gene were sequenced from peripheral blood DNA. AR gene coding sequences were normal. SRD5A2 gene analysis revealed 2 consecutive mutations in exon 4, each located in a different allele: 1) a T nucleotide deletion, which predicts a frameshift mutation from codon 219, and 2) a missense mutation at codon 227, where the substitution of guanine (CGA) by adenine (CAA) predicts a glutamine replacement of arginine (R227Q). Testes located in the inguinal canal showed a normal morphology for age. The patient was a compound heterozygote for SRD5A2 mutations, carrying 2 mutations in exon 4. The patient showed an R227Q mutation that has been described in an Asian population and MPH patients, along with a novel frameshift mutation, Tdel219. Testis morphology showed that, during early infancy, the 5-alpha-reductase enzyme deficiency may not have affected interstitial or tubular development.
Reengineering a transmembrane protein to treat muscular dystrophy using exon skipping.
Gao, Quan Q; Wyatt, Eugene; Goldstein, Jeff A; LoPresti, Peter; Castillo, Lisa M; Gazda, Alec; Petrossian, Natalie; Earley, Judy U; Hadhazy, Michele; Barefield, David Y; Demonbreun, Alexis R; Bönnemann, Carsten; Wolf, Matthew; McNally, Elizabeth M
2015-11-02
Exon skipping uses antisense oligonucleotides as a treatment for genetic diseases. The antisense oligonucleotides used for exon skipping are designed to bypass premature stop codons in the target RNA and restore reading frame disruption. Exon skipping is currently being tested in humans with dystrophin gene mutations who have Duchenne muscular dystrophy. For Duchenne muscular dystrophy, the rationale for exon skipping derived from observations in patients with naturally occurring dystrophin gene mutations that generated internally deleted but partially functional dystrophin proteins. We have now expanded the potential for exon skipping by testing whether an internal, in-frame truncation of a transmembrane protein γ-sarcoglycan is functional. We generated an internally truncated γ-sarcoglycan protein that we have termed Mini-Gamma by deleting a large portion of the extracellular domain. Mini-Gamma provided functional and pathological benefits to correct the loss of γ-sarcoglycan in a Drosophila model, in heterologous cell expression studies, and in transgenic mice lacking γ-sarcoglycan. We generated a cellular model of human muscle disease and showed that multiple exon skipping could be induced in RNA that encodes a mutant human γ-sarcoglycan. Since Mini-Gamma represents removal of 4 of the 7 coding exons in γ-sarcoglycan, this approach provides a viable strategy to treat the majority of patients with γ-sarcoglycan gene mutations.
SEASTAR: systematic evaluation of alternative transcription start sites in RNA.
Qin, Zhiyi; Stoilov, Peter; Zhang, Xuegong; Xing, Yi
2018-05-04
Alternative first exons diversify the transcriptomes of eukaryotes by producing variants of the 5' Untranslated Regions (5'UTRs) and N-terminal coding sequences. Accurate transcriptome-wide detection of alternative first exons typically requires specialized experimental approaches that are designed to identify the 5' ends of transcripts. We developed a computational pipeline SEASTAR that identifies first exons from RNA-seq data alone then quantifies and compares alternative first exon usage across multiple biological conditions. The exons inferred by SEASTAR coincide with transcription start sites identified directly by CAGE experiments and bear epigenetic hallmarks of active promoters. To determine if differential usage of alternative first exons can yield insights into the mechanism controlling gene expression, we applied SEASTAR to an RNA-seq dataset that tracked the reprogramming of mouse fibroblasts into induced pluripotent stem cells. We observed dynamic temporal changes in the usage of alternative first exons, along with correlated changes in transcription factor expression. Using a combined sequence motif and gene set enrichment analysis we identified N-Myc as a regulator of alternative first exon usage in the pluripotent state. Our results demonstrate that SEASTAR can leverage the available RNA-seq data to gain insights into the control of gene expression and alternative transcript variation in eukaryotic transcriptomes.
Reengineering a transmembrane protein to treat muscular dystrophy using exon skipping
Gao, Quan Q.; Wyatt, Eugene; Goldstein, Jeff A.; LoPresti, Peter; Castillo, Lisa M.; Gazda, Alec; Petrossian, Natalie; Earley, Judy U.; Hadhazy, Michele; Barefield, David Y.; Demonbreun, Alexis R.; Bönnemann, Carsten; Wolf, Matthew; McNally, Elizabeth M.
2015-01-01
Exon skipping uses antisense oligonucleotides as a treatment for genetic diseases. The antisense oligonucleotides used for exon skipping are designed to bypass premature stop codons in the target RNA and restore reading frame disruption. Exon skipping is currently being tested in humans with dystrophin gene mutations who have Duchenne muscular dystrophy. For Duchenne muscular dystrophy, the rationale for exon skipping derived from observations in patients with naturally occurring dystrophin gene mutations that generated internally deleted but partially functional dystrophin proteins. We have now expanded the potential for exon skipping by testing whether an internal, in-frame truncation of a transmembrane protein γ-sarcoglycan is functional. We generated an internally truncated γ-sarcoglycan protein that we have termed Mini-Gamma by deleting a large portion of the extracellular domain. Mini-Gamma provided functional and pathological benefits to correct the loss of γ-sarcoglycan in a Drosophila model, in heterologous cell expression studies, and in transgenic mice lacking γ-sarcoglycan. We generated a cellular model of human muscle disease and showed that multiple exon skipping could be induced in RNA that encodes a mutant human γ-sarcoglycan. Since Mini-Gamma represents removal of 4 of the 7 coding exons in γ-sarcoglycan, this approach provides a viable strategy to treat the majority of patients with γ-sarcoglycan gene mutations. PMID:26457733
An RRM–ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion
Collins, Katherine M.; Kainov, Yaroslav A.; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A.
2017-01-01
Abstract RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1–ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. PMID:28379442
An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.
Collins, Katherine M; Kainov, Yaroslav A; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A; Makeyev, Eugene V; Ramos, Andres
2017-06-20
RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Expression of exon-8-skipped kindlin-1 does not compensate for defects of Kindler syndrome.
Natsuga, Ken; Nishie, Wataru; Shinkuma, Satoru; Nakamura, Hideki; Matsushima, Yoichiro; Tatsuta, Aya; Komine, Mayumi; Shimizu, Hiroshi
2011-01-01
Kindler syndrome (KS) is a rare, inherited skin disease characterized by blister formation and generalized poikiloderma. Mutations in KIND1, which encodes kindlin-1, are responsible for KS. c.1089del/1089+1del is a recurrent splice-site deletion mutation in KS patients. To elucidate the effects of c.1089del/1089+1del at the mRNA and protein level. Two KS patients with c.1089del/1089+1del were included in this study. Immunofluorescence analysis of KS skin samples using antibodies against the dermo-epidermal junction proteins was performed. Exon-trapping experiments were performed to isolate the mRNA sequences transcribed from genomic DNA harbouring c.1089del/1089+1del. β1 integrin activation in HeLa cells transfected with truncated KIND1 cDNA was analyzed. Immunofluorescence study showed positive expression of kindlin-1 in KS skin with c.1089del/1089+1del mutation. We identified the exon-8-skipped in-frame transcript as the main product among multiple splicing variants derived from that mutation. HeLa cells transfected with KIND1 cDNA without exon 8 showed impaired β1 integrin activation. Exon-8-coding amino acids are located in the FERM F2 domain, which is conserved among species, and the unstructured region between F2 and the pleckstrin homology domain. This study suggests that exon-8-skipped truncated kindlin-1 is functionally defective and does not compensate for the defects of KS, even though kindlin-1 expression in skin is positive. Copyright © 2010 Japanese Society for Investigative Dermatology. Published by Elsevier Ireland Ltd. All rights reserved.
RPS8—a New Informative DNA Marker for Phylogeny of Babesia and Theileria Parasites in China
Tian, Zhan-Cheng; Liu, Guang-Yuan; Yin, Hong; Luo, Jian-Xun; Guan, Gui-Quan; Luo, Jin; Xie, Jun-Ren; Shen, Hui; Tian, Mei-Yuan; Zheng, Jin-feng; Yuan, Xiao-song; Wang, Fang-fang
2013-01-01
Piroplasmosis is a serious debilitating and sometimes fatal disease. Phylogenetic relationships within piroplasmida are complex and remain unclear. We compared the intron–exon structure and DNA sequences of the RPS8 gene from Babesia and Theileria spp. isolates in China. Similar to 18S rDNA, the 40S ribosomal protein S8 gene, RPS8, including both coding and non-coding regions is a useful and novel genetic marker for defining species boundaries and for inferring phylogenies because it tends to have little intra-specific variation but considerable inter-specific difference. However, more samples are needed to verify the usefulness of the RPS8 (coding and non-coding regions) gene as a marker for the phylogenetic position and detection of most Babesia and Theileria species, particularly for some closely related species. PMID:24244571
Novel folliculin (FLCN) mutation and familial spontaneous pneumothorax.
Zhu, J-F; Shen, X-Q; Zhu, F; Tian, L
2017-01-01
Familial spontaneous pneumothorax is one of the characteristics of Birt-Hogg-Dubé syndrome (BHDS), which is an autosomal dominant disease caused by the mutation of folliculin (FLCN). To investigate the mutation of FLCN gene in a familial spontaneous pneumothorax. Prospective case study. Clinical and genetic data of a Chinese family with four patients who presented spontaneous pneumothorax in the absence of skin lesions or renal tumors were collected. CT scan of patient's lung was applied for observation of pneumothorax. DNA sequencing of the coding exons (4-14 exons) of FLCN was performed for all 11 members of the family and 100 unrelated healthy controls. CT scan of patient's lung showed spontaneous pneumothorax. A mutation (c. 510C > G) that leads to a premature stop codon (p. Y170X) was found in the proband using DNA sequencing of coding exons (4-14 exons) of FLCN. This mutation was also observed in the other affected members of the family. A nonsense mutation of FLCN was found in a spontaneous pneumothorax family. Our results expand the mutational spectrum of FLCN in patients with BHDS. © The Author 2016. Published by Oxford University Press on behalf of the Association of Physicians. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
A novel mutation in SCN9A in a child with congenital insensitivity to pain.
Shorer, Zamir; Wajsbrot, Einav; Liran, Tamir-Hostovsky; Levy, Jacov; Parvari, Ruti
2014-01-01
[corrected] Congenital insensitivity to pain (CIP) is a rare condition in which patients have no pain perception and anosmia but are otherwise essentially normal (OMIM 243000). The recent discovery of the genetic defects underlying 3 monogenic pain disorders has provided additional and important insights about some components of human pain. Genetic studies in families demonstrating recessively inherited channelopathy-associated insensitivity to pain have identified nonsense mutations that result in truncation of the voltage-gated sodium channel type IX subunit (SCN9A), a 113.5-kb gene comprising coding 26 exons. Here we describe a patient with CIP with a new mutation in SCN9A not described yet. All exons were sequenced. All 26 coding exons were sequenced and two changes were identified in homozygosity in exon 10: c.1126 A > C causing K376Q and c.1124delG causing p.G375Afs* frame shift. We report a novel, loss-of-function mutation in homozygosity that causes congenital insensitivity to pain and provide a comprehensive clinical description of the patient. This contributes to the clinical and neurophysiological characteristic of the sodium channel Nav1.7 channelopathy and expand our genetic knowledge which might provide more accurate and comprehensive clinical electrophysiological and genetic information. Copyright © 2014 Elsevier Inc. All rights reserved.
Bedeschi, Maria Francesca; Marangi, Giuseppe; Calvello, Maria Rosaria; Ricciardi, Stefania; Leone, Francesca Pia Chiara; Baccarin, Marco; Guerneri, Silvana; Orteschi, Daniela; Murdolo, Marina; Lattante, Serena; Frangella, Silvia; Keena, Beth; Harr, Margaret H; Zackai, Elaine; Zollino, Marcella
2017-11-01
Pitt-Hopkins syndrome is a neurodevelopmental disorder characterized by severe intellectual disability and a distinctive facial gestalt. It is caused by haploinsufficiency of the TCF4 gene. The TCF4 protein has different functional domains, with the NLS (nuclear localization signal) domain coded by exons 7-8 and the bHLH (basic Helix-Loop-Helix) domain coded by exon 18. Several alternatively spliced TCF4 variants have been described, allowing for translation of variable protein isoforms. Typical PTHS patients have impairment of at least the bHLH domain. To which extent impairment of the remaining domains contributes to the final phenotype is not clear. There is recent evidence that certain loss-of-function variants disrupting TCF4 are associated with mild ID, but not with typical PTHS. We describe a frameshift-causing partial gene deletion encompassing exons 4-6 of TCF4 in an adult patient with mild ID and nonspecific facial dysmorphisms but without the typical features of PTHS, and a c.520C > T nonsense variant within exon 8 in a child presenting with a severe phenotype largely mimicking PTHS, but lacking the typical facial dysmorphism. Investigation on mRNA, along with literature review, led us to suggest a preliminary phenotypic map of loss-of-function variants affecting TCF4. An intragenic phenotypic map of loss-of-function variants in TCF4 is suggested here for the first time: variants within exons 1-4 and exons 4-6 give rise to a recurrent phenotype with mild ID not in the spectrum of Pitt-Hopkins syndrome (biallelic preservation of both the NLS and bHLH domains); variants within exons 7-8 cause a severe phenotype resembling PTHS but in absence of the typical facial dysmorphism (impairment limited to the NLS domain); variants within exons 9-19 cause typical Pitt-Hopkins syndrome (impairment of at least the bHLH domain). Understanding the TCF4 molecular syndromology can allow for proper nosology in the current era of whole genomic investigations. Copyright © 2017. Published by Elsevier Masson SAS.
Using the NCBI Genome Databases to Compare the Genes for Human & Chimpanzee Beta Hemoglobin
ERIC Educational Resources Information Center
Offner, Susan
2010-01-01
The beta hemoglobin protein is identical in humans and chimpanzees. In this tutorial, students see that even though the proteins are identical, the genes that code for them are not. There are many more differences in the introns than in the exons, which indicates that coding regions of DNA are more highly conserved than non-coding regions.
Kobayashi, Eri; Shimizu, Ritsuko; Kikuchi, Yuko; Takahashi, Satoru; Yamamoto, Masayuki
2010-01-01
GATA1 is essential for the differentiation of erythroid cells and megakaryocytes. The Gata1 gene is composed of multiple untranslated first exons and five common coding exons. The erythroid first exon (IE exon) is important for Gata1 gene expression in hematopoietic lineages. Because previous IE exon knockdown analyses resulted in embryonic lethality, less is understood about the contribution of the IE exon to adult hematopoiesis. Here, we achieved specific deletion of the floxed IE exon in adulthood using an inducible Cre expression system. In this conditional knock-out mouse line, the Gata1 mRNA level was significantly down-regulated in the megakaryocyte lineage, resulting in thrombocytopenia with a marked proliferation of megakaryocytes. By contrast, in the erythroid lineage, Gata1 mRNA was expressed abundantly utilizing alternative first exons. Especially, the IEb/c and newly identified IEd exons were transcribed at a level comparable with that of the IE exon in control mice. Surprisingly, in the IE-null mouse, these transcripts failed to produce full-length GATA1 protein, but instead yielded GATA1 lacking the N-terminal domain inefficiently. With low level expression of the short form of GATA1, IE-null mice showed severe anemia with skewed erythroid maturation. Notably, the hematological phenotypes of adult IE-null mice substantially differ from those observed in mice harboring conditional ablation of the entire Gata1 gene. The present study demonstrates that the IE exon is instrumental to adult erythropoiesis by regulating the proper level of transcription and selecting the correct transcription start site of the Gata1 gene. PMID:19854837
New genetic variants of LATS1 detected in urinary bladder and colon cancer.
Saadeldin, Mona K; Shawer, Heba; Mostafa, Ahmed; Kassem, Neemat M; Amleh, Asma; Siam, Rania
2014-01-01
LATS1, the large tumor suppressor 1 gene, encodes for a serine/threonine kinase protein and is implicated in cell cycle progression. LATS1 is down-regulated in various human cancers, such as breast cancer, and astrocytoma. Point mutations in LATS1 were reported in human sarcomas. Additionally, loss of heterozygosity of LATS1 chromosomal region predisposes to breast, ovarian, and cervical tumors. In the current study, we investigated LATS1 genetic variations including single nucleotide polymorphisms (SNPs), in 28 Egyptian patients with either urinary bladder or colon cancers. The LATS1 gene was amplified and sequenced and the expression of LATS1 at the RNA level was assessed in 12 urinary bladder cancer samples. We report, the identification of a total of 29 variants including previously identified SNPs within LATS1 coding and non-coding sequences. A total of 18 variants were novel. Majority of the novel variants, 13, were mapped to intronic sequences and un-translated regions of the gene. Four of the five novel variants located in the coding region of the gene, represented missense mutations within the serine/threonine kinase catalytic domain. Interestingly, LATS1 RNA steady state levels was lost in urinary bladder cancerous tissue harboring four specific SNPs (16045 + 41736 + 34614 + 56177) positioned in the 5'UTR, intron 6, and two silent mutations within exon 4 and exon 8, respectively. This study identifies novel single-base-sequence alterations in the LATS1 gene. These newly identified variants could potentially be used as novel diagnostic or prognostic tools in cancer.
The human cytochrome P450 3A locus. Gene evolution by capture of downstream exons.
Finta, C; Zaphiropoulos, P G
2000-12-30
Using a bacterial artificial chromosome (BAC) clone, we have mapped the human cytochrome P450 3A (CYP3A) locus containing the genes encoding for CYP3A4, CYP3A5 and CYP3A7. The genes lie in a head-to-tail orientation in the order of 3A4, 3A7 and 3A5. In both intergenic regions (3A4-3A7 and 3A7-3A5), we have detected several additional cytochrome P450 3A exons, forming two CYP3A pseudogenes. These pseudogenes have the same orientation as the CYP3A genes. To our surprise, a 3A7 mRNA species has been detected in which the exons 2 and 13 of one of the pseudogenes (the one that is downstream of 3A7) are spliced after the 3A7 terminal exon. This results in an mRNA molecule that consists of the 13 3A7 exons and two additional exons at the 3' end. The additional two exons originating from the pseudogene are in an altered reading frame and consequently have the capability to code a completely different amino acid sequence than the canonical CYP3A exons 2 and 13. These findings may represent a generalized evolutionary process with genes having the potential to capture neighboring sequences and use them as functional exons.
Genes and proteins of urea transporters.
Sands, Jeff M; Blount, Mitsi A
2014-01-01
A urea transporter protein in the kidney was first proposed in 1987. The first urea transporter cDNA was cloned in 1993. The SLC14a urea transporter family contains two major subgroups: SLC14a1, the UT-B urea transporter originally isolated from erythrocytes; and SLC14a2, the UT-A group originally isolated from kidney inner medulla. Slc14a1, the human UT-B gene, arises from a single locus located on chromosome 18q12.1-q21.1, which is located close to Slc14a2. Slc14a1 includes 11 exons, with the coding region extending from exon 4 to exon 11, and is approximately 30 kb in length. The Slc14a2 gene is a very large gene with 24 exons, is approximately 300 kb in length, and encodes 6 different isoforms. Slc14a2 contains two promoter elements: promoter I is located in the typical position, upstream of exon 1, and drives the transcription of UT-A1, UT-A1b, UT-A3, UT-A3b, and UT-A4; while promoter II is located within intron 12 and drives the transcription of UT-A2 and UT-A2b. UT-A1 and UT-A3 are located in the inner medullary collecting duct, UT-A2 in the thin descending limb and liver, UT-A5 in testis, UT-A6 in colon, UT-B1 primarily in descending vasa recta and erythrocytes, and UT-B2 in rumen.
Neutral lipid-storage disease with myopathy and extended phenotype with novel PNPLA2 mutation.
Massa, Roberto; Pozzessere, Simone; Rastelli, Emanuele; Serra, Laura; Terracciano, Chiara; Gibellini, Manuela; Bozzali, Marco; Arca, Marcello
2016-04-01
Neutral lipid-storage disease with myopathy is caused by mutations in PNPLA2, which produce skeletal and cardiac myopathy. We report a man with multiorgan neutral lipid storage and unusual multisystem clinical involvement, including cognitive impairment. Quantitative brain MRI with voxel-based morphometry and extended neuropsychological assessment were performed. In parallel, the coding sequences and intron/exon boundaries of the PNPLA2 gene were screened by direct sequencing. Neuropsychological assessment revealed global cognitive impairment, and brain MRI showed reduced gray matter volume in the temporal lobes. Molecular characterization revealed a novel homozygous mutation in exon 5 of PNPLA2 (c.714C>A), resulting in a premature stop codon (p.Cys238*). Some PNPLA2 mutations, such as the one described here, may present with an extended phenotype, including brain involvement. In these cases, complete neuropsychological testing, combined with quantitative brain MRI, may help to characterize and quantify cognitive impairment. © 2016 Wiley Periodicals, Inc.
Yang, Huiqin; Li, Shiqiang; Xiao, Xueshan; Guo, Xiangming; Zhang, Qingjiong
2012-08-01
To screen mutations in the norrin (NDP) gene in 44 unrelated Chinese patients with familial exudative vitreoretinopathy (FEVR, 38 cases) or Norrie disease (6 cases) and to describe the associated phenotypes. Of the 44 patients, mutation in FZD4, LRP5, and TSPAN12 was excluded in 38 patients with FEVR in previous study. Sanger sequencing was used to analyze the 2 coding exons and their adjacent regions of NDP in the 44 patients. Clinical data were presented for patients with mutation. NDP variants in 5 of the 6 patients with Norrie disease were identified, including a novel missense mutation (c.164G>A, p.Cys55Phe) in one patient, two known missense mutations (c.122G>A, p.Arg41Lys; c.220C>T, p.Arg74Cys) in two patients, and a gross deletion encompassing the two coding exons in two patients. Of the 5 patients, 3 had a family history and 2 were singleton cases. No mutation in NDP was detected in the 38 patients with FEVR. NDP mutations are common cause of Norrie disease but might be rare cause for FEVR in Chinese.
Ito, M; Mori, Y; Oiso, Y; Saito, H
1991-01-01
To elucidate the molecular mechanism of familial central diabetes insipidus (FDI), we sequenced the arginine vasopressin-neurophysin II (AVP-NPII) gene in 2 patients belonging to a pedigree that is consistent with an autosomal dominant mode of inheritance. 10 patients with idiopathic central diabetes insipidus (IDI) and 5 normals were also studied. The AVP-NPII gene, locating on chromosome 20, consists of three exons that encode putative signal peptide, AVP, NPII, and glycoprotein. Using polymerase chain reaction, fragments including the promoter region and all coding regions were amplified from genomic DNA and subjected to direct sequencing. Sequences of 10 patients with IDI were identical with those of normals, while in 2 patients with FDI, a single base substitution was detected in one of two alleles of the AVP-NPII gene, indicating they were heterozygotes for this mutation. It was a G----A transition at nucleotide position 1859 in the second exon, resulting in a substitution of Gly for Ser at amino acid position 57 in the NPII moiety. It was speculated that the mutated AVP-NPII precursor or the mutated NPII molecule, through their conformational changes, might be responsible for AVP deficiency. Images PMID:1840604
Douzery, Emmanuel J P; Scornavacca, Celine; Romiguier, Jonathan; Belkhir, Khalid; Galtier, Nicolas; Delsuc, Frédéric; Ranwez, Vincent
2014-07-01
Comparative genomic studies extensively rely on alignments of orthologous sequences. Yet, selecting, gathering, and aligning orthologous exons and protein-coding sequences (CDS) that are relevant for a given evolutionary analysis can be a difficult and time-consuming task. In this context, we developed OrthoMaM, a database of ORTHOlogous MAmmalian Markers describing the evolutionary dynamics of orthologous genes in mammalian genomes using a phylogenetic framework. Since its first release in 2007, OrthoMaM has regularly evolved, not only to include newly available genomes but also to incorporate up-to-date software in its analytic pipeline. This eighth release integrates the 40 complete mammalian genomes available in Ensembl v73 and provides alignments, phylogenies, evolutionary descriptor information, and functional annotations for 13,404 single-copy orthologous CDS and 6,953 long exons. The graphical interface allows to easily explore OrthoMaM to identify markers with specific characteristics (e.g., taxa availability, alignment size, %G+C, evolutionary rate, chromosome location). It hence provides an efficient solution to sample preprocessed markers adapted to user-specific needs. OrthoMaM has proven to be a valuable resource for researchers interested in mammalian phylogenomics, evolutionary genomics, and has served as a source of benchmark empirical data sets in several methodological studies. OrthoMaM is available for browsing, query and complete or filtered downloads at http://www.orthomam.univ-montp2.fr/. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
p53 in pure epithelioid PEComa: an immunohistochemistry study and gene mutation analysis.
Bing, Zhanyong; Yao, Yuan; Pasha, Theresa; Tomaszewski, John E; Zhang, Paul J
2012-04-01
Pure epithelioid PEComa (PEP; so-called epithelioid angiomyolipoma) is rare and is more often associated with aggressive behaviors. The pathogenesis of PEP has been poorly understood. The authors studied p53 expression and gene mutation in PEPs by immunohistochemistry, single-strand conformation polymorphism, and direct sequencing in paraffin material from 8 PEPs. A group of classic angiomyolipomas (AMLs) were also analyzed for comparison. Five PEPs were from kidneys and 1 each from the heart, the liver, and the uterus. PEPs showed much stronger p53 nuclear staining (Allred score 6.4 ± 2.5) than the classic AML (2.3 ± 2.9) (P < .01). There was no p53 single-strand conformation polymorphism identified in either the PEPs or the 8 classic AMLs. p53 mutation analyses by direct sequencing of exons 5 to 9 showed 4 mutations in 3 of 8 PEPs but none in any of the 8 classic AMLs. The mutations included 2 missense mutations in a hepatic PEComa and 2 silent mutations in 2 renal PEPs. Both the missense mutations in the hepatic PEComa involved the exon 5, one involving codon 165, with change from CAG to CAC (coding amino acid changed from glutamine to histidine), and the other involving codon 182, with change from TGC to TAC (coding amino acid changed from cysteine to tyrosine). The finding of stronger p53 expression and mutations in epithelioid angiomyolipomas might have contributed to their less predictable behavior. However, the abnormal p53 expression cannot be entirely explained by p53 mutations in the exons examined in the PEPs.
Soheili, Fariborz; Jalili, Zahra; Rahbar, Mahtab; Khatooni, Zahed; Mashayekhi, Amir; Jafari, Hossein
2018-03-01
The mutations in GATA4 gene induce inherited atrial and ventricular septation defects, which is the most frequent forms of congenital heart defects (CHDs) constituting about half of all cases. We have performed High resolution melting (HRM) mutation scanning of GATA4 coding exons of nonsyndrome 100 patients as a case group including 39 atrial septal defects (ASD), 57 ventricular septal defects (VSD) and four patients with both above defects and 50 healthy individuals as a control group. Our samples are categorized according to their HRM graph. The genome sequencing has been done for 15 control samples and 25 samples of patients whose HRM analysis were similar to healthy subjects for each exon. The PolyPhen-2 and MUpro have been used to determine the causative possibility and structural stability prediction of GATA4 sequence variation. The HRM curve analysis exhibit that 21 patients and 3 normal samples have deviated curves for GATA4 coding exons. Sequencing analysis has revealed 12 nonsynonymous mutations while all of them resulted in stability structure of protein 10 of them are pathogenic and 2 of them are benign. Also we found two nucleotide deletions which one of them was novel and one new indel mutation resulting in frame shift mutation, and 4 synonymous variations or polymorphism in 6 of patients and 3 of normal individuals. Six or about 50% of these nonsynonymous mutations have not been previously reported. Our results show that there is a spectrum of GATA4 mutations resulting in septal defects. © 2018 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mercier, B.; Audrezet, M.P.; Guillermit, H.
Cystic fibrosis transmembrane conductance regulator (CFTR), the gene responsible, when mutated, for cystic fibrosis (CF), spans over 230 kb on the long arm of chromosome 7 and is composed of 27 exons. The most common mutation responsible for CF worldwide is the deletion of a phenylalanine amino acid at codon 508 in the first nucleotide-binding fold and accounts for approximately 70% of CF chromosomes studied. More than 250 other mutations have been reported through the CF Genetic Analysis Consortium. The majority of the mutations previously described lie in the two nucleotide-binding folds. To explore exhaustively other regions of the gene,more » particularly exons coding for transmembrane domains, the authors have initiated a collaborative study between different laboratories to screen 369 non-[Delta]F508 CF chromosomes of seven ethnic European populations (Belgian, French, Breton, Irish, Italian, Yugoslavian, Russian). Among these chromosomes carrying an unidentified mutation, 63 were from Brittany, 50 of various French origin, 45 of Irish origin, 56 of Italian origin, 41 of Belgian origin, 2 of Turkish origin, 38 of Yugoslavian origin, 22 of Russian origin, and 52 of Bulgarian origin. Diagnostic criteria for CF included at least one positive sweat test and pulmonary disease with or without pancreatic disease. Using a denaturing gradient gel electrophoresis (DGGE) assay, they have identified eight novel mutations in exon 17b coding for part of the second transmembrane domain of the CFTR and they describe them in this report. 8 refs., 1 fig., 1 tab.« less
On splice site prediction using weight array models: a comparison of smoothing techniques
NASA Astrophysics Data System (ADS)
Taher, Leila; Meinicke, Peter; Morgenstern, Burkhard
2007-11-01
In most eukaryotic genes, protein-coding exons are separated by non-coding introns which are removed from the primary transcript by a process called "splicing". The positions where introns are cut and exons are spliced together are called "splice sites". Thus, computational prediction of splice sites is crucial for gene finding in eukaryotes. Weight array models are a powerful probabilistic approach to splice site detection. Parameters for these models are usually derived from m-tuple frequencies in trusted training data and subsequently smoothed to avoid zero probabilities. In this study we compare three different ways of parameter estimation for m-tuple frequencies, namely (a) non-smoothed probability estimation, (b) standard pseudo counts and (c) a Gaussian smoothing procedure that we recently developed.
Novel methodologies for spectral classification of exon and intron sequences
NASA Astrophysics Data System (ADS)
Kwan, Hon Keung; Kwan, Benjamin Y. M.; Kwan, Jennifer Y. Y.
2012-12-01
Digital processing of a nucleotide sequence requires it to be mapped to a numerical sequence in which the choice of nucleotide to numeric mapping affects how well its biological properties can be preserved and reflected from nucleotide domain to numerical domain. Digital spectral analysis of nucleotide sequences unfolds a period-3 power spectral value which is more prominent in an exon sequence as compared to that of an intron sequence. The success of a period-3 based exon and intron classification depends on the choice of a threshold value. The main purposes of this article are to introduce novel codes for 1-sequence numerical representations for spectral analysis and compare them to existing codes to determine appropriate representation, and to introduce novel thresholding methods for more accurate period-3 based exon and intron classification of an unknown sequence. The main findings of this study are summarized as follows: Among sixteen 1-sequence numerical representations, the K-Quaternary Code I offers an attractive performance. A windowed 1-sequence numerical representation (with window length of 9, 15, and 24 bases) offers a possible speed gain over non-windowed 4-sequence Voss representation which increases as sequence length increases. A winner threshold value (chosen from the best among two defined threshold values and one other threshold value) offers a top precision for classifying an unknown sequence of specified fixed lengths. An interpolated winner threshold value applicable to an unknown and arbitrary length sequence can be estimated from the winner threshold values of fixed length sequences with a comparable performance. In general, precision increases as sequence length increases. The study contributes an effective spectral analysis of nucleotide sequences to better reveal embedded properties, and has potential applications in improved genome annotation.
Cosart, Ted; Beja-Pereira, Albano; Luikart, Gordon
2014-11-01
The computer program EXONSAMPLER automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of EXONSAMPLER to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16,000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection. © 2014 John Wiley & Sons Ltd.
An improved and validated RNA HLA class I SBT approach for obtaining full length coding sequences.
Gerritsen, K E H; Olieslagers, T I; Groeneweg, M; Voorter, C E M; Tilanus, M G J
2014-11-01
The functional relevance of human leukocyte antigen (HLA) class I allele polymorphism beyond exons 2 and 3 is difficult to address because more than 70% of the HLA class I alleles are defined by exons 2 and 3 sequences only. For routine application on clinical samples we improved and validated the HLA sequence-based typing (SBT) approach based on RNA templates, using either a single locus-specific or two overlapping group-specific polymerase chain reaction (PCR) amplifications, with three forward and three reverse sequencing reactions for full length sequencing. Locus-specific HLA typing with RNA SBT of a reference panel, representing the major antigen groups, showed identical results compared to DNA SBT typing. Alleles encountered with unknown exons in the IMGT/HLA database and three samples, two with Null and one with a Low expressed allele, have been addressed by the group-specific RNA SBT approach to obtain full length coding sequences. This RNA SBT approach has proven its value in our routine full length definition of alleles. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
An UPF3-based nonsense-mediated decay in Paramecium.
Contreras, Julia; Begley, Victoria; Macias, Sandra; Villalobo, Eduardo
2014-12-01
Nonsense-mediated decay recognises mRNAs containing premature termination codons. One of its components, UPF3, is a molecular link bridging through its binding to the exon junction complex nonsense-mediated decay and splicing. In protists UPF3 has not been identified yet. We report that Paramecium tetraurelia bears an UPF3 gene and that it has a role in nonsense-mediated decay. Interestingly, the identified UPF3 has not conserved the essential amino acids required to bind the exon junction complex. Though, our data indicates that this ciliate bears genes coding for core proteins of the exon junction complex. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
The genomic structure: proof of the role of non-coding DNA.
Bouaynaya, Nidhal; Schonfeld, Dan
2006-01-01
We prove that the introns play the role of a decoy in absorbing mutations in the same way hollow uninhabited structures are used by the military to protect important installations. Our approach is based on a probability of error analysis, where errors are mutations which occur in the exon sequences. We derive the optimal exon length distribution, which minimizes the probability of error in the genome. Furthermore, to understand how can Nature generate the optimal distribution, we propose a diffusive random walk model for exon generation throughout evolution. This model results in an alpha stable exon length distribution, which is asymptotically equivalent to the optimal distribution. Experimental results show that both distributions accurately fit the real data. Given that introns also drive biological evolution by increasing the rate of unequal crossover between genes, we conclude that the role of introns is to maintain a genius balance between stability and adaptability in eukaryotic genomes.
The D4 receptor gene and mood disorders: An association study
DOE Office of Scientific and Technical Information (OSTI.GOV)
Macciardi, F.; Cavalini, M.C.; Petronis, A.
1994-09-01
The problem of a gene-disease association is of major relevance in the current research of Psychiatric Disorders, mostly because of the lack of unequivocal results obtained with the linkage approach. However, some points of an association study must also be carefully considered, namely the statistical methodology and the strategy to select a gene to be tested. The gene coding for the D4 receptor (DRD4) might be theoretically relevant as a component of the genetic susceptibility for mood disorders. We now know that DRD4 has at least 2 functional polymorphisms in the coding regions of the gene, in exon 3 andmore » exon 1, thus conferring etiologic relevance to a potentially positive association. In our work, we investigated the DRD4 genotypes of the 3rd and 1st exon for 93 patients with bipolar disorder and 57 patients with major depression, recurrent disorder. Patients have been diagnosed either by traditional DSMIII-R criteria or by clustering their lifetime psychopathological symptomatology. A random control group consisted of 151 subjects. A significant association has been found with DRD4 exon 3 genotypes, revealing an increase of genotypes 2-4 in Bipolar patients (chi-square=23.07, df=12, p=0.02). Even though a definitive confirmation of our finding requires an independent replication of the study, this result emphasizes the importance of DRD4 in mood disorders.« less
Campos, W N; Massaro, J D; Martinelli, A L C; Halliwell, J A; Marsh, S G E; Mendes-Junior, C T; Donadi, E A
2017-10-01
The HFE molecule controls iron uptake from gut, and defects in the molecule have been associated with iron overload, particularly in hereditary hemochromatosis. The HFE gene including both coding and boundary intronic regions were sequenced in 304 Brazilian individuals, encompassing healthy individuals and patients exhibiting hereditary or acquired iron overload. Six sites of variation were detected: (1) H63D C>G in exon 2, (2) IVS2 (+4) T>C in intron 2, (3) a C>G transversion in intron 3, (4) C282Y G>A in exon 4, (5) IVS4 (-44) T>C in intron 4, and (6) a new guanine deletion (G>del) in intron 5, which were used for haplotype inference. Nine HFE alleles were detected and six of these were officially named on the basis of the HLA Nomenclature, defined by the World Health Organization (WHO) Nomenclature Committee for Factors of the HLA System, and published via the IPD-IMGT/HLA website. Four alleles, HFE*001, *002, *003, and *004 exhibited variation within their exon sequences. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Tollefson, Ann E.; Ying, Baoling; Doronin, Konstantin; Sidor, Peter D.; Wold, William S. M.
2007-01-01
A short open reading frame named the “U exon,” located on the adenovirus (Ad) l-strand (for leftward transcription) between the early E3 region and the fiber gene, is conserved in mastadenoviruses. We have observed that Ad5 mutants with large deletions in E3 that infringe on the U exon display a mild growth defect, as well as an aberrant Ad E2 DNA-binding protein (DBP) intranuclear localization pattern and an apparent failure to organize replication centers during late infection. Mutants in which the U exon DNA is reconstructed have a reversed phenotype. Chow et al. (L. T. Chow et al., J. Mol. Biol. 134:265-303, 1979) described mRNAs initiating in the region of the U exon and spliced to downstream sequences in the late DBP mRNA leader and the DBP-coding region. We have cloned this mRNA (as cDNA) from Ad5 late mRNA; the predicted protein is 217 amino acids, initiating in the U exon and continuing in frame in the DBP leader and in the DBP-coding region but in a different reading frame from DBP. Polyclonal and monoclonal antibodies generated against the predicted U exon protein (UXP) showed that UXP is ∼24K in size by immunoblot and is a late protein. At 18 to 24 h postinfection, UXP is strongly associated with nucleoli and is found throughout the nucleus; later, UXP is associated with the periphery of replication centers, suggesting a function relevant to Ad DNA replication or RNA transcription. UXP is expressed by all four species C Ads. When expressed in transient transfections, UXP complements the aberrant DBP localization pattern of UXP-negative Ad5 mutants. Our data indicate that UXP is a previously unrecognized protein derived from a novel late l-strand transcription unit. PMID:17881437
Chen, Yong; Yang, Fuwei; Zheng, Hexin; Zhu, Ganghua; Hu, Peng; Wu, Weijing
2015-12-01
To explore the molecular etiology of two pedigrees affected with type II Waardenburg syndrome (WS2) and to provide genetic diagnosis and counseling. Blood samples were collected from the proband and his family members. Following extraction of genomic DNA, the coding sequences of PAX3, MITF, SOX10 and SNAI2 genes were amplified with PCR and subjected to DNA sequencing to detect potential mutations. A heterozygous deletional mutation c.649_651delAGA in exon 7 of the MITF gene has been identified in all patients from the first family, while no mutation was found in the other WS2 related genes including PAX3, MITF, SOX10 and SNAI2. The heterozygous deletion mutation c.649_651delAGA in exon 7 of the MITF gene probably underlies the disease in the first family. It is expected that other genes may also underlie WS2.
Molecular mechanisms of pathogenesis in hepatocellular carcinoma revealed by RNA‑sequencing.
Liu, Yao; Yang, Zhe; Du, Feng; Yang, Qiao; Hou, Jie; Yan, Xiaohong; Geng, Yi; Zhao, Yaning; Wang, Hua
2017-11-01
The present study aimed to explore the underlying molecular mechanisms of hepatocellular carcinoma (HCC). RNA‑sequencing profiles GSM629264 and GSM629265, from the GSE25599 data set, were downloaded from the Gene Expression Omnibus database and processed by quality evaluation. GSM629264 and GSM629265 were from HCC and adjacent non‑cancerous tissues, respectively. TopHat software was used for alignment analysis, followed by the detection of novel splicing sites. In addition, the Cufflinks software package was used to analyze gene expressions, and the Cuffdiff program was used to screen for differently expressed genes (DEGs) and differentially expressed splicing variants. Gene ontology functional enrichment and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of DEGs were also performed. Transcription factors (TFs) and microRNAs (miRNAs) that regulate DEGs were identified, and a protein‑protein interaction (PPI) network was constructed. The hub node in the PPI network was obtained, and the TFs and miRNAs that regulated the hub node were further predicted. The quality of the sequencing data met the standards for analysis, and the clean reads were ~65%. Most sequencing reads mapped into coding sequence exons (CDS_exons), whereas other reads mapped into exon 3' untranslated regions (UTR_Exons), 5'UTR_Exons and Introns. Upregulated and downregulated DEGs between HCC and adjacent non‑cancerous tissues were screened. Genes of differentially expressed splicing variants were identified, including vesicle‑associated membrane protein 4, phosphatidylinositol glycan anchor biosynthesis class C, protein disulfide isomerase family A member 4 and growth arrest specific 5. Screened DEGs were enriched in the complement pathway. In the PPI network, ubiquitin C (UBC) was the hub node. UBC was predicted to be regulated by several TFs, including specificity protein 1 (SP1), FBJ murine osteosarcoma viral oncogene homolog (FOS), proto‑oncogene c‑JUN (JUN), FOS‑like antigen 2 (FOSL2) and SWI/SNF‑related, matrix‑associated, actin‑dependent regulator of chromatin, subfamily A, member 4 (SMARCA4), and several miRNAs, including miR‑30 and miR‑181. Results from the present study demonstrated that UBC, SP1, FOS, JUN, FOSL2, SMARCA4, miR‑30 and miR‑181 may participate in the development of HCC.
Perreault-Micale, Cynthia; Frieden, Alexander; Kennedy, Caleb J; Neitzel, Dana; Sullivan, Jessica; Faulkner, Nicole; Hallam, Stephanie; Greger, Valerie
2014-11-01
Loss of function variants in the PCDH15 gene can cause Usher syndrome type 1F, an autosomal recessive disease associated with profound congenital hearing loss, vestibular dysfunction, and retinitis pigmentosa. The Ashkenazi Jewish population has an increased incidence of Usher syndrome type 1F (founder variant p.Arg245X accounts for 75% of alleles), yet the variant spectrum in a panethnic population remains undetermined. We sequenced the coding region and intron-exon borders of PCDH15 using next-generation DNA sequencing technology in approximately 14,000 patients from fertility clinics. More than 600 unique PCDH15 variants (single nucleotide changes and small indels) were identified, including previously described pathogenic variants p.Arg3X, p.Arg245X (five patients), p.Arg643X, p.Arg929X, and p.Arg1106X. Novel truncating variants were also found, including one in the N-terminal extracellular domain (p.Leu877X), but all other novel truncating variants clustered in the exon 33 encoded C-terminal cytoplasmic domain (52 patients, 14 variants). One variant was observed predominantly in African Americans (carrier frequency of 2.3%). The high incidence of truncating exon 33 variants indicates that they are unlikely to cause Usher syndrome type 1F even though many remove a large portion of the gene. They may be tolerated because PCDH15 has several alternate cytoplasmic domain exons and differentially spliced isoforms may function redundantly. Effects of some PCDH15 truncating variants were addressed by deep sequencing of a panethnic population. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Polymorphism at codon 36 of the p53 gene.
Felix, C A; Brown, D L; Mitsudomi, T; Ikagaki, N; Wong, A; Wasserman, R; Womer, R B; Biegel, J A
1994-01-01
A polymorphism at codon 36 in exon 4 of the p53 gene was identified by single strand conformation polymorphism (SSCP) analysis and direct sequencing of genomic DNA PCR products. The polymorphic allele, present in the heterozygous state in genomic DNAs of four of 100 individuals (4%), changes the codon 36 CCG to CCA, eliminates a FinI restriction site and creates a BccI site. Including this polymorphism there are four known polymorphisms in the p53 coding sequence.
IL-TIF/IL-22: genomic organization and mapping of the human and mouse genes.
Dumoutier, L; Van Roost, E; Ameye, G; Michaux, L; Renauld, J C
2000-12-01
IL-TIF is a new cytokine originally identified as a gene induced by IL-9 in murine T lymphocytes, and showing 22% amino acid identity with IL-10. Here, we report the sequence and organization of the mouse and human IL-TIF genes, which both consist of 6 exons spreading over approximately 6 Kb. The IL-TIF gene is a single copy gene in humans, and is located on chromosome 12q15, at 90 Kb from the IFN gamma gene, and at 27 Kb from the AK155 gene, which codes for another IL-10-related cytokine. In the mouse, the IL-TIF gene is located on chromosome 10, also in the same region as the IFN gamma gene. Although it is a single copy gene in BALB/c and DBA/2 mice, the IL-TIF gene is duplicated in other strains such as C57Bl/6, FVB and 129. The two copies, which show 98% nucleotide identity in the coding region, were named IL-TIF alpha and IL-TIF beta. Beside single nucleotide variations, they differ by a 658 nucleotide deletion in IL-TIF beta, including the first non-coding exon and 603 nucleotides from the promoter. A DNA fragment corresponding to this deletion was sufficient to confer IL-9-regulated expression of a luciferase reporter plasmid, suggesting that the IL-TIF beta gene is either differentially regulated, or not expressed at all.
Mahamdallie, Shazia; Ruark, Elise; Yost, Shawn; Ramsay, Emma; Uddin, Imran; Wylie, Harriett; Elliott, Anna; Strydom, Ann; Renwick, Anthony; Seal, Sheila; Rahman, Nazneen
2017-01-01
Detection of deletions and duplications of whole exons (exon CNVs) is a key requirement of genetic testing. Accurate detection of this variant type has proved very challenging in targeted next-generation sequencing (NGS) data, particularly if only a single exon is involved. Many different NGS exon CNV calling methods have been developed over the last five years. Such methods are usually evaluated using simulated and/or in-house data due to a lack of publicly-available datasets with orthogonally generated results. This hinders tool comparisons, transparency and reproducibility. To provide a community resource for assessment of exon CNV calling methods in targeted NGS data, we here present the ICR96 exon CNV validation series. The dataset includes high-quality sequencing data from a targeted NGS assay (the TruSight Cancer Panel) together with Multiplex Ligation-dependent Probe Amplification (MLPA) results for 96 independent samples. 66 samples contain at least one validated exon CNV and 30 samples have validated negative results for exon CNVs in 26 genes. The dataset includes 46 exon CNVs in BRCA1 , BRCA2 , TP53 , MLH1 , MSH2 , MSH6 , PMS2 , EPCAM or PTEN , giving excellent representation of the cancer predisposition genes most frequently tested in clinical practice. Moreover, the validated exon CNVs include 25 single exon CNVs, the most difficult type of exon CNV to detect. The FASTQ files for the ICR96 exon CNV validation series can be accessed through the European-Genome phenome Archive (EGA) under the accession number EGAS00001002428.
Hypoxia regulates alternative splicing of HIF and non-HIF target genes.
Sena, Johnny A; Wang, Liyi; Heasley, Lynn E; Hu, Cheng-Jun
2014-09-01
Hypoxia is a common characteristic of many solid tumors. The hypoxic microenvironment stabilizes hypoxia-inducible transcription factor 1α (HIF1α) and 2α (HIF2α/EPAS1) to activate gene transcription, which promotes tumor cell survival. The majority of human genes are alternatively spliced, producing RNA isoforms that code for functionally distinct proteins. Thus, an effective hypoxia response requires increased HIF target gene expression as well as proper RNA splicing of these HIF-dependent transcripts. However, it is unclear if and how hypoxia regulates RNA splicing of HIF targets. This study determined the effects of hypoxia on alternative splicing (AS) of HIF and non-HIF target genes in hepatocellular carcinoma cells and characterized the role of HIF in regulating AS of HIF-induced genes. The results indicate that hypoxia generally promotes exon inclusion for hypoxia-induced, but reduces exon inclusion for hypoxia-reduced genes. Mechanistically, HIF activity, but not hypoxia per se is found to be necessary and sufficient to increase exon inclusion of several HIF targets, including pyruvate dehydrogenase kinase 1 (PDK1). PDK1 splicing reporters confirm that transcriptional activation by HIF is sufficient to increase exon inclusion of PDK1 splicing reporter. In contrast, transcriptional activation of a PDK1 minigene by other transcription factors in the absence of endogenous HIF target gene activation fails to alter PDK1 RNA splicing. This study demonstrates a novel function of HIF in regulating RNA splicing of HIF target genes. ©2014 American Association for Cancer Research.
Plant Proteins Are Smaller Because They Are Encoded by Fewer Exons than Animal Proteins.
Ramírez-Sánchez, Obed; Pérez-Rodríguez, Paulino; Delaye, Luis; Tiessen, Axel
2016-12-01
Protein size is an important biochemical feature since longer proteins can harbor more domains and therefore can display more biological functionalities than shorter proteins. We found remarkable differences in protein length, exon structure, and domain count among different phylogenetic lineages. While eukaryotic proteins have an average size of 472 amino acid residues (aa), average protein sizes in plant genomes are smaller than those of animals and fungi. Proteins unique to plants are ∼81aa shorter than plant proteins conserved among other eukaryotic lineages. The smaller average size of plant proteins could neither be explained by endosymbiosis nor subcellular compartmentation nor exon size, but rather due to exon number. Metazoan proteins are encoded on average by ∼10 exons of small size [∼176 nucleotides (nt)]. Streptophyta have on average only ∼5.7 exons of medium size (∼230nt). Multicellular species code for large proteins by increasing the exon number, while most unicellular organisms employ rather larger exons (>400nt). Among subcellular compartments, membrane proteins are the largest (∼520aa), whereas the smallest proteins correspond to the gene ontology group of ribosome (∼240aa). Plant genes are encoded by half the number of exons and also contain fewer domains than animal proteins on average. Interestingly, endosymbiotic proteins that migrated to the plant nucleus became larger than their cyanobacterial orthologs. We thus conclude that plants have proteins larger than bacteria but smaller than animals or fungi. Compared to the average of eukaryotic species, plants have ∼34% more but ∼20% smaller proteins. This suggests that photosynthetic organisms are unique and deserve therefore special attention with regard to the evolutionary forces acting on their genomes and proteomes. Copyright © 2016 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cuppens, H.; Marynen, P.; Cassiman, J.J.
1993-12-01
The authors have previously shown that about 85% of the mutations in 194 Belgian cystic fibrosis alleles could be detected by a reverse dot-blot assay. In the present study, 50 Belgian chromosomes were analyzed for mutations in the cystic fibrosis transmembrane conductance regulator gene by means of direct solid phase automatic sequencing of PCR products of individual exons. Twenty-six disease mutations and 14 polymorphisms were found. Twelve of these mutations and 3 polymorphisms were not described before. With the exception of one mutant allele carrying two mutations, these mutations were the only mutations found in the complete coding region andmore » their exon/intron boundaries. The total sensitivity of mutant CF alleles that could be identified was 98.5%. Given the heterogeneity of these mutations, most of them very rare, CFTR mutation screening still remains rather complex in the population, and population screening, whether desirable or not, does not appear to be technically feasible with the methods currently available. 24 refs., 1 fig., 2 tabs.« less
GENCODE: the reference human genome annotation for The ENCODE Project.
Harrow, Jennifer; Frankish, Adam; Gonzalez, Jose M; Tapanari, Electra; Diekhans, Mark; Kokocinski, Felix; Aken, Bronwen L; Barrell, Daniel; Zadissa, Amonida; Searle, Stephen; Barnes, If; Bignell, Alexandra; Boychenko, Veronika; Hunt, Toby; Kay, Mike; Mukherjee, Gaurab; Rajan, Jeena; Despacio-Reyes, Gloria; Saunders, Gary; Steward, Charles; Harte, Rachel; Lin, Michael; Howald, Cédric; Tanzer, Andrea; Derrien, Thomas; Chrast, Jacqueline; Walters, Nathalie; Balasubramanian, Suganthi; Pei, Baikang; Tress, Michael; Rodriguez, Jose Manuel; Ezkurdia, Iakes; van Baren, Jeltje; Brent, Michael; Haussler, David; Kellis, Manolis; Valencia, Alfonso; Reymond, Alexandre; Gerstein, Mark; Guigó, Roderic; Hubbard, Tim J
2012-09-01
The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.
Koike, Chihiro; Friday, Robert P.; Nakashima, Izumi; Luppi, Patrizia; Fung, John J.; Rao, Abdul S.; Starzl, Thomas E.; Trucco, Massimo
2010-01-01
Background α1,3-galactosyltransferase (α1,3GT) is an enzyme that produces carbohydrate chains termed αGal epitopes found in most mammals, although some species of higher primates, including human, are notable exceptions. The evolutionary origin of the lost α1,3GT enzyme activity is not yet known, although it has been suggested that the promoter activity of this gene in the ancestors of higher primates was inactivated. Methods We used 5′-or 3′-RACE, GenomeWalking, reverse transcriptase polymerase chain reaction (RT-PCR) and dual Luciferase reporter assay for identification of the full-length cDNA, which includes the transcription initiation site and the promoter region of porcine α1,3GT gene. Results The region around exon 1 is guanine and cytosine (GC)-rich (about 70%), comprising a CpG island spanning more than 1.5 kbp. The 5′-flanking region of exon 1 contains multiple transcription factor consensus motifs, including GC-box, SP1, AP2, and GATA-box sites, in the absence of TATA or CAAT-box sequences. The entire gene consists of three 5′ noncoding and six coding region exons spanning more than 52 kbp. Detailed analysis of α1,3GT transcripts revealed two major alternative splicing patterns in the 5′-untranslated region (5′-UTR) and evidence for minor splicing activity that occurs in a tissue-specific manner. Interspecies comparison of 5′-UTR shows minimal homology between porcine and murine sequences except for exon 2, which suggests that the regulatory regions differ among species. Conclusions These observations have important implications for experiments involving genetic manipulation of the α1,3GT gene in transgenic animals in terms of promoter utilization, and particularly in genetically engineering cells for the animal cloning technology by nuclear transfer. PMID:11087141
Novel mutations of TCOF1 gene in European patients with treacher Collins syndrome
2011-01-01
Background Treacher Collins syndrome (TCS) is one of the most severe autosomal dominant congenital disorders of craniofacial development and shows variable phenotypic expression. TCS is extremely rare, occurring with an incidence of 1 in 50.000 live births. The TCS distinguishing characteristics are represented by down slanting palpebral fissures, coloboma of the eyelid, micrognathia, microtia and other deformity of the ears, hypoplastic zygomatic arches, and macrostomia. Conductive hearing loss and cleft palate are often present. TCS results from mutations in the TCOF1 gene located on chromosome 5, which encodes a serine/alanine-rich nucleolar phospho-protein called Treacle. However, alterations in the TCOF1 gene have been implicated in only 81-93% of TCS cases. Methods In this study, the entire coding regions of the TCOF1 gene, including newly described exons 6A and 16A, were sequenced in 46 unrelated subjects suspected of TCS clinical indication. Results Fifteen mutations were reported, including twelve novel and three already described in 14 sporadic patients and in 3 familial cases. Moreover, seven novel polymorphisms were also described. Most of the mutations characterised were microdeletions spanning one or more nucleotides, in addition to an insertion of one nucleotide in exon 18 and a stop mutation. The deletions and the insertion described cause a premature termination of translation, resulting in a truncated protein. Conclusion This study confirms that almost all the TCOF1 pathogenic mutations fall in the coding region and lead to an aberrant protein. PMID:21951868
Novel mutations of TCOF1 gene in European patients with Treacher Collins syndrome.
Conte, Chiara; D'Apice, Maria Rosaria; Rinaldi, Fabrizio; Gambardella, Stefano; Sangiuolo, Federica; Novelli, Giuseppe
2011-09-27
Treacher Collins syndrome (TCS) is one of the most severe autosomal dominant congenital disorders of craniofacial development and shows variable phenotypic expression. TCS is extremely rare, occurring with an incidence of 1 in 50.000 live births. The TCS distinguishing characteristics are represented by down slanting palpebral fissures, coloboma of the eyelid, micrognathia, microtia and other deformity of the ears, hypoplastic zygomatic arches, and macrostomia. Conductive hearing loss and cleft palate are often present. TCS results from mutations in the TCOF1 gene located on chromosome 5, which encodes a serine/alanine-rich nucleolar phospho-protein called Treacle. However, alterations in the TCOF1 gene have been implicated in only 81-93% of TCS cases. In this study, the entire coding regions of the TCOF1 gene, including newly described exons 6A and 16A, were sequenced in 46 unrelated subjects suspected of TCS clinical indication. Fifteen mutations were reported, including twelve novel and three already described in 14 sporadic patients and in 3 familial cases. Moreover, seven novel polymorphisms were also described. Most of the mutations characterised were microdeletions spanning one or more nucleotides, in addition to an insertion of one nucleotide in exon 18 and a stop mutation. The deletions and the insertion described cause a premature termination of translation, resulting in a truncated protein. This study confirms that almost all the TCOF1 pathogenic mutations fall in the coding region and lead to an aberrant protein.
Conservation of CD44 exon v3 functional elements in mammals
Vela, Elena; Hilari, Josep M; Delclaux, María; Fernández-Bellon, Hugo; Isamat, Marcos
2008-01-01
Background The human CD44 gene contains 10 variable exons (v1 to v10) that can be alternatively spliced to generate hundreds of different CD44 protein isoforms. Human CD44 variable exon v3 inclusion in the final mRNA depends on a multisite bipartite splicing enhancer located within the exon itself, which we have recently described, and provides the protein domain responsible for growth factor binding to CD44. Findings We have analyzed the sequence of CD44v3 in 95 mammalian species to report high conservation levels for both its splicing regulatory elements (the 3' splice site and the exonic splicing enhancer), and the functional glycosaminglycan binding site coded by v3. We also report the functional expression of CD44v3 isoforms in peripheral blood cells of different mammalian taxa with both consensus and variant v3 sequences. Conclusion CD44v3 mammalian sequences maintain all functional splicing regulatory elements as well as the GAG binding site with the same relative positions and sequence identity previously described during alternative splicing of human CD44. The sequence within the GAG attachment site, which in turn contains the Y motif of the exonic splicing enhancer, is more conserved relative to the rest of exon. Amplification of CD44v3 sequence from mammalian species but not from birds, fish or reptiles, may lead to classify CD44v3 as an exclusive mammalian gene trait. PMID:18710510
Lack of mutations in the leptin receptor gene in severely obese children.
Dias, Natasha Favoretto; Fernandes, Ariana Ester; Melo, Maria Edna de; Reinhardt, Heidi Lui; Cercato, Cintia; Villares, Sandra Mara Ferreira; Halpern, Alfredo; Mancini, Marcio C
2012-04-01
To analyze the LEPR gene in obese children and to investigate the associations between molecular findings and anthropometric and metabolic features. Thirty-two patients were evaluated regarding anthropometric characteristics, blood pressure, heart rate, serum glucose, insulin, leptin levels, and lipid profile. The molecular study consisted of the amplification and automatic sequencing of the coding region of LEPR in order to investigate new mutations. We identified a high prevalence of metabolic disorders: impaired fasting glucose in 12.5% of the patients, elevated HOMA-IR in 85.7%, low HDL-cholesterol levels in 46.9%, high triglyceride levels in 40.6%, and hypertension in 58.6% of the patients. The molecular study identified 6 already described allelic variants: rs1137100 (exon-2), rs1137101 (exon-4), rs1805134 (exon-7), rs8179183 (exon-12), rs1805096 (exon-18), and the deletion/insertion of the pentanucleotide CTTTA at 3'untranslated region. The frequency of alleles observed in this cohort is similar to that described in the literature, and was not correlated with any clinical feature. The molecular findings in the analysis of the LEPR did not seem to be implicated in the etiology of obesity in these patients.
Decoding of exon splicing patterns in the human RUNX1-RUNX1T1 fusion gene.
Grinev, Vasily V; Migas, Alexandr A; Kirsanava, Aksana D; Mishkova, Olga A; Siomava, Natalia; Ramanouskaya, Tatiana V; Vaitsiankova, Alina V; Ilyushonak, Ilia M; Nazarov, Petr V; Vallar, Laurent; Aleinikova, Olga V
2015-11-01
The t(8;21) translocation is the most widespread genetic defect found in human acute myeloid leukemia. This translocation results in the RUNX1-RUNX1T1 fusion gene that produces a wide variety of alternative transcripts and influences the course of the disease. The rules of combinatorics and splicing of exons in the RUNX1-RUNX1T1 transcripts are not known. To address this issue, we developed an exon graph model of the fusion gene organization and evaluated its local exon combinatorics by the exon combinatorial index (ECI). Here we show that the local exon combinatorics of the RUNX1-RUNX1T1 gene follows a power-law behavior and (i) the vast majority of exons has a low ECI, (ii) only a small part is represented by "exons-hubs" of splicing with very high ECI values, and (iii) it is scale-free and very sensitive to targeted skipping of "exons-hubs". Stochasticity of the splicing machinery and preferred usage of exons in alternative splicing can explain such behavior of the system. Stochasticity may explain up to 12% of the ECI variance and results in a number of non-coding and unproductive transcripts that can be considered as a noise. Half-life of these transcripts is increased due to the deregulation of some key genes of the nonsense-mediated decay system in leukemia cells. On the other hand, preferred usage of exons may explain up to 75% of the ECI variability. Our analysis revealed a set of splicing-related cis-regulatory motifs that can explain "attractiveness" of exons in alternative splicing but only when they are considered together. Cis-regulatory motifs are guides for splicing trans-factors and we observed a leukemia-specific profile of expression of the splicing genes in t(8;21)-positive blasts. Altogether, our results show that alternative splicing of the RUNX1-RUNX1T1 transcripts follows strict rules and that the power-law component of the fusion gene organization confers a high flexibility to this process. Copyright © 2015 Elsevier Ltd. All rights reserved.
Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R
2004-01-01
A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563
Samson, Marie-Laure
2008-01-01
Background The Drosophila gene embryonic lethal abnormal visual system (elav) is the prototype of a gene family present in all metazoans. Its members encode structurally conserved neuronal proteins with three RNA Recognition Motifs (RRM) but they paradoxically act at diverse levels of post-transcriptional regulation. In an attempt to understand the history of this family, we searched for orthologs in eleven completely sequenced genomes, including those of humans, D. melanogaster and C. elegans, for which cDNAs are available. Results We analyzed 23 orthologs/paralogs of elav, and found evidence of gain/loss of gene copy number. For one set of genes, including elav itself, the coding sequences are free of introns and their products most resemble ELAV. The remaining genes show remarkable conservation of their exon organization, and their products most resemble FNE and RBP9, proteins encoded by the two elav paralogs of Drosophila. Remarkably, three of the conserved exon junctions are both close to structural elements, involved respectively in protein-RNA interactions and in the regulation of sub-cellular localization, and in the vicinity of diverse sequence variations. Conclusion The data indicate that the essential elav gene of Drosophila is newly emerged, restricted to dipterans and of retrotransposed origin. We propose that the conserved exon junctions constitute potential sites for sequence/function modifications, and that RRM binding proteins, whose function relies upon plastic RNA-protein interactions, may have played an important role in brain evolution. PMID:18715504
Drosha Promotes Splicing of a Pre-microRNA-like Alternative Exon
Havens, Mallory A.; Reich, Ashley A.; Hastings, Michelle L.
2014-01-01
The ribonuclease III enzyme Drosha has a central role in the biogenesis of microRNA (miRNA) by binding and cleaving hairpin structures in primary RNA transcripts into precursor miRNAs (pre-miRNAs). Many miRNA genes are located within protein-coding host genes and cleaved by Drosha in a manner that is coincident with splicing of introns by the spliceosome. The close proximity of splicing and pre-miRNA biogenesis suggests a potential for co-regulation of miRNA and host gene expression, though this relationship is not completely understood. Here, we describe a cleavage-independent role for Drosha in the splicing of an exon that has a predicted hairpin structure resembling a Drosha substrate. We find that Drosha can cleave the alternatively spliced exon 5 of the eIF4H gene into a pre-miRNA both in vitro and in cells. However, the primary role of Drosha in eIF4H gene expression is to promote the splicing of exon 5. Drosha binds to the exon and enhances splicing in a manner that depends on RNA structure but not on cleavage by Drosha. We conclude that Drosha can function like a splicing enhancer and promote exon inclusion. Our results reveal a new mechanism of alternative splicing regulation involving a cleavage-independent role for Drosha in splicing. PMID:24786770
The sequence, structure and evolutionary features of HOTAIR in mammals
2011-01-01
Background An increasing number of long noncoding RNAs (lncRNAs) have been identified recently. Different from all the others that function in cis to regulate local gene expression, the newly identified HOTAIR is located between HoxC11 and HoxC12 in the human genome and regulates HoxD expression in multiple tissues. Like the well-characterised lncRNA Xist, HOTAIR binds to polycomb proteins to methylate histones at multiple HoxD loci, but unlike Xist, many details of its structure and function, as well as the trans regulation, remain unclear. Moreover, HOTAIR is involved in the aberrant regulation of gene expression in cancer. Results To identify conserved domains in HOTAIR and study the phylogenetic distribution of this lncRNA, we searched the genomes of 10 mammalian and 3 non-mammalian vertebrates for matches to its 6 exons and the two conserved domains within the 1800 bp exon6 using Infernal. There was just one high-scoring hit for each mammal, but many low-scoring hits were found in both mammals and non-mammalian vertebrates. These hits and their flanking genes in four placental mammals and platypus were examined to determine whether HOTAIR contained elements shared by other lncRNAs. Several of the hits were within unknown transcripts or ncRNAs, many were within introns of, or antisense to, protein-coding genes, and conservation of the flanking genes was observed only between human and chimpanzee. Phylogenetic analysis revealed discrete evolutionary dynamics for orthologous sequences of HOTAIR exons. Exon1 at the 5' end and a domain in exon6 near the 3' end, which contain domains that bind to multiple proteins, have evolved faster in primates than in other mammals. Structures were predicted for exon1, two domains of exon6 and the full HOTAIR sequence. The sequence and structure of two fragments, in exon1 and the domain B of exon6 respectively, were identified to robustly occur in predicted structures of exon1, domain B of exon6 and the full HOTAIR in mammals. Conclusions HOTAIR exists in mammals, has poorly conserved sequences and considerably conserved structures, and has evolved faster than nearby HoxC genes. Exons of HOTAIR show distinct evolutionary features, and a 239 bp domain in the 1804 bp exon6 is especially conserved. These features, together with the absence of some exons and sequences in mouse, rat and kangaroo, suggest ab initio generation of HOTAIR in marsupials. Structure prediction identifies two fragments in the 5' end exon1 and the 3' end domain B of exon6, with sequence and structure invariably occurring in various predicted structures of exon1, the domain B of exon6 and the full HOTAIR. PMID:21496275
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klebig, M.L.; Woychik, R.P.; Wilkinson, J.E.
1994-09-01
The lethal yellow (A{sup y/-}) and viable yellow (A{sup vy/-}) mouse agouti mutants have a predominantly yellow pelage and display a complex syndrome that includes obesity, hyperinsulinemia, and insulin resistance, hallmark features of obesity-associated noninsulin-dependent diabetes mellitus (NIDDM) in humans. A new dominant agouti allele, A{sup iapy}, has recently been identified; like the A{sup vy} allele, it is homozygous viable and confers obesity and yellow fur in heterozygotes. The agouti gene was cloned and characterized at the molecular level. The gene is expressed in the skin during hair growth and is predicted to encode a 131 amino acid protein, thatmore » is likely to be a secreted factor. In both Ay/- and A{sup iapy}/- mice, the obesity and other dominant pleiotropic effects are associated with an ectopic expression of agouti in many tissues where the gene product is normally not produced. In Ay, a 170-kb deletion has occurred that causes an upstream promoter to drive the ectopic expression of the wild-type agouti coding exons. In A{sup iapy}, the coding region of the gene is expressed from a cryptic promoter within the LTR of an intracisternal A-particle (IAP), which has integrated within the region just upstream of the first agouti coding exon. Transgenic mice ubiquitously expressing the cloned agouti gene under the influence of the beta-actin and phosphoglycerate kinase promoters display obesity, hyperinsulinemia, and yellow coat color. This demonstrates unequivocally that ectopic expression of agouti is responsible for the yellow obese syndrome.« less
Wang, Xinsheng; Zhao, Xiangzhong; Wang, Xiaoling; Yao, Jian; Zhang, Feifei; Lang, Yanhua; Tuffery-Giraud, Sylvie; Bottillo, Irene; Shao, Leping
2015-01-01
Twenty-six HOGA1 mutations have been reported in primary hyperoxaluria (PH) type 3 (PH3) patients with c.700 + 5G>T accounting for about 50% of the total alleles. However, PH3 has never been described in Asians. A Chinese child with early-onset nephrolithiasis was suspected of having PH. We searched for AGXT, GRHPR and HOGA1 gene mutations in this patient and his parents. All coding regions, including intron-exon boundaries, were analyzed using PCR followed by direct sequence analysis. Two heterozygous mutations not previously described in the literature about HOGA1 were identified (compound heterozygous). One mutation was a successive 2 bp substitution at the last nucleotide of exon 6 and at the first nucleotide of intron 6, respectively (c.834_834 + 1GG>TT), while the other one was a guanine to adenine substitution of the last nucleotide of exon 6 (c.834G>A). Direct sequencing analysis failed to find these mutations in 100 unrelated healthy subjects and the functional role on splicing of both variants found in this study was confirmed by a minigene assay based on the pSPL3 exon trapping vector. In addition, we found a SNP in this family (c.715G>A, p.V239I). There were no mutations detected in AGXT and GRHPR. Two novel HOGA1 mutations were identified in association with PH3. This is the first description and investigation on mutant gene analysis of PH3 in an Asian. © 2015 S. Karger AG, Basel
... exons, the parts of DNA that code for proteins in the body. Researchers like this method because it is faster and cheaper. Learn More More still needs to be done before whole genome sequencing becomes a routine part of medical care. Many ...
The Rise and Fall of the Gene.
ERIC Educational Resources Information Center
Mahadeva, Madhu; Randerson, Sherman
1985-01-01
Summarizes the current state of genetics, highlighting major historical events in the development of the field and discussing topics related to introns ("silent" or noncoding base sequences in eucaryotic genes) and exons (the coding parts of DNA). (JN)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ploos van Amstel, H.; Reitsma, P.H.; van der Logt, C.P.
The human protein S locus on chromosome 3 consists of two protein S genes, PS{alpha} and PS{beta}. Here the authors report the cloning and characterization of both genes. Fifteen exons of the PS{alpha} gene were identified that together code for protein S mRNA as derived from the reported protein S cDNAs. Analysis by primer extension of liver protein S mRNA, however, reveals the presence of two mRNA forms that differ in the length of their 5{prime}-noncoding region. Both transcripts contain a 5{prime}-noncoding region longer than found in the protein S cDNAs. The two products may arise from alternative splicing ofmore » an additional intron in this region or from the usage of two start sites for transcription. The intron-exon organization of the PS{alpha} gene fully supports the hypothesis that the protein S gene is the product of an evolutional assembling process in which gene modules coding for structural/functional protein units also found in other coagulation proteins have been put upstream of the ancestral gene of a steroid hormone binding protein. The PS{beta} gene is identified as a pseudogene. It contains a large variety of detrimental aberrations, viz., the absence of exon I, a splice site mutation, three stop codons, and a frame shift mutation. Overall the two genes PS{alpha} and PS{beta} show between their exonic sequences 96.5% homology. Southern analysis of primate DNA showed that the duplication of the ancestral protein S gene has occurred after the branching of the orangutan from the African apes. A nonsense mutation that is present in the pseudogene of man also could be identified in one of the two protein S genes of both chimpanzee and gorilla. This implicates that silencing of one of the two protein S genes must have taken place before the divergence of the three African apes.« less
Comparative architecture of silks, fibrous proteins and their encoding genes in insects and spiders.
Craig, Catherine L; Riekel, Christian
2002-12-01
The known silk fibroins and fibrous glues are thought to be encoded by members of the same gene family. All silk fibroins sequenced to date contain regions of long-range order (crystalline regions) and/or short-range order (non-crystalline regions). All of the sequenced fibroin silks (Flag or silk from flagelliform gland in spiders; Fhc or heavy chain fibroin silks produced by Lepidoptera larvae) are made up of hierarchically organized, repetitive arrays of amino acids. Fhc fibroin genes are characterized by a similar molecular genetic architecture of two exons and one intron, but the organization and size of these units differs. The Flag, Ser (sericin gene) and BR (Balbiani ring genes; both fibrous proteins) genes are made up of multiple exons and introns. Sequences coding for crystalline and non-crystalline protein domains are integrated in the repetitive regions of Fhc and MA exons, but not in the protein glues Ser1 and BR-1. Genetic 'hot-spots' promote recombination errors in Fhc, MA, and Flag. Codon bias, structural constraint, point mutations, and shortened coding arrays may be alternative means of stabilizing precursor mRNA transcripts. Differential regulation of gene expression and selective splicing of the mRNA transcript may allow rapid adaptation of silk functional properties to different physical environments.
Quantifying the mechanisms of domain gain in animal proteins.
Buljan, Marija; Frankish, Adam; Bateman, Alex
2010-01-01
Protein domains are protein regions that are shared among different proteins and are frequently functionally and structurally independent from the rest of the protein. Novel domain combinations have a major role in evolutionary innovation. However, the relative contributions of the different molecular mechanisms that underlie domain gains in animals are still unknown. By using animal gene phylogenies we were able to identify a set of high confidence domain gain events and by looking at their coding DNA investigate the causative mechanisms. Here we show that the major mechanism for gains of new domains in metazoan proteins is likely to be gene fusion through joining of exons from adjacent genes, possibly mediated by non-allelic homologous recombination. Retroposition and insertion of exons into ancestral introns through intronic recombination are, in contrast to previous expectations, only minor contributors to domain gains and have accounted for less than 1% and 10% of high confidence domain gain events, respectively. Additionally, exonization of previously non-coding regions appears to be an important mechanism for addition of disordered segments to proteins. We observe that gene duplication has preceded domain gain in at least 80% of the gain events. The interplay of gene duplication and domain gain demonstrates an important mechanism for fast neofunctionalization of genes.
Bhasi, Ashwini; Philip, Philge; Manikandan, Vinu; Senapathy, Periannan
2009-01-01
We have developed ExDom, a unique database for the comparative analysis of the exon–intron structures of 96 680 protein domains from seven eukaryotic organisms (Homo sapiens, Mus musculus, Bos taurus, Rattus norvegicus, Danio rerio, Gallus gallus and Arabidopsis thaliana). ExDom provides integrated access to exon-domain data through a sophisticated web interface which has the following analytical capabilities: (i) intergenomic and intragenomic comparative analysis of exon–intron structure of domains; (ii) color-coded graphical display of the domain architecture of proteins correlated with their corresponding exon-intron structures; (iii) graphical analysis of multiple sequence alignments of amino acid and coding nucleotide sequences of homologous protein domains from seven organisms; (iv) comparative graphical display of exon distributions within the tertiary structures of protein domains; and (v) visualization of exon–intron structures of alternative transcripts of a gene correlated to variations in the domain architecture of corresponding protein isoforms. These novel analytical features are highly suited for detailed investigations on the exon–intron structure of domains and make ExDom a powerful tool for exploring several key questions concerning the function, origin and evolution of genes and proteins. ExDom database is freely accessible at: http://66.170.16.154/ExDom/. PMID:18984624
Two distinct promoters drive transcription of the human D1A dopamine receptor gene.
Lee, S H; Minowa, M T; Mouradian, M M
1996-10-11
The human D1A dopamine receptor gene has a GC-rich, TATA-less promoter located upstream of a small, noncoding exon 1, which is separated from the coding exon 2 by a 116-base pair (bp)-long intron. Serial 3'-deletions of the 5'-noncoding region of this gene, including the intron and 5'-end of exon 2, resulted in 80 and 40% decrease in transcriptional activity of the upstream promoter in two D1A-expressing neuroblastoma cell lines, SK-N-MC and NS20Y, respectively. To investigate the function of this region, the intron and 245 bp at the 5'-end of exon 2 were investigated. Transient expression analyses using various chloramphenicol acetyltransferase constructs showed that the transcriptional activity of the intron is higher than that of the upstream promoter by 12-fold in SK-N-MC cells and by 5.5-fold in NS20Y cells in an orientation-dependent manner, indicating that the D1A intron is a strong promoter. Primer extension and ribonuclease protection assays revealed that transcription driven by the intron promoter is initiated at the junction of intron and exon 2 and at a cluster of nucleotides located 50 bp downstream from this junction. The same transcription start sites are utilized by the chloramphenicol acetyltransferase constructs employed in transfections as well as by the D1A gene expressed within the human caudate. The relative abundance of D1A transcripts originating from the upstream promoter compared with those transcribed from the intron promoter is 1.5-2.9 times in SK-N-MC cells and 2 times in the human caudate. Transcript stability studies in SK-N-MC cells revealed that longer D1A mRNA molecules containing exon 1 are degraded 1.8 times faster than shorter transcripts lacking exon 1. Although gel mobility shift assay could not detect DNA-protein interaction at the D1A intron, competitive co-transfection using the intron as competitor confirmed the presence of trans-acting factors at the intron. These data taken together indicate that the human D1A gene has two functional TATA-less promoters, both in D1A expressing cultured neuroblastoma cells and in the human striatum.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsugu, H.; Horowitz, R.; Gibson, N.
1994-12-01
Sera from approximately 30% of patients with systemic lupus erythematosus (SLE) contain high titers of autoantibodies that bind to the 52-kDa Ro/SSA protein. We previously detected polymorphisms in the 52-kDa Ro/SSA gene (SSA1) with restriction enzymes, one of which is strongly associated with the presence of SLE (P < 0.0005) in African Americans. A higher disease frequency and more severe forms of the disease are commonly noted among these female patients. To determine the location and nature of this polymorphism, we obtained two clones that span 8.5 kb of the 52-kDa Ro/SSA locus including its upstream regulatory region. Six exonsmore » were identified, and their nucleotide sequences plus adjacent noncoding regions were determined. No differences were found between these exons and the coding region of one of the reported cDNAs. The disease-associated polymorphic site suggested by a restriction enzyme map and confirmed by DNA amplification and nucleotide sequencing was present upstream of exon 1. This polymorphism may be a genetic marker for a disease-related variation in the coding region for the protein or in the upstream regulatory region of this gene. Although this RFLP is present in Japanese, it is not associated with lupus in this race. 41 refs., 4 figs., 2 tabs.« less
Genotype-phenotype correlation of xeroderma pigmentosum in a Chinese Han population.
Sun, Z; Zhang, J; Guo, Y; Ni, C; Liang, J; Cheng, R; Li, M; Yao, Z
2015-04-01
Xeroderma pigmentosum (XP) is a rare autosomal recessive disorder characterized by extreme sensitivity to sunlight, freckle-like pigmentation and a greatly increased incidence of skin cancers. Genetic mutation detection and genotype-phenotype analysis of XP are rarely reported in the Chinese Han population. To investigate the mutational spectrum of XP in a Chinese Han population, to discover any genotype-phenotype correlation and, consequently, to propose a simple and effective tool for the molecular diagnosis of XP. This study was carried out on 12 unrelated Chinese families that included 13 patients with clinically suspected XP. Genomic DNA was extracted from peripheral blood samples. Mutation screening was performed by direct sequencing of exons and flanking intron-exon boundaries for the entire coding region of eight XP genes. In 12 patients, direct sequencing of the whole coding region of eight XP genes revealed pathogenic mutations, including seven compound heterozygous mutations, three homozygous mutations and a Japanese founder mutation. Thirteen mutations have not been previously identified. This cohort was composed of four patients with XP-C (XPC), two with XP-G (ERCC5), three with XP-A (XPA) and three with XP-V (POLH). This study identified 13 novel mutations and extended the mutation spectrum of XP in the Chinese Han population. In this cohort, we found that patients with XP-G have no neurological symptoms, and patients with XP-A and XP-V have a high incidence of malignancy. Furthermore, lack of stringent protection against sunlight, late diagnosis and long duration of disease play an important role. © 2014 British Association of Dermatologists.
Suzuki, O. T.; Sertié, A. L.; Der Kaloustian, V. M.; Kok, F.; Carpenter, M.; Murray, J.; Czeizel, A. E.; Kliemann, S. E.; Rosemberg, S.; Monteiro, M.; Olsen, B. R.; Passos-Bueno, M. R.
2002-01-01
Knobloch syndrome (KS) is a rare disease characterized by severe ocular alterations, including vitreoretinal degeneration associated with retinal detachment and occipital scalp defect. The responsible gene, COL18A1, has been mapped to 21q22.3, and, on the basis of the analysis of one family, we have demonstrated that a mutation affecting only one of the three COL18A1 isoforms causes this phenotype. We report here the results of the screening of both the entire coding region and the exon-intron boundaries of the COL18A1 gene (which includes 43 exons), in eight unrelated patients with KS. Besides 20 polymorphic changes, we identified 6 different pathogenic changes in both alleles of five unrelated patients with KS (three compound heterozygotes and two homozygotes). All are truncating mutations leading to deficiency of one or all collagen XVIII isoforms and endostatin. We have verified that, in exon 41, the deletion c3514-3515delCT, found in three unrelated alleles, is embedded in different haplotypes, suggesting that this mutation has occurred more than once. In addition, our results provide evidence of nonallelic genetic heterogeneity in KS. We also show that the longest human isoform (NC11-728) is expressed in several tissues (including the human eye) and that lack of either the short variant or all of the collagen XVIII isoforms causes similar phenotypes but that those patients who lack all forms present more-severe ocular alterations. Despite the small sample size, we found low endostatin plasma levels in those patients with mutations leading to deficiency of all isoforms; in addition, it seems that absence of all collagen XVIII isoforms causes predisposition to epilepsy. PMID:12415512
Characterization and mapping of the mouse NDP (Norrie disease) locus (Ndp).
Battinelli, E M; Boyd, Y; Craig, I W; Breakefield, X O; Chen, Z Y
1996-02-01
Norrie disease is a severe X-linked recessive neurological disorder characterized by congenital blindness with progressive loss of hearing. Over half of Norrie patients also manifest different degrees of mental retardation. The gene for Norrie disease (NDP) has recently been cloned and characterized. With the human NDP cDNA, mouse genomic phage libraries were screened for the homolog of the gene. Comparison between mouse and human genomic DNA blots hybridized with the NDP cDNA, as well as analysis of phage clones, shows that the mouse NDP gene is 29 kb in size (28 kb for the human gene). The organization in the two species is very similar. Both have three exons with similar-sized introns and identical exon-intron boundaries between exon 2 and 3. The mouse open reading frame is 393 bp and, like the human coding sequence, is encoded in exons 2 and 3. The absence of six nucleotides in the second mouse exon results in the encoded protein being two amino acids smaller than its human counterpart. The overall homology between the human and mouse NDP protein is 95% and is particularly high (99%) in exon 3, consistent with the apparent functional importance of this region. Analysis of transcription initiation sites suggests the presence of multiple start sites associated with expression of the mouse NDP gene. Pedigree analysis of an interspecific mouse backcross localizes the mouse NDP gene close to Maoa in the conserved segment, which runs from CYBB to PFC in both human and mouse.
Zhao, Xintao; Wu, Yajie; Chen, Yi; Feng, Xinxing; Song, Ying; Wang, Yilu; Zou, Yubao; Wang, Jizheng; Shao, Yibing; Hui, Rutai; Song, Lei; Wang, Xu
2014-07-01
To identify the casual mutation of a Chinese pedigree with hypertrophic cardiomyopathy (HCM), and to analyze the genotype-phenotype relationship. The coding exons of 26 reported disease genes were sequenced by targeted resequencing in the proband and the identified mutation were detected with bi-directional Sanger sequencing in all family members and 307 healthy controls. The genotype-phenotype correlation was analyzed in the family. A missense mutation (c.2191C > T, p. Pro731Ser) in the 20th exon of MYH7 gene was identified. This mutation was absent in 307 healthy controls and predicted to be pathogenic by PolyPhen-HCM. Totally 13 family members carried this mutation, including 10 patients with HCM and 3 asymptomatic mutation carriers. The proband manifested severe congestive heart failure and 8 patients expressed various clinical manifestations of heart failure, including dyspnea, palpitations, chest pain, amaurosis or syncope. Five patients were diagnosed as HCM at the age of 16 or younger. One family member suffered sudden cardiac death. The Pro731Ser of MYH7 gene mutation is a causal and malignant mutation linked with familiar HCM.
Li, Rui; Liao, Xian-Hua; Ye, Jun-Zhao; Li, Min-Rui; Wu, Yan-Qin; Hu, Xuan; Zhong, Bi-Hui
2017-06-14
To test the hypothesis that K8/K18 variants predispose humans to non-alcoholic fatty liver disease (NAFLD) progression and its metabolic phenotypes. We selected a total of 373 unrelated adult subjects from our Physical Examination Department, including 200 unrelated NAFLD patients and 173 controls of both genders and different ages. Diagnoses of NAFLD were established according to ultrasonic signs of fatty liver. All subjects were tested for population characteristics, lipid profile, liver tests, as well as glucose tests. Genomic DNA was obtained from peripheral blood with a DNeasy Tissue Kit. K8/K18 coding regions were analyzed, including 15 exons and exon-intron boundaries. Among 200 NAFLD patients, 10 (5%) heterozygous carriers of keratin variants were identified. There were 5 amino-acid-altering heterozygous variants and 6 non-coding heterozygous variants. One novel amino-acid-altering heterozygous variant (K18 N193S) and three novel non-coding variants were observed (K8 IVS5-9A→G, K8 IVS6+19G→A, K18 T195T). A total of 9 patients had a single variant and 1 patient had compound variants (K18 N193S+K8 IVS3-15C→G). Only one R341H variant was found in the control group (1 of 173, 0.58%). The frequency of keratin variants in NAFLD patients was significantly higher than that in the control group (5% vs 0.58%, P = 0.015). Notably, the keratin variants were significantly associated with insulin resistance (IR) in NAFLD patients (8.86% in NAFLD patients with IR vs 2.5% in NAFLD patients without IR, P = 0.043). K8/K18 variants are overrepresented in Chinese NAFLD patients and might accelerate liver fat storage through IR.
[Clinical and genetic analysis of a patient with Treacher Collins syndrome in TCOF1 gene].
Li, Hongbo; Zhang, Xu; Li, Zhenyue; Chen, Jing; Lu, Yu; Jia, Jingjie; Yuan, Huijun; Han, Dongyi
2012-05-01
To analyze the clinical and genetic features of a patient with Treacher Collins syndrome (TCS), and identify the mutation in TCOF1 gene. The medical history was taken, and general physical examinations and otological examinations were conducted in this patient. Genomic DNA was extracted from this patient and his parents and complete TCOF1 gene coding exons were amplified by specific PCR primers. Direct sequencing was carried out to identify the mutations. The raw data was analyzed with GeneTool software and molecular biological website. We detected a heterozygous c. 1639 delAG mutation in exon 11 of TCOF1, which resulted in a truncated protein lacking normal function. This mutation is a novel mutation and the second case identified in exon 11 of in TCS. TCS patient reported in this study has unique clinical phenotype. TCOF1 gene mutation is the specific risk factor.
A 5′ Splice Site-Proximal Enhancer Binds SF1 and Activates Exon Bridging of a Microexon
Carlo, Troy; Sierra, Rebecca; Berget, Susan M.
2000-01-01
Internal exon size in vertebrates occurs over a narrow size range. Experimentally, exons shorter than 50 nucleotides are poorly included in mRNA unless accompanied by strengthened splice sites or accessory sequences that act as splicing enhancers, suggesting steric interference between snRNPs and other splicing factors binding simultaneously to the 3′ and 5′ splice sites of microexons. Despite these problems, very small naturally occurring exons exist. Here we studied the factors and mechanism involved in recognizing a constitutively included six-nucleotide exon from the cardiac troponin T gene. Inclusion of this exon is dependent on an enhancer located downstream of the 5′ splice site. This enhancer contains six copies of the simple sequence GGGGCUG. The enhancer activates heterologous microexons and will work when located either upstream or downstream of the target exon, suggesting an ability to bind factors that bridge splicing units. A single copy of this sequence is sufficient for in vivo exon inclusion and is the binding site for the known bridging mammalian splicing factor 1 (SF1). The enhancer and its bound SF1 act to increase recognition of the upstream exon during exon definition, such that competition of in vitro reactions with RNAs containing the GGGGCUG repeated sequence depress splicing of the upstream intron, assembly of the spliceosome on the 3′ splice site of the exon, and cross-linking of SF1. These results suggest a model in which SF1 bridges the small exon during initial assembly, thereby effectively extending the domain of the exon. PMID:10805741
Lex-SVM: exploring the potential of exon expression profiling for disease classification.
Yuan, Xiongying; Zhao, Yi; Liu, Changning; Bu, Dongbo
2011-04-01
Exon expression profiling technologies, including exon arrays and RNA-Seq, measure the abundance of every exon in a gene. Compared with gene expression profiling technologies like 3' array, exon expression profiling technologies could detect alterations in both transcription and alternative splicing, therefore they are expected to be more sensitive in diagnosis. However, exon expression profiling also brings higher dimension, more redundancy, and significant correlation among features. Ignoring the correlation structure among exons of a gene, a popular classification method like L1-SVM selects exons individually from each gene and thus is vulnerable to noise. To overcome this limitation, we present in this paper a new variant of SVM named Lex-SVM to incorporate correlation structure among exons and known splicing patterns to promote classification performance. Specifically, we construct a new norm, ex-norm, including our prior knowledge on exon correlation structure to regularize the coefficients of a linear SVM. Lex-SVM can be solved efficiently using standard linear programming techniques. The advantage of Lex-SVM is that it can select features group-wisely, force features in a subgroup to take equal weihts and exclude the features that contradict the majority in the subgroup. Experimental results suggest that on exon expression profile, Lex-SVM is more accurate than existing methods. Lex-SVM also generates a more compact model and selects genes more consistently in cross-validation. Unlike L1-SVM selecting only one exon in a gene, Lex-SVM assigns equal weights to as many exons in a gene as possible, lending itself easier for further interpretation.
Rumpho, Mary E.; Pochareddy, Sirisha; Worful, Jared M.; Summer, Elizabeth J.; Bhattacharya, Debashish; Pelletreau, Karen N.; Tyler, Mary S.; Lee, Jungho; Manhart, James R.; Soule, Kara M.
2009-01-01
Phosphoribulokinase (PRK), a nuclear-encoded plastid-localized enzyme unique to the photosynthetic carbon reduction (Calvin) cycle, was cloned and characterized from the stramenopile alga Vaucheria litorea. This alga is the source of plastids for the mollusc (sea slug) Elysia chlorotica which enable the animal to survive for months solely by photoautotrophic CO2 fixation. The 1633-bp V. litorea prk gene was cloned and the coding region, found to be interrupted by four introns, encodes a 405-amino acid protein. This protein contains the typical bipartite target sequence expected of nuclear-encoded proteins that are directed to complex (i.e. four membrane-bound) algal plastids. De novo synthesis of PRK and enzyme activity were detected in E. chlorotica in spite of having been starved of V. litorea for several months. Unlike the algal enzyme, PRK in the sea slug did not exhibit redox regulation. Two copies of partial PRK-encoding genes were isolated from both sea slug and aposymbiotic sea slug egg DNA using PCR. Each copy contains the nucleotide region spanning exon 1 and part of exon 2 of V. litorea prk, including the bipartite targeting peptide. However, the larger prk fragment also includes intron 1. The exon and intron sequences of prk in E. chlorotica and V. litorea are nearly identical. These data suggest that PRK is differentially regulated in V. litorea and E. chlorotica and at least a portion of the V. litorea nuclear PRK gene is present in sea slugs that have been starved for several months. PMID:19995736
Implication of LRRC4C and DPP6 in neurodevelopmental disorders
Maussion, Gilles; Cruceanu, Cristiana; Rosenfeld, Jill A.; Bell, Scott C.; Jollant, Fabrice; Szatkiewicz, Jin; Collins, Ryan L.; Hanscom, Carrie; Kolobova, Ilaria; de Champfleur, Nicolas Menjot; Blumenthal, Ian; Chiang, Colby; Ota, Vanessa; Hultman, Christina; O’Dushlaine, Colm; McCarroll, Steve; Alda, Martin; Jacquemont, Sebastien; Ordulu, Zehra; Marshall, Christian R.; Carter, Melissa T.; Shaffer, Lisa G.; Sklar, Pamela; Girirajan, Santhosh; Morton, Cynthia C.; Gusella, James F.; Turecki, Gustavo; Stavropoulos, D. J.; Sullivan, Patrick F.; Scherer, Stephen W.; Talkowski, Michael E.; Ernst, Carl
2018-01-01
We performed whole-genome sequencing on an individual from a family with variable psychiatric phenotypes that had a sensory processing disorder, apraxia, and autism. The proband harbored a maternally inherited balanced translocation (46,XY,t(11;14)(p12;p12)mat) that disrupted LRRC4C, a member of the highly specialized netrin G family of axon guidance molecules. The proband also inherited a paternally derived chromosomal inversion that disrupted DPP6, a potassium channel interacting protein. Copy Number (CN) analysis in 14,077 cases with neurodevelopmental disorders and 8,960 control subjects revealed that 60% of cases with exonic deletions in LRRC4C had a second clinically recognizable syndrome associated with variable clinical phenotypes, including 16p11.2, 1q44, and 2q33.1 CN syndromes, suggesting LRRC4C deletion variants may be modifiers of neurodevelopmental disorders. In vitro, functional assessments modeling patient deletions in LRRC4C suggest a negative regulatory role of these exons found in the untranslated region of LRRC4C, which has a single, terminal coding exon. These data suggest that the proband’s autism may be due to the inheritance of disruptions in both DPP6 and LRRC4C, and may highlight the importance of the netrin G family and potassium channel interacting molecules in neurodevelopmental disorders. PMID:27759917
Agirbasli, Deniz; Hyatt, Tommy; Agirbasli, Mehmet
2018-04-26
This is a case report of a 38-year-old Syrian refugee male with early-onset extensive atherosclerosis. The physical and laboratory examination were remarkable with severe xanthomas in the upper and lower extremities and with low-density lipoprotein cholesterol (LDL-C) 417 mg/dL, total cholesterol 495 mg/dL, high-density lipoprotein cholesterol 30 mg/dL, and triglycerides 242 mg/dL. LDL-C level responded poorly to the high-dose statin treatment. The genetic analysis indicated that the patient had a large homozygous deletion in LDL receptor gene including the exons 7-14. A 12-kb deletion had occurred between the 2 Alu repetitive sequences that were oriented in opposite directions, one in intron 6 and the other in intron 14. This deletion eliminated exons 7-14, which exactly corresponded to the entire exon sequence coding the epidermal growth factor precursor homology domain. This deletion in LDL receptor was previously reported. This rare case of homozygous familial hypercholesterolemia presenting with multiple large and widely distributed xanthomas implicates the need for novel treatment options in familial hypercholesterolemia patients. The case is a Syrian refugee and emphasizes the urgent need to address orphan disease in refugee populations throughout the world. Copyright © 2018 National Lipid Association. Published by Elsevier Inc. All rights reserved.
In situ genetic correction of F8 intron 22 inversion in hemophilia A patient-specific iPSCs.
Wu, Yong; Hu, Zhiqing; Li, Zhuo; Pang, Jialun; Feng, Mai; Hu, Xuyun; Wang, Xiaolin; Lin-Peng, Siyuan; Liu, Bo; Chen, Fangping; Wu, Lingqian; Liang, Desheng
2016-01-08
Nearly half of severe Hemophilia A (HA) cases are caused by F8 intron 22 inversion (Inv22). This 0.6-Mb inversion splits the 186-kb F8 into two parts with opposite transcription directions. The inverted 5' part (141 kb) preserves the first 22 exons that are driven by the intrinsic F8 promoter, leading to a truncated F8 transcript due to the lack of the last 627 bp coding sequence of exons 23-26. Here we describe an in situ genetic correction of Inv22 in patient-specific induced pluripotent stem cells (iPSCs). By using TALENs, the 627 bp sequence plus a polyA signal was precisely targeted at the junction of exon 22 and intron 22 via homologous recombination (HR) with high targeting efficiencies of 62.5% and 52.9%. The gene-corrected iPSCs retained a normal karyotype following removal of drug selection cassette using a Cre-LoxP system. Importantly, both F8 transcription and FVIII secretion were rescued in the candidate cell types for HA gene therapy including endothelial cells (ECs) and mesenchymal stem cells (MSCs) derived from the gene-corrected iPSCs. This is the first report of an efficient in situ genetic correction of the large inversion mutation using a strategy of targeted gene addition.
In situ genetic correction of F8 intron 22 inversion in hemophilia A patient-specific iPSCs
Wu, Yong; Hu, Zhiqing; Li, Zhuo; Pang, Jialun; Feng, Mai; Hu, Xuyun; Wang, Xiaolin; Lin-Peng, Siyuan; Liu, Bo; Chen, Fangping; Wu, Lingqian; Liang, Desheng
2016-01-01
Nearly half of severe Hemophilia A (HA) cases are caused by F8 intron 22 inversion (Inv22). This 0.6-Mb inversion splits the 186-kb F8 into two parts with opposite transcription directions. The inverted 5′ part (141 kb) preserves the first 22 exons that are driven by the intrinsic F8 promoter, leading to a truncated F8 transcript due to the lack of the last 627 bp coding sequence of exons 23–26. Here we describe an in situ genetic correction of Inv22 in patient-specific induced pluripotent stem cells (iPSCs). By using TALENs, the 627 bp sequence plus a polyA signal was precisely targeted at the junction of exon 22 and intron 22 via homologous recombination (HR) with high targeting efficiencies of 62.5% and 52.9%. The gene-corrected iPSCs retained a normal karyotype following removal of drug selection cassette using a Cre-LoxP system. Importantly, both F8 transcription and FVIII secretion were rescued in the candidate cell types for HA gene therapy including endothelial cells (ECs) and mesenchymal stem cells (MSCs) derived from the gene-corrected iPSCs. This is the first report of an efficient in situ genetic correction of the large inversion mutation using a strategy of targeted gene addition. PMID:26743572
Kabir, Firoz; Ullah, Inayat; Ali, Shahbaz; Gottsch, Alexander D.H.; Naeem, Muhammad Asif; Assir, Muhammad Zaman; Khan, Shaheen N.; Akram, Javed; Riazuddin, Sheikh; Ayyagari, Radha; Hejtmancik, J. Fielding
2016-01-01
Purpose This study was undertaken to identify causal mutations responsible for autosomal recessive retinitis pigmentosa (arRP) in consanguineous families. Methods Large consanguineous families were ascertained from the Punjab province of Pakistan. An ophthalmic examination consisting of a fundus evaluation and electroretinography (ERG) was completed, and small aliquots of blood were collected from all participating individuals. Genomic DNA was extracted from white blood cells, and a genome-wide linkage or a locus-specific exclusion analysis was completed with polymorphic short tandem repeats (STRs). Two-point logarithm of odds (LOD) scores were calculated, and all coding exons and exon–intron boundaries of RP1 were sequenced to identify the causal mutation. Results The ophthalmic examination showed that affected individuals in all families manifest cardinal symptoms of RP. Genome-wide scans localized the disease phenotype to chromosome 8q, a region harboring RP1, a gene previously implicated in the pathogenesis of RP. Sanger sequencing identified a homozygous single base deletion in exon 4: c.3697delT (p.S1233Pfs22*), a single base substitution in intron 3: c.787+1G>A (p.I263Nfs8*), a 2 bp duplication in exon 2: c.551_552dupTA (p.Q185Yfs4*) and an 11,117 bp deletion that removes all three coding exons of RP1. These variations segregated with the disease phenotype within the respective families and were not present in ethnically matched control samples. Conclusions These results strongly suggest that these mutations in RP1 are responsible for the retinal phenotype in affected individuals of all four consanguineous families. PMID:27307693
Mutation Scanning in Wheat by Exon Capture and Next-Generation Sequencing.
King, Robert; Bird, Nicholas; Ramirez-Gonzalez, Ricardo; Coghill, Jane A; Patil, Archana; Hassani-Pak, Keywan; Uauy, Cristobal; Phillips, Andrew L
2015-01-01
Targeted Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach to identify novel sequence variation in genomes, with the aims of investigating gene function and/or developing useful alleles for breeding. Despite recent advances in wheat genomics, most current TILLING methods are low to medium in throughput, being based on PCR amplification of the target genes. We performed a pilot-scale evaluation of TILLING in wheat by next-generation sequencing through exon capture. An oligonucleotide-based enrichment array covering ~2 Mbp of wheat coding sequence was used to carry out exon capture and sequencing on three mutagenised lines of wheat containing previously-identified mutations in the TaGA20ox1 homoeologous genes. After testing different mapping algorithms and settings, candidate SNPs were identified by mapping to the IWGSC wheat Chromosome Survey Sequences. Where sequence data for all three homoeologues were found in the reference, mutant calls were unambiguous; however, where the reference lacked one or two of the homoeologues, captured reads from these genes were mis-mapped to other homoeologues, resulting either in dilution of the variant allele frequency or assignment of mutations to the wrong homoeologue. Competitive PCR assays were used to validate the putative SNPs and estimate cut-off levels for SNP filtering. At least 464 high-confidence SNPs were detected across the three mutagenized lines, including the three known alleles in TaGA20ox1, indicating a mutation rate of ~35 SNPs per Mb, similar to that estimated by PCR-based TILLING. This demonstrates the feasibility of using exon capture for genome re-sequencing as a method of mutation detection in polyploid wheat, but accurate mutation calling will require an improved genomic reference with more comprehensive coverage of homoeologues.
ANXA11 mutations prevail in Chinese ALS patients with and without cognitive dementia.
Zhang, Kang; Liu, Qing; Liu, Keqiang; Shen, Dongchao; Tai, Hongfei; Shu, Shi; Ding, Qingyun; Fu, Hanhui; Liu, Shuangwu; Wang, Zhili; Li, Xiaoguang; Liu, Mingsheng; Zhang, Xue; Cui, Liying
2018-06-01
To investigate the genetic contribution of ANXA11 , a gene associated with amyotrophic lateral sclerosis (ALS), in Chinese ALS patients with and without cognitive dementia. Sequencing all the coding exons of ANXA11 and intron-exon boundaries in 18 familial amyotrophic lateral sclerosis (FALS), 353 unrelated sporadic amyotrophic lateral sclerosis (SALS), and 12 Chinese patients with ALS-frontotemporal lobar dementia (ALS-FTD). The transcripts in peripheral blood generated from a splicing mutation were examined by reverse transcriptase PCR. We identified 6 nonsynonymous heterozygous mutations (5 novel and 1 recurrent), 1 splice site mutation, and 1 deletion of 10 amino acids (not accounted in the mutant frequency) in 11 unrelated patients, accounting for a mutant frequency of 5.6% (1/18) in FALS, 2.3% (8/353) in SALS, and 8.3% (1/12) in ALS-FTD. The deletion of 10 amino acids was detected in 1 clinically undetermined male with an ALS family history who had atrophy in hand muscles and myotonic discharges revealed by EMG. The novel p. P36R mutation was identified in 1 FALS index, 1 patient with SALS, and 1 ALS-FTD. The splicing mutation (c.174-2A>G) caused in-frame skipping of the entire exon 6. The rest missense mutations including p.D40G, p.V128M, p.S229R, p.R302C and p.G491R were found in 6 unrelated patients with SALS. The ANXA11 gene is one of the most frequently mutated genes in Chinese patients with SALS. A canonical splice site mutation leading to skipping of the entire exon 6 further supports the loss-of-function mechanism. In addition, the study findings further expand the ANXA11 phenotype, first highlighting its pathogenic role in ALS-FTD.
Friedberg, Felix
2009-05-01
In this paper we examine (restricted to homo sapiens) the products resulting from gene duplication and the subsequent alternative splicing for the members of a multidomain group of proteins which possess the evolutionary conserved calponin homology CH domain, i.e. an "actin binding domain", as a singlet and which, in addition, contain the conserved cysteine rich double Zn finger possessing Lim domain, also as a singlet. Seven genes, resulting from gene duplications, were identified that code for seven group members for which pre-mRNAs appear to have undergone multiple alternative splicing: Mical 1, 2 and 3 are located on chromosomes 6q21, 11p15 and 22q11, respectively. The LMO7 gene is present on chromosome 13q22 and the LIMCH1 gene on chromosome 4p13. Micall1 is mapped to chromosome 22q13 and Micall2 to chromosome 7p22. Translated Gen/Bank ESTs suggest the existence of multiple products alternatively spliced from the pre-mRNAs encoded by these genes. Characteristic indicators of such splicing among the proteins derived from one gene must include containment of some common extensive 100% identical regions. In some instances only one exon might be partly or completely eliminated. Sometimes alternative splicing is also associated with an increased frequency of creation of an exon or part of an exon from an intron. Not only coding regions for the body of the protein but also for its N- or -C ends could be affected by the splicing. If created forms are merely beginning at different starting points but remain identical in sequence thereafter, their existence as products of alternate splicing must be questioned. In the splicings, described in this paper, multiple isoforms rather than a single isoform appear as products during the gene expression.
Grossen, Christine; Keller, Lukas; Biebach, Iris; Croll, Daniel
2014-01-01
The major histocompatibility complex (MHC) is a crucial component of the vertebrate immune system and shows extremely high levels of genetic polymorphism. The extraordinary genetic variation is thought to be ancient polymorphisms maintained by balancing selection. However, introgression from related species was recently proposed as an additional mechanism. Here we provide evidence for introgression at the MHC in Alpine ibex (Capra ibex ibex). At a usually very polymorphic MHC exon involved in pathogen recognition (DRB exon 2), Alpine ibex carried only two alleles. We found that one of these DRB alleles is identical to a DRB allele of domestic goats (Capra aegagrus hircus). We sequenced 2489 bp of the coding and non-coding regions of the DRB gene and found that Alpine ibex homozygous for the goat-type DRB exon 2 allele showed nearly identical sequences (99.8%) to a breed of domestic goats. Using Sanger and RAD sequencing, microsatellite and SNP chip data, we show that the chromosomal region containing the goat-type DRB allele has a signature of recent introgression in Alpine ibex. A region of approximately 750 kb including the DRB locus showed high rates of heterozygosity in individuals carrying one copy of the goat-type DRB allele. These individuals shared SNP alleles both with domestic goats and other Alpine ibex. In a survey of four Alpine ibex populations, we found that the region surrounding the DRB allele shows strong linkage disequilibria, strong sequence clustering and low diversity among haplotypes carrying the goat-type allele. Introgression at the MHC is likely adaptive and introgression critically increased MHC DRB diversity in the genetically impoverished Alpine ibex. Our finding contradicts the long-standing view that genetic variability at the MHC is solely a consequence of ancient trans-species polymorphism. Introgression is likely an underappreciated source of genetic diversity at the MHC and other loci under balancing selection. PMID:24945814
Promoter mutation is a common variant in GJC2-associated Pelizaeus-Merzbacher-like disease.
Meyer, E; Kurian, M A; Morgan, N V; McNeill, A; Pasha, S; Tee, L; Younis, R; Norman, A; van der Knaap, M S; Wassmer, E; Trembath, R C; Brueton, L; Maher, E R
2011-12-01
Pelizaeus-Merzbacher-like disease (PMLD) is a clinically and genetically heterogeneous neurological disorder of cerebral hypomyelination. It is clinically characterised by early onset (usually infantile) nystagmus, impaired motor development, ataxia, choreoathetoid movements, dysarthria and progressive limb spasticity. We undertook autozygosity mapping studies in a large consanguineous family of Pakistani origin in which affected children had progressive lower limb spasticity and features of cerebral hypomyelination on MR brain imaging. SNP microarray and microsatellite marker analysis demonstrated linkage to chromosome 1q42.13-1q42.2. Direct sequencing of the gap junction protein gamma-2 gene, GJC2, identified a promoter region mutation (c.-167A>G) in the non-coding exon 1. The c.-167A>G promoter mutation was identified in a further 4 individuals from two families (who were also of Pakistani origin) with clinical and radiological features of PMLD in whom previous routine diagnostic screening of GJC2 had been reported as negative. A common haplotype was identified at the GJC2 locus in the three mutation-positive families, consistent with a common origin for the mutation and likely founder effect. This promoter mutation has only recently been reported in GJC2-PMLD but it has been postulated to affect the binding of the transcription factor SOX10 and appears to be a prevalent mutation, accounting for ~29% of reported patients with GJC2-PMLD. We propose that diagnostic screening of GJC2 should include sequence analysis of the non-coding exon 1, as well as the coding regions to avoid misdiagnosis or diagnostic delay in suspected PMLD. Copyright © 2011 Elsevier Inc. All rights reserved.
Izuogu, Osagie G; Alhasan, Abd A; Mellough, Carla; Collin, Joseph; Gallon, Richard; Hyslop, Jonathon; Mastrorosa, Francesco K; Ehrmann, Ingrid; Lako, Majlinda; Elliott, David J; Santibanez-Koref, Mauro; Jackson, Michael S
2018-04-20
Circular RNAs (circRNAs) are predominantly derived from protein coding genes, and some can act as microRNA sponges or transcriptional regulators. Changes in circRNA levels have been identified during human development which may be functionally important, but lineage-specific analyses are currently lacking. To address this, we performed RNAseq analysis of human embryonic stem (ES) cells differentiated for 90 days towards 3D laminated retina. A transcriptome-wide increase in circRNA expression, size, and exon count was observed, with circRNA levels reaching a plateau by day 45. Parallel statistical analyses, controlling for sample and locus specific effects, identified 239 circRNAs with expression changes distinct from the transcriptome-wide pattern, but these all also increased in abundance over time. Surprisingly, circRNAs derived from long non-coding RNAs (lncRNAs) were found to account for a significantly larger proportion of transcripts from their loci of origin than circRNAs from coding genes. The most abundant, circRMST:E12-E6, showed a > 100X increase during differentiation accompanied by an isoform switch, and accounts for > 99% of RMST transcripts in many adult tissues. The second most abundant, circFIRRE:E10-E5, accounts for > 98% of FIRRE transcripts in differentiating human ES cells, and is one of 39 FIRRE circRNAs, many of which include multiple unannotated exons. Our results suggest that during human ES cell differentiation, changes in circRNA levels are primarily globally controlled. They also suggest that RMST and FIRRE, genes with established roles in neurogenesis and topological organisation of chromosomal domains respectively, are processed as circular lncRNAs with only minor linear species.
Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro
2008-01-03
The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
Chung, H Y; Choi, Y C; Park, H N
2015-05-18
We investigated the phylogenetic relationships between pig breeds, compared the genetic similarity between humans and pigs, and provided basic genetic information on Korean native pigs (KNPs), using genetic variants of the swine leukocyte antigen 3 (SLA-3) gene. Primers were based on sequences from GenBank (accession Nos. AF464010 and AF464009). Polymerase chain reaction analysis amplified approximately 1727 bp of segments, which contained 1086 bp of coding regions and 641 bp of the 3'- and 5'-untranslated regions. Bacterial artificial chromosome clones of miniature pigs were used for sequencing the SLA-3 genomic region, which was 3114 bp in total length, including the coding (1086 bp) and non-coding (2028 bp) regions. Sequence analysis detected 53 single nucleotide polymorphisms (SNPs), based on a minor allele frequency greater than 0.01, which is low compared with other pig breeds, and the results suggest that there is low genetic variability in KNPs. Comparative analysis revealed that humans possess approximately three times more genetic variation than do pigs. Approximately 71% of SNPs in exons 2 and 3 were detected in KNPs, and exon 5 in humans is a highly polymorphic region. Newly identified sequences of SLA-3 using KNPs were submitted to GenBank (accession No. DQ992512-18). Cluster analysis revealed that KNPs were grouped according to three major alleles: SLA-3*0502 (DQ992518), SLA-3*0302 (DQ992513 and DQ992516), and SLA-3*0303 (DQ992512, DQ992514, DQ992515, and DQ992517). Alignments revealed that humans have a relatively close genetic relationship with pigs and chimpanzees. The information provided by this study may be useful in KNP management.
Recognition of Protein-coding Genes Based on Z-curve Algorithms
-Biao Guo, Feng; Lin, Yan; -Ling Chen, Ling
2014-01-01
Recognition of protein-coding genes, a classical bioinformatics issue, is an absolutely needed step for annotating newly sequenced genomes. The Z-curve algorithm, as one of the most effective methods on this issue, has been successfully applied in annotating or re-annotating many genomes, including those of bacteria, archaea and viruses. Two Z-curve based ab initio gene-finding programs have been developed: ZCURVE (for bacteria and archaea) and ZCURVE_V (for viruses and phages). ZCURVE_C (for 57 bacteria) and Zfisher (for any bacterium) are web servers for re-annotation of bacterial and archaeal genomes. The above four tools can be used for genome annotation or re-annotation, either independently or combined with the other gene-finding programs. In addition to recognizing protein-coding genes and exons, Z-curve algorithms are also effective in recognizing promoters and translation start sites. Here, we summarize the applications of Z-curve algorithms in gene finding and genome annotation. PMID:24822027
[Identifying and sequence analysis of HLA-B*2736].
Li, Zhen; Zou, Hong-Yan; Shao, Chao-Peng; Tang, Si; Wang, Da-Ming; Cheng, Liang-Hong
2007-11-01
An unknown HLA-B allele which was similar to HLA-B*270401 was detected by FLOW-SSOPCR-SSP and heterozygous sequence-based typing (SBT) in Chinese Han individual. Its anomalous patterns suggested the possible presence of new allele. Amplifying exon 2-5(include intron 2-4) of the HLA-B*27 allele separately by using allele-specific primers and sequencing in both directions. Identifying the difference between the novel B*27 allele and B*270401. The sequence of novel B*27 from exon 2 to partial exon 5 is 1 815 bp. There are 10 nt changes from B*270401 in exon 3-4, at nt634where A-->C(codon130 AGC-->CGC, 130 S-->R); nt670 where A-->T (codon142 ACC-->TCC, 142 T-->S); nt683 where G-->T (codon146 TGG-->TTG, 146 W-->L); nt698 where A-->T (codon151 GAG-->GTG, 151 E-->V); nt774 where G-->C (codon176 GAG-->GAC, 176 E-->D); nt776 where C-->A (codon177 ACG-->AAG, 177 T-->K); nt781 where C-->G (codon179 CAG-->GAG, 179Q-->E); nt789 where G-->T (codon181 GCG-->GCT) resulting no coding change; nt1438 where C-->T (codon206 GGC-->GGT) resulting no coding change; nt1449 where G-->C (codon210 GGG-->GCG, 210G-->A). In IMGT/HLA database, only three alleles (B*270502/2706/2732) have sequences of introns. The same sequence in intron 2 showed homology between the novel HLA-B*27 allele and B*2706, but their homology could not be supported in intron 3-4. Comparing the sequence of the novel B*27 allele in intron 3 and 4 with B*27 group, it showed there are three mutations at nt106 C-->G, nt179 G-->A, nt536 G-->A and one deletion at nt168 in intron 3 and one mutations at nt82 T-->C in intron 4, but the sequence of the novel B*27 allele in intron 3 and 4 was all the same to B*070201. The sequence was submitted to Gen-Bank and the accession number was DQ915176. The allele has been confirmed as an extension of B*2736 by the WHO Nomenclature committee in November 2006.
Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4.
Abbott, Geoffrey W
2016-08-01
The 5 human (h)KCNE β subunits each regulate various cation channels and are linked to inherited cardiac arrhythmias. Reported here are previously undiscovered protein-coding regions in exon 1 of hKCNE3 and hKCNE4 that extend their encoded extracellular domains by 44 and 51 residues, which yields full-length proteins of 147 and 221 residues, respectively. Full-length hKCNE3 and hKCNE4 transcript and protein are expressed in multiple human tissues; for hKCNE4, only the longer protein isoform is detectable. Two-electrode voltage-clamp electrophysiology revealed that, when coexpressed in Xenopus laevis oocytes with various potassium channels, the newly discovered segment preserved conversion of KCNQ1 by hKCNE3 to a constitutively open channel, but prevented its inhibition of Kv4.2 and KCNQ4. hKCNE4 slowing of Kv4.2 inactivation and positive-shifted steady-state inactivation were also preserved in the longer form. In contrast, full-length hKCNE4 inhibition of KCNQ1 was limited to 40% at +40 mV vs. 80% inhibition by the shorter form, and augmentation of KCNQ4 activity by hKCNE4 was entirely abolished by the additional segment. Among the genome databases analyzed, the longer KCNE3 is confined to primates; full-length KCNE4 is widespread in vertebrates but is notably absent from Mus musculus Findings highlight unexpected KCNE gene diversity, raise the possibility of dynamic regulation of KCNE partner modulation via splice variation, and suggest that the longer hKCNE3 and hKCNE4 proteins should be adopted in future mechanistic and genetic screening studies.-Abbott, G. W. Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4. © FASEB.
CVD-associated non-coding RNA, ANRIL, modulates expression of atherogenic pathways in VSMC
DOE Office of Scientific and Technical Information (OSTI.GOV)
Congrains, Ada; Kamide, Kei; Katsuya, Tomohiro
Highlights: Black-Right-Pointing-Pointer ANRIL maps in the strongest susceptibility locus for cardiovascular disease. Black-Right-Pointing-Pointer Silencing of ANRIL leads to altered expression of tissue remodeling-related genes. Black-Right-Pointing-Pointer The effects of ANRIL on gene expression are splicing variant specific. Black-Right-Pointing-Pointer ANRIL affects progression of cardiovascular disease by regulating proliferation and apoptosis pathways. -- Abstract: ANRIL is a newly discovered non-coding RNA lying on the strongest genetic susceptibility locus for cardiovascular disease (CVD) in the chromosome 9p21 region. Genome-wide association studies have been linking polymorphisms in this locus with CVD and several other major diseases such as diabetes and cancer. The role of thismore » non-coding RNA in atherosclerosis progression is still poorly understood. In this study, we investigated the implication of ANRIL in the modulation of gene sets directly involved in atherosclerosis. We designed and tested siRNA sequences to selectively target two exons (exon 1 and exon 19) of the transcript and successfully knocked down expression of ANRIL in human aortic vascular smooth muscle cells (HuAoVSMC). We used a pathway-focused RT-PCR array to profile gene expression changes caused by ANRIL knock down. Notably, the genes affected by each of the siRNAs were different, suggesting that different splicing variants of ANRIL might have distinct roles in cell physiology. Our results suggest that ANRIL splicing variants play a role in coordinating tissue remodeling, by modulating the expression of genes involved in cell proliferation, apoptosis, extra-cellular matrix remodeling and inflammatory response to finally impact in the risk of cardiovascular disease and other pathologies.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, O.; Masters, C.; Lewis, M.B.
1994-09-01
In an 8-year-old girl and her father, both of whom have severe type III OI, we have previously used RNA/RNA hybrid analysis to demonstrate a mismatch in the region of {alpha}1(I) mRNA coding for aa 558-861. We used SSCP to further localize the abnormality to a subregion coding for aa 579-679. This region was subcloned and sequenced. Each patient`s cDNA has a deletion of the sequences coding for the last residue of exon 34, and all of exons 35 and 36 (aa 604-639), followed by an insertion of 156 nt from the 3{prime}-end of intron 36. PCR amplification of leukocytemore » DNA from the patients and the clinically normal paternal grandmother yielded two fragments: a 1007 bp fragment predicted from normal genomic sequences and a 445 bp fragment. Subcloning and sequencing of the shorter genomic PCR product confirmed the presence of a 565 bp genomic deletion from the end of exon 34 to the middle of intron 36. The abnormal protein is apparently synthesized and incorporated into helix. The inserted nucleotides are in frame with the collagenous sequence and contain no stop codons. They encode a 52 aa non-collagenous region. The fibroblast procollagen of the patients has both normal and electrophoretically delayed pro{alpha}(I) bands. The electrophoretically delayed procollagen is very sensitive to pepsin or trypsin digestion, as predicted by its non-collagenous sequence, and cannot be visualized as collagen. This unique OI collagen mutation is an excellent candidate for molecular targeting to {open_quotes}turn off{close_quotes} a dominant mutant allele.« less
Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis
D’Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; He, Hong; Li, Shibo; Hejtmancik, James F.; Sieving, Paul A.; Wang, Xinjing
2013-01-01
Purpose X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4–5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Methods Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Results Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5′ region of the RS1 gene (including the promoter) through intron 1 (c.(−35)-1723_c.51+2664del4472). The exon 4–5 deletion spans introns 3 to intron 5 (c.185–1020_c.522+1844del5764). Conclusions Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes. PMID:24227916
Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis.
D'Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; Lee, Ji-Yun; He, Hong; Li, Shibo; Smaoui, Nizar; Hejtmancik, James F; Sieving, Paul A; Wang, Xinjing
2013-01-01
X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4-5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5' region of the RS1 gene (including the promoter) through intron 1 (c.(-35)-1723_c.51+2664del4472). The exon 4-5 deletion spans introns 3 to intron 5 (c.185-1020_c.522+1844del5764). Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes.
Exon 2-mediated c-myc mRNA decay in vivo is independent of its translation.
Pistoi, S; Roland, J; Babinet, C; Morello, D
1996-01-01
We have previously shown that the steady-state level of c-myc mRNA in vivo is primarily controlled by posttranscriptional regulatory mechanisms. To identify the sequences involved in this process, we constructed a series of H-2/myc transgenic lines in which various regions of the human c-MYC gene were placed under the control of the quasi-ubiquitous H-2K class I regulatory sequences. We demonstrated that the presence of one of the two coding exons, exon 2 or exon 3, is sufficient to confer a level of expression of transgene mRNA similar to that of endogenous c-myc in various adult tissues as well as after partial hepatectomy or after protein synthesis inhibition. We now focus on the molecular mechanisms involved in modulation of expression of mRNAs containing c-myc exon 2 sequences, with special emphasis on the coupling between translation and c-myc mRNA turnover. We have undertaken an analysis of expression, both at the mRNA level and at the protein level, of new transgenic constructs in which the translation is impaired either by disruption of the initiation codon or by addition of stop codons upstream of exon 2. Our results show that the translation of c-myc exon 2 is not required for regulated expression of the transgene in the different situations analyzed, and therefore they indicate that the mRNA destabilizing function of exon 2 is independent of translation by ribosomes. Our investigations also reveal that, in the thymus, some H-2/myc transgenes express high levels of mRNA but low levels of protein. Besides the fact that these results suggest the existence of tissue-specific mechanisms that control c-myc translatability in vivo, they also bring another indication of the uncoupling of c-myc mRNA translation and degradation. PMID:8756668
High Resolution Melting Analysis for JAK2 Exon 14 and Exon 12 Mutations
Rapado, Inmaculada; Grande, Silvia; Albizua, Enriqueta; Ayala, Rosa; Hernández, José-Angel; Gallardo, Miguel; Gilsanz, Florinda; Martinez-Lopez, Joaquin
2009-01-01
JAK2 mutations are important criteria for the diagnosis of Philadelphia chromosome-negative myeloproliferative neoplasms. We aimed to assess JAK2 exon 14 and exon 12 mutations by high-resolution melting (HRM) analysis, which allows variation screening. The exon 14 analysis included 163 patients with polycythemia vera, secondary erythrocytoses, essential thrombocythemia, or secondary thrombocytoses, and 126 healthy subjects. The study of exon 12 included 40 JAK2 V617F-negative patients (nine of which had polycythemia vera, and 31 with splanchnic vein thrombosis) and 30 healthy subjects. HRM analyses of JAK2 exons 14 and 12 gave analytical sensitivities near 1% and both intra- and interday coefficients of variation of less than 1%. For HRM analysis of JAK2 exon 14 in polycythemia vera and essential thrombocythemia, clinical sensitivities were 93.5% and 67.9%, clinical specificities were 98.8% and 97.0%, positive predictive values were 93.5% and 79.2%, and negative predictive values were 98.8% and 94.6, respectively. Correlations were observed between the results from HRM and three commonly used analytical methods. The JAK2 exon 12 HRM results agreed completely with those from sequencing analysis, and the three mutations in exon 12 were detected by both methods. Hence, HRM analysis of exons 14 and 12 in JAK2 shows better diagnostic values than three other routinely used methods against which it was compared. In addition, HRM analysis has the advantage of detecting unknown mutations. PMID:19225136
Wise, C A; Chiang, L C; Paznekas, W A; Sharma, M; Musy, M M; Ashley, J A; Lovett, M; Jabs, E W
1997-04-01
Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development.
Genomic organization of the neurofibromatosis 1 gene (NF1)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Y.; O`Connell, P.; Huntsman Breidenbach, H.
Neurofibromatosis 1 maps to chromosome band 17q11.2, and the NF1 locus has been partially characterized. Even though the full-length NF1 cDNA has been sequenced, the complete genomic structure of the NF1 gene has not been elucidated. The 5{prime} end of NF1 is embedded in a CpG island containing a NotI restriction site, and the remainder of the gene lies in the adjacent 350-kb NotI fragment. In our efforts to develop a comprehensive screen for NF1 mutations, we have isolated genomic DNA clones that together harbor the entire NF1 cDNA sequence. We have identified all intron-exon boundaries of the coding regionmore » and established that it is composed of 59 exons. Furthermore, we have defined the 3{prime}-untranslated region (3{prime}-UTR) of the NF1 gene; it spans approximately 3.5 kb of genomic DNA sequence and is continuous with the stop codon. Oligonucleotide primer pairs synthesized from exon-flanking DNA sequences were used in the polymerase chain reaction with cloned, chromosome 17-specific genomic DNA as template to amplify NF1 exons 1 through 27b and the exon containing the 3{prime}-UTR separately. This information should be useful for implementing a comprehensive NF1 mutation screen using genomic DNA as template. 41 refs., 3 figs., 2 tabs.« less
Laitinen, Eeva-Maria; Tommiska, Johanna; Dunkel, Leo; Sankilampi, Ulla; Vaaralahti, Kirsi; Raivio, Taneli
2010-04-01
To describe a mother with idiopathic hypogonadotropic hypogonadism (IHH) and her monozygotic (MZ) twin boys who all have the same heterozygous fibroblast growth factor receptor-1 (FGFR1) gene mutation. Case report. University hospital. A 28-year-old mother with normosmic IHH gave birth to MZ twin boys after a transfer of a single frozen-thawed embryo. Clinical and biochemical evaluation of IHH. Sequence analysis of the 17 coding exons (exons 2-18) and exon-intron boundaries of FGFR1 from polymerase chain reaction-amplified genomic DNA from peripheral blood leukocytes of the subjects. Phenotypic features of the subjects. All subjects harbored a previously undescribed heterozygous FGFR1 mutation (c.2049-1 G-->C), leading to the skipping of exon 16 and thus a loss of amino acids 684-726 in the tyrosine kinase domain of the receptor. The absence of exon 16 was verified at the cDNA level. The twins manifested with microphallus, cryptorchidism, and deficient postnatal activation of the hypothalamic-pituitary-gonadal axis, findings consistent with IHH. Our report underlines that assisted reproductive techniques enable the inheritance of gene mutations causing infertility. This is the first report on the phenotypic features of MZ twins with an FGFR1 mutation. Copyright 2010 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Yang, Q L; Huang, X Y; Kong, J J; Zhao, S G; Liu, L X; Gun, S B
2016-08-19
Piglet diarrhea is one of the primary factors that affects the benefits of the swine industry. Recent studies have shown that exon 2 of the swine leukocyte antigen-DQA gene is associated with piglet resistance to diarrhea; however, the contributions of additional exon coding regions of this gene remain unclear. Here, we detected and sequenced variants in the exon 3 region and examined their associations with diarrhea infection in 425 suckling piglets using the polymerase chain reaction-single-strand conformational polymorphism and sequencing analysis. The results revealed that exon 3 of the swine leukocyte antigen-DQA gene is highly polymorphic and pivotal to both diarrhea susceptibility and resistance in piglets. We identified 14 genotypes (AA, AB, BB, BC, CC, EE, EF, BE, BF, CF, DD, DH, GG, and GF) and eight alleles (A-H) that were generated by 14 nucleotide variants, eight of which were novel, and three nucleotide deletions. Statistical analyses revealed that the genotypes AB and EF were associated with resistance to diarrheal disease (P < 0.05), and the genotype DD may contribute to diarrhea susceptibility but was unique to Large White pigs (P > 0.05). These results elucidate the genetic and immunological background to piglet diarrhea, and provide useful information for resistance breeding programs.
Identification of an Intronic Splicing Enhancer Essential for the Inclusion of FGFR2 Exon IIIc*S⃞
Seth, Puneet; Miller, Heather B.; Lasda, Erika L.; Pearson, James L.; Garcia-Blanco, Mariano A.
2008-01-01
The ligand specificity of fibroblast growth factor receptor 2 (FGFR2) is determined by the alternative splicing of exons 8 (IIIb) or 9 (IIIc). Exon IIIb is included in epithelial cells, whereas exon IIIc is included in mesenchymal cells. Although a number of cis elements and trans factors have been identified that play a role in exon IIIb inclusion in epithelium, little is known about the activation of exon IIIc in mesenchyme. We report here the identification of a splicing enhancer required for IIIc inclusion. This 24-nucleotide (nt) downstream intronic splicing enhancer (DISE) is located within intron 9 immediately downstream of exon IIIc. DISE was able to activate the inclusion of heterologous exons rat FGFR2 IIIb and human β-globin exon 2 in cell lines from different tissues and species and also in HeLa cell nuclear extracts in vitro. DISE was capable of replacing the intronic activator sequence 1 (IAS1), a known IIIb splicing enhancer and vice versa. This fact, together with the requirement for DISE to be close to the 5′-splice site and the ability of DISE to promote binding of U1 snRNP, suggested that IAS1 and DISE belong to the same class of cis-acting elements. PMID:18256031
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Eisman, Robert C.; Phelps, Melissa A. S.; Kaufman, Thomas
2015-01-01
The formation of the pericentriolar matrix (PCM) and a fully functional centrosome in syncytial Drosophila melanogaster embryos requires the rapid transport of Cnn during initiation of the centrosome replication cycle. We show a Cnn and Polo kinase interaction is apparently required during embryogenesis and involves the exon 1A-initiating coding exon, suggesting a subset of Cnn splice variants is regulated by Polo kinase. During PCM formation exon 1A Cnn-Long Form proteins likely bind Polo kinase before phosphorylation by Polo for Cnn transport to the centrosome. Loss of either of these interactions in a portion of the total Cnn protein pool is sufficient to remove native Cnn from the pool, thereby altering the normal localization dynamics of Cnn to the PCM. Additionally, Cnn-Short Form proteins are required for polar body formation, a process known to require Polo kinase after the completion of meiosis. Exon 1A Cnn-LF and Cnn-SF proteins, in conjunction with Polo kinase, are required at the completion of meiosis and for the formation of functional centrosomes during early embryogenesis. PMID:26447129
Eisman, Robert C; Phelps, Melissa A S; Kaufman, Thomas
2015-10-01
The formation of the pericentriolar matrix (PCM) and a fully functional centrosome in syncytial Drosophila melanogaster embryos requires the rapid transport of Cnn during initiation of the centrosome replication cycle. We show a Cnn and Polo kinase interaction is apparently required during embryogenesis and involves the exon 1A-initiating coding exon, suggesting a subset of Cnn splice variants is regulated by Polo kinase. During PCM formation exon 1A Cnn-Long Form proteins likely bind Polo kinase before phosphorylation by Polo for Cnn transport to the centrosome. Loss of either of these interactions in a portion of the total Cnn protein pool is sufficient to remove native Cnn from the pool, thereby altering the normal localization dynamics of Cnn to the PCM. Additionally, Cnn-Short Form proteins are required for polar body formation, a process known to require Polo kinase after the completion of meiosis. Exon 1A Cnn-LF and Cnn-SF proteins, in conjunction with Polo kinase, are required at the completion of meiosis and for the formation of functional centrosomes during early embryogenesis. Copyright © 2015 by the Genetics Society of America.
Li, Hongying; Zhang, Kaihui; Xu, Qun; Ma, Lixia; Lv, Xin; Sun, Ruopeng
2015-03-01
Alkaptonuria (AKU) is an autosomal recessive disorder of tyrosine metabolism, which is caused by a defect in the enzyme homogentisate 1,2-dioxygenase (HGD) with subsequent accumulation of homogentisic acid. Presently, more than 100 HGD mutations have been identified as the cause of the inborn error of metabolism across different populations worldwide. However, the HGD mutation is very rarely reported in Asia, especially China. In this study, we present mutational analyses of HGD gene in one Chinese Han child with AKU, which had been identified by gas chromatography-mass spectrometry detection of organic acids in urine samples. PCR and DNA sequencing of the entire coding region as well as exon-intron boundaries of HGD have been performed. Two novel mutations were identified in the HGD gene in this AKU case, a frameshift mutation of c.115delG in exon 3 and the splicing mutation of IVS5+3 A>C, a donor splice site of the exon 5 and exon-intron junction. The identification of these mutations in this study further expands the spectrum of known HGD gene mutations and contributes to prenatal molecular diagnosis of AKU.
Analysis and recognition of 5′ UTR intron splice sites in human pre-mRNA
Eden, E.; Brunak, S.
2004-01-01
Prediction of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition. We perform a rigorous analysis of such splice sites embedded in human 5′ untranslated regions (UTRs), and investigate correlations between this class of splice sites and other features found in the adjacent exons and introns. By restricting the training of neural network algorithms to ‘pure’ UTRs (not extending partially into protein coding regions), we for the first time investigate the predictive power of the splicing signal proper, in contrast to conventional splice site prediction, which typically relies on the change in sequence at the transition from protein coding to non-coding. By doing so, the algorithms were able to pick up subtler splicing signals that were otherwise masked by ‘coding’ noise, thus enhancing significantly the prediction of 5′ UTR splice sites. For example, the non-coding splice site predicting networks pick up compositional and positional bias in the 3′ ends of non-coding exons and 5′ non-coding intron ends, where cytosine and guanine are over-represented. This compositional bias at the true UTR donor sites is also visible in the synaptic weights of the neural networks trained to identify UTR donor sites. Conventional splice site prediction methods perform poorly in UTRs because the reading frame pattern is absent. The NetUTR method presented here performs 2–3-fold better compared with NetGene2 and GenScan in 5′ UTRs. We also tested the 5′ UTR trained method on protein coding regions, and discovered, surprisingly, that it works quite well (although it cannot compete with NetGene2). This indicates that the local splicing pattern in UTRs and coding regions is largely the same. The NetUTR method is made publicly available at www.cbs.dtu.dk/services/NetUTR. PMID:14960723
Laitinen, Eeva-Maria; Tommiska, Johanna; Virtanen, Helena E; Oehlandt, Heidi; Koivu, Rosanna; Vaaralahti, Kirsi; Toppari, Jorma; Raivio, Taneli
2011-07-20
Mutations in FGFR1, GNRHR, PROK2, PROKR2, TAC3, or TACR3 underlie isolated hypogonadotropic hypogonadism (IHH) with clinically variable phenotypes, and, by causing incomplete intrauterine activation of the hypothalamic-pituitary-gonadal axis, may lead to cryptorchidism. To investigate the role of defects in these genes in the etiology of isolated cryptorchidism, we screened coding exons and exon-intron boundaries of these genes in 54 boys or men from 46 families with a history of cryptorchidism. Control subjects (200) included 120 males. None of the patients carried mutation(s) in FGFR1, PROK2, PROKR2, TAC3 or TACR3. Two of the 46 index subjects with unilateral cryptorchidism were heterozygous carriers of a single GNRHR mutation (Q106R or R262Q), also present in male controls with a similar frequency (3/120; p=0.62). No homozygous or compound heterozygous GNRHR mutations were found. In conclusion, cryptorchidism is not commonly caused by defects in genes involved in IHH. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Alternative Splicing as a Target for Cancer Treatment.
Martinez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Anaya Ruiz, Maricruz; Monjaraz-Guzman, Eduardo; Martinez-Contreras, Rebeca
2018-02-11
Alternative splicing is a key mechanism determinant for gene expression in metazoan. During alternative splicing, non-coding sequences are removed to generate different mature messenger RNAs due to a combination of sequence elements and cellular factors that contribute to splicing regulation. A different combination of splicing sites, exonic or intronic sequences, mutually exclusive exons or retained introns could be selected during alternative splicing to generate different mature mRNAs that could in turn produce distinct protein products. Alternative splicing is the main source of protein diversity responsible for 90% of human gene expression, and it has recently become a hallmark for cancer with a full potential as a prognostic and therapeutic tool. Currently, more than 15,000 alternative splicing events have been associated to different aspects of cancer biology, including cell proliferation and invasion, apoptosis resistance and susceptibility to different chemotherapeutic drugs. Here, we present well established and newly discovered splicing events that occur in different cancer-related genes, their modification by several approaches and the current status of key tools developed to target alternative splicing with diagnostic and therapeutic purposes.
Connecting the dots: chromatin and alternative splicing in EMT.
Warns, Jessica A; Davie, James R; Dhasarathy, Archana
2016-02-01
Nature has devised sophisticated cellular machinery to process mRNA transcripts produced by RNA Polymerase II, removing intronic regions and connecting exons together, to produce mature RNAs. This process, known as splicing, is very closely linked to transcription. Alternative splicing, or the ability to produce different combinations of exons that are spliced together from the same genomic template, is a fundamental means of regulating protein complexity. Similar to transcription, both constitutive and alternative splicing can be regulated by chromatin and its associated factors in response to various signal transduction pathways activated by external stimuli. This regulation can vary between different cell types, and interference with these pathways can lead to changes in splicing, often resulting in aberrant cellular states and disease. The epithelial to mesenchymal transition (EMT), which leads to cancer metastasis, is influenced by alternative splicing events of chromatin remodelers and epigenetic factors such as DNA methylation and non-coding RNAs. In this review, we will discuss the role of epigenetic factors including chromatin, chromatin remodelers, DNA methyltransferases, and microRNAs in the context of alternative splicing, and discuss their potential involvement in alternative splicing during the EMT process.
Genetic variations in the MCT1 (SLC16A1) gene in the Chinese population of Singapore.
Lean, Choo Bee; Lee, Edmund Jon Deoon
2009-01-01
MCT1(SLC16A1) is the first member of the monocarboxylate transporter (MCT) and its family is involved in the transportation of metabolically important monocarboxylates such as lactate, pyruvate, acetate and ketone bodies. This study identifies genetic variations in SLC16A1 in the ethnic Chinese group of the Singaporean population (n=95). The promoter, coding region and exon-intron junctions of the SLC16A1 gene encoding the MCT1 transporter were screened for genetic variation in the study population by DNA sequencing. Seven genetic variations of SLC16A1, including 4 novel ones, were found: 2 in the promoter region, 2 in the coding exons (both nonsynonymous variations), 2 in the 3' untranslated region (3'UTR) and 1 in the intron. Of the two mutations detected in the promoter region, the -363-855T>C is a novel mutation. The 1282G>A (Val(428)Ile) is a novel SNP and was found as heterozygotic in 4 subjects. The 1470T>A (Asp(490)Glu) was found to be a common polymorphism in this study. Lastly, IVS3-17A>C in intron 3 and 2258 (755)A>G in 3'UTR are novel mutations found to be common polymorphisms in the local Chinese population. To our knowledge, this is the first report of a comprehensive analysis on the MCT1 gene in any population.
Pandey, Manmohan; Kumar, Ravindra; Srivastava, Prachi; Agarwal, Suyash; Srivastava, Shreya; Nagpure, Naresh S; Jena, Joy K; Kushwaha, Basdeo
2018-03-16
Mining and characterization of Simple Sequence Repeat (SSR) markers from whole genomes provide valuable information about biological significance of SSR distribution and also facilitate development of markers for genetic analysis. Whole genome sequencing (WGS)-SSR Annotation Tool (WGSSAT) is a graphical user interface pipeline developed using Java Netbeans and Perl scripts which facilitates in simplifying the process of SSR mining and characterization. WGSSAT takes input in FASTA format and automates the prediction of genes, noncoding RNA (ncRNA), core genes, repeats and SSRs from whole genomes followed by mapping of the predicted SSRs onto a genome (classified according to genes, ncRNA, repeats, exonic, intronic, and core gene region) along with primer identification and mining of cross-species markers. The program also generates a detailed statistical report along with visualization of mapped SSRs, genes, core genes, and RNAs. The features of WGSSAT were demonstrated using Takifugu rubripes data. This yielded a total of 139 057 SSR, out of which 113 703 SSR primer pairs were uniquely amplified in silico onto a T. rubripes (fugu) genome. Out of 113 703 mined SSRs, 81 463 were from coding region (including 4286 exonic and 77 177 intronic), 7 from RNA, 267 from core genes of fugu, whereas 105 641 SSR and 601 SSR primer pairs were uniquely mapped onto the medaka genome. WGSSAT is tested under Ubuntu Linux. The source code, documentation, user manual, example dataset and scripts are available online at https://sourceforge.net/projects/wgssat-nbfgr.
The frequency of previously undetectable deletions involving 3' Exons of the PMS2 gene.
Vaughn, Cecily P; Baker, Christine L; Samowitz, Wade S; Swensen, Jeffrey J
2013-01-01
Lynch syndrome is characterized by mutations in one of four mismatch repair genes, MLH1, MSH2, MSH6, or PMS2. Clinical mutation analysis of these genes includes sequencing of exonic regions and deletion/duplication analysis. However, detection of deletions and duplications in PMS2 has previously been confined to Exons 1-11 due to gene conversion between PMS2 and the pseudogene PMS2CL in the remaining 3' exons (Exons 12-15). We have recently described an MLPA-based method that permits detection of deletions of PMS2 Exons 12-15; however, the frequency of such deletions has not yet been determined. To address this question, we tested for 3' deletions in 58 samples that were reported to be negative for PMS2 mutations using previously available methods. All samples were from individuals whose tumors exhibited loss of PMS2 immunohistochemical staining without concomitant loss of MLH1 immunostaining. We identified seven samples in this cohort with deletions in the 3' region of PMS2, including three previously reported samples with deletions of Exons 13-15 (two samples) and Exons 14-15. Also detected were deletions of Exons 12-15, Exon 13, and Exon 14 (two samples). Breakpoint analysis of the intragenic deletions suggests they occurred through Alu-mediated recombination. Our results indicate that ∼12% of samples suspected of harboring a PMS2 mutation based on immunohistochemical staining, for which mutations have not yet been identified, would benefit from testing using the new methodology. Copyright © 2012 Wiley Periodicals, Inc.
Large exon size does not limit splicing in vivo.
Chen, I T; Chasin, L A
1994-03-01
Exon sizes in vertebrate genes are, with a few exceptions, limited to less than 300 bases. It has been proposed that this limitation may derive from the exon definition model of splice site recognition. In this model, a downstream donor site enhances splicing at the upstream acceptor site of the same exon. This enhancement may require contact between factors bound to each end of the exon; an exon size limitation would promote such contact. To test the idea that proximity was required for exon definition, we inserted random DNA fragments from Escherichia coli into a central exon in a three-exon dihydrofolate reductase minigene and tested whether the expanded exons were efficiently spliced. DNA from a plasmid library of expanded minigenes was used to transfect a CHO cell deletion mutant lacking the dhfr locus. PCR analysis of DNA isolated from the pooled stable cotransfectant populations displayed a range of DNA insert sizes from 50 to 1,500 nucleotides. A parallel analysis of the RNA from this population by reverse transcription followed by PCR showed a similar size distribution. Central exons as large as 1,400 bases could be spliced into mRNA. We also tested individual plasmid clones containing exon inserts of defined sizes. The largest exon included in mRNA was 1,200 bases in length, well above the 300-base limit implied by the survey of naturally occurring exons. We conclude that a limitation in exon size is not part of the exon definition mechanism.
[Numeric alterations in the dys gene and their association with clinical features].
Mampel, Alejandra; Echeverría, María Inés; Vargas, Ana Lía; Roque, María
2011-01-01
The Duchenne/Becker muscular dystrophy is a hereditary miopathy with a recessive sex-linked pattern. The related gene is called DYS and the coded protein plays a crucial role in the anchorage between the cytoskeleton and the cellular membrane in muscle cells. Different clinical manifestations are observed depending on the impact of the genetic alteration on the protein. The global register of mutations reveals an enhanced frequency for deletions/duplications of one or more exons affecting the DYS gene. In the present work, numeric alterations have been studied in the 79 exons of the DYS gene. The study has been performed on 59 individuals, including 31 independent cases and 28 cases with a familial link. The applied methodology was Multiplex Ligation Dependent Probe Amplification (MLPA). In the 31 independent cases clinical data were established: i.e. the clinical score, the Raven test percentiles, and the creatininphosphokinase (CPK) blood values. Our results reveal a 61.3% frequency of numeric alterations affecting the DYS gene in our population, provoking all of them a reading frame shift. The rate for de novo mutations was identified as 35.2%. Alterations involving a specific region of one exon were observed with high frequency, affecting a specific region. A significant association was found between numeric alterations and a low percentile for the Raven test. These data contribute to the local knowledge of genetic alterations and their phenotypic impact for the Duchenne/Becker disease.
Lu, Jiamiao; Williams, James A.; Luke, Jeremy; Zhang, Feijie; Chu, Kirk; Kay, Mark A.
2017-01-01
We previously developed a mini-intronic plasmid (MIP) expression system in which the essential bacterial elements for plasmid replication and selection are placed within an engineered intron contained within a universal 5′ UTR noncoding exon. Like minicircle DNA plasmids (devoid of bacterial backbone sequences), MIP plasmids overcome transcriptional silencing of the transgene. However, in addition MIP plasmids increase transgene expression by 2 and often >10 times higher than minicircle vectors in vivo and in vitro. Based on these findings, we examined the effects of the MIP intronic sequences in a recombinant adeno-associated virus (AAV) vector system. Recombinant AAV vectors containing an intron with a bacterial replication origin and bacterial selectable marker increased transgene expression by 40 to 100 times in vivo when compared with conventional AAV vectors. Therefore, inclusion of this noncoding exon/intron sequence upstream of the coding region can substantially enhance AAV-mediated gene expression in vivo. PMID:27903072
DOE Office of Scientific and Technical Information (OSTI.GOV)
Steinlein, O.; Weiland, S.; Stoodt, J.
1996-03-01
The human neuronal nicotinic acetylcholine receptor {alpha}4 subunit gene (CHRNA4) is located in the candidate region for three different phenotypes: benign familial neonatal convulsions, autosomal dominant nocturnal frontal lobe epilepsy, and low-voltage EEG. Recently, a missense mutation in transmembrane domain 2 of CHRNA4 was found to be associated with autosomal dominant nocturnal frontal lobe epilepsy in one extended pedigree. We have determined the genomic organization of CHRNA4, which consists of six exons distributed over approximately 17 kb of genomic DNA. The nucleotide sequence obtained from the genomic regions adjacent to the exon boundaries enabled us to develop a set ofmore » primer pairs for PCR amplification of the complete coding region. The sequence analysis provides the basis for a comprehensive mutation screening of CHRNA4 in the above-mentioned phenotypes and possibly in other types of idopathic epilepsies. 29 refs., 3 figs., 1 tab.« less
Nicolas, Francisco Esteban; Moxon, Simon; de Haro, Juan P.; Calo, Silvia; Grigoriev, Igor V.; Torres-Martínez, Santiago; Moulton, Vincent; Ruiz-Vázquez, Rosa M.; Dalmay, Tamas
2010-01-01
Endogenous short RNAs (esRNAs) play diverse roles in eukaryotes and usually are produced from double-stranded RNA (dsRNA) by Dicer. esRNAs are grouped into different classes based on biogenesis and function but not all classes are present in all three eukaryotic kingdoms. The esRNA register of fungi is poorly described compared to other eukaryotes and it is not clear what esRNA classes are present in this kingdom and whether they regulate the expression of protein coding genes. However, evidence that some dicer mutant fungi display altered phenotypes suggests that esRNAs play an important role in fungi. Here, we show that the basal fungus Mucor circinelloides produces new classes of esRNAs that map to exons and regulate the expression of many protein coding genes. The largest class of these exonic-siRNAs (ex-siRNAs) are generated by RNA-dependent RNA Polymerase 1 (RdRP1) and dicer-like 2 (DCL2) and target the mRNAs of protein coding genes from which they were produced. Our results expand the range of esRNAs in eukaryotes and reveal a new role for esRNAs in fungi. PMID:20427422
Lenglet, Marion; Robriquet, Florence; Schwarz, Klaus; Camps, Carme; Couturier, Anne; Hoogewijs, David; Buffet, Alexandre; Knight, Samantha Jl; Gad, Sophie; Couvé, Sophie; Chesnel, Franck; Pacault, Mathilde; Lindenbaum, Pierre; Job, Sylvie; Dumont, Solenne; Besnard, Thomas; Cornec, Marine; Dreau, Helene; Pentony, Melissa; Kvikstad, Erika; Deveaux, Sophie; Burnichon, Nelly; Ferlicot, Sophie; Vilaine, Mathias; Mazzella, Jean-Michaël; Airaud, Fabrice; Garrec, Céline; Heidet, Laurence; Irtan, Sabine; Mantadakis, Elpis; Bouchireb, Karim; Debatin, Klaus-Michael; Redon, Richard; Bezieau, Stéphane; Bressac-de Paillerets, Brigitte; Teh, Bin Tean; Girodon, François; Randi, Maria-Luigia; Putti, Maria Caterina; Bours, Vincent; Van Wijk, Richard; Göthert, Joachim R; Kattamis, Antonis; Janin, Nicolas; Bento, Celeste; Taylor, Jenny C; Arlot-Bonnemains, Yannick; Richard, Stéphane; Gimenez-Roqueplo, Anne-Paule; Cario, Holger; Gardie, Betty
2018-06-11
Chuvash polycythemia is an autosomal recessive form of erythrocytosis associated with a homozygous p.Arg200Trp mutation in the von Hippel-Lindau (VHL) gene. Since this discovery, additional VHL mutations have been identified in patients with congenital erythrocytosis, in a homozygous or compound-heterozygous state. VHL is a major tumor suppressor gene, mutations in which were first described in patients presenting with von Hippel-Lindau disease, which is characterized by the development of highly vascularized tumors. Here, we identified a new VHL cryptic-exon (termed E1') deep in intron 1 that is naturally expressed in many tissues. More importantly, we identified mutations in E1' in seven families with erythrocytosis (one homozygous case and six compound-heterozygous cases with a mutation in E1' in addition to a mutation in VHL coding sequences) and in one large family with typical VHL disease but without any alteration in the other VHL exons. In this study we have shown that the mutations induced a dysregulation of the VHL splicing with excessive retention of E1' and are associated with a downregulation of VHL protein expression. In addition, we have demonstrated a pathogenic role for synonymous mutations in VHL-Exon 2 that alter splicing through E2-skipping in five families with erythrocytosis or VHL disease. In all the studied cases, the mutations differentially impact splicing, correlating with phenotype severity. This study demonstrates that cryptic-exon-retention or exon-skipping are new VHL alterations and reveals a novel complex splicing regulation of the VHL gene. These findings open new avenues for diagnosis and research into the VHL-related-hypoxia-signaling pathway. Copyright © 2018 American Society of Hematology.
NASA Technical Reports Server (NTRS)
Donoho, Greg; Brenneman, Mark A.; Cui, Tracy X.; Donoviel, Dorit; Vogel, Hannes; Goodwin, Edwin H.; Chen, David J.; Hasty, Paul
2003-01-01
The Brca2 tumor-suppressor gene contributes to genomic stability, at least in part by a role in homologous recombinational repair. BRCA2 protein is presumed to function in homologous recombination through interactions with RAD51. Both exons 11 and 27 of Brca2 code for domains that interact with RAD51; exon 11 encodes eight BRC motifs, whereas exon 27 encodes a single, distinct interaction domain. Deletion of all RAD51-interacting domains causes embryonic lethality in mice. A less severe phenotype is seen with BRAC2 truncations that preserve some, but not all, of the BRC motifs. These mice can survive beyond weaning, but are runted and infertile, and die very young from cancer. Cells from such mice show hypersensitivity to some genotoxic agents and chromosomal instability. Here, we have analyzed mice and cells with a deletion of only the RAD51-interacting region encoded by exon 27. Mice homozygous for this mutation (called brca2(lex1)) have a shorter life span than that of control littermates, possibly because of early onsets of cancer and sepsis. No other phenotype was observed in these animals; therefore, the brca2(lex1) mutation is less severe than truncations that delete some BRC motifs. However, at the cellular level, the brca2(lex1) mutation causes reduced viability, hypersensitivity to the DNA interstrand crosslinking agent mitomycin C, and gross chromosomal instability, much like more severe truncations. Thus, the extreme carboxy-terminal region encoded by exon 27 is important for BRCA2 function, probably because it is required for a fully functional interaction between BRCA2 and RAD51. Copyright 2003 Wiley-Liss, Inc.
Kaer, Kristel; Branovets, Jelena; Hallikma, Anni; Nigumann, Pilvi; Speek, Mart
2011-01-01
Background Transcriptional interference has been recently recognized as an unexpectedly complex and mostly negative regulation of genes. Despite a relatively few studies that emerged in recent years, it has been demonstrated that a readthrough transcription derived from one gene can influence the transcription of another overlapping or nested gene. However, the molecular effects resulting from this interaction are largely unknown. Methodology/Principal Findings Using in silico chromosome walking, we searched for prematurely terminated transcripts bearing signatures of intron retention or exonization of intronic sequence at their 3′ ends upstream to human L1 retrotransposons, protein-coding and noncoding nested genes. We demonstrate that transcriptional interference induced by intronic L1s (or other repeated DNAs) and nested genes could be characterized by intron retention, forced exonization and cryptic polyadenylation. These molecular effects were revealed from the analysis of endogenous transcripts derived from different cell lines and tissues and confirmed by the expression of three minigenes in cell culture. While intron retention and exonization were comparably observed in introns upstream to L1s, forced exonization was preferentially detected in nested genes. Transcriptional interference induced by L1 or nested genes was dependent on the presence or absence of cryptic splice sites, affected the inclusion or exclusion of the upstream exon and the use of cryptic polyadenylation signals. Conclusions/Significance Our results suggest that transcriptional interference induced by intronic L1s and nested genes could influence the transcription of the large number of genes in normal as well as in tumor tissues. Therefore, this type of interference could have a major impact on the regulation of the host gene expression. PMID:22022525
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.
Pauciullo, Alfredo; Erhardt, Georg
2015-01-01
In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5’- and 3’-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species. PMID:25923814
Pauciullo, Alfredo; Erhardt, Georg
2015-01-01
In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5'- and 3'-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species.
Probing the Boundaries of Orthology: The Unanticipated Rapid Evolution of Drosophila centrosomin
Eisman, Robert C.; Kaufman, Thomas C.
2013-01-01
The rapid evolution of essential developmental genes and their protein products is both intriguing and problematic. The rapid evolution of gene products with simple protein folds and a lack of well-characterized functional domains typically result in a low discovery rate of orthologous genes. Additionally, in the absence of orthologs it is difficult to study the processes and mechanisms underlying rapid evolution. In this study, we have investigated the rapid evolution of centrosomin (cnn), an essential gene encoding centrosomal protein isoforms required during syncytial development in Drosophila melanogaster. Until recently the rapid divergence of cnn made identification of orthologs difficult and questionable because Cnn violates many of the assumptions underlying models for protein evolution. To overcome these limitations, we have identified a group of insect orthologs and present conserved features likely to be required for the functions attributed to cnn in D. melanogaster. We also show that the rapid divergence of Cnn isoforms is apparently due to frequent coding sequence indels and an accelerated rate of intronic additions and eliminations. These changes appear to be buffered by multi-exon and multi-reading frame maximum potential ORFs, simple protein folds, and the splicing machinery. These buffering features also occur in other genes in Drosophila and may help prevent potentially deleterious mutations due to indels in genes with large coding exons and exon-dense regions separated by small introns. This work promises to be useful for future investigations of cnn and potentially other rapidly evolving genes and proteins. PMID:23749319
PTEN/MMAC1 Mutations in Hepatocellular Carcinomas: Somatic Inactivation of Both Alleles in Tumors
Kawamura, Naoki; Nagai, Hisaki; Bando, Koichi; Koyama, Masaaki; Matsumoto, Satoshi; Tajiri, Takashi; Onda, Masahiko; Fujimoto, Jiro; Ueki, Takahiro; Konishi, Noboru; Shiba, Tadayoshi
1999-01-01
Allelic loss of loci on chromosome 10q occurs frequently in hepatocellular carcinomas. Somatic mutations of the PTEN/MMAC1 gene on this chromosome at 10q23 were recently identified in sporadic cancers of the uterus, brain, prostate and breast. To investigate the potential role of PTEN/MMAC1 gene in the genesis of hepatocellular carcinomas, we examined 96 tumors for allelic loss on 10q and also for subtle mutations anywhere within the coding region of PTEN/MMAC1 gene. Allelic loss was identified in 25 of the 89 (27%) tumors that were informative for polymorphic markers in the region. Somatic mutations were identified in five of those tumors: three frameshift mutations, a 1‐bp insertion at codon 83–84 in exon 4 and two 4‐bp deletions, both at codon 318–319 in exon 8; two C‐to‐G transversion mutation, both at ‐9 bp from the initiation codon in the 5’non‐coding region of exon 1. No missense mutation was observed in this panel of tumors. In most of the informative tumors carrying intragenic mutations of one allele, we were able to detect loss of heterozygosity as well. These findings suggest that two alleles of the PTEN/MMAC1 gene may be inactivated by a combination of intragenic point mutation on one allele and loss of chromosomal material on the other allele in some of these tumors. PMID:10363579
Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A
1996-01-01
In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965
Yan, Xukun; Zhang, Tianyu; Wang, Zhengmin; Jiang, Yi; Chen, Yan; Wang, Hongyan; Ma, Duan; Wang, Lei; Li, Huawei
2011-12-20
Waardenburg syndrome type II (WS2) is associated with syndromic deafness. A subset of WS2, WS2A, accounting for approximately 15% of patients, is attributed to mutations in the microphthalmia-associated transcription factor (MITF) gene. We examined the genetic basis of WS2 in a large Chinese family. All 9 exons of the MITF gene, the single coding exon (exon 2) of the most common hereditary deafness gene GJB2 and the mitochondrial DNA (mtDNA) 12S rRNA were sequenced. A novel heterozygous mutation c.[742_743delAAinsT;746_747delCA] in exon 8 of the MITF gene co-segregates with WS2 in the family. The MITF mutation results in a premature termination codon and a truncated MITF protein with only 247 of the 419 wild type amino acids. The deaf proband had this MITF gene heterozygous mutation as well as a c.[109G>A]+[235delC] compound heterozygous pathogenic mutation in the GJB2 gene. No pathogenic mutation was found in mtDNA 12S rRNA in this family. Thus, a novel compound heterozygous mutation, c.[742_743delAAinsT;746_747delCA] in MITF exon 8 was the key genetic reason for WS2 in this family, and a digenic effect of MITF and GJB2 genes may contribute to deafness of the proband. Copyright © 2011. Published by Elsevier Ltd.
Is a Genome a Codeword of an Error-Correcting Code?
Kleinschmidt, João H.; Silva-Filho, Márcio C.; Bim, Edson; Herai, Roberto H.; Yamagishi, Michel E. B.; Palazzo, Reginaldo
2012-01-01
Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction. PMID:22649495
Hu, Dong Gui; McKinnon, Ross A; Hulin, Julie-Ann; Mackenzie, Peter I; Meech, Robyn
2016-12-27
Nearly 20 different transcripts of the human androgen receptor (AR) are reported with two currently listed as Refseq isoforms in the NCBI database. Isoform 1 encodes wild-type AR (type 1 AR) and isoform 2 encodes the variant AR45 (type 2 AR). Both variants contain eight exons: they share common exons 2-8 but differ in exon 1 with the canonical exon 1 in isoform 1 and the variant exon 1b in isoform 2. Splicing of exon 1 or exon 1b is reported to be mutually exclusive. In this study, we identified a novel exon 1b (1b/TAG) that contains an additional TAG trinucleotide upstream of exon 1b. Moreover, we identified AR transcripts in both normal and cancerous breast and prostate cells that contained either exon 1b or 1b/TAG spliced between the canonical exon 1 and exon 2, generating nine-exon AR transcripts that we have named isoforms 3a and 3b. The proteins encoded by these new AR variants could regulate androgen-responsive reporters in breast and prostate cancer cells under androgen-depleted conditions. Analysis of type 3 AR-GFP fusion proteins showed partial nuclear localization in PC3 cells under androgen-depleted conditions, supporting androgen-independent activation of the AR. Type 3 AR proteins inhibited androgen-induced growth of LNCaP cells. Microarray analysis identified a small set of type 3a AR target genes in LNCaP cells, including genes known to modulate growth and proliferation of prostate cancer ( PCGEM1 , PEG3 , EPHA3 , and EFNB2 ) or other types of human cancers ( TOX3 , ST8SIA4 , and SLITRK3 ), and genes that are diagnostic/prognostic biomarkers of prostate cancer ( GRINA3 , and BCHE ).
Zhang, Xiaoli; Colleoni, Christophe; Ratushna, Vlada; Sirghie-Colleoni, Mirella; James, Martha G; Myers, Alan M
2004-04-01
Mutations in the maize gene sugary2 ( su2 ) affect starch structure and its resultant physiochemical properties in useful ways, although the gene has not been characterized previously at the molecular level. This study tested the hypothesis that su2 codes for starch synthase IIa (SSIIa). Two independent mutations of the su2 locus, su2-2279 and su2-5178 , were identified in a Mutator -active maize population. The nucleotide sequence of the genomic locus that codes for SSIIa was compared between wild type plants and those homozygous for either novel mutation. Plants bearing su2-2279 invariably contained a Mutator transposon in exon 3 of the SSIIa gene, and su2-5178 mutants always contained a small retrotransposon-like insertion in exon 10. Six allelic su2 (-) mutations conditioned loss or reduction in abundance of the SSIIa protein detected by immunoblot. These data indicate that su2 codes for SSIIa and that deficiency in this isoform is ultimately responsible for the altered physiochemical properties of su2 (-) mutant starches. A specific starch synthase isoform among several identified in soluble endosperm extracts was absent in su2-2279 or su2-5178 mutants, indicating that SSIIa is active in the soluble phase during kernel development. The immediate structural effect of the su2 (-) mutations was shown to be increased abundance of short glucan chains in amylopectin and a proportional decrease in intermediate length chains, similar to the effects of SSII deficiency in other species.
Role of interleukin-15 receptor alpha polymorphisms in normal weight obese syndrome.
Di Renzo, L; Gloria-Bottini, F; Saccucci, P; Bigioni, M; Abenavoli, L; Gasbarrini, G; De Lorenzo, A
2009-01-01
Previous published studies have identified a class of women, Normal Weight Obese women (NWO) with normal BMI and high fat content. An important role of Interleukin-15 (IL-15) has been documented in facilitating muscle proliferation and promoting fat depletion. Indeed the presence of three types of IL-15 receptor subunits in fat tissue suggests a direct effect on adipose tissue. We studied three single nucleotide polymorphisms (SNP) of IL-15R-alpha receptor gene and investigated their relationship with NWO phenotype. We considered two classes of women according to their BMI and percent fat mass (percent FAT), class 1: including 72 overweight-obese women (high BMI-high fat mass) and class 2: including 36 NWO (normal BMI, high fat mass). Three sites of Interleukin-15 receptor subunit á gene were examined, located respectively in exon4, exon5 intron-exon border and exon7. Genotyping of the identified polymorphisms was performed by restriction fragment length polymorphism. Haplotype frequency estimation was performed by using the Mendel-University of Chicago program. Odds ratio analyses were calculated by EPISTAT program. Highly significant differences were observed for exon 7- exon5 intron-exon border and exon 4-exon 7 haplotype distribution between class 1 and class 2 women. These results strongly support the hypothesis that genetic variability of the IL-15 receptor has an important role in body fat composition. Our data underscore previous findings that suggest a potential role of IL-15 cytokine in NWO syndrome.
A role for exon sequences in alternative splicing of the human fibronectin gene.
Mardon, H J; Sebastio, G; Baralle, F E
1987-01-01
Exon EDIIIA of the fibronectin (Fn) gene is alternatively spliced via pathways which either skip or include the whole exon in the messenger RNA (mRNA). We have investigated the role of EDIIIA exon sequences in the human Fn gene in determining alternative splicing of this exon during transient expression of alpha globin/Fn minigene hybrids in HeLa cells. We demonstrate that a DNA sequence of 81bp within the central region of exon EDIIIA is required for alternative splicing during processing of the primary transcript to generate both EDIIIA+ and EDIIIA- mRNA's. Furthermore, alternative splicing of EDIIIA only occurs when this sequence is present in the correct orientation since when it is in antisense orientation splicing always occurs via exon-skipping generating EDIIIA- mRNA. Images PMID:3671064
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
Agouti sequence polymorphisms in coyotes, wolves and dogs suggest hybridization.
Schmutz, Sheila M; Berryere, Thomas G; Barta, Jodi L; Reddick, Kimberley D; Schmutz, Josef K
2007-01-01
Domestic dogs have been shown to have multiple alleles of the Agouti Signal Peptide (ASIP) in exon 4 and we wished to determine the level of polymorphism in the common wild canids of Canada, wolves and coyotes, in comparison. All Canadian coyotes and most wolves have banded hairs. The ASIP coding sequence of the wolf did not vary from the domestic dog but one variant was detected in exon 4 of coyotes that did not alter the arginine at this position. Two other differences were found in the sequence flanking exon 4 of coyotes compared with the 45 dogs and 1 wolf. The coyotes also demonstrated a relatively common polymorphism in the 3' UTR sequence that could be used for population studies. One of the ASIP alleles (R96C) in domestic dogs causes a solid black coat color in homozygotes. Although some wolves are melanistic, this phenotype does not appear to be caused by this same mutation. However, one wolf, potentially a dog-wolf hybrid or descendant thereof, was heterozygous for this allele. Likewise 2 coyotes, potentially dog-coyote or wolf-coyote hybrid descendants, were heterozygous for the several polymorphisms in and flanking exon 4. We could conclude that these were coyote-dog hybrids because both were heterozygous for 2 mutations causing fawn coat color in dogs.
A rare male patient with classic Rett syndrome caused by MeCP2_e1 mutation.
Tokaji, Narumi; Ito, Hiromichi; Kohmoto, Tomohiro; Naruto, Takuya; Takahashi, Rizu; Goji, Aya; Mori, Tatsuo; Toda, Yoshihiro; Saito, Masako; Tange, Shoichiro; Masuda, Kiyoshi; Kagami, Shoji; Imoto, Issei
2018-03-01
Rett syndrome (RTT) is a severe neurodevelopmental disorder typically affecting females. It is mainly caused by loss-of-function mutations that affect the coding sequence of exon 3 or 4 of methyl-CpG-binding protein 2 (MECP2). Severe neonatal encephalopathy resulting in death before the age of 2 years is the most common phenotype observed in males affected by a pathogenic MECP2 variant. Mutations in MECP2 exon 1 affecting the MeCP2_e1 isoform are relatively rare causes of RTT in females, and only one case of a male patient with MECP2-related severe neonatal encephalopathy caused by a mutation in MECP2 exon 1 has been reported. This is the first reported case of a male with classic RTT caused by a 5-bp duplication in the open-reading frame of MECP2 exon 1 (NM_001110792.1:c.23_27dup) that introduced a premature stop codon [p.(Ser10Argfs*36)] in the MeCP2_e1 isoform, which has been reported in one female patient with classic RTT. Therefore, both males and females displaying at least some type of MeCP2_e1 mutation may exhibit the classic RTT phenotype. © 2018 Wiley Periodicals, Inc.
Liu, Hong Yan; Huang, Jia; Wang, Rui Li; Wang, Yue; Guo, Liang Jie; Li, Tao; Wu, Dong; Wang, Hong Dan; Guo, Qian Nan; Dong, Dao Quan
2016-11-01
Familial exudative vitreoretinopathy (FEVR) is a hereditary ocular disorder characterized by a failure of peripheral retinal vascularization. In this report, we describe a novel missense mutation of the Norrie disease gene (NDP) in a Chinese family with X-linked FEVR. Ophthalmologic evaluation was performed on four male patients and seven unaffected individuals after informed consent was obtained. Venous blood was collected from the 11 members of this family, and genomic DNA was extracted using standard methods. The coding exons 2 and 3 and their corresponding exon-intron junctions of NDP were amplified by polymerase chain reaction and then subjected to direct DNA sequencing. A novel missense mutation (c.310A>C) in exon 3, leading to a lysine-to-glutamine substitution at position 104 (p.Lys104Gln), was identified in all four patients with X-linked FEVR. Three unaffected female individuals (III2, IV3, and IV11) were found to be carriers of the mutation. This mutation was not detected in other unaffected individuals. The mutation c.310A>C (p.Lys104Gln) in exon 3 of NDP is associated with FEVR in the studied family. This result further enriches the mutation spectrum of FEVR. Copyright © 2016. Published by Elsevier Taiwan LLC.
PTESFinder: a computational method to identify post-transcriptional exon shuffling (PTES) events.
Izuogu, Osagie G; Alhasan, Abd A; Alafghani, Hani M; Santibanez-Koref, Mauro; Elliott, David J; Elliot, David J; Jackson, Michael S
2016-01-13
Transcripts, which have been subject to Post-transcriptional exon shuffling (PTES), have an exon order inconsistent with the underlying genomic sequence. These have been identified in a wide variety of tissues and cell types from many eukaryotes, and are now known to be mostly circular, cytoplasmic, and non-coding. Although there is no uniformly ascribed function, several have been shown to be involved in gene regulation. Accurate identification of these transcripts can, however, be difficult due to artefacts from a wide variety of sources. Here, we present a computational method, PTESFinder, to identify these transcripts from high throughput RNAseq data. Uniquely, it systematically excludes potential artefacts emanating from pseudogenes, segmental duplications, and template switching, and outputs both PTES and canonical exon junction counts to facilitate comparative analyses. In comparison with four existing methods, PTESFinder achieves highest specificity and comparable sensitivity at a variety of read depths. PTESFinder also identifies between 13 % and 41.6 % more structures, compared to publicly available methods recently used to identify human circular RNAs. With high sensitivity and specificity, user-adjustable filters that target known sources of false positives, and tailored output to facilitate comparison of transcript levels, PTESFinder will facilitate the discovery and analysis of these poorly understood transcripts.
Geiss, K T; Abbas, G M; Makaroff, C A
1994-04-01
The mitochondrial gene coding for subunit 4 of the NADH dehydrogenase complex I (nad4) has been isolated and characterized from lettuce, Lactuca sativa. Analysis of nad4 genes in a number of plants by Southern hybridization had previously suggested that the intron content varied between species. Characterization of the lettuce gene confirms this observation. Lettuce nad4 contains two exons and one group IIA intron, whereas previously sequenced nad4 genes from turnip and wheat contain three group IIA introns. Northern analysis identified a transcript of 1600 nucleotides, which represents the mature nad4 mRNA and a primary transcript of 3200 nucleotides. Sequence analysis of lettuce and turnip nad4 cDNAs was used to confirm the intron/exon border sequences and to examine RNA editing patterns. Editing is observed at the 5' and 3' ends of the lettuce transcript, but is absent from sequences that correspond to exons two, three and the 5' end of exon four in turnip and wheat. In contrast, turnip transcripts are highly edited in this region, suggesting that homologous recombination of an edited and spliced cDNA intermediate was involved in the loss of introns two and three from an ancestral lettuce nad4 gene.
Lunova, Mariia; Guldiken, Nurdan; Lienau, Tim C.; Stickel, Felix; Omary, M. Bishr
2012-01-01
Background Keratins 8 and 18 (K8/K18) are intermediate filament proteins that protect the liver from various forms of injury. Exonic K8/K18 variants associate with adverse outcome in acute liver failure and with liver fibrosis progression in patients with chronic hepatitis C infection or primary biliary cirrhosis. Given the association of K8/K18 variants with end-stage liver disease and progression in several chronic liver disorders, we studied the importance of keratin variants in patients with hemochromatosis. Methods The entire K8/K18 exonic regions were analyzed in 162 hemochromatosis patients carrying homozygous C282Y HFE (hemochromatosis gene) mutations. 234 liver-healthy subjects were used as controls. Exonic regions were PCR-amplified and analyzed using denaturing high-performance liquid chromatography and DNA sequencing. Previously-generated transgenic mice overexpressing K8 G62C were studied for their susceptibility to iron overload. Susceptibility to iron toxicity of primary hepatocytes that express K8 wild-type and G62C was also assessed. Results We identified amino-acid-altering keratin heterozygous variants in 10 of 162 hemochromatosis patients (6.2%) and non-coding heterozygous variants in 6 additional patients (3.7%). Two novel K8 variants (Q169E/R275W) were found. K8 R341H was the most common amino-acid altering variant (4 patients), and exclusively associated with an intronic KRT8 IVS7+10delC deletion. Intronic, but not amino-acid-altering variants associated with the development of liver fibrosis. In mice, or ex vivo, the K8 G62C variant did not affect iron-accumulation in response to iron-rich diet or the extent of iron-induced hepatocellular injury. Conclusion In patients with hemochromatosis, intronic but not exonic K8/K18 variants associate with liver fibrosis development. PMID:22412904
Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R
1997-04-28
We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.
Parvari, R; Shen, J; Hershkovitz, E; Chen, Y T; Moses, S W
1998-04-01
Glycogen storage disease type III (GSD III) is an autosomal recessive disease caused by the deficiency of glycogen debranching enzyme (AGL). We report the finding of two new mutations in a GSD IIIa Ashkenazi Jewish patient. Both mutations are insertion of an adenine into a stretch of 8 adenines towards the 3' end of the coding region, one at position 3904 (3904insA) in exon 30, the second at position 4214 (4214insA) in exon 32. The mutations cause frameshifts and premature terminations of the glycogen debranching enzyme, the first causing a frameshift at amino acid 1304, the second causing a frameshift at amino acid 1408 of the total of 1532. These mutations demonstrate the importance of the 125 amino acids at the carboxy-terminus of the debrancher enzyme for its activity and support the suggestion that the putative glycogen binding domain is located in the carboxy-terminus of the AGL. The mutations cause distinctive single-strand conformation polymorphism (SSCP) patterns enabling easy detection.
Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing
Hoque, Mainul; Ji, Zhe; Zheng, Dinghai; Luo, Wenting; Li, Wencheng; You, Bei; Park, Ji Yeon; Yehia, Ghassan; Tian, Bin
2012-01-01
Alternative cleavage and polyadenylation (APA) leads to mRNA isoforms with different coding sequences (CDS) and/or 3′ untranslated regions (3′UTRs). Using 3′ Region Extraction And Deep Sequencing (3′READS), a method which addresses the internal priming and oligo(A) tail issues that commonly plague polyA site (pA) identification, we comprehensively mapped pAs in the mouse genome, thoroughly annotating 3′ ends of genes and revealing over five thousand pAs (~8% of total) flanked by A-rich sequences, which have hitherto been overlooked. About 79% of mRNA genes and 66% of long non-coding RNA (lncRNA) genes have APA; but these two gene types have distinct usage patterns for pAs in introns and upstream exons. Promoter-distal pAs become relatively more abundant during embryonic development and cell differentiation, a trend affecting pAs in both 3′-most exons and upstream regions. Upregulated isoforms generally have stronger pAs, suggesting global modulation of the 3′ end processing activity in development and differentiation. PMID:23241633
Characterization of an Equine α-S2-Casein Variant Due to a 1.3 kb Deletion Spanning Two Coding Exons
Brinkmann, Julia; Koudelka, Tomas; Keppler, Julia K.; Tholey, Andreas; Schwarz, Karin; Thaller, Georg; Tetens, Jens
2015-01-01
The production and consumption of mare’s milk in Europe has gained importance, mainly based on positive health effects and a lower allergenic potential as compared to cows’ milk. The allergenicity of milk is to a certain extent affected by different genetic variants. In classical dairy species, much research has been conducted into the genetic variability of milk proteins, but the knowledge in horses is scarce. Here, we characterize two major forms of equine αS2-casein arising from genomic 1.3 kb in-frame deletion involving two coding exons, one of which represents an equid specific duplication. Findings at the DNA-level have been verified by cDNA sequencing from horse milk of mares with different genotypes. At the protein-level, we were able to show by SDS-page and in-gel digestion with subsequent LC-MS analysis that both proteins are actually expressed. The comparison with published sequences of other equids revealed that the deletion has probably occurred before the ancestor of present-day asses and zebras diverged from the horse lineage. PMID:26444874
Daher, Tamas; Tur, Mehmet Kemal; Brobeil, Alexander; Etschmann, Benjamin; Witte, Biruta; Engenhart-Cabillic, Rita; Krombach, Gabriele; Blau, Wolfgang; Grimminger, Friedrich; Seeger, Werner; Klussmann, Jens Peter; Bräuninger, Andreas; Gattenlöhner, Stefan
2018-06-01
In head and neck squamous cell carcinoma (HNSCC), the occurrence of concurrent lung malignancies poses a significant diagnostic challenge because metastatic HNSCC is difficult to discern from second primary lung squamous cell carcinoma (SCC). However, this differentiation is crucial because the recommended treatments for metastatic HNSCC and second primary lung SCC differ profoundly. We analyzed the origin of lung tumors in 32 patients with HNSCC using human papillomavirus (HPV) typing and targeted next generation sequencing of all coding exons of tumor protein 53 (TP53). Lung tumors were clearly identified as HNSCC metastases or second primary tumors in 29 patients, thus revealing that 16 patients had received incorrect diagnoses based on clinical and morphological data alone. The HPV typing and mutation analysis of all TP53 coding exons is a valuable diagnostic tool in patients with HNSCC and concurrent lung SCC, which can help to ensure that patients receive the most suitable treatment. © 2018 Wiley Periodicals, Inc.
Exon dosage analysis of parkin gene in Chinese sporadic Parkinson's disease.
Guo, Ji-Feng; Dong, Xiao-Li; Xu, Qian; Li, Nan; Yan, Xin-Xiang; Xia, Kun; Tang, Bei-Sha
2015-09-14
Parkin gene mutations are by far the most common mutations in both familial Parkinson's disease (PD) and sporadic PD. Approximately, 50% of parkin mutations is exon dosage mutations (i.e., deletions and duplications of entire exons). Here, we first established a MLPA assay for quick detection of parkin exon rearrangements. Then, we studied parkin exon dosage mutations in 755 Chinese sporadic PDdisease patients using the established MLPA assay. We found that there were 25 (3.3%) patients with exon dosage alterations including deletions and duplications, 20 (11.4%) patients with exon rearrangements in 178 early-onset patients, and 5 (0.86%) patients with exon rearrangement mutations in 579 later-onset patients. The percentage of individuals with parkin dosage mutations is more than 33% when the age at onset is less than 30 years old, but less than 7% when the age at onset is more than 30. In these mutations, deletion is the main mutational style, especially in exon 2-5. Our results indicated that exon dosage mutations in parkin gene might be the main cause for sporadic PD, especially in EOP. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Reddy, Sushma; Kimball, Rebecca T; Pandey, Akanksha; Hosner, Peter A; Braun, Michael J; Hackett, Shannon J; Han, Kin-Lan; Harshman, John; Huddleston, Christopher J; Kingston, Sarah; Marks, Ben D; Miglia, Kathleen J; Moore, William S; Sheldon, Frederick H; Witt, Christopher C; Yuri, Tamaki; Braun, Edward L
2017-09-01
Phylogenomics, the use of large-scale data matrices in phylogenetic analyses, has been viewed as the ultimate solution to the problem of resolving difficult nodes in the tree of life. However, it has become clear that analyses of these large genomic data sets can also result in conflicting estimates of phylogeny. Here, we use the early divergences in Neoaves, the largest clade of extant birds, as a "model system" to understand the basis for incongruence among phylogenomic trees. We were motivated by the observation that trees from two recent avian phylogenomic studies exhibit conflicts. Those studies used different strategies: 1) collecting many characters [$\\sim$ 42 mega base pairs (Mbp) of sequence data] from 48 birds, sometimes including only one taxon for each major clade; and 2) collecting fewer characters ($\\sim$ 0.4 Mbp) from 198 birds, selected to subdivide long branches. However, the studies also used different data types: the taxon-poor data matrix comprised 68% non-coding sequences whereas coding exons dominated the taxon-rich data matrix. This difference raises the question of whether the primary reason for incongruence is the number of sites, the number of taxa, or the data type. To test among these alternative hypotheses we assembled a novel, large-scale data matrix comprising 90% non-coding sequences from 235 bird species. Although increased taxon sampling appeared to have a positive impact on phylogenetic analyses the most important variable was data type. Indeed, by analyzing different subsets of the taxa in our data matrix we found that increased taxon sampling actually resulted in increased congruence with the tree from the previous taxon-poor study (which had a majority of non-coding data) instead of the taxon-rich study (which largely used coding data). We suggest that the observed differences in the estimates of topology for these studies reflect data-type effects due to violations of the models used in phylogenetic analyses, some of which may be difficult to detect. If incongruence among trees estimated using phylogenomic methods largely reflects problems with model fit developing more "biologically-realistic" models is likely to be critical for efforts to reconstruct the tree of life. [Birds; coding exons; GTR model; model fit; Neoaves; non-coding DNA; phylogenomics; taxon sampling.]. © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Graveley, Brenton R.
2008-01-01
Summary Drosophila Dscam encodes 38,016 distinct axon guidance receptors through the mutually exclusive alternative splicing of 95 variable exons. Importantly, known mechanisms that ensure the mutually exclusive splicing of pairs of exons cannot explain this phenomenon in Dscam. I have identified two classes of conserved elements in the Dscam exon 6 cluster, which contains 48 alternative exons—the docking site, located in the intron downstream of constitutive exon 5, and the selector sequences, which are located upstream of each exon 6 variant. Strikingly, each selector sequence is complementary to a portion of the docking site, and this pairing juxtaposes one, and only one, alternative exon to the upstream constitutive exon. The mutually exclusive nature of the docking site:selector sequence interactions suggests that the formation of these competing RNA structures is a central component of the mechanism guaranteeing that only one exon 6 variant is included in each Dscam mRNA. PMID:16213213
Germ-line and somatic EPHA2 coding variants in lens aging and cataract.
Bennett, Thomas M; M'Hamdi, Oussama; Hejtmancik, J Fielding; Shiels, Alan
2017-01-01
Rare germ-line mutations in the coding regions of the human EPHA2 gene (EPHA2) have been associated with inherited forms of pediatric cataract, whereas, frequent, non-coding, single nucleotide variants (SNVs) have been associated with age-related cataract. Here we sought to determine if germ-line EPHA2 coding SNVs were associated with age-related cataract in a case-control DNA panel (> 50 years) and if somatic EPHA2 coding SNVs were associated with lens aging and/or cataract in a post-mortem lens DNA panel (> 48 years). Micro-fluidic PCR amplification followed by targeted amplicon (exon) next-generation (deep) sequencing of EPHA2 (17-exons) afforded high read-depth coverage (1000x) for > 82% of reads in the cataract case-control panel (161 cases, 64 controls) and > 70% of reads in the post-mortem lens panel (35 clear lens pairs, 22 cataract lens pairs). Novel and reference (known) missense SNVs in EPHA2 that were predicted in silico to be functionally damaging were found in both cases and controls from the age-related cataract panel at variant allele frequencies (VAFs) consistent with germ-line transmission (VAF > 20%). Similarly, both novel and reference missense SNVs in EPHA2 were found in the post-mortem lens panel at VAFs consistent with a somatic origin (VAF > 3%). The majority of SNVs found in the cataract case-control panel and post-mortem lens panel were transitions and many occurred at di-pyrimidine sites that are susceptible to ultraviolet (UV) radiation induced mutation. These data suggest that novel germ-line (blood) and somatic (lens) coding SNVs in EPHA2 that are predicted to be functionally deleterious occur in adults over 50 years of age. However, both types of EPHA2 coding variants were present at comparable levels in individuals with or without age-related cataract making simple genotype-phenotype correlations inconclusive.
Germ-line and somatic EPHA2 coding variants in lens aging and cataract
Bennett, Thomas M.; M’Hamdi, Oussama; Hejtmancik, J. Fielding
2017-01-01
Rare germ-line mutations in the coding regions of the human EPHA2 gene (EPHA2) have been associated with inherited forms of pediatric cataract, whereas, frequent, non-coding, single nucleotide variants (SNVs) have been associated with age-related cataract. Here we sought to determine if germ-line EPHA2 coding SNVs were associated with age-related cataract in a case-control DNA panel (> 50 years) and if somatic EPHA2 coding SNVs were associated with lens aging and/or cataract in a post-mortem lens DNA panel (> 48 years). Micro-fluidic PCR amplification followed by targeted amplicon (exon) next-generation (deep) sequencing of EPHA2 (17-exons) afforded high read-depth coverage (1000x) for > 82% of reads in the cataract case-control panel (161 cases, 64 controls) and > 70% of reads in the post-mortem lens panel (35 clear lens pairs, 22 cataract lens pairs). Novel and reference (known) missense SNVs in EPHA2 that were predicted in silico to be functionally damaging were found in both cases and controls from the age-related cataract panel at variant allele frequencies (VAFs) consistent with germ-line transmission (VAF > 20%). Similarly, both novel and reference missense SNVs in EPHA2 were found in the post-mortem lens panel at VAFs consistent with a somatic origin (VAF > 3%). The majority of SNVs found in the cataract case-control panel and post-mortem lens panel were transitions and many occurred at di-pyrimidine sites that are susceptible to ultraviolet (UV) radiation induced mutation. These data suggest that novel germ-line (blood) and somatic (lens) coding SNVs in EPHA2 that are predicted to be functionally deleterious occur in adults over 50 years of age. However, both types of EPHA2 coding variants were present at comparable levels in individuals with or without age-related cataract making simple genotype-phenotype correlations inconclusive. PMID:29267365
Ullah, Inayat; Kabir, Firoz; Iqbal, Muhammad; Gottsch, Clare Brooks S.; Naeem, Muhammad Asif; Assir, Muhammad Zaman; Khan, Shaheen N.; Akram, Javed; Riazuddin, Sheikh; Ayyagari, Radha; Hejtmancik, J. Fielding
2016-01-01
Purpose To identify pathogenic mutations responsible for autosomal recessive retinitis pigmentosa (arRP) in consanguineous familial cases. Methods Seven large familial cases with multiple individuals diagnosed with retinitis pigmentosa were included in the study. Affected individuals in these families underwent ophthalmic examinations to document the symptoms and confirm the initial diagnosis. Blood samples were collected from all participating members, and genomic DNA was extracted. An exclusion analysis with microsatellite markers spanning the TULP1 locus on chromosome 6p was performed, and two-point logarithm of odds (LOD) scores were calculated. All coding exons along with the exon–intron boundaries of TULP1 were sequenced bidirectionally. We constructed a single nucleotide polymorphism (SNP) haplotype for the four familial cases harboring the K489R allele and estimated the likelihood of a founder effect. Results The ophthalmic examinations of the affected individuals in these familial cases were suggestive of RP. Exclusion analyses confirmed linkage to chromosome 6p harboring TULP1 with positive two-point LOD scores. Subsequent Sanger sequencing identified the single base pair substitution in exon14, c.1466A>G (p.K489R), in four families. Additionally, we identified a two-base deletion in exon 4, c.286_287delGA (p.E96Gfs77*); a homozygous splice site variant in intron 14, c.1495+4A>C; and a novel missense variation in exon 15, c.1561C>T (p.P521S). All mutations segregated with the disease phenotype in the respective families and were absent in ethnically matched control chromosomes. Haplotype analysis suggested (p<10−6) that affected individuals inherited the causal mutation from a common ancestor. Conclusions Pathogenic mutations in TULP1 are responsible for the RP phenotype in seven familial cases with a common ancestral mutation responsible for the disease phenotype in four of the seven families. PMID:27440997
Palti, Y.; Rodriguez, M.F.; Gahr, S.A.; Purcell, M.K.; Rexroad, C. E.; Wiens, G.D.
2010-01-01
Induction of innate immune pathways is critical for early anti-microbial defense but there is limited understanding of how teleosts recognize microbial molecules and activate these pathways. In mammals, Toll-like receptors (TLR) 1 and 2 form a heterodimer involved in recognizing peptidoglycans and lipoproteins of microbial origin. Herein, we identify and describe the rainbow trout (Oncorhynchus mykiss) TLR1 gene ortholog and its mRNA expression. Two TLR1 loci were identified from a rainbow trout bacterial artificial chromosome (BAC) library using DNA sequencing and genetic linkage analyses. Full length cDNA clone and direct sequencing of four BACs revealed an intact omTLR1 open reading frame (ORF) located on chromosome 14 and a second locus on chromosome 25 that contains a TLR1 pseudogene. The duplicated trout loci exhibit conserved synteny with other fish genomes that extends beyond the TLR1 gene sequences. The omTLR1 gene includes a single large coding exon similar to all other described TLR1 genes, but unlike other teleosts it also has a 5??? UTR exon and intron preceding the large coding exon. The omTLR1 ORF is predicted to encode an 808 amino-acid protein with 69% similarity to the Fugu TLR1 and a conserved pattern of predicted leucine-rich repeats (LRR). Phylogenetic analysis grouped omTLR1 with other fish TLR1 genes on a separate branch from the avian TLR1 and mammalian TLR1, 6 and 10. omTLR1 expression levels in rainbow trout anterior kidney leukocytes were not affected by the human TLR2/6 and TLR2/1 agonists diacylated lipoprotein (Pam2CSK4) and triacylated lipoprotein (Pam3CSK4). However, due to the lack of TLR6 and 10 genes in teleost genomes and up-regulation of TLR1 mRNA in response to LPS and bacterial infection in other fish species we hypothesize an important role for omTLR1 in anti-microbial immunity. Therefore, the identification of a TLR2 ortholog in rainbow trout and the development of assays to measure ligand binding and downstream signaling are critical for future elucidation of omTLR1 functions.
Palti, Yniv; Rodriguez, M. Fernanda; Gahr, Scott A.; Purcell, Maureen K.; Rexroad, Caird E.; Wiens, Gregory D.
2010-01-01
Induction of innate immune pathways is critical for early anti-microbial defense but there is limited understanding of how teleosts recognize microbial molecules and activate these pathways. In mammals, Toll-like receptors (TLR) 1 and 2 form a heterodimer involved in recognizing peptidoglycans and lipoproteins of microbial origin. Herein, we identify and describe the rainbow trout (Oncorhynchus mykiss) TLR1 gene ortholog and its mRNA expression. Two TLR1 loci were identified from a rainbow trout bacterial artificial chromosome (BAC) library using DNA sequencing and genetic linkage analyses. Full length cDNA clone and direct sequencing of four BACs revealed an intact omTLR1 open reading frame (ORF) located on chromosome 14 and a second locus on chromosome 25 that contains a TLR1 pseudogene. The duplicated trout loci exhibit conserved synteny with other fish genomes that extends beyond the TLR1 gene sequences. The omTLR1 gene includes a single large coding exon similar to all other described TLR1 genes, but unlike other teleosts it also has a 5' UTR exon and intron preceding the large coding exon. The omTLR1 ORF is predicted to encode an 808 amino-acid protein with 69% similarity to the Fugu TLR1 and a conserved pattern of predicted leucine-rich repeats (LRR). Phylogenetic analysis grouped omTLR1 with other fish TLR1 genes on a separate branch from the avian TLR1 and mammalian TLR1, 6 and 10. omTLR1 expression levels in rainbow trout anterior kidney leukocytes were not affected by the human TLR2/6 and TLR2/1 agonists diacylated lipoprotein (Pam2CSK4) and triacylated lipoprotein (Pam3CSK4). However, due to the lack of TLR6 and 10 genes in teleost genomes and up-regulation of TLR1 mRNA in response to LPS and bacterial infection in other fish species we hypothesize an important role for omTLR1 in anti-microbial immunity. Therefore, the identification of a TLR2 ortholog in rainbow trout and the development of assays to measure ligand binding and downstream signaling are critical for future elucidation of omTLR1 functions.
Swalla, B J; Just, M A; Pederson, E L; Jeffery, W R
1999-04-01
The Manx gene is required for the development of the tail and other chordate features in the ascidian tadpole larva. To determine the structure of the Manx gene, we isolated and sequenced genomic clones from the tailed ascidian Molgula oculata. The Manx gene contains 9 exons and encodes both major and minor Manx mRNAs, which differ in the length of their 5' untranslated regions. The coding region of the single-copy bobcat gene, which encodes a DEAD-box RNA helicase, is embedded within the first Manx intron. The organization of the bobcat and Manx transcription units was determined by comparing genomic and cDNA clones. The Manx-bobcat gene locus has an unusual organization in which a non-coding first exon is alternatively spliced at the 5' end of two different mRNAs. The bobcat and Manx genes are expressed coordinately during oogenesis and embryogenesis, but not during spermatogenesis, in which bobcat mRNA accumulates independently of Manx mRNA. Similar to Manx, zygotic bobcat transcripts accumulate in the embryonic primordia responsible for generating chordate features, including the dorsal neural tube and notochord, are downregulated during embryogenesis in the tailless species Molgula occulta and are upregulated in M. occulta X M. oculata hybrids, which restore these chordate features. Antisense experiments indicate that zygotic bobcat expression is required for development of the same suite of chordate features as Manx. The results show that the Manx-bobcat gene complex has a role in the development of chordate features in ascidian tadpole larvae.
Analysis of CHRNA7 rare variants in autism spectrum disorder susceptibility.
Bacchelli, Elena; Battaglia, Agatino; Cameli, Cinzia; Lomartire, Silvia; Tancredi, Raffaella; Thomson, Susanne; Sutcliffe, James S; Maestrini, Elena
2015-04-01
Chromosome 15q13.3 recurrent microdeletions are causally associated with a wide range of phenotypes, including autism spectrum disorder (ASD), seizures, intellectual disability, and other psychiatric conditions. Whether the reciprocal microduplication is pathogenic is less certain. CHRNA7, encoding for the alpha7 subunit of the neuronal nicotinic acetylcholine receptor, is considered the likely culprit gene in mediating neurological phenotypes in 15q13.3 deletion cases. To assess if CHRNA7 rare variants confer risk to ASD, we performed copy number variant analysis and Sanger sequencing of the CHRNA7 coding sequence in a sample of 135 ASD cases. Sequence variation in this gene remains largely unexplored, given the existence of a fusion gene, CHRFAM7A, which includes a nearly identical partial duplication of CHRNA7. Hence, attempts to sequence coding exons must distinguish between CHRNA7 and CHRFAM7A, making next-generation sequencing approaches unreliable for this purpose. A CHRNA7 microduplication was detected in a patient with autism and moderate cognitive impairment; while no rare damaging variants were identified in the coding region, we detected rare variants in the promoter region, previously described to functionally reduce transcription. This study represents the first sequence variant analysis of CHRNA7 in a sample of idiopathic autism. © 2015 Wiley Periodicals, Inc.
Applications of statistical physics and information theory to the analysis of DNA sequences
NASA Astrophysics Data System (ADS)
Grosse, Ivo
2000-10-01
DNA carries the genetic information of most living organisms, and the of genome projects is to uncover that genetic information. One basic task in the analysis of DNA sequences is the recognition of protein coding genes. Powerful computer programs for gene recognition have been developed, but most of them are based on statistical patterns that vary from species to species. In this thesis I address the question if there exist universal statistical patterns that are different in coding and noncoding DNA of all living species, regardless of their phylogenetic origin. In search for such species-independent patterns I study the mutual information function of genomic DNA sequences, and find that it shows persistent period-three oscillations. To understand the biological origin of the observed period-three oscillations, I compare the mutual information function of genomic DNA sequences to the mutual information function of stochastic model sequences. I find that the pseudo-exon model is able to reproduce the mutual information function of genomic DNA sequences. Moreover, I find that a generalization of the pseudo-exon model can connect the existence and the functional form of long-range correlations to the presence and the length distributions of coding and noncoding regions. Based on these theoretical studies I am able to find an information-theoretical quantity, the average mutual information (AMI), whose probability distributions are significantly different in coding and noncoding DNA, while they are almost identical in all studied species. These findings show that there exist universal statistical patterns that are different in coding and noncoding DNA of all studied species, and they suggest that the AMI may be used to identify genes in different living species, irrespective of their taxonomic origin.
Epithelial Plasticity in Castration-Resistant Prostate Cancer: Biology of the Lethal Phenotype
2011-07-01
Sorg BS, Albrecht T et al. Alternative inclusion of fibroblast growth factor receptor 2 exon IIIc in Dunning prostate tumors reveals unexpected... exon 658IIIc in Dunning prostate tumors reveals unexpected epithelial 659mesenchymal plasticity. Proc Natl Acad Sci U S A 2006;103: 66014116–21. 24...variant isoforms (such as, for example FGFR2 that includes or excludes either exon IIIc or exon IIIb), or CD133, or any combination of two or more
Mumm, Steven; Wenkert, Deborah; Zhang, Xiafang; McAlister, William H; Mier, Richard J; Whyte, Michael P
2007-02-01
Autosomal dominant OPK and BOS feature widespread foci of osteosclerotic trabeculae without or with skin lesions, respectively. Occasionally, a larger area of dense bone in OPK or BOS resembles MEL, a sporadic sclerosing disorder primarily involving cortical bone. Others, finding deactivating germline LEMD3 mutations in OPK or BOS, concluded such defects explain all three conditions. We found germline LEMD3 mutations in OPK and BOS but not in sporadic MEL. In 2004, others discovered that heterozygous, loss-of-function, germline mutations in the LEMD3 gene (LEMD3 or MAN1) cause both osteopoikilosis (OPK) and Buschke-Ollendorff syndrome (BOS). OPK is an autosomal dominant, usually benign, skeletal dysplasia featuring multiple, small, especially metaphyseal, oval or round, dense trabecular foci distributed symmetrically throughout the skeleton. BOS combines OPK with connective tissue nevi comprised of collagen and elastin. In some OPK and BOS families, an individual may have relatively large, asymmetric areas of dense cortical bone interpreted as melorheostosis (MEL). MEL, however, classically refers to a sporadic, troublesome skeletal dysostosis featuring large, asymmetric, "flowing hyperostosis" of long bone cortices often with overlying, constricting soft tissue abnormalities. However, a heterozygous germline mutation in LEMD3 was offered to explain MEL. We studied 11 unrelated individuals with sclerosing bone disorders where LEMD3 mutation was a potential etiology: familial OPK (1), familial BOS (2), previously reported familial OPK with MEL (1), sporadic MEL (3), sporadic MEL with mixed-sclerosing-bone dystrophy (1), and patients with other unusual sclerosing bone disorders (3). All coding exons and adjacent mRNA splice sites for LEMD3 were amplified by PCR and sequenced using genomic DNA from leukocytes. We did not study lesional tissue from bone or skin. In the OPK family, a heterozygous nonsense mutation (c.1433T>A, p.L478X) was discovered in exon 1. In the two BOS families, a heterozygous nonsense mutation (exon 1, c.1323C>A, p.Y441X) and a heterozygous frame-shift mutation (exon 1, c.332_333insTC) were identified. In the individual with MEL and familial OPK, a heterozygous nonsense mutation (c.1963C>T, p.R655X) was detected in exon 7. However, no LEMD3 mutation was found for any other patient, including all four with sporadic MEL. We confirm that OPK and BOS individuals, including those with MEL-like lesions, have heterozygous, deactivating, germline LEMD3 mutations. However, MEL remains of unknown etiology.
Hu, Dong Gui; McKinnon, Ross A.; Hulin, Julie-Ann; Mackenzie, Peter I.; Meech, Robyn
2016-01-01
Nearly 20 different transcripts of the human androgen receptor (AR) are reported with two currently listed as Refseq isoforms in the NCBI database. Isoform 1 encodes wild-type AR (type 1 AR) and isoform 2 encodes the variant AR45 (type 2 AR). Both variants contain eight exons: they share common exons 2–8 but differ in exon 1 with the canonical exon 1 in isoform 1 and the variant exon 1b in isoform 2. Splicing of exon 1 or exon 1b is reported to be mutually exclusive. In this study, we identified a novel exon 1b (1b/TAG) that contains an additional TAG trinucleotide upstream of exon 1b. Moreover, we identified AR transcripts in both normal and cancerous breast and prostate cells that contained either exon 1b or 1b/TAG spliced between the canonical exon 1 and exon 2, generating nine-exon AR transcripts that we have named isoforms 3a and 3b. The proteins encoded by these new AR variants could regulate androgen-responsive reporters in breast and prostate cancer cells under androgen-depleted conditions. Analysis of type 3 AR-GFP fusion proteins showed partial nuclear localization in PC3 cells under androgen-depleted conditions, supporting androgen-independent activation of the AR. Type 3 AR proteins inhibited androgen-induced growth of LNCaP cells. Microarray analysis identified a small set of type 3a AR target genes in LNCaP cells, including genes known to modulate growth and proliferation of prostate cancer (PCGEM1, PEG3, EPHA3, and EFNB2) or other types of human cancers (TOX3, ST8SIA4, and SLITRK3), and genes that are diagnostic/prognostic biomarkers of prostate cancer (GRINA3, and BCHE). PMID:28035996
Connecting the dots: chromatin and alternative splicing in EMT
Warns, Jessica A.; Davie, James R.; Dhasarathy, Archana
2015-01-01
Nature has devised sophisticated cellular machinery to process mRNA transcripts produced by RNA Polymerase II, removing intronic regions and connecting exons together, to produce mature RNAs. This process, known as splicing, is very closely linked to transcription. Alternative splicing, or the ability to produce different combinations of exons that are spliced together from the same genomic template, is a fundamental means of regulating protein complexity. Similar to transcription, both constitutive and alternative splicing can be regulated by chromatin and its associated factors in response to various signal transduction pathways activated by external stimuli. This regulation can vary between different cell types, and interference with these pathways can lead to changes in splicing, often resulting in aberrant cellular states and disease. The epithelial to mesenchymal transition (EMT), which leads to cancer metastasis, is influenced by alternative splicing events of chromatin remodelers and epigenetic factors such as DNA methylation and non-coding RNAs. In this review, we will discuss the role of epigenetic factors including chromatin, chromatin remodelers, DNA methyltransferases and microRNAs in the context of alternative splicing, and discuss their potential involvement in alternative splicing during the EMT process. PMID:26291837
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ainsworth, P.J.; Coulter-Mackie, M.B.
1992-10-01
The B1 variant form of Tay-Sachs disease is enzymologically unique in that the causative mutation(s) appear to affect the active site in the [alpha] subunit of [beta]-hexosaminidase A without altering its ability to associate with the [beta] subunit. Most previously reported B1 variant mutations were found in exon 5 within codon 178. The coding sequence of the [alpha] subunit gene of a patient with the B1 variant form was examined with a combination of reverse transcription of mRNA to cDNA, PCR, and dideoxy sequencing. A double mutation in exon 6 has been identified: a G[sub 574][yields]C transversion causing a val[submore » 192][yields]leu change and a G[sub 598][yields] A transition resulting in a val[sub 200][yields]met alteration. The amplified cDNAs were otherwise normal throughout their sequence. The 574 and 598 alterations have been confirmed by amplification directly from genomic DNA from the patient and her mother. Transient-expression studies of the two exon 6 mutations (singly or together) in COS-1 cells show that the G[sub 574][yields]C change is sufficient to cause the loss of enzyme activity. The biochemical phenotype of the 574 alteration in transfection studies is consistent with that expected for a B1 variant mutation. As such, this mutation differs from previously reported B1 variant mutations, all of which occur in exon 5. 31 refs., 2 figs., 2 tabs.« less
Chang, Cheng; Shen, Wen-Kai; Wang, Tzu-Ting; Lin, Ying-Hsi; Hsu, Err-Lieh; Dai, Shu-Mei
2009-04-01
To identify pertinent mutations associated with knockdown resistance to permethrin, the entire coding sequence of the voltage-gated sodium channel gene Aa-para was sequenced and analyzed from a Per-R strain with 190-fold resistance to permethrin and two susceptible strains of Aedes aegypti. The longest transcript, a 6441bp open reading frame, encodes 2147 amino acid residues with an estimated molecular mass of 241kDa. A total of 33 exons were found in the Aa-para gene over 293kb of genomic DNA. Three previously unreported optional exons were identified. The first two exons, m and n, were located within the intracellular domain I/II, and the third, f', was found within the II/III linkers. The two mutually exclusive exons, d and l, were the only alternative exons in all the cDNA clones sequenced in this study. The most distinct finding was a novel amino acid substitution mutation, D1794Y, located within the extracellular linker between IVS5 and IVS6, which is concurrent with the known V1023G mutation in Aa-para of the Per-R strain. The high frequency and coexistence of the two mutations in the Per-R strain suggest that they might exert a synergistic effect to provide the knockdown resistance to permethrin. Furthermore, both cDNA and genomic DNA data from the same individual mosquitoes have demonstrated that RNA editing was not involved in amino acid substitutions of the Per-R strain.
Kuzuoka, M; Takahashi, T; Guron, C; Raghow, R
1994-05-01
Detailed molecular organization of the coding and upstream regulatory regions of the murine homeodomain-containing gene, Msx-1, is reported. The protein-encoding portion of the gene is contained in two exons, 590 and 1214 bp in length, separated by a 2107-bp intron; the homeodomain is located in the second exon. The two-exon organization of the murine Msx-1 gene resembles a number of other homeodomain-containing genes. The 5'-(GTAAGT) and 3'-(CCCTAG) splicing junctions and the mRNA polyadenylation signal (UAUAA) of the murine Msx-1 gene are also characteristic of other vertebrate genes. By nuclease protection and primer extension assays, the start of transcription of the Msx-1 gene was located 256 bp upstream of the first AUG. Computer analysis of the promoter proximal 1280-bp sequence revealed a number of potentially important cis-regulatory sequences; these include the recognition elements for Ap-1, Ap-2, Ap-3, Sp-1, a possible binding site for RAR:RXR, and a number of TCF-1 consensus motifs. Importantly, a perfect reverse complement of (C/G)TTAATTG, which was recently shown to be an optimal binding sequence for the homeodomain of Msx-1 protein (K.M. Catron, N. Iler, and C. Abate (1993) Mol. Cell. Biol. 13:2354-2365), was also located in the murine Msx-1 promoter. Binding of bacterially expressed Msx-1 homeodomain polypeptide to Msx-1-specific oligonucleotide was experimentally demonstrated, raising a distinct possibility of autoregulation of this developmentally regulated gene.
Patterson, Emily J; Wilk, Melissa; Langlo, Christopher S; Kasilian, Melissa; Ring, Michael; Hufnagel, Robert B; Dubis, Adam M; Tee, James J; Kalitzeos, Angelos; Gardner, Jessica C; Ahmed, Zubair M; Sisk, Robert A; Larsen, Michael; Sjoberg, Stacy; Connor, Thomas B; Dubra, Alfredo; Neitz, Jay; Hardcastle, Alison J; Neitz, Maureen; Michaelides, Michel; Carroll, Joseph
2016-07-01
Mutations in the coding sequence of the L and M opsin genes are often associated with X-linked cone dysfunction (such as Bornholm Eye Disease, BED), though the exact color vision phenotype associated with these disorders is variable. We examined individuals with L/M opsin gene mutations to clarify the link between color vision deficiency and cone dysfunction. We recruited 17 males for imaging. The thickness and integrity of the photoreceptor layers were evaluated using spectral-domain optical coherence tomography. Cone density was measured using high-resolution images of the cone mosaic obtained with adaptive optics scanning light ophthalmoscopy. The L/M opsin gene array was characterized in 16 subjects, including at least one subject from each family. There were six subjects with the LVAVA haplotype encoded by exon 3, seven with LIAVA, two with the Cys203Arg mutation encoded by exon 4, and two with a novel insertion in exon 2. Foveal cone structure and retinal thickness was disrupted to a variable degree, even among related individuals with the same L/M array. Our findings provide a direct link between disruption of the cone mosaic and L/M opsin variants. We hypothesize that, in addition to large phenotypic differences between different L/M opsin variants, the ratio of expression of first versus downstream genes in the L/M array contributes to phenotypic diversity. While the L/M opsin mutations underlie the cone dysfunction in all of the subjects tested, the color vision defect can be caused either by the same mutation or a gene rearrangement at the same locus.
Badr, Eman; ElHefnawi, Mahmoud; Heath, Lenwood S
2016-01-01
Alternative splicing is a vital process for regulating gene expression and promoting proteomic diversity. It plays a key role in tissue-specific expressed genes. This specificity is mainly regulated by splicing factors that bind to specific sequences called splicing regulatory elements (SREs). Here, we report a genome-wide analysis to study alternative splicing on multiple tissues, including brain, heart, liver, and muscle. We propose a pipeline to identify differential exons across tissues and hence tissue-specific SREs. In our pipeline, we utilize the DEXSeq package along with our previously reported algorithms. Utilizing the publicly available RNA-Seq data set from the Human BodyMap project, we identified 28,100 differentially used exons across the four tissues. We identified tissue-specific exonic splicing enhancers that overlap with various previously published experimental and computational databases. A complicated exonic enhancer regulatory network was revealed, where multiple exonic enhancers were found across multiple tissues while some were found only in specific tissues. Putative combinatorial exonic enhancers and silencers were discovered as well, which may be responsible for exon inclusion or exclusion across tissues. Some of the exonic enhancers are found to be co-occurring with multiple exonic silencers and vice versa, which demonstrates a complicated relationship between tissue-specific exonic enhancers and silencers.
Castrignanò, Tiziana; Canali, Alessandro; Grillo, Giorgio; Liuni, Sabino; Mignone, Flavio; Pesole, Graziano
2004-01-01
The identification and characterization of genome tracts that are highly conserved across species during evolution may contribute significantly to the functional annotation of whole-genome sequences. Indeed, such sequences are likely to correspond to known or unknown coding exons or regulatory motifs. Here, we present a web server implementing a previously developed algorithm that, by comparing user-submitted genome sequences, is able to identify statistically significant conserved blocks and assess their coding or noncoding nature through the measure of a coding potential score. The web tool, available at http://www.caspur.it/CSTminer/, is dynamically interconnected with the Ensembl genome resources and produces a graphical output showing a map of detected conserved sequences and annotated gene features. PMID:15215464
CTCF, a Novel Regulator of Alternative Splicing | Center for Cancer Research
Alternative splicing, or the inclusion of different patterns of exons from the same gene, plays an important role in expanding the coding possibilities of a limited genome. The immune system is an ideal system to study this since alternative splicing is used to generate an almost unlimited number of antibodies against any pathogen we might encounter.
Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster
Wang, Wen; Brunet, Frédéric G.; Nevo, Eviatar; Long, Manyuan
2002-01-01
Non-protein-coding RNA genes play an important role in various biological processes. How new RNA genes originated and whether this process is controlled by similar evolutionary mechanisms for the origin of protein-coding genes remains unclear. A young chimeric RNA gene that we term sphinx (spx) provides the first insight into the early stage of evolution of RNA genes. spx originated as an insertion of a retroposed sequence of the ATP synthase chain F gene at the cytological region 60DB since the divergence of Drosophila melanogaster from its sibling species 2–3 million years ago. This retrosequence, which is located at 102F on the fourth chromosome, recruited a nearby exon and intron, thereby evolving a chimeric gene structure. This molecular process suggests that the mechanism of exon shuffling, which can generate protein-coding genes, also plays a role in the origin of RNA genes. The subsequent evolutionary process of spx has been associated with a high nucleotide substitution rate, possibly driven by a continuous positive Darwinian selection for a novel function, as is shown in its sex- and development-specific alternative splicing. To test whether spx has adapted to different environments, we investigated its population genetic structure in the unique “Evolution Canyon” in Israel, revealing a similar haplotype structure in spx, and thus similar evolutionary forces operating on spx between environments. PMID:11904380
Rozhdestvensky, Timofey S; Robeck, Thomas; Galiveti, Chenna R; Raabe, Carsten A; Seeger, Birte; Wolters, Anna; Gubar, Leonid V; Brosius, Jürgen; Skryabin, Boris V
2016-02-05
Prader-Willi syndrome (PWS) is a neurogenetic disorder caused by loss of paternally expressed genes on chromosome 15q11-q13. The PWS-critical region (PWScr) contains an array of non-protein coding IPW-A exons hosting intronic SNORD116 snoRNA genes. Deletion of PWScr is associated with PWS in humans and growth retardation in mice exhibiting ~15% postnatal lethality in C57BL/6 background. Here we analysed a knock-in mouse containing a 5'HPRT-LoxP-Neo(R) cassette (5'LoxP) inserted upstream of the PWScr. When the insertion was inherited maternally in a paternal PWScr-deletion mouse model (PWScr(p-/m5'LoxP)), we observed compensation of growth retardation and postnatal lethality. Genomic methylation pattern and expression of protein-coding genes remained unaltered at the PWS-locus of PWScr(p-/m5'LoxP) mice. Interestingly, ubiquitous Snord116 and IPW-A exon transcription from the originally silent maternal chromosome was detected. In situ hybridization indicated that PWScr(p-/m5'LoxP) mice expressed Snord116 in brain areas similar to wild type animals. Our results suggest that the lack of PWScr RNA expression in certain brain areas could be a primary cause of the growth retardation phenotype in mice. We propose that activation of disease-associated genes on imprinted regions could lead to general therapeutic strategies in associated diseases.
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing
Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng
2017-01-01
ABSTRACT Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure. PMID:28277933
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing.
Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng
2017-10-03
Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure.
Liskova, Petra; Tuft, Stephen J.; Gwilliam, Rhian; Ebenezer, Neil D.; Jirsova, Katerina; Prescott, Quincy; Martincova, Radka; Pretorius, Marike; Sinclair, Neil; Boase, David L.; Jeffrey, Margaret J.; Deloukas, Panos; Hardcastle, Alison J.; Filipec, Martin; Bhattacharya, Shomi S.
2009-01-01
We describe the search for mutations in six unrelated Czech and four unrelated British families with posterior polymorphous corneal dystrophy (PPCD); a relatively rare eye disorder. Coding exons and intron/exon boundaries of all three genes (VSX1, COL8A2, and ZEB1/TCF8) previously reported to be implicated in the pathogenesis of this disorder were screened by DNA sequencing. Four novel pathogenic mutations were identified in four families; two deletions, one nonsense, and one duplication within exon 7 in the ZEB1 gene located at 10p11.2. We also genotyped the Czech patients to test for a founder haplotype and lack of disease segregation with the 20p11.2 locus we previously described. Although a systematic clinical examination was not performed, our investigation does not support an association between ZEB1 changes and self reported non-ocular anomalies. In the remaining six families no disease causing mutations were identified thereby indicating that as yet unidentified gene(s) are likely to be responsible for PPCD. PMID:17437275
Exon Shuffling and Origin of Scorpion Venom Biodiversity
Wang, Xueli; Gao, Bin; Zhu, Shunyi
2016-01-01
Scorpion venom is a complex combinatorial library of peptides and proteins with multiple biological functions. A combination of transcriptomic and proteomic techniques has revealed its enormous molecular diversity, as identified by the presence of a large number of ion channel-targeted neurotoxins with different folds, membrane-active antimicrobial peptides, proteases, and protease inhibitors. Although the biodiversity of scorpion venom has long been known, how it arises remains unsolved. In this work, we analyzed the exon-intron structures of an array of scorpion venom protein-encoding genes and unexpectedly found that nearly all of these genes possess a phase-1 intron (one intron located between the first and second nucleotides of a codon) near the cleavage site of a signal sequence despite their mature peptides remarkably differ. This observation matches a theory of exon shuffling in the origin of new genes and suggests that recruitment of different folds into scorpion venom might be achieved via shuffling between body protein-coding genes and ancestral venom gland-specific genes that presumably contributed tissue-specific regulatory elements and secretory signal sequences. PMID:28035955
Exon Shuffling and Origin of Scorpion Venom Biodiversity.
Wang, Xueli; Gao, Bin; Zhu, Shunyi
2016-12-26
Scorpion venom is a complex combinatorial library of peptides and proteins with multiple biological functions. A combination of transcriptomic and proteomic techniques has revealed its enormous molecular diversity, as identified by the presence of a large number of ion channel-targeted neurotoxins with different folds, membrane-active antimicrobial peptides, proteases, and protease inhibitors. Although the biodiversity of scorpion venom has long been known, how it arises remains unsolved. In this work, we analyzed the exon-intron structures of an array of scorpion venom protein-encoding genes and unexpectedly found that nearly all of these genes possess a phase-1 intron (one intron located between the first and second nucleotides of a codon) near the cleavage site of a signal sequence despite their mature peptides remarkably differ. This observation matches a theory of exon shuffling in the origin of new genes and suggests that recruitment of different folds into scorpion venom might be achieved via shuffling between body protein-coding genes and ancestral venom gland-specific genes that presumably contributed tissue-specific regulatory elements and secretory signal sequences.
Hotspots in PTPN11 Gene Among Indian Children With Noonan Syndrome.
Narayanan, Dhanya Lakshmi; Pandey, Himani; Moirangthem, Amita; Mandal, Kausik; Gupta, Rekha; Puri, Ratna Dua; Patil, S J; Phadke, Shubha R
2017-08-15
To test for PTPN11 mutations in clinically diagnosed cases of Noonan syndrome. 17 individuals with clinical diagnosis of Noonan syndrome were included in the study. Sanger sequencing of all the 15 exons of PTPN11 was done. A genotype-phenotype correlation was attempted. Mutation in PTPN11 was detected in 11 out of 17 (64.7%) patients with Noonan syndrome; 72% had mutation in exon 3 and 27 % had mutation in exon 13. PTPN11 mutation accounts for 64.7% of cases with clinical features of Noonan syndrome in India. Majority of the mutations are in exon 3 and exon 13 of PTPN11, making them the hotspots in Indian population.
Expression Profiling Smackdown: Human Transcriptome Array HTA 2.0 vs. RNA-Seq
Palermo, Meghann; Driscoll, Heather; Tighe, Scott; Dragon, Julie; Bond, Jeff; Shukla, Arti; Vangala, Mahesh; Vincent, James; Hunter, Tim
2014-01-01
The advent of both microarray and massively parallel sequencing have revolutionized high-throughput analysis of the human transcriptome. Due to limitations in microarray technology, detecting and quantifying coding transcript isoforms, in addition to non-coding transcripts, has been challenging. As a result, RNA-Seq has been the preferred method for characterizing the full human transcriptome, until now. A new high-resolution array from Affymetrix, GeneChip Human Transcriptome Array 2.0 (HTA 2.0), has been designed to interrogate all transcript isoforms in the human transcriptome with >6 million probes targeting coding transcripts, exon-exon splice junctions, and non-coding transcripts. Here we compare expression results from GeneChip HTA 2.0 and RNA-Seq data using identical RNA extractions from three samples each of healthy human mesothelial cells in culture, LP9-C1, and healthy mesothelial cells treated with asbestos, LP9-A1. For GeneChip HTA 2.0 sample preparation, we chose to compare two target preparation methods, NuGEN Ovation Pico WTA V2 with the Encore Biotin Module versus Affymetrix's GeneChip WT PLUS with the WT Terminal Labeling Kit, on identical RNA extractions from both untreated and treated samples. These same RNA extractions were used for the RNA-Seq library preparation. All analyses were performed in Partek Genomics Suite 6.6. Expression profiles for control and asbestos-treated mesothelial cells prepared with NuGEN versus Affymetrix target preparation methods (GeneChip HTA 2.0) are compared to each other as well as to RNA-Seq results.
Phylogenomic analyses data of the avian phylogenomics project.
Jarvis, Erich D; Mirarab, Siavash; Aberer, Andre J; Li, Bo; Houde, Peter; Li, Cai; Ho, Simon Y W; Faircloth, Brant C; Nabholz, Benoit; Howard, Jason T; Suh, Alexander; Weber, Claudia C; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Narula, Nitish; Liu, Liang; Burt, Dave; Ellegren, Hans; Edwards, Scott V; Stamatakis, Alexandros; Mindell, David P; Cracraft, Joel; Braun, Edward L; Warnow, Tandy; Jun, Wang; Gilbert, M Thomas Pius; Zhang, Guojie
2015-01-01
Determining the evolutionary relationships among the major lineages of extant birds has been one of the biggest challenges in systematic biology. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders. We used these genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomic analyses. Here we present the datasets associated with the phylogenomic analyses, which include sequence alignment files consisting of nucleotides, amino acids, indels, and transposable elements, as well as tree files containing gene trees and species trees. Inferring an accurate phylogeny required generating: 1) A well annotated data set across species based on genome synteny; 2) Alignments with unaligned or incorrectly overaligned sequences filtered out; and 3) Diverse data sets, including genes and their inferred trees, indels, and transposable elements. Our total evidence nucleotide tree (TENT) data set (consisting of exons, introns, and UCEs) gave what we consider our most reliable species tree when using the concatenation-based ExaML algorithm or when using statistical binning with the coalescence-based MP-EST algorithm (which we refer to as MP-EST*). Other data sets, such as the coding sequence of some exons, revealed other properties of genome evolution, namely convergence. The Avian Phylogenomics Project is the largest vertebrate phylogenomics project to date that we are aware of. The sequence, alignment, and tree data are expected to accelerate analyses in phylogenomics and other related areas.
Screening for mutations in two exons of FANCG gene in Pakistani population.
Aymun, Ujala; Iram, Saima; Aftab, Iram; Khaliq, Saba; Nadir, Ali; Nisar, Ahmed; Mohsin, Shahida
2017-06-01
Fanconi anemia is a rare autosomal recessive disorder of genetic instability. It is both molecularly and clinically, a heterogeneous disorder. Its incidence is 1 in 129,000 births and relatively high in some ethnic groups. Sixteen genes have been identified among them mutations in FANCG gene are most common after FANCA and FANCC gene mutations. To study mutations in exon 3 and 4 of FANCG gene in Pakistani population. Thirty five patients with positive Diepoxybutane test were included in the study. DNA was extracted and amplified for exons 3 and 4. Thereafter Sequencing was done and analyzed for the presence of mutations. No mutation was detected in exon 3 whereas a carrier of known mutation c.307+1 G>T was found in exon 4 of the FANCG gene. Absence of any mutation in exon 3 and only one heterozygous mutation in exon 4 of FANCG gene points to a different spectrum of FA gene pool in Pakistan that needs extensive research in this area.
A Novel Nonsense Mutation in Exon 5 of KIND1 Gene in an Iranian Family with Kindler Syndrome.
Heidari, Mohammad Mehdi; Khatami, Mehri; Kargar, Saeed; Azari, Mojdeh; Hoseinzadeh, Hassan; Fallah, Hamedeh
2016-06-01
Kindler syndrome (KS) is an autosomal recessive skin disease characterized by actual blistering, photosensitivity and a progressive poikiloderma. The disorder results from rare mutations in the KIND1 gene. This gene contains 15 exons and expresses two kindlin-1 isoforms. The aim of this investigation was to analyze mutations in the exons 1 to 15 of KIND1 gene in an Iranian family clinically affected with Kindler syndrome. The mutations analysis of 15 coding exons of KIND1 gene was performed with PCR-SSCP and direct sequencing in 14 subjects from one Iranian family clinically affected with Kindler syndrome. We identified eight new nucleotide changes in KIND1 in this family. These changes were found in g.3892delA, g.3951T>C, g.3962T>G, g.4190G>T, g.7497G>A, g.11076T>C, g.11102C>T and g.13177C>T positions. Among them, the g.13177C>T mutation resulting in the formation of a premature stop codon (Q226X) was detected only in seven affected family individuals as homozygous but was not present in 100 unrelated healthy controls. This study suggests that nonsense mutation may lead to incomplete and non-functional protein products and is pathogenic and has meaningful implications for the diagnosis of patients with Kindler syndrome.
Lenarduzzi, S; Morgutti, M; Crovella, S; Coiana, A; Rosatelli, M C
2014-11-14
Cystic fibrosis (CF) is a common recessive genetic disease caused by mutations in the gene encoding for the cystic fibrosis transmembrane conductance regulator (CFTR) protein. More than 1800 different mutations have been described to date. Here, we report 3 novel mutations in CFTR in 3 Italian CF patients. To detect and identify 36 frequent mutations in Caucasians, we used the INNO-LiPA CFTR19 and INNO-LiPA CFTR17+Tn Update kits (Innogenetics; Ghent, Belgium). Our first analysis did not reveal both of the responsible mutations; thus, direct sequencing of the CFTR gene coding region was performed. The 3 patients were compound heterozygous. In one allele, the F508del (c.1521_1523delCTT, p.PHE508del) mutation in exon 11 was observed in each case. For the second allele, in patient No.1, direct sequencing revealed an 11-base pair deletion (GAGGCGATACT) in exon 14 (c.2236_2246del; pGlu746Alafs*29). In patient No. 2, direct sequencing revealed a nonsense mutation at nucleotide 3892 (c.3892G>T) in exon 24. In patient No. 3, direct sequencing revealed a deletion of cytosine in exon 27 (c.4296delC; p.Asn1432Lysfs*16). These 3 novel mutations indicate the production of a truncated protein, which consequently results in a non-functional polypeptide.
Soreq, Lilach; Guffanti, Alessandro; Salomonis, Nathan; Simchovitz, Alon; Israel, Zvi; Bergman, Hagai; Soreq, Hermona
2014-01-01
The continuously prolonged human lifespan is accompanied by increase in neurodegenerative diseases incidence, calling for the development of inexpensive blood-based diagnostics. Analyzing blood cell transcripts by RNA-Seq is a robust means to identify novel biomarkers that rapidly becomes a commonplace. However, there is lack of tools to discover novel exons, junctions and splicing events and to precisely and sensitively assess differential splicing through RNA-Seq data analysis and across RNA-Seq platforms. Here, we present a new and comprehensive computational workflow for whole-transcriptome RNA-Seq analysis, using an updated version of the software AltAnalyze, to identify both known and novel high-confidence alternative splicing events, and to integrate them with both protein-domains and microRNA binding annotations. We applied the novel workflow on RNA-Seq data from Parkinson's disease (PD) patients' leukocytes pre- and post- Deep Brain Stimulation (DBS) treatment and compared to healthy controls. Disease-mediated changes included decreased usage of alternative promoters and N-termini, 5′-end variations and mutually-exclusive exons. The PD regulated FUS and HNRNP A/B included prion-like domains regulated regions. We also present here a workflow to identify and analyze long non-coding RNAs (lncRNAs) via RNA-Seq data. We identified reduced lncRNA expression and selective PD-induced changes in 13 of over 6,000 detected leukocyte lncRNAs, four of which were inversely altered post-DBS. These included the U1 spliceosomal lncRNA and RP11-462G22.1, each entailing sequence complementarity to numerous microRNAs. Analysis of RNA-Seq from PD and unaffected controls brains revealed over 7,000 brain-expressed lncRNAs, of which 3,495 were co-expressed in the leukocytes including U1, which showed both leukocyte and brain increases. Furthermore, qRT-PCR validations confirmed these co-increases in PD leukocytes and two brain regions, the amygdala and substantia-nigra, compared to controls. This novel workflow allows deep multi-level inspection of RNA-Seq datasets and provides a comprehensive new resource for understanding disease transcriptome modifications in PD and other neurodegenerative diseases. PMID:24651478
Lin, Yani; Liu, Enbin; Sun, Qi; Ma, Jiao; Li, QingHua; Cao, Zeng; Wang, Jun; Jia, Yujiao; Zhang, Hongju; Song, Zhen; Ai, Xiaofei; Shi, Lihui; Feng, Xiaofang; Li, Chenwei; Wang, Jianxiang; Ru, Kun
2015-07-01
To evaluate the mutation frequency of JAK2 V617F, JAK2 exon 12, MPL exon 10, and CALR exon 9 and the value of the combined tests in the diagnosis of BCR-ABL1-negative myeloproliferative neoplasms (MPNs). In the current study, mutations of JAK2 V617F, JAK2 exon 12, MPL exon 10, and CALR exon 9 were analyzed in 929 Chinese patients with BCR-ABL1-negative MPN, including 234 cases of polycythemia vera (PV), 428 ETs, 187 PMFs, and 80 unclassifiable MPNs (MPN-Us). Our result showed that the positive rate of any of four mutations in patients with PV, ET, PMF, and MPN-U was 89.3%, 83.4%, 87.2%, and 77.5%, respectively, which significantly improved the diagnostic rate, especially in ET and PMF. Meanwhile, we also found that the patients without any of four mutations were younger than those with one or more mutations. Unexpectedly, the coexistence of JAK2 V617F and CALR exon 9 was identified in six (0.6%) patients, and JAK2 V617F and MPL exon 10 were present simultaneously in two (0.2%) patients. In addition, we also identified several novel mutation types in CALR exon 9. The combined genetic tests of JAK2 V617F, JAK2 exon 12, MPL exon 10, and CALR exon 9 help improve the diagnostic rate for BCR-ABL1-negative MPN. Copyright© by the American Society for Clinical Pathology.
Alekseyenko, Alexander V.; Kim, Namshin; Lee, Christopher J.
2007-01-01
Association of alternative splicing (AS) with accelerated rates of exon evolution in some organisms has recently aroused widespread interest in its role in evolution of eukaryotic gene structure. Previous studies were limited to analysis of exon creation or lost events in mouse and/or human only. Our multigenome approach provides a way for (1) distinguishing creation and loss events on the large scale; (2) uncovering details of the evolutionary mechanisms involved; (3) estimating the corresponding rates over a wide range of evolutionary times and organisms; and (4) assessing the impact of AS on those evolutionary rates. We use previously unpublished independent analyses of alternative splicing in five species (human, mouse, dog, cow, and zebrafish) from the ASAP database combined with genomewide multiple alignment of 17 genomes to analyze exon creation and loss of both constitutively and alternatively spliced exons in mammals, fish, and birds. Our analysis provides a comprehensive database of exon creation and loss events over 360 million years of vertebrate evolution, including tens of thousands of alternative and constitutive exons. We find that exon inclusion level is inversely related to the rate of exon creation. In addition, we provide a detailed in-depth analysis of mechanisms of exon creation and loss, which suggests that a large fraction of nonrepetitive created exons are results of ab initio creation from purely intronic sequences. Our data indicate an important role for alternative splicing in creation of new exons and provide a useful novel database resource for future genome evolution research. PMID:17369312
Accurate clinical detection of exon copy number variants in a targeted NGS panel using DECoN.
Fowler, Anna; Mahamdallie, Shazia; Ruark, Elise; Seal, Sheila; Ramsay, Emma; Clarke, Matthew; Uddin, Imran; Wylie, Harriet; Strydom, Ann; Lunter, Gerton; Rahman, Nazneen
2016-11-25
Background: Targeted next generation sequencing (NGS) panels are increasingly being used in clinical genomics to increase capacity, throughput and affordability of gene testing. Identifying whole exon deletions or duplications (termed exon copy number variants, 'exon CNVs') in exon-targeted NGS panels has proved challenging, particularly for single exon CNVs. Methods: We developed a tool for the Detection of Exon Copy Number variants (DECoN), which is optimised for analysis of exon-targeted NGS panels in the clinical setting. We evaluated DECoN performance using 96 samples with independently validated exon CNV data. We performed simulations to evaluate DECoN detection performance of single exon CNVs and to evaluate performance using different coverage levels and sample numbers. Finally, we implemented DECoN in a clinical laboratory that tests BRCA1 and BRCA2 with the TruSight Cancer Panel (TSCP). We used DECoN to analyse 1,919 samples, validating exon CNV detections by multiplex ligation-dependent probe amplification (MLPA). Results: In the evaluation set, DECoN achieved 100% sensitivity and 99% specificity for BRCA exon CNVs, including identification of 8 single exon CNVs. DECoN also identified 14/15 exon CNVs in 8 other genes. Simulations of all possible BRCA single exon CNVs gave a mean sensitivity of 98% for deletions and 95% for duplications. DECoN performance remained excellent with different levels of coverage and sample numbers; sensitivity and specificity was >98% with the typical NGS run parameters. In the clinical pipeline, DECoN automatically analyses pools of 48 samples at a time, taking 24 minutes per pool, on average. DECoN detected 24 BRCA exon CNVs, of which 23 were confirmed by MLPA, giving a false discovery rate of 4%. Specificity was 99.7%. Conclusions: DECoN is a fast, accurate, exon CNV detection tool readily implementable in research and clinical NGS pipelines. It has high sensitivity and specificity and acceptable false discovery rate. DECoN is freely available at www.icr.ac.uk/decon.
Abascal, Federico; Ezkurdia, Iakes; Rodriguez-Rivas, Juan; Rodriguez, Jose Manuel; del Pozo, Angela; Vázquez, Jesús; Valencia, Alfonso; Tress, Michael L.
2015-01-01
Alternative splicing of messenger RNA can generate a wide variety of mature RNA transcripts, and these transcripts may produce protein isoforms with diverse cellular functions. While there is much supporting evidence for the expression of alternative transcripts, the same is not true for the alternatively spliced protein products. Large-scale mass spectroscopy experiments have identified evidence of alternative splicing at the protein level, but with conflicting results. Here we carried out a rigorous analysis of the peptide evidence from eight large-scale proteomics experiments to assess the scale of alternative splicing that is detectable by high-resolution mass spectroscopy. We find fewer splice events than would be expected: we identified peptides for almost 64% of human protein coding genes, but detected just 282 splice events. This data suggests that most genes have a single dominant isoform at the protein level. Many of the alternative isoforms that we could identify were only subtly different from the main splice isoform. Very few of the splice events identified at the protein level disrupted functional domains, in stark contrast to the two thirds of splice events annotated in the human genome that would lead to the loss or damage of functional domains. The most striking result was that more than 20% of the splice isoforms we identified were generated by substituting one homologous exon for another. This is significantly more than would be expected from the frequency of these events in the genome. These homologous exon substitution events were remarkably conserved—all the homologous exons we identified evolved over 460 million years ago—and eight of the fourteen tissue-specific splice isoforms we identified were generated from homologous exons. The combination of proteomics evidence, ancient origin and tissue-specific splicing indicates that isoforms generated from homologous exons may have important cellular roles. PMID:26061177
Genetics of Type III Bartter Syndrome in Spain, Proposed Diagnostic Algorithm
García Castaño, Alejandro; Pérez de Nanclares, Gustavo; Madariaga, Leire; Aguirre, Mireia; Madrid, Alvaro; Nadal, Inmaculada; Navarro, Mercedes; Lucas, Elena; Fijo, Julia; Espino, Mar; Espitaletta, Zilac; Castaño, Luis; Ariceta, Gema
2013-01-01
The p.Ala204Thr mutation (exon 7) of the CLCNKB gene is a "founder" mutation that causes most of type III Bartter syndrome cases in Spain. We performed genetic analysis of the CLCNKB gene, which encodes for the chloride channel protein ClC-Kb, in a cohort of 26 affected patients from 23 families. The diagnostic algorithm was: first, detection of the p.Ala204Thr mutation; second, detecting large deletions or duplications by Multiplex Ligation-dependent Probe Amplification and Quantitative Multiplex PCR of Short Fluorescent Fragments; and third, sequencing of the coding and flanking regions of the whole CLCNKB gene. In our genetic diagnosis, 20 families presented with the p.Ala204Thr mutation. Of those, 15 patients (15 families) were homozygous (57.7% of overall patients). Another 8 patients (5 families) were compound heterozygous for the founder mutation together with a second one. Thus, 3 patients (2 siblings) presented with the c. -19-?_2053+? del deletion (comprising the entire gene); one patient carried the p.Val170Met mutation (exon 6); and 4 patients (3 siblings) presented with the novel p.Glu442Gly mutation (exon 14). On the other hand, another two patients carried two novel mutations in compound heterozygosis: one presented the p.Ile398_Thr401del mutation (exon 12) associated with the c. -19-?_2053+? del deletion, and the other one carried the c.1756+1G>A splice-site mutation (exon 16) as well as the already described p.Ala210Val change (exon 7). One case turned out to be negative in our genetic screening. In addition, 51 relatives were found to be heterozygous carriers of the described CLCNKB mutations. In conclusion, different mutations cause type III Bartter syndrome in Spain. The high prevalence of the p.Ala204Thr in Spanish families thus justifies an initial screen for this mutation. However, should it not be detected further investigation of the CLCNKB gene is warranted in clinically diagnosed families. PMID:24058621
Genetics of type III Bartter syndrome in Spain, proposed diagnostic algorithm.
García Castaño, Alejandro; Pérez de Nanclares, Gustavo; Madariaga, Leire; Aguirre, Mireia; Madrid, Alvaro; Nadal, Inmaculada; Navarro, Mercedes; Lucas, Elena; Fijo, Julia; Espino, Mar; Espitaletta, Zilac; Castaño, Luis; Ariceta, Gema
2013-01-01
The p.Ala204Thr mutation (exon 7) of the CLCNKB gene is a "founder" mutation that causes most of type III Bartter syndrome cases in Spain. We performed genetic analysis of the CLCNKB gene, which encodes for the chloride channel protein ClC-Kb, in a cohort of 26 affected patients from 23 families. The diagnostic algorithm was: first, detection of the p.Ala204Thr mutation; second, detecting large deletions or duplications by Multiplex Ligation-dependent Probe Amplification and Quantitative Multiplex PCR of Short Fluorescent Fragments; and third, sequencing of the coding and flanking regions of the whole CLCNKB gene. In our genetic diagnosis, 20 families presented with the p.Ala204Thr mutation. Of those, 15 patients (15 families) were homozygous (57.7% of overall patients). Another 8 patients (5 families) were compound heterozygous for the founder mutation together with a second one. Thus, 3 patients (2 siblings) presented with the c. -19-?_2053+? del deletion (comprising the entire gene); one patient carried the p.Val170Met mutation (exon 6); and 4 patients (3 siblings) presented with the novel p.Glu442Gly mutation (exon 14). On the other hand, another two patients carried two novel mutations in compound heterozygosis: one presented the p.Ile398_Thr401del mutation (exon 12) associated with the c. -19-?_2053+? del deletion, and the other one carried the c.1756+1G>A splice-site mutation (exon 16) as well as the already described p.Ala210Val change (exon 7). One case turned out to be negative in our genetic screening. In addition, 51 relatives were found to be heterozygous carriers of the described CLCNKB mutations. In conclusion, different mutations cause type III Bartter syndrome in Spain. The high prevalence of the p.Ala204Thr in Spanish families thus justifies an initial screen for this mutation. However, should it not be detected further investigation of the CLCNKB gene is warranted in clinically diagnosed families.
Mamatha, Gandra; Umashankar, Vetrivel; Kasinathan, Nachiappan; Krishnan, Tandava; Sathyabaarathi, Ravichandran; Karthiyayini, Thirumalai; Amali, John; Rao, Chetan
2011-01-01
Purpose Bietti crystalline dystrophy (BCD) is an autosomal recessive disease characterized by intraretinal deposits of multiple small crystals, with or without associated crystal deposits in the cornea. The disease is caused by mutation in the cytochrome p450, family 4, subfamily v, polypeptide 2 (CYP4V2) gene. Choroidal neovascularization (CNV) is a rare event in BCD. We report two cases of BCD associated with CNV. CYP4V2 and exon 5 of tissue inhibitor of metalloproteinase 3 (TIMP3) were screened in both cases. A patient with BCD, but without CNV, was also screened to identify pathogenic variations. Methods Three BCD families of Asian Indian origin were recruited after a comprehensive ophthalmic examination. Genomic DNA was isolated from blood leukocytes, and coding exons and flanking introns of CYP4V2 and exon 5 of TIMP3 were amplified via polymerase chain reaction (PCR) and were sequenced. Family segregation, control screening, and bioinformatics tools were used to assess the pathogenicity of the novel variations. Results Of the three BCD patients, two had parafoveal CNV. The patient with BCD, but without CNV had novel single base-pair duplication (c.1062_1063dupA). This mutation results in a structurally defective and unstable protein with impaired protein function. Four novel benign variations (three in exons and one in an intron) were observed in the cohort. Screening of exon 5 of TIMP3 did not reveal any variation in these families. Conclusions A novel mutation was found in a patient with BCD but without CNV, while patients with BCD and CNV did not show any pathogenic variation. The modifier role of TIMP3 in the pathogenesis of CNV in BCD was partly ruled out, as no variation was observed in exon 5 of the gene. A larger BCD cohort with CNV needs to be studied and screened to understand the genetics of CNV in BCD. PMID:21850171
Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures
Stark, Alexander; Lin, Michael F.; Kheradpour, Pouya; Pedersen, Jakob S.; Parts, Leopold; Carlson, Joseph W.; Crosby, Madeline A.; Rasmussen, Matthew D.; Roy, Sushmita; Deoras, Ameya N.; Ruby, J. Graham; Brennecke, Julius; Hodges, Emily; Hinrichs, Angie S.; Caspi, Anat; Paten, Benedict; Park, Seung-Won; Han, Mira V.; Maeder, Morgan L.; Polansky, Benjamin J.; Robson, Bryanne E.; Aerts, Stein; van Helden, Jacques; Hassan, Bassem; Gilbert, Donald G.; Eastman, Deborah A.; Rice, Michael; Weir, Michael; Hahn, Matthew W.; Park, Yongkyu; Dewey, Colin N.; Pachter, Lior; Kent, W. James; Haussler, David; Lai, Eric C.; Bartel, David P.; Hannon, Gregory J.; Kaufman, Thomas C.; Eisen, Michael B.; Clark, Andrew G.; Smith, Douglas; Celniker, Susan E.; Gelbart, William M.; Kellis, Manolis
2008-01-01
Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or ‘evolutionary signatures’, dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies. PMID:17994088
The structure of the human interferon alpha/beta receptor gene.
Lutfalla, G; Gardiner, K; Proudhon, D; Vielh, E; Uzé, G
1992-02-05
Using the cDNA coding for the human interferon alpha/beta receptor (IFNAR), the IFNAR gene has been physically mapped relative to the other loci of the chromosome 21q22.1 region. 32,906 base pairs covering the IFNAR gene have been cloned and sequenced. Primer extension and solution hybridization-ribonuclease protection have been used to determine that the transcription of the gene is initiated in a broad region of 20 base pairs. Some aspects of the polymorphism of the gene, including noncoding sequences, have been analyzed; some are allelic differences in the coding sequence that induce amino acid variations in the resulting protein. The exon structure of the IFNAR gene and of that of the available genes for the receptors of the cytokine/growth hormone/prolactin/interferon receptor family have been compared with the predictions for the secondary structure of those receptors. From this analysis, we postulate a common origin and propose an hypothesis for the divergence from the immunoglobulin superfamily.
Shakoor, Nadia; Nair, Ramesh; Crasta, Oswald; Morris, Geoffrey; Feltus, Alex; Kresovich, Stephen
2014-01-23
Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.
2014-01-01
Background Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community. PMID:24456189
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solera, J.; Magallon, M.; Martin-Villar, J.
1992-02-01
DNA from a patient with severe hemophilia B was evaluated by RFLP analysis, producing results which suggested the existence of a partial deletion within the factor IX gene. The deletion was further localized and characterized by PCR amplification and sequencing. The altered allele has a 4,442-bp deletion which removes both the donor splice site located at the 5[prime] end of intron d and the two last coding nucleotides located at the 3[prime] end of exon IV in the normal factor IX gene; this fragment has been inserted in inverted orientation. Two homologous sequences have been discovered at the ends ofmore » the deleted DNA fragment.« less
Identification of new mutations in primary hyperoxaluria type 1 (PH1).
von Schnakenburg, C; Rumsby, G
1998-01-01
Primary hyperoxaluria type 1 (PH1) is caused by deficiency of the hepatic peroxisomal enzyme alanine:glyoxylate aminotransferase (AGT). The AGXT gene, which codes for the 392 amino acid protein, has been mapped to chromosome 2q37.3. In order to identify new mutations in the AGXT gene we studied 79 PH1 patients using single strand conformation polymorphism analysis. In addition to a cluster of new mutations in exon 7 we report five novel mutations in exons 2, 4, 5, 9 and 10. These are T444C, G640A, G690A, 1008-1010delGCG and G1171A. These five new mutations contribute to our knowledge of the AGXT gene. Their possible consequences for PH1 phenotype and enzyme activity are discussed.
Wang, Chun-Chi; Chang, Jan-Gowth; Chen, Yen-Ling; Jong, Yuh-Jyh; Wu, Shou-Mei
2010-07-01
In this study, we established the first method for simultaneous evaluation of nine exons in the survival motor neuron (SMN) genes for full-scale genotyping. This method was used not only to quantify the copy numbers of highly homogenous telomeric SMN (SMN1)/centromeric SMN genes in exons 7 and 8 but also to determine intragenic mutations in all nine exons for complete diagnosis of spinal muscular atrophy (SMA). Additionally, we utilized the "universal fluorescent PCR" for simultaneously fluorescent labeling of eleven gene fragments (nine exons in SMN and two internal standards). Such technique is very beneficial for multi-exon analysis due to only requirement of one universal fluorescent primer which could fluorescently amplify all gene fragments. Of all 262 detected individuals, three subjects possessing different ratios of SMN1/centromeric SMN in the two exons were determined as gene conversion, and we also detected three interesting intragenic mutations (c.1 -39A>G, c.22_23insA in exon 1, c.84C>T in exon 2a) which were associated with the SMA patients owning one copy of SMN1 including two mutations never reported previously. This high-resolved method provided better potential technique for genotyping and identifying SMA, carrier and normal controls in large population.
NASA Technical Reports Server (NTRS)
Chang, Dong Kyung; Metzgar, David; Wills, Christopher; Boland, C. Richard
2003-01-01
All "minor" components of the human DNA mismatch repair (MMR) system-MSH3, MSH6, PMS2, and the recently discovered MLH3-contain mononucleotide microsatellites in their coding sequences. This intriguing finding contrasts with the situation found in the major components of the DNA MMR system-MSH2 and MLH1-and, in fact, most human genes. Although eukaryotic genomes are rich in microsatellites, non-triplet microsatellites are rare in coding regions. The recurring presence of exonal mononucleotide repeat sequences within a single family of human genes would therefore be considered exceptional.
Critical Infrastructures: Background, Policy, and Implementation
2006-04-18
initially approved. The current review process for such purchases, implemented under authority of the Exon- Florio provision of the Defense Production Act (50...legislative proposals “broaden” coverage of Exon- Florio to include the purchase of assets associated with critical infrastructure. As currently written...Exon- Florio covers persons engaged in interstate commerce. While this covers a broad range of “persons,” extending beyond those that might be part
USDA-ARS?s Scientific Manuscript database
KCC3 and KCC1 are potassium chloride transporters with partially overlapping function, and KCC3 knockout mice exhibit hypertension. Two KCC3 isoforms differ by alternate promoters and first coding exons: KCC3a is widely expressed, and KCC3b is highly expressed in kidney proximal convoluted tubule. W...
Nucleotide sequences of two genomic DNAs encoding peroxidase of Arabidopsis thaliana.
Intapruk, C; Higashimura, N; Yamamoto, K; Okada, N; Shinmyo, A; Takano, M
1991-02-15
The peroxidase (EC 1.11.1.7)-encoding gene of Arabidopsis thaliana was screened from a genomic library using a cDNA encoding a neutral isozyme of horseradish, Armoracia rusticana, peroxidase (HRP) as a probe, and two positive clones were isolated. From the comparison with the sequences of the HRP-encoding genes, we concluded that two clones contained peroxidase-encoding genes, and they were named prxCa and prxEa. Both genes consisted of four exons and three introns; the introns had consensus nucleotides, GT and AG, at the 5' and 3' ends, respectively. The lengths of each putative exon of the prxEa gene were the same as those of the HRP-basic-isozyme-encoding gene, prxC3, and coded for 349 amino acids (aa) with a sequence homology of 89% to that encoded by prxC3. The prxCa gene was very close to the HRP-neutral-isozyme-encoding gene, prxC1b, and coded for 354 aa with 91% homology to that encoded by prxC1b. The aa sequence homology was 64% between the two peroxidases encoded by prxCa and prxEa.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Feder, J.N.; Jan, L.Y.; Jan, Y.N.
The Drosophila hairy gene encodes a basic helix- loop-helix protein that functions in at least two steps during Drosophila development: (1) during embryogenesis, when it partakes in the establishment of segments, and (2) during the larval stage, when it functions negatively in determining the pattern of sensory bristles on the adult fly. In the rat, a structurally homologous gene (RHL) behaves as an immediate-early gene in its response to growth factors and can, like that in Drosophila, suppress neuronal differentiation events. Here, the authors report the genomic cloning of the human hairy gene homolog (HRY). The coding region of themore » gene is contained within four exons. The predicted amino acid sequence reveals only four amino acid differences between the human and rat genes. Analysis of the DNA sequence 5[prime] to the coding region reveals a putatitve untranslated exon. To increase the value of the HRY gene as a genetic marker and to assess its potential involvement in genetic disorders, they sublocalized the locus to chromosome 3q28-q29 by fluorescence in situ hybridization. 34 refs., 4 figs., 1 tab.« less
Prenatal diagnosis for a Chinese family with a de novo DMD gene mutation
Li, Tao; Zhang, Zhao-jing; Ma, Xin; Lv, Xue; Xiao, Hai; Guo, Qian-nan; Liu, Hong-yan; Wang, Hong-dan; Wu, Dong; Lou, Gui-yu; Wang, Xin; Zhang, Chao-yang; Liao, Shi-xiu
2017-01-01
Abstract Background: Patients with Duchenne muscular dystrophy (DMD) usually have severe and fatal symptoms. At present, there is no effective treatment for DMD, thus it is very important to avoid the birth of children with DMD by effective prenatal diagnosis. We identified a de novo DMD gene mutation in a Chinese family, and make a prenatal diagnosis. Methods: First, multiplex ligation-dependent probe amplification (MLPA) was applied to analyze DMD gene exon deletion/duplication in all family members. The coding sequences of 79 exons in DMD gene were analyzed by Sanger sequencing in the patient; and then according to DMD gene exon mutation in the patient, DMD gene sequencing was performed in the family members. On the basis of results above, the pathogenic mutation in DMD gene was identified. Results: MLPA showed no DMD gene exon deletion/duplication in all family members. Sanger sequencing revealed c.2767_2767delT [p.Ser923LeufsX26] mutation in DMD gene of the patient. Heterozygous deletion mutation (T/-) at this locus was observed in the pregnant woman and her mother and younger sister. The analyses of amniotic fluid samples indicated negative Y chromosome sex-determining gene, no DMD gene exon deletion/duplication, no mutations at c.2767 locus, and the inherited maternal X chromosome different from that of the patient. Conclusion: The pathogenic mutation in DMD gene, c.2767_2767delT [p.Ser923LeufsX26], identified in this family is a de novo mutation. On the basis of specific conditions, it is necessary to select suitable methods to make prenatal diagnosis more effective, accurate, and economic. PMID:29390271
Mutations in GNA11 in Uveal Melanoma
Van Raamsdonk, Catherine D.; Griewank, Klaus G.; Crosby, Michelle B.; Garrido, Maria C.; Vemula, Swapna; Wiesner, Thomas; Obenauf, Anna C.; Wackernagel, Werner; Green, Gary; Bouvier, Nancy; Sozen, M. Mert; Baimukanova, Gail; Roy, Ritu; Heguy, Adriana; Dolgalev, Igor; Khanin, Raya; Busam, Klaus; Speicher, Michael R.; O’Brien, Joan; Bastian, Boris C.
2011-01-01
BACKGROUND Uveal melanoma is the most common intraocular cancer. There are no effective therapies for metastatic disease. Mutations in GNAQ, the gene encoding an alpha subunit of heterotrimeric G proteins, are found in 40% of uveal melanomas. METHODS We sequenced exon 5 of GNAQ and GNA11, a paralogue of GNAQ, in 713 melanocytic neoplasms of different types (186 uveal melanomas, 139 blue nevi, 106 other nevi, and 282 other melanomas). We sequenced exon 4 of GNAQ and GNA11 in 453 of these samples and in all coding exons of GNAQ and GNA11 in 97 uveal melanomas and 45 blue nevi. RESULTS We found somatic mutations in exon 5 (affecting Q209) and in exon 4 (affecting R183) in both GNA11 and GNAQ, in a mutually exclusive pattern. Mutations affecting Q209 in GNA11 were present in 7% of blue nevi, 32% of primary uveal melanomas, and 57% of uveal melanoma metastases. In contrast, we observed Q209 mutations in GNAQ in 55% of blue nevi, 45% of uveal melanomas, and 22% of uveal melanoma metastases. Mutations affecting R183 in either GNAQ or GNA11 were less prevalent (2% of blue nevi and 6% of uveal melanomas) than the Q209 mutations. Mutations in GNA11 induced spontaneously metastasizing tumors in a mouse model and activated the mitogen-activated protein kinase pathway. CONCLUSIONS Of the uveal melanomas we analyzed, 83% had somatic mutations in GNAQ or GNA11. Constitutive activation of the pathway involving these two genes appears to be a major contributor to the development of uveal melanoma. (Funded by the National Institutes of Health and others.) PMID:21083380
Vysokovsky, A; Saxena, R; Landau, M; Zivelin, A; Eskaraev, R; Rosenberg, N; Seligsohn, U; Inbal, A
2004-10-01
Hereditary factor (F)XIII deficiency is a rare bleeding disorder mostly due to mutations in FXIII A subunit. We studied the molecular basis of FXIII deficiency in patients from 10 unrelated families originating from Israel, India and Tunisia. Exons 2-15 of genomic DNA consisting of coding regions and intron/exon boundaries were amplified and sequenced. Structural analysis of the mutations was undertaken by computer modeling. Seven novel mutations were identified in the FXIIIA gene. The propositus from the Ethiopian-Jewish family was found to be a compound heterozygote for two novel mutations: a 10-bp deletion in exon 12 at nucleotides 1652-1661 (followed by 22 altered amino acids and termination codon) and Ala318Val mutation. The propositus of the Tunisian family was homozygous for C insertion after nucleotide 863 within a stretch of six cytosines of exon 7. This insertion results in generation of eight altered amino acids followed by a termination codon downstream. The propositus from Indian-Jewish origin was found to be homozygous for G to T substitution at IVS 11 [+1] resulting in skipping of exons 10 and 11. In addition to the Ala318Val mutation, three of the novel mutations identified are missense mutations: Arg260Leu, Thr398Asn and Gly210Arg each occurring in a homozygous state in an Israeli-Arab and two Indian families, respectively. Structure-function correlation analysis by computer modeling of the new missense mutations predicted that Gly210Arg will cause protein misfolding, Ala318Val and Thr398Asn will interfere with the catalytic process or protein stability, and Arg260Leu will impair dimerization.
Fang, Wenfeng; Yan, Yue; Hu, Zhihuang; Hong, Shaodong; Wu, Xuan; Qin, Tao; Liang, Wenhua; Zhang, Li
2014-01-01
Backgrounds It has been extensively proved that the efficacy of epidermal growth factor receptor-tyrosine kinase inhibitors (EGFR-TKIs) is superior to that of cytotoxic chemotherapy in advanced non-small cell lung cancer (NSCLC) patients harboring sensitive EGFR mutations. However, the question of whether the efficacy of EGFR-TKIs differs between exon 19 deletion and exon 21 L858R mutation has not been yet statistically answered. Methods Subgroup data on hazard ratio (HR) for progression-free survival (PFS) of correlative studies were extracted and synthesized based on random-effect model. Comparison of outcomes between specific mutations was estimated through indirect and direct methods, respectively. Results A total of 13 studies of advanced NSCLC patients with either 19 or 21 exon alteration receiving first-line EGFR-TKIs were included. Based on the data from six clinical trials for indirect meta-analysis, the pooled HRTKI/chemotherapy for PFS were 0.28 (95% CI 0.20–0.38, P<0.001) in patients with 19 exon deletion and 0.47 (95% CI 0.35–0.64, P<0.001) in those with exon 21 L858R mutation. Indirect comparison revealed that the patients with exon 19 deletion had longer PFS than those with exon 21 L858R mutation (HR19 exon deletion/exon 21 L858R mutation = 0.59, 95% CI 0.38–0.92; P = 0.019). Additionally, direct meta-analysis showed similar result (HR19 exon deletion/exon 21 L858R mutation = 0.75, 95% CI 0.65 to 0.85; P<0.001) by incorporating another seven studies. Conclusions For advanced NSCLC patients, exon 19 deletion might be associated with longer PFS compared to L858 mutation at exon 21 after first-line EGFR-TKIs. PMID:25222496
Dunnick, Wesley A; Shi, Jian; Holden, Victoria; Fontaine, Clinton; Collins, John T
2011-01-01
Germline transcription precedes class switch recombination (CSR). The promoter regions and I exons of these germline transcripts include binding sites for activation- and cytokine-induced transcription factors, and the promoter regions/I exons are essential for CSR. Therefore, it is a strong hypothesis that the promoter/I exons regions are responsible for much of cytokine-regulated, gene-specific CSR. We tested this hypothesis by swapping the germline promoter and I exons for the murine γ1 and γ2a H chain genes in a transgene of the entire H chain C-region locus. We found that the promoter/I exon for γ1 germline transcripts can direct robust IL-4-induced recombination to the γ2a gene. In contrast, the promoter/I exon for the γ2a germline transcripts works poorly in the context of the γ1 H chain gene, resulting in expression of γ1 H chains that is <1% the wild-type level. Nevertheless, the small amount of recombination to the chimeric γ1 gene is induced by IFN-γ. These results suggest that cytokine regulation of CSR, but not the magnitude of CSR, is regulated by the promoter/I exons.
Dai, Hanjun; Zhang, Xiaohui; Zhao, Xin; Deng, Ting; Dong, Bing; Wang, Jingzhao; Li, Yang
2008-01-01
Usher syndrome type II (USH2) is the most common form of Usher syndrome, an autosomal recessive disorder characterized by moderate to severe hearing loss, postpuberal onset of retinitis pigmentosa (RP), and normal vestibular function. Mutations in the USH2A gene have been shown to be responsible for most cases of USH2. To further elucidate the role of USH2A in USH2, mutation screening was undertaken in three Chinese families with USH2. Three unrelated Chinese families, consisting of six patients and 10 unaffected relatives, were examined clinically, and 100 normal Chinese individuals served as controls. Genomic DNA was extracted from the venous blood of all participants. The coding region (exons 2-72), including the intron-exon boundary of USH2A, was amplified by polymerase chain reaction (PCR). The PCR products amplified from the three probands were analyzed using direct sequencing to screen sequence variants. Whenever substitutions were identified in a patient, restriction fragment length polymorphism analysis, or single strand conformation polymorphism analysis was performed on all available family members and the control group. Fundus examination revealed typical fundus features of RP, including narrowing of the vessels, bone-speckle pigmentation, and waxy optic discs. The ERG wave amplitudes of three probands were undetectable. Audiometric tests indicated moderate to severe sensorineural hearing impairment. Vestibular function was normal. Five novel mutations (one small insertion, one small deletion, one nonsense, one missense, and one splice site) were detected in three families after sequence analysis of USH2A. Of the five mutations, four were located in exons 22-72, specific to the long isoform of USH2A. The mutations found in our study broaden the spectrum of USH2A mutations. Our results further indicate that the long isoform of USH2A may harbor even more mutations of the USH2A gene.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eipers, P.G.
1992-01-01
The gene for the human p58[sup clk[minus]1] protein kinase, a cell division control-related gene, has been mapped by somatic cell hybrid analyses, in situ localization with the chromosomal gene, and nested polymerase chain reaction amplification of microdissected chromosomes. These studies indicate that the expressed p58[sup clk[minus]1] chromosomal gene maps to 1p36, while a highly related p58[sup clk[minus]1] sequence of unknown nature maps to chromosome 15. Assignment of a p34[sup cdc2]-related gene to 1p36 region, including neuroblastoma, ductal carcinoma of the breast, malignant melanoma, Merkel cell carcinoma and endocrine neoplasia among others. Aberrant expression of this protein kinase negatively regulates normalmore » cellular growth. The p58[sup clk[minus]1] protein contains a central domain of 299 amino acids that is 46% identical to human p34[sup cdc2], the master mitotic protein kinase. This dissertation details the complete structure of the p58[sup clk[minus]1] chromosomal gene, including its putative promoter region, transcriptional start sites, exonic sequences, and intron/exon boundary sequences. The gene is 10 kb in size and contains 12 exons and 11 introns. Interestingly, the rather large 2.0 kb 3[prime] untranslated region is interrupted by an intron that separates a region containing numerous AUUUA destabilization motifs from the coding region. Furthermore, the expression of this gene in normal human tissues, as well as several human tumor cell samples and lines, is examined. The origin of multiple human transcripts from the same chromosomal gene, and the possible differential stability of these various transcripts, is discussed with regard to the transcriptional and post-transcriptional regulation of this gene. This is the first report of the chromosomal gene structure of a member of the p34[sup cdc2] supergene family.« less
Aliferis, K; Hellé, S; Gyapay, G; Duchatelet, S; Stoetzel, C; Mandel, J L; Dollfus, H
2012-03-01
Early onset retinal degeneration associated with obesity can present a diagnostic challenge in paediatric ophthalmology practice. Clinical overlap between Bardet-Biedl syndrome (BBS) and Alström syndrome has been described, although the two entities are genetically distinct. To date, 16 genes are known to be associated with BBS (BBS1-16) and only one gene has been identified for Alström syndrome (ALMS1). In collaboration with the French National Center for Sequencing (CNS, Evry), all coding exons and flanking introns were sequenced for 27 ciliopathy genes (BBS1-12, MGC1203, TTC21b, AHI1, NPHP2-8 (NPHP6=BBS14), MKS1(BBS13), MKS3, C2ORF86, SDCCAG8, ALMS1) in 96 patients referred with a clinical diagnosis of BBS. ALMS1 gene analysis included sequencing of all coding exons. BBS known gene mutations were found in 44 patients (36 with two mutations and 8 heterozygous). ALMS1 mutations were found in four cases. The rate of ALMS1 mutations among patients suspected of having BBS was 4.2%. Clinically, all four patients presented early-onset severe retinal degeneration with congenital nystagmus associated with obesity. The difficult early differential diagnosis between the two syndromes is outlined. One mutation had already been reported (c.11310delAGAG/p.R3770fsX) and three were novel (c.2293C > T/p.Q765X, c.6823insA/p.R2275fsX, c.9046delA/p.N3016fsX). Ciliopathy genes sequencing can be very helpful in providing a timely diagnosis in this group of patients, hence appropriate genetic counselling for families and adequate medical follow-up for affected children.
Saha, Jayita; Giri, Kalyan
2017-04-20
Compelling evidences anticipated the well acclamation of involvement of exogenous and endogenous polyamines (PAs) in conferring salt tolerance in plants. Intracellular PA's anabolism and catabolism should have contributed to maintain endogenous PAs homeostasis to induce stress signal networks. In this report, the evolutionary study has been conducted to reveal the phylogenetic relationship of genes encoding enzymes of the anabolic and catabolic pathway of PAs among the five plant lineages including green algae, moss, lycophyte, dicot and monocot along with their respective exon-intron structural patterns. Our results indicated that natural selection pressure had considerable influence on the ancestral PA metabolic pathway coding genes of land plants. PA metabolic genes have undergone gradual evolution by duplication and diversification process leading to subsequent structural modification through exon-intron gain and loss events to acquire specific function under environmental stress conditions. We have illuminated on the potential regulation of both the pathways by investigating the real-time expression analyses of PA metabolic pathway related enzyme coding genes at the transcriptional level in root and shoot tissues of two indica rice varieties, namely IR 36 (salt sensitive) and Nonabokra (salt-tolerant) in response to salinity in presence or absence of exogenous spermidine (Spd) treatment. Additionally, we have performed tissue specific quantification of the intracellular PAs and tried to draw probable connection between the PA metabolic pathway activation and endogenous PAs accumulation. Our results successfully enlighten the fact that how exogenous Spd in presence or absence of salt stress adjust the intracellular PA pathways to equilibrate the cellular PAs that would have been attributed to plant salt tolerance. Copyright © 2017 Elsevier B.V. All rights reserved.
(Epi)genotype-Phenotype Analysis in 69 Japanese Patients With Pseudohypoparathyroidism Type I
Sano, Shinichiro; Nakamura, Akie; Matsubara, Keiko; Nagasaki, Keisuke; Fukami, Maki; Kagami, Masayo
2018-01-01
Context: Pseudohypoparathyroidism type I (PHP-I) is divided into PHP-Ia with Albright hereditary osteodystrophy and PHP-Ib, which usually shows no Albright hereditary osteodystrophy features. Although PHP-Ia and PHP-Ib are typically caused by genetic defects involving α subunit of the stimulatory G protein (Gsα)–coding GNAS exons and methylation defects of the GNAS differentially methylated regions (DMRs) on the maternal allele, respectively, detailed phenotypic characteristics still remains to be examined. Objective: To clarify phenotypic characteristics according to underlying (epi)genetic causes. Patients and Methods: We performed (epi)genotype-phenotype analysis in 69 Japanese patients with PHP-I; that is, 28 patients with genetic defects involving Gsα-coding GNAS exons (group 1) consisting of 12 patients with missense variants (subgroup A) and 16 patients with null variants (subgroup B), as well as 41 patients with methylation defects (group 2) consisting of 21 patients with broad methylation defects of the GNAS-DMRs (subgroup C) and 20 patients with an isolated A/B-DMR methylation defect accompanied by the common STX16 microdeletion (subgroup D). Results: Although (epi)genotype-phenotype findings were grossly similar to those reported previously, several important findings were identified, including younger age at hypocalcemic symptoms and higher frequencies of hyperphosphatemia in subgroup C than in subgroup D, development of brachydactyly in four patients of subgroup C, predominant manifestation of subcutaneous ossification in subgroup B, higher frequency of thyrotropin resistance in group 1 than in group 2, and relatively low thyrotropin values in four patients with low T4 values and relatively low luteinizing hormone/follicle-stimulating hormone values in five adult females with ovarian dysfunction. Conclusion: The results imply the presence of clinical findings characteristic of each underlying cause and provide useful information on the imprinting status of Gsα. PMID:29379892
Suzuki, Takashi; Brown, Judy J.; Swift, Larry L.
2016-01-01
Microsomal triglyceride transfer protein (MTP) is essential for the assembly of triglyceride-rich apolipoprotein B-containing lipoproteins. Previous studies in our laboratory identified a novel splice variant of MTP in mice that we named MTP-B. MTP-B has a unique first exon (1B) located 2.7 kB upstream of the first exon (1A) for canonical MTP (MTP-A). The two mature isoforms, though nearly identical in sequence and function, have different tissue expression patterns. In this study we report the identification of a second MTP splice variant (MTP-C), which contains both exons 1B and 1A. MTP-C is expressed in all the tissues we tested. In cells transfected with MTP-C, protein expression was less than 15% of that found when the cells were transfected with MTP-A or MTP-B. In silico analysis of the 5’-UTR of MTP-C revealed seven ATGs upstream of the start site for MTP-A, which is the only viable start site in frame with the main coding sequence. One of those ATGs was located in the 5’-UTR for MTP-A. We generated reporter constructs in which the 5’-UTRs of MTP-A or MTP-C were inserted between an SV40 promoter and the coding sequence of the luciferase gene and transfected these constructs into HEK 293 cells. Luciferase activity was significantly reduced by the MTP-C 5’-UTR, but not by the MTP-A 5’-UTR. We conclude that alternative splicing plays a key role in regulating MTP expression by introducing unique 5’-UTRs, which contain elements that alter translation efficiency, enabling the cell to optimize MTP levels and activity. PMID:26771188
Gonzalez, Luis Miguel; Bonay, Pedro; Benitez, Laura; Ferrer, Elizabeth; Harrison, Leslie J S; Parkhouse, R Michael E; Garate, Teresa
2007-02-01
Two clones from an activated Taenia saginata oncosphere cDNA library, Ts45W and Ts45S, were isolated and sequenced. Both of these genes belong to the Taenia ovis 45W gene family. The Ts45W and Ts45S cDNAs are 997- and 1,004-bp-long, each corresponding to 255 amino acids and with theoretical molecular masses of 27.8 and 27.7 kDa, respectively. Southern blot profiles obtained with Ts45W cDNA as a probe suggest that these two genes are members of a multigene family with tandem organization. The full genomic sequence was determined for the Ts45W gene and a new family member, the Ts45W/2 gene. The genomic sequences of the T. saginata Ts45W and Ts45W/2 genes were at least 2.2 kb in length with four exons separated by three introns. Exons 1 and 4 coded for hydrophobic domains, while, importantly, exons 2 and 3 coded for fibronectin homologous domains. These domains are presumably responsible for the demonstrated cell adhesion and, perhaps, the protective nature of this family of molecules and the acronym TAF (Taenia adhesion family) is proposed for this group of genes. We hypothesize that these TAF proteins and another T. saginata-protective antigen, HP6, have evolved the dual functions of facilitating tissue invasion and stimulating protective immunity to first ensure primary infection and subsequently to establish a concomitant protective immunity to protect the host from death or debilitation through superinfection by subsequent infections and thus help ensure parasite survival.
Screening for microsatellite instability target genes in colorectal cancers
Vilkki, S; Launonen, V; Karhu, A; Sistonen, P; Vastrik, I; Aaltonen, L
2002-01-01
Background: Defects in the DNA repair system lead to genetic instability because replication errors are not corrected. This type of genetic instability is a key event in the malignant progression of HNPCC and a subset of sporadic colon cancers and mutation rates are particularly high at short repetitive sequences. Somatic deletions of coding mononucleotide repeats have been detected, for example, in the TGFßRII and BAX genes, and recently many novel target genes for microsatellite instability (MSI) have been proposed. Novel target genes are likely to be discovered in the future. More data should be created on background mutation rates in MSI tumours to evaluate mutation rates observed in the candidate target genes. Methods: Mutation rates in 14 neutral intronic repeats were evaluated in MSI tumours. Bioinformatic searches combined with keywords related to cancer and tumour suppressor or CRC related gene homology were used to find new candidate MSI target genes. By comparison of mutation frequencies observed in intronic mononucleotide repeats versus exonic coding repeats of potential MSI target genes, the significance of the exonic mutations was estimated. Results: As expected, the length of an intronic mononucleotide repeat correlated positively with the number of slippages for both G/C and A/T repeats (p=0.0020 and p=0.0012, respectively). BRCA1, CtBP1, and Rb1 associated CtIP and other candidates were found in a bioinformatic search combined with keywords related to cancer. Sequencing showed a significantly increased mutation rate in the exonic A9 repeat of CtIP (25/109=22.9%) as compared with similar intronic repeats (p≤0.001). Conclusions: We propose a new candidate MSI target gene CtIP to be evaluated in further studies. PMID:12414815
Yao, Q; Fischer, K P; Tyrrell, D L; Gutfreund, K S
2015-04-01
Programmed death ligand-1 (PD-L1) plays an important role in the attenuation of adaptive immune responses in higher vertebrates. Here, we describe the identification of the Pekin duck PD-L1 orthologue (duPD-L1) and its gene structure. The duPD-L1 cDNA encodes a 311-amino acid protein that has an amino acid identity of 78% and 42% with chicken and human PD-L1, respectively. Mapping of the duPD-L1 cDNA with duck genomic sequences revealed an exonic structure of its coding sequence similar to those of other vertebrates but lacked a noncoding exon 1. Homology modelling of the duPD-L1 extracellular domain was compatible with the tandem IgV-like and IgC-like IgSF domain structure of human PD-L1 (PDB ID: 3BIS). Residues known to be important for receptor binding of human PD-L1 were mostly conserved in duPD-L1 within the N-terminus and the G sheet, and partially conserved within the F sheet but not within sheets C and C'. DuPD-L1 mRNA was constitutively expressed in all tissues examined with highest expression levels in lung and spleen and very low levels of expression in muscle, kidney and brain. Mitogen stimulation of duck peripheral blood mononuclear cells transiently increased duPD-L1 mRNA expression. Our observations demonstrate evolutionary conservation of the exonic structure of its coding sequence, the extracellular domain structure and residues implicated in receptor binding, but the role of the longer cytoplasmic tail in avian PD-L1 proteins remains to be determined. © 2014 John Wiley & Sons Ltd.
Barley whole exome capture: a tool for genomic research in the genus Hordeum and beyond
Mascher, Martin; Richmond, Todd A; Gerhardt, Daniel J; Himmelbach, Axel; Clissold, Leah; Sampath, Dharanya; Ayling, Sarah; Steuernagel, Burkhard; Pfeifer, Matthias; D'Ascenzo, Mark; Akhunov, Eduard D; Hedley, Pete E; Gonzales, Ana M; Morrell, Peter L; Kilian, Benjamin; Blattner, Frank R; Scholz, Uwe; Mayer, Klaus FX; Flavell, Andrew J; Muehlbauer, Gary J; Waugh, Robbie; Jeddeloh, Jeffrey A; Stein, Nils
2013-01-01
Advanced resources for genome-assisted research in barley (Hordeum vulgare) including a whole-genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole-genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA-coding exome reduces barley genomic complexity more than 50-fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in-solution hybridization-based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full-length cDNAs and de novo assembled RNA-Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA-coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping-by-sequencing and genetic diversity analyzes. PMID:23889683
A Mutation of the Prdm9 Mouse Hybrid Sterility Gene Carried by a Transgene.
Mihola, O; Trachtulec, Z
2017-01-01
PRDM9 is a protein with histone-3-methyltransferase activity, which specifies the sites of meiotic recombination in mammals. Deficiency of the Prdm9 gene in the laboratory mouse results in complete arrest of the meiotic prophase of both sexes. Moreover, the combination of certain PRDM9 alleles from different mouse subspecies causes hybrid sterility, e.g., the male-specific meiotic arrest found in the (PWD/Ph × C57BL/6J)F1 animals. The fertility of all these mice can be rescued using a Prdm9-containing transgene. Here we characterized a transgene made from the clone RP24-346I22 that was expected to encompass the entire Prdm9 gene. Both (PWD/Ph × C57BL/6J)F1 intersubspecific hybrid males and Prdm9-deficient laboratory mice of both sexes carrying this transgene remained sterile, suggesting that Prdm9 inactivation occurred in the Tg(RP24-346I22) transgenics. Indeed, comparative qRT-PCR analysis of testicular RNAs from transgene-positive versus negative animals revealed similar expression levels of Prdm9 mRNAs from the exons encoding the C-terminal part of the protein but elevated expression from the regions coding for the N-terminus of PRDM9, indicating that the transgenic carries a new null Prdm9 allele. Two naturally occurring alternative Prdm9 mRNA isoforms were overexpressed in Tg(RP24-346I22), one formed via splicing to a 3'-terminal exon consisting of short interspersed element B2 and one isoform including an alternative internal exon of 28 base pairs. However, the overexpression of these alternative transcripts was apparently insufficient for Prdm9 function or for increasing the fertility of the hybrid males.
Mechanisms and Regulation of Alternative Pre-mRNA Splicing
Lee, Yeon
2015-01-01
Precursor messenger RNA (pre-mRNA) splicing is a critical step in the posttranscriptional regulation of gene expression, providing significant expansion of the functional proteome of eukaryotic organisms with limited gene numbers. Split eukaryotic genes contain intervening sequences or introns disrupting protein-coding exons, and intron removal occurs by repeated assembly of a large and highly dynamic ribonucleoprotein complex termed the spliceosome, which is composed of five small nuclear ribonucleoprotein particles, U1, U2, U4/U6, and U5. Biochemical studies over the past 10 years have allowed the isolation as well as compositional, functional, and structural analysis of splicing complexes at distinct stages along the spliceosome cycle. The average human gene contains eight exons and seven introns, producing an average of three or more alternatively spliced mRNA isoforms. Recent high-throughput sequencing studies indicate that 100% of human genes produce at least two alternative mRNA isoforms. Mechanisms of alternative splicing include RNA–protein interactions of splicing factors with regulatory sites termed silencers or enhancers, RNA–RNA base-pairing interactions, or chromatin-based effects that can change or determine splicing patterns. Disease-causing mutations can often occur in splice sites near intron borders or in exonic or intronic RNA regulatory silencer or enhancer elements, as well as in genes that encode splicing factors. Together, these studies provide mechanistic insights into how spliceosome assembly, dynamics, and catalysis occur; how alternative splicing is regulated and evolves; and how splicing can be disrupted by cis- and trans-acting mutations leading to disease states. These findings make the spliceosome an attractive new target for small-molecule, antisense, and genome-editing therapeutic interventions. PMID:25784052
Patterson, Emily J.; Wilk, Melissa; Langlo, Christopher S.; Kasilian, Melissa; Ring, Michael; Hufnagel, Robert B.; Dubis, Adam M.; Tee, James J.; Kalitzeos, Angelos; Gardner, Jessica C.; Ahmed, Zubair M.; Sisk, Robert A.; Larsen, Michael; Sjoberg, Stacy; Connor, Thomas B.; Dubra, Alfredo; Neitz, Jay; Hardcastle, Alison J.; Neitz, Maureen; Michaelides, Michel; Carroll, Joseph
2016-01-01
Purpose Mutations in the coding sequence of the L and M opsin genes are often associated with X-linked cone dysfunction (such as Bornholm Eye Disease, BED), though the exact color vision phenotype associated with these disorders is variable. We examined individuals with L/M opsin gene mutations to clarify the link between color vision deficiency and cone dysfunction. Methods We recruited 17 males for imaging. The thickness and integrity of the photoreceptor layers were evaluated using spectral-domain optical coherence tomography. Cone density was measured using high-resolution images of the cone mosaic obtained with adaptive optics scanning light ophthalmoscopy. The L/M opsin gene array was characterized in 16 subjects, including at least one subject from each family. Results There were six subjects with the LVAVA haplotype encoded by exon 3, seven with LIAVA, two with the Cys203Arg mutation encoded by exon 4, and two with a novel insertion in exon 2. Foveal cone structure and retinal thickness was disrupted to a variable degree, even among related individuals with the same L/M array. Conclusions Our findings provide a direct link between disruption of the cone mosaic and L/M opsin variants. We hypothesize that, in addition to large phenotypic differences between different L/M opsin variants, the ratio of expression of first versus downstream genes in the L/M array contributes to phenotypic diversity. While the L/M opsin mutations underlie the cone dysfunction in all of the subjects tested, the color vision defect can be caused either by the same mutation or a gene rearrangement at the same locus. PMID:27447086
Rozhdestvensky, Timofey S.; Robeck, Thomas; Galiveti, Chenna R.; Raabe, Carsten A.; Seeger, Birte; Wolters, Anna; Gubar, Leonid V.; Brosius, Jürgen; Skryabin, Boris V.
2016-01-01
Prader-Willi syndrome (PWS) is a neurogenetic disorder caused by loss of paternally expressed genes on chromosome 15q11-q13. The PWS-critical region (PWScr) contains an array of non-protein coding IPW-A exons hosting intronic SNORD116 snoRNA genes. Deletion of PWScr is associated with PWS in humans and growth retardation in mice exhibiting ~15% postnatal lethality in C57BL/6 background. Here we analysed a knock-in mouse containing a 5′HPRT-LoxP-NeoR cassette (5′LoxP) inserted upstream of the PWScr. When the insertion was inherited maternally in a paternal PWScr-deletion mouse model (PWScrp−/m5′LoxP), we observed compensation of growth retardation and postnatal lethality. Genomic methylation pattern and expression of protein-coding genes remained unaltered at the PWS-locus of PWScrp−/m5′LoxP mice. Interestingly, ubiquitous Snord116 and IPW-A exon transcription from the originally silent maternal chromosome was detected. In situ hybridization indicated that PWScrp−/m5′LoxP mice expressed Snord116 in brain areas similar to wild type animals. Our results suggest that the lack of PWScr RNA expression in certain brain areas could be a primary cause of the growth retardation phenotype in mice. We propose that activation of disease-associated genes on imprinted regions could lead to general therapeutic strategies in associated diseases. PMID:26848093
Dimitrova, Desislava; Ruscito, Ilary; Olek, Sven; Richter, Rolf; Hellwag, Alexander; Türbachova, Ivana; Woopen, Hannah; Baron, Udo; Braicu, Elena Ioana; Sehouli, Jalid
2016-09-01
Germline mutations in BRCA1 gene have been reported in up to 20 % of epithelial ovarian cancer (EOC) patients. Distinct clinical characteristics have been attributed to this special EOC population. We hypothesized that mutations in different BRCA1 gene exons may differently affect the clinical course of the disease. The aim of this study was to analyze, in a large cohort of primary EOCs, the clinical impact of mutations in BRCA1 gene exon 11, the largest exon of the gene sequence encoding the 60 % of BRCA1 protein. Two hundred sixty-three primary EOC patients, treated between 2000 and 2008 at Charité University Hospital of Berlin, were included. Patients' blood samples were obtained from the Tumor Ovarian Cancer (TOC) Network ( www.toc-network.de ). Direct sequencing of BRCA1 gene exon 11 was performed for each patient to detect mutations. Based on their BRCA1 exon 11 mutational status, patients were compared regarding clinico-pathological variables and survival. Mutations in BRCA1 exon 11 were found in 18 out of 263 patients (6.8 %). Further 10/263 (3.8 %) cases showed variants of uncertain significance (VUS). All exon 11 BRCA1-positive tumors (100 %) were Type 2 ovarian carcinomas (p = 0.05). Age at diagnosis was significantly younger in Type 2 exon 11 mutated patients (p = 0.01). On multivariate analysis, BRCA1 exon 11 mutational status was not found to be an independent predictive factor for optimal cytoreduction, platinum response, or survival. Mutations in BRCA1 gene exon 11 seem to predispose women to exclusively develop a Type 2 ovarian cancer at younger age. Exon 11 BRCA1-mutated EOC patients showed distinct clinico-pathological features but similar clinical outcome with respect to sporadic EOC patients.
Kongchum, Pawapol; Hallerman, Eric M; Hulata, Gideon; David, Lior; Palti, Yniv
2011-01-01
Induction of innate immune pathways is critical for early host defense, but there is limited understanding of how teleost fishes recognize pathogen molecules and activate these pathways. In mammals, cells of the innate immune system detect pathogenic molecular structures using pattern recognition receptors (PRRs). TLR9 functions as a PRR that recognizes CpG motifs in bacterial and viral DNA and requires adaptor molecules MyD88 and TRAF6 for signal transduction. Here we report full-length cDNA isolation, structural characterization and tissue mRNA expression analysis of the common carp (cc) TLR9, MyD88 and TRAF6 gene orthologs. The ccTLR9 open-reading frame (ORF) is predicted to encode a 1064-amino acid (aa) protein. We found that MyD88 and TRAF6 genes are duplicated in common carp. This is the first report of TRAF6 duplication in a vertebrate genome and stronger evidence in support of MyD88 duplication is provided. The ccMyD88a and b ORFs are predicted to encode 288-aa and 284-aa peptides, respectively. They share 91% aa sequence identity between paralogs. The ccTRAF6a and b ORFs are both predicted to encode 543-aa peptides sharing 95% aa sequence identity between paralogs. The ccTLR9 gene is contained in a single large exon. The ccMyD88a and ccMyD88b coding sequences span five exons. The TRAF6b gene spans six exons. PCR amplification to obtain the entire coding sequence of ccTRAF6a gene was not successful. The 2104-bp fragment amplified covers the 3' end of the gene and it contains a partial sequence of one exon and three complete exons. The predicated protein domains of the ccTLR9, ccMyD88 and ccTRAF6 are conserved and resemble orthologs from other vertebrates. Real-time quantitative PCR assays of the ccTLR9, MyD88a and b, and TRAF6a and b gene transcripts in healthy common carp indicated that mRNA expression varied between tissues. Differential expression of duplicate copies were found for ccMyD88 and ccTRAF6 in white and red muscle tissues, suggesting that paralogs may have evolved and attained a new function. The genomic information we describe in this paper provides evidence of sequence and structural conservation of immune response genes in common carp. Published by Elsevier Ltd.
The landscape of cancer genes and mutational processes in breast cancer
Stephens, Philip J.; Tarpey, Patrick S.; Davies, Helen; Loo, Peter Van; Greenman, Chris; Wedge, David C.; Nik-Zainal, Serena; Martin, Sancha; Varela, Ignacio; Bignell, Graham R.; Yates, Lucy R.; Papaemmanuil, Elli; Beare, David; Butler, Adam; Cheverton, Angela; Gamble, John; Hinton, Jonathan; Jia, Mingming; Jayakumar, Alagu; Jones, David; Latimer, Calli; Lau, King Wai; McLaren, Stuart; McBride, David J.; Menzies, Andrew; Mudie, Laura; Raine, Keiran; Rad, Roland; Chapman, Michael Spencer; Teague, Jon; Easton, Douglas; Langerød, Anita; OSBREAC; Lee, Ming Ta Michael; Shen, Chen-Yang; Tee, Benita Tan Kiat; Huimin, Bernice Wong; Broeks, Annegien; Vargas, Ana Cristina; Turashvili, Gulisa; Martens, John; Fatima, Aquila; Miron, Penelope; Chin, Suet-Feung; Thomas, Gilles; Boyault, Sandrine; Mariani, Odette; Lakhani, Sunil R.; van de Vijver, Marc; van ’t Veer, Laura; Foekens, John; Desmedt, Christine; Sotiriou, Christos; Tutt, Andrew; Caldas, Carlos; Reis-Filho, Jorge S.; Aparicio, Samuel A. J. R.; Salomon, Anne Vincent; Børresen-Dale, Anne-Lise; Richardson, Andrea L.; Campbell, Peter J.; Futreal, P. Andrew; Stratton, Michael R.
2012-01-01
All cancers carry somatic mutations in their genomes. A subset, known as driver mutations, confer clonal selective advantage on cancer cells and are causally implicated in oncogenesis1, and the remainder are passenger mutations. The driver mutations and mutational processes operative in breast cancer have not yet been comprehensively explored. Here we examine the genomes of 100 tumours for somatic copy number changes and mutations in the coding exons of protein-coding genes. The number of somatic mutations varied markedly between individual tumours. We found strong correlations between mutation number, age at which cancer was diagnosed and cancer histological grade, and observed multiple mutational signatures, including one present in about ten per cent of tumours characterized by numerous mutations of cytosine at TpC dinucleotides. Driver mutations were identified in several new cancer genes including AKT2, ARID1B, CASP8, CDKN1B, MAP3K1, MAP3K13, NCOR1, SMARCD1 and TBX3. Among the 100 tumours, we found driver mutations in at least 40 cancer genes and 73 different combinations of mutated cancer genes. The results highlight the substantial genetic diversity underlying this common disease. PMID:22722201
Genetic variations of the SLCO1B1 gene in the Chinese, Malay and Indian populations of Singapore.
Ho, Woon Fei; Koo, Seok Hwee; Yee, Jie Yin; Lee, Edmund Jon Deoon
2008-01-01
OATP1B1 is a liver-specific transporter that mediates the uptake of various endogenous and exogenous compounds including many clinically used drugs from blood into hepatocytes. This study aims to identify genetic variations of SLCO1B1 gene in three distinct ethnic groups of the Singaporean population (n=288). The coding region of the gene encoding the transporter protein was screened for genetic variations in the study population by denaturing high-performance liquid chromatography and DNA sequencing. Twenty-five genetic variations of SLCO1B1, including 10 novel ones, were found: 13 in the coding exons (9 nonsynonymous and 4 synonymous variations), 6 in the introns, and 6 in the 3' untranslated region. Four novel nonsynonymous variations: 633A>G (Ile211Met), 875C>T (Ala292Val), 1837T>C (Cys613Arg), and 1877T>A (Leu626Stop) were detected as heterozygotes. Among the novel nonsynonymous variations, 633A>G, 1837T>C, and 1877T>A were predicted to be functionally significant. These data would provide fundamental and useful information for pharmacogenetic studies on drugs that are substrates of OATP1B1 in Asians.
Kuroyanagi, Hidehito; Watanabe, Yohei; Suzuki, Yutaka; Hagiwara, Masatoshi
2013-01-01
A large fraction of protein-coding genes in metazoans undergo alternative pre-mRNA splicing in tissue- or cell-type-specific manners. Recent genome-wide approaches have identified many putative-binding sites for some of tissue-specific trans-acting splicing regulators. However, the mechanisms of splicing regulation in vivo remain largely unknown. To elucidate the modes of splicing regulation by the neuron-specific CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans, we performed deep sequencing of poly(A)+ RNAs from the unc-75(+)- and unc-75-mutant worms and identified more than 20 cassette and mutually exclusive exons repressed or activated by UNC-75. Motif searches revealed that (G/U)UGUUGUG stretches are enriched in the upstream and downstream introns of the UNC-75-repressed and -activated exons, respectively. Recombinant UNC-75 protein specifically binds to RNA fragments carrying the (G/U)UGUUGUG stretches in vitro. Bi-chromatic fluorescence alternative splicing reporters revealed that the UNC-75-target exons are regulated in tissue-specific and (G/U)UGUUGUG element-dependent manners in vivo. The unc-75 mutation affected the splicing reporter expression specifically in the nervous system. These results indicate that UNC-75 regulates alternative splicing of its target exons in neuron-specific and position-dependent manners through the (G/U)UGUUGUG elements in C. elegans. This study thus reveals the repertoire of target events for the CELF family in the living organism. PMID:23416545
De novo insertion of an intron into the mammalian sex determining gene, SRY
O’Neill, Rachel J. Waugh; Brennan, Francine E.; Delbridge, Margaret L.; Crozier, Ross H.; Graves, Jennifer A. Marshall
1998-01-01
Two theories have been proposed to explain the evolution of introns within eukaryotic genes. The introns early theory, or “exon theory of genes,” proposes that introns are ancient and that recombination within introns provided new exon structure, and thus new genes. The introns late theory, or “insertional theory of introns,” proposes that ancient genes existed as uninterrupted exons and that introns have been introduced during the course of evolution. There is still controversy as to how intron–exon structure evolved and whether the majority of introns are ancient or novel. Although there is extensive evidence in support of the introns early theory, phylogenetic comparisons of several genes indicate recent gain and loss of introns within these genes. However, no example has been shown of a protein coding gene, intronless in its ancestral form, which has acquired an intron in a derived form. The mammalian sex determining gene, SRY, is intronless in all mammals studied to date, as is the gene from which it recently evolved. However, we report here comparisons of genomic and cDNA sequences that now provide evidence of a de novo insertion of an intron into the SRY gene of dasyurid marsupials. This recently (approximately 45 million years ago) inserted sequence is not homologous with known transposable elements. Our data demonstrate that introns may be inserted as spliced units within a developmentally crucial gene without disrupting its function. PMID:9465071
Pastor, André F.; Moura, Laís Rodrigues; Neto, José W.D.; Nascimento, Eduardo J.M.; Calzavara-Silva, Carlos E.; Gomes, Ana Lisa V.; da Silva, Ana Maria; Cordeiro, Marli T.; Braga-Neto, Ulisses; Crovella, Sergio; Gil, Laura H.V.G.; Marques, Ernesto T.A.; Acioli-Santos, Bartolomeu
2013-01-01
Four genetic polymorphisms located at the promoter (C-257T) and coding regions of CFH gene (exon 2 G257A, exon 14 A2089G and exon 19 G2881T) were investigated in 121 dengue patients (DENV-3) in order to assess the relationship between allele/haplotypes variants and clinical outcomes. A statistical value was found between the CFH-257T allele (TT/TC genotypes) and reduced susceptibility to severe dengue (SD). Statistical associations indicate that individuals bearing a T allele presented significantly higher protein levels in plasma. The –257T variant is located within a NF-κB binding site, suggesting that this variant might have effect on the ability of the CFH gene to respond to signals via the NF-κB pathway. The G257A allelic variant showed significant protection against severe dengue. When CFH haplotypes effect was considered, the ancestral CG/CG promoter-exon 2 SNP genotype showed significant risk to SD either in a general comparison (ancestral × all variant genotypes), as well as in individual genotypes comparison (ancestral × each variant genotype), where the most prevalent effect was observed in the CG/CG × CA/TG comparison. These findings support the involvement of –257T, 257A allele variants and haplotypes on severe dengue phenotype protection, related with high basal CFH expression. PMID:23747994
Structure and genomic organization of the human B1 receptor gene for kinins (BDKRB1).
Bachvarov, D R; Hess, J F; Menke, J G; Larrivée, J F; Marceau, F
1996-05-01
Two subtypes of mammalian bradykinin receptors, B1 and B2 (BDKRB1 and BDKRB2), have been defined based on their pharmacological properties. The B1 type kinin receptors have weak affinity for intact BK or Lys-BK but strong affinity for kinin metabolites without the C-terminal arginine (e.g., des-Arg9-BK and Lys-des-Arg9-BK, also called des-Arg10-kallidin), which are generated by kininase I. The B1 receptor expression is up-regulated following tissue injury and inflammation (hyperemia, exudation, hyperalgesia, etc.). In the present study, we have cloned and sequenced the gene encoding human B1 receptor from a human genomic library. The human B1 receptor gene contains three exons separated by two introns. The first and the second exon are noncoding, while the coding region and the 3'-flanking region are located entirely on the third exon. The exon-intron arrangement of the human B1 receptor gene shows significant similarity with the genes encoding the B2 receptor subtype in human, mouse, and rat. Sequence analysis of the 5'-flanking region revealed the presence of a consensus TATA box and of numerous candidate transcription factor binding sequences. Primer extension experiments have shown the existence of multiple transcription initiation sites situated downstream and upstream from the consensus TATA box. Genomic Southern blot analysis indicated that the human B1 receptor is encoded by a single-copy gene.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kaler, S.G.; Gahl, W.A.
1994-09-01
Menkes disease is an X linked recessive disorder of copper metabolism produced by abnormalities in a gene that encodes a copper transporting ATPase. The clinical spectrum of Menkes disease includes a range of neurological severity from the classical type to the occipital horn syndrome (OHS) in which slightly subnormal intelligence or signs of autonomic dysfunction are the only neurologic abnormalities. We previously documented a distinctive, less severe Menkes phenotype associated with a +3 intronic splice donor mutation at the 3{prime} end of the gene in which exon skipping occurred but some normally spliced message was also detectable. We now reportmore » a similar splicing mutation in a patient with a typical OHS phenotype an A to G transition at the 2 exonic position of a splice donor site in the middle of the Menkes coding sequence. Some normally sized transcripts are evident by RT-PCR of lymphoblast mRNA from this individual, as well as 2 truncated fragments generated by exon skipping and activation of a cryptic splice acceptor site, respectively. The predicted effect of the mutation on the gene product involves a serine to glycine substitution in a noncritical region of the Menkes ATPase from the patient`s normally sized message, and premature termination due to translational frameshift in both truncated transcripts. The mutation eliminates a Dde 1 restriction site in the gene which provided a method to rapidly screen other family members, and revealed that the patient`s mother is a non-carrier. The mutational base change was not present in 25 normal X chromosomes studied. Preliminary analysis of the Menkes locus in 5 other Menkes disease families indicates aberrant mRNA splicing in 2. Our findings confirm allelism at the Menkes locus, indicate that splice mutations are relatively common mutational event in Menkes disease, and suggest that splice mutations in which some normal splicing is preserved may underlie milder Menkes disease variants, including OHS.« less
PrimerZ: streamlined primer design for promoters, exons and human SNPs.
Tsai, Ming-Fang; Lin, Yi-Jung; Cheng, Yu-Chang; Lee, Kuo-Hsi; Huang, Cheng-Chih; Chen, Yuan-Tsong; Yao, Adam
2007-07-01
PrimerZ (http://genepipe.ngc.sinica.edu.tw/primerz/) is a web application dedicated primarily to primer design for genes and human SNPs. PrimerZ accepts genes by gene name or Ensembl accession code, and SNPs by dbSNP rs or AFFY_Probe IDs. The promoter and exon sequence information of all gene transcripts fetched from the Ensembl database (http://www.ensembl.org) are processed before being passed on to Primer3 (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi) for individual primer design. All results returned from Primer 3 are organized and integrated in a specially designed web page for easy browsing. Besides the web page presentation, csv text file export is also provided for enhanced user convenience. PrimerZ automates highly standard but tedious gene primer design to improve the success rate of PCR experiments. More than 2000 primers have been designed with PrimerZ at our institute since 2004 and the success rate is over 70%. The addition of several new features has made PrimerZ even more useful to the research community in facilitating primer design for promoters, exons and SNPs.
Peng, Ting; Wang, Li; Zhou, Shu-Feng; Li, Xiaotian
2010-12-01
A number of mutations in GATA4 and NKX2.5 have been identified to be causative for a subset of familial congenital heart defects (CHDs) and a small number of sporadic CHDs. In this study, we evaluated common GATA4 and NKX2.5 mutations in 135 Chinese pediatric patients with non-familial congenital heart defects. Two novel mutations in the coding region of GATA4 were identified, namely, 487C >T (Pro163Ser) in exon 1 in a child with tetralogy of Fallot and 1220C >A (Pro407Gln) in exon 6 in a pediatric patient with outlet membranous ventricular septal defect. We also found 848C >A (Pro283Gln) in exon 2 of the NKX2.5 gene in a pediatric patient with ventricular septal defect, patent ductus arteriosus and aortic isthmus stenosis. None of the mutations was detected in healthy control subjects (n = 114). This study suggests that GATA4 and NKX2.5 missense mutations may be associated with congenital heart defects in pediatric Chinese patients. Further clinical studies with large samples are warranted.
Zhao, Zhanqi; Chu, Chan-Ching; Chang, Mei-Yun; Chang, Hao-Tai; Hsu, Yeong-Long
2018-06-01
Methylmalonic acidemia (MMA) is an autosomal recessive disease of organic acidemia. We report a 26-year-old male who presented with metabolic acidosis, acute renal failure required hemodialysis and acute respiratory failure required mechanical ventilation support. Progressive hypotonia of muscles made weaning from mechanical ventilator difficult. High level of serum methylmalonic acid and the mut genotype sequences confirmed the diagnosis of this adult-onset MMA. Two mut genotype sequences were found by analyzing all coding exons and exon-intron junctions. One genotype was well documented (Exon 6 Mutation, c. 1280G>A. p. G427D, heterozygous). The other mut genotype sequence had never been reported elsewhere (Intron 6 Novel, c. 1333-13_c. 1333-8delTTTTTC, heterozygous). Diet modification, medication, regular hemodialysis and physical rehabilitation. Weaning strategy adjusted with help of electrical impedance tomography. The muscle power of the patient gradually recovered. Extubation of the patient was successful and he was discharged without oxygen required. This case gives us the lesson that MMA can be newly diagnosed in adult patient. A new mut genotype sequence was discovered. The use of electrical impedance tomography to select a suitable method for inspiratory muscle training was possible and useful.
Bahi-Buisson, Nadia; Girard, Benoit; Gautier, Agnes; Nectoux, Juliette; Fichou, Yann; Saillour, Yoann; Poirier, Karine; Chelly, Jamel; Bienvenu, Thierry
2010-01-05
We report a 2-year-old girl with early onset seizures variant of Rett syndrome with a deletion at Xp22 detected by multiplex ligation-dependent probe amplification (MLPA) technique. This patient presented with tonic seizures at 7 days of life. Subsequently, she developed infantile spasms at three months and finally refractory myoclonic epilepsy. She demonstrated severe encephalopathy with hypotonia, deceleration of head growth, with eye gaze but limited eye pursuit, no language, limited hand use, and intermittent hand stereotypies. This combination of clinical features, suggestive of early onset variant of Rett syndrome led us to screen the CDKL5 gene. In a first step, screening of the whole coding sequence of the CDKL5 gene revealed no point mutations. In a second step, we searched gross rearrangements by MLPA and identified a microdeletion affecting both the promoter and exon 1 in CDKL5. Subsequent analysis on a Nimblegen HD2 microarray confirmed a deletion of approximately 300 kb at Xp22, including the BEND2, SCML2, and CDKL5 genes. In conclusion, our report suggests that searching for large rearrangements in CDKL5 should be considered in girls with early onset seizures and Rett-like features. (c) 2009 Wiley-Liss, Inc.
Chromosomal localization and cDNA cloning of the human DBP and TEF genes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Khatib, Z.A.; Inaba, T.; Valentine, M.
1994-09-15
The authors have isolated cDNA and genomic clones and determined the human chromosome positions of two genes encoding transcription factors expressed in the liver and the pituitary gland: albumin D-site-binding protein (DBP) and thyrotroph embryonic factor (TEF). Both proteins have been identified as members of the PAR (proline and acidic amino acid-rich) subfamily of bZIP transcription factors in the rat, but human homologues have not been characterized. Using a fluorescence in situ hybridization technique, the DBP locus was assigned to chromosome 19q13, and TEF to chromosome 22q13. Each assignment was confirmed by means of human chromosome segregation in somatic cellmore » hybrids. Coding sequences of DBP and TEF, extending beyond the bZIP domain to the PAR region, were highly conserved in both human-human and interspecies comparisons. Conservation of the exon-intron boundaries of each bZIP domain-encoding exon suggested derivation from a common ancestral gene. DBP and TEF mRNAs were expressed in all tissues and cell lines examined, including brain, lung, liver, spleen, and kidney. Knowledge of the human chromosome locations of these PAR proteins will facilitate studies to assess their involvement in carcinogenesis and other fundamental biological processes. 37 refs., 5 figs., 1 tab.« less
Tsuchiya, Takayoshi; Shibata, Minoru; Numabe, Hironao; Jinno, Tomoko; Nakabayashi, Kazuhiko; Nishimura, Gen; Nagai, Toshiro; Ogata, Tsutomu; Fukami, Maki
2014-02-01
Haploinsufficiency of SHOX on the short arm pseudoautosomal region (PAR1) leads to Leri-Weill dyschondrosteosis (LWD), and nullizygosity of SHOX results in Langer mesomelic dysplasia (LMD). Molecular defects of LWD/LMD include various microdeletions in PAR1 that involve exons and/or the putative upstream or downstream enhancer regions of SHOX, as well as several intragenic mutations. Here, we report on a Japanese male infant with mild manifestations of LMD and hitherto unreported microdeletions in PAR1. Clinical analysis revealed mesomelic short stature with various radiological findings indicative of LMD. Molecular analyses identified compound heterozygous deletions, that is, a maternally inherited ∼46 kb deletion involving the upstream region and exons 1-5 of SHOX, and a paternally inherited ∼500 kb deletion started from a position ∼300 kb downstream from SHOX. In silico analysis revealed that the downstream deletion did not affect the known putative enhancer regions of SHOX, although it encompassed several non-coding elements which were well conserved among various species with SHOX orthologs. These results provide the possibility of the presence of a novel enhancer for SHOX in the genomic region ∼300 to ∼800 kb downstream of the start codon. © 2013 Wiley Periodicals, Inc.
Schwaiger, F W; Weyers, E; Epplen, C; Brün, J; Ruff, G; Crawford, A; Epplen, J T
1993-09-01
Twenty-one different caprine and 13 ovine MHC-DRB exon 2 sequences were determined including part of the adjacent introns containing simple repetitive (gt)n(ga)m elements. The positions for highly polymorphic DRB amino acids vary slightly among ungulates and other mammals. From man and mouse to ungulates the basic (gt)n(ga)m structure is fixed in evolution for 7 x 10(7) years whereas ample variations exist in the tandem (gt)n and (ga)m dinucleotides and especially their "degenerated" derivatives. Phylogenetic trees for the alpha-helices and beta-pleated sheets of the ungulate DRB sequences suggest different evolutionary histories. In hoofed animals as well as in humans DRB beta-sheet encoding sequences and adjacent intronic repeats can be assembled into virtually identical groups suggesting coevolution of noncoding as well as coding DNA. In contrast alpha-helices and C-terminal parts of the first DRB domain evolve distinctly. In the absence of a defined mechanism causing specific, site-directed mutations, double-recombination or gene-conversion-like events would readily explain this fact. The role of the intronic simple (gt)n(ga)m repeat is discussed with respect to these genetic exchange mechanisms during evolution.
Peng, Tao; Xue, Chenghai; Bi, Jianning; Li, Tingting; Wang, Xiaowo; Zhang, Xuegong; Li, Yanda
2008-04-26
Alternative splicing expands transcriptome diversity and plays an important role in regulation of gene expression. Previous studies focus on the regulation of a single cassette exon, but recent experiments indicate that multiple cassette exons within a gene may interact with each other. This interaction can increase the potential to generate various transcripts and adds an extra layer of complexity to gene regulation. Several cases of exon interaction have been discovered. However, the extent to which the cassette exons coordinate with each other remains unknown. Based on EST data, we employed a metric of correlation coefficients to describe the interaction between two adjacent cassette exons and then categorized these exon pairs into three different groups by their interaction (correlation) patterns. Sequence analysis demonstrates that strongly-correlated groups are more conserved and contain a higher proportion of pairs with reading frame preservation in a combinatorial manner. Multiple genome comparison further indicates that different groups of correlated pairs have different evolutionary courses: (1) The vast majority of positively-correlated pairs are old, (2) most of the weakly-correlated pairs are relatively young, and (3) negatively-correlated pairs are a mixture of old and young events. We performed a large-scale analysis of interactions between adjacent cassette exons. Compared with weakly-correlated pairs, the strongly-correlated pairs, including both the positively and negatively correlated ones, show more evidence that they are under delicate splicing control and tend to be functionally important. Additionally, the positively-correlated pairs bear strong resemblance to constitutive exons, which suggests that they may evolve from ancient constitutive exons, while negatively and weakly correlated pairs are more likely to contain newly emerging exons.
Mutation analysis of the Fanconi Anemia Gene FACC
DOE Office of Scientific and Technical Information (OSTI.GOV)
Verlander, P.C.; Lin, J.D.; Udono, M.U.
1994-04-01
Fanconi anemia (FA) is a genetically heterogeneous autosomal recessive disorder characterized by a unique hypersensitivity of cells to DNA cross-linking agents; a gene for complementation group C (FACC) has recently been cloned. The authors have amplified FACC exons with their flanking intron sequences from genomic DNA from 174 racially and ethnically diverse families in the International Fanconi Anemia Registry and have screened for mutations by using SSCP analysis. They have identified eight different variants in 32 families; three were detected in exon 1, one in exon 4, one in intron 4, two in exon 6, and one in exon 14.more » Two of the eight variants, in seven families, did not segregate with the disease allele in multiplex families, suggesting that these variants represented benign polymorphisms. Disease-associated mutations in FACC were detected in a total of 25 (14.4%) of 174 families screened. The most frequent mutations were IVS4 + 4 A [yields] T (intron 4; 12 families) and 322delG (exon 1; 9 families). Other, less common mutations include Q13X in exon 1, R185X and D195V in exon 6, and L554P in exon 14. The polymorphisms were S26F in exon 1 and G139E in exon 4. All patients in the study with 322delG, Q13X, R185X, and D195V are of northern or eastern European or southern Italian ancestry, and 18 of 19 have a mild form of the disease, while the 2 patients with L554P, both from the same family, have a severe phenotype. All 19 patients with IVS4 + 4 A [yields] T have Jewish ancestry and have a severe phenotype. 19 refs., 1 fig., 3 tabs.« less
Yuan, Kejun; Wang, Changjun; Xin, Li; Zhang, Anning; Ai, Chengxiang
2013-07-25
A farnesyl diphosphate synthase gene (FPPS2), which contains 11 introns and 12 exons, was isolated from the apple cultivar "White Winter Pearmain". When it was compared to our previously reported FPPS1, its each intron size was different, its each exon size was the same as that of FPPS1 gene, 30 nucleotide differences were found in its coding sequence. Based on these nucleotide differences, specific primers were designed to perform expression analysis; the results showed that it expressed in both fruit and leaf, its expression level was obviously lower than that of FPPS1 gene in fruit which was stored at 4°C for 5 weeks. This is the first report concerning two FPPS genes and their expression comparison in apples. Copyright © 2013 Elsevier B.V. All rights reserved.
O'Leary, Debra A.; Sharif, Orzala; Anderson, Paul; Tu, Buu; Welch, Genevieve; Zhou, Yingyao; Caldwell, Jeremy S.; Engels, Ingo H.; Brinker, Achim
2009-01-01
One therapeutic approach to Duchenne Muscular Dystrophy (DMD) recently entering clinical trials aims to convert DMD phenotypes to that of a milder disease variant, Becker Muscular Dystrophy (BMD), by employing antisense oligonucleotides (AONs) targeting splice sites, to induce exon skipping and restore partial dystrophin function. In order to search for small molecule and genetic modulators of AON-dependent and independent exon skipping, we screened ∼10,000 known small molecule drugs, >17,000 cDNA clones, and >2,000 kinase- targeted siRNAs against a 5.6 kb luciferase minigene construct, encompassing exon 71 to exon 73 of human dystrophin. As a result, we identified several enhancers of exon skipping, acting on both the reporter construct as well as endogenous dystrophin in mdx cells. Multiple mechanisms of action were identified, including histone deacetylase inhibition, tubulin modulation and pre-mRNA processing. Among others, the nucleolar protein NOL8 and staufen RNA binding protein homolog 2 (Stau2) were found to induce endogenous exon skipping in mdx cells in an AON-dependent fashion. An unexpected but recurrent theme observed in our screening efforts was the apparent link between the inhibition of cell cycle progression and the induction of exon skipping. PMID:20020055
O'Leary, Debra A; Sharif, Orzala; Anderson, Paul; Tu, Buu; Welch, Genevieve; Zhou, Yingyao; Caldwell, Jeremy S; Engels, Ingo H; Brinker, Achim
2009-12-17
One therapeutic approach to Duchenne Muscular Dystrophy (DMD) recently entering clinical trials aims to convert DMD phenotypes to that of a milder disease variant, Becker Muscular Dystrophy (BMD), by employing antisense oligonucleotides (AONs) targeting splice sites, to induce exon skipping and restore partial dystrophin function. In order to search for small molecule and genetic modulators of AON-dependent and independent exon skipping, we screened approximately 10,000 known small molecule drugs, >17,000 cDNA clones, and >2,000 kinase- targeted siRNAs against a 5.6 kb luciferase minigene construct, encompassing exon 71 to exon 73 of human dystrophin. As a result, we identified several enhancers of exon skipping, acting on both the reporter construct as well as endogenous dystrophin in mdx cells. Multiple mechanisms of action were identified, including histone deacetylase inhibition, tubulin modulation and pre-mRNA processing. Among others, the nucleolar protein NOL8 and staufen RNA binding protein homolog 2 (Stau2) were found to induce endogenous exon skipping in mdx cells in an AON-dependent fashion. An unexpected but recurrent theme observed in our screening efforts was the apparent link between the inhibition of cell cycle progression and the induction of exon skipping.
Early-onset seizures due to mosaic exonic deletions of CDKL5 in a male and two females.
Bartnik, Magdalena; Derwińska, Katarzyna; Gos, Monika; Obersztyn, Ewa; Kołodziejska, Katarzyna E; Erez, Ayelet; Szpecht-Potocka, Agnieszka; Fang, Ping; Terczyńska, Iwona; Mierzewska, Hanna; Lohr, Naomi J; Bellus, Gary A; Reimschisel, Tyler; Bocian, Ewa; Mazurczak, Tadeusz; Cheung, Sau Wai; Stankiewicz, Paweł
2011-05-01
Mutations in the CDKL5 gene have been associated with an X-linked dominant early infantile epileptic encephalopathy-2. The clinical presentation is usually of severe encephalopathy with refractory seizures and Rett syndrome (RTT)-like phenotype. We attempted to assess the role of mosaic intragenic copy number variation in CDKL5. We have used comparative genomic hybridization with a custom-designed clinical oligonucleotide array targeting exons of selected disease and candidate genes, including CDKL5. We have identified mosaic exonic deletions of CDKL5 in one male and two females with developmental delay and medically intractable seizures. These three mosaic changes represent 60% of all deletions detected in 12,000 patients analyzed by array comparative genomic hybridization and involving the exonic portion of CDKL5. We report the first case of an exonic deletion of CDKL5 in a male and emphasize the importance of underappreciated mosaic exonic copy number variation in patients with early-onset seizures and RTT-like features of both genders.
Hereditary Angioedema Nationwide Study in Slovenia Reveals Four Novel Mutations in SERPING1 Gene
Rijavec, Matija; Korošec, Peter; Šilar, Mira; Zidarn, Mihaela; Miljković, Jovan; Košnik, Mitja
2013-01-01
Hereditary angioedema (HAE) is a rare autosomal dominant disease characterized by swelling of the face, lips, tongue, larynx, genitalia, or extremities, with abdominal pain caused by intra-abdominal edema. HAE is caused by mutations affecting the C1 inhibitor gene, SERPING1, resulting in low levels of C1 inhibitor (Type I HAE) or normal levels of ineffective C1 inhibitor (Type II HAE). A nationwide survey identified nine unrelated families with HAE in Slovenia, among whom 17 individuals from eight families were recruited for genetic analyses. A diagnosis of HAE was established in the presence of clinical and laboratory criteria (low C1 inhibitor antigenic levels and/or function), followed up by a positive family history. Genetic studies were carried out using PCR and sequencing to detect SERPING1 mutations in promoter, noncoding exon 1, the 7 coding exons, and exon-intron boundaries. Multiplex ligation-dependent probe amplification was performed in order to search for large deletions/duplications in SERPING1 gene. A mutation responsible for HAE was identified in patients from seven families with the disease. In HAE type I families, one previously reported substitution (Gln67Stop, c.265C>T) and four novel mutations were identified. The new mutations included two missense substitutions, Ser128Phe (c.449C>T), and Glu429Lys (c.1351G>A), together with two frameshift mutations, indel (c.49delGinsTT) and deletion (c.593_594delCT). Both families with HAE type II harbored the two well-known substitutions affecting the arginyl residue at the reactive center in exon 8, Arg444Cys (c.1396C>T) and Arg444His (c.1397G>A), respectively. In one patient only the homozygous variant g.566T>C (c.-21T>C) was identified. Our study identified four novel mutations in the Slovenian HAE population, highlighting the heterogeneity of mutations in the SERPING1 gene causing C1 inhibitor deficiency and HAE. In a single patient with HAE a homozygous variant g.566T>C (c.-21T>C) might be responsible for the disease. PMID:23437219
Hereditary angioedema nationwide study in Slovenia reveals four novel mutations in SERPING1 gene.
Rijavec, Matija; Korošec, Peter; Šilar, Mira; Zidarn, Mihaela; Miljković, Jovan; Košnik, Mitja
2013-01-01
Hereditary angioedema (HAE) is a rare autosomal dominant disease characterized by swelling of the face, lips, tongue, larynx, genitalia, or extremities, with abdominal pain caused by intra-abdominal edema. HAE is caused by mutations affecting the C1 inhibitor gene, SERPING1, resulting in low levels of C1 inhibitor (Type I HAE) or normal levels of ineffective C1 inhibitor (Type II HAE). A nationwide survey identified nine unrelated families with HAE in Slovenia, among whom 17 individuals from eight families were recruited for genetic analyses. A diagnosis of HAE was established in the presence of clinical and laboratory criteria (low C1 inhibitor antigenic levels and/or function), followed up by a positive family history. Genetic studies were carried out using PCR and sequencing to detect SERPING1 mutations in promoter, noncoding exon 1, the 7 coding exons, and exon-intron boundaries. Multiplex ligation-dependent probe amplification was performed in order to search for large deletions/duplications in SERPING1 gene. A mutation responsible for HAE was identified in patients from seven families with the disease. In HAE type I families, one previously reported substitution (Gln67Stop, c.265C>T) and four novel mutations were identified. The new mutations included two missense substitutions, Ser128Phe (c.449C>T), and Glu429Lys (c.1351G>A), together with two frameshift mutations, indel (c.49delGinsTT) and deletion (c.593_594delCT). Both families with HAE type II harbored the two well-known substitutions affecting the arginyl residue at the reactive center in exon 8, Arg444Cys (c.1396C>T) and Arg444His (c.1397G>A), respectively. In one patient only the homozygous variant g.566T>C (c.-21T>C) was identified. Our study identified four novel mutations in the Slovenian HAE population, highlighting the heterogeneity of mutations in the SERPING1 gene causing C1 inhibitor deficiency and HAE. In a single patient with HAE a homozygous variant g.566T>C (c.-21T>C) might be responsible for the disease.
Aoki, Koh; Yano, Kentaro; Suzuki, Ayako; Kawamura, Shingo; Sakurai, Nozomu; Suda, Kunihiro; Kurabayashi, Atsushi; Suzuki, Tatsuya; Tsugane, Taneaki; Watanabe, Manabu; Ooga, Kazuhide; Torii, Maiko; Narita, Takanori; Shin-I, Tadasu; Kohara, Yuji; Yamamoto, Naoki; Takahashi, Hideki; Watanabe, Yuichiro; Egusa, Mayumi; Kodama, Motoichiro; Ichinose, Yuki; Kikuchi, Mari; Fukushima, Sumire; Okabe, Akiko; Arie, Tsutomu; Sato, Yuko; Yazawa, Katsumi; Satoh, Shinobu; Omura, Toshikazu; Ezura, Hiroshi; Shibata, Daisuke
2010-03-30
The Solanaceae family includes several economically important vegetable crops. The tomato (Solanum lycopersicum) is regarded as a model plant of the Solanaceae family. Recently, a number of tomato resources have been developed in parallel with the ongoing tomato genome sequencing project. In particular, a miniature cultivar, Micro-Tom, is regarded as a model system in tomato genomics, and a number of genomics resources in the Micro-Tom-background, such as ESTs and mutagenized lines, have been established by an international alliance. To accelerate the progress in tomato genomics, we developed a collection of fully-sequenced 13,227 Micro-Tom full-length cDNAs. By checking redundant sequences, coding sequences, and chimeric sequences, a set of 11,502 non-redundant full-length cDNAs (nrFLcDNAs) was generated. Analysis of untranslated regions demonstrated that tomato has longer 5'- and 3'-untranslated regions than most other plants but rice. Classification of functions of proteins predicted from the coding sequences demonstrated that nrFLcDNAs covered a broad range of functions. A comparison of nrFLcDNAs with genes of sixteen plants facilitated the identification of tomato genes that are not found in other plants, most of which did not have known protein domains. Mapping of the nrFLcDNAs onto currently available tomato genome sequences facilitated prediction of exon-intron structure. Introns of tomato genes were longer than those of Arabidopsis and rice. According to a comparison of exon sequences between the nrFLcDNAs and the tomato genome sequences, the frequency of nucleotide mismatch in exons between Micro-Tom and the genome-sequencing cultivar (Heinz 1706) was estimated to be 0.061%. The collection of Micro-Tom nrFLcDNAs generated in this study will serve as a valuable genomic tool for plant biologists to bridge the gap between basic and applied studies. The nrFLcDNA sequences will help annotation of the tomato whole-genome sequence and aid in tomato functional genomics and molecular breeding. Full-length cDNA sequences and their annotations are provided in the database KaFTom http://www.pgb.kazusa.or.jp/kaftom/ via the website of the National Bioresource Project Tomato http://tomato.nbrp.jp.
Sano, Shinichiro; Iwata, Hiromi; Matsubara, Keiko; Fukami, Maki; Kagami, Masayo; Ogata, Tsutomu
2015-01-01
Pseudohypoparathyroidism (PHP) is associated with compromised signal transductions via PTH receptor (PTH-R) and other G-protein-coupled receptors including GHRH-R. To date, while GH deficiency (GHD) has been reported in multiple patients with PHP-Ia caused by mutations on the maternally expressed GNAS coding regions and in two patients with sporadic form of PHP-Ib accompanied by broad methylation defects of maternally derived GNAS differentially methylated regions (DMRs), it has not been identified in a patient with an autosomal dominant form of PHP-Ib (AD-PHP-Ib) accompanied by an STX16 microdeletion and an isolated loss of methylation (LOM) at exon A/B-DMR. We studied 5 4/12-year-old monozygotic twins with short stature (both -3.4 SD) and GHD (peak GH values, <6.0 μg/L after arginine and clonidine stimulations). Molecular studies revealed maternally derived STX16 microdeletions and isolated LOMs at exon A/B-DMR in the twins, confirming the diagnosis of AD-PHP-Ib. GNAS mutation was not identified, and neither mutation nor copy number variation was detected in GH1, POU1F1, PROP1, GHRHR, LHX3, LHX4, and HESX1 in the twins. The results, in conjunction with the previous finding that GNAS shows maternal expression in the pituitary, suggest that GHD of the twins is primarily ascribed to compromised GHRH-R signaling caused by AD-PTH-Ib. Thus, resistance to multiple hormones including GHRH should be considered in AD-PHP-Ib.
Baker, C S; Vant, M D; Dalebout, M L; Lento, G M; O'Brien, S J; Yuhki, N
2006-05-01
The molecular diversity and phylogenetic relationships of two class II genes of the baleen whale major histocompatibility complex were investigated and compared to toothed whales and out-groups. Amplification of the DQB exon 2 provided sequences showing high within-species and between-species nucleotide diversity and uninterrupted reading frames consistent with functional class II loci found in related mammals (e.g., ruminants). Cloning of amplified products indicated gene duplication in the humpback whale and triplication in the southern right whale, with average nucleotide diversity of 5.9 and 6.3%, respectively, for alleles of each species. Significantly higher nonsynonymous divergence at sites coding for peptide binding (32% for humpback and 40% for southern right) suggested that these loci were subject to positive (overdominant) selection. A population survey of humpback whales detected 23 alleles, differing by up to 21% of their inferred amino acid sequences. Amplification of the DRB exon 2 resulted in two groups of sequences. One was most similar to the DRB3 of the cow and present in all whales screened to date, including toothed whales. The second was most similar to the DRB2 of the cow and was found only in the bowhead and right whales. Both loci showed low diversity among species and apparent loss of function or altered function including interruption of reading frames. Finally, comparison of inferred protein sequence of the DRB3-like locus suggested convergence with the DQB, perhaps resulting from intergenic conversion or recombination.
Divergent transcription is associated with promoters of transcriptional regulators
2013-01-01
Background Divergent transcription is a wide-spread phenomenon in mammals. For instance, short bidirectional transcripts are a hallmark of active promoters, while longer transcripts can be detected antisense from active genes in conditions where the RNA degradation machinery is inhibited. Moreover, many described long non-coding RNAs (lncRNAs) are transcribed antisense from coding gene promoters. However, the general significance of divergent lncRNA/mRNA gene pair transcription is still poorly understood. Here, we used strand-specific RNA-seq with high sequencing depth to thoroughly identify antisense transcripts from coding gene promoters in primary mouse tissues. Results We found that a substantial fraction of coding-gene promoters sustain divergent transcription of long non-coding RNA (lncRNA)/mRNA gene pairs. Strikingly, upstream antisense transcription is significantly associated with genes related to transcriptional regulation and development. Their promoters share several characteristics with those of transcriptional developmental genes, including very large CpG islands, high degree of conservation and epigenetic regulation in ES cells. In-depth analysis revealed a unique GC skew profile at these promoter regions, while the associated coding genes were found to have large first exons, two genomic features that might enforce bidirectional transcription. Finally, genes associated with antisense transcription harbor specific H3K79me2 epigenetic marking and RNA polymerase II enrichment profiles linked to an intensified rate of early transcriptional elongation. Conclusions We concluded that promoters of a class of transcription regulators are characterized by a specialized transcriptional control mechanism, which is directly coupled to relaxed bidirectional transcription. PMID:24365181
nGASP - the nematode genome annotation assessment project
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coghlan, A; Fiedler, T J; McKay, S J
2008-12-19
While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner'more » algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders. While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders.« less
Omi, T; Brenig, B; Spilar Kramer, S; Iwamoto, S; Stranzinger, G; Neuenschwander, S
2005-04-01
The peroxisome proliferator-activated receptor-gamma (PPAR-gamma) is a member of the steroid/thyroid/retinoid receptor superfamily, and is primarily expressed in fat tissue. To date, two major PPAR-gamma isoforms have been identified in pig, PPAR-gamma1 and PPAR-gamma2. Porcine PPAR-gamma1a consists of two leader exons, designated A1 and A2, followed by six exons containing the open reading frame. Here, we report the isolation and characterization of three novel PPAR-gamma1 transcripts. PPAR-gamma1b is derived from exon A1, with exon A2 spliced out. PPAR-gamma1c and PPAR-gamma1d are derived from the new exon, A', containing exon A2 (gamma1c) or without exon A2 (gamma1d). Based on PCR analysis of PAC clones that included sequences from the 5'-untranslated region of the PPAR-gamma gene, the new A' exon is located between the known exons A1 and A2. We also isolated the human homologue to exon A', as well as the two new PPAR-gamma1c and -gamma1d splice variants, from human adipose tissue. Studies of the expression of porcine PPAR-gamma by real time reverse transcription-polymerase chain reaction analysis show that transcripts derived from exon A1 were not expressed at significantly different levels in visceral fat (lamina subserosa) or subcutaneous fat (back fat, inner and outer layer). In contrast, exon A'-derived transcripts were expressed at progressively higher levels in the inner and outer layers of subcutaneous fat than in visceral fat. The same expression pattern was also observed for PPAR-gamma2. We hypothesize that there are three promoters, which differentially regulate PPAR-gamma1 and PPAR-gamma2 gene expression, depending on the specific localization of the fat tissue.
Li, Jun; Hakata, Yoshiyuki; Takeda, Eri; Liu, Qingping; Iwatani, Yasumasa; Kozak, Christine A.; Miyazawa, Masaaki
2012-01-01
Mouse apolipoprotein B mRNA-editing enzyme catalytic polypeptide-like editing complex 3 (mA3), an intracellular antiviral factor, has 2 allelic variations that are linked with different susceptibilities to beta- and gammaretrovirus infections among various mouse strains. In virus-resistant C57BL/6 (B6) mice, mA3 transcripts are more abundant than those in susceptible BALB/c mice both in the spleen and bone marrow. These strains of mice also express mA3 transcripts with different splicing patterns: B6 mice preferentially express exon 5-deficient (Δ5) mA3 mRNA, while BALB/c mice produce exon 5-containing full-length mA3 mRNA as the major transcript. Although the protein product of the Δ5 mRNA exerts stronger antiretroviral activities than the full-length protein, how exon 5 affects mA3 antiviral activity, as well as the genetic mechanisms regulating exon 5 inclusion into the mA3 transcripts, remains largely uncharacterized. Here we show that mA3 exon 5 is indeed a functional element that influences protein synthesis at a post-transcriptional level. We further employed in vitro splicing assays using genomic DNA clones to identify two critical polymorphisms affecting the inclusion of exon 5 into mA3 transcripts: the number of TCCT repeats upstream of exon 5 and the single nucleotide polymorphism within exon 5 located 12 bases upstream of the exon 5/intron 5 boundary. Distribution of the above polymorphisms among different Mus species indicates that the inclusion of exon 5 into mA3 mRNA is a relatively recent event in the evolution of mice. The widespread geographic distribution of this exon 5-including genetic variant suggests that in some Mus populations the cost of maintaining an effective but mutagenic enzyme may outweigh its antiviral function. PMID:22275865
Piya, Anil; Kaur, Jasmeet; Rice, Alan M; Bose, Himangshu S
2017-01-01
Cholesterol transport into the mitochondria is required for synthesis of the first steroid, pregnenolone. Cholesterol is transported by the steroidogenic acute regulatory protein (STAR), which acts at the outer mitochondrial membrane prior to its import. Mutations in the STAR protein result in lipoid congenital adrenal hyperplasia (CAH). Although the STAR protein consists of seven exons, biochemical analysis in nonsteroidogenic COS-1 cells showed that the first two were not essential for pregnenolone synthesis. Here, we present a patient with ambiguous genitalia, salt-lossing crisis within two weeks after birth and low cortisol levels. Sequence analysis of the STAR , including the exon-intron boundaries, showed the complete deletion of exon 1 as well as more than 50 nucleotides upstream of STAR promoter. Mitochondrial protein import with the translated protein through synthesis cassette of the mutant STAR lacking exon 1 showed protein translation, but it is less likely to have synthesized without a promoter in our patient. Thus, a full-length STAR gene is necessary for physiological mitochondrial cholesterol transport in vivo . STAR exon 1 deletion caused lipoid CAH.Exon 1 substitution does not affect biochemical activity.StAR promoter is responsible for gonadal development.
Ankyrin-G isoform imbalance and interneuronopathy link epilepsy and bipolar disorder.
Lopez, A Y; Wang, X; Xu, M; Maheshwari, A; Curry, D; Lam, S; Adesina, A M; Noebels, J L; Sun, Q-Q; Cooper, E C
2017-10-01
ANK3, encoding the adaptor protein Ankyrin-G (AnkG), has been implicated in bipolar disorder by genome-wide association studies. ANK3 has multiple alternative first exons, and a bipolar disorder-associated ANK3 variant has been shown to reduce the expression of exon 1b. Here we identify mechanisms through which reduced ANK3 exon 1b isoform expression disrupts neuronal excitation-inhibition balance. We find that parvalbumin (PV) interneurons and principal cells differentially express ANK3 first exon subtypes. PV interneurons express only isoforms containing exon 1b, whereas excitatory principal cells express exon 1e alone or both 1e and 1b. In transgenic mice deficient for exon 1b, PV interneurons lack voltage-gated sodium channels at their axonal initial segments and have increased firing thresholds and diminished action potential dynamic range. These mice exhibit an Ank3 gene dosage-dependent phenotype including behavior changes modeling bipolar disorder, epilepsy and sudden death. Thus ANK3's important association with human bipolar susceptibility may arise from imbalance between AnkG function in interneurons and principal cells and resultant excessive circuit sensitivity and output. AnkG isoform imbalance is a novel molecular endophenotype and potential therapeutic target.
Ankyrin-G isoform imbalance and interneuronopathy link epilepsy and bipolar disorder
Lopez, Angel Y.; Wang, Xinjun; Xu, Mingxuan; Maheshwari, Atul; Curry, Daniel; Lam, Sandi; Adesina, Adekunle M.; Noebels, Jeffrey L.; Sun, Qian-Quan; Cooper, Edward C.
2016-01-01
ANK3, encoding the adaptor protein Ankyrin-G, has been implicated in bipolar disorder by genome wide association studies. ANK3 has multiple alternative first exons, and a bipolar disorder-associated ANK3 variant has been shown to reduce expression of exon 1b. Here we identify mechanisms through which reduced ANK3 exon 1b isoform expression disrupts neuronal excitation-inhibition balance. We find that parvalbumin interneurons and principal cells differentially express ANK3 first exon subtypes. Parvalbumin interneurons express only isoforms containing exon 1b, whereas excitatory principal cells express exon 1e alone, or both 1e and 1b. In transgenic mice deficient for exon 1b, parvalbumin interneurons lack voltage-gated sodium channels at their axonal initial segments and have increased firing thresholds and diminished action potential dynamic range. These mice exhibit an Ank3 gene dosage-dependent phenotype including behavior changes modeling bipolar disorder, epilepsy, and sudden death. Thus, ANK3’s important association with human bipolar susceptibility may arise from imbalance between ankyrin-G function in interneurons and principal cells and resultant excessive circuit sensitivity and output. Ankyrin-G isoform imbalance is a novel molecular endophenotype and potential therapeutic target. PMID:27956739
Scharner, J; Figeac, N; Ellis, J A; Zammit, P S
2015-06-01
Exon skipping, as a therapy to restore a reading frame or switch protein isoforms, is under clinical trial. We hypothesised that removing an in-frame exon containing a mutation could also improve pathogenic phenotypes. Our model is laminopathies: incurable tissue-specific degenerative diseases associated with LMNA mutations. LMNA encodes A-type lamins, that together with B-type lamins, form the nuclear lamina. Lamins contain an alpha-helical central rod domain composed of multiple heptad repeats. Eliminating LMNA exon 3 or 5 removes six heptad repeats, so shortens, but should not otherwise significantly alter, the alpha-helix. Human Lamin A or Lamin C with a deletion corresponding to amino acids encoded by exon 5 (Lamin A/C-Δ5) localised normally in murine lmna-null cells, rescuing both nuclear shape and endogenous Lamin B1/emerin distribution. However, Lamin A carrying pathogenic mutations in exon 3 or 5, or Lamin A/C-Δ3, did not. Furthermore, Lamin A/C-Δ5 was not deleterious to wild-type cells, unlike the other Lamin A mutants including Lamin A/C-Δ3. Thus Lamin A/C-Δ5 function as effectively as wild-type Lamin A/C and better than mutant versions. Antisense oligonucleotides skipped LMNA exon 5 in human cells, demonstrating the possibility of treating certain laminopathies with this approach. This proof-of-concept is the first to report the therapeutic potential of exon skipping for diseases arising from missense mutations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shakoor, N; Nair, R; Crasta, O
2014-01-23
Background: Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results: This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specificmore » probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e. g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions: Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.« less
Mayr, B; Reifinger, M; Alton, K; Schaffner, G
1998-06-01
Twenty feline neoplasms were sequenced in the region from exons 5 to 8 for the presence of tumour suppressor gene p53 mutations. In a spindle cell sarcoma of the bladder, a missense mutation (codon 164 AAG-->GAG, lysine-->glutamic acid) in exon 5 was detected. In a pleomorphic sarcoma, a 23 bp deletion involving the splicing junction between intron 5 and exon 6 was observed. In a fibrosarcoma, a 6 bp deletion of p53 covering 2 bp of exon 7 and 4 bp of intron 7, including the splicing junction, was found. The study demonstrates three new p53 mutations in different types of sarcomas in cats.
Liu, Jun; Bhadra, Malini; Sinnakannu, Joanna Rajeswary; Yue, Wan Lin; Tan, Cheryl Weiqi; Rigo, Frank; Ong, S.Tiong; Roca, Xavier
2017-01-01
Many tyrosine kinase-driven cancers, including chronic myeloid leukemia (CML), are characterized by high response rates to specific tyrosine kinase inhibitors (TKIs) like imatinib. In East Asians, primary imatinib resistance is caused by a deletion polymorphism in Intron 2 of the BIM gene, whose product is required for TKI-induced apoptosis. The deletion biases BIM splicing from exon 4 to exon 3, generating splice isoforms lacking the exon 4-encoded pro-apoptotic BH3 domain, which impairs the ability of TKIs to induce apoptosis. We sought to identify splice-switching antisense oligonucleotides (ASOs) that block exon 3 but enhance exon 4 splicing, and thereby resensitize BIM deletion-containing cancers to imatinib. First, we mapped multiple cis-acting splicing elements around BIM exon 3 by minigene mutations, and found an exonic splicing enhancer acting via SRSF1. Second, by a systematic ASO walk, we isolated ASOs that corrected the aberrant BIM splicing. Eight of 67 ASOs increased exon 4 levels in BIM deletion-containing cells, and restored imatinib-induced apoptosis and TKI sensitivity. This proof-of-principle study proves that resistant CML cells by BIM deletion polymorphism can be resensitized to imatinib via splice-switching BIM ASOs. Future optimizations might yield a therapeutic ASO as precision-medicine adjuvant treatment for BIM-polymorphism-associated TKI-resistant CML and other cancers. PMID:29100409
Liu, Jun; Bhadra, Malini; Sinnakannu, Joanna Rajeswary; Yue, Wan Lin; Tan, Cheryl Weiqi; Rigo, Frank; Ong, S Tiong; Roca, Xavier
2017-09-29
Many tyrosine kinase-driven cancers, including chronic myeloid leukemia (CML), are characterized by high response rates to specific tyrosine kinase inhibitors (TKIs) like imatinib. In East Asians, primary imatinib resistance is caused by a deletion polymorphism in Intron 2 of the BIM gene, whose product is required for TKI-induced apoptosis. The deletion biases BIM splicing from exon 4 to exon 3, generating splice isoforms lacking the exon 4-encoded pro-apoptotic BH3 domain, which impairs the ability of TKIs to induce apoptosis. We sought to identify splice-switching antisense oligonucleotides (ASOs) that block exon 3 but enhance exon 4 splicing, and thereby resensitize BIM deletion-containing cancers to imatinib. First, we mapped multiple cis -acting splicing elements around BIM exon 3 by minigene mutations, and found an exonic splicing enhancer acting via SRSF1. Second, by a systematic ASO walk, we isolated ASOs that corrected the aberrant BIM splicing. Eight of 67 ASOs increased exon 4 levels in BIM deletion-containing cells, and restored imatinib-induced apoptosis and TKI sensitivity. This proof-of-principle study proves that resistant CML cells by BIM deletion polymorphism can be resensitized to imatinib via splice-switching BIM ASOs. Future optimizations might yield a therapeutic ASO as precision-medicine adjuvant treatment for BIM -polymorphism-associated TKI-resistant CML and other cancers.
USDA-ARS?s Scientific Manuscript database
The actions of prolactin (PRL) are mediated by both long (LF) and short isoforms (SF) of the PRL receptor (PRLR). Here, we report on a genetic and functional analysis of the porcine PRLR (pPRLR) SF. Three single nucleotide polymorphisms (SNPs) within exon 11 of the pPRLR-SF give rise to four amino a...
Aller, E; Jaijo, T; Beneyto, M; Nájera, C; Oltra, S; Ayuso, C; Baiget, M; Carballo, M; Antiñolo, G; Valverde, D; Moreno, F; Vilela, C; Collado, D; Pérez‐Garrigues, H; Navea, A; Millán, J M
2006-01-01
Mutations in USH2A gene have been shown to be responsible for Usher syndrome type II, an autosomal recessive disorder characterised by hearing loss and retinitis pigmentosa. USH2A was firstly described as consisting of 21 exons, but 52 novel exons at the 3' end of the gene were recently identified. In this report, a mutation analysis of the new 52 exons of USH2A gene was carried out in 32 unrelated patients in which both disease‐causing mutations could not be found after the screening of the first 21 exons of the USH2A gene. On analysing the new 52 exons, fourteen novel mutations were identified in 14 out of the 32 cases studied, including 7 missense, 5 frameshift, 1 duplication and a putative splice-site mutation. PMID:17085681
Thauvin-Robinet, Christel; Franco, Brunella; Saugier-Veber, Pascale; Aral, Bernard; Gigot, Nadège; Donzel, Anne; Van Maldergem, Lionel; Bieth, Eric; Layet, Valérie; Mathieu, Michèle; Teebi, Ahmad; Lespinasse, James; Callier, Patrick; Mugneret, Francine; Masurel-Paulet, Alice; Gautier, Elodie; Huet, Frédéric; Teyssier, Jean-Raymond; Tosi, Mario; Frébourg, Thierry; Faivre, Laurence
2009-02-01
Oral-facial-digital type I syndrome (OFDI) is characterised by an X-linked dominant mode of inheritance with lethality in males. Clinical features include facial dysmorphism with oral, dental and distal abnormalities, polycystic kidney disease and central nervous system malformations. Considerable allelic heterogeneity has been reported within the OFD1 gene, but DNA bi-directional sequencing of the exons and intron-exon boundaries of the OFD1 gene remains negative in more than 20% of cases. We hypothesized that genomic rearrangements could account for the majority of the remaining undiagnosed cases. Thus, we took advantage of two independent available series of patients with OFDI syndrome and negative DNA bi-directional sequencing of the exons and intron-exon boundaries of the OFD1 gene from two different European labs: 13/36 cases from the French lab; 13/95 from the Italian lab. All patients were screened by a semiquantitative fluorescent multiplex method (QFMPSF) and relative quantification by real-time PCR (qPCR). Six OFD1 genomic deletions (exon 5, exons 1-8, exons 1-14, exons 10-11, exons 13-23 and exon 17) were identified, accounting for 5% of OFDI patients and for 23% of patients with negative mutation screening by DNA sequencing. The association of DNA direct sequencing, QFMPSF and qPCR detects OFD1 alteration in up to 85% of patients with a phenotype suggestive of OFDI syndrome. Given the average percentage of large genomic rearrangements (5%), we suggest that dosage methods should be performed in addition to DNA direct sequencing analysis to exclude the involvement of the OFD1 transcript when there are genetic counselling issues. (c) 2008 Wiley-Liss, Inc.
Hashemikhabir, Seyedsasan; Budak, Gungor; Janga, Sarath Chandra
2016-01-01
Survival analysis in biomedical sciences is generally performed by correlating the levels of cellular components with patients’ clinical features as a common practice in prognostic biomarker discovery. While the common and primary focus of such analysis in cancer genomics so far has been to identify the potential prognostic genes, alternative splicing – a posttranscriptional regulatory mechanism that affects the functional form of a protein due to inclusion or exclusion of individual exons giving rise to alternative protein products, has increasingly gained attention due to the prevalence of splicing aberrations in cancer transcriptomes. Hence, uncovering the potential prognostic exons can not only help in rationally designing exon-specific therapeutics but also increase specificity toward more personalized treatment options. To address this gap and to provide a platform for rational identification of prognostic exons from cancer transcriptomes, we developed ExSurv (https://exsurv.soic.iupui.edu), a web-based platform for predicting the survival contribution of all annotated exons in the human genome using RNA sequencing-based expression profiles for cancer samples from four cancer types available from The Cancer Genome Atlas. ExSurv enables users to search for a gene of interest and shows survival probabilities for all the exons associated with a gene and found to be significant at the chosen threshold. ExSurv also includes raw expression values across the cancer cohort as well as the survival plots for prognostic exons. Our analysis of the resulting prognostic exons across four cancer types revealed that most of the survival-associated exons are unique to a cancer type with few processes such as cell adhesion, carboxylic, fatty acid metabolism, and regulation of T-cell signaling common across cancer types, possibly suggesting significant differences in the posttranscriptional regulatory pathways contributing to prognosis. PMID:27528797
Diniz, Erik Trovão; Jorge, Alexander A L; Arnhold, Ivo J P; Rosenbloom, Arlan L; Bandeira, Francisco
2008-11-01
To date, about sixty different mutations within GH receptor (GHR) gene have been described in patients with GH insensitivity syndrome (GHI). In this report, we described a novel nonsense mutation of GHR. The patient was evaluated at the age of 6 yr, for short stature associated to clinical phenotype of GHI. GH, IGF-1, and GHBP levels were determined. The PCR products from exons 2-10 were sequenced. The patient had high GH (26 microg/L), low IGF-1 (22.5 ng/ml) and undetectable GHBP levels. The sequencing of GHR exon 5 disclosed adenine duplication at nucleotide 338 of GHR coding sequence (c.338dupA) in homozygous state. We described a novel mutation that causes a truncated GHR and a loss of receptor function due to the lack of amino acids comprising the transmembrane and intracellular regions of GHR protein, leading to GHI.
Circular RNAs and hereditary bone diseases.
Zhai, Naixiang; Lu, Yanqin; Wang, Yanzhou; Ren, Xiuzhi; Han, Jinxiang
2018-02-01
Circular RNA (circRNA) is a non-linear form of RNA derived from exonic, intronic, and exon-intron gene regions. circRNAs are characterized by covalent closed loops, highly stable nuclease resistance, and specific expression in species and developmental stages. CircRNA molecules have been identified as playing roles in the regulation of cell transcription, transcriptional expression after translation, interactions with microRNAs, and protein coding. A high stability and tissue- and disease-specific expression allow circRNAs to serve as potential biomarkers both for diseases and prognosis. CircRNAs function in bone remodeling by directly participating in bone-related signaling pathways and by forming the circRNA-miRNA-mRNA axis. Studies have seldom reported on the low incidence of circRNAs in genetic bone disorders. The current study reviews the characteristics of circRNAs and recent research on their role in rare hereditary bone diseases.
Permanent Neonatal Diabetes Caused by Creation of an Ectopic Splice Site within the INS Gene
Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis
2012-01-01
Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272
Lourenco-Jaramillo, Diana Lelidett; Sifuentes-Rincón, Ana María; Parra-Bracamonte, Gaspar Manuel; de la Rosa-Reyna, Xochitl Fabiola; Segura-Cabrera, Aldo; Arellano-Vera, Williams
2012-01-01
DNA from four cattle breeds was used to re-sequence all of the exons and 56% of the introns of the bovine tyrosine hydroxylase (TH) gene and 97% and 13% of the bovine dopamine β-hydroxylase (DBH) coding and non-coding sequences, respectively. Two novel single nucleotide polymorphisms (SNPs) and a microsatellite motif were found in the TH sequences. The DBH sequences contained 62 nucleotide changes, including eight non-synonymous SNPs (nsSNPs) that are of particular interest because they may alter protein function and therefore affect the phenotype. These DBH nsSNPs resulted in amino acid substitutions that were predicted to destabilize the protein structure. Six SNPs (one from TH and five from DBH non-synonymous SNPs) were genotyped in 140 animals; all of them were polymorphic and had a minor allele frequency of > 9%. There were significant differences in the intra- and inter-population haplotype distributions. The haplotype differences between Brahman cattle and the three B. t. taurus breeds (Charolais, Holstein and Lidia) were interesting from a behavioural point of view because of the differences in temperament between these breeds. PMID:22888292
Quek, Richard; Farid, Mohamad; Kanjanapan, Yada; Lim, Cindy; Tan, Iain Beehuat; Kesavan, Sittampalam; Lim, Tony Kiat Hon; Oon, Lynette Lin-Ean; Goh, Brian Kp; Chan, Weng Hoong; Teo, Melissa; Chung, Alexander Yf; Ong, Hock Soo; Wong, Wai Keong; Tan, Patrick; Yip, Desmond
2017-06-01
Benefit of adjuvant imatinib therapy following curative resection in patients with intermediate-risk gastrointestinal stromal tumor (GIST) is unclear. GIST-specific exon mutations, in particular exon 11 deletions, have been shown to be prognostic. We hypothesize that specific KIT mutations may improve risk stratification in patients with intermediate-risk GIST, identifying a subgroup of patients who may benefit from adjuvant therapy. In total, 142 GIST patients with complete clinicopathologic and mutational data from two sites were included. Risk classification was based on the modified National Institute of Health (NIH) criteria. In this cohort, 74% (n = 105) of patients harbored a KIT mutation; 61% (n = 86) were found in exon 11 of which nearly 70% were KIT exon 11 deletions (n = 60). A total of 18% (n = 25) of cases were classified as having intermediate-risk disease. Univariate analysis confirmed tumor size, mitotic index, nongastric origin, presence of tumor rupture and modified NIH criteria were adversely prognostic for relapse-free survival (RFS). Among KIT/PDGFRA mutants, KIT exon 11 deletions had a significantly worse prognosis (hazard ratio 2.31; 95% confidence interval, 1.30-4.10; P = 0.003). Multivariate analysis confirmed KIT exon 11 deletion (P = 0.003) and clinical risk classification (P < 0.001) as independent adverse prognostic factors for RFS. Intermediate-risk patients harboring KIT exon 11 deletions had RFS outcomes similar to high-risk patients. The presence of KIT exon 11 deletion mutation in patients with intermediate-risk GIST is associated with an inferior clinical outcome with RFS similar to high-risk patients. © 2016 John Wiley & Sons Australia, Ltd.
Hypoparathyroidism-retardation-Dysmorphism (HRD) syndrome--a review.
Hershkovitz, Eli; Parvari, Ruti; Diaz, George A; Gorodischer, Rafael
2004-12-01
Hypoparathyroidism, retardation, and dysmorphism (HRD) is a newly recognized genetic syndrome, described in patients of Arab origin. The syndrome consists of permanent congenital hypoparathyroidism, severe prenatal and postnatal growth retardation, and profound global developmental delay. The patients are susceptible to severe infections including life-threatening pneumococcal infections especially during infancy. The main dysmorphic features are microcephaly, deep-set eyes or microphthalmia, ear abnormalities, depressed nasal bridge, thin upper lip, hooked small nose, micrognathia, and small hands and feet. A single 12-bp deletion (del52-55) in the second coding exon of the tubulin cofactor E (TCFE) gene, located on the long arm of chromosome 1, is the cause of HRD among Arab patients. Early recognition and therapy of hypocalcemia is important as is daily antibiotic prophylaxis against pneumococcal infections.
NASA Astrophysics Data System (ADS)
Ma, Ruiqin; He, Feng; Wen, Haishen; Li, Jifang; Shi, Bao; Shi, Dan; Liu, Miao; Mu, Weijie; Zhang, Yuanqing; Hu, Jian; Han, Weiguo; Zhang, Jianan; Wang, Qingqing; Yuan, Yuren; Liu, Qun
2012-03-01
As a specific gene of fish, cytochrome P450c17-II ( CYP17-II) gene plays a key role in the growth, development an reproduction level of fish. In this study, the single-stranded conformational polymorphism (SSCP) technique was used to characterize polymorphisms within the coding region of CYP17-II gene in a population of 75 male Japanese flounder ( Paralichthys olivaceus). Three single nucleotide polymorphisms (SNPs) were identified in CYP17-II gene of Japanese flounder. They were c.G594A (p.G188R), c.G939A and c.G1502A (p.G490D). SNP1 (c.G594A), located in exon 4 of CYP17-II gene, was significantly associated with gonadosomatic index (GSI). Individuals with genotype GG of SNP1 had significantly lower GSI ( P < 0.05) than those with genotype AA or AG. SNP2 (c.G939A) located at the CpG island of CYP17-II gene. The mutation changed the methylation of exon 6. Individuals with genotype AA of SNP2 had significantly lower serum testosterone (T) level and hepatosomatic index (HSI) compared to those with genotype GG. The results suggested that SNP2 could influence the reproductive endocrine of male Japanese flounder. However, the SNP3 (c.G1502A) located in exon 9 did not affect the four measured reproductive traits. This study showed that CYP17-II gene could be a potentially useful candidate gene for the research of genetic breeding and physiological aspects of Japanese flounder.
Pastor, André F; Rodrigues Moura, Laís; Neto, José W D; Nascimento, Eduardo J M; Calzavara-Silva, Carlos E; Gomes, Ana Lisa V; Silva, Ana Maria da; Cordeiro, Marli T; Braga-Neto, Ulisses; Crovella, Sergio; Gil, Laura H V G; Marques, Ernesto T A; Acioli-Santos, Bartolomeu
2013-09-01
Four genetic polymorphisms located at the promoter (C-257T) and coding regions of CFH gene (exon 2 G257A, exon 14 A2089G and exon 19 G2881T) were investigated in 121 dengue patients (DENV-3) in order to assess the relationship between allele/haplotypes variants and clinical outcomes. A statistical value was found between the CFH-257T allele (TT/TC genotypes) and reduced susceptibility to severe dengue (SD). Statistical associations indicate that individuals bearing a T allele presented significantly higher protein levels in plasma. The -257T variant is located within a NF-κB binding site, suggesting that this variant might have effect on the ability of the CFH gene to respond to signals via the NF-κB pathway. The G257A allelic variant showed significant protection against severe dengue. When CFH haplotypes effect was considered, the ancestral CG/CG promoter-exon 2 SNP genotype showed significant risk to SD either in a general comparison (ancestral × all variant genotypes), as well as in individual genotypes comparison (ancestral × each variant genotype), where the most prevalent effect was observed in the CG/CG × CA/TG comparison. These findings support the involvement of -257T, 257A allele variants and haplotypes on severe dengue phenotype protection, related with high basal CFH expression. Copyright © 2013 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Callén, E; Tischkowitz, M D; Creus, A; Marcos, R; Bueren, J A; Casado, J A; Mathew, C G; Surrallés, J
2004-01-01
Fanconi anaemia is an autosomal recessive disease characterized by chromosome fragility, multiple congenital abnormalities, progressive bone marrow failure and a high predisposition to develop malignancies. Most of the Fanconi anaemia patients belong to complementation group FA-A due to mutations in the FANCA gene. This gene contains 43 exons along a 4.3-kb coding sequence with a very heterogeneous mutational spectrum that makes the mutation screening of FANCA a difficult task. In addition, as the FANCA gene is rich in Alu sequences, it was reported that Alu-mediated recombination led to large intragenic deletions that cannot be detected in heterozygous state by conventional PCR, SSCP analysis, or DNA sequencing. To overcome this problem, a method based on quantitative fluorescent multiplex PCR was proposed to detect intragenic deletions in FANCA involving the most frequently deleted exons (exons 5, 11, 17, 21 and 31). Here we apply the proposed method to detect intragenic deletions in 25 Spanish FA-A patients previously assigned to complementation group FA-A by FANCA cDNA retroviral transduction. A total of eight heterozygous deletions involving from one to more than 26 exons were detected. Thus, one third of the patients carried a large intragenic deletion that would have not been detected by conventional methods. These results are in agreement with previously published data and indicate that large intragenic deletions are one of the most frequent mutations leading to Fanconi anaemia. Consequently, this technology should be applied in future studies on FANCA to improve the mutation detection rate. Copyright 2003 S. Karger AG, Basel
Glucocorticoid receptor represses brain-derived neurotrophic factor expression in neuron-like cells.
Chen, Hui; Lombès, Marc; Le Menuet, Damien
2017-04-12
Brain-derived neurotrophic factor (BDNF) is involved in many functions such as neuronal growth, survival, synaptic plasticity and memorization. Altered expression levels are associated with many pathological situations such as depression, epilepsy, Alzheimer's, Huntington's and Parkinson's diseases. Glucocorticoid receptor (GR) is also crucial for neuron functions, via binding of glucocorticoid hormones (GCs). GR actions largely overlap those of BDNF. It has been proposed that GR could be a regulator of BDNF expression, however the molecular mechanisms involved have not been clearly defined yet. Herein, we analyzed the effect of a GC agonist dexamethasone (DEX) on BDNF expression in mouse neuronal primary cultures and in the newly characterized, mouse hippocampal BZ cell line established by targeted oncogenesis. Mouse Bdnf gene exhibits a complex genomic structure with 8 untranslated exons (I to VIII) splicing onto one common and unique coding exon IX. We found that DEX significantly downregulated total BDNF mRNA expression by around 30%. Expression of the highly expressed exon IV and VI containing transcripts was also reduced by DEX. The GR antagonist RU486 abolished this effect, which is consistent with specific GR-mediated action. Transient transfection assays allowed us to define a short 275 bp region within exon IV promoter responsible for GR-mediated Bdnf repression. Chromatin immunoprecipitation experiments demonstrated GR recruitment onto this fragment, through unidentified transcription factor tethering. Altogether, GR downregulates Bdnf expression through direct binding to Bdnf regulatory sequences. These findings bring new insights into the crosstalk between GR and BDNF signaling pathways both playing a major role in physiology and pathology of the central nervous system.
A splice variant in the ACSL5 gene relates migraine with fatty acid activation in mitochondria
Matesanz, Fuencisla; Fedetz, María; Barrionuevo, Cristina; Karaky, Mohamad; Catalá-Rabasa, Antonio; Potenciano, Victor; Bello-Morales, Raquel; López-Guerrero, Jose-Antonio; Alcina, Antonio
2016-01-01
Genome-wide association studies (GWAS) in migraine are providing the molecular basis of this heterogeneous disease, but the understanding of its aetiology is still incomplete. Although some biomarkers have currently been accepted for migraine, large amount of studies for identifying new ones is needed. The migraine-associated variant rs12355831:A>G (P=2 × 10−6), described in a GWAS of the International Headache Genetic Consortium, is localized in a non-coding sequence with unknown function. We sought to identify the causal variant and the genetic mechanism involved in the migraine risk. To this end, we integrated data of RNA sequences from the Genetic European Variation in Health and Disease (GEUVADIS) and genotypes from 1000 GENOMES of 344 lymphoblastoid cell lines (LCLs), to determine the expression quantitative trait loci (eQTLs) in the region. We found that the migraine-associated variant belongs to a linkage disequilibrium block associated with the expression of an acyl-coenzyme A synthetase 5 (ACSL5) transcript lacking exon 20 (ACSL5-Δ20). We showed by exon-skipping assay a direct causality of rs2256368-G in the exon 20 skipping of approximately 20 to 40% of ACSL5 RNA molecules. In conclusion, we identified the functional variant (rs2256368:A>G) affecting ACSL5 exon 20 skipping, as a causal factor linked to the migraine-associated rs12355831:A>G, suggesting that the activation of long-chain fatty acids by the spliced ACSL5-Δ20 molecules, a mitochondrial located enzyme, is involved in migraine pathology. PMID:27189022
Non-exomic and synonymous variants in ABCA4 are an important cause of Stargardt disease
Braun, Terry A.; Mullins, Robert F.; Wagner, Alex H.; Andorf, Jeaneen L.; Johnston, Rebecca M.; Bakall, Benjamin B.; Deluca, Adam P.; Fishman, Gerald A.; Lam, Byron L.; Weleber, Richard G.; Cideciyan, Artur V.; Jacobson, Samuel G.; Sheffield, Val C.; Tucker, Budd A.; Stone, Edwin M.
2013-01-01
Mutations in ABCA4 cause Stargardt disease and other blinding autosomal recessive retinal disorders. However, sequencing of the complete coding sequence in patients with clinical features of Stargardt disease sometimes fails to detect one or both mutations. For example, among 208 individuals with clear clinical evidence of ABCA4 disease ascertained at a single institution, 28 had only one disease-causing allele identified in the exons and splice junctions of the primary retinal transcript of the gene. Haplotype analysis of these 28 probands revealed 3 haplotypes shared among ten families, suggesting that 18 of the 28 missing alleles were rare enough to be present only once in the cohort. We hypothesized that mutations near rare alternate splice junctions in ABCA4 might cause disease by increasing the probability of mis-splicing at these sites. Next-generation sequencing of RNA extracted from human donor eyes revealed more than a dozen alternate exons that are occasionally incorporated into the ABCA4 transcript in normal human retina. We sequenced the genomic DNA containing 15 of these minor exons in the 28 one-allele subjects and observed five instances of two different variations in the splice signals of exon 36.1 that were not present in normal individuals (P < 10−6). Analysis of RNA obtained from the keratinocytes of patients with these mutations revealed the predicted alternate transcript. This study illustrates the utility of RNA sequence analysis of human donor tissue and patient-derived cell lines to identify mutations that would be undetectable by exome sequencing. PMID:23918662
Sarkar, Debina; Oghabian, Ali; Bodiyabadu, Pasani K; Joseph, Wayne R; Leung, Euphemia Y; Finlay, Graeme J; Baguley, Bruce C; Askarian-Amiri, Marjan E
2017-06-27
The long non-coding RNA ANRIL , antisense to the CDKN2B locus, is transcribed from a gene that encompasses multiple disease-associated polymorphisms. Despite the identification of multiple isoforms of ANRIL , expression of certain transcripts has been found to be tissue-specific and the characterisation of ANRIL transcripts remains incomplete. Several functions have been associated with ANRIL . In our judgement, studies on ANRIL functionality are premature pending a more complete appreciation of the profusion of isoforms. We found differential expression of ANRIL exons, which indicates that multiple isoforms exist in melanoma cells. In addition to linear isoforms, we identified circular forms of ANRIL ( circANRIL ). Further characterisation of circANR IL in two patient-derived metastatic melanoma cell lines (NZM7 and NZM37) revealed the existence of a rich assortment of circular isoforms. Moreover, in the two melanoma cell lines investigated, the complements of circANRIL isoforms were almost completely different. Novel exons were also discovered. We also found the family of linear ANRIL was enriched in the nucleus, whilst the circular isoforms were enriched in the cytoplasm and they differed markedly in stability. With respect to the variable processing of circANRIL species, bioinformatic analysis indicated that intronic Arthrobacter luteus (Alu) restriction endonuclease inverted repeats and exon skipping were not involved in selection of back-spliced exon junctions. Based on our findings, we hypothesise that " ANRIL " has wholly distinct dual sets of functions in melanoma. This reveals the dynamic nature of the locus and constitutes a basis for investigating the functions of ANRIL in melanoma.
Efficient exon skipping of SGCG mutations mediated by phosphorodiamidate morpholino oligomers.
Wyatt, Eugene J; Demonbreun, Alexis R; Kim, Ellis Y; Puckelwartz, Megan J; Vo, Andy H; Dellefave-Castillo, Lisa M; Gao, Quan Q; Vainzof, Mariz; Pavanello, Rita C M; Zatz, Mayana; McNally, Elizabeth M
2018-05-03
Exon skipping uses chemically modified antisense oligonucleotides to modulate RNA splicing. Therapeutically, exon skipping can bypass mutations and restore reading frame disruption by generating internally truncated, functional proteins to rescue the loss of native gene expression. Limb-girdle muscular dystrophy type 2C is caused by autosomal recessive mutations in the SGCG gene, which encodes the dystrophin-associated protein γ-sarcoglycan. The most common SGCG mutations disrupt the transcript reading frame abrogating γ-sarcoglycan protein expression. In order to treat most SGCG gene mutations, it is necessary to skip 4 exons in order to restore the SGCG transcript reading frame, creating an internally truncated protein referred to as Mini-Gamma. Using direct reprogramming of human cells with MyoD, myogenic cells were tested with 2 antisense oligonucleotide chemistries, 2'-O-methyl phosphorothioate oligonucleotides and vivo-phosphorodiamidate morpholino oligomers, to induce exon skipping. Treatment with vivo-phosphorodiamidate morpholino oligomers demonstrated efficient skipping of the targeted exons and corrected the mutant reading frame, resulting in the expression of a functional Mini-Gamma protein. Antisense-induced exon skipping of SGCG occurred in normal cells and those with multiple distinct SGCG mutations, including the most common 521ΔT mutation. These findings demonstrate a multiexon-skipping strategy applicable to the majority of limb-girdle muscular dystrophy 2C patients.
PANAGOPOULOS, IOANNIS; GORUNOVA, LUDMILA; BJERKEHAGEN, BODIL; LOBMAIER, INGVILD; HEIM, SVERRE
2015-01-01
Lipomas are the most common soft tissue tumors in adults. They often carry chromosome aberrations involving 12q13~15 leading to rearrangements of the HMGA2 gene in 12q14.3, with breakpoints occurring within or outside of the gene. Here, we present eleven lipomas and one osteochondrolipoma with a novel recurrent chromosome aberration, t(12;18) (q14~15;q12~21). Molecular studies on eight of the tumors showed that full-length HMGA2 transcript was expressed in three and a chimeric HMGA2 transcript in five of them. In three lipomas and in the osteochondrolipoma, exons 1–3 of HMGA2 were fused to a sequence of SETBP1 on 18q12.3 or an intragenic sequence from 18q12.3 circa 10 kbp distal to SETBP1. In another lipoma, exons 1–4 of HMGA2 were fused to an intronic sequence of GRIP1 which maps to chromosome band 12q14.3, distal to HMGA2. The ensuing HMGA2 fusion transcripts code for putative proteins which contain amino acid residues of HMGA2 corresponding to exons 1–3 (or exons 1–4 in one case) followed by amino acid residues corresponding to the fused sequences. Thus, the pattern is similar to the rearrangements of HMGA2 found in other lipomas, i.e., disruption of the HMGA2 locus leaves intact exons 1–3 which encode the AT-hooks domains and separates them from the 3′-terminal part of the gene. The fact that the examined osteochondrolipoma had a t(12;18) and a HMGA2-SETBP1 fusion identical to the findings in the much more common ordinary lipomas, underscores the close developmental relationship between the two tumor types. PMID:26202160
The developmental transcriptome of Drosophila melanogaster
DOE Office of Scientific and Technical Information (OSTI.GOV)
University of Connecticut; Graveley, Brenton R.; Brooks, Angela N.
Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, predictionmore » and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development. Drosophila melanogaster is an important non-mammalian model system that has had a critical role in basic biological discoveries, such as identifying chromosomes as the carriers of genetic information and uncovering the role of genes in development. Because it shares a substantial genic content with humans, Drosophila is increasingly used as a translational model for human development, homeostasis and disease. High-quality maps are needed for all functional genomic elements. Previous studies demonstrated that a rich collection of genes is deployed during the life cycle of the fly. Although expression profiling using microarrays has revealed the expression of, 13,000 annotated genes, it is difficult to map splice junctions and individual base modifications generated by RNA editing using such approaches. Single-base resolution is essential to define precisely the elements that comprise the Drosophila transcriptome. Estimates of the number of transcript isoforms are less accurate than estimates of the number of genes. Whereas, 20% of Drosophila genes are annotated as encoding alternatively spliced premRNAs, splice-junction microarray experiments indicate that this number is at least 40% (ref. 7). Determining the diversity of mRNAs generated by alternative promoters, alternative splicing and RNA editing will substantially increase the inferred protein repertoire. Non-coding RNA genes (ncRNAs) including short interfering RNAs (siRNAs) and microRNAS (miRNAs) (reviewed in ref. 10), and longer ncRNAs such as bxd (ref. 11) and rox (ref. 12), have important roles in gene regulation, whereas others such as small nucleolar RNAs (snoRNAs)and small nuclear RNAs (snRNAs) are important components of macromolecular machines such as the ribosome and spliceosome. The transcription and processing of these ncRNAs must also be fully documented and mapped. As part of the modENCODE project to annotate the functional elements of the D. melanogaster and Caenorhabditis elegans genomes, we used RNA-Seq and tiling microarrays to sample the Drosophila transcriptome at unprecedented depth throughout development from early embryo to ageing male and female adults. We report on a high-resolution view of the discovery, structure and dynamic expression of the D. melanogaster transcriptome.« less
Poon, Kok Siong; Sng, Andrew Anjian; Ho, Cindy Weili; Koay, Evelyn Siew-Chuan
2015-01-01
Loss-of-function mutations in the phosphate regulating gene with homologies to endopeptidases on the X-chromosome (PHEX) have been causally associated with X-linked hypophosphatemic rickets (XLHR). The early diagnosis of XLHR in infants is challenging when it is based solely on clinical features and biochemical findings. We report a 7-month-old boy with a family history of hypophosphatemic rickets., who demonstrated early clinical evidence of rickets, although serial biochemical findings could not definitively confirm rickets. A sequencing assay targeting the PHEX gene was first performed on the mother’s DNA to screen for mutations in the 5′UTR, 22 coding exons, and the exon-intron junctions. Targeted mutation analysis and mRNA studies were subsequently performed on the boys’ DNA to investigate the pathogenicity of the identified mutation. Genetic screening of the PHEX gene revealed a novel mutation, c.1080-2A>C, at the splice acceptor site in intron 9. The detection of an aberrant mRNA transcript with skipped (loss of) exon 10 establishes its pathogenicity and confirms the diagnosis of XLHR in this infant. Genetic testing of the PHEX gene resulted in early diagnosis of XLHR, thus enabling initiation of therapy and prevention of progressive rachitic changes in the infant. PMID:26904698
Kapahnke, Marcel; Banning, Antje; Tikkanen, Ritva
2016-12-14
The clustered regularly interspaced short palindromic repeats (CRISPR)-associated sequence 9 (CRISPR/Cas9) system is widely used for genome editing purposes as it facilitates an efficient knockout of a specific gene in, e.g. cultured cells. Targeted double-strand breaks are introduced to the target sequence of the guide RNAs, which activates the cellular DNA repair mechanism for non-homologous-end-joining, resulting in unprecise repair and introduction of small deletions or insertions. Due to this, sequence alterations in the coding region of the target gene frequently cause frame-shift mutations, facilitating degradation of the mRNA. We here show that such CRISPR/Cas9-mediated alterations in the target exon may also result in altered splicing of the respective pre-mRNA, most likely due to mutations of splice-regulatory sequences. Using the human FLOT-1 gene as an example, we demonstrate that such altered splicing products also give rise to aberrant protein products. These may potentially function as dominant-negative proteins and thus interfere with the interpretation of the data generated with these cell lines. Since most researchers only control the consequences of CRISPR knockout at genomic and protein level, our data should encourage to also check the alterations at the mRNA level.
Pleiotropic biological activities of alternatively spliced TMPRSS2/ERG fusion gene transcripts
Wang, Jianghua; Cai, Yi; Yu, Wendong; Ren, Chengxi; Spencer, David M.; Ittmann, Michael
2008-01-01
TMPRSS2/ERG gene fusions are found in the majority of prostate cancers; however, there is significant heterogeneity in the 5′ region of the alternatively spliced fusion gene transcripts. We have found that there is also significant heterogeneity within the coding exons as well. There is variable inclusion of a 72-bp exon and other novel alternatively spliced isoforms. To assess the biological significance of these alternatively spliced transcripts, we expressed various transcripts in primary prostatic epithelial cells and in an immortalized prostatic epithelial cell line, PNT1a. The fusion gene transcripts promoted proliferation, invasion and motility with variable activities that depended on the structure of the 5′ region encoding the TMPRSS2/ERG fusion and the presence of the 72-bp exon. Cotransfection of different isoforms further enhanced biological activity, mimicking the situation in vivo, in which multiple isoforms are expressed. Finally, knockdown of the fusion gene in VCaP cells resulted in inhibition of proliferation in vitro and tumor progression in an in vivo orthotopic mice model. Our results indicate that TMPRSS2/ERG fusion isoforms have variable biological activities promoting tumor initiation and progression and are consistent with our previous clinical observations indicating that certain TMPRSS2/ERG fusion isoforms are significantly correlated with more aggressive disease. PMID:18922926
Nakada, Satoko; Minato, Hiroshi; Nojima, Takayuki
2016-07-01
Investigations on the NAB2-STAT6 fusion gene in solitary fibrous tumors (SFTs) and hemangiopericytomas (HPCs) have increased since its discovery in 2013. Although several SFTs reported without NAB2-STAT6 fusion gene analysis, we reviewed 546 SFTs/HPCs with NAB2-STAT6 fusion gene analysis in this study and investigated differences between the gene variants. In total, 452 cases tested positive for the NAB2-STAT6 fusion gene, with more than 40 variants being detected. The most frequent of these were NAB2 exon 6-STAT6 exon 16/17/18 and NAB2 exon 4-STAT6 exon 2/3, with the former occurring most frequently in SFTs in meninges, soft tissues, and head and neck; the latter predominated in SFTs in the pleura and lung. There was no difference between the histology of SFTs and fusion gene variants. A follow-up analysis of SFTs showed that 51 of 202 cases had a recurrence, with 18 of 53 meningeal SFTs having a local recurrence and/or metastasis within 0-19 years. In meninges and soft tissue, SFTs with the NAB2 exon 6-STAT6 exon 16/17/18 tended to recur more frequently than SFTs with the NAB2 exon 4-STAT6 exon 2/3. Clinicopathological data, including yearly follow-ups, are required for meningeal SFTs/HPCs to define the correlation of variants of NAB2-STAT6 fusion gene.
Diane Dietrich; Casey Crooks
2009-01-01
A pyranose 2-oxidase gene from the brown-rot basidiomycete Gloeophyllum trabeum was isolated using homology-based degenerate PCR. The gene structure was determined and compared to that of several pyranose 2-oxidases cloned from white-rot fungi. The G. trabeum pyranose 2-oxidase gene consists of 16 coding exons with canonical promoter CAAT and TATA elements in the 5âUTR...
Takeda, Masayuki; Sakai, Kazuko; Hayashi, Hidetoshi; Tanaka, Kaoru; Tanizaki, Junko; Takahama, Takayuki; Haratani, Koji; Nishio, Kazuto; Nakagawa, Kazuhiko
2018-04-20
Unlike common epidermal growth factor receptor gene ( EGFR ) mutations that confer sensitivity to tyrosine kinase inhibitors (TKIs) in non-small cell lung cancer (NSCLC), mutations in exon 20 of either EGFR or the human EGFR2 gene ( HER2 ) are associated with insensitivity to EGFR-TKIs, with treatment options for patients with such mutations being limited. Clinical characteristics, outcome of EGFR-TKI or nivolumab treatment, and the presence of coexisting mutations were reviewed for NSCLC patients with exon-20 mutations of EGFR or HER2 as detected by routine application of an amplicon-based next-generation sequencing panel. Between July 2013 and June 2017, 206 patients with pathologically confirmed lung cancer were screened for genetic alterations including HER2 and EGFR mutations. Ten patients harbored HER2 exon-20 insertions (one of whom also carried an exon-19 deletion of EGFR ), and 12 patients harbored EGFR exon-20 mutations. Five of the 13 patients with EGFR mutations were treated with EGFR-TKIs, two of whom manifested a partial response, two stable disease, and one progressive disease. Among the seven patients treated with nivolumab, one patient manifested a partial response, three stable disease, and three progressive disease, with most (86%) of these patients discontinuing treatment as a result of disease progression within 4 months. The H1047R mutation of PIK3CA detected in one patient was the only actionable mutation coexisting with the exon-20 mutations of EGFR or HER2 . Potentially actionable mutations thus rarely coexist with exon-20 mutations of EGFR or HER2 , and EGFR-TKIs and nivolumab show limited efficacy in patients with such exon-20 mutations.
A novel first exon directs hormone-sensitive transcription of the pig prolactin receptor
USDA-ARS?s Scientific Manuscript database
Endocrine, paracrine, and autocrine prolactin (PRL) acts through its receptor (PRLR) to confer a wide range of biological functions, including its established role during lactation.We have identified a novel first exon of the porcine PRLR that gives rise to three different mRNA transcripts. Transcri...
Kramer, Marianne C.; Liang, Dongming; Tatomer, Deirdre C.; Gold, Beth; March, Zachary M.; Cherry, Sara; Wilusz, Jeremy E.
2015-01-01
Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3′ end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine–arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. PMID:26450910
Rau, Frédérique; Lainé, Jeanne; Ramanoudjame, Laetitita; Ferry, Arnaud; Arandel, Ludovic; Delalande, Olivier; Jollet, Arnaud; Dingli, Florent; Lee, Kuang-Yung; Peccate, Cécile; Lorain, Stéphanie; Kabashi, Edor; Athanasopoulos, Takis; Koo, Taeyoung; Loew, Damarys; Swanson, Maurice S; Le Rumeur, Elisabeth; Dickson, George; Allamand, Valérie; Marie, Joëlle; Furling, Denis
2015-05-28
Myotonic Dystrophy type 1 (DM1) is a dominant neuromuscular disease caused by nuclear-retained RNAs containing expanded CUG repeats. These toxic RNAs alter the activities of RNA splicing factors resulting in alternative splicing misregulation and muscular dysfunction. Here we show that the abnormal splicing of DMD exon 78 found in dystrophic muscles of DM1 patients is due to the functional loss of MBNL1 and leads to the re-expression of an embryonic dystrophin in place of the adult isoform. Forced expression of embryonic dystrophin in zebrafish using an exon-skipping approach severely impairs the mobility and muscle architecture. Moreover, reproducing Dmd exon 78 missplicing switch in mice induces muscle fibre remodelling and ultrastructural abnormalities including ringed fibres, sarcoplasmic masses or Z-band disorganization, which are characteristic features of dystrophic DM1 skeletal muscles. Thus, we propose that splicing misregulation of DMD exon 78 compromises muscle fibre maintenance and contributes to the progressive dystrophic process in DM1.
Rau, Frédérique; Lainé, Jeanne; Ramanoudjame, Laetitita; Ferry, Arnaud; Arandel, Ludovic; Delalande, Olivier; Jollet, Arnaud; Dingli, Florent; Lee, Kuang-Yung; Peccate, Cécile; Lorain, Stéphanie; Kabashi, Edor; Athanasopoulos, Takis; Koo, Taeyoung; Loew, Damarys; Swanson, Maurice S.; Le Rumeur, Elisabeth; Dickson, George; Allamand, Valérie; Marie, Joëlle; Furling, Denis
2015-01-01
Myotonic Dystrophy type 1 (DM1) is a dominant neuromuscular disease caused by nuclear-retained RNAs containing expanded CUG repeats. These toxic RNAs alter the activities of RNA splicing factors resulting in alternative splicing misregulation and muscular dysfunction. Here we show that the abnormal splicing of DMD exon 78 found in dystrophic muscles of DM1 patients is due to the functional loss of MBNL1 and leads to the re-expression of an embryonic dystrophin in place of the adult isoform. Forced expression of embryonic dystrophin in zebrafish using an exon-skipping approach severely impairs the mobility and muscle architecture. Moreover, reproducing Dmd exon 78 missplicing switch in mice induces muscle fibre remodelling and ultrastructural abnormalities including ringed fibres, sarcoplasmic masses or Z-band disorganization, which are characteristic features of dystrophic DM1 skeletal muscles. Thus, we propose that splicing misregulation of DMD exon 78 compromises muscle fibre maintenance and contributes to the progressive dystrophic process in DM1. PMID:26018658
Piya, Anil; Kaur, Jasmeet; Rice, Alan M
2017-01-01
Summary Cholesterol transport into the mitochondria is required for synthesis of the first steroid, pregnenolone. Cholesterol is transported by the steroidogenic acute regulatory protein (STAR), which acts at the outer mitochondrial membrane prior to its import. Mutations in the STAR protein result in lipoid congenital adrenal hyperplasia (CAH). Although the STAR protein consists of seven exons, biochemical analysis in nonsteroidogenic COS-1 cells showed that the first two were not essential for pregnenolone synthesis. Here, we present a patient with ambiguous genitalia, salt-lossing crisis within two weeks after birth and low cortisol levels. Sequence analysis of the STAR, including the exon–intron boundaries, showed the complete deletion of exon 1 as well as more than 50 nucleotides upstream of STAR promoter. Mitochondrial protein import with the translated protein through synthesis cassette of the mutant STAR lacking exon 1 showed protein translation, but it is less likely to have synthesized without a promoter in our patient. Thus, a full-length STAR gene is necessary for physiological mitochondrial cholesterol transport in vivo. Learning points: STAR exon 1 deletion caused lipoid CAH. Exon 1 substitution does not affect biochemical activity. StAR promoter is responsible for gonadal development. PMID:28458886
NASA Astrophysics Data System (ADS)
Sekar, Nishu; Kulkarni, Rucha; Ozalkar, Sharvari; Prabhu, Yogamaya D.; Renu, Kaviyarasi; Ramgir, Shalaka S.; Abilash, V. G.
2017-11-01
Polycystic ovarian syndrome is the most common heterogenous endocrine disorder in women. Follicle stimulating hormone receptor is associated with normal development as well as maturation of follicles and triggers estrogen production in granulosa cells of the ovary. Inactivating mutation in FSHR gene correlated with reduction of ovarian function in women is due to damage to receptor function. This study aims to investigate whether inactivating mutations, in follicle stimulating hormone receptor gene is related to polycystic ovarian morphology in women with PCOS. Genomic DNA isolated from 15 subjects from Sandhya Hospital, Vellore (10 patients with PCOS and 5 healthy controls) was taken for this study. Patient data included a clinical report, hormonal levels, and ovarian morphological details. DNA isolation was followed by DNA amplification by polymerase chain reaction using Exon 10 A and Exon 10 B primers. The PCR-RFLP analysis was performed using Dde1 restriction enzyme. Here we discuss inactivating mutation found in Exon 10 of FSHR gene in patients with PCOS.The absence of inactivating mutation was observed through PCR-RFLP study on Exon 10A and Exon 10B.
Synthesis, antimicrobial activity and gene structure of a novel member of the dermaseptin B family.
Fleury, Y; Vouille, V; Beven, L; Amiche, M; Wróblewski, H; Delfour, A; Nicolas, P
1998-03-09
Dermaseptins are a family of cationic (Lys-rich) antimicrobial peptides that are abundant in the skin secretions of the arboreal frogs Phyllomedusa bicolor and P. sauvagii. In vitro, these peptides are microbicidal against a wide variety of microorganisms including Gram-positive and Gram-negative bacteria, yeasts, protozoa and fungi. To date, 6 dermaseptin B mature peptides, 24-34 residues long, 2 dermaseptin B cDNAs and 2 gene sequences have been identified in P. bicolor. To assess dermaseptin related genes further, we screened a P. bicolor genomic library with 32P-labeled cDNAs coding either for prepro-dermaseptins B1 or B2 (adenoregulin). A gene sequence was identified that coded a novel dermaseptin B, termed Drg3, which exhibits 23-42% amino acids identities with other members of the family. Analysis of the cDNAs coding precursors for several opioid and antimicrobial peptides originating from the skin of various amphibian species revealed that the 25-residue preproregion of these preproforms are all encoded by conserved nucleotides encompassed by the first coding exon of the Drg3 gene. Synthetic dermaseptin Drg3 exhibited a bactericidal activity towards several species of mollicutes (wall-less eubacteria), firmicutes (Gram-positive eubacteria), and gracilicutes (Gram-negative eubacteria), with minimal inhibitory concentrations (MICs) ranging from 6.25 to 100 microM. Experiments performed on Acholeplasma laidlawii cells revealed that this peptide is membranotropic and that if efficiently depolarizes the plasma membrane.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mebarki, F.; Forest, M.G.; Josso, N.
The androgen insensivity syndrome (AIS) is a recessive X-linked disorder resulting from a deficient function of the androgen receptor (AR). The human AR gene has 3 functional domains: N-terminal encoded by exon 1, DNA-binding domain encoded by exons 2 and 3, and androgen-binding domain encoded by exons 4 to 8. In order to characterize the molecular defects of the AR gene in AIS, the entire coding regions and the intronic bording sequences of the AR gene were amplified by PCR before automatic direct sequencing in 45 patients. Twenty seven different point mutations were found in 32 unrelated AIS patients: 18more » with a complete form (CAIS), 14 with a partial form (PAIS); 18 of these mutations are novel mutations, not published to date. Only 3 mutations were repeatedly found: R804H in 3 families; M780I in 3 families and R774C in 2 families. For 26 patients out of the 32 found to have a mutation, maternal DNA was collected and sequenced: 6 de novo mutations were detected (i.e. 23% of the cases). Finally, no mutation was detected in 13 patients (29%): 7 with CAIS and 6 familial severe PAIS. The latter all presented with perineal hypospadias, micropenis, 4 out of 6 being raised as girl. Diagnosis of AIS in these 13 families in whom no mutation was detected is supported by the following criteria: clinical data, familial history (2 or 3 index cases in the same family), familial segregation of the polymorphic CAG repeat of the AR gene. Mutations in intronic regions or the promoter of the AR gene could not explain all cases of AIS without mutations in the AR coding regions, because AR binding (performed in 9 out of 13) was normal in 6, suggesting the synthesis of an AR protein. This situation led us to speculate that another X-linked factor associated with the AR could be implicated in some cases of AIS.« less
Johnson, Alexander A. T.
2017-01-01
Iron (Fe) uptake in graminaceous plant species occurs via the release and uptake of Fe-chelating compounds known as mugineic acid family phytosiderophores (MAs). In the MAs biosynthetic pathway, nicotianamine aminotransferase (NAAT) and deoxymugineic acid synthase (DMAS) enzymes catalyse the formation of 2’-deoxymugineic acid (DMA) from nicotianamine (NA). Here we describe the identification and characterisation of six TaNAAT and three TaDMAS1 genes in bread wheat (Triticum aestivum L.). The coding sequences of all six TaNAAT homeologs consist of seven exons with ≥88.0% nucleotide sequence identity and most sequence variation present in the first exon. The coding sequences of the three TaDMAS1 homeologs consist of three exons with ≥97.8% nucleotide sequence identity. Phylogenetic analysis revealed that the TaNAAT and TaDMAS1 proteins are most closely related to the HvNAAT and HvDMAS1 proteins of barley and that there are two distinct groups of TaNAAT proteins—TaNAAT1 and TaNAAT2 –that correspond to the HvNAATA and HvNAATB proteins, respectively. Quantitative reverse transcription-PCR analysis revealed that the TaNAAT2 genes are expressed at highest levels in anther tissues whilst the TaNAAT1 and TaDMAS1 genes are expressed at highest levels in root tissues of bread wheat. Furthermore, the TaNAAT1, TaNAAT2 and TaDMAS1 genes were differentially regulated by plant Fe status and their expression was significantly upregulated in root tissues from day five onwards during a seven-day Fe deficiency treatment. The identification and characterization of the TaNAAT1, TaNAAT2 and TaDMAS1 genes provides a valuable genetic resource for improving bread wheat growth on Fe deficient soils and enhancing grain Fe nutrition. PMID:28475636
Haut, Donald D.; Pintel, D. J.
1998-01-01
Alternative splicing of pre-mRNAs plays a critical role in maximizing the coding capacity of the small parvovirus genome. The small-intron region of minute virus of mice (MVM) pre-mRNAs undergoes an unusual pattern of overlapping alternative splicing—using two donors (D1 and D2) and two acceptors (A1 and A2) within a region of 120 nucleotides—that determines the steady-state ratios of the various viral mRNAs. In this report, we show that the determinants that govern excision of the small intron are complex and are also required for efficient definition of the upstream exon. For the MVM small intron in its natural context, the two donors appear to compete for the splicing machinery: the position of D1 favors its usage, while the primary sequence of D2 must be more like the consensus sequence than is D1 to be used efficiently. We have genetically defined the branch points that are used for generation of the major and minor spliced forms and show that recognition of components of the small-intron acceptors is likely to be the dominant determinant in alternative small-intron excision. We have also identified a G-rich intronic enhancer sequence within the small intron that is essential for splicing of the minor form (D2 to A2) but not the major form (D1 to A1) of MVM mRNAs and is required for efficient definition of the upstream NS2-specific exon. In its natural context, the small intron appears to be excised by a mechanism consistent with intron definition. When the MVM small intron is expanded, various parameters of its excision are altered, indicating that critical cis-acting signals are context dependent. Relative use of the donors and acceptors is altered, and the upstream NS2-specific exon is no longer efficiently defined. The fact that definition of the upstream NS2-specific exon can be achieved by the MVM small intron in its natural context, but not when it is expanded, suggests that the multiple determinants that govern definition and excision of the small intron are required, in concert, for upstream exon definition. Our data are consistent with a model in which alternative splicing of the MVM P4-generated pre-mRNAs is governed by a hybrid of intron- and exon-defining mechanisms. PMID:9499034
Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.
Sharma, Rekha; Ahlawat, Sonika; Maitra, A; Roy, Manoranjan; Mandakmale, S; Tantia, M S
2013-12-10
Bone morphogenetic proteins (BMPs) are members of the TGF-β (transforming growth factor-beta) superfamily, of which BMP4 is the most important due to its crucial role in follicular growth and differentiation, cumulus expansion and ovulation. Reproduction is a crucial trait in goat breeding and based on the important role of BMP4 gene in reproduction it was considered as a possible candidate gene for the prolificacy of goats. The objective of the present study was to detect polymorphism in intronic, exonic and 3' un-translated regions of BMP4 gene in Indian goats. Nine different goat breeds (Barbari, Beetal, Black Bengal, Malabari, Jakhrana (Twinning>40%), Osmanabadi, Sangamneri (Twinning 20-30%), Sirohi and Ganjam (Twinning<10%)) differing in prolificacy and geographic distribution were employed for polymorphism scanning. Cattle sequence (AC_000167.1) was used to design primers for the amplification of a targeted region followed by direct DNA sequencing to identify the genetic variations. Single nucleotide polymorphisms (SNPs) were not detected in exon 3, the intronic region and the 3' flanking region. A SNP (G1534A) was identified in exon 2. It was a non-synonymous mutation resulting in an arginine to lysine change in a corresponding protein sequence. G to A transition at the 1534 locus revealed two genotypes GG and GA in the nine investigated goat breeds. The GG genotype was predominant with a genotype frequency of 0.98. The GA genotype was present in the Black Bengal as well as Jakhrana breed with a genotype frequency of 0.02. A microsatellite was identified in the 3' flanking region, only 20 nucleotides downstream from the termination site of the coding region, as a short sequence with more than nineteen continuous and repeated CA dinucleotides. Since the gene is highly evolutionarily conserved, identification of a non-synonymous SNP (G1534A) in the coding region gains further importance. To our knowledge, this is the first report of a mutation in the coding region of the caprine BMP4 gene. But whether the reproduction trait of goat is associated with the BMP4 polymorphism, needs to be further defined by association studies in more populations so as to delineate an effect on it. © 2013 Elsevier B.V. All rights reserved.
An insight into the sialome of the horse fly, Tabanus bromius
Ribeiro, José M.C.; Kazimirova, Maria; Takac, Peter; Andersen, John F.; Francischetti, Ivo M.B.
2015-01-01
Blood feeding animals face their host's defenses against tissue injury and blood loss while attempting to feed. One adaptation to surmount these barriers involves the evolution of a salivary potion that disarms their host's inflammatory and anti-hemostatic processes. The composition of the peptide moiety of this potion, or sialome (from the Greek sialo=saliva), can be deducted in part by proper interpretation of the blood feeder' sialotranscriptome. In this work we disclose the sialome of the blood feeding adult female Tabanus bromius. Following assembly of over 75 million Illumina reads (101 nt long) 16,683 contigs were obtained from which 4,078 coding sequences were extracted. From these, 320 were assigned as coding for putative secreted proteins. These 320 contigs mapped 85% of the reads. The antigen-5 proteins family was studied in detail, indicating three Tabanus specific clades with and without disintegrin domains, as well as with and without leukotriene binding domains. Defensins were also detailed; a clade of salivary tabanid peptides was found lacking the propeptide domain ending in the KR dipeptide signaling furin cleavage. Novel protein families were also disclosed. Viral transcripts were identified closely matching the Kotonkan virus capsid proteins. Full length Mariner transposases were also identified. A total of 3,043 coding sequences and their protein products were deposited in Genbank. Hyperlinked excel spreadsheets containing the coding sequences and their annotation are available at http://exon.niaid.nih.gov/transcriptome/T_bromius/Tbromius-web.xlsx (hyperlinked excel spreadsheet, 11 MB) and http://exon.niaid.nih.gov/transcriptome/T_bromius/Tbromius-SA.zip (Standalone excel with all local links, 360 MB). These sequences provide for a platform from which further proteomic studies may be designed to identify salivary proteins from T. bromius that are of pharmacological interest or used as immunological markers of host exposure. PMID:26369729
Allelic combinations of promoter and exon 2 in DQB1 in dogs and wolves.
Berggren, Karin T; Seddon, Jennifer M
2008-07-01
Polymorphism of PBRs of the major histocompatibility complex (MHC) genes is well recognized, but the polymorphism also extends to proximal promoter regions. Examining DQB1 variability in dogs and wolves, we identified 7 promoter variants and 13 exon 2 alleles among 89 dogs, including a previously unknown DQB1 exon 2 allele, and 8 promoter variants and 9 exon 2 alleles among 85 wolves. As expected from previous studies and from a close chromosomal location, strong linkage disequilibrium was demonstrated in both wolves and dogs by having significantly fewer promoter/exon 2 combinations than expected from simulations of randomized data sets. Interestingly, we noticed weaker haplotypic associations in dogs than in wolves. Dogs had twice as many promoter/exon 2 combinations as wolves and an almost 2-fold difference in the number of exon 2 alleles per promoter variant. This difference was not caused by an admixture of breeds in our group of dogs because the high ratio of observed to expected number of haplotypes persisted within a single dog breed, the German Shepherd. Ewens-Watterson tests indicated that both the promoter and exon 2 are under the balancing selection, and both regions appear to be more recently derived in the dog than in the wolf. Hence, although reasons for the differences are unknown, they may relate to altered selection pressure on patterns of expression. Deviations from normal MHC expression patterns have been associated with autoimmune diseases, which occur frequently in several dog breeds. Further knowledge about these deviations may help us understand the source of such diseases.
JAK2 Exon 14 Deletion in Patients with Chronic Myeloproliferative Neoplasms
Ma, Wanlong; Kantarjian, Hagop; Zhang, Xi; Wang, Xiuqiang; Zhang, Zhong; Yeh, Chen-Hsiung; O'Brien, Susan; Giles, Francis; Bruey, Jean Marie; Albitar, Maher
2010-01-01
Background The JAK2 V617F mutation in exon 14 is the most common mutation in chronic myeloproliferative neoplasms (MPNs); deletion of the entire exon 14 is rarely detected. In our previous study of >10,000 samples from patients with suspected MPNs tested for JAK2 mutations by reverse transcription-PCR (RT-PCR) with direct sequencing, complete deletion of exon 14 (Δexon14) constituted <1% of JAK2 mutations. This appears to be an alternative splicing mutation, not detectable with DNA-based testing. Methodology/Principal Findings We investigated the possibility that MPN patients may express the JAK2 Δexon14 at low levels (<15% of total transcript) not routinely detectable by RT-PCR with direct sequencing. Using a sensitive RT-PCR–based fluorescent fragment analysis method to quantify JAK2 Δexon14 mRNA expression relative to wild-type, we tested 61 patients with confirmed MPNs, 183 with suspected MPNs (93 V617F-positive, 90 V617F-negative), and 46 healthy control subjects. The Δexon14 variant was detected in 9 of the 61 (15%) confirmed MPN patients, accounting for 3.96% to 33.85% (mean = 12.04%) of total JAK2 transcript. This variant was also detected in 51 of the 183 patients with suspected MPNs (27%), including 20 of the 93 (22%) with V617F (mean [range] expression = 5.41% [2.13%–26.22%]) and 31 of the 90 (34%) without V617F (mean [range] expression = 3.88% [2.08%–12.22%]). Immunoprecipitation studies demonstrated that patients expressing Δexon14 mRNA expressed a corresponding truncated JAK2 protein. The Δexon14 variant was not detected in the 46 control subjects. Conclusions/Significance These data suggest that expression of the JAK2 Δexon14 splice variant, leading to a truncated JAK2 protein, is common in patients with MPNs. This alternatively spliced transcript appears to be more frequent in MPN patients without V617F mutation, in whom it might contribute to leukemogenesis. This mutation is missed if DNA rather than RNA is used for testing. PMID:20730051
Huntingtin gene evolution in Chordata and its peculiar features in the ascidian Ciona genus
Gissi, Carmela; Pesole, Graziano; Cattaneo, Elena; Tartari, Marzia
2006-01-01
Background To gain insight into the evolutionary features of the huntingtin (htt) gene in Chordata, we have sequenced and characterized the full-length htt mRNA in the ascidian Ciona intestinalis, a basal chordate emerging as new invertebrate model organism. Moreover, taking advantage of the availability of genomic and EST sequences, the htt gene structure of a number of chordate species, including the cogeneric ascidian Ciona savignyi, and the vertebrates Xenopus and Gallus was reconstructed. Results The C. intestinalis htt transcript exhibits some peculiar features, such as spliced leader trans-splicing in the 98 nt-long 5' untranslated region (UTR), an alternative splicing in the coding region, eight alternative polyadenylation sites, and no similarities of both 5' and 3'UTRs compared to homologs of the cogeneric C. savignyi. The predicted protein is 2946 amino acids long, shorter than its vertebrate homologs, and lacks the polyQ and the polyP stretches found in the the N-terminal regions of mammalian homologs. The exon-intron organization of the htt gene is almost identical among vertebrates, and significantly conserved between Ciona and vertebrates, allowing us to hypothesize an ancestral chordate gene consisting of at least 40 coding exons. Conclusion During chordate diversification, events of gain/loss, sliding, phase changes, and expansion of introns occurred in both vertebrate and ascidian lineages predominantly in the 5'-half of the htt gene, where there is also evidence of lineage-specific evolutionary dynamics in vertebrates. On the contrary, the 3'-half of the gene is highly conserved in all chordates at the level of both gene structure and protein sequence. Between the two Ciona species, a fast evolutionary rate and/or an early divergence time is suggested by the absence of significant similarity between UTRs, protein divergence comparable to that observed between mammals and fishes, and different distribution of repetitive elements. PMID:17092333
A case report and literature review of Fanconi Anemia (FA) diagnosed by genetic testing.
Solomon, Ponnumony John; Margaret, Priya; Rajendran, Ramya; Ramalingam, Revathy; Menezes, Godfred A; Shirley, Alph S; Lee, Seung Jun; Seong, Moon-Woo; Park, Sung Sup; Seol, Dodam; Seo, Soo Hyun
2015-05-08
Fanconi anemia (FA) is a genetically heterogeneous rare autosomal recessive disorder characterized by congenital malformations, hematological problems and predisposition to malignancies. The genes that have been found to be mutated in FA patients are called FANC. To date 16 distinct FANC genes have been reported. Among these, mutations in FANCA are the most frequent among FA patients worldwide which account for 60- 65%. In this study, a nine years old male child was brought to our hospital one year ago for opinion and advice. He was the third child born to consanguineous parents. The mutation analyses were performed for proband, parents, elder sibling and the relatives [maternal aunt and maternal aunt's son (cousin)]. Molecular genetic testing [targeted next-generation sequencing (MiSeq, Illumina method)] was performed by mutation analysis in 15 genes involved. Entire coding exons and their flanking regions of the genes were analysed. Sanger sequencing [(ABI 3730 analyzer by Applied Biosystems)] was performed using primers specific for 43 coding exons of the FANCA gene. A novel splice site mutation, c.3066 + 1G > T, (IVS31 + 1G > T), homozygote was detected by sequencing in the patient. The above sequence variant was identified in heterozygous state in his parents. Further, the above sequence variant was not identified in other family members (elder sibling, maternal aunt and cousin). It is concluded that genetic study should be done if possible in all the cases of suspected FA, including siblings, parents and close blood relatives. It will help us to plan appropriate treatment and also to select suitable donor for hematopoietic stem cell transplantation and to plan for genetic counseling. In addition to the case report, the main focus of this manuscript was to review literature on role of FANCA gene in FA since large number of FANCA mutations and polymorphisms have been identified.
Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes1
Rombauts, Stephane; Florquin, Kobe; Lescot, Magali; Marchal, Kathleen; Rouzé, Pierre; Van de Peer, Yves
2003-01-01
The identification of promoters and their regulatory elements is one of the major challenges in bioinformatics and integrates comparative, structural, and functional genomics. Many different approaches have been developed to detect conserved motifs in a set of genes that are either coregulated or orthologous. However, although recent approaches seem promising, in general, unambiguous identification of regulatory elements is not straightforward. The delineation of promoters is even harder, due to its complex nature, and in silico promoter prediction is still in its infancy. Here, we review the different approaches that have been developed for identifying promoters and their regulatory elements. We discuss the detection of cis-acting regulatory elements using word-counting or probabilistic methods (so-called “search by signal” methods) and the delineation of promoters by considering both sequence content and structural features (“search by content” methods). As an example of search by content, we explored in greater detail the association of promoters with CpG islands. However, due to differences in sequence content, the parameters used to detect CpG islands in humans and other vertebrates cannot be used for plants. Therefore, a preliminary attempt was made to define parameters that could possibly define CpG and CpNpG islands in Arabidopsis, by exploring the compositional landscape around the transcriptional start site. To this end, a data set of more than 5,000 gene sequences was built, including the promoter region, the 5′-untranslated region, and the first introns and coding exons. Preliminary analysis shows that promoter location based on the detection of potential CpG/CpNpG islands in the Arabidopsis genome is not straightforward. Nevertheless, because the landscape of CpG/CpNpG islands differs considerably between promoters and introns on the one side and exons (whether coding or not) on the other, more sophisticated approaches can probably be developed for the successful detection of “putative” CpG and CpNpG islands in plants. PMID:12857799
2015-10-01
a promising target for precision therapy , but the mechanisms leading to hypermutation, optimal methods to measure hypermutation status in the ...1 was largely completed in Year 1 and is summarized below. We published a manuscript in Nature Communications based on the work accomplished in Aim...multiplexing 24 samples per lane on a HiSeq2500. The BROCA assay uses the Agilent SureSelect enrichment system to capture the coding exons and
Age of heart disease presentation and dysmorphic nuclei in patients with LMNA mutations
Core, Jason Q.; Mehrabi, Mehrsa; Robinson, Zachery R.; Ochs, Alexander R.; McCarthy, Linda A.; Zaragoza, Michael V.
2017-01-01
Nuclear shape defects are a distinguishing characteristic in laminopathies, cancers, and other pathologies. Correlating these defects to the symptoms, mechanisms, and progression of disease requires unbiased, quantitative, and high-throughput means of quantifying nuclear morphology. To accomplish this, we developed a method of automatically segmenting fluorescently stained nuclei in 2D microscopy images and then classifying them as normal or dysmorphic based on three geometric features of the nucleus using a package of Matlab codes. As a test case, cultured skin-fibroblast nuclei of individuals possessing LMNA splice-site mutation (c.357-2A>G), LMNA nonsense mutation (c.736 C>T, pQ246X) in exon 4, LMNA missense mutation (c.1003C>T, pR335W) in exon 6, Hutchinson-Gilford Progeria Syndrome, and no LMNA mutations were analyzed. For each cell type, the percentage of dysmorphic nuclei, and other morphological features such as average nuclear area and average eccentricity were obtained. Compared to blind observers, our procedure implemented in Matlab codes possessed similar accuracy to manual counting of dysmorphic nuclei while being significantly more consistent. The automatic quantification of nuclear defects revealed a correlation between in vitro results and age of patients for initial symptom onset. Our results demonstrate the method’s utility in experimental studies of diseases affecting nuclear shape through automated, unbiased, and accurate identification of dysmorphic nuclei. PMID:29149195
Age of heart disease presentation and dysmorphic nuclei in patients with LMNA mutations.
Core, Jason Q; Mehrabi, Mehrsa; Robinson, Zachery R; Ochs, Alexander R; McCarthy, Linda A; Zaragoza, Michael V; Grosberg, Anna
2017-01-01
Nuclear shape defects are a distinguishing characteristic in laminopathies, cancers, and other pathologies. Correlating these defects to the symptoms, mechanisms, and progression of disease requires unbiased, quantitative, and high-throughput means of quantifying nuclear morphology. To accomplish this, we developed a method of automatically segmenting fluorescently stained nuclei in 2D microscopy images and then classifying them as normal or dysmorphic based on three geometric features of the nucleus using a package of Matlab codes. As a test case, cultured skin-fibroblast nuclei of individuals possessing LMNA splice-site mutation (c.357-2A>G), LMNA nonsense mutation (c.736 C>T, pQ246X) in exon 4, LMNA missense mutation (c.1003C>T, pR335W) in exon 6, Hutchinson-Gilford Progeria Syndrome, and no LMNA mutations were analyzed. For each cell type, the percentage of dysmorphic nuclei, and other morphological features such as average nuclear area and average eccentricity were obtained. Compared to blind observers, our procedure implemented in Matlab codes possessed similar accuracy to manual counting of dysmorphic nuclei while being significantly more consistent. The automatic quantification of nuclear defects revealed a correlation between in vitro results and age of patients for initial symptom onset. Our results demonstrate the method's utility in experimental studies of diseases affecting nuclear shape through automated, unbiased, and accurate identification of dysmorphic nuclei.
Nettore, I C; Desiderio, S; De Nisco, E; Cacace, V; Albano, L; Improda, N; Ungaro, P; Salerno, M; Colao, A; Macchia, P E
2018-06-01
Congenital hypothyroidism is a frequent disease occurring with an incidence of about 1/1500 newborns/year. In about 75% of the cases, CH is caused by alterations in thyroid morphogenesis, defined "thyroid dysgenesis" (TD). TD is generally a sporadic disease but in about 5% of the cases a genetic origin has been demonstrated. Previous studies indicate that Dnajc17 as a candidate modifier gene for hypothyroidism, since it is expressed in the thyroid bud, interacts with NKX2.1 and PAX8 and it has been associated to the hypothyroid phenotype in mice carrying a single Nkx2.1 and Pax8 genes (double heterozygous knock-out). The work evaluates the possible involvement of DNAJC17 in the pathogenesis of TD. High-resolution DNA melting analysis (HRM) and direct sequencing have been used to screen for mutations in the DNAJC17 coding sequence in 89 patients with TD. Two mutations have been identified in the coding sequence of DNAJC17 gene, one in exon 5 (c.350A>C; rs79709714) and one in exon 9 (c.610G>C; rs117485355). The last one is a rare variant, while the rs79709714 is a polymorphism. Both are present in databases and the frequency of the alleles is not different between TD patients and controls. DNAJC17 mutations are not frequently present in patients with TD.
Van, K; Onoda, S; Kim, M Y; Kim, K D; Lee, S-H
2008-03-01
The Waxy (Wx) gene product controls the formation of a straight chain polymer of amylose in the starch pathway. Dominance/recessiveness of the Wx allele is associated with amylose content, leading to non-waxy/waxy phenotypes. For a total of 113 foxtail millet accessions, agronomic traits and the molecular differences of the Wx gene were surveyed to evaluate genetic diversities. Molecular types were associated with phenotypes determined by four specific primer sets (non-waxy, Type I; low amylose, Type VI; waxy, Type IV or V). Additionally, the insertion of transposable element in waxy was confirmed by ex1/TSI2R, TSI2F/ex2, ex2int2/TSI7R and TSI7F/ex4r. Seventeen single nucleotide polymorphims (SNPs) were observed from non-coding regions, while three SNPs from coding regions were non-synonymous. Interestingly, the phenotype of No. 88 was still non-waxy, although seven nucleotides (AATTGGT) insertion at 2,993 bp led to 78 amino acids shorter. The rapid decline of r (2) in the sequenced region (exon 1-intron 1-exon 2) suggested a low level of linkage disequilibrium and limited haplotype structure. K (s) values and estimation of evolutionary events indicate early divergence of S. italica among cereal crops. This study suggested the Wx gene was one of the targets in the selection process during domestication.
Four novel mutations in the lactase gene (LCT) underlying congenital lactase deficiency (CLD).
Torniainen, Suvi; Freddara, Roberta; Routi, Taina; Gijsbers, Carolien; Catassi, Carlo; Höglund, Pia; Savilahti, Erkki; Järvelä, Irma
2009-01-22
Congenital lactase deficiency (CLD) is a severe gastrointestinal disorder of newborns. The diagnosis is challenging and based on clinical symptoms and low lactase activity in intestinal biopsy specimens. The disease is enriched in Finland but is also present in other parts of the world. Mutations encoding the lactase (LCT) gene have recently been shown to underlie CLD. The purpose of this study was to identify new mutations underlying CLD in patients with different ethnic origins, and to increase awareness of this disease so that the patients could be sought out and treated correctly. Disaccharidase activities in intestinal biopsy specimens were assayed and the coding region of LCT was sequenced from five patients from Europe with clinical features compatible with CLD. In the analysis and prediction of mutations the following programs: ClustalW, Blosum62, PolyPhen, SIFT and Panther PSEC were used. Four novel mutations in the LCT gene were identified. A single nucleotide substitution leading to an amino acid change S688P in exon 7 and E1612X in exon 12 were present in a patient of Italian origin. Five base deletion V565fsX567 leading to a stop codon in exon 6 was found in one and a substitution R1587H in exon 12 from another Finnish patient. Both Finnish patients were heterozygous for the Finnish founder mutation Y1390X. The previously reported mutation G1363S was found in a homozygous state in two siblings of Turkish origin. This is the first report of CLD mutations in patients living outside Finland. It seems that disease is more common than previously thought. All mutations in the LCT gene lead to a similar phenotype despite the location and/or type of mutation.
Teasdale, Luisa C; Köhler, Frank; Murray, Kevin D; O'Hara, Tim; Moussalli, Adnan
2016-09-01
The qualification of orthology is a significant challenge when developing large, multiloci phylogenetic data sets from assembled transcripts. Transcriptome assemblies have various attributes, such as fragmentation, frameshifts and mis-indexing, which pose problems to automated methods of orthology assessment. Here, we identify a set of orthologous single-copy genes from transcriptome assemblies for the land snails and slugs (Eupulmonata) using a thorough approach to orthology determination involving manual alignment curation, gene tree assessment and sequencing from genomic DNA. We qualified the orthology of 500 nuclear, protein-coding genes from the transcriptome assemblies of 21 eupulmonate species to produce the most complete phylogenetic data matrix for a major molluscan lineage to date, both in terms of taxon and character completeness. Exon capture targeting 490 of the 500 genes (those with at least one exon >120 bp) from 22 species of Australian Camaenidae successfully captured sequences of 2825 exons (representing all targeted genes), with only a 3.7% reduction in the data matrix due to the presence of putative paralogs or pseudogenes. The automated pipeline Agalma retrieved the majority of the manually qualified 500 single-copy gene set and identified a further 375 putative single-copy genes, although it failed to account for fragmented transcripts resulting in lower data matrix completeness when considering the original 500 genes. This could potentially explain the minor inconsistencies we observed in the supported topologies for the 21 eupulmonate species between the manually curated and 'Agalma-equivalent' data set (sharing 458 genes). Overall, our study confirms the utility of the 500 gene set to resolve phylogenetic relationships at a range of evolutionary depths and highlights the importance of addressing fragmentation at the homolog alignment stage for probe design. © 2016 John Wiley & Sons Ltd.
X-linked Alport syndrome: An SSCP-based mutation survey over all 51 exons of the COL4A5 gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Renieri, A.; Bruttini, M.; Galli, L.
1996-06-01
The COL4A5 gene encodes the {alpha}5 (type IV) collagen chain and is defective in X-linked Alport syndrome (AS). Here, we report the first systematic analysis of all 51 exons of COL4A5 gene in a series of 201 Italian AS patients. We have previously reported nine major rearrangements, as well as 18 small mutations identified in the same patient series by SSCP analysis of several exons. After systematic analysis of all 51 exons of COL4A5, we have now identified 30 different mutations: 10 glycine substitutions in the triple helical domain of the protein, 9 frameshift mutations, 4 in-frame deletions, 1 startmore » codon, 1 nonsense, and 5 splice-site mutations. These mutations were either unique or found in two unrelated families, thus excluding the presence of a common mutation in the coding part of the gene. Overall, mutations were detected in only 45% of individuals with a certain or likely diagnosis of X-linked AS. This finding suggests that mutations in noncoding segments of COL4A5 account for a high number of X-linked AS cases. An alternative hypothesis is the presence of locus heterogeneity, even within the X-linked form of the disease. A genotype/phenotype comparison enabled us to better substantiate a significant correlation between the degree of predicted disruption of the {alpha}5 chain and the severity of phenotype in affected male individuals. Our study has significant implications in the diagnosis and follow-up of AS patients. 44 refs., 3 figs., 4 tabs.« less
A novel CYP27B1 mutation causes a feline vitamin D-dependent rickets type IA.
Grahn, Robert A; Ellis, Melanie R; Grahn, Jennifer C; Lyons, Leslie A
2012-08-01
A 12-week-old domestic cat presented at a local veterinary clinic with hypocalcemia and skeletal abnormalities suggestive of rickets. Osteomalacia (rickets) is a disease caused by impaired bone mineralization leading to an increased prevalence of fractures and deformity. Described in a variety of species, rickets is most commonly caused by vitamin D or calcium deficiencies owing to both environmental and or genetic abnormalities. Vitamin D-dependent rickets type 1A (VDDR-1A) is a result of the enzymatic pathway defect caused by mutations in the 25-hydroxyvitamin D(3)-1-alpha-hydroxylase gene [cytochrome P27 B1 (CYP27B1)]. Calcitriol, the active form of vitamin D(3), regulates calcium homeostasis, which requires sufficient dietary calcium availability and correct hormonal function for proper bone growth and maintenance. Patient calcitriol concentrations were low while calcidiol levels were normal suggestive of VDDR-1A. The entire DNA coding sequencing of CYP27B1 was evaluated. The affected cat was wild type for previously identified VDDR-1A causative mutations. However, six novel mutations were identified, one of which was a nonsense mutation at G637T in exon 4. The exon 4 G637T nonsense mutation results in a premature protein truncation, changing a glutamic acid to a stop codon, E213X, likely causing the clinical presentation of rickets. The previously documented genetic mutation resulting in feline VDDR-1A rickets, as well as the case presented in this research, result from novel exon 4 CYP27B1 mutations, thus exon 4 should be the initial focus of future sequencing efforts.
Distribution of MICA alleles and haplotypes associated with HLA in the Korean population.
Pyo, Chul-Woo; Hur, Seong-Suk; Kim, Yang-Kyum; Choi, Hee-Baeg; Kim, Tae-Yoon; Kim, Tai-Gyu
2003-03-01
The MICA (MHC class I chain-related gene A) is a polymorphic gene located 46 kb centromeric of the HLA-B gene, and is preferentially expressed in epithelial cells and intestinal mucosa. The MICA gene, similar to human leukocyte antigen (HLA) class I, displays a high degree of genetic polymorphism in exons 2, 3, 4, and 5, amounting to 54 alleles. In this study, we investigated the polymorphisms at exons coding for extracellular domains (exons 2, 3, and 4), and the GCT repeat polymorphism at the transmembrane (exon 5) of MICA in 199 unrelated healthy Koreans. Eight alleles were observed in the Korean population, with allele frequencies for MICA*010, MICA*00201, MICA*027, MICA*004, MICA*012, MICA*00801, MICA*00901, and MICA*00701 being 18.3%, 17.8%, 13.6%, 12.3%, 11.1%, 10.8%, 10.6%, and 3.3%, respectively. Strong linkage disequilibria were also observed between the MICA and HLA-B gene-MICA*00201-B58, MICA*004-B44, MICA*00701-B27, MICA*00801-B60, MICA*00901-B51, MICA*010-B62, MICA*012-B54, and MICA*027-B61. In the analysis of the haplotypes of HLA class I genes (HLA-A, B, and C) and the MICA, the most common haplotype was MICA*004-A33-B44-Cw*07, followed by MICA*00201-A2-B58-Cw*0302 and MICA*012-A2-B54-Cw*0102. The MICA null haplotype might be identified in the HLA-B48 homozygous individual. These results will provide an understanding of the role of MICA in transplantation, disease association, and population analyses in Koreans.
Mutations in the Norrie disease gene.
Schuback, D E; Chen, Z Y; Craig, I W; Breakefield, X O; Sims, K B
1995-01-01
We report our experience to date in mutation identification in the Norrie disease (ND) gene. We carried out mutational analysis in 26 kindreds in an attempt to identify regions presumed critical to protein function and potentially correlated with generation of the disease phenotype. All coding exons, as well as noncoding regions of exons 1 and 2, 636 nucleotides in the noncoding region of exon 3, and 197 nucleotides of 5' flanking sequence, were analyzed for single-strand conformation polymorphisms (SSCP) by polymerase chain reaction (PCR) amplification of genomic DNA. DNA fragments that showed altered SSCP band mobilities were sequenced to locate the specific mutations. In addition to three previously described submicroscopic deletions encompassing the entire ND gene, we have now identified 6 intragenic deletions, 8 missense (seven point mutations, one 9-bp deletion), 6 nonsense (three point mutations, three single bp deletions/frameshift) and one 10-bp insertion, creating an expanded repeat in the 5' noncoding region of exon 1. Thus, mutations have been identified in a total of 24 of 26 (92%) of the kindreds we have studied to date. With the exception of two different mutations, each found in two apparently unrelated kindreds, these mutations are unique and expand the genotype database. Localization of the majority of point mutations at or near cysteine residues, potentially critical in protein tertiary structure, supports a previous protein model for norrin as member of a cystine knot growth factor family (Meitinger et al., 1993). Genotype-phenotype correlations were not evident with the limited clinical data available, except in the cases of larger submicroscopic deletions associated with a more severe neurologic syndrome.(ABSTRACT TRUNCATED AT 250 WORDS)
Li, Xiaoxin; Ma, Xiang; Tao, Yong
2007-06-07
To describe the clinical phenotype of X linked juvenile retinoschisis (XLRS) in 12 Chinese families with 11 different mutations in the XLRS1 (RS1) gene. Complete ophthalmic examinations were carried out in 29 affected males (12 probands), 38 heterozygous females carriers, and 100 controls. The coding regions of the RS1 gene that encodes retinoschisin were amplified by polymerase chain reaction and directly sequenced. Of the 29 male participants, 28 (96.6%) displayed typical foveal schisis. Eleven different RS1 mutations were identified in 12 families; four of these mutations, two frameshift mutations (26 del T of exon 1 and 488 del G of exon 5), and two missense mutations (Asp145His and Arg156Gly) of exon 5, had not been previously described. One non-disease-related polymorphism (NSP): 576C to T (Pro192Pro) change was also newly reported herein. We compared genotypes and observed more severe clinical features in families with the following mutations: frameshift mutation (26 del T) of exon 1, the splice donor site mutation (IVS1+2T to C),or Arg102Gln, Arg209His, and Arg213Gln mutations. Severe XLRS phenotypes are associated with the frameshift mutation 26 del T, splice donor site mutation (IVS1+2T to C), and Arg102Gln, Asp145His, Arg209His, and Arg213Gln mutations. The wide variability in the phenotype in Chinese patients with XLRS and different mutations in the RS1 gene is described. Identification of mutations in the RS1 gene and expanded information on clinical manifestations will facilitate early diagnosis, appropriate early therapy, and genetic counseling regarding the prognosis of XLRS.
Ratnam, Kavitha; Birch, David G.; Sundquist, Sanna M.; Lucero, Anna S.; Zhang, Yuhua; Meltzer, Meira; Smaoui, Nizar; Roorda, Austin
2011-01-01
Purpose. To evaluate macular cone structure in patients with X-linked retinoschisis (XLRS) caused by mutations in exon 6 of the RS1 gene. Methods. High-resolution macular images were obtained with adaptive optics scanning laser ophthalmoscopy (AOSLO) and spectral domain optical coherence tomography (SD-OCT) in two patients with XLRS and 27 age-similar healthy subjects. Retinal structure was correlated with best-corrected visual acuity, kinetic and static perimetry, fundus-guided microperimetry, full-field electroretinography (ERG), and multifocal ERG. The six coding exons and the flanking intronic regions of the RS1 gene were sequenced in each patient. Results. Two unrelated males, ages 14 and 29, with visual acuity ranging from 20/32 to 20/63, had macular schisis with small relative central scotomas in each eye. The mixed scotopic ERG b-wave was reduced more than the a-wave. SD-OCT showed schisis cavities in the outer and inner nuclear and plexiform layers. Cone spacing was increased within the largest foveal schisis cavities but was normal elsewhere. In each patient, a mutation in exon 6 of the RS1 gene was identified and was predicted to change the amino acid sequence in the discoidin domain of the retinoschisin protein. Conclusions. AOSLO images of two patients with molecularly characterized XLRS revealed increased cone spacing and abnormal packing in the macula of each patient, but cone coverage and function were near normal outside the central foveal schisis cavities. Although cone density is reduced, the preservation of wave-guiding cones at the fovea and eccentric macular regions has prognostic and therapeutic implications for XLRS patients with foveal schisis. (Clinical Trials.gov number, NCT00254605.) PMID:22110067
Ma, Xiang; Tao, Yong
2007-01-01
Purpose To describe the clinical phenotype of X linked juvenile retinoschisis (XLRS) in 12 Chinese families with 11 different mutations in the XLRS1 (RS1) gene. Methods Complete ophthalmic examinations were carried out in 29 affected males (12 probands), 38 heterozygous females carriers, and 100 controls. The coding regions of the RS1 gene that encodes retinoschisin were amplified by polymerase chain reaction and directly sequenced. Results Of the 29 male participants, 28 (96.6%) displayed typical foveal schisis. Eleven different RS1 mutations were identified in 12 families; four of these mutations, two frameshift mutations (26 del T of exon 1 and 488 del G of exon 5), and two missense mutations (Asp145His and Arg156Gly) of exon 5, had not been previously described. One non-disease-related polymorphism (NSP): 576C to T (Pro192Pro) change was also newly reported herein. We compared genotypes and observed more severe clinical features in families with the following mutations: frameshift mutation (26 del T) of exon 1, the splice donor site mutation (IVS1+2T to C),or Arg102Gln, Arg209His, and Arg213Gln mutations. Conclusions Severe XLRS phenotypes are associated with the frameshift mutation 26 del T, splice donor site mutation (IVS1+2T to C), and Arg102Gln, Asp145His, Arg209His, and Arg213Gln mutations. The wide variability in the phenotype in Chinese patients with XLRS and different mutations in the RS1 gene is described. Identification of mutations in the RS1 gene and expanded information on clinical manifestations will facilitate early diagnosis, appropriate early therapy, and genetic counseling regarding the prognosis of XLRS. PMID:17615541
Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M
2015-05-15
The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.
GeneBuilder: interactive in silico prediction of gene structure.
Milanesi, L; D'Angelo, D; Rogozin, I B
1999-01-01
Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.
Role of LRRK2 and SNCA in autosomal dominant Parkinson's disease in Turkey.
Kessler, Christoph; Atasu, Burcu; Hanagasi, Hasmet; Simón-Sánchez, Javier; Hauser, Ann-Kathrin; Pak, Meltem; Bilgic, Basar; Erginel-Unaltuna, Nihan; Gurvit, Hakan; Gasser, Thomas; Lohmann, Ebba
2018-03-01
Mutations in the LRRK2 and alpha-synuclein (SNCA) genes are well-established causes of autosomal dominant Parkinson's disease (PD). However, their frequency differs widely between ethnic groups. Only three studies have screened all coding regions of LRRK2 and SNCA in European samples so far. In Turkey, the role of LRRK2 in Parkinson's disease has been studied fragmentarily, and the incidence of SNCA copy number variations is unknown. The purpose of this study is to determine the frequency of LRRK2 and SNCA mutations in autosomal dominant PD in Turkey. We performed Sanger sequencing of all coding LRRK2 and SNCA exons in a sample of 91 patients with Parkinsonism. Copy number variations in SNCA, PRKN, PINK1, DJ1 and ATP13A2 were assessed using the MLPA method. All patients had a positive family history compatible with autosomal dominant inheritance. Known mutations in LRRK2 and SNCA were found in 3.3% of cases: one patient harbored the LRRK2 G2019S mutation, and two patients carried a SNCA gene duplication. Furthermore, we found a heterozygous deletion of PRKN exon 2 in one patient, and four rare coding variants of unknown significance (LRRK2: A211V, R1067Q, T2494I; SNCA: T72T). Genetic testing in one affected family identified the LRRK2 R1067Q variant as a possibly pathogenic substitution. Point mutations in LRRK2 and SNCA are a rare cause of autosomal dominant PD in Turkey. However, copy number variations should be considered. The unclassified variants, especially LRRK2 R1067Q, demand further investigation. Copyright © 2017. Published by Elsevier Ltd.
Novel pathogenic mutations and skin biopsy analysis in Knobloch syndrome
Suzuki, Oscar; Kague, Erika; Bagatini, Kelly; Tu, Hongmin; Heljasvaara, Ritva; Carvalhaes, Lorenza; Gava, Elisandra; de Oliveira, Gisele; Godoi, Paulo; Oliva, Glaucius; Kitten, Gregory; Pihlajaniemi, Taina; Passos-Bueno, Maria-Rita
2009-01-01
Purpose To facilitate future diagnosis of Knobloch syndrome (KS) and better understand its etiology, we sought to identify not yet described COL18A1 mutations in KS patients. In addition, we tested whether mutations in this gene lead to absence of the COL18A1 gene product and attempted to better characterize the functional effect of a previously reported missense mutation. Methods Direct sequencing of COL18A1 exons was performed in KS patients from four unrelated pedigrees. We used immunofluorescent histochemistry in skin biopsies to evaluate the presence of type XVIII collagen in four KS patients carrying two already described mutations: c.3277C>T, a nonsense mutation, and c.3601G>A, a missense mutation. Furthermore, we determined the binding properties of the mutated endostatin domain p.A1381T (c.3601G>A) to extracellular matrix proteins using ELISA and surface plasmon resonance assays. Results We identified four novel mutations in COL18A1, including a large deletion involving exon 41. Skin biopsies from KS patients revealed lack of type XVIII collagen in epithelial basement membranes and blood vessels. We also found a reduced affinity of p.A1381T endostatin to some extracellular matrix components. Conclusions COL18A1 mutations involved in Knobloch syndrome have a distribution bias toward the coding exons of the C-terminal end. Large deletions must also be considered when point mutations are not identified in patients with characteristic KS phenotype. We report, for the first time, lack of type XVIII collagen in KS patients by immunofluorescent histochemistry in skin biopsy samples. As a final point, we suggest the employment of this technique as a preliminary and complementary test for diagnosis of KS in cases when mutation screening either does not detect mutations or reveals mutations of uncertain effect, such as the p.A1381T change. PMID:19390655
Novel pathogenic mutations and skin biopsy analysis in Knobloch syndrome.
Suzuki, Oscar; Kague, Erika; Bagatini, Kelly; Tu, Hongmin; Heljasvaara, Ritva; Carvalhaes, Lorenza; Gava, Elisandra; de Oliveira, Gisele; Godoi, Paulo; Oliva, Glaucius; Kitten, Gregory; Pihlajaniemi, Taina; Passos-Bueno, Maria-Rita
2009-01-01
To facilitate future diagnosis of Knobloch syndrome (KS) and better understand its etiology, we sought to identify not yet described COL18A1 mutations in KS patients. In addition, we tested whether mutations in this gene lead to absence of the COL18A1 gene product and attempted to better characterize the functional effect of a previously reported missense mutation. Direct sequencing of COL18A1 exons was performed in KS patients from four unrelated pedigrees. We used immunofluorescent histochemistry in skin biopsies to evaluate the presence of type XVIII collagen in four KS patients carrying two already described mutations: c.3277C>T, a nonsense mutation, and c.3601G>A, a missense mutation. Furthermore, we determined the binding properties of the mutated endostatin domain p.A1381T (c.3601G>A) to extracellular matrix proteins using ELISA and surface plasmon resonance assays. We identified four novel mutations in COL18A1, including a large deletion involving exon 41. Skin biopsies from KS patients revealed lack of type XVIII collagen in epithelial basement membranes and blood vessels. We also found a reduced affinity of p.A1381T endostatin to some extracellular matrix components. COL18A1 mutations involved in Knobloch syndrome have a distribution bias toward the coding exons of the C-terminal end. Large deletions must also be considered when point mutations are not identified in patients with characteristic KS phenotype. We report, for the first time, lack of type XVIII collagen in KS patients by immunofluorescent histochemistry in skin biopsy samples. As a final point, we suggest the employment of this technique as a preliminary and complementary test for diagnosis of KS in cases when mutation screening either does not detect mutations or reveals mutations of uncertain effect, such as the p.A1381T change.
Truncating mutations in the last exon of NOTCH3 cause lateral meningocele syndrome.
Gripp, Karen W; Robbins, Katherine M; Sobreira, Nara L; Witmer, P Dane; Bird, Lynne M; Avela, Kristiina; Makitie, Outi; Alves, Daniela; Hogue, Jacob S; Zackai, Elaine H; Doheny, Kimberly F; Stabley, Deborah L; Sol-Church, Katia
2015-02-01
Lateral meningocele syndrome (LMS, OMIM%130720), also known as Lehman syndrome, is a very rare skeletal disorder with facial anomalies, hypotonia and meningocele-related neurologic dysfunction. The characteristic lateral meningoceles represent the severe end of the dural ectasia spectrum and are typically most severe in the lower spine. Facial features of LMS include hypertelorism and telecanthus, high arched eyebrows, ptosis, midfacial hypoplasia, micrognathia, high and narrow palate, low-set ears and a hypotonic appearance. Hyperextensibility, hernias and scoliosis reflect a connective tissue abnormality, and aortic dilation, a high-pitched nasal voice, wormian bones and osteolysis may be present. Lateral meningocele syndrome has phenotypic overlap with Hajdu-Cheney syndrome. We performed exome resequencing in five unrelated individuals with LMS and identified heterozygous truncating NOTCH3 mutations. In an additional unrelated individual Sanger sequencing revealed a deleterious variant in the same exon 33. In total, five novel de novo NOTCH3 mutations were identified in six unrelated patients. One had a 26 bp deletion (c.6461_6486del, p.G2154fsTer78), two carried the same single base pair insertion (c.6692_93insC, p.P2231fsTer11), and three individuals had a nonsense point mutation at c.6247A > T (pK2083*), c.6663C > G (p.Y2221*) or c.6732C > A, (p.Y2244*). All mutations cluster into the last coding exon, resulting in premature termination of the protein and truncation of the negative regulatory proline-glutamate-serine-threonine rich PEST domain. Our results suggest that mutant mRNA products escape nonsense mediated decay. The truncated NOTCH3 may cause gain-of-function through decreased clearance of the active intracellular product, resembling NOTCH2 mutations in the clinically related Hajdu-Cheney syndrome and contrasting the NOTCH3 missense mutations causing CADASIL. © 2014 Wiley Periodicals, Inc.
Chillón, Isabel; Pyle, Anna M.
2016-01-01
LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782
Mutation Screening of 1,237 Cancer Genes across Six Model Cell Lines of Basal-Like Breast Cancer.
Olsson, Eleonor; Winter, Christof; George, Anthony; Chen, Yilun; Törngren, Therese; Bendahl, Pär-Ola; Borg, Åke; Gruvberger-Saal, Sofia K; Saal, Lao H
2015-01-01
Basal-like breast cancer is an aggressive subtype generally characterized as poor prognosis and lacking the expression of the three most important clinical biomarkers, estrogen receptor, progesterone receptor, and HER2. Cell lines serve as useful model systems to study cancer biology in vitro and in vivo. We performed mutational profiling of six basal-like breast cancer cell lines (HCC38, HCC1143, HCC1187, HCC1395, HCC1954, and HCC1937) and their matched normal lymphocyte DNA using targeted capture and next-generation sequencing of 1,237 cancer-associated genes, including all exons, UTRs and upstream flanking regions. In total, 658 somatic variants were identified, of which 378 were non-silent (average 63 per cell line, range 37-146) and 315 were novel (not present in the Catalogue of Somatic Mutations in Cancer database; COSMIC). 125 novel mutations were confirmed by Sanger sequencing (59 exonic, 48 3'UTR and 10 5'UTR, 1 splicing), with a validation rate of 94% of high confidence variants. Of 36 mutations previously reported for these cell lines but not detected in our exome data, 36% could not be detected by Sanger sequencing. The base replacements C/G>A/T, C/G>G/C, C/G>T/A and A/T>G/C were significantly more frequent in the coding regions compared to the non-coding regions (OR 3.2, 95% CI 2.0-5.3, P<0.0001; OR 4.3, 95% CI 2.9-6.6, P<0.0001; OR 2.4, 95% CI 1.8-3.1, P<0.0001; OR 1.8, 95% CI 1.2-2.7, P = 0.024, respectively). The single nucleotide variants within the context of T[C]T/A[G]A and T[C]A/T[G]A were more frequent in the coding than in the non-coding regions (OR 3.7, 95% CI 2.2-6.1, P<0.0001; OR 3.8, 95% CI 2.0-7.2, P = 0.001, respectively). Copy number estimations were derived from the targeted regions and correlated well to Affymetrix SNP array copy number data (Pearson correlation 0.82 to 0.96 for all compared cell lines; P<0.0001). These mutation calls across 1,237 cancer-associated genes and identification of novel variants will aid in the design and interpretation of biological experiments using these six basal-like breast cancer cell lines.
Exonal deletion of SLC24A4 causes hypomaturation amelogenesis imperfecta.
Seymen, F; Lee, K-E; Tran Le, C G; Yildirim, M; Gencay, K; Lee, Z H; Kim, J-W
2014-04-01
Amelogenesis imperfecta is a heterogeneous group of genetic conditions affecting enamel formation. Recently, mutations in solute carrier family 24 member 4 (SLC24A4) have been identified to cause autosomal recessive hypomaturation amelogenesis imperfecta. We recruited a consanguineous family with hypomaturation amelogenesis imperfecta with generalized brown discoloration. Sequencing of the candidate genes identified a 10-kb deletion, including exons 15, 16, and most of the last exon of the SLC24A4 gene. Interestingly, this deletion was caused by homologous recombination between two 354-bp-long homologous sequences located in intron 14 and the 3' UTR. This is the first report of exonal deletion in SLC24A4 providing confirmatory evidence that the function of SLC24A4 in calcium transport has a crucial role in the maturation stage of amelogenesis.
RNA-Seq Profiling Reveals Novel Hepatic Gene Expression Pattern in Aflatoxin B1 Treated Rats
Merrick, B. Alex; Phadke, Dhiral P.; Auerbach, Scott S.; Mav, Deepak; Stiegelmeyer, Suzy M.; Shah, Ruchir R.; Tice, Raymond R.
2013-01-01
Deep sequencing was used to investigate the subchronic effects of 1 ppm aflatoxin B1 (AFB1), a potent hepatocarcinogen, on the male rat liver transcriptome prior to onset of histopathological lesions or tumors. We hypothesized RNA-Seq would reveal more differentially expressed genes (DEG) than microarray analysis, including low copy and novel transcripts related to AFB1’s carcinogenic activity compared to feed controls (CTRL). Paired-end reads were mapped to the rat genome (Rn4) with TopHat and further analyzed by DESeq and Cufflinks-Cuffdiff pipelines to identify differentially expressed transcripts, new exons and unannotated transcripts. PCA and cluster analysis of DEGs showed clear separation between AFB1 and CTRL treatments and concordance among group replicates. qPCR of eight high and medium DEGs and three low DEGs showed good comparability among RNA-Seq and microarray transcripts. DESeq analysis identified 1,026 differentially expressed transcripts at greater than two-fold change (p<0.005) compared to 626 transcripts by microarray due to base pair resolution of transcripts by RNA-Seq, probe placement within transcripts or an absence of probes to detect novel transcripts, splice variants and exons. Pathway analysis among DEGs revealed signaling of Ahr, Nrf2, GSH, xenobiotic, cell cycle, extracellular matrix, and cell differentiation networks consistent with pathways leading to AFB1 carcinogenesis, including almost 200 upregulated transcripts controlled by E2f1-related pathways related to kinetochore structure, mitotic spindle assembly and tissue remodeling. We report 49 novel, differentially-expressed transcripts including confirmation by PCR-cloning of two unique, unannotated, hepatic AFB1-responsive transcripts (HAfT’s) on chromosomes 1.q55 and 15.q11, overexpressed by 10 to 25-fold. Several potentially novel exons were found and exon refinements were made including AFB1 exon-specific induction of homologous family members, Ugt1a6 and Ugt1a7c. We find the rat transcriptome contains many previously unidentified, AFB1-responsive exons and transcripts supporting RNA-Seq’s capabilities to provide new insights into AFB1-mediated gene expression leading to hepatocellular carcinoma. PMID:23630614
RNA-Seq profiling reveals novel hepatic gene expression pattern in aflatoxin B1 treated rats.
Merrick, B Alex; Phadke, Dhiral P; Auerbach, Scott S; Mav, Deepak; Stiegelmeyer, Suzy M; Shah, Ruchir R; Tice, Raymond R
2013-01-01
Deep sequencing was used to investigate the subchronic effects of 1 ppm aflatoxin B1 (AFB1), a potent hepatocarcinogen, on the male rat liver transcriptome prior to onset of histopathological lesions or tumors. We hypothesized RNA-Seq would reveal more differentially expressed genes (DEG) than microarray analysis, including low copy and novel transcripts related to AFB1's carcinogenic activity compared to feed controls (CTRL). Paired-end reads were mapped to the rat genome (Rn4) with TopHat and further analyzed by DESeq and Cufflinks-Cuffdiff pipelines to identify differentially expressed transcripts, new exons and unannotated transcripts. PCA and cluster analysis of DEGs showed clear separation between AFB1 and CTRL treatments and concordance among group replicates. qPCR of eight high and medium DEGs and three low DEGs showed good comparability among RNA-Seq and microarray transcripts. DESeq analysis identified 1,026 differentially expressed transcripts at greater than two-fold change (p<0.005) compared to 626 transcripts by microarray due to base pair resolution of transcripts by RNA-Seq, probe placement within transcripts or an absence of probes to detect novel transcripts, splice variants and exons. Pathway analysis among DEGs revealed signaling of Ahr, Nrf2, GSH, xenobiotic, cell cycle, extracellular matrix, and cell differentiation networks consistent with pathways leading to AFB1 carcinogenesis, including almost 200 upregulated transcripts controlled by E2f1-related pathways related to kinetochore structure, mitotic spindle assembly and tissue remodeling. We report 49 novel, differentially-expressed transcripts including confirmation by PCR-cloning of two unique, unannotated, hepatic AFB1-responsive transcripts (HAfT's) on chromosomes 1.q55 and 15.q11, overexpressed by 10 to 25-fold. Several potentially novel exons were found and exon refinements were made including AFB1 exon-specific induction of homologous family members, Ugt1a6 and Ugt1a7c. We find the rat transcriptome contains many previously unidentified, AFB1-responsive exons and transcripts supporting RNA-Seq's capabilities to provide new insights into AFB1-mediated gene expression leading to hepatocellular carcinoma.
Huang, Yulei; Goldberg, Michel; Le, Thuan; Qiang, Ran; Warner, Douglas; Witkowska, Halina Ewa; Liu, Haichuan; Zhu, Li; Denbesten, Pamela; Li, Wu
2012-01-01
Amelogenins containing exons 8 and 9 are alternatively spliced variants of amelogenin. Some amelogenin spliced variants have been found to promote pulp regeneration following pulp exposure. The function of the amelogenin spliced variants with the exons 8 and 9 remains unknown. In this study, we synthesized recombinant leucine rich amelogenin peptide (LRAP, A-4), LRAP plus exons 8 and 9 peptide (LRAP 8, 9) or exons 8 and 9 peptide (P89), to determine their effects on odontoblasts. In vivo analyses were completed following the insertion of agarose beads containing LRAP or LRAP 8, 9 into exposed cavity preparations of rat molars. After 8, 15 or 30 days' exposure, the pulp tissues were analyzed for changes in histomorphometry and cell proliferation by PCNA stainings. In vitro analyses included the effects of the addition of the recombinant proteins or peptide on cell proliferation, differentiation and adhesion of postnatal human dental pulp cells (DPCs). These studies showed that in vivo LRAP 8, 9 enhanced the reparative dentin formation as compared to LRAP. In vitro LRAP 8, 9 promoted DPC proliferation and differentiation to a greater extent than LRAP. These data suggest that amelogenin exons 8 and 9 may be useful in amelogenin-mediated pulp repair. Copyright © 2012 S. Karger AG, Basel.
Zouheir Habbal, Mohammad; Bou-Assi, Tarek; Zhu, Jun; Owen, Renius; Chehab, Farid F
2014-01-01
Alkaptonuria is often diagnosed clinically with episodes of dark urine, biochemically by the accumulation of peripheral homogentisic acid and molecularly by the presence of mutations in the homogentisate 1,2-dioxygenase gene (HGD). Alkaptonuria is invariably associated with HGD mutations, which consist of single nucleotide variants and small insertions/deletions. Surprisingly, the presence of deletions beyond a few nucleotides among over 150 reported deleterious mutations has not been described, raising the suspicion that this gene might be protected against the detrimental mechanisms of gene rearrangements. The quest for an HGD mutation in a proband with AKU revealed with a SNP array five large regions of homozygosity (5-16 Mb), one of which includes the HGD gene. A homozygous deletion of 649 bp deletion that encompasses the 72 nucleotides of exon 2 and surrounding DNA sequences in flanking introns of the HGD gene was unveiled in a proband with AKU. The nature of this deletion suggests that this in-frame deletion could generate a protein without exon 2. Thus, we modeled the tertiary structure of the mutant protein structure to determine the effect of exon 2 deletion. While the two β-pleated sheets encoded by exon 2 were missing in the mutant structure, other β-pleated sheets are largely unaffected by the deletion. However, nine novel α-helical coils substituted the eight coils present in the native HGD crystal structure. Thus, this deletion results in a deleterious enzyme, which is consistent with the proband's phenotype. Screening for mutations in the HGD gene, particularly in the Middle East, ought to include this exon 2 deletion in order to determine its frequency and uncover its origin.
Habbal, Mohammad Zouheir; Bou-Assi, Tarek; Zhu, Jun; Owen, Renius; Chehab, Farid F.
2014-01-01
Alkaptonuria is often diagnosed clinically with episodes of dark urine, biochemically by the accumulation of peripheral homogentisic acid and molecularly by the presence of mutations in the homogentisate 1,2-dioxygenase gene (HGD). Alkaptonuria is invariably associated with HGD mutations, which consist of single nucleotide variants and small insertions/deletions. Surprisingly, the presence of deletions beyond a few nucleotides among over 150 reported deleterious mutations has not been described, raising the suspicion that this gene might be protected against the detrimental mechanisms of gene rearrangements. The quest for an HGD mutation in a proband with AKU revealed with a SNP array five large regions of homozygosity (5–16 Mb), one of which includes the HGD gene. A homozygous deletion of 649 bp deletion that encompasses the 72 nucleotides of exon 2 and surrounding DNA sequences in flanking introns of the HGD gene was unveiled in a proband with AKU. The nature of this deletion suggests that this in-frame deletion could generate a protein without exon 2. Thus, we modeled the tertiary structure of the mutant protein structure to determine the effect of exon 2 deletion. While the two β-pleated sheets encoded by exon 2 were missing in the mutant structure, other β-pleated sheets are largely unaffected by the deletion. However, nine novel α-helical coils substituted the eight coils present in the native HGD crystal structure. Thus, this deletion results in a deleterious enzyme, which is consistent with the proband’s phenotype. Screening for mutations in the HGD gene, particularly in the Middle East, ought to include this exon 2 deletion in order to determine its frequency and uncover its origin. PMID:25233259
Ren, H; Stiles, G L
1994-01-01
The human A1 adenosine receptor gene contains six exons with exons 1, 2, 3, 4, and part of 5 representing 5' untranslated regions. Reverse transcription-PCR with exon-specific primers showed two distinct transcripts containing either exons 3, 5, and 6 or exons 4, 5, and 6, with exons 3 and 4 being mutually exclusive. No mature mRNAs containing exons 1 and 2 have been detected. All human tissues that express any A1 receptors contain mRNA with exons 4, 5, and 6. Tissues which express high levels of A1 receptors contain mRNA with exons 3, 5, and 6. Exon 4 contains two upstream ATG codons whereas exon 3 contains none. COS cells transfected with expression vectors containing exon 4 (exons 1-6, 3-6, or Ex4-6) express much lower levels of A1 receptors than vectors without exon 4 (exons 3, 5, and 6). Mutation of upstream ATG codons in exon 4 leads to 3- to 7-fold increased A1 receptor expression, up to the level seen with the construct containing exons 3, 5, and 6. Thus, in human tissues "basal" levels of A1 receptors can be expressed by use of mRNA containing exons 4, 5, and 6, but when high levels are needed, alternative transcripts with exons 3, 5, and 6 are produced. Images PMID:8197148
Darbani, Behrooz; Noeparvar, Shahin; Borg, Søren
2016-01-01
RNA circularization made by head-to-tail back-splicing events is involved in the regulation of gene expression from transcriptional to post-translational levels. By exploiting RNA-Seq data and down-stream analysis, we shed light on the importance of circular RNAs in plants. The results introduce circular RNAs as novel interactors in the regulation of gene expression in plants and imply the comprehensiveness of this regulatory pathway by identifying circular RNAs for a diverse set of genes. These genes are involved in several aspects of cellular metabolism as hormonal signaling, intracellular protein sorting, carbohydrate metabolism and cell-wall biogenesis, respiration, amino acid biosynthesis, transcription and translation, and protein ubiquitination. Additionally, these parental loci of circular RNAs, from both nuclear and mitochondrial genomes, encode for different transcript classes including protein coding transcripts, microRNA, rRNA, and long non-coding/microprotein coding RNAs. The results shed light on the mitochondrial exonic circular RNAs and imply the importance of circular RNAs for regulation of mitochondrial genes. Importantly, we introduce circular RNAs in barley and elucidate their cellular-level alterations across tissues and in response to micronutrients iron and zinc. In further support of circular RNAs' functional roles in plants, we report several cases where fluctuations of circRNAs do not correlate with the levels of their parental-loci encoded linear transcripts. PMID:27375638
Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.
Sanford, Jeremy R; Wang, Xin; Mort, Matthew; Vanduyn, Natalia; Cooper, David N; Mooney, Sean D; Edenberg, Howard J; Liu, Yunlong
2009-03-01
Metazoan genes are encrypted with at least two superimposed codes: the genetic code to specify the primary structure of proteins and the splicing code to expand their proteomic output via alternative splicing. Here, we define the specificity of a central regulator of pre-mRNA splicing, the conserved, essential splicing factor SFRS1. Cross-linking immunoprecipitation and high-throughput sequencing (CLIP-seq) identified 23,632 binding sites for SFRS1 in the transcriptome of cultured human embryonic kidney cells. SFRS1 was found to engage many different classes of functionally distinct transcripts including mRNA, miRNA, snoRNAs, ncRNAs, and conserved intergenic transcripts of unknown function. The majority of these diverse transcripts share a purine-rich consensus motif corresponding to the canonical SFRS1 binding site. The consensus site was not only enriched in exons cross-linked to SFRS1 in vivo, but was also enriched in close proximity to splice sites. mRNAs encoding RNA processing factors were significantly overrepresented, suggesting that SFRS1 may broadly influence the post-transcriptional control of gene expression in vivo. Finally, a search for the SFRS1 consensus motif within the Human Gene Mutation Database identified 181 mutations in 82 different genes that disrupt predicted SFRS1 binding sites. This comprehensive analysis substantially expands the known roles of human SR proteins in the regulation of a diverse array of RNA transcripts.
Liu, Feiling; Guo, Dianhao; Yuan, Zhuting; Chen, Chen; Xiao, Huamei
2017-11-20
Long non-coding RNA (lncRNA) is a class of noncoding RNA >200 bp in length that has essential roles in regulating a variety of biological processes. Here, we constructed a computational pipeline to identify lncRNA genes in the diamondback moth (Plutella xylostella), a major insect pest of cruciferous vegetables. In total, 3,324 lncRNAs corresponding to 2,475 loci were identified from 13 RNA-Seq datasets, including samples from parasitized, insecticide-resistant strains and different developmental stages. The identified P. xylostella lncRNAs had shorter transcripts and fewer exons than protein-coding genes. Seven out of nine randomly selected lncRNAs were validated by strand-specific RT-PCR. In total, 54-172 lncRNAs were specifically expressed in the insecticide resistant strains, among which one lncRNA was located adjacent to the sodium channel gene. In addition, 63-135 lncRNAs were specifically expressed in different developmental stages, among which three lncRNAs overlapped or were located adjacent to the metamorphosis-associated genes. These lncRNAs were either strongly or weakly co-expressed with their overlapping or neighboring mRNA genes. In summary, we identified thousands of lncRNAs and presented evidence that lncRNAs might have key roles in conferring insecticide resistance and regulating the metamorphosis development in P. xylostella.
A global view of the nonprotein-coding transcriptome in Plasmodium falciparum
Raabe, Carsten A.; Sanchez, Cecilia P.; Randau, Gerrit; Robeck, Thomas; Skryabin, Boris V.; Chinni, Suresh V.; Kube, Michael; Reinhardt, Richard; Ng, Guey Hooi; Manickam, Ravichandran; Kuryshev, Vladimir Y.; Lanzer, Michael; Brosius, Juergen; Tang, Thean Hock; Rozhdestvensky, Timofey S.
2010-01-01
Nonprotein-coding RNAs (npcRNAs) represent an important class of regulatory molecules that act in many cellular pathways. Here, we describe the experimental identification and validation of the small npcRNA transcriptome of the human malaria parasite Plasmodium falciparum. We identified 630 novel npcRNA candidates. Based on sequence and structural motifs, 43 of them belong to the C/D and H/ACA-box subclasses of small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs). We further observed the exonization of a functional H/ACA snoRNA gene, which might contribute to the regulation of ribosomal protein L7a gene expression. Some of the small npcRNA candidates are from telomeric and subtelomeric repetitive regions, suggesting their potential involvement in maintaining telomeric integrity and subtelomeric gene silencing. We also detected 328 cis-encoded antisense npcRNAs (asRNAs) complementary to P. falciparum protein-coding genes of a wide range of biochemical pathways, including determinants of virulence and pathology. All cis-encoded asRNA genes tested exhibit lifecycle-specific expression profiles. For all but one of the respective sense–antisense pairs, we deduced concordant patterns of expression. Our findings have important implications for a better understanding of gene regulatory mechanisms in P. falciparum, revealing an extended and sophisticated npcRNA network that may control the expression of housekeeping genes and virulence factors. PMID:19864253
A global view of the nonprotein-coding transcriptome in Plasmodium falciparum.
Raabe, Carsten A; Sanchez, Cecilia P; Randau, Gerrit; Robeck, Thomas; Skryabin, Boris V; Chinni, Suresh V; Kube, Michael; Reinhardt, Richard; Ng, Guey Hooi; Manickam, Ravichandran; Kuryshev, Vladimir Y; Lanzer, Michael; Brosius, Juergen; Tang, Thean Hock; Rozhdestvensky, Timofey S
2010-01-01
Nonprotein-coding RNAs (npcRNAs) represent an important class of regulatory molecules that act in many cellular pathways. Here, we describe the experimental identification and validation of the small npcRNA transcriptome of the human malaria parasite Plasmodium falciparum. We identified 630 novel npcRNA candidates. Based on sequence and structural motifs, 43 of them belong to the C/D and H/ACA-box subclasses of small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs). We further observed the exonization of a functional H/ACA snoRNA gene, which might contribute to the regulation of ribosomal protein L7a gene expression. Some of the small npcRNA candidates are from telomeric and subtelomeric repetitive regions, suggesting their potential involvement in maintaining telomeric integrity and subtelomeric gene silencing. We also detected 328 cis-encoded antisense npcRNAs (asRNAs) complementary to P. falciparum protein-coding genes of a wide range of biochemical pathways, including determinants of virulence and pathology. All cis-encoded asRNA genes tested exhibit lifecycle-specific expression profiles. For all but one of the respective sense-antisense pairs, we deduced concordant patterns of expression. Our findings have important implications for a better understanding of gene regulatory mechanisms in P. falciparum, revealing an extended and sophisticated npcRNA network that may control the expression of housekeeping genes and virulence factors.
Comparative genomic analysis of the false killer whale (Pseudorca crassidens) LMBR1 locus.
Kim, Dae-Won; Choi, Sang-Haeng; Kim, Ryong Nam; Kim, Sun-Hong; Paik, Sang-Gi; Nam, Seong-Hyeuk; Kim, Dong-Wook; Kim, Aeri; Kang, Aram; Park, Hong-Seog
2010-09-01
The sequencing and comparative genomic analysis of LMBR1 loci in mammals or other species, including human, would be very important in understanding evolutionary genetic changes underlying the evolution of limb development. In this regard, comparative genomic annotation of the false killer whale LMBR1 locus could shed new light on the evolution of limb development. We sequenced two false killer whale BAC clones, corresponding to 156 kb and 144 kb, respectively, harboring the tightly linked RNF32, LMBR1, and NOM1 genes. Our annotation of the false killer whale LMBR1 gene showed that it consists of 17 exons (1473 bp), in contrast to 18 exons (1596 bp) in human, and it displays 93.1% and 95.6% nucleotide and amino acid sequence similarity, respectively, compared with the human gene. In particular, we discovered that exon 10, deleted in the false killer whale LMBR1 gene, is present only in primates, and this fact strongly implies that exon 10 might be crucial in determining primate-specific limb development. ZRS and TFBS sequences have been well conserved across 11 species, suggesting that these regions could be involved in an important function of limb development and limb patterning. The neighboring gene RNF32 showed several lineage-conserved exons, such as exons 2 through 9 conserved in eutherian mammals, exons 3 through 9 conserved in mammals, and exons 5 through 9 conserved in vertebrates. The other neighboring gene, NOM1, had undergone a substitution (ATG→GTA) at the start codon, giving rise to a 36 bp shorter N-terminal sequence compared with the human sequence. Our comparative analysis of the false killer whale LMBR1 genomic locus provides important clues regarding the genetic regions that may play crucial roles in limb development and patterning.
MPL mutation profile in JAK2 mutation-negative patients with myeloproliferative disorders.
Ma, Wanlong; Zhang, Xi; Wang, Xiuqiang; Zhang, Zhong; Yeh, Chen-Hsiung; Uyeji, Jennifer; Albitar, Maher
2011-03-01
Mutations in the thrombopoietin receptor gene (myeloproliferative leukemia, MPL) have been reported in patients with JAK2 V617F-negative chronic myeloproliferative disorders (MPDs). We evaluated the prevalence of MPL mutations relative to JAK2 mutations in patients with suspected MPDs. A total of 2790 patient samples submitted for JAK2 mutation analysis were tested using real-time polymerase chain reaction and bidirectional sequencing of plasma RNA. JAK2 V617F-negative samples were tested for JAK2 exons 12 to 14 mutations, and those with negative results were then tested for mutations in MPL exons 10 and 11. Of the 2790 patients, 529 (18.96%) had V617F, 12 (0.43%) had small insertions or deletions in exon 12, and 7 (0.25%) had other JAK2 mutations in exons 12 to 14. Of the 2242 JAK2 mutation-negative patients, 68 (3.03%) had MPL mutations. W515L was the predominant MPL mutation (n=46; 68%), and 10 (15%) patients had other W515 variants. The remaining MPL mutations (n=12, 17%) were detected at other locations in exons 10 and 11 and included 3 insertion/deletion mutations. The S505N mutation, associated with familial MPD, was detected in 3 patients. Overall, for every 100 V617F mutations in patients with suspected MPDs, there were 12.9 MPL mutations, 2.3 JAK2 exon 12 mutations, and 1.3 JAK2 exons 13 to 14 mutations. These findings suggest that MPL mutation screening should be performed before JAK2 exons 12 to 14 testing in JAK2 V617F-negative patients with suspected MPDs.
Gene analysis of steroid 5 alpha-reductase 1 in hyperandrogenic women.
Eminović, Izet; Komel, Radovan; Prezelj, Janez; Karamehić, Jasenko; Gavrankapetanović, Faris; Heljić, Becir
2005-08-01
To examine the gene encoding for 5alpha-reductase type 1 in hyperandrogenic women, and assess the association of its eventual mutations or polymorphisms with the development of the hyperandrogenic female pattern. Sixteen hyperandrogenic women were included in the study. Single-stranded conformation polymorphism analysis (SSCP) and DNA sequencing were performed after polymerase chain reaction amplification of each of the 5 exons of the SRD5A1 gene in both hyperandrogenic and control group (16 participants). Sequence analysis identified the existence of many polymorphisms; in codon 24 of exon 1, GGC (Gly) into GAC (Asp); in codon 30 of exon 1, CGG (Arg) into CGC (Arg); in exon 3 codon 169, ACA to ACG (both encoding for threonine); in exon 5, AGA to AGG (both encoding for arginine, codon 260); and T/C polymorphism in intron 2. Polymorphisms were found in both groups. Polymorphisms of SRD5A1 gene were the same in both hyperandrogenic and healthy women, indicating no significant associations of genetic polymorphisms/variations of SRD5A1 gene with clinical manifestations of hyperandrogenic disorders in women.
Characterisation of CDKL5 Transcript Isoforms in Human and Mouse
Dando, Owen; Landsberger, Nicoletta; Kilstrup-Nielsen, Charlotte; Kind, Peter C.; Bailey, Mark E. S.; Cobb, Stuart R.
2016-01-01
Mutations in the X-linked Cyclin-Dependent Kinase-Like 5 gene (CDKL5) cause early onset infantile spasms and subsequent severe developmental delay in affected children. Deleterious mutations have been reported to occur throughout the CDKL5 coding region. Several studies point to a complex CDKL5 gene structure in terms of exon usage and transcript expression. Improvements in molecular diagnosis and more extensive research into the neurobiology of CDKL5 and pathophysiology of CDKL5 disorders necessitate an updated analysis of the gene. In this study, we have analysed human and mouse CDKL5 transcript patterns both bioinformatically and experimentally. We have characterised the predominant brain isoform of CDKL5, a 9.7 kb transcript comprised of 18 exons with a large 6.6 kb 3’-untranslated region (UTR), which we name hCDKL5_1. In addition we describe new exonic regions and a range of novel splice and UTR isoforms. This has enabled the description of an updated gene model in both species and a standardised nomenclature system for CDKL5 transcripts. Profiling revealed tissue- and brain development stage-specific differences in expression between transcript isoforms. These findings provide an essential backdrop for the diagnosis of CDKL5-related disorders, for investigations into the basic biology of this gene and its protein products, and for the rational design of gene-based and molecular therapies for these disorders. PMID:27315173
Characterisation of CDKL5 Transcript Isoforms in Human and Mouse.
Hector, Ralph D; Dando, Owen; Landsberger, Nicoletta; Kilstrup-Nielsen, Charlotte; Kind, Peter C; Bailey, Mark E S; Cobb, Stuart R
2016-01-01
Mutations in the X-linked Cyclin-Dependent Kinase-Like 5 gene (CDKL5) cause early onset infantile spasms and subsequent severe developmental delay in affected children. Deleterious mutations have been reported to occur throughout the CDKL5 coding region. Several studies point to a complex CDKL5 gene structure in terms of exon usage and transcript expression. Improvements in molecular diagnosis and more extensive research into the neurobiology of CDKL5 and pathophysiology of CDKL5 disorders necessitate an updated analysis of the gene. In this study, we have analysed human and mouse CDKL5 transcript patterns both bioinformatically and experimentally. We have characterised the predominant brain isoform of CDKL5, a 9.7 kb transcript comprised of 18 exons with a large 6.6 kb 3'-untranslated region (UTR), which we name hCDKL5_1. In addition we describe new exonic regions and a range of novel splice and UTR isoforms. This has enabled the description of an updated gene model in both species and a standardised nomenclature system for CDKL5 transcripts. Profiling revealed tissue- and brain development stage-specific differences in expression between transcript isoforms. These findings provide an essential backdrop for the diagnosis of CDKL5-related disorders, for investigations into the basic biology of this gene and its protein products, and for the rational design of gene-based and molecular therapies for these disorders.
Sanger sequencing as a first-line approach for molecular diagnosis of Andersen-Tawil syndrome.
Totomoch-Serra, Armando; Marquez, Manlio F; Cervantes-Barragán, David E
2017-01-01
In 1977, Frederick Sanger developed a new method for DNA sequencing based on the chain termination method, now known as the Sanger sequencing method (SSM). Recently, massive parallel sequencing, better known as next-generation sequencing (NGS), is replacing the SSM for detecting mutations in cardiovascular diseases with a genetic background. The present opinion article wants to remark that "targeted" SSM is still effective as a first-line approach for the molecular diagnosis of some specific conditions, as is the case for Andersen-Tawil syndrome (ATS). ATS is described as a rare multisystemic autosomal dominant channelopathy syndrome caused mainly by a heterozygous mutation in the KCNJ2 gene . KCJN2 has particular characteristics that make it attractive for "directed" SSM. KCNJ2 has a sequence of 17,510 base pairs (bp), and a short coding region with two exons (exon 1=166 bp and exon 2=5220 bp), half of the mutations are located in the C-terminal cytosolic domain, a mutational hotspot has been described in residue Arg218, and this gene explains the phenotype in 60% of ATS cases that fulfill all the clinical criteria of the disease. In order to increase the diagnosis of ATS we urge cardiologists to search for facial and muscular abnormalities in subjects with frequent ventricular arrhythmias (especially bigeminy) and prominent U waves on the electrocardiogram.
Sanger sequencing as a first-line approach for molecular diagnosis of Andersen-Tawil syndrome
Totomoch-Serra, Armando; Marquez, Manlio F.; Cervantes-Barragán, David E.
2017-01-01
In 1977, Frederick Sanger developed a new method for DNA sequencing based on the chain termination method, now known as the Sanger sequencing method (SSM). Recently, massive parallel sequencing, better known as next-generation sequencing (NGS), is replacing the SSM for detecting mutations in cardiovascular diseases with a genetic background. The present opinion article wants to remark that “targeted” SSM is still effective as a first-line approach for the molecular diagnosis of some specific conditions, as is the case for Andersen-Tawil syndrome (ATS). ATS is described as a rare multisystemic autosomal dominant channelopathy syndrome caused mainly by a heterozygous mutation in the KCNJ2 gene . KCJN2 has particular characteristics that make it attractive for “directed” SSM. KCNJ2 has a sequence of 17,510 base pairs (bp), and a short coding region with two exons (exon 1=166 bp and exon 2=5220 bp), half of the mutations are located in the C-terminal cytosolic domain, a mutational hotspot has been described in residue Arg218, and this gene explains the phenotype in 60% of ATS cases that fulfill all the clinical criteria of the disease. In order to increase the diagnosis of ATS we urge cardiologists to search for facial and muscular abnormalities in subjects with frequent ventricular arrhythmias (especially bigeminy) and prominent U waves on the electrocardiogram. PMID:29093808
Fukami, Maki; Naiki, Yasuhiro; Muroya, Koji; Hamajima, Takashi; Soneda, Shun; Horikawa, Reiko; Jinno, Tomoko; Katsumi, Momori; Nakamura, Akie; Asakura, Yumi; Adachi, Masanori; Ogata, Tsutomu; Kanzaki, Susumu
2015-09-01
Pseudoautosomal region 1 (PAR1) contains SHOX, in addition to seven highly conserved non-coding DNA elements (CNEs) with cis-regulatory activity. Microdeletions involving SHOX exons 1-6a and/or the CNEs result in idiopathic short stature (ISS) and Leri-Weill dyschondrosteosis (LWD). Here, we report six rare copy-number variations (CNVs) in PAR1 identified through copy-number analyzes of 245 ISS/LWD patients and 15 unaffected individuals. The six CNVs consisted of three microduplications encompassing SHOX and some of the CNEs, two microduplications in the SHOX 3'-region affecting one or four of the downstream CNEs, and a microdeletion involving SHOX exon 6b and its neighboring CNE. The amplified DNA fragments of two SHOX-containing duplications were detected at chromosomal regions adjacent to the original positions. The breakpoints of a SHOX-containing duplication resided within Alu repeats. A microduplication encompassing four downstream CNEs was identified in an unaffected father-daughter pair, whereas the other five CNVs were detected in ISS patients. These results suggest that microduplications involving SHOX cause ISS by disrupting the cis-regulatory machinery of this gene and that at least some of microduplications in PAR1 arise from Alu-mediated non-allelic homologous recombination. The pathogenicity of other rare PAR1-linked CNVs, such as CNE-containing microduplications and exon 6b-flanking microdeletions, merits further investigation.
Hu, H M; Chuang, C K; Lee, M J; Tseng, T C; Tang, T K
2000-11-01
We previously reported two novel testis-specific serine/threonine kinases, Aie1 (mouse) and AIE2 (human), that share high amino acid identities with the kinase domains of fly aurora and yeast Ipl1. Here, we report the entire intron-exon organization of the Aie1 gene and analyze the expression patterns of Aie1 mRNA during testis development. The mouse Aie1 gene spans approximately 14 kb and contains seven exons. The sequences of the exon-intron boundaries of the Aie1 gene conform to the consensus sequences (GT/AG) of the splicing donor and acceptor sites of most eukaryotic genes. Comparative genomic sequencing revealed that the gene structure is highly conserved between mouse Aie1 and human AIE2. However, much less homology was found in the sequence outside the kinase-coding domains. The Aie1 locus was mapped to mouse chromosome 7A2-A3 by fluorescent in situ hybridization. Northern blot analysis indicates that Aie1 mRNA likely is expressed at a low level on day 14 and reaches its plateau on day 21 in the developing postnatal testis. RNA in situ hybridization indicated that the expression of the Aie1 transcript was restricted to meiotically active germ cells, with the highest levels detected in spermatocytes at the late pachytene stage. These findings suggest that Aie1 plays a role in spermatogenesis.
Drögemüller, Cord; Philipp, Ute; Haase, Bianca; Günzel-Apel, Anne-Rose; Leeb, Tosso
2007-01-01
Coat color dilution in several breeds of dog is characterized by a specific pigmentation phenotype and sometimes accompanied by hair loss and recurrent skin inflammation, the so-called color dilution alopecia or black hair follicular dysplasia. Coat color dilution (d) is inherited as a Mendelian autosomal recessive trait. In a previous study, MLPH polymorphisms showed perfect cosegregation with the dilute phenotype within breeds. However, different dilute haplotypes were found in different breeds, and no single polymorphism was identified in the coding sequence that was likely to be causative for the dilute phenotype. We resequenced the 5'-region of the canine MLPH gene and identified a strong candidate single nucleotide polymorphism within the nontranslated exon 1, which showed perfect association to the dilute phenotype in 65 dilute dogs from 7 different breeds. The A/G polymorphism is located at the last nucleotide of exon 1 and the mutant A-allele is predicted to reduce splicing efficiency 8-fold. An MLPH mRNA expression study using quantitative reverse transcriptase-polymerase chain reaction confirmed that dd animals had only about approximately 25% of the MLPH transcript compared with DD animals. These results provide preliminary evidence that the reported regulatory MLPH mutation might represent a causal mutation for coat color dilution in dogs.
Lisboa, Bianca Cristina Garcia; Machado, Tamara da Rocha; Pimenta, Daniel Carvalho; Han, Sang Won
2007-02-01
Human cytidine deaminase (HCD) catalyzes the deamination of cytidine or deoxycytidine to uridine or deoxyuridine, respectively. The genomic sequence of HCD is formed by 31 kb with 4 exons and several alternative splicing signals, but an alternative form of HCD has yet to be reported. Here we describe the cloning and characterization of a small form of HCD, HSCD, and it is likely to be a product of alternative splicing of HCD. The alignment of DNA sequences shows that the HSCD matches HCD in 2 parts, except for a deletion of 170 bp. Based on the HCD genome organization, exons 1 and 4 should be joined and all sequences of introns and exons 2 and 3 should be deleted by splicing. This alternative splicing shifted the translation of the reading frame from the point of splicing. The estimated molecular mass is 9.8 kDa, and this value was confirmed by Western blot and mass spectroscopy after expressing the gene fused with glutathionine-S-transferase in the pGEX vector. The deletion and shift of the reading frame caused a loss of HCD activity, which was confirmed by enzyme assay and also with NIH3T3 cells modified to express HSCD and challenged against cytosine arabinoside. In this work we describe the identification and characterization of HSCD, which is the product of alternative splicing of the HCD gene.
An Enhancer Near ISL1 and an Ultraconserved Exon of PCBP2 areDerived from a Retroposon
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bejerano, Gill; Lowe, Craig; Ahituv, Nadav
2005-11-27
Hundreds of highly conserved distal cis-regulatory elementshave been characterized to date in vertebrate genomes1. Many thousandsmore are predicted based on comparative genomics2,3. Yet, in starkcontrast to the genes they regulate, virtually none of these regions canbe traced using sequence similarity in invertebrates, leaving theirevolutionary origin obscure. Here we show that a class of conserved,primarily non-coding regions in tetrapods originated from a novel shortinterspersed repetitive element (SINE) retroposon family that was activein Sarcopterygii (lobe-finned fishes and terrestrial vertebrates) in theSilurian at least 410 Mya4, and, remarkably, appears to be recentlyactive in the "living fossil" Indonesian coelacanth, Latimeriamenadoensis. We show that onemore » copy is a distal enhancer, located 500kbfrom the neuro-developmental gene ISL1. Several others represent new,possibly regulatory, alternatively spliced exons in the middle ofpre-existing Sarcopterygian genes. One of these is the>200bpultraconserved region5, 100 percent identical in mammals, and 80 percentidentical to the coelacanth SINE, that contains a 31aa alternativelyspliced exon of the mRNA processing gene PCBP26. These add to a growinglist of examples7 in which relics of transposable elements have acquireda function that serves their host, a process termed "exaptation"8, andprovide an origin for at least some of the highly-conservedvertebrate-specific genomic sequences recently discovered usingcomparative genomics.« less
Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martin, L.H.; Calabi, F.; Lefebvre, F.A.
1987-12-01
The CD1 human antigens are a family of at least three components, CD1a, CD1b, and CD1c, that are characteristic of the cortical stage of thymocyte maturation. CD1a was originally named HTA1 or T6 and thought to be the human equivalent of mouse Tla. The genes coding for all three have not been identified by transfection into mouse cells. The transfectants express the surface antigens that can then be recognized by the corresponding cluster of monoclonal antibodies used to define the three members of CD1. The full sequence of the genomic DNA is described for all three. The intron-exon structure ofmore » CD1a is deduced by comparison with a near-full-length cDNA clone. Similar structures are proposed for the other two, largely based on sequence homology. An unusually long 5'-untranslated exon (280 bases long) is highly conserved between the three genes, suggesting an important but unknown function. CD1c has a duplicated form of this exon that is thought to be spliced out. The major homology between the three antigens is in the ..beta../sub 2/-microglobulin-binding-domain. The general relatedness to major histocompatibility complex class I and class II molecules is significant but low, with no section of higher homology to mouse Tla.« less
Hereditary cancer genes are highly susceptible to splicing mutations
Soemedi, Rachel; Maguire, Samantha; Murray, Michael F.; Monaghan, Sean F.
2018-01-01
Substitutions that disrupt pre-mRNA splicing are a common cause of genetic disease. On average, 13.4% of all hereditary disease alleles are classified as splicing mutations mapping to the canonical 5′ and 3′ splice sites. However, splicing mutations present in exons and deeper intronic positions are vastly underreported. A recent re-analysis of coding mutations in exon 10 of the Lynch Syndrome gene, MLH1, revealed an extremely high rate (77%) of mutations that lead to defective splicing. This finding is confirmed by extending the sampling to five other exons in the MLH1 gene. Further analysis suggests a more general phenomenon of defective splicing driving Lynch Syndrome. Of the 36 mutations tested, 11 disrupted splicing. Furthermore, analyzing past reports suggest that MLH1 mutations in canonical splice sites also occupy a much higher fraction (36%) of total mutations than expected. When performing a comprehensive analysis of splicing mutations in human disease genes, we found that three main causal genes of Lynch Syndrome, MLH1, MSH2, and PMS2, belonged to a class of 86 disease genes which are enriched for splicing mutations. Other cancer genes were also enriched in the 86 susceptible genes. The enrichment of splicing mutations in hereditary cancers strongly argues for additional priority in interpreting clinical sequencing data in relation to cancer and splicing. PMID:29505604
Wise, Carol A.; Chiang, Lydia C.; Paznekas, William A.; Sharma, Mridula; Musy, Maurice M.; Ashley, Jennifer A.; Lovett, Michael; Jabs, Ethylin W.
1997-01-01
Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development. PMID:9096354
Gruber, Barry L.; Couto, Ana Rita; Armas, Jácome Bruges; Brown, Matthew A.; Finzel, Kathleen; Terkeltaub, Robert A.
2015-01-01
This report describes a 32-year-old woman presenting since childhood with progressive calcium pyrophosphate disease (CPPD), characterized by severe arthropathy and chondrocalcinosis involving multiple peripheral joints and intervertebral disks. Because ANKH mutations have been previously described in familial CPPD, the proband’s DNA was assessed at this locus by direct sequencing of promoter and coding regions and revealed 3 sequence variants in ANKH. Sequences of exon 1 revealed a novel isolated nonsynonymous mutation (c.13 C>T), altering amino acid in codon 5 from proline to serine (CCG>TCG). Sequencing of parental DNA revealed an identical mutation in the proband’s father but not the mother. Subsequent clinical evaluation demonstrated extensive chondrocalcinosis and degenerative arthropathy in the proband’s father. In summary, we report a novel mutation, not previously described, in ANKH exon 1, wherein serine replaces proline, in a case of early-onset severe CPPD associated with metabolic abnormalities, with similar findings in the proband’s father. PMID:22647861
Gruber, Barry L; Couto, Ana Rita; Armas, Jácome Bruges; Brown, Matthew A; Finzel, Kathleen; Terkeltaub, Robert A
2012-06-01
This report describes a 32-year-old woman presenting since childhood with progressive calcium pyrophosphate disease (CPPD), characterized by severe arthropathy and chondrocalcinosis involving multiple peripheral joints and intervertebral disks. Because ANKH mutations have been previously described in familial CPPD, the proband's DNA was assessed at this locus by direct sequencing of promoter and coding regions and revealed 3 sequence variants in ANKH. Sequences of exon 1 revealed a novel isolated nonsynonymous mutation (c.13 C>T), altering amino acid in codon 5 from proline to serine (CCG>TCG). Sequencing of parental DNA revealed an identical mutation in the proband's father but not the mother. Subsequent clinical evaluation demonstrated extensive chondrocalcinosis and degenerative arthropathy in the proband's father. In summary, we report a novel mutation, not previously described, in ANKH exon 1, wherein serine replaces proline, in a case of early-onset severe CPPD associated with metabolic abnormalities, with similar findings in the proband's father.
Gonzalez, Francisco; Loidi, Lourdes; Abalo-Lojo, Jose M
2017-01-01
Ankyloblepharon-ectodermal dysplasia-cleft lip/palate (AEC) syndrome is a disorder resulting from anomalous embryonic development of ectodermal tissues. There is evidence that AEC syndrome is caused by mutations in the TP63 gene, which encodes the p63 protein. This is an important regulatory protein involved in epidermal proliferation and differentiation. Genome sequencing was performed in DNA from peripheral blood leukocytes of a newborn with AEC syndrome and her parents. Variants were searched in all coding exons and intron-exon boundaries of the TP63 gene. A heterozygous missense variant (NM_003722.4:c.1063G>C (p.Asp355His) was found in the newborn patient. No variants were found in either of the parents. We identified a previously unreported variant in TP63 gene which seems to be involved in the somatic malformations found in the AEC syndrome. The absence of this variant in both parents suggests that the variant appeared de novo.
Novel mutations in the STK11 gene in Thai patients with Peutz-Jeghers syndrome
Ausavarat, Surasawadee; Leoyklang, Petcharat; Vejchapipat, Paisarn; Chongsrisawat, Voranush; Suphapeetiporn, Kanya; Shotelersuk, Vorasuk
2009-01-01
Peutz-Jeghers syndrome (PJS), a rare autosomal dominant inherited disorder, is characterized by hamartomatous gastrointestinal polyps and mucocutaneous pigmentation. Patients with this syndrome have a predisposition to a variety of cancers in multiple organs. Mutations in the serine/threonine kinase 11 (STK11) gene have been identified as a major cause of PJS. Here we present the clinical and molecular findings of two unrelated Thai individuals with PJS. Mutation analysis by Polymerase Chain Reaction-sequencing of the entire coding region of STK11 revealed two potentially pathogenic mutations. One harbored a single nucleotide deletion (c.182delG) in exon 1 resulting in a frameshift leading to premature termination at codon 63 (p.Gly61AlafsX63). The other carried an in-frame 9-base-pair (bp) deletion in exon 7, c.907_915del9 (p.Ile303_Gln305del). Both deletions were de novo and have never been previously described. This study has expanded the genotypic spectrum of the STK11 gene. PMID:19908348
Defesche, J C; Lansberg, P J; Reymer, P W; Lamping, R J; Kastelein, J J
1993-02-01
Familial hypercholesterolaemia (FH) is the most common genetic metabolic disorder, affecting about 1 in 500 persons in the general population. With novel techniques, it is possible to identify the molecular defects underlying FH in the gene coding for the low-density lipoprotein (LDL) receptor, thereby confirming the diagnosis of FH. In this study we present a large family with a specific mutation in exon 9 of the LDL-receptor gene (an Afrikaner mutation) and we demonstrate that by a large-scale case-finding study in this family, carriers of such a mutation can be detected. Of 63 family members, 13 were shown to be at risk for cardiovascular disease as judged by their lipoprotein profile or the presence of the Afrikaner mutation. Two persons were detected, affected with a dyslipidaemia other than FH. Medical management was initiated in order to reduce the high cardiovascular risk associated with this disorder.
Structural and molecular basis of mismatch correction and ribavirin excision from coronavirus RNA.
Ferron, François; Subissi, Lorenzo; Silveira De Morais, Ana Theresa; Le, Nhung Thi Tuyet; Sevajol, Marion; Gluais, Laure; Decroly, Etienne; Vonrhein, Clemens; Bricogne, Gérard; Canard, Bruno; Imbert, Isabelle
2018-01-09
Coronaviruses (CoVs) stand out among RNA viruses because of their unusually large genomes (∼30 kb) associated with low mutation rates. CoVs code for nsp14, a bifunctional enzyme carrying RNA cap guanine N7-methyltransferase (MTase) and 3'-5' exoribonuclease (ExoN) activities. ExoN excises nucleotide mismatches at the RNA 3'-end in vitro, and its inactivation in vivo jeopardizes viral genetic stability. Here, we demonstrate for severe acute respiratory syndrome (SARS)-CoV an RNA synthesis and proofreading pathway through association of nsp14 with the low-fidelity nsp12 viral RNA polymerase. Through this pathway, the antiviral compound ribavirin 5'-monophosphate is significantly incorporated but also readily excised from RNA, which may explain its limited efficacy in vivo. The crystal structure at 3.38 Å resolution of SARS-CoV nsp14 in complex with its cofactor nsp10 adds to the uniqueness of CoVs among RNA viruses: The MTase domain presents a new fold that differs sharply from the canonical Rossmann fold.
Echigoya, Yusuke; Mouly, Vincent; Garcia, Luis; Yokota, Toshifumi; Duddy, William
2015-01-01
The use of antisense ‘splice-switching’ oligonucleotides to induce exon skipping represents a potential therapeutic approach to various human genetic diseases. It has achieved greatest maturity in exon skipping of the dystrophin transcript in Duchenne muscular dystrophy (DMD), for which several clinical trials are completed or ongoing, and a large body of data exists describing tested oligonucleotides and their efficacy. The rational design of an exon skipping oligonucleotide involves the choice of an antisense sequence, usually between 15 and 32 nucleotides, targeting the exon that is to be skipped. Although parameters describing the target site can be computationally estimated and several have been identified to correlate with efficacy, methods to predict efficacy are limited. Here, an in silico pre-screening approach is proposed, based on predictive statistical modelling. Previous DMD data were compiled together and, for each oligonucleotide, some 60 descriptors were considered. Statistical modelling approaches were applied to derive algorithms that predict exon skipping for a given target site. We confirmed (1) the binding energetics of the oligonucleotide to the RNA, and (2) the distance in bases of the target site from the splice acceptor site, as the two most predictive parameters, and we included these and several other parameters (while discounting many) into an in silico screening process, based on their capacity to predict high or low efficacy in either phosphorodiamidate morpholino oligomers (89% correctly predicted) and/or 2’O Methyl RNA oligonucleotides (76% correctly predicted). Predictions correlated strongly with in vitro testing for sixteen de novo PMO sequences targeting various positions on DMD exons 44 (R2 0.89) and 53 (R2 0.89), one of which represents a potential novel candidate for clinical trials. We provide these algorithms together with a computational tool that facilitates screening to predict exon skipping efficacy at each position of a target exon. PMID:25816009
Mucha, L; Leidig-Bruckner, G; Frank-Raue, K; Bruckner, Th; Kroiss, M; Raue, F
2017-10-01
We describe phaeochromocytoma (phaeo) penetrance in multiple endocrine neoplasia type 2 (MEN2) according to RET protooncogene-specific mutations and report changes in phaeo diagnosis and management from 1968 to 2015. This retrospective chart review included 309 MEN2 patients from one specialized ambulatory care centre. Phaeo patients were categorized by diagnosis date: early, 1968-1996, n=40, and recent, 1997-2015, n=45. Phaeochromocytoma was diagnosed in 85/309 patients with RET mutations in the following exons (phaeos/all carriers, %): exon 11 (56/120, 46.6%); exon 16 (7/17, 41.2%), exon 10 (14/47, 29.8%), and exon 13-15 (2/116, 1.7%). Age at phaeo diagnosis differed according to affected exon: 21.9±1.5 years, exon 16; 34.1±11.6 years, exon 11; and 41.8±8.8 years, exon 10. Age-related phaeo penetrance differed among five amino acid substitutions at codon 634 and was highest for Cys634Arg and Cys634Tyr. Age at diagnosis was 34.4±11.6 years in the early and recent groups. Phaeochromocytoma and medullary thyroid carcinoma (MTC) were diagnosed synchronously in 21/40 (early) vs 8/45 (recent) and metachronously in 19/40 vs 37/45 cases. Diagnostic methods significantly changed from clinical (22/40 vs 4/45) to biochemical and/or imaging based (14/40 vs 35/45). Phaeochromocytoma diameter at diagnosis was 4.6 vs 2.6 cm. Phaeochromocytoma penetrance and age of diagnosis are highly correlated with MTC aggressiveness based on RET mutation status, with higher penetrance and younger age of diagnosis associated with more aggressive MTC. Penetrance steadily increases with age. At-risk patients require lifelong follow-up. © 2017 John Wiley & Sons Ltd.
Histone Code Modulation by Oncogenic PWWP-Domain Protein in Breast Cancers
2010-06-01
athanogene 4 * DDHD2 DDHD domain containing 2 * PPAPDC1B phosphatidic acid phosphatase type 2 domain containing 1B * WHSC1L1 Wolf-Hirschhorn syndrome...from alternative splicing of exon 10. The WHSC1L1 long isoform encodes a 1437 amino acid protein containing 2 PWWP domains, 2 PHD-type zinc finger...motifs, a TANG2 domain, an AWS domain and a SET domain. The short isoform encodes a 645 amino acid protein containing a PWWP domain only. Our western
Chen, L P; E, G X; Zhao, Y J; Na, R S; Zhao, Z Q; Zhang, J H; Ma, Y H; Sun, Y W; Zhong, T; Zhang, H P; Huang, Y F
2015-06-18
DRA encodes the alpha chain of the DR heterodimer, is closely linked to DRB and is considered almost monomorphic in major histocompatibility complex region. In this study, we identified the exon 2 of DRA to evaluate the immunogenetic diversity of Chinese south indigenous goat. Two single nucleotide polymorphisms in an untranslated region and one synonymous substitution in coding region were identified. These data suggest that high immunodiversity in native Chinese population.
Splicing of designer exons informs a biophysical model for exon definition
Arias, Mauricio A.; Chasin, Lawrence A.
2015-01-01
Pre-mRNA molecules in humans contain mostly short internal exons flanked by longer introns. To explain the removal of such introns, exon recognition instead of intron recognition has been proposed. We studied this exon definition using designer exons (DEs) made up of three prototype modules of our own design: an exonic splicing enhancer (ESE), an exonic splicing silencer (ESS), and a Reference Sequence (R) predicted to be neither. Each DE was examined as the central exon in a three-exon minigene. DEs made of R modules showed a sharp size dependence, with exons shorter than 14 nt and longer than 174 nt splicing poorly. Changing the strengths of the splice sites improved longer exon splicing but worsened shorter exon splicing, effectively displacing the curve to the right. For the ESE we found, unexpectedly, that its enhancement efficiency was independent of its position within the exon. For the ESS we found a step-wise positional increase in its effects; it was most effective at the 3′ end of the exon. To apply these results quantitatively, we developed a biophysical model for exon definition of internal exons undergoing cotranscriptional splicing. This model features commitment to inclusion before the downstream exon is synthesized and competition between skipping and inclusion fates afterward. Collision of both exon ends to form an exon definition complex was incorporated to account for the effect of size; ESE/ESS effects were modeled on the basis of stabilization/destabilization. This model accurately predicted the outcome of independent experiments on more complex DEs that combined ESEs and ESSs. PMID:25492963
Grzes, M; Nowacka-Woszuk, J; Szczerbal, I; Czerwinska, J; Gracz, J; Switonski, M
2009-01-01
The gene encoding myostatin (MSTN), due to its crucial function for growth of skeletal muscle mass, is an important candidate for muscularity. In this study we analyzed the nucleotide sequence and FISH localization of this gene in 4 canids, including 3 farm species. The nucleotide sequence of the MSTN coding fragment turned out to be highly conserved, since its identity among the studied species was very high and varied between 99.4 and 99.7%. Only 1, widely spread, silent single nucleotide polymorphism (SNP) was found in exon 1 of the Chinese raccoon dog. The MSTN gene was localized close to the centromere in one-armed chromosomes of the dog (37q11) and bi-armed chromosomes of the red fox (16p11) and arctic fox (10q11), with an exception of the Chinese raccoon dog chromosome (2q14-q21). This chromosome is orthologous to 3 canine chromosomes and thus the MSTN was found more interstitially. Our results are in agreement with the hypothesis that karyotypes of the canids evolved mainly through centric fusion/fission events, while tandem fusions occurred rarely. (c) 2009 S. Karger AG, Basel.
A computational search for box C/D snoRNA genes in the Drosophila melanogaster genome.
Accardo, M C; Giordano, E; Riccardo, S; Digilio, F A; Iazzetti, G; Calogero, R A; Furia, M
2004-12-12
In eukaryotes, the family of non-coding RNA genes includes a number of genes encoding small nucleolar RNAs (mainly C/D and H/ACA snoRNAs), which act as guides in the maturation or post-transcriptional modifications of target RNA molecules. Since in Drosophila melanogaster (Dm) only few examples of snoRNAs have been identified so far by cDNA libraries screening, integration of the molecular data with in silico identification of these types of genes could throw light on their organization in the Dm genome. We have performed a computational screening of the Dm genome for C/D snoRNA genes, followed by experimental validation of the putative candidates. Few of the 26 confirmed snoRNAs had been recognized by cDNA library analysis. Organization of the Dm genome was also found to be more variegated than previously suspected, with snoRNA genes nested in both the introns and exons of protein-coding genes. This finding suggests that the presence of additional mechanisms of snoRNA biogenesis based on the alternative production of overlapping mRNA/snoRNA molecules. Additional information is available at http://www.bioinformatica.unito.it/bioinformatics/snoRNAs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gehrmann, Thies; Pelkmans, Jordi F.; Lugones, Luis G.
Recent genome-wide studies have demonstrated that fungi possess the machinery to alternatively splice pre-mRNA. However, there has not been a systematic categorization of the functional impact of alternative splicing in a fungus. We investigate alternative splicing and its functional consequences in the model mushroom forming fungus Schizophyllum commune. Alternative splicing was demonstrated for 2,285 out of 12,988 expressed genes, resulting in 20% additional transcripts. Intron retentions were the most common alternative splicing events, accounting for 33% of all splicing events, and 43% of the events in coding regions. On the other hand, exon skipping events were rare in coding regionsmore » (1%) but enriched in UTRs where they accounted for 57% of the events. Specific functional groups, including transcription factors, contained alternatively spliced genes. Alternatively spliced transcripts were regulated differently throughout development in 19% of the 2,285 alternatively spliced genes. Notably, 69% of alternatively spliced genes have predicted alternative functionality by loss or gain of functional domains, or by acquiring alternative subcellular locations. S. commune exhibits more alternative splicing than any other studied fungus. Finally, taken together, alternative splicing increases the complexity of the S. commune proteome considerably and provides it with a rich repertoire of alternative functionality that is exploited dynamically.« less
Nanoscale studies link amyloid maturity with polyglutamine diseases onset
NASA Astrophysics Data System (ADS)
Ruggeri, F. S.; Vieweg, S.; Cendrowska, U.; Longo, G.; Chiki, A.; Lashuel, H. A.; Dietler, G.
2016-08-01
The presence of expanded poly-glutamine (polyQ) repeats in proteins is directly linked to the pathogenesis of several neurodegenerative diseases, including Huntington’s disease. However, the molecular and structural basis underlying the increased toxicity of aggregates formed by proteins containing expanded polyQ repeats remain poorly understood, in part due to the size and morphological heterogeneity of the aggregates they form in vitro. To address this knowledge gap and technical limitations, we investigated the structural, mechanical and morphological properties of fibrillar aggregates at the single molecule and nanometer scale using the first exon of the Huntingtin protein as a model system (Exon1). Our findings demonstrate a direct correlation of the morphological and mechanical properties of Exon1 aggregates with their structural organization at the single aggregate and nanometric scale and provide novel insights into the molecular and structural basis of Huntingtin Exon1 aggregation and toxicity.
The mini-exon genes of three Phytomonas isolates that differ in plant tissue tropism.
Sturm, N R; Fernandes, O; Campbell, D A
1995-08-01
The tandem mini-exon gene repeat is an ideal diagnostic target for trypanosomatids because it includes sequences that are conserved absolutely coupled with regions of extreme variability. We have exploited these features and the polymerase chain reaction to differentiate Phytomonas strains isolated from phloem, fruit or latex of various host plants. While the transcribed regions are nearly identical, the intergenic sequences are variable in size and content (130-332 base pairs). The mini-exon genes of these phytomonads can therefore be distinguished from each other and from the corresponding genes in insect trypanosomes, with which they are oft confused.
Hutcheson, Kelly A; Paluru, Prasuna C; Bernstein, Steven L; Koh, Jamie; Rappaport, Eric F; Leach, Richard A; Young, Terri L
2005-07-14
Retinopathy of prematurity (ROP) is a leading cause of visual loss in the pediatric population. Mutations in the Norrie disease gene (NDP) are associated with heritable retinal vascular disorders, and have been found in a small subset of patients with severe retinopathy of prematurity. Varying rates of progression to threshold disease in different races may have a genetic basis, as recent studies suggest that the incidence of NDP mutations may vary in different groups. African Americans, for example, are less likely to develop severe degrees of ROP. We screened a large cohort of ethnically diverse patients for mutations in the entire NDP. A total of 143 subjects of different ethnic backgrounds were enrolled in the study. Fifty-four patients had severe ROP (Stage 3 or worse). Of these, 38 were threshold in at least one eye (with a mean gestational age of 26.1 weeks and mean birth weight of 788.4 g). There were 36 patients with mild or no ROP, 31 parents with no history of retinal disease or prematurity, and 22 wild type (normal) controls. There were 70 African American subjects, 55 Caucasians, and 18 of other races. Severe ROP was noted in 29 African American subjects, 17 Caucasians, and 8 of other races. Seven polymerase chain reaction primer pairs spanning the NDP were optimized for denaturing high performance liquid chromatography and direct sequencing. Three primer pairs covered the coding region, and the remaining four spanned the 3' and 5' untranslated regions (UTR). Six of 54 (11%) infants with severe ROP had polymorphisms in the NDP. Five of the infants were African American, and one was Caucasian. Two parents were heterozygous for the same polymorphism as their child. One parent-child pair had a single base pair (bp) insertion in the 3' UTR region. Another parent-child pair had two mutations: a 14 bp deletion in the 5' UTR region of exon 1 and a single nucleotide polymorphism in the 5' UTR region of exon 2. No coding region sequence changes were found. No polymorphisms were observed in infants with mild or no ROP, or in the wild type controls. Of the six sequence alterations found, five were novel nucleotide changes: One in the 5' UTR region of exon 2, and four in the 3' UTR region of exon 3. The extent of NDP polymorphisms in this large, racially diverse group of infants is moderate. NDP polymorphisms may play a role in the pathogenesis of ROP, but do not appear to be a major causative factor.
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-01-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield. PMID:25333064
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-09-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield.
Premraj, Avinash; Nautiyal, Binita; Aleyas, Abi G; Rasool, Thaha Jamal
2015-10-01
Interleukin-26 (IL-26) is a member of the IL-10 family of cytokines. Though conserved across vertebrates, the IL-26 gene is functionally inactivated in a few mammals like rat, mouse and horse. We report here the identification, isolation and cloning of the cDNA of IL-26 from the dromedary camel. The camel cDNA contains a 516 bp open reading frame encoding a 171 amino acid precursor protein, including a 21 amino acid signal peptide. Sequence analysis revealed high similarity with other mammalian IL-26 homologs and the conservation of IL-10 cytokine family domain structure including key amino acid residues. We also report the identification and cloning of four novel transcript variants produced by alternative splicing at the Exon 3-Exon 4 regions of the gene. Three of the alternative splice variants had premature termination codons and are predicted to code for truncated proteins. The transcript variant 4 (Tv4) having an insertion of an extra 120 bp nucleotides in the ORF was predicted to encode a full length protein product with 40 extra amino acid residues. The mRNA transcripts of all the variants were identified in lymph node, where as fewer variants were observed in other tissues like blood, liver and kidney. The expression of Tv2 and Tv3 were found to be up regulated in mitogen induced camel peripheral blood mononuclear cells. IL-26-Tv2 expression was also induced in camel fibroblast cells infected with Camel pox virus in-vitro. The identification of the transcript variants of IL-26 from the dromedary camel is the first report of alternative splicing for IL-26 in a species in which the gene has not been inactivated. Copyright © 2015 Elsevier Ltd. All rights reserved.
Clinical and genetic features of cervical dystonia in a large multicenter cohort
Vemula, Satya R.; Xiao, Jianfeng; Thompson, Misty M.; Perlmutter, Joel S.; Wright, Laura J.; Jinnah, H.A.; Rosen, Ami R.; Hedera, Peter; Comella, Cynthia L.; Weissbach, Anne; Junker, Johanna; Jankovic, Joseph; Barbano, Richard L.; Reich, Stephen G.; Rodriguez, Ramon L.; Berman, Brian D.; Chouinard, Sylvain; Severt, Lawrence; Agarwal, Pinky; Stover, Natividad P.
2016-01-01
Objective: To characterize the clinical and genetic features of cervical dystonia (CD). Methods: Participants enrolled in the Dystonia Coalition biorepository (NCT01373424) with initial manifestation as CD were included in this study (n = 1,000). Data intake included demographics, family history, and the Global Dystonia Rating Scale. Participants were screened for sequence variants (SVs) in GNAL, THAP1, and Exon 5 of TOR1A. Results: The majority of participants were Caucasian (95%) and female (75%). The mean age at onset and disease duration were 45.5 ± 13.6 and 14.6 ± 11.8 years, respectively. At the time of assessment, 68.5% had involvement limited to the neck, shoulder(s), and proximal arm(s), whereas 47.4% had dystonia limited to the neck. The remaining 31.5% of the individuals exhibited more extensive anatomical spread. A head tremor was noted in 62% of the patients. Head tremor and laryngeal dystonia were more common in females. Psychiatric comorbidities, mainly depression and anxiety, were reported by 32% of the participants and were more common in females. Family histories of dystonia, parkinsonian disorder, and tremor were present in 14%, 11%, and 29% of the patients, respectively. Pathogenic or likely pathogenic SVs in THAP1, TOR1A, and GNAL were identified in 8 participants (0.8%). Two individuals harbored novel missense SVs in Exon 5 of TOR1A. Synonymous and noncoding SVs in THAP1 and GNAL were identified in 4% of the cohort. Conclusions: Head tremor, laryngeal dystonia, and psychiatric comorbidities are more common in female participants with CD. Coding and noncoding variants in GNAL, THAP1, and TOR1A make small contributions to the pathogenesis of CD. PMID:27123488
Roth, Melissa S; Cokus, Shawn J; Gallaher, Sean D; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A; Merchant, Sabeeha S; Pellegrini, Matteo; Niyogi, Krishna K
2017-05-23
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis , because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase ( BKT ), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.
Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; ...
2017-05-08
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Cost-effective sequencing of full-length cDNA clones powered by a de novo-reference hybrid assembly.
Kuroshu, Reginaldo M; Watanabe, Junichi; Sugano, Sumio; Morishita, Shinichi; Suzuki, Yutaka; Kasahara, Masahiro
2010-05-07
Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence approximately 800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded), and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only approximately US$3 per clone, demonstrating a significant advantage over previous approaches.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A.; Merchant, Sabeeha S.; Pellegrini, Matteo
2017-01-01
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production. PMID:28484037
Another face of the Treacher Collins syndrome (TCOF1) gene: identification of additional exons.
So, Rolando B; Gonzales, Bianca; Henning, Dale; Dixon, Jill; Dixon, Michael J; Valdez, Benigno C
2004-03-17
Treacher Collins syndrome (TCS) is characterized by an abnormality in craniofacial development during early embryogenesis. TCS is caused by mutations in the gene TCOF1, which encodes the nucleolar phosphoprotein treacle. Genetic and proteomic characterizations of TCS/treacle are based on the previously reported 26 exons of TCOF1. Here, we report the identification of 231-nucleotide (nt) exon 6A (between exons 6 and 7) and 108-nt exon 16A (between exons 16 and 17). Isoforms with exon 6A are up to 3.7-fold more abundant than alternatively spliced variants without exon 6A, but only minor isoforms contain exon 16A. Exon 6A encodes a peptide sequence containing basic and acidic domains similar to 10 other exons of TCOF1. Unlike the other exons, exon 6A encodes a nuclear localization signal (NLS) which does not, however, alter the nucleolar localization of full-length treacle. The discovery of exons 6A and 16A is relevant to mutational analysis of the TCOF1 gene in TCS patients, and to functional analysis of its gene product.
Kramer, Marianne C; Liang, Dongming; Tatomer, Deirdre C; Gold, Beth; March, Zachary M; Cherry, Sara; Wilusz, Jeremy E
2015-10-15
Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼ 400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3' end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine-arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. © 2015 Kramer et al.; Published by Cold Spring Harbor Laboratory Press.
Mutation analysis in the long isoform of USH2A in American patients with Usher Syndrome type II.
Yan, Denise; Ouyang, Xiaomei; Patterson, D Michael; Du, Li Lin; Jacobson, Samuel G; Liu, Xue-Zhong
2009-12-01
Usher syndrome type II (USH2) is an autosomal recessive disorder characterized by moderate to severe hearing impairment and progressive visual loss due to retinitis pigmentosa (RP). To identify novel mutations and determine the frequency of USH2A mutations as a cause of USH2, we have carried out mutation screening of all 72 coding exons and exon-intron splice sites of the USH2A gene. A total of 20 USH2 American probands of European descent were analyzed using single strand conformational polymorphism (SSCP) and direct sequencing methods. Ten different USH2A mutations were identified in 55% of the probands, five of which were novel mutations. The detected mutations include three missense, three frameshifts and four nonsense mutations, with c.2299delG/p.E767fs mutation, accounting for 38.9% of the pathological alleles. Two cases were homozygotes, two cases were compound heterozygotes and one case had complex allele with three variants. In seven probands, only one USH2A mutation was detected and no pathological mutation was found in the remaining eight individuals. Altogether, our data support the fact that c.2299delG/p.E767fs is indeed the most common USH2A mutation found in USH2 patients of European Caucasian background. Thus, if screening for mutations in USH2A is considered, it is reasonable to screen for the c.2299delG mutation first.
Li, Chenhong; Riethoven, Jean-Jack M; Naylor, Gavin J P
2012-09-01
Recent innovations in next-generation sequencing have lowered the cost of genome projects. Nevertheless, sequencing entire genomes for all representatives in a study remains expensive and unnecessary for most studies in ecology, evolution and conservation. It is still more cost-effective and efficient to target and sequence single-copy nuclear gene markers for such studies. Many tools have been developed for identifying nuclear markers, but most of these have focused on particular taxonomic groups. We have built a searchable database, EvolMarkers, for developing single-copy coding sequence (CDS) and exon-primed-intron-crossing (EPIC) markers that is designed to work across a broad range of phylogenetic divergences. The database is made up of single-copy CDS derived from BLAST searches of a variety of metazoan genomes. Users can search the database for different types of markers (CDS or EPIC) that are common to different sets of input species with different divergence characteristics. EvolMarkers can be applied to any taxonomic group for which genome data are available for two or more species. We included 82 genomes in the first version of EvolMarkers and have found the methods to be effective across Placozoa, Cnidaria, Arthropod, Nematoda, Annelida, Mollusca, Echinodermata, Hemichordata, Chordata and plants. We demonstrate the effectiveness of searching for CDS markers within annelids and show how to find potentially useful intronic markers within the lizard Anolis. © 2012 Blackwell Publishing Ltd.
Mutation screening of Chinese Treacher Collins syndrome patients identified novel TCOF1 mutations.
Chen, Ying; Guo, Luo; Li, Chen-Long; Shan, Jing; Xu, Hai-Song; Li, Jie-Ying; Sun, Shan; Hao, Shao-Juan; Jin, Lei; Chai, Gang; Zhang, Tian-Yu
2018-04-01
Treacher Collins syndrome (TCS) (OMIM 154500) is a rare congenital craniofacial disorder with an autosomal dominant manner of inheritance in most cases. To date, three pathogenic genes (TCOF1, POLR1D and POLR1C) have been identified. In this study, we conducted mutational analysis on Chinese TCS patients to reveal a mutational spectrum of known causative genes and show phenotype-genotype data to provide more information for gene counselling and future studies on the pathogenesis of TCS. Twenty-two TCS patients were recruited from two tertiary referral centres, and Sanger sequencing for the coding exons and exon-intron boundaries of TCOF1, POLR1D and POLR1C was performed. For patients without small variants, further copy number variations (CNVs) analysis was conducted using high-density SNP array platforms. The Sanger sequencing overall mutation detection rate was as high as 86.3% (19/22) for our cohort. Fifteen TCOF1 pathogenic variants, including ten novel mutations, were identified in nineteen patients. No causative mutations in POLR1D and POLR1C genes and no CNVs mutations were detected. A suspected autosomal dominant inheritance case that implies germinal mosaicism was described. Our study confirmed that TCOF1 was the main disease-causing gene for the Chinese TCS population and revealed its mutation spectrum. We also addressed the need for more studies of mosaicism in TCS cases, which could explain the mechanism of autosomal dominant inheritance in TCS cases and benefit the prevention of TCS.
Livebearing or egg-laying mammals: 27 decisive nucleotides of FAM168.
Pramanik, Subrata; Kutzner, Arne; Heese, Klaus
2017-05-23
In the present study, we determine comprehensive molecular phylogenetic relationships of the novel myelin-associated neurite-outgrowth inhibitor (MANI) gene across the entire eukaryotic lineage. Combined computational genomic and proteomic sequence analyses revealed MANI as one of the two members of the novel family with sequence similarity 168 member (FAM168) genes, consisting of FAM168A and FAM168B, having distinct genetic differences that illustrate diversification in its biological function and genetic taxonomy across the phylogenetic tree. Phylogenetic analyses based on coding sequences of these FAM168 genes revealed that they are paralogs and that the earliest emergence of these genes occurred in jawed vertebrates such as Callorhinchus milii. Surprisingly, these two genes are absent in other chordates that have a notochord at some stage in their lives, such as branchiostoma and tunicates. In the context of phylogenetic relationships among eukaryotic species, our results demonstrate the presence of FAM168 orthologs in vertebrates ranging from Callorhinchus milii to Homo sapiens, displaying distinct taxonomic clusters, comprised of fish, amphibians, reptiles, birds, and mammals. Analyses of individual FAM168 exons in our sample provide new insights into the molecular relationships between FAM168A and FAM168B (MANI) on the one hand and livebearing and egg-laying mammals on the other hand, demonstrating that a distinctive intermediate exon 4, comprised of 27 nucleotides, appears suddenly only in FAM168A and there in the livebearing mammals only but is absent from all other species including the egg-laying mammals.
Bouhouche, A; Benomar, A; Bouslam, N; Chkili, T; Yahyaoui, M
2006-05-01
Mutilating sensory neuropathy with spastic paraplegia is a very rare disease with both autosomal dominant and recessive modes of inheritance. We previously mapped the locus of the autosomal recessive form to a 25 cM interval between markers D5S2048 and D5S648 on chromosome 5p. In this candidate interval, the Cct5 gene encoding the epsilon subunit of the cytosolic chaperonin-containing t-complex peptide-1 (CCT) was the most obvious candidate gene since mutation in the Cct4 gene encoding the CCT delta subunit has been reported to be associated with autosomal recessive mutilating sensory neuropathy in mutilated foot (mf) rat mutant. A consanguineous Moroccan family with four patients displaying mutilating sensory neuropathy associated with spastic paraplegia was investigated. To identify the disease causing gene, the 11 coding exons of the Cct5 gene were screened for mutations by direct sequencing in all family members including the four patients, parents, and six at risk relatives. Sequence analysis of the Cct5 gene revealed a missense A492G mutation in exon 4 that results in the substitution of a highly conserved histidine for arginine amino acid 147. Interestingly, R147 was absent in 384 control matched chromosomes tested. This is the first disease causing mutation that has been identified in the human CCT subunit genes; the mf rat mutant could serve as an animal model for studying these chaperonopathies.
Investigation of the role of TCF4 rare sequence variants in schizophrenia.
Basmanav, F Buket; Forstner, Andreas J; Fier, Heide; Herms, Stefan; Meier, Sandra; Degenhardt, Franziska; Hoffmann, Per; Barth, Sandra; Fricker, Nadine; Strohmaier, Jana; Witt, Stephanie H; Ludwig, Michael; Schmael, Christine; Moebus, Susanne; Maier, Wolfgang; Mössner, Rainald; Rujescu, Dan; Rietschel, Marcella; Lange, Christoph; Nöthen, Markus M; Cichon, Sven
2015-07-01
Transcription factor 4 (TCF4) is one of the most robust of all reported schizophrenia risk loci and is supported by several genetic and functional lines of evidence. While numerous studies have implicated common genetic variation at TCF4 in schizophrenia risk, the role of rare, small-sized variants at this locus-such as single nucleotide variants and short indels which are below the resolution of chip-based arrays requires further exploration. The aim of the present study was to investigate the association between rare TCF4 sequence variants and schizophrenia. Exon-targeted resequencing was performed in 190 German schizophrenia patients. Six rare variants at the coding exons and flanking sequences of the TCF4 gene were identified, including two missense variants and one splice site variant. These six variants were then pooled with nine additional rare variants identified in 379 European participants of the 1000 Genomes Project, and all 15 variants were genotyped in an independent German sample (n = 1,808 patients; n = 2,261 controls). These data were then analyzed using six statistical methods developed for the association analysis of rare variants. No significant association (P < 0.05) was found. However, the results from our association and power analyses suggest that further research into the possible involvement of rare TCF4 sequence variants in schizophrenia risk is warranted by the assessment of larger cohorts with higher statistical power to identify rare variant associations. © 2015 Wiley Periodicals, Inc.
Nakamura, Haruhiko; Koizumi, Hirotaka; Kimura, Hiroyuki; Marushima, Hideki; Saji, Hisashi; Takagi, Masayuki
2016-09-01
Epidermal growth factor receptor (EGFR) mutation rates in adenocarcinoma in situ (AIS) and minimally invasive adenocarcinoma (MIA) were studied using both DNA analysis and mutation-specific immunohistochemistry. The peptide nucleic acid-locked nucleic acid polymerase chain reaction clamp method was used to detect mutations in exons 18, 19, 20, and 21 of the EGFR gene in DNA samples extracted from paraffin-embedded tissue sections. Simultaneously, immunohistochemical analysis with two EGFR mutation-specific monoclonal antibodies was used to identify proteins resulting from an in-frame deletion in exon 19 (E746_A750del) and a point mutation replacing leucine with arginine at codon 858 of exon 21 (L858R). Forty-three tumors (22 AIS and 21 MIA) were examined. The EGFR mutation rate in AIS detected by DNA analysis was 27.3% (L858R, 5/22; exon 19 deletion,1/22), whereas that detected in MIA was 42.9% (L858R,4/21; exon 19 deletion,5/21). Mutations detected by immunohistochemical analysis included 22.7% (L858R, 4/22; exon 19 deletion, 1/22) in AIS and 42.9% (L858R, 4/21; exon 19 deletion, 5/21) in MIA. Although some results were contradictory, concordant results were obtained using both assays in 38 of 43 cases (88.4%). DNA and immunohistochemical analyses revealed similar EGFR mutation rates in both MIA and AIS, suggesting that mutation-specific monoclonal antibodies are useful to confirm DNA assay results. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Sarika, H L; Papathoma, A; Garofalaki, M; Vasileiou, V; Vlassopoulou, B; Anastasiou, E; Alevizaki, M
2012-12-01
Genetic screening for ret mutation has become routine practice in the evaluation of medullary thyroid carcinoma (MTC). Approximately 25% of these tumours are familial, and they occur as components of the multiple endocrine neoplasia type 2 syndromes (MEN 2A and 2B) or familial MTC. In familial cases, the majority of mutations are found in exons 10, 11, 13, 14 or 15 of the ret gene. A rare mutation involving exon 8 (G533C) has recently been reported in familial cases of MTC in Brazil and Greece; some of these cases were originally thought to be sporadic. The aim of this study was to re-evaluate a series of sporadic cases of MTC, with negative family history, and screen them for germline mutations in exon 8. Genomic DNA was extracted from peripheral lymphocytes in 129 unrelated individuals who had previously been characterized as 'sporadic' based on the negative family history and negative screening for ret gene mutations. Samples were analysed in Applied Biosystems 7500 real-time PCR and confirmed by sequencing. The G533C exon 8 mutation was identified in 10 of 129 patients with sporadic MTC. Asymptomatic gene carriers were subsequently identified in other family members. In our study, we found that 7·75% patients with apparently sporadic MTC do carry G533C mutation involving exon 8 of ret. We feel that there is now a need to include exon 8 mutation screening in all patients diagnosed as sporadic MTC, in Greece. © 2012 Blackwell Publishing Ltd.
Wang, Chun-Chi; Jong, Yuh-Jyh; Chang, Jan-Gowth; Chen, Yen-Ling; Wu, Shou-Mei
2010-07-01
We have developed a capillary electrophoresis (CE) method with universal fluorescent multiplex PCR to simultaneously detect the SMN1 and SMN2 genes in exons 7 and 8. Spinal muscular atrophy (SMA) is a very frequent inherited disease caused by the absence of the SMN1 gene in approximately 94% of patients. Those patients have deletion of the SMN1 gene or gene conversion between SMN1 and SMN2. However, most methods only focus on the analysis of whole gene deletion, and ignore gene conversion. Simultaneous quantification of SMN1 and SMN2 in exons 7 and 8 is a good strategy for estimating SMN1 deletion or SMN1 to SMN2 gene conversion. This study established a CE separation allowing differentiation of all copy ratios of SMN1 to SMN2 in exons 7 and 8. Among 212 detected individuals, there were 23 SMA patients, 45 carriers, and 144 normal subjects. Three individuals had different ratios of SMN1 to SMN2 in two exons, including an SMA patient having two SMN2 copies in exon 7 but one SMN1 copy in exon 8. This method could provide more information about SMN1 deletion or SMN1 to SMN2 gene conversion for SMA genotyping and diagnosis.
Homozygous and hemizygous CNV detection from exome sequencing data in a Mendelian disease cohort
Gambin, Tomasz; Akdemir, Zeynep C.; Yuan, Bo; Gu, Shen; Chiang, Theodore; Carvalho, Claudia M.B.; Shaw, Chad; Jhangiani, Shalini; Boone, Philip M.; Eldomery, Mohammad K.; Karaca, Ender; Bayram, Yavuz; Stray-Pedersen, Asbjørg; Muzny, Donna; Charng, Wu-Lin; Bahrambeigi, Vahid; Belmont, John W.; Boerwinkle, Eric; Beaudet, Arthur L.; Gibbs, Richard A.
2017-01-01
Abstract We developed an algorithm, HMZDelFinder, that uses whole exome sequencing (WES) data to identify rare and intragenic homozygous and hemizygous (HMZ) deletions that may represent complete loss-of-function of the indicated gene. HMZDelFinder was applied to 4866 samples in the Baylor–Hopkins Center for Mendelian Genomics (BHCMG) cohort and detected 773 HMZ deletion calls (567 homozygous or 206 hemizygous) with an estimated sensitivity of 86.5% (82% for single-exonic and 88% for multi-exonic calls) and precision of 78% (53% single-exonic and 96% for multi-exonic calls). Out of 773 HMZDelFinder-detected deletion calls, 82 were subjected to array comparative genomic hybridization (aCGH) and/or breakpoint PCR and 64 were confirmed. These include 18 single-exon deletions out of which 8 were exclusively detected by HMZDelFinder and not by any of seven other CNV detection tools examined. Further investigation of the 64 validated deletion calls revealed at least 15 pathogenic HMZ deletions. Of those, 7 accounted for 17–50% of pathogenic CNVs in different disease cohorts where 7.1–11% of the molecular diagnosis solved rate was attributed to CNVs. In summary, we present an algorithm to detect rare, intragenic, single-exon deletion CNVs using WES data; this tool can be useful for disease gene discovery efforts and clinical WES analyses. PMID:27980096
DOE Office of Scientific and Technical Information (OSTI.GOV)
Asai, Hirohide; Hirano, Makito; Kiriyama, Takao
Intranuclear events due to mutations in the Parkin gene remain elusive in autosomal recessive juvenile parkinsonism (ARJP). We identified a mutant PARKIN protein in fibroblast cultures from a pair of siblings with ARJP who were homozygous for the exon 4-deleted Parkin gene. Disease was mild in one patient and debilitating in the other. The detected mutant, encoded by a transcript lacking exon 3 as well as exon 4, is an in-frame deletion that removes 121 aa, resulting in a 344-aa protein (PaDel3,4). Cell culture and transfection studies revealed negative correlations between expression levels of PaDel3,4 and those of cell cyclemore » proteins, including cyclin E, CDK2, ppRb, and E2F-1, and demonstrated that GFP-PaDel3,4 entered nucleus and ubiquitinated cyclin E as a part of SCF{sup hSel-10} ligase complex in the patient cells. In addition, nuclear localization signal-tagged PaDel3,4 expressed in the transfected patient cells most effectively ubiquitinated cyclin E and reduced DNA damage, protecting cells from oxidative stress. Antisense-oligonucleotide treatment promoted skipping of exon 3 and thus generated PaDel3,4, increasing cell survival. Collectively, we propose that naturally- and experimentally-induced exon skipping at least partly restores the mutant Parkin gene deficit, providing a molecular basis for the development of therapeutic exon skipping.« less
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2017-04-01
Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Medical Sequencing at the extremes of Human Body Mass
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahituv, Nadav; Kavaslar, Nihan; Schackwitz, Wendy
2006-09-01
Body weight is a quantitative trait with significantheritability in humans. To identify potential genetic contributors tothis phenotype, we resequenced the coding exons and splice junctions of58 genes in 379 obese and 378 lean individuals. Our 96Mb survey included21 genes associated with monogenic forms of obesity in humans or mice, aswell as 37 genes that function in body weight-related pathways. We foundthat the monogenic obesity-associated gene group was enriched for rarenonsynonymous variants unique to the obese (n=46) versus lean (n=26)populations. Computational analysis further predicted a significantlygreater fraction of deleterious variants within the obese cohort.Consistent with the complex inheritance of body weight,more » we did notobserve obvious familial segregation in the majority of the 28 availablekindreds. Taken together, these data suggest that multiple rare alleleswith variable penetrance contribute to obesity in the population andprovide a deep medical sequencing based approach to detectthem.« less
Decelerated genome evolution in modern vertebrates revealed by analysis of multiple lancelet genomes
Huang, Shengfeng; Chen, Zelin; Yan, Xinyu; Yu, Ting; Huang, Guangrui; Yan, Qingyu; Pontarotti, Pierre Antoine; Zhao, Hongchen; Li, Jie; Yang, Ping; Wang, Ruihua; Li, Rui; Tao, Xin; Deng, Ting; Wang, Yiquan; Li, Guang; Zhang, Qiujin; Zhou, Sisi; You, Leiming; Yuan, Shaochun; Fu, Yonggui; Wu, Fenfang; Dong, Meiling; Chen, Shangwu; Xu, Anlong
2014-01-01
Vertebrates diverged from other chordates ~500 Myr ago and experienced successful innovations and adaptations, but the genomic basis underlying vertebrate origins are not fully understood. Here we suggest, through comparison with multiple lancelet (amphioxus) genomes, that ancient vertebrates experienced high rates of protein evolution, genome rearrangement and domain shuffling and that these rates greatly slowed down after the divergence of jawed and jawless vertebrates. Compared with lancelets, modern vertebrates retain, at least relatively, less protein diversity, fewer nucleotide polymorphisms, domain combinations and conserved non-coding elements (CNE). Modern vertebrates also lost substantial transposable element (TE) diversity, whereas lancelets preserve high TE diversity that includes even the long-sought RAG transposon. Lancelets also exhibit rapid gene turnover, pervasive transcription, fastest exon shuffling in metazoans and substantial TE methylation not observed in other invertebrates. These new lancelet genome sequences provide new insights into the chordate ancestral state and the vertebrate evolution. PMID:25523484
Huang, Shengfeng; Chen, Zelin; Yan, Xinyu; Yu, Ting; Huang, Guangrui; Yan, Qingyu; Pontarotti, Pierre Antoine; Zhao, Hongchen; Li, Jie; Yang, Ping; Wang, Ruihua; Li, Rui; Tao, Xin; Deng, Ting; Wang, Yiquan; Li, Guang; Zhang, Qiujin; Zhou, Sisi; You, Leiming; Yuan, Shaochun; Fu, Yonggui; Wu, Fenfang; Dong, Meiling; Chen, Shangwu; Xu, Anlong
2014-12-19
Vertebrates diverged from other chordates ~500 Myr ago and experienced successful innovations and adaptations, but the genomic basis underlying vertebrate origins are not fully understood. Here we suggest, through comparison with multiple lancelet (amphioxus) genomes, that ancient vertebrates experienced high rates of protein evolution, genome rearrangement and domain shuffling and that these rates greatly slowed down after the divergence of jawed and jawless vertebrates. Compared with lancelets, modern vertebrates retain, at least relatively, less protein diversity, fewer nucleotide polymorphisms, domain combinations and conserved non-coding elements (CNE). Modern vertebrates also lost substantial transposable element (TE) diversity, whereas lancelets preserve high TE diversity that includes even the long-sought RAG transposon. Lancelets also exhibit rapid gene turnover, pervasive transcription, fastest exon shuffling in metazoans and substantial TE methylation not observed in other invertebrates. These new lancelet genome sequences provide new insights into the chordate ancestral state and the vertebrate evolution.
Kris, M. G.; Camidge, D. R.; Giaccone, G.; Hida, T.; Li, B. T.; O'Connell, J.; Taylor, I.; Zhang, H.; Arcila, M. E.; Goldberg, Z.; Jänne, P. A.
2015-01-01
Background HER2 mutations and amplifications have been identified as oncogenic drivers in lung cancers. Dacomitinib, an irreversible inhibitor of HER2, EGFR (HER1), and HER4 tyrosine kinases, has demonstrated activity in cell-line models with HER2 exon 20 insertions or amplifications. Here, we studied dacomitinib in patients with HER2-mutant or amplified lung cancers. Patients and methods As a prespecified cohort of a phase II study, we included patients with stage IIIB/IV lung cancers with HER2 mutations or amplification. We gave oral dacomitinib at 30–45 mg daily in 28-day cycles. End points included partial response rate, overall survival, and toxicity. Results We enrolled 30 patients with HER2-mutant (n = 26, all in exon 20 including 25 insertions and 1 missense mutation) or HER2-amplified lung cancers (n = 4). Three of 26 patients with tumors harboring HER2 exon 20 mutations [12%; 95% confidence interval (CI) 2% to 30%] had partial responses lasting 3+, 11, and 14 months. No partial responses occurred in four patients with tumors with HER2 amplifications. The median overall survival was 9 months from the start of dacomitinib (95% CI 7–21 months) for patients with HER2 mutations and ranged from 5 to 22 months with amplifications. Treatment-related toxicities included diarrhea (90%; grade 3/4: 20%/3%), dermatitis (73%; grade 3/4: 3%/0%), and fatigue (57%; grade 3/4: 3%/0%). One patient died on study likely due to an interaction of dacomitinib with mirtazapine. Conclusions Dacomitinib produced objective responses in patients with lung cancers with specific HER2 exon 20 insertions. This observation validates HER2 exon 20 insertions as actionable targets and justifies further study of HER2-targeted agents in specific HER2-driven lung cancers. ClinicalTrials.gov NCT00818441. PMID:25899785
Global characterization of copy number variants in epilepsy patients from whole genome sequencing
Meloche, Caroline; Andrade, Danielle M.; Lafreniere, Ron G.; Gravel, Micheline; Spiegelman, Dan; Dionne-Laporte, Alexandre; Boelman, Cyrus; Hamdan, Fadi F.; Michaud, Jacques L.; Rouleau, Guy; Minassian, Berge A.; Bourque, Guillaume; Cossette, Patrick
2018-01-01
Epilepsy will affect nearly 3% of people at some point during their lifetime. Previous copy number variants (CNVs) studies of epilepsy have used array-based technology and were restricted to the detection of large or exonic events. In contrast, whole-genome sequencing (WGS) has the potential to more comprehensively profile CNVs but existing analytic methods suffer from limited accuracy. We show that this is in part due to the non-uniformity of read coverage, even after intra-sample normalization. To improve on this, we developed PopSV, an algorithm that uses multiple samples to control for technical variation and enables the robust detection of CNVs. Using WGS and PopSV, we performed a comprehensive characterization of CNVs in 198 individuals affected with epilepsy and 301 controls. For both large and small variants, we found an enrichment of rare exonic events in epilepsy patients, especially in genes with predicted loss-of-function intolerance. Notably, this genome-wide survey also revealed an enrichment of rare non-coding CNVs near previously known epilepsy genes. This enrichment was strongest for non-coding CNVs located within 100 Kbp of an epilepsy gene and in regions associated with changes in the gene expression, such as expression QTLs or DNase I hypersensitive sites. Finally, we report on 21 potentially damaging events that could be associated with known or new candidate epilepsy genes. Our results suggest that comprehensive sequence-based profiling of CNVs could help explain a larger fraction of epilepsy cases. PMID:29649218
Population genetic implications from sequence variation in four Y chromosome genes.
Shen, P; Wang, F; Underhill, P A; Franco, C; Yang, W H; Roxas, A; Sung, R; Lin, A A; Hyman, R W; Vollrath, D; Davis, R W; Cavalli-Sforza, L L; Oefner, P J
2000-06-20
Some insight into human evolution has been gained from the sequencing of four Y chromosome genes. Primary genomic sequencing determined gene SMCY to be composed of 27 exons that comprise 4,620 bp of coding sequence. The unfinished sequencing of the 5' portion of gene UTY1 was completed by primer walking, and a total of 20 exons were found. By using denaturing HPLC, these two genes, as well as DBY and DFFRY, were screened for polymorphic sites in 53-72 representatives of the five continents. A total of 98 variants were found, yielding nucleotide diversity estimates of 2.45 x 10(-5), 5. 07 x 10(-5), and 8.54 x 10(-5) for the coding regions of SMCY, DFFRY, and UTY1, respectively, with no variant having been observed in DBY. In agreement with most autosomal genes, diversity estimates for the noncoding regions were about 2- to 3-fold higher and ranged from 9. 16 x 10(-5) to 14.2 x 10(-5) for the four genes. Analysis of the frequencies of derived alleles for all four genes showed that they more closely fit the expectation of a Luria-Delbrück distribution than a distribution expected under a constant population size model, providing evidence for exponential population growth. Pairwise nucleotide mismatch distributions date the occurrence of population expansion to approximately 28,000 years ago. This estimate is in accord with the spread of Aurignacian technology and the disappearance of the Neanderthals.
Structure of the coding region and mRNA variants of the apyrase gene from pea (Pisum sativum)
NASA Technical Reports Server (NTRS)
Shibata, K.; Abe, S.; Davies, E.
2001-01-01
Partial amino acid sequences of a 49 kDa apyrase (ATP diphosphohydrolase, EC 3.6.1.5) from the cytoskeletal fraction of etiolated pea stems were used to derive oligonucleotide DNA primers to generate a cDNA fragment of pea apyrase mRNA by RT-PCR and these primers were used to screen a pea stem cDNA library. Two almost identical cDNAs differing in just 6 nucleotides within the coding regions were found, and these cDNA sequences were used to clone genomic fragments by PCR. Two nearly identical gene fragments containing 8 exons and 7 introns were obtained. One of them (H-type) encoded the mRNA sequence described by Hsieh et al. (1996) (DDBJ/EMBL/GenBank Z32743), while the other (S-type) differed by the same 6 nucleotides as the mRNAs, suggesting that these genes may be alleles. The six nucleotide differences between these two alleles were found solely in the first exon, and these mutation sites had two types of consensus sequences. These mRNAs were found with varying lengths of 3' untranslated regions (3'-UTR). There are some similarities between the 3'-UTR of these mRNAs and those of actin and actin binding proteins in plants. The putative roles of the 3'-UTR and alternative polyadenylation sites are discussed in relation to their possible role in targeting the mRNAs to different subcellular compartments.
Mutations in the Promoter Region of the Aldolase B Gene that cause Hereditary Fructose Intolerance
Coffee, Erin M.; Tolan, Dean R.
2010-01-01
SUMMARY Hereditary fructose intolerance (HFI) is a potentially fatal inherited metabolic disease caused by a deficiency of aldolase B activity in the liver and kidney. Over 40 disease-causing mutations are known in the protein-coding region of ALDOB. Mutations upstream of the protein-coding portion of ALDOB are reported here for the first time. DNA sequence analysis of 61 HFI patients revealed single base mutations in the promoter, intronic enhancer, and the first exon, which is entirely untranslated. One mutation, g.–132G>A, is located within the promoter at an evolutionarily conserved nucleotide within a transcription factor-binding site. A second mutation, IVS1+1G>C, is at the donor splice site of the first exon. In vitro electrophoretic mobility shift assays show a decrease in nuclear extract-protein binding at the g.–132G>A mutant site. The promoter mutation results in decreased transcription using luciferase reporter plasmids. Analysis of cDNA from cells transfected with plasmids harboring the IVS1+1G>C mutation results in aberrant splicing leading to complete retention of the first intron (~ 5 kb). The IVS1+1G>C splicing mutation results in loss of luciferase activity from a reporter plasmid. These novel mutations in ALDOB represent 2% of alleles in American HFI patients, with IVS1+1G>C representing a significantly higher allele frequency (6%) among HFI patients of Hispanic and African-American ethnicity. PMID:20882353
René, P; Lenne, F; Ventura, M A; Bertagna, X; de Keyzer, Y
2000-01-04
In the pituitary, vasopressin triggers ACTH release through a specific receptor subtype, termed V3 or V1b. We cloned the V3 cDNA and showed that its expression was almost exclusive to pituitary corticotrophs and some corticotroph tumors. To study the determinants of this tissue specificity, we have now cloned the gene for the human (h) V3 receptor and characterized its structure. It is composed of two exons, spanning 10kb, with the coding region interrupted between transmembrane domains 6 and 7. We established that the transcription initiation site is located 498 nucleotides upstream of the initiator codon and showed that two polyadenylation sites may be used, while the most frequent is the most downstream. Sequence analysis of the promoter region showed no TATA box but identified consensus binding motifs for Sp1, CREB, and half sites of the estrogen receptor binding site. However comparison with another corticotroph-specific gene, proopiomelanocortin, did not identify common regulatory elements in the two promoters except for a short GC-rich region. Unexpectedly, hV3 gene analysis revealed that a formerly cloned 'artifactual' hV3 cDNA indeed corresponded to a spliced antisense transcript, overlapping the 5' part of the coding sequence in exon 1 and the promoter region. This transcript, hV3rev, was detected in normal pituitary and in many corticotroph tumors expressing hV3 sense mRNA and may therefore play a role in hV3 gene expression.
Isolated familial somatotropinomas: clinical features and analysis of the MEN1 gene.
De Menis, Ernesto; Prezant, Toni R
2002-01-01
Isolated familial somatotropinomas (IFS) rarely occurs in the absence of multiple endocrine neoplasia type I (MEN1) or the Carney complex. In the present study we report two Italian siblings affected by GH-secreting adenomas. There was no history of parental consanguinity. The sister presented at 18 years of age with secondary amenorrhea and acromegalic features and one of her two brothers presented with gigantism at the same age. Endocrinological investigations confirmed GH hypersecretion in both cases. Although a pituitary microadenoma was detected in both patients, transsphenoidal surgery was not successful. The sister received conventional radiotherapy and acromegaly is now considered controlled; the brother is being treated with octreotide LAR 30 mg monthly and the disease is considered clinically active. Patients, their parents and the unaffected brother underwent extensive evaluation, and no features of MEN1 or Carney complex were found. Analysis of polymorphic microsatellite markers from chromosome 11q13 (D11S599, D11S4945, D11S4939, D11S4938 and D11S987) showed that the acromegalic siblings had inherited different maternal chromosomes and shared the paternal chromosome. No pathogenic MEN1 sequence changes were detected by sequencing or dideoxy fingerprinting of the coding sequence (exons 2-10) and exon/intron junctions. Although mutations in the promoter, introns or untranslated regions of the MEN1 gene cannot be excluded, germline mutations within the coding region of this gene do not appear responsible for IFS in this family.
NASA Astrophysics Data System (ADS)
Liu, Meng; Liu, Yuan; Hui, Min; Song, Chengwen; Cui, Zhaoxia
2017-03-01
Clip domain serine proteases (cSPs) and their homologs (SPHs) play an important role in various biological processes that are essential components of extracellular signaling cascades, especially in the innate immune responses of invertebrates. Here, polymorphisms of PtcSP and PtSPH from the swimming crab Portunus trituberculatus were investigated to explore their association with resistance/susceptibility to Vibrio alginolyticus. Polymorphic loci were identified using Clustal X, and characterized with SPSS 16.0 software, and then the significance of genotype and allele frequencies between resistant and susceptible stocks was determined by a χ 2 test. A total of 109 and 77 single nucleotide polymorphisms (SNPs) were identified in the genomic fragments of PtcSP and PtSPH, respectively. Notably, nearly half of PtSPH polymorphisms were found in the non-coding exon 1. Fourteen SNPs investigated were significantly associated with susceptibility/resistance to V. alginolyticus ( P <0.05). Among them, eight SNPs were observed in introns, and one synonymous, four non-synonymous SNPs and one ins-del were found in coding exons. In addition, five simple sequence repeats (SSRs) were detected in intron 3 of PtcSP. Although there was no statistically significant difference of allele frequencies, the SSRs showed different polymorphic alleles on the basis of the repeat number between resistant and susceptible stocks. After further validation, polymorphisms investigated here might be applied to select potential molecular markers of P. trituberculatus with resistance to V. alginolyticus.
Montandon, P E; Vasserot, A; Stutz, E
1986-01-01
We retrieved a 1.6 kbp intron separating two exons of the psb C gene which codes for the 44 kDa reaction center protein of photosystem II. This intron is 3 to 4 times the size of all previously sequenced Euglena gracilis chloroplast introns. It contains an open reading frame of 458 codons potentially coding for a basic protein of 54 kDa of yet unknown function. The intron boundaries follow consensus sequences established for chloroplast introns related to class II and nuclear pre-mRNA introns. Its 3'-terminal segment has structural features similar to class II mitochondrial introns with an invariant base A as possible branch point for lariat formation.
Novel numerical and graphical representation of DNA sequences and proteins.
Randić, M; Novic, M; Vikić-Topić, D; Plavsić, D
2006-12-01
We have introduced novel numerical and graphical representations of DNA, which offer a simple and unique characterization of DNA sequences. The numerical representation of a DNA sequence is given as a sequence of real numbers derived from a unique graphical representation of the standard genetic code. There is no loss of information on the primary structure of a DNA sequence associated with this numerical representation. The novel representations are illustrated with the coding sequences of the first exon of beta-globin gene of half a dozen species in addition to human. The method can be extended to proteins as is exemplified by humanin, a 24-aa peptide that has recently been identified as a specific inhibitor of neuronal cell death induced by familial Alzheimer's disease mutant genes.
Mo, Fan; Hong, Xu; Gao, Feng; Du, Lin; Wang, Jun; Omenn, Gilbert S; Lin, Biaoyang
2008-12-16
Alternative splicing is an important gene regulation mechanism. It is estimated that about 74% of multi-exon human genes have alternative splicing. High throughput tandem (MS/MS) mass spectrometry provides valuable information for rapidly identifying potentially novel alternatively-spliced protein products from experimental datasets. However, the ability to identify alternative splicing events through tandem mass spectrometry depends on the database against which the spectra are searched. We wrote scripts in perl, Bioperl, mysql and Ensembl API and built a theoretical exon-exon junction protein database to account for all possible combinations of exons for a gene while keeping the frame of translation (i.e., keeping only in-phase exon-exon combinations) from the Ensembl Core Database. Using our liver cancer MS/MS dataset, we identified a total of 488 non-redundant peptides that represent putative exon skipping events. Our exon-exon junction database provides the scientific community with an efficient means to identify novel alternatively spliced (exon skipping) protein isoforms using mass spectrometry data. This database will be useful in annotating genome structures using rapidly accumulating proteomics data.
Huang, J M; Wang, Z Y; Ju, Z H; Wang, C F; Li, Q L; Sun, T; Hou, Q L; Hang, S Q; Hou, M H; Zhong, J F
2011-12-21
Bovine lactoferrin (bLF) is a member of the transferrin family; it plays an important role in the innate immune response. We identified novel splice variants of the bLF gene in mastitis-infected and healthy cows. Reverse transcription-polymerase chain reaction (RT-PCR) and clone sequencing analysis were used to screen the splice variants of the bLF gene in the mammary gland, spleen and liver tissues. One main transcript corresponding to the bLF reference sequence was found in three tissues in both healthy and mastitis-infected cows. Quantitative real-time PCR analysis showed that the expression levels of the LF gene's main transcript were not significantly different in tissues from healthy versus mastitis-infected cows. However, the new splice variant, LF-AS2, which has the exon-skipping alternative splicing pattern, was only identified in mammary glands infected with Staphylococcus aureus. Sequencing analysis showed that the new splice variant was 251 bp in length, including exon 1, part of exon 2, part of exon 16, and exon 17. We conclude that bLF may play a role in resistance to mastitis through alternative splicing mechanisms.
A novel deletion of SNURF/SNRPN exon 1 in a patient with Prader-Willi-like phenotype.
Cao, Yang; AlHumaidi, Susan S; Faqeih, Eissa A; Pitel, Beth A; Lundquist, Patrick; Aypar, Umut
2017-08-01
Here we report the smallest deletion involving SNURF/SNRPN that causes major symptoms of Prader-Willi syndrome (PWS), including hypotonia, dysmorphic features, intellectual disability, and obesity. A female patient with the aforementioned and additional features was referred to the Mayo Clinic Cytogenetics laboratory for genetic testing. Chromosomal microarray analysis and subsequent Sanger sequencing identified a de novo 6.4 kb deletion at 15q11.2, containing exon 1 of the SNURF gene and exon 1 of the shortest isoform of the SNRPN gene. SNURF/SNRPN exon 1, which is methylated on the silent maternal allele, is associated with acetylated histones on the expressed paternal allele. This region also overlaps with the PWS-imprinting center (IC). Subsequent molecular methylation analysis was performed using methylation-specific MLPA (MS-MLPA), which characterized that the deletion of SNURF/SNRPN exon 1 was paternal in origin, consistent with the PWS-like phenotype. Since SNURF/SNRPN gene and the PWS-IC are known to regulate snoRNAs, it is likely that the PWS-like phenotype observed in patients with paternal SNURF/SNRPN deletion is due to the disrupted expression of SNORD116 snoRNAs. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Genome-wide association between DNA methylation and alternative splicing in an invertebrate
2012-01-01
Background Gene bodies are the most evolutionarily conserved targets of DNA methylation in eukaryotes. However, the regulatory functions of gene body DNA methylation remain largely unknown. DNA methylation in insects appears to be primarily confined to exons. Two recent studies in Apis mellifera (honeybee) and Nasonia vitripennis (jewel wasp) analyzed transcription and DNA methylation data for one gene in each species to demonstrate that exon-specific DNA methylation may be associated with alternative splicing events. In this study we investigated the relationship between DNA methylation, alternative splicing, and cross-species gene conservation on a genome-wide scale using genome-wide transcription and DNA methylation data. Results We generated RNA deep sequencing data (RNA-seq) to measure genome-wide mRNA expression at the exon- and gene-level. We produced a de novo transcriptome from this RNA-seq data and computationally predicted splice variants for the honeybee genome. We found that exons that are included in transcription are higher methylated than exons that are skipped during transcription. We detected enrichment for alternative splicing among methylated genes compared to unmethylated genes using fisher’s exact test. We performed a statistical analysis to reveal that the presence of DNA methylation or alternative splicing are both factors associated with a longer gene length and a greater number of exons in genes. In concordance with this observation, a conservation analysis using BLAST revealed that each of these factors is also associated with higher cross-species gene conservation. Conclusions This study constitutes the first genome-wide analysis exhibiting a positive relationship between exon-level DNA methylation and mRNA expression in the honeybee. Our finding that methylated genes are enriched for alternative splicing suggests that, in invertebrates, exon-level DNA methylation may play a role in the construction of splice variants by positively influencing exon inclusion during transcription. The results from our cross-species homology analysis suggest that DNA methylation and alternative splicing are genetic mechanisms whose utilization could contribute to a longer gene length and a slower rate of gene evolution. PMID:22978521
Exclusion of alternative exon 33 of CaV1.2 calcium channels in heart is proarrhythmogenic
Li, Guang; Wang, Juejin; Liao, Ping; Bartels, Peter; Zhang, Hengyu; Yu, Dejie; Liang, Mui Cheng; Poh, Kian Keong; Yu, Chye Yun; Jiang, Fengli; Yong, Tan Fong; Wong, Yuk Peng; Hu, Zhenyu; Huang, Hua; Zhang, Guangqin; Galupo, Mary Joyce; Bian, Jin-Song; Ponniah, Sathivel; Trasti, Scott Lee; Foo, Roger; Hoppe, Uta C.; Herzig, Stefan; Soong, Tuck Wah
2017-01-01
Alternative splicing changes the CaV1.2 calcium channel electrophysiological property, but the in vivo significance of such altered channel function is lacking. Structure–function studies of heterologously expressed CaV1.2 channels could not recapitulate channel function in the native milieu of the cardiomyocyte. To address this gap in knowledge, we investigated the role of alternative exon 33 of the CaV1.2 calcium channel in heart function. Exclusion of exon 33 in CaV1.2 channels has been reported to shift the activation potential −10.4 mV to the hyperpolarized direction, and increased expression of CaV1.2Δ33 channels was observed in rat myocardial infarcted hearts. However, how a change in CaV1.2 channel electrophysiological property, due to alternative splicing, might affect cardiac function in vivo is unknown. To address these questions, we generated mCacna1c exon 33−/−-null mice. These mice contained CaV1.2Δ33 channels with a gain-of-function that included conduction of larger currents that reflects a shift in voltage dependence and a modest increase in single-channel open probability. This altered channel property underscored the development of ventricular arrhythmia, which is reflected in significantly more deaths of exon 33−/− mice from β-adrenergic stimulation. In vivo telemetric recordings also confirmed increased frequencies in premature ventricular contractions, tachycardia, and lengthened QT interval. Taken together, the significant decrease or absence of exon 33-containing CaV1.2 channels is potentially proarrhythmic in the heart. Of clinical relevance, human ischemic and dilated cardiomyopathy hearts showed increased inclusion of exon 33. However, the possible role that inclusion of exon 33 in CaV1.2 channels may play in the pathogenesis of human heart failure remains unclear. PMID:28490495
Gaucher disease: molecular heterogeneity and phenotype-genotype correlations.
Theophilus, B; Latham, T; Grabowski, G A; Smith, F I
1989-08-01
Gaucher disease (GD) is the most prevalent lysosomal storage disease. This autosomal recessive trait results from the defective activity of acid beta-glucosidase (beta-Glc). Four different exonic point mutations have been identified as causal alleles for GD. To facilitate screening for these alleles, assays were developed using allele-specific oligonucleotide hybridization to amplified genomic DNA sequences. Specifically, intron bases flanking exons 5, 9, and 10 were determined, and conditions for PCR amplification of these exons were obtained. Two different procedures were developed to distinguish signals obtained from the structural beta-Glc gene exons and those from the pseudogene. These procedures were used to determine the distribution of all known GD alleles in a population of 44 affected patients of varying phenotypes and ethnicity. The high frequency of one of the exon 9 mutations in Ashkenazi Jewish GD type 1 patients was confirmed, and, in addition, this mutation was present in ethnically diverse non-Jewish type 1 GD patients. Homozygotes (N = 5) for this allele were midly affected older individuals, and this mutant allele was not found in any patient with neuronopathic disease. The exon 10 mutation was confirmed as the predominant allele in types 2 and 3 GD. However, several type 1 GD patients, including one of Ashkenazi-Jewish heritage, also were heterozygous for this allele. The presence of this allele in type 1 patients did not correlate with the severity of clinical symptoms. The second exon 9 mutation and the exon 5 mutation were rare, since they occurred only heterozygously either in one type 2 GD patient or in two related Ashkenazi-Jewish GD patients, respectively. Although most GD patients (38 of 44) had at least one of the known mutant alleles, 57% were heterozygotes for only one of these mutations. Fourteen percent of patients were negative for all mutations. A total of 73% of GD patients had at least one unknown allele. The varying clinical phenotypes and ethnic origins of these incompletely characterized patients suggest that multiple other GD alleles exist.