Meher, J K; Meher, P K; Dash, G N; Raval, M K
2012-01-01
The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene.
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-11-29
Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-01-01
Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains. PMID:18047649
A mechanism for exon skipping caused by nonsense or missense mutations in BRCA1 and other genes.
Liu, H X; Cartegni, L; Zhang, M Q; Krainer, A R
2001-01-01
Point mutations can generate defective and sometimes harmful proteins. The nonsense-mediated mRNA decay (NMD) pathway minimizes the potential damage caused by nonsense mutations. In-frame nonsense codons located at a minimum distance upstream of the last exon-exon junction are recognized as premature termination codons (PTCs), targeting the mRNA for degradation. Some nonsense mutations cause skipping of one or more exons, presumably during pre-mRNA splicing in the nucleus; this phenomenon is termed nonsense-mediated altered splicing (NAS), and its underlying mechanism is unclear. By analyzing NAS in BRCA1, we show here that inappropriate exon skipping can be reproduced in vitro, and results from disruption of a splicing enhancer in the coding sequence. Enhancers can be disrupted by single nonsense, missense and translationally silent point mutations, without recognition of an open reading frame as such. These results argue against a nuclear reading-frame scanning mechanism for NAS. Coding-region single-nucleotide polymorphisms (cSNPs) within exonic splicing enhancers or silencers may affect the patterns or efficiency of mRNA splicing, which may in turn cause phenotypic variability and variable penetrance of mutations elsewhere in a gene.
Duellman, Tyler; Warren, Christopher; Yang, Jay
2014-01-01
Microribonucleic acids (miRNAs) work with exquisite specificity and are able to distinguish a target from a non-target based on a single nucleotide mismatch in the core nucleotide domain. We questioned whether miRNA regulation of gene expression could occur in a single nucleotide polymorphism (SNP)-specific manner, manifesting as a post-transcriptional control of expression of genetic polymorphisms. In our recent study of the functional consequences of matrix metalloproteinase (MMP)-9 SNPs, we discovered that expression of a coding exon SNP in the pro-domain of the protein resulted in a profound decrease in the secreted protein. This missense SNP results in the N38S amino acid change and a loss of an N-glycosylation site. A systematic study demonstrated that the loss of secreted protein was due not to the loss of an N-glycosylation site, but rather an SNP-specific targeting by miR-671-3p and miR-657. Bioinformatics analysis identified 41 SNP-specific miRNA targeting MMP-9 SNPs, mostly in the coding exon and an extension of the analysis to chromosome 20, where the MMP-9 gene is located, suggesting that SNP-specific miRNAs targeting the coding exon are prevalent. This selective post-transcriptional regulation of a target messenger RNA harboring genetic polymorphisms by miRNAs offers an SNP-dependent post-transcriptional regulatory mechanism, allowing for polymorphic-specific differential gene regulation. PMID:24627221
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pham-Dinh, D.; Gaspera, D.B.; Dautigny, A.
1995-09-20
Myelin/oligodendrocyte glycoprotein (MOG), a special component of the central nervous system localization on the outermost lamellae of mature myelin, is a member of the immunoglobulin superfamily. We report here the organization of the human MOG gene, which spans approximately 17 kb, and the characterization of six MOG mRNA splicing variants. The intron/exon structure of the human MOG gene confirmed the splicing pattern, supporting the hypothesis that mRNA isoforms could arise by alternative splicing of a single gene. In addition to the eight exons coding for the major MOG isoform, the human MOG gene also contains 3` region, a previously unknownmore » alternatively spliced coding exon, VIA. Alternative utilization of two acceptor splicing sites for exon VIII could produce two different C-termini. The nucleotide sequences presented here may be a useful tool to study further possible involvement if the MOG gene in hereditary neurological disorders. 23 refs., 5 figs.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dyer, K.D.; Handen, J.S.; Rosenberg, H.F.
The Charcot-Leyden crystal (CLC) protein, or eosinophil lysophospholipase, is a characteristic protein of human eosinophils and basophils; recent work has demonstrated that the CLC protein is both structurally and functionally related to the galectin family of {beta}-galactoside binding proteins. The galectins as a group share a number of features in common, including a linear ligand binding site encoded on a single exon. In this work, we demonstrate that the intron-exon structure of the gene encoding CLC is analogous to those encoding the galectins. The coding sequence of the CLC gene is divided into four exons, with the entire {beta}-galactoside bindingmore » site encoded by exon III. We have isolated CLC {beta}-galactoside binding sites from both orangutan (Pongo pygmaeus) and murine (Mus musculus) genomic DNAs, both encoded on single exons, and noted conservation of the amino acids shown to interact directly with the {beta}-galactoside ligand. The most likely interpretation of these results suggests the occurrence of one or more exon duplication and insertion events, resulting in the distribution of this lectin domain to CLC as well as to the multiple galectin genes. 35 refs., 3 figs.« less
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2015-01-01
Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
ExoLocator--an online view into genetic makeup of vertebrate proteins.
Khoo, Aik Aun; Ogrizek-Tomas, Mario; Bulovic, Ana; Korpar, Matija; Gürler, Ece; Slijepcevic, Ivan; Šikic, Mile; Mihalek, Ivana
2014-01-01
ExoLocator (http://exolocator.eopsf.org) collects in a single place information needed for comparative analysis of protein-coding exons from vertebrate species. The main source of data--the genomic sequences, and the existing exon and homology annotation--is the ENSEMBL database of completed vertebrate genomes. To these, ExoLocator adds the search for ostensibly missing exons in orthologous protein pairs across species, using an extensive computational pipeline to narrow down the search region for the candidate exons and find a suitable template in the other species, as well as state-of-the-art implementations of pairwise alignment algorithms. The resulting complements of exons are organized in a way currently unique to ExoLocator: multiple sequence alignments, both on the nucleotide and on the peptide levels, clearly indicating the exon boundaries. The alignments can be inspected in the web-embedded viewer, downloaded or used on the spot to produce an estimate of conservation within orthologous sets, or functional divergence across paralogues.
Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior
2011-09-23
Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.
Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O.; Decker, Christian; Preising, Markus N.; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Issa, Peter Charbel; Holz, Frank G.; Baig, Shahid M.; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y.; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S.; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J.
2013-01-01
Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover “hidden mutations” such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5′ exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5′-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even truncating mutations may be misleading. PMID:24265693
Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O; Decker, Christian; Preising, Markus N; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Charbel Issa, Peter; Holz, Frank G; Baig, Shahid M; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J
2013-01-01
Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover "hidden mutations" such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5' exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5'-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even truncating mutations may be misleading.
Vouille, V; Amiche, M; Nicolas, P
1997-09-01
We cloned the genes of two members of the dermaseptin family, broad-spectrum antimicrobial peptides isolated from the skin of the arboreal frog Phyllomedusa bicolor. The dermaseptin gene Drg2 has a 2-exon coding structure interrupted by a small 137-bp intron, wherein exon 1 encoded a 22-residue hydrophobic signal peptide and the first three amino acids of the acidic propiece; exon 2 contained the 18 additional acidic residues of the propiece plus a typical prohormone processing signal Lys-Arg and a 32-residue dermaseptin progenitor sequence. The dermaseptin genes Drg2 and Drg1g2 have conserved sequences at both untranslated ends and in the first and second coding exons. In contrast, Drg1g2 comprises a third coding exon for a short version of the acidic propiece and a second dermaseptin progenitor sequence. Structural conservation between the two genes suggests that Drg1g2 arose recently from an ancestral Drg2-like gene through amplification of part of the second coding exon and 3'-untranslated region. Analysis of the cDNAs coding precursors for several frog skin peptides of highly different structures and activities demonstrates that the signal peptides and part of the acidic propieces are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The organization of the genes that belong to this family, with the signal peptide and the progenitor sequence on separate exons, permits strikingly different peptides to be directed into the secretory pathway. The recruitment of such a homologous 'secretory' exon by otherwise non-homologous genes may have been an early event in the evolution of amphibian.
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa
2015-01-01
Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity. PMID:26693737
SinEx DB: a database for single exon coding sequences in mammalian genomes.
Jorquera, Roddy; Ortiz, Rodrigo; Ossandon, F; Cárdenas, Juan Pablo; Sepúlveda, Rene; González, Carolina; Holmes, David S
2016-01-01
Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as 'single exon genes' (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs.Database URL: www.sinex.cl. © The Author(s) 2016. Published by Oxford University Press.
SEQassembly: A Practical Tools Program for Coding Sequences Splicing
NASA Astrophysics Data System (ADS)
Lee, Hongbin; Yang, Hang; Fu, Lei; Qin, Long; Li, Huili; He, Feng; Wang, Bo; Wu, Xiaoming
CDS (Coding Sequences) is a portion of mRNA sequences, which are composed by a number of exon sequence segments. The construction of CDS sequence is important for profound genetic analysis such as genotyping. A program in MATLAB environment is presented, which can process batch of samples sequences into code segments under the guide of reference exon models, and splice these code segments of same sample source into CDS according to the exon order in queue file. This program is useful in transcriptional polymorphism detection and gene function study.
An improved and validated RNA HLA class I SBT approach for obtaining full length coding sequences.
Gerritsen, K E H; Olieslagers, T I; Groeneweg, M; Voorter, C E M; Tilanus, M G J
2014-11-01
The functional relevance of human leukocyte antigen (HLA) class I allele polymorphism beyond exons 2 and 3 is difficult to address because more than 70% of the HLA class I alleles are defined by exons 2 and 3 sequences only. For routine application on clinical samples we improved and validated the HLA sequence-based typing (SBT) approach based on RNA templates, using either a single locus-specific or two overlapping group-specific polymerase chain reaction (PCR) amplifications, with three forward and three reverse sequencing reactions for full length sequencing. Locus-specific HLA typing with RNA SBT of a reference panel, representing the major antigen groups, showed identical results compared to DNA SBT typing. Alleles encountered with unknown exons in the IMGT/HLA database and three samples, two with Null and one with a Low expressed allele, have been addressed by the group-specific RNA SBT approach to obtain full length coding sequences. This RNA SBT approach has proven its value in our routine full length definition of alleles. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Laitinen, Eeva-Maria; Tommiska, Johanna; Dunkel, Leo; Sankilampi, Ulla; Vaaralahti, Kirsi; Raivio, Taneli
2010-04-01
To describe a mother with idiopathic hypogonadotropic hypogonadism (IHH) and her monozygotic (MZ) twin boys who all have the same heterozygous fibroblast growth factor receptor-1 (FGFR1) gene mutation. Case report. University hospital. A 28-year-old mother with normosmic IHH gave birth to MZ twin boys after a transfer of a single frozen-thawed embryo. Clinical and biochemical evaluation of IHH. Sequence analysis of the 17 coding exons (exons 2-18) and exon-intron boundaries of FGFR1 from polymerase chain reaction-amplified genomic DNA from peripheral blood leukocytes of the subjects. Phenotypic features of the subjects. All subjects harbored a previously undescribed heterozygous FGFR1 mutation (c.2049-1 G-->C), leading to the skipping of exon 16 and thus a loss of amino acids 684-726 in the tyrosine kinase domain of the receptor. The absence of exon 16 was verified at the cDNA level. The twins manifested with microphallus, cryptorchidism, and deficient postnatal activation of the hypothalamic-pituitary-gonadal axis, findings consistent with IHH. Our report underlines that assisted reproductive techniques enable the inheritance of gene mutations causing infertility. This is the first report on the phenotypic features of MZ twins with an FGFR1 mutation. Copyright 2010 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene.
Levy-Lahad, E; Poorkaj, P; Wang, K; Fu, Y H; Oshima, J; Mulligan, J; Schellenberg, G D
1996-06-01
Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23,737 bp. The first 2 exons encode the 5'-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splice acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system.
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Levy-Lahad, E.; Wang, Kai; Fu, Ying Hui
1996-06-01
Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23, 737 bp. The first 2 exons encode the 5{prime}-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splicemore » acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system. 19 refs., 2 figs., 3 tabs.« less
Pianigiani, Giulia; Licastro, Danilo; Fortugno, Paola; Castiglia, Daniele; Petrovic, Ivana; Pagani, Franco
2018-06-12
MicroRNAs are found throughout the genome and are processed by the microprocessor complex (MPC) from longer precursors. Some precursor miRNAs overlap intron:exon junctions. These Splice site Overlapping microRNAs (SO-miRNAs) are mostly located in coding genes. It has been intimated, in the rarer examples of SO-miRNAs in non-coding RNAs, that the competition between the spliceosome and the MPC modulates alternative splicing. However, the effect of this overlap on coding transcripts is unknown. Unexpectedly, we show that neither Drosha silencing nor SF3b1 silencing changed the inclusion ratio of SO-miRNA exons. Two SO-miRNAs, located in genes that code for basal membrane proteins, are known to inhibit proliferation in primary keratinocytes. These SO-miRNAs were upregulated during differentiation and the host mRNAs were downregulated, but again there was no change in inclusion ratio of the SO-miRNA exons. Interestingly, Drosha silencing increased nascent RNA density, on chromatin, downstream of SO-miRNA exons. Overall our data suggest a novel mechanism for regulating gene expression in which MPC-dependent cleavage of SO-miRNA exons could cause premature transcriptional termination of coding genes rather than affecting alternative splicing. Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Kabir, Firoz; Ullah, Inayat; Ali, Shahbaz; Gottsch, Alexander D.H.; Naeem, Muhammad Asif; Assir, Muhammad Zaman; Khan, Shaheen N.; Akram, Javed; Riazuddin, Sheikh; Ayyagari, Radha; Hejtmancik, J. Fielding
2016-01-01
Purpose This study was undertaken to identify causal mutations responsible for autosomal recessive retinitis pigmentosa (arRP) in consanguineous families. Methods Large consanguineous families were ascertained from the Punjab province of Pakistan. An ophthalmic examination consisting of a fundus evaluation and electroretinography (ERG) was completed, and small aliquots of blood were collected from all participating individuals. Genomic DNA was extracted from white blood cells, and a genome-wide linkage or a locus-specific exclusion analysis was completed with polymorphic short tandem repeats (STRs). Two-point logarithm of odds (LOD) scores were calculated, and all coding exons and exon–intron boundaries of RP1 were sequenced to identify the causal mutation. Results The ophthalmic examination showed that affected individuals in all families manifest cardinal symptoms of RP. Genome-wide scans localized the disease phenotype to chromosome 8q, a region harboring RP1, a gene previously implicated in the pathogenesis of RP. Sanger sequencing identified a homozygous single base deletion in exon 4: c.3697delT (p.S1233Pfs22*), a single base substitution in intron 3: c.787+1G>A (p.I263Nfs8*), a 2 bp duplication in exon 2: c.551_552dupTA (p.Q185Yfs4*) and an 11,117 bp deletion that removes all three coding exons of RP1. These variations segregated with the disease phenotype within the respective families and were not present in ethnically matched control samples. Conclusions These results strongly suggest that these mutations in RP1 are responsible for the retinal phenotype in affected individuals of all four consanguineous families. PMID:27307693
IL-TIF/IL-22: genomic organization and mapping of the human and mouse genes.
Dumoutier, L; Van Roost, E; Ameye, G; Michaux, L; Renauld, J C
2000-12-01
IL-TIF is a new cytokine originally identified as a gene induced by IL-9 in murine T lymphocytes, and showing 22% amino acid identity with IL-10. Here, we report the sequence and organization of the mouse and human IL-TIF genes, which both consist of 6 exons spreading over approximately 6 Kb. The IL-TIF gene is a single copy gene in humans, and is located on chromosome 12q15, at 90 Kb from the IFN gamma gene, and at 27 Kb from the AK155 gene, which codes for another IL-10-related cytokine. In the mouse, the IL-TIF gene is located on chromosome 10, also in the same region as the IFN gamma gene. Although it is a single copy gene in BALB/c and DBA/2 mice, the IL-TIF gene is duplicated in other strains such as C57Bl/6, FVB and 129. The two copies, which show 98% nucleotide identity in the coding region, were named IL-TIF alpha and IL-TIF beta. Beside single nucleotide variations, they differ by a 658 nucleotide deletion in IL-TIF beta, including the first non-coding exon and 603 nucleotides from the promoter. A DNA fragment corresponding to this deletion was sufficient to confer IL-9-regulated expression of a luciferase reporter plasmid, suggesting that the IL-TIF beta gene is either differentially regulated, or not expressed at all.
Mutational screening of FGFR1, CER1, and CDON in a large cohort of trigonocephalic patients.
Jehee, Fernanda Sarquis; Alonso, Luis G; Cavalcanti, Denise P; Kim, Chong; Wall, Steven A; Mulliken, John B; Sun, Miao; Jabs, Ethylin Wang; Boyadjiev, Simeon A; Wilkie, Andrew O M; Passos-Bueno, Maria Rita
2006-03-01
Screen the known craniosynostotic related gene, FGFR1 (exon 7), and two new identified potential candidates, CER1 and CDON, in patients with syndromic and nonsyndromic metopic craniosynostosis to determine if they might be causative genes. Using single-strand conformational polymorphisms (SSCPs), denaturing high-performance liquid chromatography, and/or direct sequencing, we analyzed a total of 81 patients for FGFR1 (exon 7), 70 for CER1, and 44 for CDON. Patients were ascertained in the Centro de Estudos do Genoma Humano in São Paulo, Brazil (n = 39), the Craniofacial Unit, Oxford, U.K. (n = 23), and the Johns Hopkins University, Baltimore, Maryland (n = 31). Clinical inclusion criteria included a triangular head and/or forehead, with or without a metopic ridge, and a radiographic documentation of metopic synostosis. Both syndromic and nonsyndromic patients were studied. No sequence alterations were found for FGFR1 (exon 7). Different patterns of SSCP migration for CER1 compatible with the segregation of single nucleotide polymorphisms reported in the region were identified. Seventeen sequence alterations were detected in the coding region of CDON, seven of which are new, but segregation analysis in parents and homology studies did not indicate a pathological role. FGFR1 (exon 7), CER1, and CDON are not related to trigonocephaly in our sample and should not be considered as causative genes for metopic synostosis. Screening of FGFR1 (exon 7) for diagnostic purposes should not be performed in trigonocephalic patients.
Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M
2017-01-01
Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
Evolution of the alternative AQP2 gene: Acquisition of a novel protein-coding sequence in dolphins.
Kishida, Takushi; Suzuki, Miwa; Takayama, Asuka
2018-01-01
Taxon-specific de novo protein-coding sequences are thought to be important for taxon-specific environmental adaptation. A recent study revealed that bottlenose dolphins acquired a novel isoform of aquaporin 2 generated by alternative splicing (alternative AQP2), which helps dolphins to live in hyperosmotic seawater. The AQP2 gene consists of four exons, but the alternative AQP2 gene lacks the fourth exon and instead has a longer third exon that includes the original third exon and a part of the original third intron. Here, we show that the latter half of the third exon of the alternative AQP2 arose from a non-protein-coding sequence. Intact ORF of this de novo sequence is shared not by all cetaceans, but only by delphinoids. However, this sequence is conservative in all modern cetaceans, implying that this de novo sequence potentially plays important roles for marine adaptation in cetaceans. Copyright © 2017 Elsevier Inc. All rights reserved.
Ito, M; Mori, Y; Oiso, Y; Saito, H
1991-01-01
To elucidate the molecular mechanism of familial central diabetes insipidus (FDI), we sequenced the arginine vasopressin-neurophysin II (AVP-NPII) gene in 2 patients belonging to a pedigree that is consistent with an autosomal dominant mode of inheritance. 10 patients with idiopathic central diabetes insipidus (IDI) and 5 normals were also studied. The AVP-NPII gene, locating on chromosome 20, consists of three exons that encode putative signal peptide, AVP, NPII, and glycoprotein. Using polymerase chain reaction, fragments including the promoter region and all coding regions were amplified from genomic DNA and subjected to direct sequencing. Sequences of 10 patients with IDI were identical with those of normals, while in 2 patients with FDI, a single base substitution was detected in one of two alleles of the AVP-NPII gene, indicating they were heterozygotes for this mutation. It was a G----A transition at nucleotide position 1859 in the second exon, resulting in a substitution of Gly for Ser at amino acid position 57 in the NPII moiety. It was speculated that the mutated AVP-NPII precursor or the mutated NPII molecule, through their conformational changes, might be responsible for AVP deficiency. Images PMID:1840604
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.
1994-12-31
Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Teasdale, Luisa C; Köhler, Frank; Murray, Kevin D; O'Hara, Tim; Moussalli, Adnan
2016-09-01
The qualification of orthology is a significant challenge when developing large, multiloci phylogenetic data sets from assembled transcripts. Transcriptome assemblies have various attributes, such as fragmentation, frameshifts and mis-indexing, which pose problems to automated methods of orthology assessment. Here, we identify a set of orthologous single-copy genes from transcriptome assemblies for the land snails and slugs (Eupulmonata) using a thorough approach to orthology determination involving manual alignment curation, gene tree assessment and sequencing from genomic DNA. We qualified the orthology of 500 nuclear, protein-coding genes from the transcriptome assemblies of 21 eupulmonate species to produce the most complete phylogenetic data matrix for a major molluscan lineage to date, both in terms of taxon and character completeness. Exon capture targeting 490 of the 500 genes (those with at least one exon >120 bp) from 22 species of Australian Camaenidae successfully captured sequences of 2825 exons (representing all targeted genes), with only a 3.7% reduction in the data matrix due to the presence of putative paralogs or pseudogenes. The automated pipeline Agalma retrieved the majority of the manually qualified 500 single-copy gene set and identified a further 375 putative single-copy genes, although it failed to account for fragmented transcripts resulting in lower data matrix completeness when considering the original 500 genes. This could potentially explain the minor inconsistencies we observed in the supported topologies for the 21 eupulmonate species between the manually curated and 'Agalma-equivalent' data set (sharing 458 genes). Overall, our study confirms the utility of the 500 gene set to resolve phylogenetic relationships at a range of evolutionary depths and highlights the importance of addressing fragmentation at the homolog alignment stage for probe design. © 2016 John Wiley & Sons Ltd.
p53 in pure epithelioid PEComa: an immunohistochemistry study and gene mutation analysis.
Bing, Zhanyong; Yao, Yuan; Pasha, Theresa; Tomaszewski, John E; Zhang, Paul J
2012-04-01
Pure epithelioid PEComa (PEP; so-called epithelioid angiomyolipoma) is rare and is more often associated with aggressive behaviors. The pathogenesis of PEP has been poorly understood. The authors studied p53 expression and gene mutation in PEPs by immunohistochemistry, single-strand conformation polymorphism, and direct sequencing in paraffin material from 8 PEPs. A group of classic angiomyolipomas (AMLs) were also analyzed for comparison. Five PEPs were from kidneys and 1 each from the heart, the liver, and the uterus. PEPs showed much stronger p53 nuclear staining (Allred score 6.4 ± 2.5) than the classic AML (2.3 ± 2.9) (P < .01). There was no p53 single-strand conformation polymorphism identified in either the PEPs or the 8 classic AMLs. p53 mutation analyses by direct sequencing of exons 5 to 9 showed 4 mutations in 3 of 8 PEPs but none in any of the 8 classic AMLs. The mutations included 2 missense mutations in a hepatic PEComa and 2 silent mutations in 2 renal PEPs. Both the missense mutations in the hepatic PEComa involved the exon 5, one involving codon 165, with change from CAG to CAC (coding amino acid changed from glutamine to histidine), and the other involving codon 182, with change from TGC to TAC (coding amino acid changed from cysteine to tyrosine). The finding of stronger p53 expression and mutations in epithelioid angiomyolipomas might have contributed to their less predictable behavior. However, the abnormal p53 expression cannot be entirely explained by p53 mutations in the exons examined in the PEPs.
Intergenic disease-associated regions are abundant in novel transcripts.
Bartonicek, N; Clark, M B; Quek, X C; Torpy, J R; Pritchard, A L; Maag, J L V; Gloss, B S; Crawford, J; Taft, R J; Hayward, N K; Montgomery, G W; Mattick, J S; Mercer, T R; Dinger, M E
2017-12-28
Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.
COOLAIR Antisense RNAs Form Evolutionarily Conserved Elaborate Secondary Structures
Hawkes, Emily J.; Hennelly, Scott P.; Novikova, Irina V.; ...
2016-09-20
There is considerable debate about the functionality of long non-coding RNAs (lncRNAs). Lack of sequence conservation has been used to argue against functional relevance. Here, we investigated antisense lncRNAs, called COOLAIR, at the A. thaliana FLC locus and experimentally determined their secondary structure. The major COOLAIR variants are highly structured, organized by exon. The distally polyadenylated transcript has a complex multi-domain structure, altered by a single non-coding SNP defining a functionally distinct A. thaliana FLC haplotype. The A. thaliana COOLAIR secondary structure was used to predict COOLAIR exons in evolutionarily divergent Brassicaceae species. These predictions were validated through chemical probingmore » and cloning. Despite the relatively low nucleotide sequence identity, the structures, including multi-helix junctions, show remarkable evolutionary conservation. In a number of places, the structure is conserved through covariation of a non-contiguous DNA sequence. This structural conservation supports a functional role for COOLAIR transcripts rather than, or in addition to, antisense transcription.« less
COOLAIR Antisense RNAs Form Evolutionarily Conserved Elaborate Secondary Structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hawkes, Emily J.; Hennelly, Scott P.; Novikova, Irina V.
There is considerable debate about the functionality of long non-coding RNAs (lncRNAs). Lack of sequence conservation has been used to argue against functional relevance. Here, we investigated antisense lncRNAs, called COOLAIR, at the A. thaliana FLC locus and experimentally determined their secondary structure. The major COOLAIR variants are highly structured, organized by exon. The distally polyadenylated transcript has a complex multi-domain structure, altered by a single non-coding SNP defining a functionally distinct A. thaliana FLC haplotype. The A. thaliana COOLAIR secondary structure was used to predict COOLAIR exons in evolutionarily divergent Brassicaceae species. These predictions were validated through chemical probingmore » and cloning. Despite the relatively low nucleotide sequence identity, the structures, including multi-helix junctions, show remarkable evolutionary conservation. In a number of places, the structure is conserved through covariation of a non-contiguous DNA sequence. This structural conservation supports a functional role for COOLAIR transcripts rather than, or in addition to, antisense transcription.« less
Kim, Yoonhee; Suktitipat, Bhoom; Yanek, Lisa R.; Faraday, Nauder; Wilson, Alexander F.; Becker, Diane M.; Becker, Lewis C.; Mathias, Rasika A.
2013-01-01
Platelet aggregation is heritable, and genome-wide association studies have detected strong associations with a common intronic variant of the platelet endothelial aggregation receptor1 (PEAR1) gene both in African American and European American individuals. In this study, we used a sequencing approach to identify additional exonic variants in PEAR1 that may also determine variability in platelet aggregation in the GeneSTAR Study. A 0.3 Mb targeted region on chromosome 1q23.1 including the entire PEAR1 gene was Sanger sequenced in 104 subjects (45% male, 49% African American, age = 52±13) selected on the basis of hyper- and hypo- aggregation across three different agonists (collagen, epinephrine, and adenosine diphosphate). Single-variant and multi-variant burden tests for association were performed. Of the 235 variants identified through sequencing, 61 were novel, and three of these were missense variants. More rare variants (MAF<5%) were noted in African Americans compared to European Americans (108 vs. 45). The common intronic GWAS-identified variant (rs12041331) demonstrated the most significant association signal in African Americans (p = 4.020×10−4); no association was seen for additional exonic variants in this group. In contrast, multi-variant burden tests indicated that exonic variants play a more significant role in European Americans (p = 0.0099 for the collective coding variants compared to p = 0.0565 for intronic variant rs12041331). Imputation of the individual exonic variants in the rest of the GeneSTAR European American cohort (N = 1,965) supports the results noted in the sequenced discovery sample: p = 3.56×10−4, 2.27×10−7, 5.20×10−5 for coding synonymous variant rs56260937 and collagen, epinephrine and adenosine diphosphate induced platelet aggregation, respectively. Sequencing approaches confirm that a common intronic variant has the strongest association with platelet aggregation in African Americans, and show that exonic variants play an additional role in platelet aggregation in European Americans. PMID:23704978
Murray, R; Pederson, K; Prosser, H; Muller, D; Hutchison, C A; Frelinger, J A
1988-01-01
We have used random oligonucleotide mutagenesis (or saturation mutagenesis) to create a library of point mutations in the alpha 1 protein domain of a Major Histocompatibility Complex (MHC) molecule. This protein domain is critical for T cell and B cell recognition. We altered the MHC class I H-2DP gene sequence such that synthetic mutant alpha 1 exons (270 bp of coding sequence), which contain mutations identified by sequence analysis, can replace the wild type alpha 1 exon. The synthetic exons were constructed from twelve overlapping oligonucleotides which contained an average of 1.3 random point mutations per intact exon. DNA sequence analysis of mutant alpha 1 exons has shown a point mutant distribution that fits a Poisson distribution, and thus emphasizes the utility of this mutagenesis technique to "scan" a large protein sequence for important mutations. We report our use of saturation mutagenesis to scan an entire exon of the H-2DP gene, a cassette strategy to replace the wild type alpha 1 exon with individual mutant alpha 1 exons, and analysis of mutant molecules expressed on the surface of transfected mouse L cells. Images PMID:2903482
NASA Technical Reports Server (NTRS)
Chang, Dong Kyung; Metzgar, David; Wills, Christopher; Boland, C. Richard
2003-01-01
All "minor" components of the human DNA mismatch repair (MMR) system-MSH3, MSH6, PMS2, and the recently discovered MLH3-contain mononucleotide microsatellites in their coding sequences. This intriguing finding contrasts with the situation found in the major components of the DNA MMR system-MSH2 and MLH1-and, in fact, most human genes. Although eukaryotic genomes are rich in microsatellites, non-triplet microsatellites are rare in coding regions. The recurring presence of exonal mononucleotide repeat sequences within a single family of human genes would therefore be considered exceptional.
USDA-ARS?s Scientific Manuscript database
The actions of prolactin (PRL) are mediated by both long (LF) and short isoforms (SF) of the PRL receptor (PRLR). Here, we report on a genetic and functional analysis of the porcine PRLR (pPRLR) SF. Three single nucleotide polymorphisms (SNPs) within exon 11 of the pPRLR-SF give rise to four amino a...
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Evaluating the protein coding potential of exonized transposable element sequences
Piriyapongsa, Jittima; Rutledge, Mark T; Patel, Sanil; Borodovsky, Mark; Jordan, I King
2007-01-01
Background Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. Results We compared the ability of three classes of sequence similarity search methods to detect TE-derived sequences among data sets of experimentally characterized proteins: 1-a profile-based hidden Markov model (HMM) approach, 2-BLAST methods and 3-RepeatMasker. Profile based methods are more sensitive and more selective than the other methods evaluated. However, the application of profile-based search methods to the detection of TE-derived sequences among well-curated experimentally characterized protein data sets did not turn up many more cases than had been previously detected and nowhere near as many cases as recent genome-wide searches have. We observed that the different search methods used were complementary in the sense that they yielded largely non-overlapping sets of hits and differed in their ability to recover known cases of TE-derived CDS. The probabilistic analysis of TE-derived exon sequences indicates that these sequences have low protein coding potential on average. In particular, non-autonomous TEs that do not encode protein sequences, such as Alu elements, are frequently exonized but unlikely to encode protein sequences. Conclusion The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence. Reviewers: This article was reviewed by Itai Yanai, Kateryna D. Makova, Melissa Wilson (nominated by Kateryna D. Makova) and Cedric Feschotte (nominated by John M. Logsdon Jr.). PMID:18036258
Yang, Q L; Huang, X Y; Kong, J J; Zhao, S G; Liu, L X; Gun, S B
2016-08-19
Piglet diarrhea is one of the primary factors that affects the benefits of the swine industry. Recent studies have shown that exon 2 of the swine leukocyte antigen-DQA gene is associated with piglet resistance to diarrhea; however, the contributions of additional exon coding regions of this gene remain unclear. Here, we detected and sequenced variants in the exon 3 region and examined their associations with diarrhea infection in 425 suckling piglets using the polymerase chain reaction-single-strand conformational polymorphism and sequencing analysis. The results revealed that exon 3 of the swine leukocyte antigen-DQA gene is highly polymorphic and pivotal to both diarrhea susceptibility and resistance in piglets. We identified 14 genotypes (AA, AB, BB, BC, CC, EE, EF, BE, BF, CF, DD, DH, GG, and GF) and eight alleles (A-H) that were generated by 14 nucleotide variants, eight of which were novel, and three nucleotide deletions. Statistical analyses revealed that the genotypes AB and EF were associated with resistance to diarrheal disease (P < 0.05), and the genotype DD may contribute to diarrhea susceptibility but was unique to Large White pigs (P > 0.05). These results elucidate the genetic and immunological background to piglet diarrhea, and provide useful information for resistance breeding programs.
Identification of new mutations in primary hyperoxaluria type 1 (PH1).
von Schnakenburg, C; Rumsby, G
1998-01-01
Primary hyperoxaluria type 1 (PH1) is caused by deficiency of the hepatic peroxisomal enzyme alanine:glyoxylate aminotransferase (AGT). The AGXT gene, which codes for the 392 amino acid protein, has been mapped to chromosome 2q37.3. In order to identify new mutations in the AGXT gene we studied 79 PH1 patients using single strand conformation polymorphism analysis. In addition to a cluster of new mutations in exon 7 we report five novel mutations in exons 2, 4, 5, 9 and 10. These are T444C, G640A, G690A, 1008-1010delGCG and G1171A. These five new mutations contribute to our knowledge of the AGXT gene. Their possible consequences for PH1 phenotype and enzyme activity are discussed.
Exon 11 skipping of SCN10A coding for voltage-gated sodium channels in dorsal root ganglia
Schirmeyer, Jana; Szafranski, Karol; Leipold, Enrico; Mawrin, Christian; Platzer, Matthias; Heinemann, Stefan H
2014-01-01
The voltage-gated sodium channel NaV1.8 (encoded by SCN10A) is predominantly expressed in dorsal root ganglia (DRG) and plays a critical role in pain perception. We analyzed SCN10A transcripts isolated from human DRGs using deep sequencing and found a novel splice variant lacking exon 11, which codes for 98 amino acids of the domain I/II linker. Quantitative PCR analysis revealed an abundance of this variant of up to 5–10% in human, while no such variants were detected in mouse or rat. Since no obvious functional differences between channels with and without the exon-11 sequence were detected, it is suggested that SCN10A exon 11 skipping in humans is a tolerated event. PMID:24763188
Zorc, Minja; Kunej, Tanja
2016-05-01
MicroRNAs (miRNAs) are a class of non-coding RNAs involved in posttranscriptional regulation of target genes. Regulation requires complementarity between target mRNA and the mature miRNA seed region, responsible for their recognition and binding. It has been estimated that each miRNA targets approximately 200 genes, and genetic variability of miRNA genes has been reported to affect phenotypic variability and disease susceptibility in humans, livestock species, and model organisms. Polymorphisms in miRNA genes could therefore represent biomarkers for phenotypic traits in livestock animals. In our previous study, we collected polymorphisms within miRNA genes in chicken. In the present study, we identified miRNA-related genomic overlaps to prioritize genomic regions of interest for further functional studies and biomarker discovery. Overlapping genomic regions in chicken were analyzed using the following bioinformatics tools and databases: miRNA SNiPer, Ensembl, miRBase, NCBI Blast, and QTLdb. Out of 740 known pre-miRNA genes, 263 (35.5 %) contain polymorphisms; among them, 35 contain more than three polymorphisms The most polymorphic miRNA genes in chicken are gga-miR-6662, containing 23 single nucleotide polymorphisms (SNPs) within the pre-miRNA region, including five consecutive SNPs, and gga-miR-6688, containing ten polymorphisms including three consecutive polymorphisms. Several miRNA-related genomic hotspots have been revealed in chicken genome; polymorphic miRNA genes are located within protein-coding and/or non-coding transcription units and quantitative trait loci (QTL) associated with production traits. The present study includes the first description of an exonic miRNA in a chicken genome, an overlap between the miRNA gene and the exon of the protein-coding gene (gga-miR-6578/HADHB), and the first report of a missense polymorphism located within a mature miRNA seed region. Identified miRNA-related genomic hotspots in chicken can serve researchers as a starting point for further functional studies and association studies with poultry production and health traits and the basis for systematic screening of exonic miRNAs and missense/miRNA seed polymorphisms in other genomes.
Parvari, R; Shen, J; Hershkovitz, E; Chen, Y T; Moses, S W
1998-04-01
Glycogen storage disease type III (GSD III) is an autosomal recessive disease caused by the deficiency of glycogen debranching enzyme (AGL). We report the finding of two new mutations in a GSD IIIa Ashkenazi Jewish patient. Both mutations are insertion of an adenine into a stretch of 8 adenines towards the 3' end of the coding region, one at position 3904 (3904insA) in exon 30, the second at position 4214 (4214insA) in exon 32. The mutations cause frameshifts and premature terminations of the glycogen debranching enzyme, the first causing a frameshift at amino acid 1304, the second causing a frameshift at amino acid 1408 of the total of 1532. These mutations demonstrate the importance of the 125 amino acids at the carboxy-terminus of the debrancher enzyme for its activity and support the suggestion that the putative glycogen binding domain is located in the carboxy-terminus of the AGL. The mutations cause distinctive single-strand conformation polymorphism (SSCP) patterns enabling easy detection.
Soukarieh, Omar; Gaildrat, Pascaline; Hamieh, Mohamad; Drouet, Aurélie; Baert-Desurmont, Stéphanie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra
2016-01-01
The identification of a causal mutation is essential for molecular diagnosis and clinical management of many genetic disorders. However, even if next-generation exome sequencing has greatly improved the detection of nucleotide changes, the biological interpretation of most exonic variants remains challenging. Moreover, particular attention is typically given to protein-coding changes often neglecting the potential impact of exonic variants on RNA splicing. Here, we used the exon 10 of MLH1, a gene implicated in hereditary cancer, as a model system to assess the prevalence of RNA splicing mutations among all single-nucleotide variants identified in a given exon. We performed comprehensive minigene assays and analyzed patient’s RNA when available. Our study revealed a staggering number of splicing mutations in MLH1 exon 10 (77% of the 22 analyzed variants), including mutations directly affecting splice sites and, particularly, mutations altering potential splicing regulatory elements (ESRs). We then used this thoroughly characterized dataset, together with experimental data derived from previous studies on BRCA1, BRCA2, CFTR and NF1, to evaluate the predictive power of 3 in silico approaches recently described as promising tools for pinpointing ESR-mutations. Our results indicate that ΔtESRseq and ΔHZEI-based approaches not only discriminate which variants affect splicing, but also predict the direction and severity of the induced splicing defects. In contrast, the ΔΨ-based approach did not show a compelling predictive power. Our data indicates that exonic splicing mutations are more prevalent than currently appreciated and that they can now be predicted by using bioinformatics methods. These findings have implications for all genetically-caused diseases. PMID:26761715
Ortuño-Pineda, Carlos; Galindo-Rosales, José Manuel; Calderón-Salinas, José Victor; Villegas-Sepúlveda, Nicolás; Saucedo-Cárdenas, Odila; De Nova-Ocampo, Mónica; Valdés, Jesús
2012-01-01
The splicing of the N exon in the pre-mRNA coding for the RE1-silencing transcription factor (REST) results in a truncated protein that modifies the expression pattern of some of its target genes. A weak 3'ss, three alternative 5'ss (N4-, N50-, and N62-5'ss) and a variety of putative target sites for splicing regulatory proteins are found around the N exon; two GGGG codes (G2-G3) and a poly-Uridine tract (N-PU) are found in front of the N50-5'ss. In this work we analyzed some of the regulatory factors and elements involved in the preferred selection of the N50-5'ss (N50 activation) in the small cell lung cancer cell line H69. Wild type and mutant N exon/β-globin minigenes recapitulated N50 exon splicing in H69 cells, and showed that the N-PU and the G2-G3 elements are required for N50 exon splicing. Biochemical and knockdown experiments identified these elements as U2AF65 and hnRNP H targets, respectively, and that they are also required for N50 exon activation. Compared to normal MRC5 cells, and in keeping with N50 exon activation, U2AF65, hnRNP H and other splicing factors were highly expressed in H69 cells. CLIP experiments revealed that hnRNP H RNA-binding occurs first and is a prerequisite for U2AF65 RNA binding, and EMSA and CLIP experiments suggest that U2AF65-RNA recognition displaces hnRNP H and helps to recruit other splicing factors (at least U1 70K) to the N50-5'ss. Our results evidenced novel hnRNP H and U2AF65 functions: respectively, U2AF65-recruiting to a 5'ss in humans and the hnRNP H-displacing function from two juxtaposed GGGG codes. PMID:22792276
[Preimplantation genetic diagnosis of Duchenne muscular dystrophy by single cell triplex PCR].
Wu, Yue-Li; Wu, Ling-Qian; Li, Yan-Ping; Liu, Dong-E; Zeng, Qiao; Zhu, Hai-Yan; Pan, Qian; Liang, De-Sheng; Hu, Hao; Long, Zhi-Gao; Li, Juan; Dai, He-Ping; Xia, Kun; Xia, Jia-Hui
2007-04-01
To detect two exons of Duchenne muscular dystrophy (DMD) gene and a gender discrimination locus amelogenin gene by single cell triplex PCR, and to evaluate the possibility of this technique for preimplantation genetic diagnosis (PGD) in DMD family with DMD deletion mutation. Single lymphocytes from a normal male, a normal female, two DMD patients (exon 8 and 47 deleted, respectively) and single blastomeres from the couples treated by the in vitro fertilization pre-embryo transfer (IVF-ET) and without family history of DMD were obtained. Exons 8 and 47 of DMD gene were amplified by a triplex PCR assay, the amelogenin gene on X and Y chromosomes were co-amplified to analyze the correlation between embryo gender and deletion status. In the normal single lymphocytes, the amplification rate of exons 8 and 47 of DMD and amelogenin gene were 93.8%, 93.8%, and 95.3% respectively. The false positive rate was 3.3%. In the exon 8 deleted DMD patient, the amplification rate of exon 47 of DMD and amelogenin gene was 95.8%, and the false positive rate was 3.3%. In the exon 47 deleted DMD patient, the amplification rate of exon 8 of DMD and amelogenin gene was 95.8%, and the false positive rate was 0. In the single blastomeres, the amplification rate of exons 8 and 47 of DMD and amelogenin gene was 82.5%, 80.0% and 77.5%, respectively, and the false positive rate was 0. The single cell triplex PCR protocol for the detection of DMD and amelogenin gene is highly sensitive, specific and reliable, and can be used for PGD in those DMD families with DMD deletion mutation.
Shapiro, James A
2016-06-08
The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess "Read-Write Genomes" they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.
Shapiro, James A.
2016-01-01
The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess “Read–Write Genomes” they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification. PMID:27338490
New genetic variants of LATS1 detected in urinary bladder and colon cancer.
Saadeldin, Mona K; Shawer, Heba; Mostafa, Ahmed; Kassem, Neemat M; Amleh, Asma; Siam, Rania
2014-01-01
LATS1, the large tumor suppressor 1 gene, encodes for a serine/threonine kinase protein and is implicated in cell cycle progression. LATS1 is down-regulated in various human cancers, such as breast cancer, and astrocytoma. Point mutations in LATS1 were reported in human sarcomas. Additionally, loss of heterozygosity of LATS1 chromosomal region predisposes to breast, ovarian, and cervical tumors. In the current study, we investigated LATS1 genetic variations including single nucleotide polymorphisms (SNPs), in 28 Egyptian patients with either urinary bladder or colon cancers. The LATS1 gene was amplified and sequenced and the expression of LATS1 at the RNA level was assessed in 12 urinary bladder cancer samples. We report, the identification of a total of 29 variants including previously identified SNPs within LATS1 coding and non-coding sequences. A total of 18 variants were novel. Majority of the novel variants, 13, were mapped to intronic sequences and un-translated regions of the gene. Four of the five novel variants located in the coding region of the gene, represented missense mutations within the serine/threonine kinase catalytic domain. Interestingly, LATS1 RNA steady state levels was lost in urinary bladder cancerous tissue harboring four specific SNPs (16045 + 41736 + 34614 + 56177) positioned in the 5'UTR, intron 6, and two silent mutations within exon 4 and exon 8, respectively. This study identifies novel single-base-sequence alterations in the LATS1 gene. These newly identified variants could potentially be used as novel diagnostic or prognostic tools in cancer.
NASA Astrophysics Data System (ADS)
Ma, Ruiqin; He, Feng; Wen, Haishen; Li, Jifang; Shi, Bao; Shi, Dan; Liu, Miao; Mu, Weijie; Zhang, Yuanqing; Hu, Jian; Han, Weiguo; Zhang, Jianan; Wang, Qingqing; Yuan, Yuren; Liu, Qun
2012-03-01
As a specific gene of fish, cytochrome P450c17-II ( CYP17-II) gene plays a key role in the growth, development an reproduction level of fish. In this study, the single-stranded conformational polymorphism (SSCP) technique was used to characterize polymorphisms within the coding region of CYP17-II gene in a population of 75 male Japanese flounder ( Paralichthys olivaceus). Three single nucleotide polymorphisms (SNPs) were identified in CYP17-II gene of Japanese flounder. They were c.G594A (p.G188R), c.G939A and c.G1502A (p.G490D). SNP1 (c.G594A), located in exon 4 of CYP17-II gene, was significantly associated with gonadosomatic index (GSI). Individuals with genotype GG of SNP1 had significantly lower GSI ( P < 0.05) than those with genotype AA or AG. SNP2 (c.G939A) located at the CpG island of CYP17-II gene. The mutation changed the methylation of exon 6. Individuals with genotype AA of SNP2 had significantly lower serum testosterone (T) level and hepatosomatic index (HSI) compared to those with genotype GG. The results suggested that SNP2 could influence the reproductive endocrine of male Japanese flounder. However, the SNP3 (c.G1502A) located in exon 9 did not affect the four measured reproductive traits. This study showed that CYP17-II gene could be a potentially useful candidate gene for the research of genetic breeding and physiological aspects of Japanese flounder.
Toyoda, N; Kleinhaus, N; Larsen, P R
1996-06-01
We analyzed the exon-intron structure of the human type 1 deiodinase gene (dio1) and compared it with that of a patient with suspected congenital type 1 deiodinase (D1) deficiency. The hdio1 gene is identical in exon-intron arrangement to the mouse gene, with coding sequences and a selenocysteine insertion sequence (SECIS) element contained in four exons. There were no mutations in the sequences of exons 1-4 of the patient's genomic DNA. Functional studies by transient expression techniques showed no difference in basal promoter activity or T3 responsiveness between the patient's and the normal dio1 gene. A structural abnormality in the dio1 gene is not a likely explanation for this patient's D1-deficient phenotype.
The Exon-Florio National Security Test for Foreign Investment
2006-03-15
Congressional Research Service ˜ The Library of Congress CRS Report for Congress Received through the CRS Web Order Code RL33312 The Exon- Florio ...number. 1. REPORT DATE 15 MAR 2006 2. REPORT TYPE N/A 3. DATES COVERED - 4. TITLE AND SUBTITLE The Exon- Florio National Security Test for...Z39-18 The Exon- Florio National Security Test for Foreign Investment Summary The proposed acquisitions of major operations in six major U.S. ports by
Van, K; Onoda, S; Kim, M Y; Kim, K D; Lee, S-H
2008-03-01
The Waxy (Wx) gene product controls the formation of a straight chain polymer of amylose in the starch pathway. Dominance/recessiveness of the Wx allele is associated with amylose content, leading to non-waxy/waxy phenotypes. For a total of 113 foxtail millet accessions, agronomic traits and the molecular differences of the Wx gene were surveyed to evaluate genetic diversities. Molecular types were associated with phenotypes determined by four specific primer sets (non-waxy, Type I; low amylose, Type VI; waxy, Type IV or V). Additionally, the insertion of transposable element in waxy was confirmed by ex1/TSI2R, TSI2F/ex2, ex2int2/TSI7R and TSI7F/ex4r. Seventeen single nucleotide polymorphims (SNPs) were observed from non-coding regions, while three SNPs from coding regions were non-synonymous. Interestingly, the phenotype of No. 88 was still non-waxy, although seven nucleotides (AATTGGT) insertion at 2,993 bp led to 78 amino acids shorter. The rapid decline of r (2) in the sequenced region (exon 1-intron 1-exon 2) suggested a low level of linkage disequilibrium and limited haplotype structure. K (s) values and estimation of evolutionary events indicate early divergence of S. italica among cereal crops. This study suggested the Wx gene was one of the targets in the selection process during domestication.
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
A rare coding variant in TREM2 increases risk for Alzheimer's disease in Han Chinese.
Jiang, Teng; Tan, Lan; Chen, Qi; Tan, Meng-Shan; Zhou, Jun-Shan; Zhu, Xi-Chen; Lu, Huan; Wang, Hui-Fu; Zhang, Ying-Dong; Yu, Jin-Tai
2016-06-01
Two recent studies have identified that a rare coding variant (p.R47H) in exon 2 of triggering receptor expressed on myeloid cells 2 (TREM2) gene is associated with Alzheimer's disease (AD) susceptibility in Caucasians. This association was not successfully replicated in Han Chinese, where this variant was rare or even absent. Previously, we resequenced TREM2 exon 2 to investigate whether additional rare variants conferred risk to AD in our cohort. Although several new variants had been identified, none of them was significantly associated with disease susceptibility. Here, to test whether TREM2 is truly a susceptibility gene of AD in Han Chinese, we extend our previous study by sequencing the other four exons of TREM2 in 988 AD patients and 1,354 healthy controls. We provided the first evidence that a rare coding variant (p.H157Y) in TREM2 exon 3 conferred a considerable risk of AD in our cohort (Pcorrected = 0.02, odds ratio = 11.01, 95% confidence interval: 1.38-88.05). This finding indicates that rare coding variants of TREM2 may play an important role in AD in Han Chinese. Copyright © 2016 Elsevier Inc. All rights reserved.
Accurate clinical detection of exon copy number variants in a targeted NGS panel using DECoN.
Fowler, Anna; Mahamdallie, Shazia; Ruark, Elise; Seal, Sheila; Ramsay, Emma; Clarke, Matthew; Uddin, Imran; Wylie, Harriet; Strydom, Ann; Lunter, Gerton; Rahman, Nazneen
2016-11-25
Background: Targeted next generation sequencing (NGS) panels are increasingly being used in clinical genomics to increase capacity, throughput and affordability of gene testing. Identifying whole exon deletions or duplications (termed exon copy number variants, 'exon CNVs') in exon-targeted NGS panels has proved challenging, particularly for single exon CNVs. Methods: We developed a tool for the Detection of Exon Copy Number variants (DECoN), which is optimised for analysis of exon-targeted NGS panels in the clinical setting. We evaluated DECoN performance using 96 samples with independently validated exon CNV data. We performed simulations to evaluate DECoN detection performance of single exon CNVs and to evaluate performance using different coverage levels and sample numbers. Finally, we implemented DECoN in a clinical laboratory that tests BRCA1 and BRCA2 with the TruSight Cancer Panel (TSCP). We used DECoN to analyse 1,919 samples, validating exon CNV detections by multiplex ligation-dependent probe amplification (MLPA). Results: In the evaluation set, DECoN achieved 100% sensitivity and 99% specificity for BRCA exon CNVs, including identification of 8 single exon CNVs. DECoN also identified 14/15 exon CNVs in 8 other genes. Simulations of all possible BRCA single exon CNVs gave a mean sensitivity of 98% for deletions and 95% for duplications. DECoN performance remained excellent with different levels of coverage and sample numbers; sensitivity and specificity was >98% with the typical NGS run parameters. In the clinical pipeline, DECoN automatically analyses pools of 48 samples at a time, taking 24 minutes per pool, on average. DECoN detected 24 BRCA exon CNVs, of which 23 were confirmed by MLPA, giving a false discovery rate of 4%. Specificity was 99.7%. Conclusions: DECoN is a fast, accurate, exon CNV detection tool readily implementable in research and clinical NGS pipelines. It has high sensitivity and specificity and acceptable false discovery rate. DECoN is freely available at www.icr.ac.uk/decon.
Lim, Byung Chan; Lee, Seungbok; Shin, Jong-Yeon; Kim, Jong-Il; Hwang, Hee; Kim, Ki Joong; Hwang, Yong Seung; Seo, Jeong-Sun; Chae, Jong Hee
2011-11-01
Duchenne muscular dystrophy or Becker muscular dystrophy might be a suitable candidate disease for application of next-generation sequencing in the genetic diagnosis because the complex mutational spectrum and the large size of the dystrophin gene require two or more analytical methods and have a high cost. The authors tested whether large deletions/duplications or small mutations, such as point mutations or short insertions/deletions of the dystrophin gene, could be predicted accurately in a single platform using next-generation sequencing technology. A custom solution-based target enrichment kit was designed to capture whole genomic regions of the dystrophin gene and other muscular-dystrophy-related genes. A multiplexing strategy, wherein four differently bar-coded samples were captured and sequenced together in a single lane of the Illumina Genome Analyser, was applied. The study subjects were 25 16 with deficient dystrophin expression without a large deletion/duplication and 9 with a known large deletion/duplication. Nearly 100% of the exonic region of the dystrophin gene was covered by at least eight reads with a mean read depth of 107. Pathogenic small mutations were identified in 15 of the 16 patients without a large deletion/duplication. Using these 16 patients as the standard, the authors' method accurately predicted the deleted or duplicated exons in the 9 patients with known mutations. Inclusion of non-coding regions and paired-end sequence analysis enabled accurate identification by increasing the read depth and providing information about the breakpoint junction. The current method has an advantage for the genetic diagnosis of Duchenne muscular dystrophy and Becker muscular dystrophy wherein a comprehensive mutational search may be feasible using a single platform.
Yan, Xukun; Zhang, Tianyu; Wang, Zhengmin; Jiang, Yi; Chen, Yan; Wang, Hongyan; Ma, Duan; Wang, Lei; Li, Huawei
2011-12-20
Waardenburg syndrome type II (WS2) is associated with syndromic deafness. A subset of WS2, WS2A, accounting for approximately 15% of patients, is attributed to mutations in the microphthalmia-associated transcription factor (MITF) gene. We examined the genetic basis of WS2 in a large Chinese family. All 9 exons of the MITF gene, the single coding exon (exon 2) of the most common hereditary deafness gene GJB2 and the mitochondrial DNA (mtDNA) 12S rRNA were sequenced. A novel heterozygous mutation c.[742_743delAAinsT;746_747delCA] in exon 8 of the MITF gene co-segregates with WS2 in the family. The MITF mutation results in a premature termination codon and a truncated MITF protein with only 247 of the 419 wild type amino acids. The deaf proband had this MITF gene heterozygous mutation as well as a c.[109G>A]+[235delC] compound heterozygous pathogenic mutation in the GJB2 gene. No pathogenic mutation was found in mtDNA 12S rRNA in this family. Thus, a novel compound heterozygous mutation, c.[742_743delAAinsT;746_747delCA] in MITF exon 8 was the key genetic reason for WS2 in this family, and a digenic effect of MITF and GJB2 genes may contribute to deafness of the proband. Copyright © 2011. Published by Elsevier Ltd.
Drögemüller, Cord; Philipp, Ute; Haase, Bianca; Günzel-Apel, Anne-Rose; Leeb, Tosso
2007-01-01
Coat color dilution in several breeds of dog is characterized by a specific pigmentation phenotype and sometimes accompanied by hair loss and recurrent skin inflammation, the so-called color dilution alopecia or black hair follicular dysplasia. Coat color dilution (d) is inherited as a Mendelian autosomal recessive trait. In a previous study, MLPH polymorphisms showed perfect cosegregation with the dilute phenotype within breeds. However, different dilute haplotypes were found in different breeds, and no single polymorphism was identified in the coding sequence that was likely to be causative for the dilute phenotype. We resequenced the 5'-region of the canine MLPH gene and identified a strong candidate single nucleotide polymorphism within the nontranslated exon 1, which showed perfect association to the dilute phenotype in 65 dilute dogs from 7 different breeds. The A/G polymorphism is located at the last nucleotide of exon 1 and the mutant A-allele is predicted to reduce splicing efficiency 8-fold. An MLPH mRNA expression study using quantitative reverse transcriptase-polymerase chain reaction confirmed that dd animals had only about approximately 25% of the MLPH transcript compared with DD animals. These results provide preliminary evidence that the reported regulatory MLPH mutation might represent a causal mutation for coat color dilution in dogs.
Chen, L P; E, G X; Zhao, Y J; Na, R S; Zhao, Z Q; Zhang, J H; Ma, Y H; Sun, Y W; Zhong, T; Zhang, H P; Huang, Y F
2015-06-18
DRA encodes the alpha chain of the DR heterodimer, is closely linked to DRB and is considered almost monomorphic in major histocompatibility complex region. In this study, we identified the exon 2 of DRA to evaluate the immunogenetic diversity of Chinese south indigenous goat. Two single nucleotide polymorphisms in an untranslated region and one synonymous substitution in coding region were identified. These data suggest that high immunodiversity in native Chinese population.
Polymorphism at codon 36 of the p53 gene.
Felix, C A; Brown, D L; Mitsudomi, T; Ikagaki, N; Wong, A; Wasserman, R; Womer, R B; Biegel, J A
1994-01-01
A polymorphism at codon 36 in exon 4 of the p53 gene was identified by single strand conformation polymorphism (SSCP) analysis and direct sequencing of genomic DNA PCR products. The polymorphic allele, present in the heterozygous state in genomic DNAs of four of 100 individuals (4%), changes the codon 36 CCG to CCA, eliminates a FinI restriction site and creates a BccI site. Including this polymorphism there are four known polymorphisms in the p53 coding sequence.
NASA Technical Reports Server (NTRS)
Donoho, Greg; Brenneman, Mark A.; Cui, Tracy X.; Donoviel, Dorit; Vogel, Hannes; Goodwin, Edwin H.; Chen, David J.; Hasty, Paul
2003-01-01
The Brca2 tumor-suppressor gene contributes to genomic stability, at least in part by a role in homologous recombinational repair. BRCA2 protein is presumed to function in homologous recombination through interactions with RAD51. Both exons 11 and 27 of Brca2 code for domains that interact with RAD51; exon 11 encodes eight BRC motifs, whereas exon 27 encodes a single, distinct interaction domain. Deletion of all RAD51-interacting domains causes embryonic lethality in mice. A less severe phenotype is seen with BRAC2 truncations that preserve some, but not all, of the BRC motifs. These mice can survive beyond weaning, but are runted and infertile, and die very young from cancer. Cells from such mice show hypersensitivity to some genotoxic agents and chromosomal instability. Here, we have analyzed mice and cells with a deletion of only the RAD51-interacting region encoded by exon 27. Mice homozygous for this mutation (called brca2(lex1)) have a shorter life span than that of control littermates, possibly because of early onsets of cancer and sepsis. No other phenotype was observed in these animals; therefore, the brca2(lex1) mutation is less severe than truncations that delete some BRC motifs. However, at the cellular level, the brca2(lex1) mutation causes reduced viability, hypersensitivity to the DNA interstrand crosslinking agent mitomycin C, and gross chromosomal instability, much like more severe truncations. Thus, the extreme carboxy-terminal region encoded by exon 27 is important for BRCA2 function, probably because it is required for a fully functional interaction between BRCA2 and RAD51. Copyright 2003 Wiley-Liss, Inc.
Mutations in the Norrie disease gene.
Schuback, D E; Chen, Z Y; Craig, I W; Breakefield, X O; Sims, K B
1995-01-01
We report our experience to date in mutation identification in the Norrie disease (ND) gene. We carried out mutational analysis in 26 kindreds in an attempt to identify regions presumed critical to protein function and potentially correlated with generation of the disease phenotype. All coding exons, as well as noncoding regions of exons 1 and 2, 636 nucleotides in the noncoding region of exon 3, and 197 nucleotides of 5' flanking sequence, were analyzed for single-strand conformation polymorphisms (SSCP) by polymerase chain reaction (PCR) amplification of genomic DNA. DNA fragments that showed altered SSCP band mobilities were sequenced to locate the specific mutations. In addition to three previously described submicroscopic deletions encompassing the entire ND gene, we have now identified 6 intragenic deletions, 8 missense (seven point mutations, one 9-bp deletion), 6 nonsense (three point mutations, three single bp deletions/frameshift) and one 10-bp insertion, creating an expanded repeat in the 5' noncoding region of exon 1. Thus, mutations have been identified in a total of 24 of 26 (92%) of the kindreds we have studied to date. With the exception of two different mutations, each found in two apparently unrelated kindreds, these mutations are unique and expand the genotype database. Localization of the majority of point mutations at or near cysteine residues, potentially critical in protein tertiary structure, supports a previous protein model for norrin as member of a cystine knot growth factor family (Meitinger et al., 1993). Genotype-phenotype correlations were not evident with the limited clinical data available, except in the cases of larger submicroscopic deletions associated with a more severe neurologic syndrome.(ABSTRACT TRUNCATED AT 250 WORDS)
A Dual Origin of the Xist Gene from a Protein-Coding Gene and a Set of Transposable Elements
Elisaphenko, Eugeny A.; Kolesnikov, Nikolay N.; Shevchenko, Alexander I.; Rogozin, Igor B.; Nesterova, Tatyana B.; Brockdorff, Neil; Zakian, Suren M.
2008-01-01
X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC). Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA. PMID:18575625
Genetic variations of VDR/NR1I1 encoding vitamin D receptor in a Japanese population.
Ukaji, Maho; Saito, Yoshiro; Fukushima-Uesaka, Hiromi; Maekawa, Keiko; Katori, Noriko; Kaniwa, Nahoko; Yoshida, Teruhiko; Nokihara, Hiroshi; Sekine, Ikuo; Kunitoh, Hideo; Ohe, Yuichiro; Yamamoto, Noboru; Tamura, Tomohide; Saijo, Nagahiro; Sawada, Jun-ichi
2007-12-01
The vitamin D receptor (VDR) is a transcriptional factor responsive to 1alpha,25-dihydroxyvitamin D(3) and lithocholic acid, and induces expression of drug metabolizing enzymes CYP3A4, CYP2B6 and CYP2C9. In this study, the promoter regions, 14 exons (including 6 exon 1's) and their flanking introns of VDR were comprehensively screened for genetic variations in 107 Japanese subjects. Sixty-one genetic variations including 25 novel ones were found: 9 in the 5'-flanking region, 2 in the 5'-untranslated region (UTR), 7 in the coding exons (5 synonymous and 2 nonsynonymous variations), 12 in the 3'-UTR, 19 in the introns between the exon 1's, and 12 in introns 2 to 8. Of these, one novel nonsynonymous variation, 154A>G (Met52Val), was detected with an allele frequency of 0.005. The single nucleotide polymorphisms (SNPs) that increase VDR expression or activity, -29649G>A, 2T>C and 1592((*)308)C>A tagging linked variations in the 3'-UTR, were detected at 0.430, 0.636, and 0.318 allele frequencies, respectively. Another SNP, -26930A>G, with reduced VDR transcription was found at a 0.028 frequency. These findings would be useful for association studies on VDR variations in Japanese.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ainsworth, P.J.; Coulter-Mackie, M.B.
1992-10-01
The B1 variant form of Tay-Sachs disease is enzymologically unique in that the causative mutation(s) appear to affect the active site in the [alpha] subunit of [beta]-hexosaminidase A without altering its ability to associate with the [beta] subunit. Most previously reported B1 variant mutations were found in exon 5 within codon 178. The coding sequence of the [alpha] subunit gene of a patient with the B1 variant form was examined with a combination of reverse transcription of mRNA to cDNA, PCR, and dideoxy sequencing. A double mutation in exon 6 has been identified: a G[sub 574][yields]C transversion causing a val[submore » 192][yields]leu change and a G[sub 598][yields] A transition resulting in a val[sub 200][yields]met alteration. The amplified cDNAs were otherwise normal throughout their sequence. The 574 and 598 alterations have been confirmed by amplification directly from genomic DNA from the patient and her mother. Transient-expression studies of the two exon 6 mutations (singly or together) in COS-1 cells show that the G[sub 574][yields]C change is sufficient to cause the loss of enzyme activity. The biochemical phenotype of the 574 alteration in transfection studies is consistent with that expected for a B1 variant mutation. As such, this mutation differs from previously reported B1 variant mutations, all of which occur in exon 5. 31 refs., 2 figs., 2 tabs.« less
Genes and proteins of urea transporters.
Sands, Jeff M; Blount, Mitsi A
2014-01-01
A urea transporter protein in the kidney was first proposed in 1987. The first urea transporter cDNA was cloned in 1993. The SLC14a urea transporter family contains two major subgroups: SLC14a1, the UT-B urea transporter originally isolated from erythrocytes; and SLC14a2, the UT-A group originally isolated from kidney inner medulla. Slc14a1, the human UT-B gene, arises from a single locus located on chromosome 18q12.1-q21.1, which is located close to Slc14a2. Slc14a1 includes 11 exons, with the coding region extending from exon 4 to exon 11, and is approximately 30 kb in length. The Slc14a2 gene is a very large gene with 24 exons, is approximately 300 kb in length, and encodes 6 different isoforms. Slc14a2 contains two promoter elements: promoter I is located in the typical position, upstream of exon 1, and drives the transcription of UT-A1, UT-A1b, UT-A3, UT-A3b, and UT-A4; while promoter II is located within intron 12 and drives the transcription of UT-A2 and UT-A2b. UT-A1 and UT-A3 are located in the inner medullary collecting duct, UT-A2 in the thin descending limb and liver, UT-A5 in testis, UT-A6 in colon, UT-B1 primarily in descending vasa recta and erythrocytes, and UT-B2 in rumen.
MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing
2014-01-01
We have developed a novel machine-learning approach, MutPred Splice, for the identification of coding region substitutions that disrupt pre-mRNA splicing. Applying MutPred Splice to human disease-causing exonic mutations suggests that 16% of mutations causing inherited disease and 10 to 14% of somatic mutations in cancer may disrupt pre-mRNA splicing. For inherited disease, the main mechanism responsible for the splicing defect is splice site loss, whereas for cancer the predominant mechanism of splicing disruption is predicted to be exon skipping via loss of exonic splicing enhancers or gain of exonic splicing silencer elements. MutPred Splice is available at http://mutdb.org/mutpredsplice. PMID:24451234
Douzery, Emmanuel J P; Scornavacca, Celine; Romiguier, Jonathan; Belkhir, Khalid; Galtier, Nicolas; Delsuc, Frédéric; Ranwez, Vincent
2014-07-01
Comparative genomic studies extensively rely on alignments of orthologous sequences. Yet, selecting, gathering, and aligning orthologous exons and protein-coding sequences (CDS) that are relevant for a given evolutionary analysis can be a difficult and time-consuming task. In this context, we developed OrthoMaM, a database of ORTHOlogous MAmmalian Markers describing the evolutionary dynamics of orthologous genes in mammalian genomes using a phylogenetic framework. Since its first release in 2007, OrthoMaM has regularly evolved, not only to include newly available genomes but also to incorporate up-to-date software in its analytic pipeline. This eighth release integrates the 40 complete mammalian genomes available in Ensembl v73 and provides alignments, phylogenies, evolutionary descriptor information, and functional annotations for 13,404 single-copy orthologous CDS and 6,953 long exons. The graphical interface allows to easily explore OrthoMaM to identify markers with specific characteristics (e.g., taxa availability, alignment size, %G+C, evolutionary rate, chromosome location). It hence provides an efficient solution to sample preprocessed markers adapted to user-specific needs. OrthoMaM has proven to be a valuable resource for researchers interested in mammalian phylogenomics, evolutionary genomics, and has served as a source of benchmark empirical data sets in several methodological studies. OrthoMaM is available for browsing, query and complete or filtered downloads at http://www.orthomam.univ-montp2.fr/. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Lavenu, A; Pistoi, S; Pournin, S; Babinet, C; Morello, D
1995-01-01
In vivo, the steady-state level of c-myc mRNA is mainly controlled by posttranscriptional mechanisms. Using a panel of transgenic mice in which various versions of the human c-myc proto-oncogene were under the control of major histocompatibility complex H-2Kb class I regulatory sequences, we have shown that the 5' and the 3' noncoding sequences are dispensable for obtaining a regulated expression of the transgene in adult quiescent tissues, at the start of liver regeneration, and after inhibition of protein synthesis. These results indicated that the coding sequences were sufficient to ensure a regulated c-myc expression. In the present study, we have pursued this analysis with transgenes containing one or the other of the two c-myc coding exons either alone or in association with the c-myc 3' untranslated region. We demonstrate that each of the exons contains determinants which control c-myc mRNA expression. Moreover, we show that in the liver, c-myc exon 2 sequences are able to down-regulate an otherwise stable H-2K mRNA when embedded within it and to induce its transient accumulation after cycloheximide treatment and soon after liver ablation. Finally, the use of transgenes with different coding capacities has allowed us to postulate that the primary mRNA sequence itself and not c-Myc peptides is an important component of c-myc posttranscriptional regulation. PMID:7623834
Kim, Dong Seon; Hahn, Yoonsoo
2012-11-13
Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.
Turco, Gina; Schnable, James C.; Pedersen, Brent; Freeling, Michael
2013-01-01
Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by >12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize. PMID:23874343
RBFOX and PTBP1 proteins regulate the alternative splicing of micro-exons in human brain transcripts
Sanchez-Pulido, Luis; Haerty, Wilfried
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein–protein interactions. PMID:25524026
Germ-line and somatic EPHA2 coding variants in lens aging and cataract.
Bennett, Thomas M; M'Hamdi, Oussama; Hejtmancik, J Fielding; Shiels, Alan
2017-01-01
Rare germ-line mutations in the coding regions of the human EPHA2 gene (EPHA2) have been associated with inherited forms of pediatric cataract, whereas, frequent, non-coding, single nucleotide variants (SNVs) have been associated with age-related cataract. Here we sought to determine if germ-line EPHA2 coding SNVs were associated with age-related cataract in a case-control DNA panel (> 50 years) and if somatic EPHA2 coding SNVs were associated with lens aging and/or cataract in a post-mortem lens DNA panel (> 48 years). Micro-fluidic PCR amplification followed by targeted amplicon (exon) next-generation (deep) sequencing of EPHA2 (17-exons) afforded high read-depth coverage (1000x) for > 82% of reads in the cataract case-control panel (161 cases, 64 controls) and > 70% of reads in the post-mortem lens panel (35 clear lens pairs, 22 cataract lens pairs). Novel and reference (known) missense SNVs in EPHA2 that were predicted in silico to be functionally damaging were found in both cases and controls from the age-related cataract panel at variant allele frequencies (VAFs) consistent with germ-line transmission (VAF > 20%). Similarly, both novel and reference missense SNVs in EPHA2 were found in the post-mortem lens panel at VAFs consistent with a somatic origin (VAF > 3%). The majority of SNVs found in the cataract case-control panel and post-mortem lens panel were transitions and many occurred at di-pyrimidine sites that are susceptible to ultraviolet (UV) radiation induced mutation. These data suggest that novel germ-line (blood) and somatic (lens) coding SNVs in EPHA2 that are predicted to be functionally deleterious occur in adults over 50 years of age. However, both types of EPHA2 coding variants were present at comparable levels in individuals with or without age-related cataract making simple genotype-phenotype correlations inconclusive.
Germ-line and somatic EPHA2 coding variants in lens aging and cataract
Bennett, Thomas M.; M’Hamdi, Oussama; Hejtmancik, J. Fielding
2017-01-01
Rare germ-line mutations in the coding regions of the human EPHA2 gene (EPHA2) have been associated with inherited forms of pediatric cataract, whereas, frequent, non-coding, single nucleotide variants (SNVs) have been associated with age-related cataract. Here we sought to determine if germ-line EPHA2 coding SNVs were associated with age-related cataract in a case-control DNA panel (> 50 years) and if somatic EPHA2 coding SNVs were associated with lens aging and/or cataract in a post-mortem lens DNA panel (> 48 years). Micro-fluidic PCR amplification followed by targeted amplicon (exon) next-generation (deep) sequencing of EPHA2 (17-exons) afforded high read-depth coverage (1000x) for > 82% of reads in the cataract case-control panel (161 cases, 64 controls) and > 70% of reads in the post-mortem lens panel (35 clear lens pairs, 22 cataract lens pairs). Novel and reference (known) missense SNVs in EPHA2 that were predicted in silico to be functionally damaging were found in both cases and controls from the age-related cataract panel at variant allele frequencies (VAFs) consistent with germ-line transmission (VAF > 20%). Similarly, both novel and reference missense SNVs in EPHA2 were found in the post-mortem lens panel at VAFs consistent with a somatic origin (VAF > 3%). The majority of SNVs found in the cataract case-control panel and post-mortem lens panel were transitions and many occurred at di-pyrimidine sites that are susceptible to ultraviolet (UV) radiation induced mutation. These data suggest that novel germ-line (blood) and somatic (lens) coding SNVs in EPHA2 that are predicted to be functionally deleterious occur in adults over 50 years of age. However, both types of EPHA2 coding variants were present at comparable levels in individuals with or without age-related cataract making simple genotype-phenotype correlations inconclusive. PMID:29267365
Genetic Variation Linked to Lung Cancer Survival in White Smokers | Center for Cancer Research
CCR investigators have discovered evidence that links lung cancer survival with genetic variations (called single nucleotide polymorphisms) in the MBL2 gene, a key player in innate immunity. The variations in the gene, which codes for a protein called the mannose-binding lectin, occur in its promoter region, where the RNA polymerase molecule binds to start transcription, and in the first exon that is responsible for the correct structure of MBL. The findings appear in the September 19, 2007, issue of the Journal of the National Cancer Institute.
Mahamdallie, Shazia; Ruark, Elise; Yost, Shawn; Ramsay, Emma; Uddin, Imran; Wylie, Harriett; Elliott, Anna; Strydom, Ann; Renwick, Anthony; Seal, Sheila; Rahman, Nazneen
2017-01-01
Detection of deletions and duplications of whole exons (exon CNVs) is a key requirement of genetic testing. Accurate detection of this variant type has proved very challenging in targeted next-generation sequencing (NGS) data, particularly if only a single exon is involved. Many different NGS exon CNV calling methods have been developed over the last five years. Such methods are usually evaluated using simulated and/or in-house data due to a lack of publicly-available datasets with orthogonally generated results. This hinders tool comparisons, transparency and reproducibility. To provide a community resource for assessment of exon CNV calling methods in targeted NGS data, we here present the ICR96 exon CNV validation series. The dataset includes high-quality sequencing data from a targeted NGS assay (the TruSight Cancer Panel) together with Multiplex Ligation-dependent Probe Amplification (MLPA) results for 96 independent samples. 66 samples contain at least one validated exon CNV and 30 samples have validated negative results for exon CNVs in 26 genes. The dataset includes 46 exon CNVs in BRCA1 , BRCA2 , TP53 , MLH1 , MSH2 , MSH6 , PMS2 , EPCAM or PTEN , giving excellent representation of the cancer predisposition genes most frequently tested in clinical practice. Moreover, the validated exon CNVs include 25 single exon CNVs, the most difficult type of exon CNV to detect. The FASTQ files for the ICR96 exon CNV validation series can be accessed through the European-Genome phenome Archive (EGA) under the accession number EGAS00001002428.
Third International Meeting on Esterases Reacting with Organophosphorus Compounds
1998-01-01
cassette for negative selection, 884 bp of ACHE including exon 1, 1.6 kb of a Neor gene cassette for positive selection, 5.2 kb of the ACHE Bam HI...fragment including exon 6, and 3 kb of Bluescript. Deletion of exons 2-5 removed 80% of the ACHE coding sequence. The gene targeting vector was...expression due to environmental influences on CYP3A4 and the presence or absence of CYP3A5 which may be under genetic control in man. Plasma
Structure and genomic organization of the human B1 receptor gene for kinins (BDKRB1).
Bachvarov, D R; Hess, J F; Menke, J G; Larrivée, J F; Marceau, F
1996-05-01
Two subtypes of mammalian bradykinin receptors, B1 and B2 (BDKRB1 and BDKRB2), have been defined based on their pharmacological properties. The B1 type kinin receptors have weak affinity for intact BK or Lys-BK but strong affinity for kinin metabolites without the C-terminal arginine (e.g., des-Arg9-BK and Lys-des-Arg9-BK, also called des-Arg10-kallidin), which are generated by kininase I. The B1 receptor expression is up-regulated following tissue injury and inflammation (hyperemia, exudation, hyperalgesia, etc.). In the present study, we have cloned and sequenced the gene encoding human B1 receptor from a human genomic library. The human B1 receptor gene contains three exons separated by two introns. The first and the second exon are noncoding, while the coding region and the 3'-flanking region are located entirely on the third exon. The exon-intron arrangement of the human B1 receptor gene shows significant similarity with the genes encoding the B2 receptor subtype in human, mouse, and rat. Sequence analysis of the 5'-flanking region revealed the presence of a consensus TATA box and of numerous candidate transcription factor binding sequences. Primer extension experiments have shown the existence of multiple transcription initiation sites situated downstream and upstream from the consensus TATA box. Genomic Southern blot analysis indicated that the human B1 receptor is encoded by a single-copy gene.
Splicing regulation and dysregulation of cholinergic genes expressed at the neuromuscular junction.
Ohno, Kinji; Rahman, Mohammad Alinoor; Nazim, Mohammad; Nasrin, Farhana; Lin, Yingni; Takeda, Jun-Ichi; Masuda, Akio
2017-08-01
We humans have evolved by acquiring diversity of alternative RNA metabolisms including alternative means of splicing and transcribing non-coding genes, and not by acquiring new coding genes. Tissue-specific and developmental stage-specific alternative RNA splicing is achieved by tightly regulated spatiotemporal regulation of expressions and activations of RNA-binding proteins that recognize their cognate splicing cis-elements on nascent RNA transcripts. Genes expressed at the neuromuscular junction are also alternatively spliced. In addition, germline mutations provoke aberrant splicing by compromising binding of RNA-binding proteins, and cause congenital myasthenic syndromes (CMS). We present physiological splicing mechanisms of genes for agrin (AGRN), acetylcholinesterase (ACHE), MuSK (MUSK), acetylcholine receptor (AChR) α1 subunit (CHRNA1), and collagen Q (COLQ) in human, and their aberration in diseases. Splicing isoforms of AChE T , AChE H , and AChE R are generated by hnRNP H/F. Skipping of MUSK exon 10 makes a Wnt-insensitive MuSK isoform, which is unique to human. Skipping of exon 10 is achieved by coordinated binding of hnRNP C, YB-1, and hnRNP L to exon 10. Exon P3A of CHRNA1 is alternatively included to generate a non-functional AChR α1 subunit in human. Molecular dissection of splicing mutations in patients with CMS reveals that exon P3A is alternatively skipped by hnRNP H, polypyrimidine tract-binding protein 1, and hnRNP L. Similarly, analysis of an exonic mutation in COLQ exon 16 in a CMS patient discloses that constitutive splicing of exon 16 requires binding of serine arginine-rich splicing factor 1. Intronic and exonic splicing mutations in CMS enable us to dissect molecular mechanisms underlying alternative and constitutive splicing of genes expressed at the neuromuscular junction. This is an article for the special issue XVth International Symposium on Cholinergic Mechanisms. © 2017 International Society for Neurochemistry.
Li, Yang I; Sanchez-Pulido, Luis; Haerty, Wilfried; Ponting, Chris P
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein-protein interactions. © 2015 Li et al.; Published by Cold Spring Harbor Laboratory Press.
Yin, Changchuan
2015-04-01
To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
2012-01-01
Background Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. Results We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Conclusions Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution. PMID:23148531
Abascal, Federico; Ezkurdia, Iakes; Rodriguez-Rivas, Juan; Rodriguez, Jose Manuel; del Pozo, Angela; Vázquez, Jesús; Valencia, Alfonso; Tress, Michael L.
2015-01-01
Alternative splicing of messenger RNA can generate a wide variety of mature RNA transcripts, and these transcripts may produce protein isoforms with diverse cellular functions. While there is much supporting evidence for the expression of alternative transcripts, the same is not true for the alternatively spliced protein products. Large-scale mass spectroscopy experiments have identified evidence of alternative splicing at the protein level, but with conflicting results. Here we carried out a rigorous analysis of the peptide evidence from eight large-scale proteomics experiments to assess the scale of alternative splicing that is detectable by high-resolution mass spectroscopy. We find fewer splice events than would be expected: we identified peptides for almost 64% of human protein coding genes, but detected just 282 splice events. This data suggests that most genes have a single dominant isoform at the protein level. Many of the alternative isoforms that we could identify were only subtly different from the main splice isoform. Very few of the splice events identified at the protein level disrupted functional domains, in stark contrast to the two thirds of splice events annotated in the human genome that would lead to the loss or damage of functional domains. The most striking result was that more than 20% of the splice isoforms we identified were generated by substituting one homologous exon for another. This is significantly more than would be expected from the frequency of these events in the genome. These homologous exon substitution events were remarkably conserved—all the homologous exons we identified evolved over 460 million years ago—and eight of the fourteen tissue-specific splice isoforms we identified were generated from homologous exons. The combination of proteomics evidence, ancient origin and tissue-specific splicing indicates that isoforms generated from homologous exons may have important cellular roles. PMID:26061177
Mamatha, Gandra; Umashankar, Vetrivel; Kasinathan, Nachiappan; Krishnan, Tandava; Sathyabaarathi, Ravichandran; Karthiyayini, Thirumalai; Amali, John; Rao, Chetan
2011-01-01
Purpose Bietti crystalline dystrophy (BCD) is an autosomal recessive disease characterized by intraretinal deposits of multiple small crystals, with or without associated crystal deposits in the cornea. The disease is caused by mutation in the cytochrome p450, family 4, subfamily v, polypeptide 2 (CYP4V2) gene. Choroidal neovascularization (CNV) is a rare event in BCD. We report two cases of BCD associated with CNV. CYP4V2 and exon 5 of tissue inhibitor of metalloproteinase 3 (TIMP3) were screened in both cases. A patient with BCD, but without CNV, was also screened to identify pathogenic variations. Methods Three BCD families of Asian Indian origin were recruited after a comprehensive ophthalmic examination. Genomic DNA was isolated from blood leukocytes, and coding exons and flanking introns of CYP4V2 and exon 5 of TIMP3 were amplified via polymerase chain reaction (PCR) and were sequenced. Family segregation, control screening, and bioinformatics tools were used to assess the pathogenicity of the novel variations. Results Of the three BCD patients, two had parafoveal CNV. The patient with BCD, but without CNV had novel single base-pair duplication (c.1062_1063dupA). This mutation results in a structurally defective and unstable protein with impaired protein function. Four novel benign variations (three in exons and one in an intron) were observed in the cohort. Screening of exon 5 of TIMP3 did not reveal any variation in these families. Conclusions A novel mutation was found in a patient with BCD but without CNV, while patients with BCD and CNV did not show any pathogenic variation. The modifier role of TIMP3 in the pathogenesis of CNV in BCD was partly ruled out, as no variation was observed in exon 5 of the gene. A larger BCD cohort with CNV needs to be studied and screened to understand the genetics of CNV in BCD. PMID:21850171
Boonstra, Pieter A; Ter Elst, Arja; Tibbesma, Marco; Bosman, Lisette J; Mathijssen, Ron; Atrafi, Florence; van Coevorden, Frits; Steeghs, Neeltje; Farag, Sheima; Gelderblom, Hans; van der Graaf, Winette T A; Desar, Ingrid M E; Maier, Jacqueline; Overbosch, Jelle; Suurmeijer, Albert J H; Gietema, Jourik; Schuuring, Ed; Reyners, Anna K L
2018-03-02
Gastrointestinal stromal tumors (GISTs) are characterized by oncogenic KIT mutations that cluster in two exon 11 hotspots. The aim of this study was to develop a single, sensitive, quantitative digital droplet PCR (ddPCR) assay for the detection of common exon 11 mutations in both GIST tumor tissue and in circulating tumor DNA (ctDNA) isolated from GIST patients' plasma. A ddPCR assay was designed using two probes that cover both hotspots. Available archival FFPE tumor tissue from 27 consecutive patients with known KIT exon 11 mutations and 9 randomly selected patients without exon 11 mutations were tested. Plasma samples were prospectively collected in a multicenter bio-databank from December 2014. ctDNA was analyzed of 22 patients with an exon 11 mutation and a baseline plasma sample. The ddPCR assay detected the exon 11 mutation in 21 of 22 tumors with exon 11 mutations covered by the assay. Mutations in ctDNA were detected at baseline in 13 of 14 metastasized patients, but in only 1 of 8 patients with localized disease. In serial plasma samples from 11 patients with metastasized GIST, a decrease in mutant droplets was detected during treatment. According to RECIST 1.1, 10 patients had radiological treatment response and one patient stable disease. A single ddPCR assay for the detection of multiple exon 11 mutations in ctDNA is a feasible, promising tool for monitoring treatment response in patients with metastasized GIST and should be further evaluated in a larger cohort.
Ullah, Inayat; Kabir, Firoz; Iqbal, Muhammad; Gottsch, Clare Brooks S.; Naeem, Muhammad Asif; Assir, Muhammad Zaman; Khan, Shaheen N.; Akram, Javed; Riazuddin, Sheikh; Ayyagari, Radha; Hejtmancik, J. Fielding
2016-01-01
Purpose To identify pathogenic mutations responsible for autosomal recessive retinitis pigmentosa (arRP) in consanguineous familial cases. Methods Seven large familial cases with multiple individuals diagnosed with retinitis pigmentosa were included in the study. Affected individuals in these families underwent ophthalmic examinations to document the symptoms and confirm the initial diagnosis. Blood samples were collected from all participating members, and genomic DNA was extracted. An exclusion analysis with microsatellite markers spanning the TULP1 locus on chromosome 6p was performed, and two-point logarithm of odds (LOD) scores were calculated. All coding exons along with the exon–intron boundaries of TULP1 were sequenced bidirectionally. We constructed a single nucleotide polymorphism (SNP) haplotype for the four familial cases harboring the K489R allele and estimated the likelihood of a founder effect. Results The ophthalmic examinations of the affected individuals in these familial cases were suggestive of RP. Exclusion analyses confirmed linkage to chromosome 6p harboring TULP1 with positive two-point LOD scores. Subsequent Sanger sequencing identified the single base pair substitution in exon14, c.1466A>G (p.K489R), in four families. Additionally, we identified a two-base deletion in exon 4, c.286_287delGA (p.E96Gfs77*); a homozygous splice site variant in intron 14, c.1495+4A>C; and a novel missense variation in exon 15, c.1561C>T (p.P521S). All mutations segregated with the disease phenotype in the respective families and were absent in ethnically matched control chromosomes. Haplotype analysis suggested (p<10−6) that affected individuals inherited the causal mutation from a common ancestor. Conclusions Pathogenic mutations in TULP1 are responsible for the RP phenotype in seven familial cases with a common ancestral mutation responsible for the disease phenotype in four of the seven families. PMID:27440997
Polymorphisms within the canine MLPH gene are associated with dilute coat color in dogs
Philipp, Ute; Hamann, Henning; Mecklenburg, Lars; Nishino, Seiji; Mignot, Emmanuel; Günzel-Apel, Anne-Rose; Schmutz, Sheila M; Leeb, Tosso
2005-01-01
Background Pinschers and other dogs with coat color dilution show a characteristic pigmentation phenotype. The fur colors are a lighter shade, e.g. silvery grey (blue) instead of black and a sandy color (Isabella fawn) instead of red or brown. In some dogs the coat color dilution is sometimes accompanied by hair loss and recurrent skin inflammation, the so called color dilution alopecia (CDA) or black hair follicular dysplasia (BHFD). In humans and mice a comparable pigmentation phenotype without any documented hair loss is caused by mutations within the melanophilin gene (MLPH). Results We sequenced the canine MLPH gene and performed a mutation analysis of the MLPH exons in 6 Doberman Pinschers and 5 German Pinschers. A total of 48 sequence variations was identified within and between the breeds. Three families of dogs showed co-segregation for at least one polymorphism in an MLPH exon and the dilute phenotype. No single polymorphism was identified in the coding sequences or at splice sites that is likely to be causative for the dilute phenotype of all dogs examined. In 18 German Pinschers a mutation in exon 7 (R199H) was consistently associated with the dilute phenotype. However, as this mutation was present in homozygous state in four dogs of other breeds with wildtype pigmentation, it seems unlikely that this mutation is truly causative for coat color dilution. In Doberman Pinschers as well as in Large Munsterlanders with BHFD, a set of single nucleotide polymorphisms (SNPs) around exon 2 was identified that show a highly significant association to the dilute phenotype. Conclusion This study provides evidence that coat color dilution is caused by one or more mutations within or near the MLPH gene in several dog breeds. The data on polymorphisms that are strongly associated with the dilute phenotype will allow the genetic testing of Pinschers to facilitate the breeding of dogs with defined coat colors and to select against Large Munsterlanders carrying BHFD. PMID:15960853
Polymorphisms within the canine MLPH gene are associated with dilute coat color in dogs.
Philipp, Ute; Hamann, Henning; Mecklenburg, Lars; Nishino, Seiji; Mignot, Emmanuel; Günzel-Apel, Anne-Rose; Schmutz, Sheila M; Leeb, Tosso
2005-06-16
Pinschers and other dogs with coat color dilution show a characteristic pigmentation phenotype. The fur colors are a lighter shade, e.g. silvery grey (blue) instead of black and a sandy color (Isabella fawn) instead of red or brown. In some dogs the coat color dilution is sometimes accompanied by hair loss and recurrent skin inflammation, the so called color dilution alopecia (CDA) or black hair follicular dysplasia (BHFD). In humans and mice a comparable pigmentation phenotype without any documented hair loss is caused by mutations within the melanophilin gene (MLPH). We sequenced the canine MLPH gene and performed a mutation analysis of the MLPH exons in 6 Doberman Pinschers and 5 German Pinschers. A total of 48 sequence variations was identified within and between the breeds. Three families of dogs showed co-segregation for at least one polymorphism in an MLPH exon and the dilute phenotype. No single polymorphism was identified in the coding sequences or at splice sites that is likely to be causative for the dilute phenotype of all dogs examined. In 18 German Pinschers a mutation in exon 7 (R199H) was consistently associated with the dilute phenotype. However, as this mutation was present in homozygous state in four dogs of other breeds with wildtype pigmentation, it seems unlikely that this mutation is truly causative for coat color dilution. In Doberman Pinschers as well as in Large Munsterlanders with BHFD, a set of single nucleotide polymorphisms (SNPs) around exon 2 was identified that show a highly significant association to the dilute phenotype. This study provides evidence that coat color dilution is caused by one or more mutations within or near the MLPH gene in several dog breeds. The data on polymorphisms that are strongly associated with the dilute phenotype will allow the genetic testing of Pinschers to facilitate the breeding of dogs with defined coat colors and to select against Large Munsterlanders carrying BHFD.
Novel mutations in the STK11 gene in Thai patients with Peutz-Jeghers syndrome
Ausavarat, Surasawadee; Leoyklang, Petcharat; Vejchapipat, Paisarn; Chongsrisawat, Voranush; Suphapeetiporn, Kanya; Shotelersuk, Vorasuk
2009-01-01
Peutz-Jeghers syndrome (PJS), a rare autosomal dominant inherited disorder, is characterized by hamartomatous gastrointestinal polyps and mucocutaneous pigmentation. Patients with this syndrome have a predisposition to a variety of cancers in multiple organs. Mutations in the serine/threonine kinase 11 (STK11) gene have been identified as a major cause of PJS. Here we present the clinical and molecular findings of two unrelated Thai individuals with PJS. Mutation analysis by Polymerase Chain Reaction-sequencing of the entire coding region of STK11 revealed two potentially pathogenic mutations. One harbored a single nucleotide deletion (c.182delG) in exon 1 resulting in a frameshift leading to premature termination at codon 63 (p.Gly61AlafsX63). The other carried an in-frame 9-base-pair (bp) deletion in exon 7, c.907_915del9 (p.Ile303_Gln305del). Both deletions were de novo and have never been previously described. This study has expanded the genotypic spectrum of the STK11 gene. PMID:19908348
Kapahnke, Marcel; Banning, Antje; Tikkanen, Ritva
2016-12-14
The clustered regularly interspaced short palindromic repeats (CRISPR)-associated sequence 9 (CRISPR/Cas9) system is widely used for genome editing purposes as it facilitates an efficient knockout of a specific gene in, e.g. cultured cells. Targeted double-strand breaks are introduced to the target sequence of the guide RNAs, which activates the cellular DNA repair mechanism for non-homologous-end-joining, resulting in unprecise repair and introduction of small deletions or insertions. Due to this, sequence alterations in the coding region of the target gene frequently cause frame-shift mutations, facilitating degradation of the mRNA. We here show that such CRISPR/Cas9-mediated alterations in the target exon may also result in altered splicing of the respective pre-mRNA, most likely due to mutations of splice-regulatory sequences. Using the human FLOT-1 gene as an example, we demonstrate that such altered splicing products also give rise to aberrant protein products. These may potentially function as dominant-negative proteins and thus interfere with the interpretation of the data generated with these cell lines. Since most researchers only control the consequences of CRISPR knockout at genomic and protein level, our data should encourage to also check the alterations at the mRNA level.
The functional spectrum of low-frequency coding variation.
Marth, Gabor T; Yu, Fuli; Indap, Amit R; Garimella, Kiran; Gravel, Simon; Leong, Wen Fung; Tyler-Smith, Chris; Bainbridge, Matthew; Blackwell, Tom; Zheng-Bradley, Xiangqun; Chen, Yuan; Challis, Danny; Clarke, Laura; Ball, Edward V; Cibulskis, Kristian; Cooper, David N; Fulton, Bob; Hartl, Chris; Koboldt, Dan; Muzny, Donna; Smith, Richard; Sougnez, Carrie; Stewart, Chip; Ward, Alistair; Yu, Jin; Xue, Yali; Altshuler, David; Bustamante, Carlos D; Clark, Andrew G; Daly, Mark; DePristo, Mark; Flicek, Paul; Gabriel, Stacey; Mardis, Elaine; Palotie, Aarno; Gibbs, Richard
2011-09-14
Rare coding variants constitute an important class of human genetic variation, but are underrepresented in current databases that are based on small population samples. Recent studies show that variants altering amino acid sequence and protein function are enriched at low variant allele frequency, 2 to 5%, but because of insufficient sample size it is not clear if the same trend holds for rare variants below 1% allele frequency. The 1000 Genomes Exon Pilot Project has collected deep-coverage exon-capture data in roughly 1,000 human genes, for nearly 700 samples. Although medical whole-exome projects are currently afoot, this is still the deepest reported sampling of a large number of human genes with next-generation technologies. According to the goals of the 1000 Genomes Project, we created effective informatics pipelines to process and analyze the data, and discovered 12,758 exonic SNPs, 70% of them novel, and 74% below 1% allele frequency in the seven population samples we examined. Our analysis confirms that coding variants below 1% allele frequency show increased population-specificity and are enriched for functional variants. This study represents a large step toward detecting and interpreting low frequency coding variation, clearly lays out technical steps for effective analysis of DNA capture data, and articulates functional and population properties of this important class of genetic variation.
Fayaz, Shima; Fard-Esfahani, Pezhman; Fard-Esfahani, Armaghan; Mostafavi, Ehsan; Meshkani, Reza; Mirmiranpour, Hossein; Khaghani, Shahnaz
2012-01-01
Homologous recombination (HR) is the major pathway for repairing double strand breaks (DSBs) in eukaryotes and XRCC2 is an essential component of the HR repair machinery. To evaluate the potential role of mutations in gene repair by HR in individuals susceptible to differentiated thyroid carcinoma (DTC) we used high resolution melting (HRM) analysis, a recently introduced method for detecting mutations, to examine the entire XRCC2 coding region in an Iranian population. HRM analysis was used to screen for mutations in three XRCC2 coding regions in 50 patients and 50 controls. There was no variation in the HRM curves obtained from the analysis of exons 1 and 2 in the case and control groups. In exon 3, an Arg188His polymorphism (rs3218536) was detected as a new melting curve group (OR: 1.46; 95%CI: 0.432–4.969; p = 0.38) compared with the normal melting curve. We also found a new Ser150Arg polymorphism in exon 3 of the control group. These findings suggest that genetic variations in the XRCC2 coding region have no potential effects on susceptibility to DTC. However, further studies with larger populations are required to confirm this conclusion. PMID:22481871
ERIC Educational Resources Information Center
Ressler, Kerry J.; Rattiner, Lisa M.; Davis, Michael
2004-01-01
Brain-derived neurotrophic factor (BDNF) has been implicated as a molecular mediator of learning and memory. The BDNF gene contains four differentially regulated promoters that generate four distinct mRNA transcripts, each containing a unique noncoding 5[prime]-exon and a common 3[prime]-coding exon. This study describes novel evidence for the…
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Gretchen H. Roffler; Stephen J. Amish; Seth Smith; Ted Cosart; Marty Kardos; Michael K. Schwartz; Gordon Luikart
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding...
Homozygous and hemizygous CNV detection from exome sequencing data in a Mendelian disease cohort
Gambin, Tomasz; Akdemir, Zeynep C.; Yuan, Bo; Gu, Shen; Chiang, Theodore; Carvalho, Claudia M.B.; Shaw, Chad; Jhangiani, Shalini; Boone, Philip M.; Eldomery, Mohammad K.; Karaca, Ender; Bayram, Yavuz; Stray-Pedersen, Asbjørg; Muzny, Donna; Charng, Wu-Lin; Bahrambeigi, Vahid; Belmont, John W.; Boerwinkle, Eric; Beaudet, Arthur L.; Gibbs, Richard A.
2017-01-01
Abstract We developed an algorithm, HMZDelFinder, that uses whole exome sequencing (WES) data to identify rare and intragenic homozygous and hemizygous (HMZ) deletions that may represent complete loss-of-function of the indicated gene. HMZDelFinder was applied to 4866 samples in the Baylor–Hopkins Center for Mendelian Genomics (BHCMG) cohort and detected 773 HMZ deletion calls (567 homozygous or 206 hemizygous) with an estimated sensitivity of 86.5% (82% for single-exonic and 88% for multi-exonic calls) and precision of 78% (53% single-exonic and 96% for multi-exonic calls). Out of 773 HMZDelFinder-detected deletion calls, 82 were subjected to array comparative genomic hybridization (aCGH) and/or breakpoint PCR and 64 were confirmed. These include 18 single-exon deletions out of which 8 were exclusively detected by HMZDelFinder and not by any of seven other CNV detection tools examined. Further investigation of the 64 validated deletion calls revealed at least 15 pathogenic HMZ deletions. Of those, 7 accounted for 17–50% of pathogenic CNVs in different disease cohorts where 7.1–11% of the molecular diagnosis solved rate was attributed to CNVs. In summary, we present an algorithm to detect rare, intragenic, single-exon deletion CNVs using WES data; this tool can be useful for disease gene discovery efforts and clinical WES analyses. PMID:27980096
Single-cut genome editing restores dystrophin expression in a new mouse model of muscular dystrophy
Amoasii, Leonela; Long, Chengzu; Li, Hui; Mireault, Alex A.; Shelton, John M.; Sanchez-Ortiz, Efrain; McAnally, John R.; Bhattacharyya, Samadrita; Schmidt, Florian; Grimm, Dirk; Hauschka, Stephen D.; Bassel-Duby, Rhonda; Olson, Eric N.
2017-01-01
Duchenne muscular dystrophy (DMD) is a severe, progressive muscle disease caused by mutations in the dystrophin gene. The majority of DMD mutations are deletions that prematurely terminate the dystrophin protein. Deletions of exon 50 of the dystrophin gene are among the most common single exon deletions causing DMD. Such mutations can be corrected by skipping exon 51, thereby restoring the dystrophin reading frame. Using clustered regularly interspaced short palindromic repeats/CRISPR-associated 9 (CRISPR/Cas9), we generated a DMD mouse model by deleting exon 50. These ΔEx50 mice displayed severe muscle dysfunction, which was corrected by systemic delivery of adeno-associated virus encoding CRISPR/Cas9 genome editing components. We optimized the method for dystrophin reading frame correction using a single guide RNA that created reframing mutations and allowed skipping of exon 51. In conjunction with muscle-specific expression of Cas9, this approach restored up to 90% of dystrophin protein expression throughout skeletal muscles and the heart of ΔEx50 mice. This method of permanently bypassing DMD mutations using a single cut in genomic DNA represents a step toward clinical correction of DMD mutations and potentially those of other neuromuscular disorders. PMID:29187645
Ala397Asp mutation of myosin VIIA gene segregating in a Spanish family with type-Ib Usher syndrome.
Espinós, C; Millán, J M; Sánchez, F; Beneyto, M; Nájera, C
1998-06-01
In the current study, 12 Spanish families affected by type-I Usher syndrome, that was previously linked to chromosome 11q, were screened for the presence of mutations in the N-terminal coding portion of the motor domain of the myosin VIIA gene by single-strand conformation polymorphism analysis of the first 14 exons. A mutation (Ala397Asp) segregating with the disease was identified, and several polymorphisms were also detected. It is presumed that the other USHIB mutations in these families could be located in the unscreened regions of the gene.
2004-10-01
digestion and cloned into pLoxpNeo upstream of the PGK-neomycin cassette. A 1.3 kb fragment with the first coding exon was amplified by Pfx polymerase ...introducing a XhoI site on the 5’-end of the amplification product. The PCR fragment was cloned blunt into the HinDIII(blunt) site 5’ of the single...functionality of each of the loxP sites was tested in AM-1 cells (Invitrogen) that express Cre recombinase . Step 2: Gene targeting in embryonic stem
Hutcheson, Kelly A; Paluru, Prasuna C; Bernstein, Steven L; Koh, Jamie; Rappaport, Eric F; Leach, Richard A; Young, Terri L
2005-07-14
Retinopathy of prematurity (ROP) is a leading cause of visual loss in the pediatric population. Mutations in the Norrie disease gene (NDP) are associated with heritable retinal vascular disorders, and have been found in a small subset of patients with severe retinopathy of prematurity. Varying rates of progression to threshold disease in different races may have a genetic basis, as recent studies suggest that the incidence of NDP mutations may vary in different groups. African Americans, for example, are less likely to develop severe degrees of ROP. We screened a large cohort of ethnically diverse patients for mutations in the entire NDP. A total of 143 subjects of different ethnic backgrounds were enrolled in the study. Fifty-four patients had severe ROP (Stage 3 or worse). Of these, 38 were threshold in at least one eye (with a mean gestational age of 26.1 weeks and mean birth weight of 788.4 g). There were 36 patients with mild or no ROP, 31 parents with no history of retinal disease or prematurity, and 22 wild type (normal) controls. There were 70 African American subjects, 55 Caucasians, and 18 of other races. Severe ROP was noted in 29 African American subjects, 17 Caucasians, and 8 of other races. Seven polymerase chain reaction primer pairs spanning the NDP were optimized for denaturing high performance liquid chromatography and direct sequencing. Three primer pairs covered the coding region, and the remaining four spanned the 3' and 5' untranslated regions (UTR). Six of 54 (11%) infants with severe ROP had polymorphisms in the NDP. Five of the infants were African American, and one was Caucasian. Two parents were heterozygous for the same polymorphism as their child. One parent-child pair had a single base pair (bp) insertion in the 3' UTR region. Another parent-child pair had two mutations: a 14 bp deletion in the 5' UTR region of exon 1 and a single nucleotide polymorphism in the 5' UTR region of exon 2. No coding region sequence changes were found. No polymorphisms were observed in infants with mild or no ROP, or in the wild type controls. Of the six sequence alterations found, five were novel nucleotide changes: One in the 5' UTR region of exon 2, and four in the 3' UTR region of exon 3. The extent of NDP polymorphisms in this large, racially diverse group of infants is moderate. NDP polymorphisms may play a role in the pathogenesis of ROP, but do not appear to be a major causative factor.
Ambigapathy, Ganesh; Zheng, Zhaoqing; Li, Wei; Keifer, Joyce
2013-01-01
Brain-derived neurotrophic factor (BDNF) has a diverse functional role and complex pattern of gene expression. Alternative splicing of mRNA transcripts leads to further diversity of mRNAs and protein isoforms. Here, we describe the regulation of BDNF mRNA transcripts in an in vitro model of eyeblink classical conditioning and a unique transcript that forms a functionally distinct truncated BDNF protein isoform. Nine different mRNA transcripts from the BDNF gene of the pond turtle Trachemys scripta elegans (tBDNF) are selectively regulated during classical conditioning: exon I mRNA transcripts show no change, exon II transcripts are downregulated, while exon III transcripts are upregulated. One unique transcript that codes from exon II, tBDNF2a, contains a 40 base pair deletion in the protein coding exon that generates a truncated tBDNF protein. The truncated transcript and protein are expressed in the naïve untrained state and are fully repressed during conditioning when full-length mature tBDNF is expressed, thereby having an alternate pattern of expression in conditioning. Truncated BDNF is not restricted to turtles as a truncated mRNA splice variant has been described for the human BDNF gene. Further studies are required to determine the ubiquity of truncated BDNF alternative splice variants across species and the mechanisms of regulation and function of this newly recognized BDNF protein.
Ambigapathy, Ganesh; Zheng, Zhaoqing; Li, Wei; Keifer, Joyce
2013-01-01
Brain-derived neurotrophic factor (BDNF) has a diverse functional role and complex pattern of gene expression. Alternative splicing of mRNA transcripts leads to further diversity of mRNAs and protein isoforms. Here, we describe the regulation of BDNF mRNA transcripts in an in vitro model of eyeblink classical conditioning and a unique transcript that forms a functionally distinct truncated BDNF protein isoform. Nine different mRNA transcripts from the BDNF gene of the pond turtle Trachemys scripta elegans (tBDNF) are selectively regulated during classical conditioning: exon I mRNA transcripts show no change, exon II transcripts are downregulated, while exon III transcripts are upregulated. One unique transcript that codes from exon II, tBDNF2a, contains a 40 base pair deletion in the protein coding exon that generates a truncated tBDNF protein. The truncated transcript and protein are expressed in the naïve untrained state and are fully repressed during conditioning when full-length mature tBDNF is expressed, thereby having an alternate pattern of expression in conditioning. Truncated BDNF is not restricted to turtles as a truncated mRNA splice variant has been described for the human BDNF gene. Further studies are required to determine the ubiquity of truncated BDNF alternative splice variants across species and the mechanisms of regulation and function of this newly recognized BDNF protein. PMID:23825634
Novel mutations of endothelin-B receptor gene in Pakistani patients with Waardenburg syndrome.
Jabeen, Raheela; Babar, Masroor Ellahi; Ahmad, Jamil; Awan, Ali Raza
2012-01-01
Mutations in EDNRB gene have been reported to cause Waardenburg-Shah syndrome (WS4) in humans. We investigated 17 patients with WS4 for identification of mutations in EDNRB gene using PCR and direct sequencing technique. Four genomic mutations were detected in four patients; a G to C transversion in codon 335 (S335C) in exon 5 and a transition of T to C in codon (S361L) in exon 5, a transition of A to G in codon 277 (L277L) in exon 4, a non coding transversion of T to A at -30 nucleotide position of exon 5. None of these mutations were found in controls. One of the patients harbored two novel mutations (S335C, S361L) in exon 5 and one in Intronic region (-30exon5 A>G). All of the mutations were homozygous and novel except the mutation observed in exon 4. In this study, we have identified 3 novel mutations in EDNRB gene associated with WS4 in Pakistani patients.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rittig, S.; Siggaard, C.; Pedersen, E.B.
1996-01-01
Familial neurohypophyseal diabetes insipidus (FNDI) is an autosomal dominant disorder characterized by progressive postnatal deficiency of arginine vasopressin as a result of mutation in the gene that encodes the hormone. To determine the extent of mutations in the coding region that produce the phenotype, we studied members of 17 unrelated kindreds with the disorder. We sequenced all 3 exons of the gene by using a rapid, direct dye-terminator method and found the causative mutation in each kindred. In four kindreds, the mutations were each identical to mutations described in other affected families. In the other 13 kindreds each mutation wasmore » unique. There were two missense mutations that altered the cleavage region of the signal peptide, seven missense mutations in exon 2, which codes for the conserved portion of the protein, one nonsense mutation in exon 2, and three nonsense mutations in exon 3. These findings, together with the clinical features of FNDI, suggest that each of the mutations exerts an effect by directing the production of a pre-prohormone that cannot be folded, processed, or degraded properly and eventually destroys vasopressinergic neurons. 63 refs., 5 figs., 6 tabs.« less
Rittig, S.; Robertson, G. L.; Siggaard, C.; Kovács, L.; Gregersen, N.; Nyborg, J.; Pedersen, E. B.
1996-01-01
Familial neurohypophyseal diabetes insipidus (FNDI) is an autosomal dominant disorder characterized by progressive postnatal deficiency of arginine vasopressin as a result of mutation in the gene that encodes the hormone. To determine the extent of mutations in the coding region that produce the phenotype, we studied members of 17 unrelated kindreds with the disorder. We sequenced all 3 exons of the gene by using a rapid, direct dye-terminator method and found the causative mutation in each kindred. In four kindreds, the mutations were each identical to mutations described in other affected families. In the other 13 kindreds each mutation was unique. There were two missense mutations that altered the cleavage region of the signal peptide, seven missense mutations in exon 2, which codes for the conserved portion of the protein, one nonsense mutation in exon 2, and three nonsense mutations in exon 3. These findings, together with the clinical features of FNDI, suggest that each of the mutations exerts an effect by directing the production of a pre-prohormone that cannot be folded, processed, or degraded properly and eventually destroys vasopressinergic neurons. Images Figure 3 PMID:8554046
Non-exomic and synonymous variants in ABCA4 are an important cause of Stargardt disease
Braun, Terry A.; Mullins, Robert F.; Wagner, Alex H.; Andorf, Jeaneen L.; Johnston, Rebecca M.; Bakall, Benjamin B.; Deluca, Adam P.; Fishman, Gerald A.; Lam, Byron L.; Weleber, Richard G.; Cideciyan, Artur V.; Jacobson, Samuel G.; Sheffield, Val C.; Tucker, Budd A.; Stone, Edwin M.
2013-01-01
Mutations in ABCA4 cause Stargardt disease and other blinding autosomal recessive retinal disorders. However, sequencing of the complete coding sequence in patients with clinical features of Stargardt disease sometimes fails to detect one or both mutations. For example, among 208 individuals with clear clinical evidence of ABCA4 disease ascertained at a single institution, 28 had only one disease-causing allele identified in the exons and splice junctions of the primary retinal transcript of the gene. Haplotype analysis of these 28 probands revealed 3 haplotypes shared among ten families, suggesting that 18 of the 28 missing alleles were rare enough to be present only once in the cohort. We hypothesized that mutations near rare alternate splice junctions in ABCA4 might cause disease by increasing the probability of mis-splicing at these sites. Next-generation sequencing of RNA extracted from human donor eyes revealed more than a dozen alternate exons that are occasionally incorporated into the ABCA4 transcript in normal human retina. We sequenced the genomic DNA containing 15 of these minor exons in the 28 one-allele subjects and observed five instances of two different variations in the splice signals of exon 36.1 that were not present in normal individuals (P < 10−6). Analysis of RNA obtained from the keratinocytes of patients with these mutations revealed the predicted alternate transcript. This study illustrates the utility of RNA sequence analysis of human donor tissue and patient-derived cell lines to identify mutations that would be undetectable by exome sequencing. PMID:23918662
Shabalina, Svetlana A.; Ogurtsov, Aleksey Y.; Spiridonov, Nikolay A.; Koonin, Eugene V.
2014-01-01
Alternative splicing (AS), alternative transcription initiation (ATI) and alternative transcription termination (ATT) create the extraordinary complexity of transcriptomes and make key contributions to the structural and functional diversity of mammalian proteomes. Analysis of mammalian genomic and transcriptomic data shows that contrary to the traditional view, the joint contribution of ATI and ATT to the transcriptome and proteome diversity is quantitatively greater than the contribution of AS. Although the mean numbers of protein-coding constitutive and alternative nucleotides in gene loci are nearly identical, their distribution along the transcripts is highly non-uniform. On average, coding exons in the variable 5′ and 3′ transcript ends that are created by ATI and ATT contain approximately four times more alternative nucleotides than core protein-coding regions that diversify exclusively via AS. Short upstream exons that encompass alternative 5′-untranslated regions and N-termini of proteins evolve under strong nucleotide-level selection whereas in 3′-terminal exons that encode protein C-termini, protein-level selection is significantly stronger. The groups of genes that are subject to ATI and ATT show major differences in biological roles, expression and selection patterns. PMID:24792168
Arcot Sadagopan, Karthikeyan; Battista, Robert; Keep, Rosanne B; Capasso, Jenina E; Levin, Alex V
2015-06-01
Leber congenital amaurosis (LCA) is most often an autosomal recessive disorder. We report a father and son with autosomal dominant LCA due to a mutation in the CRX gene. DNA screening using an allele specific assay of 90 of the most common LCA-causing variations in the coding sequences of AIPL1, CEP290, CRB1, CRX, GUCY2D, RDH12 and RPE65 was performed on the father. Automated DNA sequencing of his son examining exon 3 of the CRX gene was subsequently performed. Both father and son have a heterozygous single base pair deletion of an adenine at codon 153 in the coding sequence of the CRX gene resulting in a frameshift mutation. Mutations involving the CRX gene may demonstrate an autosomal dominant inheritance pattern for LCA.
Retterer, Kyle; Scuffins, Julie; Schmidt, Daniel; Lewis, Rachel; Pineda-Alvarez, Daniel; Stafford, Amanda; Schmidt, Lindsay; Warren, Stephanie; Gibellini, Federica; Kondakova, Anastasia; Blair, Amanda; Bale, Sherri; Matyakhina, Ludmila; Meck, Jeanne; Aradhya, Swaroop; Haverfield, Eden
2015-08-01
Detection of copy-number variation (CNV) is important for investigating many genetic disorders. Testing a large clinical cohort by array comparative genomic hybridization provides a deep perspective on the spectrum of pathogenic CNV. In this context, we describe a bioinformatics approach to extract CNV information from whole-exome sequencing and demonstrate its utility in clinical testing. Exon-focused arrays and whole-genome chromosomal microarray analysis were used to test 14,228 and 14,000 individuals, respectively. Based on these results, we developed an algorithm to detect deletions/duplications in whole-exome sequencing data and a novel whole-exome array. In the exon array cohort, we observed a positive detection rate of 2.4% (25 duplications, 318 deletions), of which 39% involved one or two exons. Chromosomal microarray analysis identified 3,345 CNVs affecting single genes (18%). We demonstrate that our whole-exome sequencing algorithm resolves CNVs of three or more exons. These results demonstrate the clinical utility of single-exon resolution in CNV assays. Our whole-exome sequencing algorithm approaches this resolution but is complemented by a whole-exome array to unambiguously identify intragenic CNVs and single-exon changes. These data illustrate the next advancements in CNV analysis through whole-exome sequencing and whole-exome array.Genet Med 17 8, 623-629.
Homozygous and hemizygous CNV detection from exome sequencing data in a Mendelian disease cohort.
Gambin, Tomasz; Akdemir, Zeynep C; Yuan, Bo; Gu, Shen; Chiang, Theodore; Carvalho, Claudia M B; Shaw, Chad; Jhangiani, Shalini; Boone, Philip M; Eldomery, Mohammad K; Karaca, Ender; Bayram, Yavuz; Stray-Pedersen, Asbjørg; Muzny, Donna; Charng, Wu-Lin; Bahrambeigi, Vahid; Belmont, John W; Boerwinkle, Eric; Beaudet, Arthur L; Gibbs, Richard A; Lupski, James R
2017-02-28
We developed an algorithm, HMZDelFinder, that uses whole exome sequencing (WES) data to identify rare and intragenic homozygous and hemizygous (HMZ) deletions that may represent complete loss-of-function of the indicated gene. HMZDelFinder was applied to 4866 samples in the Baylor-Hopkins Center for Mendelian Genomics (BHCMG) cohort and detected 773 HMZ deletion calls (567 homozygous or 206 hemizygous) with an estimated sensitivity of 86.5% (82% for single-exonic and 88% for multi-exonic calls) and precision of 78% (53% single-exonic and 96% for multi-exonic calls). Out of 773 HMZDelFinder-detected deletion calls, 82 were subjected to array comparative genomic hybridization (aCGH) and/or breakpoint PCR and 64 were confirmed. These include 18 single-exon deletions out of which 8 were exclusively detected by HMZDelFinder and not by any of seven other CNV detection tools examined. Further investigation of the 64 validated deletion calls revealed at least 15 pathogenic HMZ deletions. Of those, 7 accounted for 17-50% of pathogenic CNVs in different disease cohorts where 7.1-11% of the molecular diagnosis solved rate was attributed to CNVs. In summary, we present an algorithm to detect rare, intragenic, single-exon deletion CNVs using WES data; this tool can be useful for disease gene discovery efforts and clinical WES analyses. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
SURVEY AND SUMMARY: exon-intron organization of genes in the slime mold Physarum polycephalum.
Trzcinska-Danielewicz, J; Fronk, J
2000-09-15
The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon-intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon-intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon-intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3'-ends.
Huang, X Y; Yang, Q L; Yuan, J H; Gun, S B
2015-09-08
In this study, 290 Chinese native Yantai black pig piglets were investigated to identify gene polymorphisms, for haplotype reconstruction, and to determine the association between piglet diarrhea and swine leukocyte antigen (SLA) class II DQA exons 2, 3, and 4 by polymerase chain reaction-single stranded conformational polymorphism and cloning sequencing. The results showed that the 5, 8, and 7 genotypes were identified from SLA-DQA exon 2, 3, and 4, respectively, based on the single-stranded conformational polymorphism banding patterns and found a novel allele D in exon 2 and 2 novel mutational sites of allele C (c.4828T>C) and allele F (c.4617T>C) in exon 3. Polymorphism information content testing showed that exon 2 was moderately polymorphic and that exons-3 and -4 loci were highly polymorphic. The piglet diarrhea scores for genotypes AB (1.40 ± 0.14) and AC (1.54 ± 0.17) in exon 2, AA (1.22 ± 0.32), BC (1.72 ± 0.13), DD (1.67 ± 0.35), and CF (1.22 ± 0.45) in exon 3, and AD (2.35 ± 0.25) in exon 4 were significantly higher than those for the other genotypes (P ≤ 0.05) in DQA exons. There were 14 reconstructed haplotypes in the 3 exons from 290 individuals and Hap12 may be the diarrhea-resistant gene. Haplotype distribution was extremely uneven, and the SLA-DQA gene showed genetic linkage. In this study, we identified molecular genetic markers and provided a theoretical foundation for future pig anti-disease resistance breeding.
Nettore, I C; Desiderio, S; De Nisco, E; Cacace, V; Albano, L; Improda, N; Ungaro, P; Salerno, M; Colao, A; Macchia, P E
2018-06-01
Congenital hypothyroidism is a frequent disease occurring with an incidence of about 1/1500 newborns/year. In about 75% of the cases, CH is caused by alterations in thyroid morphogenesis, defined "thyroid dysgenesis" (TD). TD is generally a sporadic disease but in about 5% of the cases a genetic origin has been demonstrated. Previous studies indicate that Dnajc17 as a candidate modifier gene for hypothyroidism, since it is expressed in the thyroid bud, interacts with NKX2.1 and PAX8 and it has been associated to the hypothyroid phenotype in mice carrying a single Nkx2.1 and Pax8 genes (double heterozygous knock-out). The work evaluates the possible involvement of DNAJC17 in the pathogenesis of TD. High-resolution DNA melting analysis (HRM) and direct sequencing have been used to screen for mutations in the DNAJC17 coding sequence in 89 patients with TD. Two mutations have been identified in the coding sequence of DNAJC17 gene, one in exon 5 (c.350A>C; rs79709714) and one in exon 9 (c.610G>C; rs117485355). The last one is a rare variant, while the rs79709714 is a polymorphism. Both are present in databases and the frequency of the alleles is not different between TD patients and controls. DNAJC17 mutations are not frequently present in patients with TD.
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-01-01
Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis. PMID:18954468
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-10-28
The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis.
Reggiani, Claudio; Coppens, Sandra; Sekhara, Tayeb; Dimov, Ivan; Pichon, Bruno; Lufin, Nicolas; Addor, Marie-Claude; Belligni, Elga Fabia; Digilio, Maria Cristina; Faletra, Flavio; Ferrero, Giovanni Battista; Gerard, Marion; Isidor, Bertrand; Joss, Shelagh; Niel-Bütschi, Florence; Perrone, Maria Dolores; Petit, Florence; Renieri, Alessandra; Romana, Serge; Topa, Alexandra; Vermeesch, Joris Robert; Lenaerts, Tom; Casimir, Georges; Abramowicz, Marc; Bontempi, Gianluca; Vilain, Catheline; Deconinck, Nicolas; Smits, Guillaume
2017-07-19
Tissue-specific integrative omics has the potential to reveal new genic elements important for developmental disorders. Two pediatric patients with global developmental delay and intellectual disability phenotype underwent array-CGH genetic testing, both showing a partial deletion of the DLG2 gene. From independent human and murine omics datasets, we combined copy number variations, histone modifications, developmental tissue-specific regulation, and protein data to explore the molecular mechanism at play. Integrating genomics, transcriptomics, and epigenomics data, we describe two novel DLG2 promoters and coding first exons expressed in human fetal brain. Their murine conservation and protein-level evidence allowed us to produce new DLG2 gene models for human and mouse. These new genic elements are deleted in 90% of 29 patients (public and in-house) showing partial deletion of the DLG2 gene. The patients' clinical characteristics expand the neurodevelopmental phenotypic spectrum linked to DLG2 gene disruption to cognitive and behavioral categories. While protein-coding genes are regarded as well known, our work shows that integration of multiple omics datasets can unveil novel coding elements. From a clinical perspective, our work demonstrates that two new DLG2 promoters and exons are crucial for the neurodevelopmental phenotypes associated with this gene. In addition, our work brings evidence for the lack of cross-annotation in human versus mouse reference genomes and nucleotide versus protein databases.
Zhang, Li; Tang, Jun-Ling; Liang, Shang-Zheng
2008-06-01
Muscle segment homeobox gene (MSX)1 has been proposed as a gene in which mutations may contribute to nonsyndromic cleft lip with or without cleft palate (NSCL/P). To study MSX1 polymorphisms in NSCL/ P by means of polymerase chain reaction-single-strand conformation polymorphism (PCR-SSCP), and investigate the association of MSX1 exons 1 polymorphisms with NSCL/P. DNA were extracted from blood samples from NSCL/P and unrelated normal subjects. Genome DNA from peripheral leukocyte with these blood samples were extracted, which was used as template to amplify desired gene fragment of MSX1 exons 1 by means of polymerase chain reaction (PCR). The PCR products were examined by single-strand conformation polymorphism (SSCP). The MSX1 exons 1 polymorphisms were examined by sequencing if mutations were found. MSX1 genes of exon 1 mutation was not been found in the NSCL/P and unrelated normal subjects by SSCP. No correlation between MSX1 exon 1 and NSCL/P was found. MSX1 exon 1 may not be a key gene (susceptibility gene) in NSCL/P.
Implication of LRRC4C and DPP6 in neurodevelopmental disorders
Maussion, Gilles; Cruceanu, Cristiana; Rosenfeld, Jill A.; Bell, Scott C.; Jollant, Fabrice; Szatkiewicz, Jin; Collins, Ryan L.; Hanscom, Carrie; Kolobova, Ilaria; de Champfleur, Nicolas Menjot; Blumenthal, Ian; Chiang, Colby; Ota, Vanessa; Hultman, Christina; O’Dushlaine, Colm; McCarroll, Steve; Alda, Martin; Jacquemont, Sebastien; Ordulu, Zehra; Marshall, Christian R.; Carter, Melissa T.; Shaffer, Lisa G.; Sklar, Pamela; Girirajan, Santhosh; Morton, Cynthia C.; Gusella, James F.; Turecki, Gustavo; Stavropoulos, D. J.; Sullivan, Patrick F.; Scherer, Stephen W.; Talkowski, Michael E.; Ernst, Carl
2018-01-01
We performed whole-genome sequencing on an individual from a family with variable psychiatric phenotypes that had a sensory processing disorder, apraxia, and autism. The proband harbored a maternally inherited balanced translocation (46,XY,t(11;14)(p12;p12)mat) that disrupted LRRC4C, a member of the highly specialized netrin G family of axon guidance molecules. The proband also inherited a paternally derived chromosomal inversion that disrupted DPP6, a potassium channel interacting protein. Copy Number (CN) analysis in 14,077 cases with neurodevelopmental disorders and 8,960 control subjects revealed that 60% of cases with exonic deletions in LRRC4C had a second clinically recognizable syndrome associated with variable clinical phenotypes, including 16p11.2, 1q44, and 2q33.1 CN syndromes, suggesting LRRC4C deletion variants may be modifiers of neurodevelopmental disorders. In vitro, functional assessments modeling patient deletions in LRRC4C suggest a negative regulatory role of these exons found in the untranslated region of LRRC4C, which has a single, terminal coding exon. These data suggest that the proband’s autism may be due to the inheritance of disruptions in both DPP6 and LRRC4C, and may highlight the importance of the netrin G family and potassium channel interacting molecules in neurodevelopmental disorders. PMID:27759917
Fernández-Cancio, Mónica; Nistal, Manuel; Gracia, Ricardo; Molina, M Antonia; Tovar, Juan Antonio; Esteban, Cristina; Carrascosa, Antonio; Audí, Laura
2004-01-01
The goal of this study was to perform 5-alpha-reductase type 2 gene (SRD5A2) analysis in a male pseudohermaphrodite (MPH) patient with normal testosterone (T) production and normal androgen receptor (AR) gene coding sequences. A patient of Chinese origin with ambiguous genitalia at 14 months, a 46,XY karyotype, and normal T secretion under human chorionic gonadotropin (hCG) stimulation underwent a gonadectomy at 20 months. Exons 1-8 of the AR gene and exons 1-5 of the SRD5A2 gene were sequenced from peripheral blood DNA. AR gene coding sequences were normal. SRD5A2 gene analysis revealed 2 consecutive mutations in exon 4, each located in a different allele: 1) a T nucleotide deletion, which predicts a frameshift mutation from codon 219, and 2) a missense mutation at codon 227, where the substitution of guanine (CGA) by adenine (CAA) predicts a glutamine replacement of arginine (R227Q). Testes located in the inguinal canal showed a normal morphology for age. The patient was a compound heterozygote for SRD5A2 mutations, carrying 2 mutations in exon 4. The patient showed an R227Q mutation that has been described in an Asian population and MPH patients, along with a novel frameshift mutation, Tdel219. Testis morphology showed that, during early infancy, the 5-alpha-reductase enzyme deficiency may not have affected interstitial or tubular development.
Reengineering a transmembrane protein to treat muscular dystrophy using exon skipping.
Gao, Quan Q; Wyatt, Eugene; Goldstein, Jeff A; LoPresti, Peter; Castillo, Lisa M; Gazda, Alec; Petrossian, Natalie; Earley, Judy U; Hadhazy, Michele; Barefield, David Y; Demonbreun, Alexis R; Bönnemann, Carsten; Wolf, Matthew; McNally, Elizabeth M
2015-11-02
Exon skipping uses antisense oligonucleotides as a treatment for genetic diseases. The antisense oligonucleotides used for exon skipping are designed to bypass premature stop codons in the target RNA and restore reading frame disruption. Exon skipping is currently being tested in humans with dystrophin gene mutations who have Duchenne muscular dystrophy. For Duchenne muscular dystrophy, the rationale for exon skipping derived from observations in patients with naturally occurring dystrophin gene mutations that generated internally deleted but partially functional dystrophin proteins. We have now expanded the potential for exon skipping by testing whether an internal, in-frame truncation of a transmembrane protein γ-sarcoglycan is functional. We generated an internally truncated γ-sarcoglycan protein that we have termed Mini-Gamma by deleting a large portion of the extracellular domain. Mini-Gamma provided functional and pathological benefits to correct the loss of γ-sarcoglycan in a Drosophila model, in heterologous cell expression studies, and in transgenic mice lacking γ-sarcoglycan. We generated a cellular model of human muscle disease and showed that multiple exon skipping could be induced in RNA that encodes a mutant human γ-sarcoglycan. Since Mini-Gamma represents removal of 4 of the 7 coding exons in γ-sarcoglycan, this approach provides a viable strategy to treat the majority of patients with γ-sarcoglycan gene mutations.
SEASTAR: systematic evaluation of alternative transcription start sites in RNA.
Qin, Zhiyi; Stoilov, Peter; Zhang, Xuegong; Xing, Yi
2018-05-04
Alternative first exons diversify the transcriptomes of eukaryotes by producing variants of the 5' Untranslated Regions (5'UTRs) and N-terminal coding sequences. Accurate transcriptome-wide detection of alternative first exons typically requires specialized experimental approaches that are designed to identify the 5' ends of transcripts. We developed a computational pipeline SEASTAR that identifies first exons from RNA-seq data alone then quantifies and compares alternative first exon usage across multiple biological conditions. The exons inferred by SEASTAR coincide with transcription start sites identified directly by CAGE experiments and bear epigenetic hallmarks of active promoters. To determine if differential usage of alternative first exons can yield insights into the mechanism controlling gene expression, we applied SEASTAR to an RNA-seq dataset that tracked the reprogramming of mouse fibroblasts into induced pluripotent stem cells. We observed dynamic temporal changes in the usage of alternative first exons, along with correlated changes in transcription factor expression. Using a combined sequence motif and gene set enrichment analysis we identified N-Myc as a regulator of alternative first exon usage in the pluripotent state. Our results demonstrate that SEASTAR can leverage the available RNA-seq data to gain insights into the control of gene expression and alternative transcript variation in eukaryotic transcriptomes.
Reengineering a transmembrane protein to treat muscular dystrophy using exon skipping
Gao, Quan Q.; Wyatt, Eugene; Goldstein, Jeff A.; LoPresti, Peter; Castillo, Lisa M.; Gazda, Alec; Petrossian, Natalie; Earley, Judy U.; Hadhazy, Michele; Barefield, David Y.; Demonbreun, Alexis R.; Bönnemann, Carsten; Wolf, Matthew; McNally, Elizabeth M.
2015-01-01
Exon skipping uses antisense oligonucleotides as a treatment for genetic diseases. The antisense oligonucleotides used for exon skipping are designed to bypass premature stop codons in the target RNA and restore reading frame disruption. Exon skipping is currently being tested in humans with dystrophin gene mutations who have Duchenne muscular dystrophy. For Duchenne muscular dystrophy, the rationale for exon skipping derived from observations in patients with naturally occurring dystrophin gene mutations that generated internally deleted but partially functional dystrophin proteins. We have now expanded the potential for exon skipping by testing whether an internal, in-frame truncation of a transmembrane protein γ-sarcoglycan is functional. We generated an internally truncated γ-sarcoglycan protein that we have termed Mini-Gamma by deleting a large portion of the extracellular domain. Mini-Gamma provided functional and pathological benefits to correct the loss of γ-sarcoglycan in a Drosophila model, in heterologous cell expression studies, and in transgenic mice lacking γ-sarcoglycan. We generated a cellular model of human muscle disease and showed that multiple exon skipping could be induced in RNA that encodes a mutant human γ-sarcoglycan. Since Mini-Gamma represents removal of 4 of the 7 coding exons in γ-sarcoglycan, this approach provides a viable strategy to treat the majority of patients with γ-sarcoglycan gene mutations. PMID:26457733
An RRM–ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion
Collins, Katherine M.; Kainov, Yaroslav A.; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A.
2017-01-01
Abstract RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1–ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. PMID:28379442
An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.
Collins, Katherine M; Kainov, Yaroslav A; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A; Makeyev, Eugene V; Ramos, Andres
2017-06-20
RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays
Sugnet, Charles W; Srinivasan, Karpagam; Clark, Tyson A; O'Brien, Georgeann; Cline, Melissa S; Wang, Hui; Williams, Alan; Kulp, David; Blume, John E; Haussler, David; Ares, Manuel
2006-01-01
Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families. PMID:16424921
Li, Chenhong; Riethoven, Jean-Jack M; Naylor, Gavin J P
2012-09-01
Recent innovations in next-generation sequencing have lowered the cost of genome projects. Nevertheless, sequencing entire genomes for all representatives in a study remains expensive and unnecessary for most studies in ecology, evolution and conservation. It is still more cost-effective and efficient to target and sequence single-copy nuclear gene markers for such studies. Many tools have been developed for identifying nuclear markers, but most of these have focused on particular taxonomic groups. We have built a searchable database, EvolMarkers, for developing single-copy coding sequence (CDS) and exon-primed-intron-crossing (EPIC) markers that is designed to work across a broad range of phylogenetic divergences. The database is made up of single-copy CDS derived from BLAST searches of a variety of metazoan genomes. Users can search the database for different types of markers (CDS or EPIC) that are common to different sets of input species with different divergence characteristics. EvolMarkers can be applied to any taxonomic group for which genome data are available for two or more species. We included 82 genomes in the first version of EvolMarkers and have found the methods to be effective across Placozoa, Cnidaria, Arthropod, Nematoda, Annelida, Mollusca, Echinodermata, Hemichordata, Chordata and plants. We demonstrate the effectiveness of searching for CDS markers within annelids and show how to find potentially useful intronic markers within the lizard Anolis. © 2012 Blackwell Publishing Ltd.
Kongchum, Pawapol; Hallerman, Eric M; Hulata, Gideon; David, Lior; Palti, Yniv
2011-01-01
Induction of innate immune pathways is critical for early host defense, but there is limited understanding of how teleost fishes recognize pathogen molecules and activate these pathways. In mammals, cells of the innate immune system detect pathogenic molecular structures using pattern recognition receptors (PRRs). TLR9 functions as a PRR that recognizes CpG motifs in bacterial and viral DNA and requires adaptor molecules MyD88 and TRAF6 for signal transduction. Here we report full-length cDNA isolation, structural characterization and tissue mRNA expression analysis of the common carp (cc) TLR9, MyD88 and TRAF6 gene orthologs. The ccTLR9 open-reading frame (ORF) is predicted to encode a 1064-amino acid (aa) protein. We found that MyD88 and TRAF6 genes are duplicated in common carp. This is the first report of TRAF6 duplication in a vertebrate genome and stronger evidence in support of MyD88 duplication is provided. The ccMyD88a and b ORFs are predicted to encode 288-aa and 284-aa peptides, respectively. They share 91% aa sequence identity between paralogs. The ccTRAF6a and b ORFs are both predicted to encode 543-aa peptides sharing 95% aa sequence identity between paralogs. The ccTLR9 gene is contained in a single large exon. The ccMyD88a and ccMyD88b coding sequences span five exons. The TRAF6b gene spans six exons. PCR amplification to obtain the entire coding sequence of ccTRAF6a gene was not successful. The 2104-bp fragment amplified covers the 3' end of the gene and it contains a partial sequence of one exon and three complete exons. The predicated protein domains of the ccTLR9, ccMyD88 and ccTRAF6 are conserved and resemble orthologs from other vertebrates. Real-time quantitative PCR assays of the ccTLR9, MyD88a and b, and TRAF6a and b gene transcripts in healthy common carp indicated that mRNA expression varied between tissues. Differential expression of duplicate copies were found for ccMyD88 and ccTRAF6 in white and red muscle tissues, suggesting that paralogs may have evolved and attained a new function. The genomic information we describe in this paper provides evidence of sequence and structural conservation of immune response genes in common carp. Published by Elsevier Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kerr, J.M.; Fisher, L.W.; Termine, J.D.
The authors have isolated and partially sequenced the human bone sialoprotein gene (IBSP). IBSP has been sublocalized by in situ hybridization to chromosome 4q38-q31 and is composed of six small exons (51 to 159 bp) and 1 large exon ([approximately]2.6 kb). The intron/exon junctions defined by sequence analysis are of class O, retaining an intact coding triplet. Sequence analysis of the 5[prime] upstream region revealed a TATAA (nucleotides -30 to-25 from the transcriptional start point) and a CCAAT (nucleotides -56 to-52) box, both in the reverse orientation. Intron 1 contains interesting structural elements composed of polypyrimidine repeats followed by amore » poly(AC)[sub n] tract. Both types of structural elements have been detected in promoter regions of other genes and have been implicated in transcriptional regulation. Several differences between the previously published cDNA sequence and the authors' sequence have been identified, most of which are contained within the untranslated exon 1. Three base revisions in the coding region include a G to T (Gly to Val, amino acid 195), T to C (Val to Ala, amino acid 268), and T to A (Glu to Asp, amino acid 270). In conclusion, the genomic organization and potential regulatory elements of human IBSP have been elucidated. 42 refs., 4 figs., 1 tab.« less
Novel folliculin (FLCN) mutation and familial spontaneous pneumothorax.
Zhu, J-F; Shen, X-Q; Zhu, F; Tian, L
2017-01-01
Familial spontaneous pneumothorax is one of the characteristics of Birt-Hogg-Dubé syndrome (BHDS), which is an autosomal dominant disease caused by the mutation of folliculin (FLCN). To investigate the mutation of FLCN gene in a familial spontaneous pneumothorax. Prospective case study. Clinical and genetic data of a Chinese family with four patients who presented spontaneous pneumothorax in the absence of skin lesions or renal tumors were collected. CT scan of patient's lung was applied for observation of pneumothorax. DNA sequencing of the coding exons (4-14 exons) of FLCN was performed for all 11 members of the family and 100 unrelated healthy controls. CT scan of patient's lung showed spontaneous pneumothorax. A mutation (c. 510C > G) that leads to a premature stop codon (p. Y170X) was found in the proband using DNA sequencing of coding exons (4-14 exons) of FLCN. This mutation was also observed in the other affected members of the family. A nonsense mutation of FLCN was found in a spontaneous pneumothorax family. Our results expand the mutational spectrum of FLCN in patients with BHDS. © The Author 2016. Published by Oxford University Press on behalf of the Association of Physicians. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
A novel mutation in SCN9A in a child with congenital insensitivity to pain.
Shorer, Zamir; Wajsbrot, Einav; Liran, Tamir-Hostovsky; Levy, Jacov; Parvari, Ruti
2014-01-01
[corrected] Congenital insensitivity to pain (CIP) is a rare condition in which patients have no pain perception and anosmia but are otherwise essentially normal (OMIM 243000). The recent discovery of the genetic defects underlying 3 monogenic pain disorders has provided additional and important insights about some components of human pain. Genetic studies in families demonstrating recessively inherited channelopathy-associated insensitivity to pain have identified nonsense mutations that result in truncation of the voltage-gated sodium channel type IX subunit (SCN9A), a 113.5-kb gene comprising coding 26 exons. Here we describe a patient with CIP with a new mutation in SCN9A not described yet. All exons were sequenced. All 26 coding exons were sequenced and two changes were identified in homozygosity in exon 10: c.1126 A > C causing K376Q and c.1124delG causing p.G375Afs* frame shift. We report a novel, loss-of-function mutation in homozygosity that causes congenital insensitivity to pain and provide a comprehensive clinical description of the patient. This contributes to the clinical and neurophysiological characteristic of the sodium channel Nav1.7 channelopathy and expand our genetic knowledge which might provide more accurate and comprehensive clinical electrophysiological and genetic information. Copyright © 2014 Elsevier Inc. All rights reserved.
Bedeschi, Maria Francesca; Marangi, Giuseppe; Calvello, Maria Rosaria; Ricciardi, Stefania; Leone, Francesca Pia Chiara; Baccarin, Marco; Guerneri, Silvana; Orteschi, Daniela; Murdolo, Marina; Lattante, Serena; Frangella, Silvia; Keena, Beth; Harr, Margaret H; Zackai, Elaine; Zollino, Marcella
2017-11-01
Pitt-Hopkins syndrome is a neurodevelopmental disorder characterized by severe intellectual disability and a distinctive facial gestalt. It is caused by haploinsufficiency of the TCF4 gene. The TCF4 protein has different functional domains, with the NLS (nuclear localization signal) domain coded by exons 7-8 and the bHLH (basic Helix-Loop-Helix) domain coded by exon 18. Several alternatively spliced TCF4 variants have been described, allowing for translation of variable protein isoforms. Typical PTHS patients have impairment of at least the bHLH domain. To which extent impairment of the remaining domains contributes to the final phenotype is not clear. There is recent evidence that certain loss-of-function variants disrupting TCF4 are associated with mild ID, but not with typical PTHS. We describe a frameshift-causing partial gene deletion encompassing exons 4-6 of TCF4 in an adult patient with mild ID and nonspecific facial dysmorphisms but without the typical features of PTHS, and a c.520C > T nonsense variant within exon 8 in a child presenting with a severe phenotype largely mimicking PTHS, but lacking the typical facial dysmorphism. Investigation on mRNA, along with literature review, led us to suggest a preliminary phenotypic map of loss-of-function variants affecting TCF4. An intragenic phenotypic map of loss-of-function variants in TCF4 is suggested here for the first time: variants within exons 1-4 and exons 4-6 give rise to a recurrent phenotype with mild ID not in the spectrum of Pitt-Hopkins syndrome (biallelic preservation of both the NLS and bHLH domains); variants within exons 7-8 cause a severe phenotype resembling PTHS but in absence of the typical facial dysmorphism (impairment limited to the NLS domain); variants within exons 9-19 cause typical Pitt-Hopkins syndrome (impairment of at least the bHLH domain). Understanding the TCF4 molecular syndromology can allow for proper nosology in the current era of whole genomic investigations. Copyright © 2017. Published by Elsevier Masson SAS.
Adenylosuccinate lyase (ADSL) and infantile autism: Absence of previously reported point mutation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fon, E.A.; Sarrazin, J.; Rouleau, G.A.
Autism is a heterogeneous neuropsychiatric syndrome of unknown etiology. There is evidence that a deficiency in the enzyme adenylosuccinate lyase (ADSL), essential for de novo purine biosynthesis, could be involved in the pathogenesis of certain cases. A point mutation in the ADSL gene, resulting in a predicted serine-to-proline substitution and conferring structural instability to the mutant enzyme, has been reported previously in 3 affected siblings. In order to determine the prevalence of the mutation, we PCR-amplified the exon spanning the site of this mutation from the genomic DNA of patients fulfilling DSM-III-R criteria for autistic disorder. None of the 119more » patients tested were found to have this mutation. Furthermore, on preliminary screening using single-strand conformation polymorphism (SSCP), no novel mutations were detected in the coding sequence of four ADSL exons, spanning approximately 50% of the cDNA. In light of these findings, it appears that mutations in the ADSL gene represent a distinctly uncommon cause of autism. 12 refs., 2 figs.« less
Laitinen, Eeva-Maria; Tommiska, Johanna; Virtanen, Helena E; Oehlandt, Heidi; Koivu, Rosanna; Vaaralahti, Kirsi; Toppari, Jorma; Raivio, Taneli
2011-07-20
Mutations in FGFR1, GNRHR, PROK2, PROKR2, TAC3, or TACR3 underlie isolated hypogonadotropic hypogonadism (IHH) with clinically variable phenotypes, and, by causing incomplete intrauterine activation of the hypothalamic-pituitary-gonadal axis, may lead to cryptorchidism. To investigate the role of defects in these genes in the etiology of isolated cryptorchidism, we screened coding exons and exon-intron boundaries of these genes in 54 boys or men from 46 families with a history of cryptorchidism. Control subjects (200) included 120 males. None of the patients carried mutation(s) in FGFR1, PROK2, PROKR2, TAC3 or TACR3. Two of the 46 index subjects with unilateral cryptorchidism were heterozygous carriers of a single GNRHR mutation (Q106R or R262Q), also present in male controls with a similar frequency (3/120; p=0.62). No homozygous or compound heterozygous GNRHR mutations were found. In conclusion, cryptorchidism is not commonly caused by defects in genes involved in IHH. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Dynamic ASXL1 Exon Skipping and Alternative Circular Splicing in Single Human Cells
Natarajan, Sivaraman; Carter, Robert; Brown, Patrick O.
2016-01-01
Circular RNAs comprise a poorly understood new class of noncoding RNA. In this study, we used a combination of targeted deletion, high-resolution splicing detection, and single-cell sequencing to deeply probe ASXL1 circular splicing. We found that efficient circular splicing required the canonical transcriptional start site and inverted AluSx elements. Sequencing-based interrogation of isoforms after ASXL1 overexpression identified promiscuous linear splicing between all exons, with the two most abundant non-canonical linear products skipping the exons that produced the circular isoforms. Single-cell sequencing revealed a strong preference for either the linear or circular ASXL1 isoforms in each cell, and found the predominant exon skipping product is frequently co-expressed with its reciprocal circular isoform. Finally, absolute quantification of ASXL1 isoforms confirmed our findings and suggests that standard methods overestimate circRNA abundance. Taken together, these data reveal a dynamic new view of circRNA genesis, providing additional framework for studying their roles in cellular biology. PMID:27736885
Using the NCBI Genome Databases to Compare the Genes for Human & Chimpanzee Beta Hemoglobin
ERIC Educational Resources Information Center
Offner, Susan
2010-01-01
The beta hemoglobin protein is identical in humans and chimpanzees. In this tutorial, students see that even though the proteins are identical, the genes that code for them are not. There are many more differences in the introns than in the exons, which indicates that coding regions of DNA are more highly conserved than non-coding regions.
Kobayashi, Eri; Shimizu, Ritsuko; Kikuchi, Yuko; Takahashi, Satoru; Yamamoto, Masayuki
2010-01-01
GATA1 is essential for the differentiation of erythroid cells and megakaryocytes. The Gata1 gene is composed of multiple untranslated first exons and five common coding exons. The erythroid first exon (IE exon) is important for Gata1 gene expression in hematopoietic lineages. Because previous IE exon knockdown analyses resulted in embryonic lethality, less is understood about the contribution of the IE exon to adult hematopoiesis. Here, we achieved specific deletion of the floxed IE exon in adulthood using an inducible Cre expression system. In this conditional knock-out mouse line, the Gata1 mRNA level was significantly down-regulated in the megakaryocyte lineage, resulting in thrombocytopenia with a marked proliferation of megakaryocytes. By contrast, in the erythroid lineage, Gata1 mRNA was expressed abundantly utilizing alternative first exons. Especially, the IEb/c and newly identified IEd exons were transcribed at a level comparable with that of the IE exon in control mice. Surprisingly, in the IE-null mouse, these transcripts failed to produce full-length GATA1 protein, but instead yielded GATA1 lacking the N-terminal domain inefficiently. With low level expression of the short form of GATA1, IE-null mice showed severe anemia with skewed erythroid maturation. Notably, the hematological phenotypes of adult IE-null mice substantially differ from those observed in mice harboring conditional ablation of the entire Gata1 gene. The present study demonstrates that the IE exon is instrumental to adult erythropoiesis by regulating the proper level of transcription and selecting the correct transcription start site of the Gata1 gene. PMID:19854837
Four novel mutations in the lactase gene (LCT) underlying congenital lactase deficiency (CLD).
Torniainen, Suvi; Freddara, Roberta; Routi, Taina; Gijsbers, Carolien; Catassi, Carlo; Höglund, Pia; Savilahti, Erkki; Järvelä, Irma
2009-01-22
Congenital lactase deficiency (CLD) is a severe gastrointestinal disorder of newborns. The diagnosis is challenging and based on clinical symptoms and low lactase activity in intestinal biopsy specimens. The disease is enriched in Finland but is also present in other parts of the world. Mutations encoding the lactase (LCT) gene have recently been shown to underlie CLD. The purpose of this study was to identify new mutations underlying CLD in patients with different ethnic origins, and to increase awareness of this disease so that the patients could be sought out and treated correctly. Disaccharidase activities in intestinal biopsy specimens were assayed and the coding region of LCT was sequenced from five patients from Europe with clinical features compatible with CLD. In the analysis and prediction of mutations the following programs: ClustalW, Blosum62, PolyPhen, SIFT and Panther PSEC were used. Four novel mutations in the LCT gene were identified. A single nucleotide substitution leading to an amino acid change S688P in exon 7 and E1612X in exon 12 were present in a patient of Italian origin. Five base deletion V565fsX567 leading to a stop codon in exon 6 was found in one and a substitution R1587H in exon 12 from another Finnish patient. Both Finnish patients were heterozygous for the Finnish founder mutation Y1390X. The previously reported mutation G1363S was found in a homozygous state in two siblings of Turkish origin. This is the first report of CLD mutations in patients living outside Finland. It seems that disease is more common than previously thought. All mutations in the LCT gene lead to a similar phenotype despite the location and/or type of mutation.
Perreault-Micale, Cynthia; Frieden, Alexander; Kennedy, Caleb J; Neitzel, Dana; Sullivan, Jessica; Faulkner, Nicole; Hallam, Stephanie; Greger, Valerie
2014-11-01
Loss of function variants in the PCDH15 gene can cause Usher syndrome type 1F, an autosomal recessive disease associated with profound congenital hearing loss, vestibular dysfunction, and retinitis pigmentosa. The Ashkenazi Jewish population has an increased incidence of Usher syndrome type 1F (founder variant p.Arg245X accounts for 75% of alleles), yet the variant spectrum in a panethnic population remains undetermined. We sequenced the coding region and intron-exon borders of PCDH15 using next-generation DNA sequencing technology in approximately 14,000 patients from fertility clinics. More than 600 unique PCDH15 variants (single nucleotide changes and small indels) were identified, including previously described pathogenic variants p.Arg3X, p.Arg245X (five patients), p.Arg643X, p.Arg929X, and p.Arg1106X. Novel truncating variants were also found, including one in the N-terminal extracellular domain (p.Leu877X), but all other novel truncating variants clustered in the exon 33 encoded C-terminal cytoplasmic domain (52 patients, 14 variants). One variant was observed predominantly in African Americans (carrier frequency of 2.3%). The high incidence of truncating exon 33 variants indicates that they are unlikely to cause Usher syndrome type 1F even though many remove a large portion of the gene. They may be tolerated because PCDH15 has several alternate cytoplasmic domain exons and differentially spliced isoforms may function redundantly. Effects of some PCDH15 truncating variants were addressed by deep sequencing of a panethnic population. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Chung, H Y; Choi, Y C; Park, H N
2015-05-18
We investigated the phylogenetic relationships between pig breeds, compared the genetic similarity between humans and pigs, and provided basic genetic information on Korean native pigs (KNPs), using genetic variants of the swine leukocyte antigen 3 (SLA-3) gene. Primers were based on sequences from GenBank (accession Nos. AF464010 and AF464009). Polymerase chain reaction analysis amplified approximately 1727 bp of segments, which contained 1086 bp of coding regions and 641 bp of the 3'- and 5'-untranslated regions. Bacterial artificial chromosome clones of miniature pigs were used for sequencing the SLA-3 genomic region, which was 3114 bp in total length, including the coding (1086 bp) and non-coding (2028 bp) regions. Sequence analysis detected 53 single nucleotide polymorphisms (SNPs), based on a minor allele frequency greater than 0.01, which is low compared with other pig breeds, and the results suggest that there is low genetic variability in KNPs. Comparative analysis revealed that humans possess approximately three times more genetic variation than do pigs. Approximately 71% of SNPs in exons 2 and 3 were detected in KNPs, and exon 5 in humans is a highly polymorphic region. Newly identified sequences of SLA-3 using KNPs were submitted to GenBank (accession No. DQ992512-18). Cluster analysis revealed that KNPs were grouped according to three major alleles: SLA-3*0502 (DQ992518), SLA-3*0302 (DQ992513 and DQ992516), and SLA-3*0303 (DQ992512, DQ992514, DQ992515, and DQ992517). Alignments revealed that humans have a relatively close genetic relationship with pigs and chimpanzees. The information provided by this study may be useful in KNP management.
Friedberg, Felix
2009-05-01
In this paper we examine (restricted to homo sapiens) the products resulting from gene duplication and the subsequent alternative splicing for the members of a multidomain group of proteins which possess the evolutionary conserved calponin homology CH domain, i.e. an "actin binding domain", as a singlet and which, in addition, contain the conserved cysteine rich double Zn finger possessing Lim domain, also as a singlet. Seven genes, resulting from gene duplications, were identified that code for seven group members for which pre-mRNAs appear to have undergone multiple alternative splicing: Mical 1, 2 and 3 are located on chromosomes 6q21, 11p15 and 22q11, respectively. The LMO7 gene is present on chromosome 13q22 and the LIMCH1 gene on chromosome 4p13. Micall1 is mapped to chromosome 22q13 and Micall2 to chromosome 7p22. Translated Gen/Bank ESTs suggest the existence of multiple products alternatively spliced from the pre-mRNAs encoded by these genes. Characteristic indicators of such splicing among the proteins derived from one gene must include containment of some common extensive 100% identical regions. In some instances only one exon might be partly or completely eliminated. Sometimes alternative splicing is also associated with an increased frequency of creation of an exon or part of an exon from an intron. Not only coding regions for the body of the protein but also for its N- or -C ends could be affected by the splicing. If created forms are merely beginning at different starting points but remain identical in sequence thereafter, their existence as products of alternate splicing must be questioned. In the splicings, described in this paper, multiple isoforms rather than a single isoform appear as products during the gene expression.
Bahrami, A; Behzadi, Sh; Miraei-Ashtiani, S R; Roh, S-G; Katoh, K
2013-09-15
The somatotropic axis, the control system for growth hormone (GH) secretion and its endogenous factors involved in the regulation of metabolism and energy partitioning, has promising potentials for producing economically valuable traits in farm animals. Here we investigated single nucleotide polymorphisms (SNPs) of the genes of factors involved in the somatotropic axis for growth hormone (GH1), growth hormone receptor (GHR), ghrelin (GHRL), insulin-like growth factor 1 (IGF-I) and leptin (LEP), using polymerase chain reaction-single-strand conformation polymorphism (PCR-SSCP) and DNA sequencing methods in 452 individual Mehraban sheep. A nonradioactive method to allow SSCP detection was used for genomic DNA and PCR amplification of six fragments: exons 4 and 5 of GH1; exon 10 of GH receptor (GHR); exon 1 of ghrelin (GHRL); exon 1 of insulin-like growth factor-I (IGF-I), and exon 3 of leptin (LEP). Polymorphisms were detected in five of the six PCR products. Two electrophoretic patterns were detected for GH1 exon 4. Five conformational patterns were detected for GH1 exon 5 and LEP exon 3, and three for IGF-I exon 1. Only GHR and GHRL were monomorphic. Changes in protein structures due to variable SNPs were also analyzed. The results suggest that Mehraban sheep, a major breed that is important for the animal industry in Middle East countries, has high genetic variability, opening interesting prospects for future selection programs and preservation strategies. Copyright © 2013 Elsevier B.V. All rights reserved.
Shirts, Brian H; Salipante, Stephen J; Casadei, Silvia; Ryan, Shawnia; Martin, Judith; Jacobson, Angela; Vlaskin, Tatyana; Koehler, Karen; Livingston, Robert J; King, Mary-Claire; Walsh, Tom; Pritchard, Colin C
2014-10-01
Single-exon inversions have rarely been described in clinical syndromes and are challenging to detect using Sanger sequencing. We report the case of a 40-year-old woman with adenomatous colon polyps too numerous to count and who had a complex inversion spanning the entire exon 10 in APC (the gene encoding for adenomatous polyposis coli), causing exon skipping and resulting in a frameshift and premature protein truncation. In this study, we employed complete APC gene sequencing using high-coverage next-generation sequencing by ColoSeq, analysis with BreakDancer and SLOPE software, and confirmatory transcript analysis. ColoSeq identified a complex small genomic rearrangement consisting of an inversion that results in translational skipping of exon 10 in the APC gene. This mutation would not have been detected by traditional sequencing or gene-dosage methods. We report a case of adenomatous polyposis resulting from a complex single-exon inversion. Our report highlights the benefits of large-scale sequencing methods that capture intronic sequences with high enough depth of coverage-as well as the use of informatics tools-to enable detection of small pathogenic structural rearrangements.
The human cytochrome P450 3A locus. Gene evolution by capture of downstream exons.
Finta, C; Zaphiropoulos, P G
2000-12-30
Using a bacterial artificial chromosome (BAC) clone, we have mapped the human cytochrome P450 3A (CYP3A) locus containing the genes encoding for CYP3A4, CYP3A5 and CYP3A7. The genes lie in a head-to-tail orientation in the order of 3A4, 3A7 and 3A5. In both intergenic regions (3A4-3A7 and 3A7-3A5), we have detected several additional cytochrome P450 3A exons, forming two CYP3A pseudogenes. These pseudogenes have the same orientation as the CYP3A genes. To our surprise, a 3A7 mRNA species has been detected in which the exons 2 and 13 of one of the pseudogenes (the one that is downstream of 3A7) are spliced after the 3A7 terminal exon. This results in an mRNA molecule that consists of the 13 3A7 exons and two additional exons at the 3' end. The additional two exons originating from the pseudogene are in an altered reading frame and consequently have the capability to code a completely different amino acid sequence than the canonical CYP3A exons 2 and 13. These findings may represent a generalized evolutionary process with genes having the potential to capture neighboring sequences and use them as functional exons.
Lin, Michael F.; Deoras, Ameya N.; Rasmussen, Matthew D.; Kellis, Manolis
2008-01-01
Comparative genomics of multiple related species is a powerful methodology for the discovery of functional genomic elements, and its power should increase with the number of species compared. Here, we use 12 Drosophila genomes to study the power of comparative genomics metrics to distinguish between protein-coding and non-coding regions. First, we study the relative power of different comparative metrics and their relationship to single-species metrics. We find that even relatively simple multi-species metrics robustly outperform advanced single-species metrics, especially for shorter exons (≤240 nt), which are common in animal genomes. Moreover, the two capture largely independent features of protein-coding genes, with different sensitivity/specificity trade-offs, such that their combinations lead to even greater discriminatory power. In addition, we study how discovery power scales with the number and phylogenetic distance of the genomes compared. We find that species at a broad range of distances are comparably effective informants for pairwise comparative gene identification, but that these are surpassed by multi-species comparisons at similar evolutionary divergence. In particular, while pairwise discovery power plateaued at larger distances and never outperformed the most advanced single-species metrics, multi-species comparisons continued to benefit even from the most distant species with no apparent saturation. Last, we find that genes in functional categories typically considered fast-evolving can nonetheless be recovered at very high rates using comparative methods. Our results have implications for comparative genomics analyses in any species, including the human. PMID:18421375
Mutations in the Promoter Region of the Aldolase B Gene that cause Hereditary Fructose Intolerance
Coffee, Erin M.; Tolan, Dean R.
2010-01-01
SUMMARY Hereditary fructose intolerance (HFI) is a potentially fatal inherited metabolic disease caused by a deficiency of aldolase B activity in the liver and kidney. Over 40 disease-causing mutations are known in the protein-coding region of ALDOB. Mutations upstream of the protein-coding portion of ALDOB are reported here for the first time. DNA sequence analysis of 61 HFI patients revealed single base mutations in the promoter, intronic enhancer, and the first exon, which is entirely untranslated. One mutation, g.–132G>A, is located within the promoter at an evolutionarily conserved nucleotide within a transcription factor-binding site. A second mutation, IVS1+1G>C, is at the donor splice site of the first exon. In vitro electrophoretic mobility shift assays show a decrease in nuclear extract-protein binding at the g.–132G>A mutant site. The promoter mutation results in decreased transcription using luciferase reporter plasmids. Analysis of cDNA from cells transfected with plasmids harboring the IVS1+1G>C mutation results in aberrant splicing leading to complete retention of the first intron (~ 5 kb). The IVS1+1G>C splicing mutation results in loss of luciferase activity from a reporter plasmid. These novel mutations in ALDOB represent 2% of alleles in American HFI patients, with IVS1+1G>C representing a significantly higher allele frequency (6%) among HFI patients of Hispanic and African-American ethnicity. PMID:20882353
NASA Astrophysics Data System (ADS)
Liu, Meng; Liu, Yuan; Hui, Min; Song, Chengwen; Cui, Zhaoxia
2017-03-01
Clip domain serine proteases (cSPs) and their homologs (SPHs) play an important role in various biological processes that are essential components of extracellular signaling cascades, especially in the innate immune responses of invertebrates. Here, polymorphisms of PtcSP and PtSPH from the swimming crab Portunus trituberculatus were investigated to explore their association with resistance/susceptibility to Vibrio alginolyticus. Polymorphic loci were identified using Clustal X, and characterized with SPSS 16.0 software, and then the significance of genotype and allele frequencies between resistant and susceptible stocks was determined by a χ 2 test. A total of 109 and 77 single nucleotide polymorphisms (SNPs) were identified in the genomic fragments of PtcSP and PtSPH, respectively. Notably, nearly half of PtSPH polymorphisms were found in the non-coding exon 1. Fourteen SNPs investigated were significantly associated with susceptibility/resistance to V. alginolyticus ( P <0.05). Among them, eight SNPs were observed in introns, and one synonymous, four non-synonymous SNPs and one ins-del were found in coding exons. In addition, five simple sequence repeats (SSRs) were detected in intron 3 of PtcSP. Although there was no statistically significant difference of allele frequencies, the SSRs showed different polymorphic alleles on the basis of the repeat number between resistant and susceptible stocks. After further validation, polymorphisms investigated here might be applied to select potential molecular markers of P. trituberculatus with resistance to V. alginolyticus.
ACTG: novel peptide mapping onto gene models.
Choi, Seunghyuk; Kim, Hyunwoo; Paek, Eunok
2017-04-15
In many proteogenomic applications, mapping peptide sequences onto genome sequences can be very useful, because it allows us to understand origins of the gene products. Existing software tools either take the genomic position of a peptide start site as an input or assume that the peptide sequence exactly matches the coding sequence of a given gene model. In case of novel peptides resulting from genomic variations, especially structural variations such as alternative splicing, these existing tools cannot be directly applied unless users supply information about the variant, either its genomic position or its transcription model. Mapping potentially novel peptides to genome sequences, while allowing certain genomic variations, requires introducing novel gene models when aligning peptide sequences to gene structures. We have developed a new tool called ACTG (Amino aCids To Genome), which maps peptides to genome, assuming all possible single exon skipping, junction variation allowing three edit distances from the original splice sites, exon extension and frame shift. In addition, it can also consider SNVs (single nucleotide variations) during mapping phase if a user provides the VCF (variant call format) file as an input. Available at http://prix.hanyang.ac.kr/ACTG/search.jsp . eunokpaek@hanyang.ac.kr. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Regularized rare variant enrichment analysis for case-control exome sequencing data.
Larson, Nicholas B; Schaid, Daniel J
2014-02-01
Rare variants have recently garnered an immense amount of attention in genetic association analysis. However, unlike methods traditionally used for single marker analysis in GWAS, rare variant analysis often requires some method of aggregation, since single marker approaches are poorly powered for typical sequencing study sample sizes. Advancements in sequencing technologies have rendered next-generation sequencing platforms a realistic alternative to traditional genotyping arrays. Exome sequencing in particular not only provides base-level resolution of genetic coding regions, but also a natural paradigm for aggregation via genes and exons. Here, we propose the use of penalized regression in combination with variant aggregation measures to identify rare variant enrichment in exome sequencing data. In contrast to marginal gene-level testing, we simultaneously evaluate the effects of rare variants in multiple genes, focusing on gene-based least absolute shrinkage and selection operator (LASSO) and exon-based sparse group LASSO models. By using gene membership as a grouping variable, the sparse group LASSO can be used as a gene-centric analysis of rare variants while also providing a penalized approach toward identifying specific regions of interest. We apply extensive simulations to evaluate the performance of these approaches with respect to specificity and sensitivity, comparing these results to multiple competing marginal testing methods. Finally, we discuss our findings and outline future research. © 2013 WILEY PERIODICALS, INC.
On splice site prediction using weight array models: a comparison of smoothing techniques
NASA Astrophysics Data System (ADS)
Taher, Leila; Meinicke, Peter; Morgenstern, Burkhard
2007-11-01
In most eukaryotic genes, protein-coding exons are separated by non-coding introns which are removed from the primary transcript by a process called "splicing". The positions where introns are cut and exons are spliced together are called "splice sites". Thus, computational prediction of splice sites is crucial for gene finding in eukaryotes. Weight array models are a powerful probabilistic approach to splice site detection. Parameters for these models are usually derived from m-tuple frequencies in trusted training data and subsequently smoothed to avoid zero probabilities. In this study we compare three different ways of parameter estimation for m-tuple frequencies, namely (a) non-smoothed probability estimation, (b) standard pseudo counts and (c) a Gaussian smoothing procedure that we recently developed.
Novel methodologies for spectral classification of exon and intron sequences
NASA Astrophysics Data System (ADS)
Kwan, Hon Keung; Kwan, Benjamin Y. M.; Kwan, Jennifer Y. Y.
2012-12-01
Digital processing of a nucleotide sequence requires it to be mapped to a numerical sequence in which the choice of nucleotide to numeric mapping affects how well its biological properties can be preserved and reflected from nucleotide domain to numerical domain. Digital spectral analysis of nucleotide sequences unfolds a period-3 power spectral value which is more prominent in an exon sequence as compared to that of an intron sequence. The success of a period-3 based exon and intron classification depends on the choice of a threshold value. The main purposes of this article are to introduce novel codes for 1-sequence numerical representations for spectral analysis and compare them to existing codes to determine appropriate representation, and to introduce novel thresholding methods for more accurate period-3 based exon and intron classification of an unknown sequence. The main findings of this study are summarized as follows: Among sixteen 1-sequence numerical representations, the K-Quaternary Code I offers an attractive performance. A windowed 1-sequence numerical representation (with window length of 9, 15, and 24 bases) offers a possible speed gain over non-windowed 4-sequence Voss representation which increases as sequence length increases. A winner threshold value (chosen from the best among two defined threshold values and one other threshold value) offers a top precision for classifying an unknown sequence of specified fixed lengths. An interpolated winner threshold value applicable to an unknown and arbitrary length sequence can be estimated from the winner threshold values of fixed length sequences with a comparable performance. In general, precision increases as sequence length increases. The study contributes an effective spectral analysis of nucleotide sequences to better reveal embedded properties, and has potential applications in improved genome annotation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Umans, L.; Serneels, L.; Hilliker, C.
1994-08-01
The authors have cloned the mouse gene coding for {alpha}{sub 2}-macroglobulin in overlapping {lambda} clones and have analyzed its structure. The gene contains 36 exons, coding for the 4.8-kb cDNA that we cloned previously. Including putative control elements in the 5{prime} flanking region, the gene covers about 45 kb. A region of 3.8 kb, stretching from 835 bases upstream of the cDNA start site to exon 4, including all intervening sequences, was sequenced completely. The analysis demonstrated that the putative promoter region of the mouse A2M gene differed considerably from the known promoter sequences of the human A2M gene andmore » of the rat acute-phas A2M gene. Comparison of the exon-intron structure of all known genes of the A2M family confirmed that the rat acute phase A2M gene is more closely related to the human gene than to the mouse A2M gene. To generate mice with the A2M gene inactivated, an insertion type of construct containing 7.5 kb of genomic DNA of the mouse strain 129/J, encompassing exons 16 to 19, was synthesized. A hygromycin marker gene was embedded in intron 17. After electroporation, 198 hygromycin-resistant ES cell lines were isolated and analyzed by Southern blotting. Five ES cell lines were obtained with one allele of the mouse A2M gene targeted by this insertion construct, demonstrating that the position and the characteristics of the vector served the intended goal.« less
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-01-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield. PMID:25333064
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-09-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield.
An UPF3-based nonsense-mediated decay in Paramecium.
Contreras, Julia; Begley, Victoria; Macias, Sandra; Villalobo, Eduardo
2014-12-01
Nonsense-mediated decay recognises mRNAs containing premature termination codons. One of its components, UPF3, is a molecular link bridging through its binding to the exon junction complex nonsense-mediated decay and splicing. In protists UPF3 has not been identified yet. We report that Paramecium tetraurelia bears an UPF3 gene and that it has a role in nonsense-mediated decay. Interestingly, the identified UPF3 has not conserved the essential amino acids required to bind the exon junction complex. Though, our data indicates that this ciliate bears genes coding for core proteins of the exon junction complex. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Saturation mutagenesis reveals manifold determinants of exon definition.
Ke, Shengdong; Anquetil, Vincent; Zamalloa, Jorge Rojas; Maity, Alisha; Yang, Anthony; Arias, Mauricio A; Kalachikov, Sergey; Russo, James J; Ju, Jingyue; Chasin, Lawrence A
2018-01-01
To illuminate the extent and roles of exonic sequences in the splicing of human RNA transcripts, we conducted saturation mutagenesis of a 51-nt internal exon in a three-exon minigene. All possible single and tandem dinucleotide substitutions were surveyed. Using high-throughput genetics, 5560 minigene molecules were assayed for splicing in human HEK293 cells. Up to 70% of mutations produced substantial (greater than twofold) phenotypes of either increased or decreased splicing. Of all predicted secondary structural elements, only a single 15-nt stem-loop showed a strong correlation with splicing, acting negatively. The in vitro formation of exon-protein complexes between the mutant molecules and proteins associated with spliceosome formation (U2AF35, U2AF65, U1A, and U1-70K) correlated with splicing efficiencies, suggesting exon definition as the step affected by most mutations. The measured relative binding affinities of dozens of human RNA binding protein domains as reported in the CISBP-RNA database were found to correlate either positively or negatively with splicing efficiency, more than could fit on the 51-nt test exon simultaneously. The large number of these functional protein binding correlations point to a dynamic and heterogeneous population of pre-mRNA molecules, each responding to a particular collection of binding proteins. © 2018 Ke et al.; Published by Cold Spring Harbor Laboratory Press.
Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Fang, Zhide; Deininger, Prescott; Zhang, Kun
2013-08-28
The exonization of transposable elements (TEs) has proven to be a significant mechanism for the creation of novel exons. Existing knowledge of the retention patterns of TE exons in mRNAs were mainly established by the analysis of Expressed Sequence Tag (EST) data and microarray data. This study seeks to validate and extend previous studies on the expression of TE exons by an integrative statistical analysis of high throughput RNA sequencing data. We collected 26 RNA-seq datasets spanning multiple tissues and cancer types. The exon-level digital expressions (indicating retention rates in mRNAs) were quantified by a double normalized measure, called the rescaled RPKM (Reads Per Kilobase of exon model per Million mapped reads). We analyzed the distribution profiles and the variability (across samples and between tissue/disease groups) of TE exon expressions, and compared them with those of other constitutive or cassette exons. We inferred the effects of four genomic factors, including the location, length, cognate TE family and TE nucleotide proportion (RTE, see Methods section) of a TE exon, on the exons' expression level and expression variability. We also investigated the biological implications of an assembly of highly-expressed TE exons. Our analysis confirmed prior studies from the following four aspects. First, with relatively high expression variability, most TE exons in mRNAs, especially those without exact counterparts in the UCSC RefSeq (Reference Sequence) gene tables, demonstrate low but still detectable expression levels in most tissue samples. Second, the TE exons in coding DNA sequences (CDSs) are less highly expressed than those in 3' (5') untranslated regions (UTRs). Third, the exons derived from chronologically ancient repeat elements, such as MIRs, tend to be highly expressed in comparison with those derived from younger TEs. Fourth, the previously observed negative relationship between the lengths of exons and the inclusion levels in transcripts is also true for exonized TEs. Furthermore, our study resulted in several novel findings. They include: (1) for the TE exons with non-zero expression and as shown in most of the studied biological samples, a high TE nucleotide proportion leads to their lower retention rates in mRNAs; (2) the considered genomic features (i.e. a continuous variable such as the exon length or a category indicator such as 3'UTR) influence the expression level and the expression variability (CV) of TE exons in an inverse manner; (3) not only the exons derived from Alu elements but also the exons from the TEs of other families were preferentially established in zinc finger (ZNF) genes.
The genomic structure: proof of the role of non-coding DNA.
Bouaynaya, Nidhal; Schonfeld, Dan
2006-01-01
We prove that the introns play the role of a decoy in absorbing mutations in the same way hollow uninhabited structures are used by the military to protect important installations. Our approach is based on a probability of error analysis, where errors are mutations which occur in the exon sequences. We derive the optimal exon length distribution, which minimizes the probability of error in the genome. Furthermore, to understand how can Nature generate the optimal distribution, we propose a diffusive random walk model for exon generation throughout evolution. This model results in an alpha stable exon length distribution, which is asymptotically equivalent to the optimal distribution. Experimental results show that both distributions accurately fit the real data. Given that introns also drive biological evolution by increasing the rate of unequal crossover between genes, we conclude that the role of introns is to maintain a genius balance between stability and adaptability in eukaryotic genomes.
The D4 receptor gene and mood disorders: An association study
DOE Office of Scientific and Technical Information (OSTI.GOV)
Macciardi, F.; Cavalini, M.C.; Petronis, A.
1994-09-01
The problem of a gene-disease association is of major relevance in the current research of Psychiatric Disorders, mostly because of the lack of unequivocal results obtained with the linkage approach. However, some points of an association study must also be carefully considered, namely the statistical methodology and the strategy to select a gene to be tested. The gene coding for the D4 receptor (DRD4) might be theoretically relevant as a component of the genetic susceptibility for mood disorders. We now know that DRD4 has at least 2 functional polymorphisms in the coding regions of the gene, in exon 3 andmore » exon 1, thus conferring etiologic relevance to a potentially positive association. In our work, we investigated the DRD4 genotypes of the 3rd and 1st exon for 93 patients with bipolar disorder and 57 patients with major depression, recurrent disorder. Patients have been diagnosed either by traditional DSMIII-R criteria or by clustering their lifetime psychopathological symptomatology. A random control group consisted of 151 subjects. A significant association has been found with DRD4 exon 3 genotypes, revealing an increase of genotypes 2-4 in Bipolar patients (chi-square=23.07, df=12, p=0.02). Even though a definitive confirmation of our finding requires an independent replication of the study, this result emphasizes the importance of DRD4 in mood disorders.« less
Mendes-Junior, C T; Castelli, E C; Meyer, D; Simões, A L; Donadi, E A
2013-12-01
HLA-G has an important role in the modulation of the maternal immune system during pregnancy, and evidence that balancing selection acts in the promoter and 3'UTR regions has been previously reported. To determine whether selection acts on the HLA-G coding region in the Amazon Rainforest, exons 2, 3 and 4 were analyzed in a sample of 142 Amerindians from nine villages of five isolated tribes that inhabit the Central Amazon. Six previously described single-nucleotide polymorphisms (SNPs) were identified and the Expectation-Maximization (EM) and PHASE algorithms were used to computationally reconstruct SNP haplotypes (HLA-G alleles). A new HLA-G allele, which originated in Amerindian populations by a crossing-over event between two widespread HLA-G alleles, was identified in 18 individuals. Neutrality tests evidenced that natural selection has a complex part in the HLA-G coding region. Although balancing selection is the type of selection that shapes variability at a local level (Native American populations), we have also shown that purifying selection may occur on a worldwide scale. Moreover, the balancing selection does not seem to act on the coding region as strongly as it acts on the flanking regulatory regions, and such coding signature may actually reflect a hitchhiking effect.
Nanoscale studies link amyloid maturity with polyglutamine diseases onset
NASA Astrophysics Data System (ADS)
Ruggeri, F. S.; Vieweg, S.; Cendrowska, U.; Longo, G.; Chiki, A.; Lashuel, H. A.; Dietler, G.
2016-08-01
The presence of expanded poly-glutamine (polyQ) repeats in proteins is directly linked to the pathogenesis of several neurodegenerative diseases, including Huntington’s disease. However, the molecular and structural basis underlying the increased toxicity of aggregates formed by proteins containing expanded polyQ repeats remain poorly understood, in part due to the size and morphological heterogeneity of the aggregates they form in vitro. To address this knowledge gap and technical limitations, we investigated the structural, mechanical and morphological properties of fibrillar aggregates at the single molecule and nanometer scale using the first exon of the Huntingtin protein as a model system (Exon1). Our findings demonstrate a direct correlation of the morphological and mechanical properties of Exon1 aggregates with their structural organization at the single aggregate and nanometric scale and provide novel insights into the molecular and structural basis of Huntingtin Exon1 aggregation and toxicity.
Tollefson, Ann E.; Ying, Baoling; Doronin, Konstantin; Sidor, Peter D.; Wold, William S. M.
2007-01-01
A short open reading frame named the “U exon,” located on the adenovirus (Ad) l-strand (for leftward transcription) between the early E3 region and the fiber gene, is conserved in mastadenoviruses. We have observed that Ad5 mutants with large deletions in E3 that infringe on the U exon display a mild growth defect, as well as an aberrant Ad E2 DNA-binding protein (DBP) intranuclear localization pattern and an apparent failure to organize replication centers during late infection. Mutants in which the U exon DNA is reconstructed have a reversed phenotype. Chow et al. (L. T. Chow et al., J. Mol. Biol. 134:265-303, 1979) described mRNAs initiating in the region of the U exon and spliced to downstream sequences in the late DBP mRNA leader and the DBP-coding region. We have cloned this mRNA (as cDNA) from Ad5 late mRNA; the predicted protein is 217 amino acids, initiating in the U exon and continuing in frame in the DBP leader and in the DBP-coding region but in a different reading frame from DBP. Polyclonal and monoclonal antibodies generated against the predicted U exon protein (UXP) showed that UXP is ∼24K in size by immunoblot and is a late protein. At 18 to 24 h postinfection, UXP is strongly associated with nucleoli and is found throughout the nucleus; later, UXP is associated with the periphery of replication centers, suggesting a function relevant to Ad DNA replication or RNA transcription. UXP is expressed by all four species C Ads. When expressed in transient transfections, UXP complements the aberrant DBP localization pattern of UXP-negative Ad5 mutants. Our data indicate that UXP is a previously unrecognized protein derived from a novel late l-strand transcription unit. PMID:17881437
Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.
Sharma, Rekha; Ahlawat, Sonika; Maitra, A; Roy, Manoranjan; Mandakmale, S; Tantia, M S
2013-12-10
Bone morphogenetic proteins (BMPs) are members of the TGF-β (transforming growth factor-beta) superfamily, of which BMP4 is the most important due to its crucial role in follicular growth and differentiation, cumulus expansion and ovulation. Reproduction is a crucial trait in goat breeding and based on the important role of BMP4 gene in reproduction it was considered as a possible candidate gene for the prolificacy of goats. The objective of the present study was to detect polymorphism in intronic, exonic and 3' un-translated regions of BMP4 gene in Indian goats. Nine different goat breeds (Barbari, Beetal, Black Bengal, Malabari, Jakhrana (Twinning>40%), Osmanabadi, Sangamneri (Twinning 20-30%), Sirohi and Ganjam (Twinning<10%)) differing in prolificacy and geographic distribution were employed for polymorphism scanning. Cattle sequence (AC_000167.1) was used to design primers for the amplification of a targeted region followed by direct DNA sequencing to identify the genetic variations. Single nucleotide polymorphisms (SNPs) were not detected in exon 3, the intronic region and the 3' flanking region. A SNP (G1534A) was identified in exon 2. It was a non-synonymous mutation resulting in an arginine to lysine change in a corresponding protein sequence. G to A transition at the 1534 locus revealed two genotypes GG and GA in the nine investigated goat breeds. The GG genotype was predominant with a genotype frequency of 0.98. The GA genotype was present in the Black Bengal as well as Jakhrana breed with a genotype frequency of 0.02. A microsatellite was identified in the 3' flanking region, only 20 nucleotides downstream from the termination site of the coding region, as a short sequence with more than nineteen continuous and repeated CA dinucleotides. Since the gene is highly evolutionarily conserved, identification of a non-synonymous SNP (G1534A) in the coding region gains further importance. To our knowledge, this is the first report of a mutation in the coding region of the caprine BMP4 gene. But whether the reproduction trait of goat is associated with the BMP4 polymorphism, needs to be further defined by association studies in more populations so as to delineate an effect on it. © 2013 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
He, Feng; Wen, Haishen; Yu, Dahui; Li, Jifang; Shi, Bao; Chen, Caifang; Zhang, Jiaren; Jin, Guoxiong; Chen, Xiaoyan; Shi, Dan; Yang, Yanping
2010-12-01
Follicle stimulating hormone β (FSHβ) of Japanese flounder ( Paralichthys olivaceus) plays a key role in the regulation of gonadal development. This study aimed to investigate molecular genetic characteristics of the FSHβ gene and elucidate the effects of single nucleotide polymorphisms (SNPs) of FSHβ on reproductive traits in Japanese flounder. We used polymerase chain reaction single-strand conformation polymorphism (PCR-SSCP) and sequencing of the FSHβ gene in 60 individuals. We identified only an SNP (T/C) in the coding region of exon3 of FSHβ. The SNP (T/C) did not lead to amino acid changes at the position 340 bp of FSHβ gene. Statistical analysis showed that the SNP was significantly associated with testosterone (T) level and gonadosomatic index (GSI) ( P < 0.05). Individuals with genotype TC of the SNP had significantly higher serum T levels and GSI ( P < 0.05) than that of genotype CC. Therefore, FSHβ gene could be a useful molecular marker in selection for prominent reproductive trait in Japanese Flounder.
Hewett, Duncan; Samuelsson, Lena; Polding, Joanne; Enlund, Fredrik; Smart, Devi; Cantone, Kathryn; See, Chee Gee; Chadha, Sapna; Inerot, Annica; Enerback, Charlotta; Montgomery, Doug; Christodolou, Chris; Robinson, Phil; Matthews, Paul; Plumpton, Mary; Wahlstrom, Jan; Swanbeck, Gunnar; Martinsson, Tommy; Roses, Allen; Riley, John; Purvis, Ian
2002-03-01
Psoriasis is a chronic inflammatory disease of the skin with both genetic and environmental risk factors. Here we describe the creation of a single-nucleotide polymorphism (SNP) map spanning 900-1200 kb of chromosome 3q21, which had been previously recognized as containing a psoriasis susceptibility locus, PSORS5. We genotyped 644 individuals, from 195 Swedish psoriatic families, for 19 polymorphisms. Linkage disequilibrium (LD) between marker and disease was assessed using the transmission/disequilibrium test (TDT). In the TDT analysis, alleles of three of these SNPs showed significant association with disease (P<0.05). A 160-kb interval encompassing these three SNPs was sequenced, and a coding sequence consisting of 13 exons was identified. The predicted protein shares 30-40% homology with the family of cation/chloride cotransporters. A five-marker haplotype spanning the 3' half of this gene is associated with psoriasis to a P value of 3.8<10(-5). We have called this gene SLC12A8, coding for a member of the solute carrier family 12 proteins. It belongs to a class of genes that were previously unrecognized as playing a role in psoriasis pathogenesis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thanh, L.T.; Man, Nguyen Thi; Morris, G.E.
1995-08-28
We have produced a new panel of 20 monoclonal antibodies (mAbs) against a region of the dystrophin protein corresponding to a deletion-prone region of the Duchenne muscular dystrophy gene (exons 45-50). We show that immunohistochemistry or Western blotting with these {open_quotes}exon-specific{close_quotes} mAbs can provide a valuable addition to Southern blotting or PCR methods for the accurate identification of genetic deletions in Becker muscular dystrophy patients. The antibodies were mapped to the following exons: exon 45 (2 mAbs), exon 46 (6), exon 47 (1), exons 47/48 (4), exons 48-50 (6), and exon 50 (1). PCR amplification of single exons or groupsmore » of exons was used both to produce specific dystrophin immunogens and to map the mAbs obtained. PCR-mediated mutagenesis was also used to identify regions of dystrophin important for mAb binding. Because the mAbs can be used to characterize the dystrophin produced by individual muscle fibres, they will also be useful for studying {open_quotes}revertant{close_quotes} fibres in Duchenne muscle and for monitoring the results of myoblast therapy trials in MD patients with deletions in this region of the dystrophin gene. 27 refs., 7 figs., 3 tabs.« less
NASA Technical Reports Server (NTRS)
Pelzer, T.; Lyons, G. E.; Kim, S.; Moreadith, R. W.; Blomqvist, C. G. (Principal Investigator)
1996-01-01
The cellular function(s) of the SNO protein remain undefined. To gain a better understanding of possible developmental roles of this cellular proto-oncogene, we have cloned two murine sno cDNAs and have investigated their expression patterns in embryonic and postnatal tissues. A single major transcript of 7.5 kb is detected in multiple tissues by Northern blot. However, reverse transcriptase polymerase chain reaction (RT-PCR) and RNAse protection assays revealed a novel splice variant in every tissue examined. Two isoforms, termed sno N and sno-dE3 (dE3, deletion within exon 3), were identified. The sno-dE3 isoform employs a novel 5' splice site located within the coding region of the third exon and deletes potential kinase recognition motifs. Transcripts of both sno isoforms accumulate ubiquitously but are most abundant in the developing central nervous system. The in situ hybridization patterns of sno expression during murine development suggest potential roles in tissues with a high degree of cellular proliferation. Expression in terminally differentiated tissues such as muscle and neurons indicates that SNO may have multiple functional activities.
Gandhi, Manish J; Pendergrass, Thomas W; Cummings, Carrie C; Ihara, Kenji; Blau, C Anthony; Drachman, Jonathan G
2005-10-01
An 11-year-old girl, presenting with fatigue and bruising, was found to be profoundly pancytopenic. Bone marrow exam and clinical evaluation were consistent with aplastic anemia. Family members were studied as potential stem cell donors, revealing that both younger siblings displayed significant thrombocytopenia, whereas both parents had normal blood counts. We evaluated this pedigree to understand the unusually late presentation of congenital amegakaryocytic thrombocytopenia (CAMT). The coding region and the intron/exon junctions of MPL were sequenced from each family member. Vectors representing each of the mutations were constructed and tested for the ability to support growth of Baf3/Mpl(mutant) cells. All three siblings had elevated thrombopoietin levels. Analysis of genomic DNA demonstrated that each parent had mutations/polymorphisms in a single MPL allele and that each child was a compound heterozygote, having inherited both abnormal alleles. The maternal allele encoded a mutation of the donor splice-junction at the exon-3/intron-3 boundary. A mini-gene construct encoding normal vs mutant versions of the intron-3 donor-site demonstrated that physiologic splicing was significantly reduced in the mutant construct. Mutations that incompletely eliminate Mpl expression/function may result in delayed diagnosis of CAMT and confusion with aplastic anemia.
Ren, Zi; Zeng, Hai-tao; Xu, Yan-wen; Zhuang, Guang-lun; Deng, Jie; Zhang, Cheng; Zhou, Can-quan
2009-02-01
To evaluate the use of multiple displacement amplification (MDA) in preimplantation genetic diagnosis (PGD) for female carriers with Duchenne muscular dystrophy (DMD). MDA was used to amplify a whole genome of single cells. Following the setup on single cells, the test was applied in two clinical cases of PGD. One mutant exon, six short tandem repeats (STR) markers within the dystrophin gene, and amelogenin were incorporated into singleplex polymerase chain reaction (PCR) assays on MDA products of single blastomeres. Center for reproductive medicine in First Affiliated Hospital, Sun Yat-sen University, China. Two female carriers with a duplication of exons 3-11 and a deletion of exons 47-50, respectively. The MDA of single cells and fluorescent PCR assays for PGD. The ability to analyze single blastomeres for DMD using MDA. The protocol setup previously allowed for the accurate diagnosis of each embryo. Two clinical cases resulted in a healthy girl, which was the first successful clinical application of MDA in PGD for DMD. We suggest that this protocol is reliable to increase the accuracy of the PGD for DMD.
Cao, Wei; Yan, Ming; Hao, QianYun; Wang, ShuLin; Wu, LiHua; Liu, Qing; Li, MingYan; Biddle, Fred G; Wu, Wei
2013-04-01
Meesmann epithelial corneal dystrophy (MECD) is a dominantly inherited disorder, characterized by fragility of the anterior corneal epithelium and formation of intraepithelial microcysts. It has been described in a number of different ancestral groups. To date, all reported cases of MECD have been associated with either a single mutation in one exon of the keratin-3 gene (KRT3) or a single mutation in one of two exons of the keratin-12 gene (KRT12). Each mutation leads to a predicted amino acid change in the respective keratin-3 or keratin-12 proteins that combine to form the corneal-specific heterodimeric intermediate filament protein. This case report describes a four-generation Chinese kindred with typical autosomal-dominant MECD. Exon sequencing of KRT3 and KRT12 in six affected and eight unaffected individuals (including two spouses) did not detect any mutations or nucleotide sequence variants. This kindred demonstrates that single mis-sense mutations may be sufficient but are not required in all individuals with the MECD phenotype. It provides a unique opportunity to investigate further genomic and functional heterogeneity in MECD.
Plant Proteins Are Smaller Because They Are Encoded by Fewer Exons than Animal Proteins.
Ramírez-Sánchez, Obed; Pérez-Rodríguez, Paulino; Delaye, Luis; Tiessen, Axel
2016-12-01
Protein size is an important biochemical feature since longer proteins can harbor more domains and therefore can display more biological functionalities than shorter proteins. We found remarkable differences in protein length, exon structure, and domain count among different phylogenetic lineages. While eukaryotic proteins have an average size of 472 amino acid residues (aa), average protein sizes in plant genomes are smaller than those of animals and fungi. Proteins unique to plants are ∼81aa shorter than plant proteins conserved among other eukaryotic lineages. The smaller average size of plant proteins could neither be explained by endosymbiosis nor subcellular compartmentation nor exon size, but rather due to exon number. Metazoan proteins are encoded on average by ∼10 exons of small size [∼176 nucleotides (nt)]. Streptophyta have on average only ∼5.7 exons of medium size (∼230nt). Multicellular species code for large proteins by increasing the exon number, while most unicellular organisms employ rather larger exons (>400nt). Among subcellular compartments, membrane proteins are the largest (∼520aa), whereas the smallest proteins correspond to the gene ontology group of ribosome (∼240aa). Plant genes are encoded by half the number of exons and also contain fewer domains than animal proteins on average. Interestingly, endosymbiotic proteins that migrated to the plant nucleus became larger than their cyanobacterial orthologs. We thus conclude that plants have proteins larger than bacteria but smaller than animals or fungi. Compared to the average of eukaryotic species, plants have ∼34% more but ∼20% smaller proteins. This suggests that photosynthetic organisms are unique and deserve therefore special attention with regard to the evolutionary forces acting on their genomes and proteomes. Copyright © 2016 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Zipper plot: visualizing transcriptional activity of genomic regions.
Avila Cobos, Francisco; Anckaert, Jasper; Volders, Pieter-Jan; Everaert, Celine; Rombaut, Dries; Vandesompele, Jo; De Preter, Katleen; Mestdagh, Pieter
2017-05-02
Reconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. Current state-of-the-art tools for long non-coding RNA (lncRNA) annotation are mainly based on evolutionary constraints, which may result in false negatives due to the overall limited conservation of lncRNAs. To tackle this problem we have developed the Zipper plot, a novel visualization and analysis method that enables users to simultaneously interrogate thousands of human putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These include publicly available CAGE-sequencing, ChIP-sequencing and DNase-sequencing datasets. Our method only requires three tab-separated fields (chromosome, genomic coordinate of the TSS and strand) as input and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot. Using the Zipper plot, we found evidence of transcription for a set of well-characterized lncRNAs and observed that fewer mono-exonic lncRNAs have CAGE peaks overlapping with their TSSs compared to multi-exonic lncRNAs. Using publicly available RNA-seq data, we found more than one hundred cases where junction reads connected protein-coding gene exons with a downstream mono-exonic lncRNA, revealing the need for a careful evaluation of lncRNA 5'-boundaries. Our method is implemented using the statistical programming language R and is freely available as a webtool.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cuppens, H.; Marynen, P.; Cassiman, J.J.
1993-12-01
The authors have previously shown that about 85% of the mutations in 194 Belgian cystic fibrosis alleles could be detected by a reverse dot-blot assay. In the present study, 50 Belgian chromosomes were analyzed for mutations in the cystic fibrosis transmembrane conductance regulator gene by means of direct solid phase automatic sequencing of PCR products of individual exons. Twenty-six disease mutations and 14 polymorphisms were found. Twelve of these mutations and 3 polymorphisms were not described before. With the exception of one mutant allele carrying two mutations, these mutations were the only mutations found in the complete coding region andmore » their exon/intron boundaries. The total sensitivity of mutant CF alleles that could be identified was 98.5%. Given the heterogeneity of these mutations, most of them very rare, CFTR mutation screening still remains rather complex in the population, and population screening, whether desirable or not, does not appear to be technically feasible with the methods currently available. 24 refs., 1 fig., 2 tabs.« less
Swalla, B J; Just, M A; Pederson, E L; Jeffery, W R
1999-04-01
The Manx gene is required for the development of the tail and other chordate features in the ascidian tadpole larva. To determine the structure of the Manx gene, we isolated and sequenced genomic clones from the tailed ascidian Molgula oculata. The Manx gene contains 9 exons and encodes both major and minor Manx mRNAs, which differ in the length of their 5' untranslated regions. The coding region of the single-copy bobcat gene, which encodes a DEAD-box RNA helicase, is embedded within the first Manx intron. The organization of the bobcat and Manx transcription units was determined by comparing genomic and cDNA clones. The Manx-bobcat gene locus has an unusual organization in which a non-coding first exon is alternatively spliced at the 5' end of two different mRNAs. The bobcat and Manx genes are expressed coordinately during oogenesis and embryogenesis, but not during spermatogenesis, in which bobcat mRNA accumulates independently of Manx mRNA. Similar to Manx, zygotic bobcat transcripts accumulate in the embryonic primordia responsible for generating chordate features, including the dorsal neural tube and notochord, are downregulated during embryogenesis in the tailless species Molgula occulta and are upregulated in M. occulta X M. oculata hybrids, which restore these chordate features. Antisense experiments indicate that zygotic bobcat expression is required for development of the same suite of chordate features as Manx. The results show that the Manx-bobcat gene complex has a role in the development of chordate features in ascidian tadpole larvae.
Splendore, A; Silva, E O; Alonso, L G; Richieri-Costa, A; Alonso, N; Rosa, A; Carakushanky, G; Cavalcanti, D P; Brunoni, D; Passos-Bueno, M R
2000-10-01
Twenty-eight families with a clinical diagnosis of Treacher Collins syndrome were screened for mutations in the 25 coding exons of TCOF1 and their adjacent splice junctions through SSCP and direct sequencing. Pathogenic mutations were detected in 26 patients, yielding the highest detection rate reported so far for this disease (93%) and bringing the number of known disease-causing mutations from 35 to 51. This is the first report to describe clustering of pathogenic mutations. Thirteen novel polymorphic alterations were characterized, confirming previous reports that TCOF1 has an unusually high rate of single-nucleotide polymorphisms (SNPs) within its coding region. We suggest a possible different mechanism leading to TCS or genetic heterogeneity for this condition, as we identified two families with no apparent pathogenic mutation in the gene. Furthermore, our data confirm the absence of genotype-phenotype correlation and reinforce that the apparent anticipation often observed in TCS families is due to ascertainment bias. Copyright 2000 Wiley-Liss, Inc.
Gardner, Elliot M.; Johnson, Matthew G.; Ragone, Diane; Wickett, Norman J.; Zerega, Nyree J. C.
2016-01-01
Premise of the study: We used moderately low-coverage (17×) whole-genome sequencing of Artocarpus camansi (Moraceae) to develop genomic resources for Artocarpus and Moraceae. Methods and Results: A de novo assembly of Illumina short reads (251,378,536 pairs, 2 × 100 bp) accounted for 93% of the predicted genome size. Predicted coding regions were used in a three-way orthology search with published genomes of Morus notabilis and Cannabis sativa. Phylogenetic markers for Moraceae were developed from 333 inferred single-copy exons. Ninety-eight putative MADS-box genes were identified. Analysis of all predicted coding regions resulted in preliminary annotation of 49,089 genes. An analysis of synonymous substitutions for pairs of orthologs (Ks analysis) in M. notabilis and A. camansi strongly suggested a lineage-specific whole-genome duplication in Artocarpus. Conclusions: This study substantially increases the genomic resources available for Artocarpus and Moraceae and demonstrates the value of low-coverage de novo assemblies for nonmodel organisms with moderately large genomes. PMID:27437173
GENCODE: the reference human genome annotation for The ENCODE Project.
Harrow, Jennifer; Frankish, Adam; Gonzalez, Jose M; Tapanari, Electra; Diekhans, Mark; Kokocinski, Felix; Aken, Bronwen L; Barrell, Daniel; Zadissa, Amonida; Searle, Stephen; Barnes, If; Bignell, Alexandra; Boychenko, Veronika; Hunt, Toby; Kay, Mike; Mukherjee, Gaurab; Rajan, Jeena; Despacio-Reyes, Gloria; Saunders, Gary; Steward, Charles; Harte, Rachel; Lin, Michael; Howald, Cédric; Tanzer, Andrea; Derrien, Thomas; Chrast, Jacqueline; Walters, Nathalie; Balasubramanian, Suganthi; Pei, Baikang; Tress, Michael; Rodriguez, Jose Manuel; Ezkurdia, Iakes; van Baren, Jeltje; Brent, Michael; Haussler, David; Kellis, Manolis; Valencia, Alfonso; Reymond, Alexandre; Gerstein, Mark; Guigó, Roderic; Hubbard, Tim J
2012-09-01
The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.
Identification of four novel mutations in the COL4A5 gene of patients with Alport syndrome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lemmink, H.H.; Schroeder, C.H.; Brunner, H.G.
1993-08-01
The type IV collagen [alpha]5 chain (COL4A5) genes of patients with Alport syndrome were tested for major gene rearrangements by Southern blot analysis, using COL4A5 cDNA clones as probes. In addition, individual exons were screened for small mutations by single-strand conformation polymorphism (SSCP) analysis. Four new COL4A5 mutations were detected. A duplication of the nine most 3[prime] located nucleotides of exon 49 and the first nucleotide of intron 49 was identified in the COL4A5 gene of one patient. Two patients displayed single base substitutions leading to, respectively, a proline to threonine and an arginine to glutamine substitution in the C-terminalmore » end. Both substitutions involve amino acids conserved through evolution. In COL4A5 intron 41 a mutation changing the splice acceptor site from AG to AA was identified. All mutations cosegregate with the clinical phenotype of Alport syndrome in affected family members. In a control population of 50 individuals tested by PCR-SSCP these mutations were never identified. Together with two mutations reported previously, a total of six mutations were found in 26 patients with Alport syndrome (23%) after systematic screening of about 30% of the COL4A5 coding region. The clinical features of these six patients are described in detail. 21 refs., 2 figs., 3 tabs.« less
Rahbarnia, Leila; Farajnia, Safar; Babaei, Hossein; Majidi, Jafar; Akbari, Bahman; Ahdi Khosroshahi, Shiva
2016-12-01
Purpose: EGFRvIII as the most common mutant variant of the epidermal growth factor receptor is resulting from deletion of exons 2-7 in the coding sequence and junction of exons 1 and 8 through a novel glycine residue. EGFRvIII is highly expressed in glioblastoma, carcinoma of the breast, ovary, and lung but not in normal cells. The aim of the present study was identification of a novel single chain antibody against EGFRvIII as a promising target for cancer therapy. Methods: In this study, a synthetic peptide corresponding to EGFRvIII protein was used for screening a naive human scFv phage library. A novel five-round selection strategy was used for enrichment of rare specific clones. Results: After five rounds of screening, six positive scFv clones against EGFRvIII were selected using monoclonal phage ELISA, among them, only three clones had expected size in PCR reaction. The specific interaction of two of the scFv clones with EGFRvIII was confirmed by indirect ELISA. One phage clone with higher affinity in scFv ELISA was purified for further analysis. The purity of the produced scFv antibody was confirmed using SDS-PAGE and Western blotting analyses. Conclusion: In the present study, a human anti- EGFRvIII scFv with high affinity was first identified from a scFv phage library. This study can be the groundwork for developing more effective diagnostic and therapeutic agents against EGFRvIII expressing cancers.
zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs.
Parekh, Swati; Ziegenhain, Christoph; Vieth, Beate; Enard, Wolfgang; Hellmann, Ines
2018-06-01
Single-cell RNA-sequencing (scRNA-seq) experiments typically analyze hundreds or thousands of cells after amplification of the cDNA. The high throughput is made possible by the early introduction of sample-specific bar codes (BCs), and the amplification bias is alleviated by unique molecular identifiers (UMIs). Thus, the ideal analysis pipeline for scRNA-seq data needs to efficiently tabulate reads according to both BC and UMI. zUMIs is a pipeline that can handle both known and random BCs and also efficiently collapse UMIs, either just for exon mapping reads or for both exon and intron mapping reads. If BC annotation is missing, zUMIs can accurately detect intact cells from the distribution of sequencing reads. Another unique feature of zUMIs is the adaptive downsampling function that facilitates dealing with hugely varying library sizes but also allows the user to evaluate whether the library has been sequenced to saturation. To illustrate the utility of zUMIs, we analyzed a single-nucleus RNA-seq dataset and show that more than 35% of all reads map to introns. Also, we show that these intronic reads are informative about expression levels, significantly increasing the number of detected genes and improving the cluster resolution. zUMIs flexibility makes if possible to accommodate data generated with any of the major scRNA-seq protocols that use BCs and UMIs and is the most feature-rich, fast, and user-friendly pipeline to process such scRNA-seq data.
Regulation of alternative splicing at the single-cell level.
Faigenbloom, Lior; Rubinstein, Nimrod D; Kloog, Yoel; Mayrose, Itay; Pupko, Tal; Stein, Reuven
2015-12-28
Alternative splicing is a key cellular mechanism for generating distinct isoforms, whose relative abundances regulate critical cellular processes. It is therefore essential that inclusion levels of alternative exons be tightly regulated. However, how the precision of inclusion levels among individual cells is governed is poorly understood. Using single-cell gene expression, we show that the precision of inclusion levels of alternative exons is determined by the degree of evolutionary conservation at their flanking intronic regions. Moreover, the inclusion levels of alternative exons, as well as the expression levels of the transcripts harboring them, also contribute to this precision. We further show that alternative exons whose inclusion levels are considerably changed during stem cell differentiation are also subject to this regulation. Our results imply that alternative splicing is coordinately regulated to achieve accuracy in relative isoform abundances and that such accuracy may be important in determining cell fate. © 2015 The Authors. Published under the terms of the CC BY 4.0 license.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rorsman, F.; Bywater, M.; Knott, T.J.
The human platelet-derived growth factor (PDGF) A-chain locus was characterized by restriction endonuclease analysis, and the nucleotide sequence of its exons was determined. Seven exons were identified, spanning approximately 22 kilobase pairs of genomic DNA. Alternative exon usage, identified by cDNA cloning, occurs in a human glioblastoma cell line and may give rise to two types of A-chain precursors with different C termini. The exon-intron arrangement was similar to that of the PDGF B-chain/sis locus and seemed to divide the precursor proteins into functional domains. Southern blot analysis of genomic DNA showed that a single PDGF A-chain gene was presentmore » in the human genome.« less
A 5′ Splice Site-Proximal Enhancer Binds SF1 and Activates Exon Bridging of a Microexon
Carlo, Troy; Sierra, Rebecca; Berget, Susan M.
2000-01-01
Internal exon size in vertebrates occurs over a narrow size range. Experimentally, exons shorter than 50 nucleotides are poorly included in mRNA unless accompanied by strengthened splice sites or accessory sequences that act as splicing enhancers, suggesting steric interference between snRNPs and other splicing factors binding simultaneously to the 3′ and 5′ splice sites of microexons. Despite these problems, very small naturally occurring exons exist. Here we studied the factors and mechanism involved in recognizing a constitutively included six-nucleotide exon from the cardiac troponin T gene. Inclusion of this exon is dependent on an enhancer located downstream of the 5′ splice site. This enhancer contains six copies of the simple sequence GGGGCUG. The enhancer activates heterologous microexons and will work when located either upstream or downstream of the target exon, suggesting an ability to bind factors that bridge splicing units. A single copy of this sequence is sufficient for in vivo exon inclusion and is the binding site for the known bridging mammalian splicing factor 1 (SF1). The enhancer and its bound SF1 act to increase recognition of the upstream exon during exon definition, such that competition of in vitro reactions with RNAs containing the GGGGCUG repeated sequence depress splicing of the upstream intron, assembly of the spliceosome on the 3′ splice site of the exon, and cross-linking of SF1. These results suggest a model in which SF1 bridges the small exon during initial assembly, thereby effectively extending the domain of the exon. PMID:10805741
Dutta, Shruti; Guhathakurta, Subhrangshu; Sinha, Swagata; Chatterjee, Anindita; Ahmed, Shabina; Ghosh, Saurabh; Gangopadhyay, Prasanta K; Singh, Manoranjan; Usha, Rajamma
2007-01-05
Autism is a neurodevelopmental disorder with high heritability factor and the reelin gene, which codes for an extracellular matrix protein involved with neuronal migration and lamination is being investigated as a positional and functional candidate gene for autism. It is located on chromosome 7q22 within the autism susceptible locus (AUTS1); identified in earlier genome scans and several investigations have been carried out on various ethnic groups to assess possible association and linkage of the gene with autism. However, the findings are still inconclusive. In the present study which represents the first report of such a study on the Indian population, genotyping analyses of CGG repeat polymorphism at 5'UTR, two single nucleotide polymorphisms (SNP) at exon 6 and exon 50 were performed in 73 autistic subjects, 129 parents, and 80 controls. The allelic distributions of the repeat polymorphism and exon 50 T/C SNP were quite different from earlier reports in other populations. Allelic and genotypic distribution of the markers did not show any differences between the cases and controls. While our preliminary data on family-based association studies on 58 trios showed no preferential transmission of any allele from the parents to the affected offspring, TDT and HHRR analyses revealed significant paternal transmission distortions for 10- and > or =11-repeat alleles of CGG repeat polymorphism. Thus, the present study suggests that 5'UTR of reelin gene may have a role in the susceptibility towards autism with the paternal transmission and non-transmission respectively of 10- and > or =11-repeat alleles, to the affected offspring.
Conservation of CD44 exon v3 functional elements in mammals
Vela, Elena; Hilari, Josep M; Delclaux, María; Fernández-Bellon, Hugo; Isamat, Marcos
2008-01-01
Background The human CD44 gene contains 10 variable exons (v1 to v10) that can be alternatively spliced to generate hundreds of different CD44 protein isoforms. Human CD44 variable exon v3 inclusion in the final mRNA depends on a multisite bipartite splicing enhancer located within the exon itself, which we have recently described, and provides the protein domain responsible for growth factor binding to CD44. Findings We have analyzed the sequence of CD44v3 in 95 mammalian species to report high conservation levels for both its splicing regulatory elements (the 3' splice site and the exonic splicing enhancer), and the functional glycosaminglycan binding site coded by v3. We also report the functional expression of CD44v3 isoforms in peripheral blood cells of different mammalian taxa with both consensus and variant v3 sequences. Conclusion CD44v3 mammalian sequences maintain all functional splicing regulatory elements as well as the GAG binding site with the same relative positions and sequence identity previously described during alternative splicing of human CD44. The sequence within the GAG attachment site, which in turn contains the Y motif of the exonic splicing enhancer, is more conserved relative to the rest of exon. Amplification of CD44v3 sequence from mammalian species but not from birds, fish or reptiles, may lead to classify CD44v3 as an exclusive mammalian gene trait. PMID:18710510
Detection of BRCA1 gross rearrangements by droplet digital PCR.
Preobrazhenskaya, Elena V; Bizin, Ilya V; Kuligina, Ekatherina Sh; Shleykina, Alla Yu; Suspitsin, Evgeny N; Zaytseva, Olga A; Anisimova, Elena I; Laptiev, Sergey A; Gorodnova, Tatiana V; Belyaev, Alexey M; Imyanitov, Evgeny N; Sokolenko, Anna P
2017-10-01
Large genomic rearrangements (LGRs) constitute a significant share of pathogenic BRCA1 mutations. Multiplex ligation-dependent probe amplification (MLPA) is a leading method for LGR detection; however, it is entirely based on the use of commercial kits, includes relatively time-consuming hybridization step, and is not convenient for large-scale screening of recurrent LGRs. We developed and validated the droplet digital PCR (ddPCR) assay, which covers the entire coding region of BRCA1 gene and is capable to precisely quantitate the copy number for each exon. 141 breast cancer (BC) patients, who demonstrated evident clinical features of hereditary BC but turned out to be negative for founder BRCA1/2 mutations, were subjected to the LGR analysis. Four patients with LGR were identified, with three cases of exon 8 deletion and one women carrying the deletion of exons 5-7. Excellent concordance with MLPA test was observed. Exon 8 copy number was tested in additional 720 BC and 184 ovarian cancer (OC) high-risk patients, and another four cases with the deletion were revealed; MLPA re-analysis demonstrated that exon 8 loss was a part of a larger genetic alteration in two cases, while the remaining two patients had isolated defect of exon 8. Long-range PCR and next generation sequencing of DNA samples carrying exon 8 deletion revealed two types of recurrent LGRs. Droplet digital PCR is a reliable tool for the detection of large genomic rearrangements.
Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome
Stolc, Viktor; Deng, Wei; He, Hang; Korbel, Jan; Chen, Xuewei; Tongprasit, Waraporn; Ronald, Pamela; Chen, Runsheng; Gerstein, Mark; Wang Deng, Xing
2007-01-01
Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome. PMID:17372628
Lack of mutations in the leptin receptor gene in severely obese children.
Dias, Natasha Favoretto; Fernandes, Ariana Ester; Melo, Maria Edna de; Reinhardt, Heidi Lui; Cercato, Cintia; Villares, Sandra Mara Ferreira; Halpern, Alfredo; Mancini, Marcio C
2012-04-01
To analyze the LEPR gene in obese children and to investigate the associations between molecular findings and anthropometric and metabolic features. Thirty-two patients were evaluated regarding anthropometric characteristics, blood pressure, heart rate, serum glucose, insulin, leptin levels, and lipid profile. The molecular study consisted of the amplification and automatic sequencing of the coding region of LEPR in order to investigate new mutations. We identified a high prevalence of metabolic disorders: impaired fasting glucose in 12.5% of the patients, elevated HOMA-IR in 85.7%, low HDL-cholesterol levels in 46.9%, high triglyceride levels in 40.6%, and hypertension in 58.6% of the patients. The molecular study identified 6 already described allelic variants: rs1137100 (exon-2), rs1137101 (exon-4), rs1805134 (exon-7), rs8179183 (exon-12), rs1805096 (exon-18), and the deletion/insertion of the pentanucleotide CTTTA at 3'untranslated region. The frequency of alleles observed in this cohort is similar to that described in the literature, and was not correlated with any clinical feature. The molecular findings in the analysis of the LEPR did not seem to be implicated in the etiology of obesity in these patients.
Decoding of exon splicing patterns in the human RUNX1-RUNX1T1 fusion gene.
Grinev, Vasily V; Migas, Alexandr A; Kirsanava, Aksana D; Mishkova, Olga A; Siomava, Natalia; Ramanouskaya, Tatiana V; Vaitsiankova, Alina V; Ilyushonak, Ilia M; Nazarov, Petr V; Vallar, Laurent; Aleinikova, Olga V
2015-11-01
The t(8;21) translocation is the most widespread genetic defect found in human acute myeloid leukemia. This translocation results in the RUNX1-RUNX1T1 fusion gene that produces a wide variety of alternative transcripts and influences the course of the disease. The rules of combinatorics and splicing of exons in the RUNX1-RUNX1T1 transcripts are not known. To address this issue, we developed an exon graph model of the fusion gene organization and evaluated its local exon combinatorics by the exon combinatorial index (ECI). Here we show that the local exon combinatorics of the RUNX1-RUNX1T1 gene follows a power-law behavior and (i) the vast majority of exons has a low ECI, (ii) only a small part is represented by "exons-hubs" of splicing with very high ECI values, and (iii) it is scale-free and very sensitive to targeted skipping of "exons-hubs". Stochasticity of the splicing machinery and preferred usage of exons in alternative splicing can explain such behavior of the system. Stochasticity may explain up to 12% of the ECI variance and results in a number of non-coding and unproductive transcripts that can be considered as a noise. Half-life of these transcripts is increased due to the deregulation of some key genes of the nonsense-mediated decay system in leukemia cells. On the other hand, preferred usage of exons may explain up to 75% of the ECI variability. Our analysis revealed a set of splicing-related cis-regulatory motifs that can explain "attractiveness" of exons in alternative splicing but only when they are considered together. Cis-regulatory motifs are guides for splicing trans-factors and we observed a leukemia-specific profile of expression of the splicing genes in t(8;21)-positive blasts. Altogether, our results show that alternative splicing of the RUNX1-RUNX1T1 transcripts follows strict rules and that the power-law component of the fusion gene organization confers a high flexibility to this process. Copyright © 2015 Elsevier Ltd. All rights reserved.
Interactive web-based identification and visualization of transcript shared sequences.
Azhir, Alaleh; Merino, Louis-Henri; Nauen, David W
2018-05-12
We have developed TraC (Transcript Consensus), a web-based tool for detecting and visualizing shared sequences among two or more mRNA transcripts such as splice variants. Results including exon-exon boundaries are returned in a highly intuitive, data-rich, interactive plot that permits users to explore the similarities and differences of multiple transcript sequences. The online tool (http://labs.pathology.jhu.edu/nauen/trac/) is free to use. The source code is freely available for download (https://github.com/nauenlab/TraC). Copyright © 2018 Elsevier Inc. All rights reserved.
Palti, Y.; Rodriguez, M.F.; Gahr, S.A.; Purcell, M.K.; Rexroad, C. E.; Wiens, G.D.
2010-01-01
Induction of innate immune pathways is critical for early anti-microbial defense but there is limited understanding of how teleosts recognize microbial molecules and activate these pathways. In mammals, Toll-like receptors (TLR) 1 and 2 form a heterodimer involved in recognizing peptidoglycans and lipoproteins of microbial origin. Herein, we identify and describe the rainbow trout (Oncorhynchus mykiss) TLR1 gene ortholog and its mRNA expression. Two TLR1 loci were identified from a rainbow trout bacterial artificial chromosome (BAC) library using DNA sequencing and genetic linkage analyses. Full length cDNA clone and direct sequencing of four BACs revealed an intact omTLR1 open reading frame (ORF) located on chromosome 14 and a second locus on chromosome 25 that contains a TLR1 pseudogene. The duplicated trout loci exhibit conserved synteny with other fish genomes that extends beyond the TLR1 gene sequences. The omTLR1 gene includes a single large coding exon similar to all other described TLR1 genes, but unlike other teleosts it also has a 5??? UTR exon and intron preceding the large coding exon. The omTLR1 ORF is predicted to encode an 808 amino-acid protein with 69% similarity to the Fugu TLR1 and a conserved pattern of predicted leucine-rich repeats (LRR). Phylogenetic analysis grouped omTLR1 with other fish TLR1 genes on a separate branch from the avian TLR1 and mammalian TLR1, 6 and 10. omTLR1 expression levels in rainbow trout anterior kidney leukocytes were not affected by the human TLR2/6 and TLR2/1 agonists diacylated lipoprotein (Pam2CSK4) and triacylated lipoprotein (Pam3CSK4). However, due to the lack of TLR6 and 10 genes in teleost genomes and up-regulation of TLR1 mRNA in response to LPS and bacterial infection in other fish species we hypothesize an important role for omTLR1 in anti-microbial immunity. Therefore, the identification of a TLR2 ortholog in rainbow trout and the development of assays to measure ligand binding and downstream signaling are critical for future elucidation of omTLR1 functions.
Palti, Yniv; Rodriguez, M. Fernanda; Gahr, Scott A.; Purcell, Maureen K.; Rexroad, Caird E.; Wiens, Gregory D.
2010-01-01
Induction of innate immune pathways is critical for early anti-microbial defense but there is limited understanding of how teleosts recognize microbial molecules and activate these pathways. In mammals, Toll-like receptors (TLR) 1 and 2 form a heterodimer involved in recognizing peptidoglycans and lipoproteins of microbial origin. Herein, we identify and describe the rainbow trout (Oncorhynchus mykiss) TLR1 gene ortholog and its mRNA expression. Two TLR1 loci were identified from a rainbow trout bacterial artificial chromosome (BAC) library using DNA sequencing and genetic linkage analyses. Full length cDNA clone and direct sequencing of four BACs revealed an intact omTLR1 open reading frame (ORF) located on chromosome 14 and a second locus on chromosome 25 that contains a TLR1 pseudogene. The duplicated trout loci exhibit conserved synteny with other fish genomes that extends beyond the TLR1 gene sequences. The omTLR1 gene includes a single large coding exon similar to all other described TLR1 genes, but unlike other teleosts it also has a 5' UTR exon and intron preceding the large coding exon. The omTLR1 ORF is predicted to encode an 808 amino-acid protein with 69% similarity to the Fugu TLR1 and a conserved pattern of predicted leucine-rich repeats (LRR). Phylogenetic analysis grouped omTLR1 with other fish TLR1 genes on a separate branch from the avian TLR1 and mammalian TLR1, 6 and 10. omTLR1 expression levels in rainbow trout anterior kidney leukocytes were not affected by the human TLR2/6 and TLR2/1 agonists diacylated lipoprotein (Pam2CSK4) and triacylated lipoprotein (Pam3CSK4). However, due to the lack of TLR6 and 10 genes in teleost genomes and up-regulation of TLR1 mRNA in response to LPS and bacterial infection in other fish species we hypothesize an important role for omTLR1 in anti-microbial immunity. Therefore, the identification of a TLR2 ortholog in rainbow trout and the development of assays to measure ligand binding and downstream signaling are critical for future elucidation of omTLR1 functions.
Li, Rui; Liao, Xian-Hua; Ye, Jun-Zhao; Li, Min-Rui; Wu, Yan-Qin; Hu, Xuan; Zhong, Bi-Hui
2017-06-14
To test the hypothesis that K8/K18 variants predispose humans to non-alcoholic fatty liver disease (NAFLD) progression and its metabolic phenotypes. We selected a total of 373 unrelated adult subjects from our Physical Examination Department, including 200 unrelated NAFLD patients and 173 controls of both genders and different ages. Diagnoses of NAFLD were established according to ultrasonic signs of fatty liver. All subjects were tested for population characteristics, lipid profile, liver tests, as well as glucose tests. Genomic DNA was obtained from peripheral blood with a DNeasy Tissue Kit. K8/K18 coding regions were analyzed, including 15 exons and exon-intron boundaries. Among 200 NAFLD patients, 10 (5%) heterozygous carriers of keratin variants were identified. There were 5 amino-acid-altering heterozygous variants and 6 non-coding heterozygous variants. One novel amino-acid-altering heterozygous variant (K18 N193S) and three novel non-coding variants were observed (K8 IVS5-9A→G, K8 IVS6+19G→A, K18 T195T). A total of 9 patients had a single variant and 1 patient had compound variants (K18 N193S+K8 IVS3-15C→G). Only one R341H variant was found in the control group (1 of 173, 0.58%). The frequency of keratin variants in NAFLD patients was significantly higher than that in the control group (5% vs 0.58%, P = 0.015). Notably, the keratin variants were significantly associated with insulin resistance (IR) in NAFLD patients (8.86% in NAFLD patients with IR vs 2.5% in NAFLD patients without IR, P = 0.043). K8/K18 variants are overrepresented in Chinese NAFLD patients and might accelerate liver fat storage through IR.
Intron self-complementarity enforces exon inclusion in a yeast pre-mRNA
Howe, Kenneth James; Ares, Manuel
1997-01-01
Skipping of internal exons during removal of introns from pre-mRNA must be avoided for proper expression of most eukaryotic genes. Despite significant understanding of the mechanics of intron removal, mechanisms that ensure inclusion of internal exons in multi-intron pre-mRNAs remain mysterious. Using a natural two-intron yeast gene, we have identified distinct RNA–RNA complementarities within each intron that prevent exon skipping and ensure inclusion of internal exons. We show that these complementarities are positioned to act as intron identity elements, bringing together only the appropriate 5′ splice sites and branchpoints. Destroying either intron self-complementarity allows exon skipping to occur, and restoring the complementarity using compensatory mutations rescues exon inclusion, indicating that the elements act through formation of RNA secondary structure. Introducing new pairing potential between regions near the 5′ splice site of intron 1 and the branchpoint of intron 2 dramatically enhances exon skipping. Similar elements identified in single intron yeast genes contribute to splicing efficiency. Our results illustrate how intron secondary structure serves to coordinate splice site pairing and enforce exon inclusion. We suggest that similar elements in vertebrate genes could assist in the splicing of very large introns and in the evolution of alternative splicing. PMID:9356473
Drosha Promotes Splicing of a Pre-microRNA-like Alternative Exon
Havens, Mallory A.; Reich, Ashley A.; Hastings, Michelle L.
2014-01-01
The ribonuclease III enzyme Drosha has a central role in the biogenesis of microRNA (miRNA) by binding and cleaving hairpin structures in primary RNA transcripts into precursor miRNAs (pre-miRNAs). Many miRNA genes are located within protein-coding host genes and cleaved by Drosha in a manner that is coincident with splicing of introns by the spliceosome. The close proximity of splicing and pre-miRNA biogenesis suggests a potential for co-regulation of miRNA and host gene expression, though this relationship is not completely understood. Here, we describe a cleavage-independent role for Drosha in the splicing of an exon that has a predicted hairpin structure resembling a Drosha substrate. We find that Drosha can cleave the alternatively spliced exon 5 of the eIF4H gene into a pre-miRNA both in vitro and in cells. However, the primary role of Drosha in eIF4H gene expression is to promote the splicing of exon 5. Drosha binds to the exon and enhances splicing in a manner that depends on RNA structure but not on cleavage by Drosha. We conclude that Drosha can function like a splicing enhancer and promote exon inclusion. Our results reveal a new mechanism of alternative splicing regulation involving a cleavage-independent role for Drosha in splicing. PMID:24786770
The sequence, structure and evolutionary features of HOTAIR in mammals
2011-01-01
Background An increasing number of long noncoding RNAs (lncRNAs) have been identified recently. Different from all the others that function in cis to regulate local gene expression, the newly identified HOTAIR is located between HoxC11 and HoxC12 in the human genome and regulates HoxD expression in multiple tissues. Like the well-characterised lncRNA Xist, HOTAIR binds to polycomb proteins to methylate histones at multiple HoxD loci, but unlike Xist, many details of its structure and function, as well as the trans regulation, remain unclear. Moreover, HOTAIR is involved in the aberrant regulation of gene expression in cancer. Results To identify conserved domains in HOTAIR and study the phylogenetic distribution of this lncRNA, we searched the genomes of 10 mammalian and 3 non-mammalian vertebrates for matches to its 6 exons and the two conserved domains within the 1800 bp exon6 using Infernal. There was just one high-scoring hit for each mammal, but many low-scoring hits were found in both mammals and non-mammalian vertebrates. These hits and their flanking genes in four placental mammals and platypus were examined to determine whether HOTAIR contained elements shared by other lncRNAs. Several of the hits were within unknown transcripts or ncRNAs, many were within introns of, or antisense to, protein-coding genes, and conservation of the flanking genes was observed only between human and chimpanzee. Phylogenetic analysis revealed discrete evolutionary dynamics for orthologous sequences of HOTAIR exons. Exon1 at the 5' end and a domain in exon6 near the 3' end, which contain domains that bind to multiple proteins, have evolved faster in primates than in other mammals. Structures were predicted for exon1, two domains of exon6 and the full HOTAIR sequence. The sequence and structure of two fragments, in exon1 and the domain B of exon6 respectively, were identified to robustly occur in predicted structures of exon1, domain B of exon6 and the full HOTAIR in mammals. Conclusions HOTAIR exists in mammals, has poorly conserved sequences and considerably conserved structures, and has evolved faster than nearby HoxC genes. Exons of HOTAIR show distinct evolutionary features, and a 239 bp domain in the 1804 bp exon6 is especially conserved. These features, together with the absence of some exons and sequences in mouse, rat and kangaroo, suggest ab initio generation of HOTAIR in marsupials. Structure prediction identifies two fragments in the 5' end exon1 and the 3' end domain B of exon6, with sequence and structure invariably occurring in various predicted structures of exon1, the domain B of exon6 and the full HOTAIR. PMID:21496275
Maia, Rafaela M; Valente, Valeria; Cunha, Marco A V; Sousa, Josane F; Araujo, Daniela D; Silva, Wilson A; Zago, Marco A; Dias-Neto, Emmanuel; Souza, Sandro J; Simpson, Andrew J G; Monesi, Nadia; Ramos, Ricardo G P; Espreafico, Enilza M; Paçó-Larson, Maria L
2007-07-24
The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA+ northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data.
Maia, Rafaela M; Valente, Valeria; Cunha, Marco AV; Sousa, Josane F; Araujo, Daniela D; Silva, Wilson A; Zago, Marco A; Dias-Neto, Emmanuel; Souza, Sandro J; Simpson, Andrew JG; Monesi, Nadia; Ramos, Ricardo GP; Espreafico, Enilza M; Paçó-Larson, Maria L
2007-01-01
Background The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. Results Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA+ northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. Conclusion Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data. PMID:17650329
Molecular characterization of the vitamin D receptor (VDR) gene in Holstein cows.
Ali, Mayar O; El-Adl, Mohamed A; Ibrahim, Hussam M M; Elseedy, Youssef Y; Rizk, Mohamed A; El-Khodery, Sabry A
2018-06-01
Vitamin D plays a vital role in calcium homeostasis, growth, and immunoregulation. Because little is known about the vitamin D receptor (VDR) gene in cattle, the aim of the present investigation was to present the molecular characterization of exons 5 and 6 of the VDR gene in Holstein cows. DNA extraction, genomic sequencing, phylogenetic analysis, synteny mapping and single nucleotide gene polymorphism analysis of the VDR gene were performed to assess blood samples collected from 50 clinically healthy Holstein cows. The results revealed the presence of a 450-base pair (bp) nucleotide sequence that resembled exons 5 and 6 with intron 5 enclosed between these exons. Sequence alignment and phylogenetic analysis revealed a close relationship between the sequenced VDR region and that found in Hereford cattle. A close association between this region and the corresponding region in small ruminants was also documented. Moreover, a single nucleotide polymorphism (SNP) that caused the replacement of a glutamate with an arginine in the deduced amino acid sequence was detected at position 7 of exon 5. In conclusion, Holstein and Hereford cattle differ with respect to exon 5 of the VDR gene. Phylogenetic analysis of the VDR gene based on nucleotide sequence produced different results from prior analyses based on amino acid sequence. Copyright © 2018 Elsevier Ltd. All rights reserved.
Geyer, David D.; Spence, M. Anne; Johannes, Meriam; Flodman, Pamela; Clancy, Kevin P.; Berry, Rebecca; Sparkes, Robert S.; Jonsen, Matthew D.; Isenberg, Sherwin J.; Bateman, J. Bronwyn
2006-01-01
PURPOSE To further elucidate the cataract phenotype, and identify the gene and mutation for autosomal dominant cataract (ADC) in an American family of European descent (ADC2) by sequencing the major intrinsic protein gene (MIP), a candidate based on linkage to chromosome 12q13. DESIGN Observational case series and laboratory experimental study. METHODS We examined two at-risk individuals in ADC2. We PCR-amplified and sequenced all four exons and all intron-exon boundaries of the MIP gene from genomic and cloned DNA in affected members to confirm one variant as the putative mutation. RESULTS We found a novel single deletion of nucleotide (nt) 3223 (within codon 235) in exon four, causing a frameshift that alters 41 of 45 subsequent amino acids and creates a premature stop codon. CONCLUSIONS We identified a novel single base pair deletion in the MIP gene and conclude that it is a pathogenic sequence alteration. PMID:16564824
... exons, the parts of DNA that code for proteins in the body. Researchers like this method because it is faster and cheaper. Learn More More still needs to be done before whole genome sequencing becomes a routine part of medical care. Many ...
The Rise and Fall of the Gene.
ERIC Educational Resources Information Center
Mahadeva, Madhu; Randerson, Sherman
1985-01-01
Summarizes the current state of genetics, highlighting major historical events in the development of the field and discussing topics related to introns ("silent" or noncoding base sequences in eucaryotic genes) and exons (the coding parts of DNA). (JN)
Combined array CGH plus SNP genome analyses in a single assay for optimized clinical testing
Wiszniewska, Joanna; Bi, Weimin; Shaw, Chad; Stankiewicz, Pawel; Kang, Sung-Hae L; Pursley, Amber N; Lalani, Seema; Hixson, Patricia; Gambin, Tomasz; Tsai, Chun-hui; Bock, Hans-Georg; Descartes, Maria; Probst, Frank J; Scaglia, Fernando; Beaudet, Arthur L; Lupski, James R; Eng, Christine; Wai Cheung, Sau; Bacino, Carlos; Patel, Ankita
2014-01-01
In clinical diagnostics, both array comparative genomic hybridization (array CGH) and single nucleotide polymorphism (SNP) genotyping have proven to be powerful genomic technologies utilized for the evaluation of developmental delay, multiple congenital anomalies, and neuropsychiatric disorders. Differences in the ability to resolve genomic changes between these arrays may constitute an implementation challenge for clinicians: which platform (SNP vs array CGH) might best detect the underlying genetic cause for the disease in the patient? While only SNP arrays enable the detection of copy number neutral regions of absence of heterozygosity (AOH), they have limited ability to detect single-exon copy number variants (CNVs) due to the distribution of SNPs across the genome. To provide comprehensive clinical testing for both CNVs and copy-neutral AOH, we enhanced our custom-designed high-resolution oligonucleotide array that has exon-targeted coverage of 1860 genes with 60 000 SNP probes, referred to as Chromosomal Microarray Analysis – Comprehensive (CMA-COMP). Of the 3240 cases evaluated by this array, clinically significant CNVs were detected in 445 cases including 21 cases with exonic events. In addition, 162 cases (5.0%) showed at least one AOH region >10 Mb. We demonstrate that even though this array has a lower density of SNP probes than other commercially available SNP arrays, it reliably detected AOH events >10 Mb as well as exonic CNVs beyond the detection limitations of SNP genotyping. Thus, combining SNP probes and exon-targeted array CGH into one platform provides clinically useful genetic screening in an efficient manner. PMID:23695279
Glaus, Esther; Lorenz, Birgit; Netzer, Christian; Li, Yün; Schambeck, Maria; Wittmer, Mariana; Feil, Silke; Kirschner-Schwabe, Renate; Rosenberg, Thomas; Cremers, Frans P.M.; Bergen, Arthur A.B.; Barthelmes, Daniel; Baraki, Husnia; Schmid, Fabian; Tanner, Gaby; Fleischhauer, Johannes; Orth, Ulrike; Becker, Christian; Wegscheider, Erika; Nürnberg, Gudrun; Nürnberg, Peter; Bolz, Hanno Jörn; Gal, Andreas; Berger, Wolfgang
2008-01-01
Purpose The goal of this study was to identify mutations in X-chromosomal genes associated with retinitis pigmentosa (RP) in patients from Germany, The Netherlands, Denmark, and Switzerland. Methods In addition to all coding exons of RP2, exons 1 through 15, 9a, ORF15, 15a and 15b of RPGR were screened for mutations. PCR products were amplified from genomic DNA extracted from blood samples and analyzed by direct sequencing. In one family with apparently dominant inheritance of RP, linkage analysis identified an interval on the X chromosome containing RPGR, and mutation screening revealed a pathogenic variant in this gene. Patients of this family were examined clinically and by X-inactivation studies. Results This study included 141 RP families with possible X-chromosomal inheritance. In total, we identified 46 families with pathogenic sequence alterations in RPGR and RP2, of which 17 mutations have not been described previously. Two of the novel mutations represent the most 3’-terminal pathogenic sequence variants in RPGR and RP2 reported to date. In exon ORF15 of RPGR, we found eight novel and 14 known mutations. All lead to a disruption of open reading frame. Of the families with suggested X-chromosomal inheritance, 35% showed mutations in ORF15. In addition, we found five novel mutations in other exons of RPGR and four in RP2. Deletions in ORF15 of RPGR were identified in three families in which female carriers showed variable manifestation of the phenotype. Furthermore, an ORF15 mutation was found in an RP patient who additionally carries a 6.4 kbp deletion downstream of the coding region of exon ORF15. We did not identify mutations in 39 sporadic male cases from Switzerland. Conclusions RPGR mutations were confirmed to be the most frequent cause of RP in families with an X-chromosomal inheritance pattern. We propose a screening strategy to provide molecular diagnostics in these families. PMID:18552978
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ploos van Amstel, H.; Reitsma, P.H.; van der Logt, C.P.
The human protein S locus on chromosome 3 consists of two protein S genes, PS{alpha} and PS{beta}. Here the authors report the cloning and characterization of both genes. Fifteen exons of the PS{alpha} gene were identified that together code for protein S mRNA as derived from the reported protein S cDNAs. Analysis by primer extension of liver protein S mRNA, however, reveals the presence of two mRNA forms that differ in the length of their 5{prime}-noncoding region. Both transcripts contain a 5{prime}-noncoding region longer than found in the protein S cDNAs. The two products may arise from alternative splicing ofmore » an additional intron in this region or from the usage of two start sites for transcription. The intron-exon organization of the PS{alpha} gene fully supports the hypothesis that the protein S gene is the product of an evolutional assembling process in which gene modules coding for structural/functional protein units also found in other coagulation proteins have been put upstream of the ancestral gene of a steroid hormone binding protein. The PS{beta} gene is identified as a pseudogene. It contains a large variety of detrimental aberrations, viz., the absence of exon I, a splice site mutation, three stop codons, and a frame shift mutation. Overall the two genes PS{alpha} and PS{beta} show between their exonic sequences 96.5% homology. Southern analysis of primate DNA showed that the duplication of the ancestral protein S gene has occurred after the branching of the orangutan from the African apes. A nonsense mutation that is present in the pseudogene of man also could be identified in one of the two protein S genes of both chimpanzee and gorilla. This implicates that silencing of one of the two protein S genes must have taken place before the divergence of the three African apes.« less
Li, Chun-Xiao; Jiang, Mei-Shan; Chen, Shi-Yi; Lai, Song-Jia
2008-07-01
Single nucleotide polymorphism (SNP) in exon 1 and 3 of fibroblast growth factor (FGF5) gene was studied by DNA sequencing in Yingjing angora rabbit, Tianfu black rabbit and California rabbit. A frameshift mutation (TCT insert) at base position 217 (site A) of exon 1 and a T/C missense mutation at base position 59 (site B) of exon 3 were found in Yingjing angora rabbit with a high frequency; a T/C same-sense mutation at base position 3 (site C) of exon 3 was found with similar frequency in three rabbit breeds. Least square analysis showed that different genotypes had no significant association with wool yield in site A, and had high significant association with wool yield in site B (P<0.01) and significant association with wool yield in site C (P<0.05). It was concluded from the results that FGF5 gene could be the potential major gene affecting wool yield or link with the major gene, and polymorphic loci B and C may be used as molecular markers for im-proving wool yield in angora rabbits.
Comparative architecture of silks, fibrous proteins and their encoding genes in insects and spiders.
Craig, Catherine L; Riekel, Christian
2002-12-01
The known silk fibroins and fibrous glues are thought to be encoded by members of the same gene family. All silk fibroins sequenced to date contain regions of long-range order (crystalline regions) and/or short-range order (non-crystalline regions). All of the sequenced fibroin silks (Flag or silk from flagelliform gland in spiders; Fhc or heavy chain fibroin silks produced by Lepidoptera larvae) are made up of hierarchically organized, repetitive arrays of amino acids. Fhc fibroin genes are characterized by a similar molecular genetic architecture of two exons and one intron, but the organization and size of these units differs. The Flag, Ser (sericin gene) and BR (Balbiani ring genes; both fibrous proteins) genes are made up of multiple exons and introns. Sequences coding for crystalline and non-crystalline protein domains are integrated in the repetitive regions of Fhc and MA exons, but not in the protein glues Ser1 and BR-1. Genetic 'hot-spots' promote recombination errors in Fhc, MA, and Flag. Codon bias, structural constraint, point mutations, and shortened coding arrays may be alternative means of stabilizing precursor mRNA transcripts. Differential regulation of gene expression and selective splicing of the mRNA transcript may allow rapid adaptation of silk functional properties to different physical environments.
Quantifying the mechanisms of domain gain in animal proteins.
Buljan, Marija; Frankish, Adam; Bateman, Alex
2010-01-01
Protein domains are protein regions that are shared among different proteins and are frequently functionally and structurally independent from the rest of the protein. Novel domain combinations have a major role in evolutionary innovation. However, the relative contributions of the different molecular mechanisms that underlie domain gains in animals are still unknown. By using animal gene phylogenies we were able to identify a set of high confidence domain gain events and by looking at their coding DNA investigate the causative mechanisms. Here we show that the major mechanism for gains of new domains in metazoan proteins is likely to be gene fusion through joining of exons from adjacent genes, possibly mediated by non-allelic homologous recombination. Retroposition and insertion of exons into ancestral introns through intronic recombination are, in contrast to previous expectations, only minor contributors to domain gains and have accounted for less than 1% and 10% of high confidence domain gain events, respectively. Additionally, exonization of previously non-coding regions appears to be an important mechanism for addition of disordered segments to proteins. We observe that gene duplication has preceded domain gain in at least 80% of the gain events. The interplay of gene duplication and domain gain demonstrates an important mechanism for fast neofunctionalization of genes.
Bhasi, Ashwini; Philip, Philge; Manikandan, Vinu; Senapathy, Periannan
2009-01-01
We have developed ExDom, a unique database for the comparative analysis of the exon–intron structures of 96 680 protein domains from seven eukaryotic organisms (Homo sapiens, Mus musculus, Bos taurus, Rattus norvegicus, Danio rerio, Gallus gallus and Arabidopsis thaliana). ExDom provides integrated access to exon-domain data through a sophisticated web interface which has the following analytical capabilities: (i) intergenomic and intragenomic comparative analysis of exon–intron structure of domains; (ii) color-coded graphical display of the domain architecture of proteins correlated with their corresponding exon-intron structures; (iii) graphical analysis of multiple sequence alignments of amino acid and coding nucleotide sequences of homologous protein domains from seven organisms; (iv) comparative graphical display of exon distributions within the tertiary structures of protein domains; and (v) visualization of exon–intron structures of alternative transcripts of a gene correlated to variations in the domain architecture of corresponding protein isoforms. These novel analytical features are highly suited for detailed investigations on the exon–intron structure of domains and make ExDom a powerful tool for exploring several key questions concerning the function, origin and evolution of genes and proteins. ExDom database is freely accessible at: http://66.170.16.154/ExDom/. PMID:18984624
Foulkes, William D; Ghadirian, Parviz; Akbari, Mohammed Reza; Hamel, Nancy; Giroux, Sylvie; Sabbaghian, Nelly; Darnel, Andrew; Royer, Robert; Poll, Aletta; Fafard, Eve; Robidoux, André; Martin, Ginette; Bismar, Tarek A; Tischkowitz, Marc; Rousseau, Francois; Narod, Steven A
2007-01-01
PALB2 has recently been identified as a breast cancer susceptibility gene. PALB2 mutations are rare causes of hereditary breast cancer but may be important in countries such as Finland where a founder mutation is present. We sought to estimate the contribution of PALB2 mutations to the burden of breast cancer in French Canadians from Quebec. We screened all coding exons of PALB2 in a sample of 50 French-Canadian women diagnosed with either early-onset breast cancer or familial breast cancer at a single Montreal hospital. The genetic variants identified in this sample were then studied in 356 additional women with breast cancer diagnosed before age 50 and in 6,448 newborn controls. We identified a single protein-truncating mutation in PALB2 (c.2323 C>T, resulting in Q775X) in 1 of the 50 high-risk women. This variant was present in 2 of 356 breast cancer cases and in none of 6,440 newborn French-Canadian controls (P = 0.003). We also identified two novel new non-synonymous single nucleotide polymorphisms in exon 4 of PALB2 (c.5038 A>G [I76V] and c.5156 G>T [G115V]). G115V was found in 1 of 356 cases and in 15 of 6,442 controls (P = 0.6). The I76V variant was not identified in either the extended case series or the controls. We have identified a novel truncating mutation in PALB2. The mutation was found in approximately 0.5% of unselected French-Canadian women with early-onset breast cancer and appears to have a single origin. Although mutations are infrequent, PALB2 can be added to the list of breast cancer susceptibility genes for which founder mutations have been identified in the French-Canadian population.
Foulkes, William D; Ghadirian, Parviz; Akbari, Mohammed Reza; Hamel, Nancy; Giroux, Sylvie; Sabbaghian, Nelly; Darnel, Andrew; Royer, Robert; Poll, Aletta; Fafard, Eve; Robidoux, André; Martin, Ginette; Bismar, Tarek A; Tischkowitz, Marc; Rousseau, Francois; Narod, Steven A
2007-01-01
Background PALB2 has recently been identified as a breast cancer susceptibility gene. PALB2 mutations are rare causes of hereditary breast cancer but may be important in countries such as Finland where a founder mutation is present. We sought to estimate the contribution of PALB2 mutations to the burden of breast cancer in French Canadians from Quebec. Methods We screened all coding exons of PALB2 in a sample of 50 French-Canadian women diagnosed with either early-onset breast cancer or familial breast cancer at a single Montreal hospital. The genetic variants identified in this sample were then studied in 356 additional women with breast cancer diagnosed before age 50 and in 6,448 newborn controls. Results We identified a single protein-truncating mutation in PALB2 (c.2323 C>T, resulting in Q775X) in 1 of the 50 high-risk women. This variant was present in 2 of 356 breast cancer cases and in none of 6,440 newborn French-Canadian controls (P = 0.003). We also identified two novel new non-synonymous single nucleotide polymorphisms in exon 4 of PALB2 (c.5038 A>G [I76V] and c.5156 G>T [G115V]). G115V was found in 1 of 356 cases and in 15 of 6,442 controls (P = 0.6). The I76V variant was not identified in either the extended case series or the controls. Conclusion We have identified a novel truncating mutation in PALB2. The mutation was found in approximately 0.5% of unselected French-Canadian women with early-onset breast cancer and appears to have a single origin. Although mutations are infrequent, PALB2 can be added to the list of breast cancer susceptibility genes for which founder mutations have been identified in the French-Canadian population. PMID:18053174
Species-Specific Exon Loss in Human Transcriptomes
Wang, Jinkai; Lu, Zhi-xiang; Tokheim, Collin J.; Miller, Sara E.; Xing, Yi
2015-01-01
Changes in exon–intron structures and splicing patterns represent an important mechanism for the evolution of gene functions and species-specific regulatory networks. Although exon creation is widespread during primate and human evolution and has been studied extensively, much less is known about the scope and potential impact of human-specific exon loss events. Historically, transcriptome data and exon annotations are significantly biased toward humans over nonhuman primates. This ascertainment bias makes it challenging to discover human-specific exon loss events. We carried out a transcriptome-wide search of human-specific exon loss events, by taking advantage of RNA sequencing (RNA-seq) as a powerful and unbiased tool for exon discovery and annotation. Using RNA-seq data of humans, chimpanzees, and other primates, we reconstructed and compared transcript structures across the primate phylogeny. We discovered 33 candidate human-specific exon loss events, among which six exons passed stringent experimental filters for the complete loss of splicing activities in diverse human tissues. These events may result from human-specific deletion of genomic DNA, or small-scale sequence changes that inactivated splicing signals. The impact of human-specific exon loss events is predominantly regulatory. Three of the six events occurred in the 5′ untranslated region (5′-UTR) and affected cis-regulatory elements of mRNA translation. In SLC7A6, a gene encoding an amino acid transporter, luciferase reporter assays suggested that both a human-specific exon loss event and an independent human-specific single nucleotide substitution in the 5′-UTR increased mRNA translational efficiency. Our study provides novel insights into the molecular mechanisms and evolutionary consequences of exon loss during human evolution. PMID:25398629
Delineation of the Marfan phenotype associated with mutations in exons 23-32 of the FBN1 gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Putnam, E.A.; Cho, M.; Milewicz, D.M.
Marfan syndrome is a dominantly inherited connective tissue disorder with a wide range of phenotypic severity. The condition is the result of mutations in FBN1, a large gene composed of 65 exons encoding the fibrillin-1 protein. While mutations causing classic manifestations of Marfan syndrome have been identified throughout the FBN1 gene, the six previously characterized mutations resulting in the severe, perinatal lethal form of Marfan syndrome have clustered in exons 24-32 of the gene. We screened 8 patients with either neonatal Marfan syndrome or severe cardiovascular complications of Marfan syndrome for mutations in this region of the gene. Using intron-basedmore » exon-specific primers, we amplified exons 23-32 from genomic DNAs, screened these fragments by single-stranded conformational polymorphism analysis, and sequenced indicated exons. This analysis documented mutations in exons 25-27 of the FBN1 mutations in 6 of these patients. These results, taken together with previously published FBN1 mutations in this region, further define the phenotype associated with mutations in exons 24-32 of the FBN1 gene, information important for the development of possible diagnostic tests and genetic counseling. 49 refs., 4 figs., 2 tabs.« less
Myelin protein zero gene sequencing diagnoses Charcot-Marie-Tooth Type 1B disease
DOE Office of Scientific and Technical Information (OSTI.GOV)
Su, Y.; Zhang, H.; Madrid, R.
1994-09-01
Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, affects about 1 in 2600 people in Norway and is found worldwide. CMT Type 1 (CMT1) has slow nerve conduction with demyelinated Schwann cells. Autosomal dominant CMT Type 1B (CMT1B) results from mutations in the myelin protein zero gene which directs the synthesis of more than half of all Schwann cell protein. This gene was mapped to the chromosome 1q22-1q23.1 borderline by fluorescence in situ hybridization. The first 7 of 7 reported CMT1B mutations are unique. Thus the most effective means to identify CMT1B mutations in at-risk family members and fetuses ismore » to sequence the entire coding sequence in dominant or sporadic CMT patients without the CMT1A duplication. Of the 19 primers used in 16 pars to uniquely amplify the entire MPZ coding sequence, 6 primer pairs were used to amplify and sequence the 6 exons. The DyeDeoxy Terminator cycle sequencing method used with four different color fluorescent lables was superior to manual sequencing because it sequences more bases unambiguously from extracted genomic DNA samples within 24 hours. This protocol was used to test 28 CMT and Dejerine-Sottas patients without CMT1A gene duplication. Sequencing MPZ gene-specific amplified fragments identified 9 polymorphic sites within the 6 exons that encode the 248 amino acid MPZ protein. The large number of major CMT1B mutations identified by single strand sequencing are being verified by reverse strand sequencing and when possible, by restriction enzyme analysis. This protocol can be used to distringuish CMT1B patients from othre CMT phenotypes and to determine the CMT1B status of relatives both presymptomatically and prenatally.« less
Hypoparathyroidism-retardation-Dysmorphism (HRD) syndrome--a review.
Hershkovitz, Eli; Parvari, Ruti; Diaz, George A; Gorodischer, Rafael
2004-12-01
Hypoparathyroidism, retardation, and dysmorphism (HRD) is a newly recognized genetic syndrome, described in patients of Arab origin. The syndrome consists of permanent congenital hypoparathyroidism, severe prenatal and postnatal growth retardation, and profound global developmental delay. The patients are susceptible to severe infections including life-threatening pneumococcal infections especially during infancy. The main dysmorphic features are microcephaly, deep-set eyes or microphthalmia, ear abnormalities, depressed nasal bridge, thin upper lip, hooked small nose, micrognathia, and small hands and feet. A single 12-bp deletion (del52-55) in the second coding exon of the tubulin cofactor E (TCFE) gene, located on the long arm of chromosome 1, is the cause of HRD among Arab patients. Early recognition and therapy of hypocalcemia is important as is daily antibiotic prophylaxis against pneumococcal infections.
Molecular evolution of the leptin exon 3 in some species of the family Canidae.
Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek
2003-01-01
The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris)--16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical.
Novel germline PALB2 truncating mutations in African-American breast cancer patients
Zheng, Yonglan; Zhang, Jing; Niu, Qun; Huo, Dezheng; Olopade, Olufunmilayo I.
2011-01-01
Background It has been demonstrated that PALB2 acts as a bridging molecule between the BRCA1 and BRCA2 proteins and is responsible for facilitating BRCA2-mediated DNA repair. Truncating mutations in the PALB2 gene have been reported to be enriched in Fanconi anemia and breast cancer patients in various populations. Methods We evaluated the contribution of PALB2 germline mutations in 279 African-American breast cancer patients including 29 patients with a strong family history, 29 patients with a moderate family history, 75 patients with a weak family history, and 146 non-familial or sporadic breast cancer cases. Results After direct sequencing of all the coding exons, exon/intron boundaries, 5′UTR and 3′UTR of PALB2, three (1.08%; 3 in 279) novel monoallelic truncating mutations were identified: c.758dupT (exon4), c.1479delC (exon4) and c.3048delT (exon 10); together with 50 sequence variants, 27 of which are novel. None of the truncating mutations were found in 262 controls from the same population. Conclusions PALB2 mutations are present in both familial and non-familial breast cancer among African-Americans. Rare PALB2 mutations account for a small but substantial proportion of breast cancer patients. PMID:21932393
Mutation in Pyrroline-5-Carboxylate Reductase 1 Gene in Families with Cutis Laxa Type 2
Guernsey, Duane L.; Jiang, Haiyan; Evans, Susan C.; Ferguson, Meghan; Matsuoka, Makoto; Nightingale, Mathew; Rideout, Andrea L.; Provost, Sylvie; Bedard, Karen; Orr, Andrew; Dubé, Marie-Pierre; Ludman, Mark; Samuels, Mark E.
2009-01-01
Autosomal-recessive cutis laxa type 2 (ARCL2) is a multisystem disorder characterized by the appearance of premature aging, wrinkled and lax skin, joint laxity, and a general developmental delay. Cutis laxa includes a family of clinically overlapping conditions with confusing nomenclature, generally requiring molecular analyses for definitive diagnosis. Six genes are currently known to mutate to yield one of these related conditions. We ascertained a cohort of typical ARCL2 patients from a subpopulation isolate within eastern Canada. Homozygosity mapping with high-density SNP genotyping excluded all six known genes, and instead identified a single homozygous region near the telomere of chromosome 17, shared identically by state by all genotyped affected individuals from the families. A putative pathogenic variant was identified by direct DNA sequencing of genes within the region. The single nucleotide change leads to a missense mutation adjacent to a splice junction in the gene encoding pyrroline-5-carboxylate reductase 1 (PYCR1). Bioinformatic analysis predicted a pathogenic effect of the variant on splice donor site function. Skipping of the associated exon was confirmed in RNA from blood lymphocytes of affected homozygotes and heterozygous mutation carriers. Exon skipping leads to deletion of the reductase functional domain-coding region and an obligatory downstream frameshift. PYCR1 plays a critical role in proline biosynthesis. Pathogenicity of the genetic variant in PYCR1 is likely, given that a similar clinical phenotype has been documented for mutation carriers of another proline biosynthetic enzyme, pyrroline-5-carboxylate synthase. Our results support a significant role for proline in normal development. PMID:19576563
Drögemüller, Cord; Reichart, Ursula; Seuberlich, Torsten; Oevermann, Anna; Baumgartner, Martin; Kühni Boghenbor, Kathrin; Stoffel, Michael H.; Syring, Claudia; Meylan, Mireille; Müller, Simone; Müller, Mathias; Gredler, Birgit
2011-01-01
Tyrolean Grey cattle represent a local breed with a population size of ∼5000 registered cows. In 2003, a previously unknown neurological disorder was recognized in Tyrolean Grey cattle. The clinical signs of the disorder are similar to those of bovine progressive degenerative myeloencephalopathy (weaver syndrome) in Brown Swiss cattle but occur much earlier in life. The neuropathological investigation of an affected calf showed axonal degeneration in the central nervous system (CNS) and femoral nerve. The pedigrees of the affected calves suggested a monogenic autosomal recessive inheritance. We localized the responsible mutation to a 1.9 Mb interval on chromosome 16 by genome-wide association and haplotype mapping. The MFN2 gene located in this interval encodes mitofusin 2, a mitochondrial membrane protein. A heritable human axonal neuropathy, Charcot-Marie-Tooth disease-2A2 (CMT2A2), is caused by MFN2 mutations. Therefore, we considered MFN2 a positional and functional candidate gene and performed mutation analysis in affected and control Tyrolean Grey cattle. We did not find any non-synonymous variants. However, we identified a perfectly associated silent SNP in the coding region of exon 20 of the MFN2 gene. This SNP is located within a putative exonic splice enhancer (ESE) and the variant allele leads to partial retention of the entire intron 19 and a premature stop codon in the aberrant MFN2 transcript. Thus we have identified a highly unusual splicing defect, where an exonic single base exchange leads to the retention of the preceding intron. This splicing defect represents a potential explanation for the observed degenerative axonopathy. Marker assisted selection can now be used to eliminate degenerative axonopathy from Tyrolean Grey cattle. PMID:21526202
Drögemüller, Cord; Reichart, Ursula; Seuberlich, Torsten; Oevermann, Anna; Baumgartner, Martin; Kühni Boghenbor, Kathrin; Stoffel, Michael H; Syring, Claudia; Meylan, Mireille; Müller, Simone; Müller, Mathias; Gredler, Birgit; Sölkner, Johann; Leeb, Tosso
2011-04-15
Tyrolean Grey cattle represent a local breed with a population size of ∼5000 registered cows. In 2003, a previously unknown neurological disorder was recognized in Tyrolean Grey cattle. The clinical signs of the disorder are similar to those of bovine progressive degenerative myeloencephalopathy (weaver syndrome) in Brown Swiss cattle but occur much earlier in life. The neuropathological investigation of an affected calf showed axonal degeneration in the central nervous system (CNS) and femoral nerve. The pedigrees of the affected calves suggested a monogenic autosomal recessive inheritance. We localized the responsible mutation to a 1.9 Mb interval on chromosome 16 by genome-wide association and haplotype mapping. The MFN2 gene located in this interval encodes mitofusin 2, a mitochondrial membrane protein. A heritable human axonal neuropathy, Charcot-Marie-Tooth disease-2A2 (CMT2A2), is caused by MFN2 mutations. Therefore, we considered MFN2 a positional and functional candidate gene and performed mutation analysis in affected and control Tyrolean Grey cattle. We did not find any non-synonymous variants. However, we identified a perfectly associated silent SNP in the coding region of exon 20 of the MFN2 gene. This SNP is located within a putative exonic splice enhancer (ESE) and the variant allele leads to partial retention of the entire intron 19 and a premature stop codon in the aberrant MFN2 transcript. Thus we have identified a highly unusual splicing defect, where an exonic single base exchange leads to the retention of the preceding intron. This splicing defect represents a potential explanation for the observed degenerative axonopathy. Marker assisted selection can now be used to eliminate degenerative axonopathy from Tyrolean Grey cattle.
Characterization and mapping of the mouse NDP (Norrie disease) locus (Ndp).
Battinelli, E M; Boyd, Y; Craig, I W; Breakefield, X O; Chen, Z Y
1996-02-01
Norrie disease is a severe X-linked recessive neurological disorder characterized by congenital blindness with progressive loss of hearing. Over half of Norrie patients also manifest different degrees of mental retardation. The gene for Norrie disease (NDP) has recently been cloned and characterized. With the human NDP cDNA, mouse genomic phage libraries were screened for the homolog of the gene. Comparison between mouse and human genomic DNA blots hybridized with the NDP cDNA, as well as analysis of phage clones, shows that the mouse NDP gene is 29 kb in size (28 kb for the human gene). The organization in the two species is very similar. Both have three exons with similar-sized introns and identical exon-intron boundaries between exon 2 and 3. The mouse open reading frame is 393 bp and, like the human coding sequence, is encoded in exons 2 and 3. The absence of six nucleotides in the second mouse exon results in the encoded protein being two amino acids smaller than its human counterpart. The overall homology between the human and mouse NDP protein is 95% and is particularly high (99%) in exon 3, consistent with the apparent functional importance of this region. Analysis of transcription initiation sites suggests the presence of multiple start sites associated with expression of the mouse NDP gene. Pedigree analysis of an interspecific mouse backcross localizes the mouse NDP gene close to Maoa in the conserved segment, which runs from CYBB to PFC in both human and mouse.
Origins of Genes: "Big Bang" or Continuous Creation?
NASA Astrophysics Data System (ADS)
Kesse, Paul K.; Gibbs, Adrian
1992-10-01
Many protein families are common to all cellular organisms, indicating that many genes have ancient origins. Genetic variation is mostly attributed to processes such as mutation, duplication, and rearrangement of ancient modules. Thus it is widely assumed that much of present-day genetic diversity can be traced by common ancestry to a molecular "big bang." A rarely considered alternative is that proteins may arise continuously de novo. One mechanism of generating different coding sequences is by "overprinting," in which an existing nucleotide sequence is translated de novo in a different reading frame or from noncoding open reading frames. The clearest evidence for overprinting is provided when the original gene function is retained, as in overlapping genes. Analysis of their phylogenies indicates which are the original genes and which are their informationally novel partners. We report here the phylogenetic relationships of overlapping coding sequences from steroid-related receptor genes and from tymovirus, luteovirus, and lentivirus genomes. For each pair of overlapping coding sequences, one is confined to a single lineage, whereas the other is more widespread. This suggests that the phylogenetically restricted coding sequence arose only in the progenitor of that lineage by translating an out-of-frame sequence to yield the new polypeptide. The production of novel exons by alternative splicing in thyroid receptor and lentivirus genes suggests that introns can be a valuable evolutionary source for overprinting. New genes and their products may drive major evolutionary changes.
[Clinical and genetic analysis of a patient with Treacher Collins syndrome in TCOF1 gene].
Li, Hongbo; Zhang, Xu; Li, Zhenyue; Chen, Jing; Lu, Yu; Jia, Jingjie; Yuan, Huijun; Han, Dongyi
2012-05-01
To analyze the clinical and genetic features of a patient with Treacher Collins syndrome (TCS), and identify the mutation in TCOF1 gene. The medical history was taken, and general physical examinations and otological examinations were conducted in this patient. Genomic DNA was extracted from this patient and his parents and complete TCOF1 gene coding exons were amplified by specific PCR primers. Direct sequencing was carried out to identify the mutations. The raw data was analyzed with GeneTool software and molecular biological website. We detected a heterozygous c. 1639 delAG mutation in exon 11 of TCOF1, which resulted in a truncated protein lacking normal function. This mutation is a novel mutation and the second case identified in exon 11 of in TCS. TCS patient reported in this study has unique clinical phenotype. TCOF1 gene mutation is the specific risk factor.
Peng, Tao; Xue, Chenghai; Bi, Jianning; Li, Tingting; Wang, Xiaowo; Zhang, Xuegong; Li, Yanda
2008-04-26
Alternative splicing expands transcriptome diversity and plays an important role in regulation of gene expression. Previous studies focus on the regulation of a single cassette exon, but recent experiments indicate that multiple cassette exons within a gene may interact with each other. This interaction can increase the potential to generate various transcripts and adds an extra layer of complexity to gene regulation. Several cases of exon interaction have been discovered. However, the extent to which the cassette exons coordinate with each other remains unknown. Based on EST data, we employed a metric of correlation coefficients to describe the interaction between two adjacent cassette exons and then categorized these exon pairs into three different groups by their interaction (correlation) patterns. Sequence analysis demonstrates that strongly-correlated groups are more conserved and contain a higher proportion of pairs with reading frame preservation in a combinatorial manner. Multiple genome comparison further indicates that different groups of correlated pairs have different evolutionary courses: (1) The vast majority of positively-correlated pairs are old, (2) most of the weakly-correlated pairs are relatively young, and (3) negatively-correlated pairs are a mixture of old and young events. We performed a large-scale analysis of interactions between adjacent cassette exons. Compared with weakly-correlated pairs, the strongly-correlated pairs, including both the positively and negatively correlated ones, show more evidence that they are under delicate splicing control and tend to be functionally important. Additionally, the positively-correlated pairs bear strong resemblance to constitutive exons, which suggests that they may evolve from ancient constitutive exons, while negatively and weakly correlated pairs are more likely to contain newly emerging exons.
Nowacka-Woszuk, J; Switonski, M
2010-02-01
Numerous mutations of the human androgen receptor (AR) gene cause an intersexual phenotype, called the androgen insensitivity syndrome. The intersexual phenotype is also quite often diagnosed in dogs. The aim of this study was to conduct a comparative analysis of the entire coding sequence (eight exons) of the AR gene in healthy and four intersex dogs, as well as in three other canids (the red fox, arctic fox and Chinese raccoon dog). The coding sequence of the studied species appeared to be conserved (similarity above 97%) and polymorphism was found in exon 1 only. Altogether, 2 SNPs were identified in healthy dogs, 14 in red foxes, 16 in arctic foxes and 6 were found in Chinese raccoon dogs, respectively. Moreover, a variable number of tandem repeats (CAG and CAA), encoding an array of glutamines, was also observed in this exon. The CAA codon numbers were invariable within species, but the CAG repeats were polymorphic. The highest number of the CAG and CAA repeats was found in dogs (from 40 to 42) and the observed variability was similar in intersex and healthy dogs. In the other canids the variability fell within the following ranges: 29-37 (red fox), 37-39 (arctic fox) and 29-32 (Chinese raccoon dog). In addition, a polymorphic microsatellite marker in intron 2 was found in the dog, red fox and Chinese raccoon dog. It was concluded that the polymorphism level of the AR gene in the dog was lower than in the other canids and none of the detected polymorphisms, including variability of the CAG tandem repeats, could be related with the intersexual phenotype of the studied dogs.
Lourenco-Jaramillo, Diana Lelidett; Sifuentes-Rincón, Ana María; Parra-Bracamonte, Gaspar Manuel; de la Rosa-Reyna, Xochitl Fabiola; Segura-Cabrera, Aldo; Arellano-Vera, Williams
2012-01-01
DNA from four cattle breeds was used to re-sequence all of the exons and 56% of the introns of the bovine tyrosine hydroxylase (TH) gene and 97% and 13% of the bovine dopamine β-hydroxylase (DBH) coding and non-coding sequences, respectively. Two novel single nucleotide polymorphisms (SNPs) and a microsatellite motif were found in the TH sequences. The DBH sequences contained 62 nucleotide changes, including eight non-synonymous SNPs (nsSNPs) that are of particular interest because they may alter protein function and therefore affect the phenotype. These DBH nsSNPs resulted in amino acid substitutions that were predicted to destabilize the protein structure. Six SNPs (one from TH and five from DBH non-synonymous SNPs) were genotyped in 140 animals; all of them were polymorphic and had a minor allele frequency of > 9%. There were significant differences in the intra- and inter-population haplotype distributions. The haplotype differences between Brahman cattle and the three B. t. taurus breeds (Charolais, Holstein and Lidia) were interesting from a behavioural point of view because of the differences in temperament between these breeds. PMID:22888292
Exome-wide DNA capture and next generation sequencing in domestic and wild species.
Cosart, Ted; Beja-Pereira, Albano; Chen, Shanyuan; Ng, Sarah B; Shendure, Jay; Luikart, Gordon
2011-07-05
Gene-targeted and genome-wide markers are crucial to advance evolutionary biology, agriculture, and biodiversity conservation by improving our understanding of genetic processes underlying adaptation and speciation. Unfortunately, for eukaryotic species with large genomes it remains costly to obtain genome sequences and to develop genome resources such as genome-wide SNPs. A method is needed to allow gene-targeted, next-generation sequencing that is flexible enough to include any gene or number of genes, unlike transcriptome sequencing. Such a method would allow sequencing of many individuals, avoiding ascertainment bias in subsequent population genetic analyses.We demonstrate the usefulness of a recent technology, exon capture, for genome-wide, gene-targeted marker discovery in species with no genome resources. We use coding gene sequences from the domestic cow genome sequence (Bos taurus) to capture (enrich for), and subsequently sequence, thousands of exons of B. taurus, B. indicus, and Bison bison (wild bison). Our capture array has probes for 16,131 exons in 2,570 genes, including 203 candidate genes with known function and of interest for their association with disease and other fitness traits. We successfully sequenced and mapped exon sequences from across the 29 autosomes and X chromosome in the B. taurus genome sequence. Exon capture and high-throughput sequencing identified thousands of putative SNPs spread evenly across all reference chromosomes, in all three individuals, including hundreds of SNPs in our targeted candidate genes. This study shows exon capture can be customized for SNP discovery in many individuals and for non-model species without genomic resources. Our captured exome subset was small enough for affordable next-generation sequencing, and successfully captured exons from a divergent wild species using the domestic cow genome as reference.
Role of interleukin-15 receptor alpha polymorphisms in normal weight obese syndrome.
Di Renzo, L; Gloria-Bottini, F; Saccucci, P; Bigioni, M; Abenavoli, L; Gasbarrini, G; De Lorenzo, A
2009-01-01
Previous published studies have identified a class of women, Normal Weight Obese women (NWO) with normal BMI and high fat content. An important role of Interleukin-15 (IL-15) has been documented in facilitating muscle proliferation and promoting fat depletion. Indeed the presence of three types of IL-15 receptor subunits in fat tissue suggests a direct effect on adipose tissue. We studied three single nucleotide polymorphisms (SNP) of IL-15R-alpha receptor gene and investigated their relationship with NWO phenotype. We considered two classes of women according to their BMI and percent fat mass (percent FAT), class 1: including 72 overweight-obese women (high BMI-high fat mass) and class 2: including 36 NWO (normal BMI, high fat mass). Three sites of Interleukin-15 receptor subunit á gene were examined, located respectively in exon4, exon5 intron-exon border and exon7. Genotyping of the identified polymorphisms was performed by restriction fragment length polymorphism. Haplotype frequency estimation was performed by using the Mendel-University of Chicago program. Odds ratio analyses were calculated by EPISTAT program. Highly significant differences were observed for exon 7- exon5 intron-exon border and exon 4-exon 7 haplotype distribution between class 1 and class 2 women. These results strongly support the hypothesis that genetic variability of the IL-15 receptor has an important role in body fat composition. Our data underscore previous findings that suggest a potential role of IL-15 cytokine in NWO syndrome.
Molecular evolution of the leptin exon 3 in some species of the family Canidae
Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek
2003-01-01
The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris) – 16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical. PMID:12939206
Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4.
Abbott, Geoffrey W
2016-08-01
The 5 human (h)KCNE β subunits each regulate various cation channels and are linked to inherited cardiac arrhythmias. Reported here are previously undiscovered protein-coding regions in exon 1 of hKCNE3 and hKCNE4 that extend their encoded extracellular domains by 44 and 51 residues, which yields full-length proteins of 147 and 221 residues, respectively. Full-length hKCNE3 and hKCNE4 transcript and protein are expressed in multiple human tissues; for hKCNE4, only the longer protein isoform is detectable. Two-electrode voltage-clamp electrophysiology revealed that, when coexpressed in Xenopus laevis oocytes with various potassium channels, the newly discovered segment preserved conversion of KCNQ1 by hKCNE3 to a constitutively open channel, but prevented its inhibition of Kv4.2 and KCNQ4. hKCNE4 slowing of Kv4.2 inactivation and positive-shifted steady-state inactivation were also preserved in the longer form. In contrast, full-length hKCNE4 inhibition of KCNQ1 was limited to 40% at +40 mV vs. 80% inhibition by the shorter form, and augmentation of KCNQ4 activity by hKCNE4 was entirely abolished by the additional segment. Among the genome databases analyzed, the longer KCNE3 is confined to primates; full-length KCNE4 is widespread in vertebrates but is notably absent from Mus musculus Findings highlight unexpected KCNE gene diversity, raise the possibility of dynamic regulation of KCNE partner modulation via splice variation, and suggest that the longer hKCNE3 and hKCNE4 proteins should be adopted in future mechanistic and genetic screening studies.-Abbott, G. W. Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4. © FASEB.
CVD-associated non-coding RNA, ANRIL, modulates expression of atherogenic pathways in VSMC
DOE Office of Scientific and Technical Information (OSTI.GOV)
Congrains, Ada; Kamide, Kei; Katsuya, Tomohiro
Highlights: Black-Right-Pointing-Pointer ANRIL maps in the strongest susceptibility locus for cardiovascular disease. Black-Right-Pointing-Pointer Silencing of ANRIL leads to altered expression of tissue remodeling-related genes. Black-Right-Pointing-Pointer The effects of ANRIL on gene expression are splicing variant specific. Black-Right-Pointing-Pointer ANRIL affects progression of cardiovascular disease by regulating proliferation and apoptosis pathways. -- Abstract: ANRIL is a newly discovered non-coding RNA lying on the strongest genetic susceptibility locus for cardiovascular disease (CVD) in the chromosome 9p21 region. Genome-wide association studies have been linking polymorphisms in this locus with CVD and several other major diseases such as diabetes and cancer. The role of thismore » non-coding RNA in atherosclerosis progression is still poorly understood. In this study, we investigated the implication of ANRIL in the modulation of gene sets directly involved in atherosclerosis. We designed and tested siRNA sequences to selectively target two exons (exon 1 and exon 19) of the transcript and successfully knocked down expression of ANRIL in human aortic vascular smooth muscle cells (HuAoVSMC). We used a pathway-focused RT-PCR array to profile gene expression changes caused by ANRIL knock down. Notably, the genes affected by each of the siRNAs were different, suggesting that different splicing variants of ANRIL might have distinct roles in cell physiology. Our results suggest that ANRIL splicing variants play a role in coordinating tissue remodeling, by modulating the expression of genes involved in cell proliferation, apoptosis, extra-cellular matrix remodeling and inflammatory response to finally impact in the risk of cardiovascular disease and other pathologies.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, O.; Masters, C.; Lewis, M.B.
1994-09-01
In an 8-year-old girl and her father, both of whom have severe type III OI, we have previously used RNA/RNA hybrid analysis to demonstrate a mismatch in the region of {alpha}1(I) mRNA coding for aa 558-861. We used SSCP to further localize the abnormality to a subregion coding for aa 579-679. This region was subcloned and sequenced. Each patient`s cDNA has a deletion of the sequences coding for the last residue of exon 34, and all of exons 35 and 36 (aa 604-639), followed by an insertion of 156 nt from the 3{prime}-end of intron 36. PCR amplification of leukocytemore » DNA from the patients and the clinically normal paternal grandmother yielded two fragments: a 1007 bp fragment predicted from normal genomic sequences and a 445 bp fragment. Subcloning and sequencing of the shorter genomic PCR product confirmed the presence of a 565 bp genomic deletion from the end of exon 34 to the middle of intron 36. The abnormal protein is apparently synthesized and incorporated into helix. The inserted nucleotides are in frame with the collagenous sequence and contain no stop codons. They encode a 52 aa non-collagenous region. The fibroblast procollagen of the patients has both normal and electrophoretically delayed pro{alpha}(I) bands. The electrophoretically delayed procollagen is very sensitive to pepsin or trypsin digestion, as predicted by its non-collagenous sequence, and cannot be visualized as collagen. This unique OI collagen mutation is an excellent candidate for molecular targeting to {open_quotes}turn off{close_quotes} a dominant mutant allele.« less
Exon 2-mediated c-myc mRNA decay in vivo is independent of its translation.
Pistoi, S; Roland, J; Babinet, C; Morello, D
1996-01-01
We have previously shown that the steady-state level of c-myc mRNA in vivo is primarily controlled by posttranscriptional regulatory mechanisms. To identify the sequences involved in this process, we constructed a series of H-2/myc transgenic lines in which various regions of the human c-MYC gene were placed under the control of the quasi-ubiquitous H-2K class I regulatory sequences. We demonstrated that the presence of one of the two coding exons, exon 2 or exon 3, is sufficient to confer a level of expression of transgene mRNA similar to that of endogenous c-myc in various adult tissues as well as after partial hepatectomy or after protein synthesis inhibition. We now focus on the molecular mechanisms involved in modulation of expression of mRNAs containing c-myc exon 2 sequences, with special emphasis on the coupling between translation and c-myc mRNA turnover. We have undertaken an analysis of expression, both at the mRNA level and at the protein level, of new transgenic constructs in which the translation is impaired either by disruption of the initiation codon or by addition of stop codons upstream of exon 2. Our results show that the translation of c-myc exon 2 is not required for regulated expression of the transgene in the different situations analyzed, and therefore they indicate that the mRNA destabilizing function of exon 2 is independent of translation by ribosomes. Our investigations also reveal that, in the thymus, some H-2/myc transgenes express high levels of mRNA but low levels of protein. Besides the fact that these results suggest the existence of tissue-specific mechanisms that control c-myc translatability in vivo, they also bring another indication of the uncoupling of c-myc mRNA translation and degradation. PMID:8756668
Fisher, S E; van Bakel, I; Lloyd, S E; Pearce, S H; Thakker, R V; Craig, I W
1995-10-10
Dent disease, an X-linked familial renal tubular disorder, is a form of Fanconi syndrome associated with proteinuria, hypercalciuria, nephrocalcinosis, kidney stones, and eventual renal failure. We have previously used positional cloning to identify the 3' part of a novel kidney-specific gene (initially termed hClC-K2, but now referred to as CLCN5), which is deleted in patients from one pedigree segregating Dent disease. Mutations that disrupt this gene have been identified in other patients with this disorder. Here we describe the isolation and characterization of the complete open reading frame of the human CLCN5 gene, which is predicted to encode a protein of 746 amino acids, with significant homology to all known members of the ClC family of voltage-gated chloride channels. CLCN5 belongs to a distinct branch of this family, which also includes the recently identified genes CLCN3 and CLCN4. We have shown that the coding region of CLCN5 is organized into 12 exons, spanning 25-30 kb of genomic DNA, and have determined the sequence of each exon-intron boundary. The elucidation of the coding sequence and exon-intron organization of CLCN5 will both expedite the evaluation of structure/function relationships of these ion channels and facilitate the screening of other patients with renal tubular dysfunction for mutations at this locus.
Characterization of a novel 132-bp exon of the human maxi-K channel.
Korovkina, V P; Fergus, D J; Holdiman, A J; England, S K
2001-07-01
The large-conductance Ca2+-activated voltage-dependent K+ channel (maxi-K channel) induces a significant repolarizing current that buffers cell excitability. This channel can derive its diversity by alternative splicing of its transcript-producing isoforms that differ in their sensitivity to voltage and intracellular Ca2+. We have identified a novel 132-bp exon of the maxi-K channel from human myometrial cells that encodes 44 amino acids within the first intracellular loop of the channel protein. Distribution analysis reveals that this exon is expressed predominantly in human smooth muscle tissues with the highest abundance in the uterus and aorta and resembles the previously reported distribution of the total maxi-K channel transcript. Single-channel K+ current measurements in fibroblasts transfected with the maxi-K channel containing this novel 132-bp exon demonstrate that the presence of this insert attenuates the sensitivity to voltage and intracellular Ca2+. Alternative splicing to introduce this 132-bp exon into the maxi-K channel may elicit another mode to modulate cell excitability.
Abdoli, R; Zamani, P; Deljou, A; Rezvan, H
2013-07-25
BMPR-1B and GDF9 genes are well known due to their important effects on litter size and mechanisms controlling ovulation rate in sheep. In the present study, polymorphisms of BMPR-1B gene exon 8 and GDF9 gene exon 1 were detected by single strand conformational polymorphism (SSCP) analysis and DNA sequencing methods in 100 Mehraban ewes. The PCR reaction forced to amplify 140 and 380-bp fragments of BMPR-1B and GDF9 genes, respectively. Two single nucleotide polymorphisms (SNPS) were identified in two different SSCP patterns of BMPR-1B gene (CC and CA genotypes) that deduced one amino acid exchange. Also, two SNPS were identified in three different SSCP patterns of GDF9 gene (AA, AG and GG genotypes) that deduced one amino acid exchanges. Two different secondary structures of protein were predicted for BMPR-1B exon 8, but the secondary protein structures predicted for GDF9 exon 1 were similar together. The evaluation of the associations between the SSCP patterns and the protein structure changes with reproduction traits showed that BMPR-1B exon 8 genotypes have significant effects on some of reproduction traits but the GDF9 genotypes did not have any significant effect. The CA genotype of BMPR-1B exon 8 had a significant positive effect on reproduction performance and could be considered as an important and new mutation, affecting the ewes reproduction performance. Marker assisted selection using BMPR-IB gene could be noticed to improve the reproduction traits in Mehraban sheep. Copyright © 2013 Elsevier B.V. All rights reserved.
Esmaeili, Rezvan; Abdoli, Nasrin; Yadegari, Fatemeh; Neishaboury, Mohamadreza; Farahmand, Leila; Kaviani, Ahmad; Majidzadeh-A, Keivan
2018-01-01
CD44 encoded by a single gene is a cell surface transmembrane glycoprotein. Exon 2 is one of the important exons to bind CD44 protein to hyaluronan. Experimental evidences show that hyaluronan-CD44 interaction intensifies the proliferation, migration, and invasion of breast cancer cells. Therefore, the current study aimed at investigating the association between specific polymorphisms in exon 2 and its flanking region of CD44 with predisposition to breast cancer. In the current study, 175 Iranian female patients with breast cancer and 175 age-matched healthy controls were recruited in biobank, Breast Cancer Research Center, Tehran, Iran. Single nucleotide polymorphisms of CD44 exon 2 and its flanking were analyzed via polymerase chain reaction and gene sequencing techniques. Association between the observed variation with breast cancer risk and clinico-pathological characteristics were studied. Subsequently, bioinformatics analysis was conducted to predict potential exonic splicing enhancer (ESE) motifs changed as the result of a mutation. A unique polymorphism of the gene encoding CD44 was identified at position 14 nucleotide upstream of exon 2 (A37692→G) by the sequencing method. The A > G polymorphism exhibited a significant association with higher-grades of breast cancer, although no significant relation was found between this polymorphism and breast cancer risk. Finally, computational analysis revealed that the intronic mutation generated a new consensus-binding motif for the splicing factor, SC35, within intron 1. The current study results indicated that A > G polymorphism was associated with breast cancer development; in addition, in silico analysis with ESE finder prediction software showed that the change created a new SC35 binding site.
Li, Jun; Hakata, Yoshiyuki; Takeda, Eri; Liu, Qingping; Iwatani, Yasumasa; Kozak, Christine A.; Miyazawa, Masaaki
2012-01-01
Mouse apolipoprotein B mRNA-editing enzyme catalytic polypeptide-like editing complex 3 (mA3), an intracellular antiviral factor, has 2 allelic variations that are linked with different susceptibilities to beta- and gammaretrovirus infections among various mouse strains. In virus-resistant C57BL/6 (B6) mice, mA3 transcripts are more abundant than those in susceptible BALB/c mice both in the spleen and bone marrow. These strains of mice also express mA3 transcripts with different splicing patterns: B6 mice preferentially express exon 5-deficient (Δ5) mA3 mRNA, while BALB/c mice produce exon 5-containing full-length mA3 mRNA as the major transcript. Although the protein product of the Δ5 mRNA exerts stronger antiretroviral activities than the full-length protein, how exon 5 affects mA3 antiviral activity, as well as the genetic mechanisms regulating exon 5 inclusion into the mA3 transcripts, remains largely uncharacterized. Here we show that mA3 exon 5 is indeed a functional element that influences protein synthesis at a post-transcriptional level. We further employed in vitro splicing assays using genomic DNA clones to identify two critical polymorphisms affecting the inclusion of exon 5 into mA3 transcripts: the number of TCCT repeats upstream of exon 5 and the single nucleotide polymorphism within exon 5 located 12 bases upstream of the exon 5/intron 5 boundary. Distribution of the above polymorphisms among different Mus species indicates that the inclusion of exon 5 into mA3 mRNA is a relatively recent event in the evolution of mice. The widespread geographic distribution of this exon 5-including genetic variant suggests that in some Mus populations the cost of maintaining an effective but mutagenic enzyme may outweigh its antiviral function. PMID:22275865
Wise, C A; Chiang, L C; Paznekas, W A; Sharma, M; Musy, M M; Ashley, J A; Lovett, M; Jabs, E W
1997-04-01
Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development.
Genomic organization of the neurofibromatosis 1 gene (NF1)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Y.; O`Connell, P.; Huntsman Breidenbach, H.
Neurofibromatosis 1 maps to chromosome band 17q11.2, and the NF1 locus has been partially characterized. Even though the full-length NF1 cDNA has been sequenced, the complete genomic structure of the NF1 gene has not been elucidated. The 5{prime} end of NF1 is embedded in a CpG island containing a NotI restriction site, and the remainder of the gene lies in the adjacent 350-kb NotI fragment. In our efforts to develop a comprehensive screen for NF1 mutations, we have isolated genomic DNA clones that together harbor the entire NF1 cDNA sequence. We have identified all intron-exon boundaries of the coding regionmore » and established that it is composed of 59 exons. Furthermore, we have defined the 3{prime}-untranslated region (3{prime}-UTR) of the NF1 gene; it spans approximately 3.5 kb of genomic DNA sequence and is continuous with the stop codon. Oligonucleotide primer pairs synthesized from exon-flanking DNA sequences were used in the polymerase chain reaction with cloned, chromosome 17-specific genomic DNA as template to amplify NF1 exons 1 through 27b and the exon containing the 3{prime}-UTR separately. This information should be useful for implementing a comprehensive NF1 mutation screen using genomic DNA as template. 41 refs., 3 figs., 2 tabs.« less
Hereditary Angioedema Nationwide Study in Slovenia Reveals Four Novel Mutations in SERPING1 Gene
Rijavec, Matija; Korošec, Peter; Šilar, Mira; Zidarn, Mihaela; Miljković, Jovan; Košnik, Mitja
2013-01-01
Hereditary angioedema (HAE) is a rare autosomal dominant disease characterized by swelling of the face, lips, tongue, larynx, genitalia, or extremities, with abdominal pain caused by intra-abdominal edema. HAE is caused by mutations affecting the C1 inhibitor gene, SERPING1, resulting in low levels of C1 inhibitor (Type I HAE) or normal levels of ineffective C1 inhibitor (Type II HAE). A nationwide survey identified nine unrelated families with HAE in Slovenia, among whom 17 individuals from eight families were recruited for genetic analyses. A diagnosis of HAE was established in the presence of clinical and laboratory criteria (low C1 inhibitor antigenic levels and/or function), followed up by a positive family history. Genetic studies were carried out using PCR and sequencing to detect SERPING1 mutations in promoter, noncoding exon 1, the 7 coding exons, and exon-intron boundaries. Multiplex ligation-dependent probe amplification was performed in order to search for large deletions/duplications in SERPING1 gene. A mutation responsible for HAE was identified in patients from seven families with the disease. In HAE type I families, one previously reported substitution (Gln67Stop, c.265C>T) and four novel mutations were identified. The new mutations included two missense substitutions, Ser128Phe (c.449C>T), and Glu429Lys (c.1351G>A), together with two frameshift mutations, indel (c.49delGinsTT) and deletion (c.593_594delCT). Both families with HAE type II harbored the two well-known substitutions affecting the arginyl residue at the reactive center in exon 8, Arg444Cys (c.1396C>T) and Arg444His (c.1397G>A), respectively. In one patient only the homozygous variant g.566T>C (c.-21T>C) was identified. Our study identified four novel mutations in the Slovenian HAE population, highlighting the heterogeneity of mutations in the SERPING1 gene causing C1 inhibitor deficiency and HAE. In a single patient with HAE a homozygous variant g.566T>C (c.-21T>C) might be responsible for the disease. PMID:23437219
Hereditary angioedema nationwide study in Slovenia reveals four novel mutations in SERPING1 gene.
Rijavec, Matija; Korošec, Peter; Šilar, Mira; Zidarn, Mihaela; Miljković, Jovan; Košnik, Mitja
2013-01-01
Hereditary angioedema (HAE) is a rare autosomal dominant disease characterized by swelling of the face, lips, tongue, larynx, genitalia, or extremities, with abdominal pain caused by intra-abdominal edema. HAE is caused by mutations affecting the C1 inhibitor gene, SERPING1, resulting in low levels of C1 inhibitor (Type I HAE) or normal levels of ineffective C1 inhibitor (Type II HAE). A nationwide survey identified nine unrelated families with HAE in Slovenia, among whom 17 individuals from eight families were recruited for genetic analyses. A diagnosis of HAE was established in the presence of clinical and laboratory criteria (low C1 inhibitor antigenic levels and/or function), followed up by a positive family history. Genetic studies were carried out using PCR and sequencing to detect SERPING1 mutations in promoter, noncoding exon 1, the 7 coding exons, and exon-intron boundaries. Multiplex ligation-dependent probe amplification was performed in order to search for large deletions/duplications in SERPING1 gene. A mutation responsible for HAE was identified in patients from seven families with the disease. In HAE type I families, one previously reported substitution (Gln67Stop, c.265C>T) and four novel mutations were identified. The new mutations included two missense substitutions, Ser128Phe (c.449C>T), and Glu429Lys (c.1351G>A), together with two frameshift mutations, indel (c.49delGinsTT) and deletion (c.593_594delCT). Both families with HAE type II harbored the two well-known substitutions affecting the arginyl residue at the reactive center in exon 8, Arg444Cys (c.1396C>T) and Arg444His (c.1397G>A), respectively. In one patient only the homozygous variant g.566T>C (c.-21T>C) was identified. Our study identified four novel mutations in the Slovenian HAE population, highlighting the heterogeneity of mutations in the SERPING1 gene causing C1 inhibitor deficiency and HAE. In a single patient with HAE a homozygous variant g.566T>C (c.-21T>C) might be responsible for the disease.
Chillón, Isabel; Pyle, Anna M.
2016-01-01
LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782
Shabanpoor, Fazel; McClorey, Graham; Saleh, Amer F.; Järver, Peter; Wood, Matthew J.A.; Gait, Michael J.
2015-01-01
The potential for therapeutic application of splice-switching oligonucleotides (SSOs) to modulate pre-mRNA splicing is increasingly evident in a number of diseases. However, the primary drawback of this approach is poor cell and in vivo oligonucleotide uptake efficacy. Biological activities can be significantly enhanced through the use of synthetically conjugated cationic cell penetrating peptides (CPPs). Studies to date have focused on the delivery of a single SSO conjugated to a CPP, but here we describe the conjugation of two phosphorodiamidate morpholino oligonucleotide (PMO) SSOs to a single CPP for simultaneous delivery and pre-mRNA targeting of two separate genes, exon 23 of the Dmd gene and exon 5 of the Acvr2b gene, in a mouse model of Duchenne muscular dystrophy. Conjugations of PMOs to a single CPP were carried out through an amide bond in one case and through a triazole linkage (‘click chemistry’) in the other. The most active bi-specific CPP–PMOs demonstrated comparable exon skipping levels for both pre-mRNA targets when compared to individual CPP–PMO conjugates both in cell culture and in vivo in the mdx mouse model. Thus, two SSOs with different target sequences conjugated to a single CPP are biologically effective and potentially suitable for future therapeutic exploitation. PMID:25468897
Multi-step splicing of sphingomyelin synthase linear and circular RNAs.
Filippenkov, Ivan B; Sudarkina, Olga Yu; Limborska, Svetlana A; Dergunova, Lyudmila V
2018-05-15
The SGMS1 gene encodes the enzyme sphingomyelin synthase 1 (SMS1), which is involved in the regulation of lipid metabolism, apoptosis, intracellular vesicular transport and other significant processes. The SGMS1 gene is located on chromosome 10 and has a size of 320 kb. Previously, we showed that dozens of alternative transcripts of the SGMS1 gene are present in various human tissues. In addition to mRNAs that provide synthesis of the SMS1 protein, this gene participates in the synthesis of non-coding transcripts, including circular RNAs (circRNAs), which include exons of the 5'-untranslated region (5'-UTR) and are highly represented in the brain. In this study, using the high-throughput technology RNA-CaptureSeq, many new SGMS1 transcripts were identified, including both intronic unspliced RNAs (premature RNAs) and RNAs formed via alternative splicing. Recursive exons (RS-exons) that can participate in the multi-step splicing of long introns of the gene were also identified. These exons participate in the formation of circRNAs. Thus, multi-step splicing may provide a variety of linear and circular RNAs of eukaryotic genes in tissues. Copyright © 2018 Elsevier B.V. All rights reserved.
Eisman, Robert C.; Phelps, Melissa A. S.; Kaufman, Thomas
2015-01-01
The formation of the pericentriolar matrix (PCM) and a fully functional centrosome in syncytial Drosophila melanogaster embryos requires the rapid transport of Cnn during initiation of the centrosome replication cycle. We show a Cnn and Polo kinase interaction is apparently required during embryogenesis and involves the exon 1A-initiating coding exon, suggesting a subset of Cnn splice variants is regulated by Polo kinase. During PCM formation exon 1A Cnn-Long Form proteins likely bind Polo kinase before phosphorylation by Polo for Cnn transport to the centrosome. Loss of either of these interactions in a portion of the total Cnn protein pool is sufficient to remove native Cnn from the pool, thereby altering the normal localization dynamics of Cnn to the PCM. Additionally, Cnn-Short Form proteins are required for polar body formation, a process known to require Polo kinase after the completion of meiosis. Exon 1A Cnn-LF and Cnn-SF proteins, in conjunction with Polo kinase, are required at the completion of meiosis and for the formation of functional centrosomes during early embryogenesis. PMID:26447129
Eisman, Robert C; Phelps, Melissa A S; Kaufman, Thomas
2015-10-01
The formation of the pericentriolar matrix (PCM) and a fully functional centrosome in syncytial Drosophila melanogaster embryos requires the rapid transport of Cnn during initiation of the centrosome replication cycle. We show a Cnn and Polo kinase interaction is apparently required during embryogenesis and involves the exon 1A-initiating coding exon, suggesting a subset of Cnn splice variants is regulated by Polo kinase. During PCM formation exon 1A Cnn-Long Form proteins likely bind Polo kinase before phosphorylation by Polo for Cnn transport to the centrosome. Loss of either of these interactions in a portion of the total Cnn protein pool is sufficient to remove native Cnn from the pool, thereby altering the normal localization dynamics of Cnn to the PCM. Additionally, Cnn-Short Form proteins are required for polar body formation, a process known to require Polo kinase after the completion of meiosis. Exon 1A Cnn-LF and Cnn-SF proteins, in conjunction with Polo kinase, are required at the completion of meiosis and for the formation of functional centrosomes during early embryogenesis. Copyright © 2015 by the Genetics Society of America.
Li, Hongying; Zhang, Kaihui; Xu, Qun; Ma, Lixia; Lv, Xin; Sun, Ruopeng
2015-03-01
Alkaptonuria (AKU) is an autosomal recessive disorder of tyrosine metabolism, which is caused by a defect in the enzyme homogentisate 1,2-dioxygenase (HGD) with subsequent accumulation of homogentisic acid. Presently, more than 100 HGD mutations have been identified as the cause of the inborn error of metabolism across different populations worldwide. However, the HGD mutation is very rarely reported in Asia, especially China. In this study, we present mutational analyses of HGD gene in one Chinese Han child with AKU, which had been identified by gas chromatography-mass spectrometry detection of organic acids in urine samples. PCR and DNA sequencing of the entire coding region as well as exon-intron boundaries of HGD have been performed. Two novel mutations were identified in the HGD gene in this AKU case, a frameshift mutation of c.115delG in exon 3 and the splicing mutation of IVS5+3 A>C, a donor splice site of the exon 5 and exon-intron junction. The identification of these mutations in this study further expands the spectrum of known HGD gene mutations and contributes to prenatal molecular diagnosis of AKU.
Xu, Wen-Ning; Jiang, Zu-Jun; Li, Yong-Hua; Xiao, Hao-Wen; Gao, Yang; Pang, Yan; Ouyang, Lin; Liu, Zeng-Hui; Zhang, Le-Qing; Wang, Yang; Xiao, Yang
2015-10-01
To explore the correlation between MBL ExonI 54 and NFκB1-94ins/del ATTG polymorphism and fever during neutropenia in patients with acute leukaemia (AL) (except M3) after first chemotherapy in Chinese Han population. Blood samples obtained from 76 fever patients with AL during neutropenia episodes were detected to analyse single nucleotide polymorphism (SNP) in the MBL ExonI 54 and NFκB1-94ins/del ATTG gene, and analyse the correlation between above-mentioned 2 polymorphisms and fever during neutropenia of AL patients after chemotherapy. In 76 patients, no correlation were found between MBL ExonI 54 and NFκB1-94ins/del ATTG polymorphism and fever during neutropenia in patients with acute leukaemia after chemotherapy (P > 0.05). No significant relation were found in sex, age, underlying disease, disease status or degrees of neutropenia in febrile neutropenia between MBL ExonI 54 and NFκB1-94ins/del ATTG polymorphism (P > 0.05). However, patients with MBL ExonI 54 mutation presented longer febrile duration with a median of 5 days compared to 3 days of patients with wildtype MBL ExonI 54 genotype (P < 0.05). There is no clear correlation between MBL ExonI 54 and NFκB1-94ins/del ATTG polymorphism and fever during neutropenia in patients with acute leukaemia after chemotherapy. However, the patients with MBL ExonI 54 mutation have been observed to present a longer febrile duration.
Xu, Dong-Qing; Mattox, William
2006-01-01
Exonic splicing enhancers (ESEs) are sequences that facilitate recognition of splice sites and prevent exon-skipping. Because ESEs are often embedded within proteincoding sequences, alterations in them can also often be interpreted as nonsense, missense or silent mutations. To correctly interpret exonic mutations and their roles in disease, it is important to develop strategies that identify ESE mutations. Potential ESEs can be found computationally in many exons but it has proven difficult to predict if a given mutation will have effects on splicing based on sequence alone. Here we describe a flexible in vitro method that can be used to functionally compare the effects of multiple sequence variants on ESE activity in a single in vitro splicing reaction. We have applied this method in parallel with conventional splicing assays to test for a splicing enhancer in exon 17 of the human MLH1 gene. Point mutations associated with hereditary nonpolyposis colorectal cancer (HNPCC) have previously been found to correlate with exon-skipping in both lymphocytes and tumors from patients. We show that sequences from this exon can replace an ESE from the mouse IgM gene to support RNA splicing in HeLa nuclear extracts. ESE activity was reduced by HNPCC point mutations in codon 659 indicating that their primary effect is on splicing. Surprisingly the strongest enhancer function mapped to a different region of the exon upstream of this codon. Together our results indicate that HNPCC point mutations in codon 659 affect an auxillary element that augments the enhancer function to ensure exon inclusion. PMID:16357104
Analysis and recognition of 5′ UTR intron splice sites in human pre-mRNA
Eden, E.; Brunak, S.
2004-01-01
Prediction of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition. We perform a rigorous analysis of such splice sites embedded in human 5′ untranslated regions (UTRs), and investigate correlations between this class of splice sites and other features found in the adjacent exons and introns. By restricting the training of neural network algorithms to ‘pure’ UTRs (not extending partially into protein coding regions), we for the first time investigate the predictive power of the splicing signal proper, in contrast to conventional splice site prediction, which typically relies on the change in sequence at the transition from protein coding to non-coding. By doing so, the algorithms were able to pick up subtler splicing signals that were otherwise masked by ‘coding’ noise, thus enhancing significantly the prediction of 5′ UTR splice sites. For example, the non-coding splice site predicting networks pick up compositional and positional bias in the 3′ ends of non-coding exons and 5′ non-coding intron ends, where cytosine and guanine are over-represented. This compositional bias at the true UTR donor sites is also visible in the synaptic weights of the neural networks trained to identify UTR donor sites. Conventional splice site prediction methods perform poorly in UTRs because the reading frame pattern is absent. The NetUTR method presented here performs 2–3-fold better compared with NetGene2 and GenScan in 5′ UTRs. We also tested the 5′ UTR trained method on protein coding regions, and discovered, surprisingly, that it works quite well (although it cannot compete with NetGene2). This indicates that the local splicing pattern in UTRs and coding regions is largely the same. The NetUTR method is made publicly available at www.cbs.dtu.dk/services/NetUTR. PMID:14960723
Vongvanrungruang, A; Mongkolsiriwatana, C; Boonkaew, T; Sawatdichaikul, O; Srikulnath, K; Peyachoknagul, S
2016-09-19
The fragrance gene, betaine aldehyde dehydrogenase 2 (Badh2), has been well studied in many plant species. The objectives of this study were to clone Badh2 and compare the sequences between aromatic and non-aromatic coconuts. The complete coding region was cloned from cDNA of both aromatic and non-aromatic coconuts. The nucleotide sequences were highly homologous to Badh2 genes of other plants. Badh2 consisted of a 1512-bp open reading frame encoding 503 amino acids. A single nucleotide difference between aromatic and non-aromatic coconuts resulted in the conversion of alanine (non-aromatic) to proline (aromatic) at position 442, which was the substrate binding site of BADH2. The ring side chain of proline could destabilize the structure leading to a non-functional enzyme. Badh2 genomic DNA was cloned from exon 1 to 4, and from exon 5 to 15 from the two coconut types, except for intron 4 that was very long. The intron sequences of the two coconut groups were highly homologous. No differences in Badh2 expression were found among the tissues of aromatic coconut or between aromatic and non-aromatic coconuts. The amino acid sequences of BADH2 from coconut and other plants were compared and the genetic relationship was analyzed using MEGA 7.0. The phylogenetic tree reconstructed by the Bayesian information criterion consisted of two distinct groups of monocots and dicots. Among the monocots, coconut (Cocos nucifera) and oil palm (Elaeis guineensis) were the most closely related species. A marker for coconut differentiation was developed from one-base substitution site and could be successfully used.
Zhu, Fu-Yuan; Chen, Mo-Xian; Ye, Neng-Hui; Shi, Lu; Ma, Kai-Long; Yang, Jing-Fang; Cao, Yun-Ying; Zhang, Youjun; Yoshida, Takuya; Fernie, Alisdair R; Fan, Guang-Yi; Wen, Bo; Zhou, Ruo; Liu, Tie-Yuan; Fan, Tao; Gao, Bei; Zhang, Di; Hao, Ge-Fei; Xiao, Shi; Liu, Ying-Gao; Zhang, Jianhua
2017-08-01
In eukaryotes, mechanisms such as alternative splicing (AS) and alternative translation initiation (ATI) contribute to organismal protein diversity. Specifically, splicing factors play crucial roles in responses to environment and development cues; however, the underlying mechanisms are not well investigated in plants. Here, we report the parallel employment of short-read RNA sequencing, single molecule long-read sequencing and proteomic identification to unravel AS isoforms and previously unannotated proteins in response to abscisic acid (ABA) treatment. Combining the data from the two sequencing methods, approximately 83.4% of intron-containing genes were alternatively spliced. Two AS types, which are referred to as alternative first exon (AFE) and alternative last exon (ALE), were more abundant than intron retention (IR); however, by contrast to AS events detected under normal conditions, differentially expressed AS isoforms were more likely to be translated. ABA extensively affects the AS pattern, indicated by the increasing number of non-conventional splicing sites. This work also identified thousands of unannotated peptides and proteins by ATI based on mass spectrometry and a virtual peptide library deduced from both strands of coding regions within the Arabidopsis genome. The results enhance our understanding of AS and alternative translation mechanisms under normal conditions, and in response to ABA treatment. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Gan, W; Song, Q; Zhang, N N; Xiong, X P; Wang, D M C; Li, L
2015-06-18
The fat mass and obesity-associated gene (FTO) is an excellent candidate gene that affects energy metabolism. Single nucleotide polymorphisms (SNPs) in FTO are associated with carcass and meat quality traits in pigs, cattle, and rabbits. The aim of this study was to investigate the association between novel SNPs in the FTO coding region and carcass and meat quality traits in 95 crossbred ducks, using DNA sequencing. We found two transitions G/A (SNP 387 and 473) within exon 3. SNP 387 was a synonymous mutation, whereas SNP 473 was a missense mutation. Association analysis suggested that SNP g.387G>A was significantly associated with all of the carcass traits measured, the intramuscular fat content (IMF), cooking yield (CY), pH values 45 min after slaughter (pH45m), drip losses from the breast muscle, and the leg muscle (P < 0.05). For SNP g.473G>A, the genotype AA exhibited greater leg muscle weight than the genotypes GG or AG (P < 0.05). The D value suggested that the two SNPs exhibited strong linkage disequilibrium. Three haplotypes (G1G2, G1A2, and A1A2) were significantly associated with IMF, CY, the a* value, and all of the carcass traits measured (P < 0.05). The results suggest that FTO is a candidate locus that affects carcass and meat quality traits in ducks.
Nlend, Rachel Nlend; Meyer, Kathrin
2010-01-01
Recent analyses of complete genomes have revealed that alternative splicing became more prevalent and important during eukaryotic evolution. Alternative splicing augments the protein repertoire—particularly that of the human genome—and plays an important role in the development and function of differentiated cell types. However, splicing is also extremely vulnerable, and defects in the proper recognition of splicing signals can give rise to a variety of diseases. In this review, we discuss splicing correction therapies, by using the inherited disease Spinal Muscular Atrophy (SMA) as an example. This lethal early childhood disorder is caused by deletions or other severe mutations of SMN1, a gene coding for the essential survival of motoneurons protein. A second gene copy present in humans and few non-human primates, SMN2, can only partly compensate for the defect because of a single nucleotide change in exon 7 that causes this exon to be skipped in the majority of mRNAs. Thus SMN2 is a prime therapeutic target for SMA. In recent years, several strategies based on small molecule drugs, antisense oligonucleotides or in vivo expressed RNAs have been developed that allow a correction of SMN2 splicing. For some of these, a therapeutic benefit has been demonstrated in mouse models for SMA. This means that clinical trials of such splicing therapies for SMA may become possible in the near future. PMID:20523126
Rademakers, Rosa; Cruts, Marc; Sleegers, Kristel; Dermaut, Bart; Theuns, Jessie; Aulchenko, Yurii; Weckx, Stefan; De Pooter, Tim; Van den Broeck, Marleen; Corsmit, Ellen; De Rijk, Peter; Del-Favero, Jurgen; van Swieten, John; van Duijn, Cornelia M; Van Broeckhoven, Christine
2005-10-01
We obtained conclusive linkage of Alzheimer disease (AD) with a candidate region of 19.7 cM at 7q36 in an extended multiplex family, family 1270, ascertained in a population-based study of early-onset AD in the northern Netherlands. Single-nucleotide polymorphism and haplotype association analyses of a Dutch patient-control sample further supported the linkage at 7q36. In addition, we identified a shared haplotype at 7q36 between family 1270 and three of six multiplex AD-affected families from the same geographical region, which is indicative of a founder effect and defines a priority region of 9.3 cM. Mutation analysis of coding exons of 29 candidate genes identified one linked synonymous mutation, g.38030G-->C in exon 10, that affected codon 626 of the PAX transactivation domain interacting protein gene (PAXIP1). It remains to be determined whether PAXIP1 has a functional role in the expression of AD in family 1270 or whether another mutation at this locus explains the observed linkage and sharing. Together, our linkage data from the informative family 1270 and the association data in the population-based early-onset AD patient-control sample strongly support the identification of a novel AD locus at 7q36 and re-emphasize the genetic heterogeneity of AD.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schriner, J.E.; Yi, W.; Hofmann, S.L.
Palmitoyl-protein thioesterase (PPT) is a small glycoprotein that removes palmitate groups from cysteine residues in lipid-modified proteins. We recently reported mutations in PPT in patients with infantile neuronal ceroid lipofuscinosis (INCL), a severe neurodegenerative disorder. INCL is characterized by the accumulation of proteolipid storage material in brain and other tissues, suggesting that the disease is a consequence of abnormal catabolism of acylated proteins. In the current paper, we report the sequence of the human PPT cDNA and the structure of the human PPT gene. The cDNA predicts a protein of 306 amino acids that contains a 25-amino-acid signal peptide, threemore » N-linked glycosylation sites, and consensus motifs characteristic of thioesterases. Northern analysis of a human tissue blot revealed ubiquitous expression of a single 2.5-kb mRNA, with highest expression in lung, brain, and heart. The human PPT gene spans 25 kb and is composed of seven coding exons and a large eighth exon, containing the entire 3{prime}-untranslated region of 1388 bp. An Alu repeat and promoter elements corresponding to putative binding sites for several general transcription factors were identified in the 1060 nucleotides upstream of the transcription start site. The human PPT cDNA sequence and gene structure will provide the means for the identification of further causative mutations in INCL and facilitate genetic screening in selected high-risk populations. 31 refs., 5 figs., 1 tab.« less
Detection limit of intragenic deletions with targeted array comparative genomic hybridization
2013-01-01
Background Pathogenic mutations range from single nucleotide changes to deletions or duplications that encompass a single exon to several genes. The use of gene-centric high-density array comparative genomic hybridization (aCGH) has revolutionized the detection of intragenic copy number variations. We implemented an exon-centric design of high-resolution aCGH to detect single- and multi-exon deletions and duplications in a large set of genes using the OGT 60 K and 180 K arrays. Here we describe the molecular characterization and breakpoint mapping of deletions at the smaller end of the detectable range in several genes using aCGH. Results The method initially implemented to detect single to multiple exon deletions, was able to detect deletions much smaller than anticipated. The selected deletions we describe vary in size, ranging from over 2 kb to as small as 12 base pairs. The smallest of these deletions are only detectable after careful manual review during data analysis. Suspected deletions smaller than the detection size for which the method was optimized, were rigorously followed up and confirmed with PCR-based investigations to uncover the true detection size limit of intragenic deletions with this technology. False-positive deletion calls often demonstrated single nucleotide changes or an insertion causing lower hybridization of probes demonstrating the sensitivity of aCGH. Conclusions With optimizing aCGH design and careful review process, aCGH can uncover intragenic deletions as small as dozen bases. These data provide insight that will help optimize probe coverage in array design and illustrate the true assay sensitivity. Mapping of the breakpoints confirms smaller deletions and contributes to the understanding of the mechanism behind these events. Our knowledge of the mutation spectra of several genes can be expected to change as previously unrecognized intragenic deletions are uncovered. PMID:24304607
Origins of genes: "big bang" or continuous creation?
Keese, P K; Gibbs, A
1992-01-01
Many protein families are common to all cellular organisms, indicating that many genes have ancient origins. Genetic variation is mostly attributed to processes such as mutation, duplication, and rearrangement of ancient modules. Thus it is widely assumed that much of present-day genetic diversity can be traced by common ancestry to a molecular "big bang." A rarely considered alternative is that proteins may arise continuously de novo. One mechanism of generating different coding sequences is by "overprinting," in which an existing nucleotide sequence is translated de novo in a different reading frame or from noncoding open reading frames. The clearest evidence for overprinting is provided when the original gene function is retained, as in overlapping genes. Analysis of their phylogenies indicates which are the original genes and which are their informationally novel partners. We report here the phylogenetic relationships of overlapping coding sequences from steroid-related receptor genes and from tymovirus, luteovirus, and lentivirus genomes. For each pair of overlapping coding sequences, one is confined to a single lineage, whereas the other is more widespread. This suggests that the phylogenetically restricted coding sequence arose only in the progenitor of that lineage by translating an out-of-frame sequence to yield the new polypeptide. The production of novel exons by alternative splicing in thyroid receptor and lentivirus genes suggests that introns can be a valuable evolutionary source for overprinting. New genes and their products may drive major evolutionary changes. PMID:1329098
RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.
Xiong, Hui Y; Alipanahi, Babak; Lee, Leo J; Bretschneider, Hannes; Merico, Daniele; Yuen, Ryan K C; Hua, Yimin; Gueroussov, Serge; Najafabadi, Hamed S; Hughes, Timothy R; Morris, Quaid; Barash, Yoseph; Krainer, Adrian R; Jojic, Nebojsa; Scherer, Stephen W; Blencowe, Benjamin J; Frey, Brendan J
2015-01-09
To facilitate precision medicine and whole-genome annotation, we developed a machine-learning technique that scores how strongly genetic variants affect RNA splicing, whose alteration contributes to many diseases. Analysis of more than 650,000 intronic and exonic variants revealed widespread patterns of mutation-driven aberrant splicing. Intronic disease mutations that are more than 30 nucleotides from any splice site alter splicing nine times as often as common variants, and missense exonic disease mutations that have the least impact on protein function are five times as likely as others to alter splicing. We detected tens of thousands of disease-causing mutations, including those involved in cancers and spinal muscular atrophy. Examination of intronic and exonic variants found using whole-genome sequencing of individuals with autism revealed misspliced genes with neurodevelopmental phenotypes. Our approach provides evidence for causal variants and should enable new discoveries in precision medicine. Copyright © 2015, American Association for the Advancement of Science.
Lu, Jiamiao; Williams, James A.; Luke, Jeremy; Zhang, Feijie; Chu, Kirk; Kay, Mark A.
2017-01-01
We previously developed a mini-intronic plasmid (MIP) expression system in which the essential bacterial elements for plasmid replication and selection are placed within an engineered intron contained within a universal 5′ UTR noncoding exon. Like minicircle DNA plasmids (devoid of bacterial backbone sequences), MIP plasmids overcome transcriptional silencing of the transgene. However, in addition MIP plasmids increase transgene expression by 2 and often >10 times higher than minicircle vectors in vivo and in vitro. Based on these findings, we examined the effects of the MIP intronic sequences in a recombinant adeno-associated virus (AAV) vector system. Recombinant AAV vectors containing an intron with a bacterial replication origin and bacterial selectable marker increased transgene expression by 40 to 100 times in vivo when compared with conventional AAV vectors. Therefore, inclusion of this noncoding exon/intron sequence upstream of the coding region can substantially enhance AAV-mediated gene expression in vivo. PMID:27903072
DOE Office of Scientific and Technical Information (OSTI.GOV)
Steinlein, O.; Weiland, S.; Stoodt, J.
1996-03-01
The human neuronal nicotinic acetylcholine receptor {alpha}4 subunit gene (CHRNA4) is located in the candidate region for three different phenotypes: benign familial neonatal convulsions, autosomal dominant nocturnal frontal lobe epilepsy, and low-voltage EEG. Recently, a missense mutation in transmembrane domain 2 of CHRNA4 was found to be associated with autosomal dominant nocturnal frontal lobe epilepsy in one extended pedigree. We have determined the genomic organization of CHRNA4, which consists of six exons distributed over approximately 17 kb of genomic DNA. The nucleotide sequence obtained from the genomic regions adjacent to the exon boundaries enabled us to develop a set ofmore » primer pairs for PCR amplification of the complete coding region. The sequence analysis provides the basis for a comprehensive mutation screening of CHRNA4 in the above-mentioned phenotypes and possibly in other types of idopathic epilepsies. 29 refs., 3 figs., 1 tab.« less
Nicolas, Francisco Esteban; Moxon, Simon; de Haro, Juan P.; Calo, Silvia; Grigoriev, Igor V.; Torres-Martínez, Santiago; Moulton, Vincent; Ruiz-Vázquez, Rosa M.; Dalmay, Tamas
2010-01-01
Endogenous short RNAs (esRNAs) play diverse roles in eukaryotes and usually are produced from double-stranded RNA (dsRNA) by Dicer. esRNAs are grouped into different classes based on biogenesis and function but not all classes are present in all three eukaryotic kingdoms. The esRNA register of fungi is poorly described compared to other eukaryotes and it is not clear what esRNA classes are present in this kingdom and whether they regulate the expression of protein coding genes. However, evidence that some dicer mutant fungi display altered phenotypes suggests that esRNAs play an important role in fungi. Here, we show that the basal fungus Mucor circinelloides produces new classes of esRNAs that map to exons and regulate the expression of many protein coding genes. The largest class of these exonic-siRNAs (ex-siRNAs) are generated by RNA-dependent RNA Polymerase 1 (RdRP1) and dicer-like 2 (DCL2) and target the mRNAs of protein coding genes from which they were produced. Our results expand the range of esRNAs in eukaryotes and reveal a new role for esRNAs in fungi. PMID:20427422
Gene analysis of steroid 5 alpha-reductase 1 in hyperandrogenic women.
Eminović, Izet; Komel, Radovan; Prezelj, Janez; Karamehić, Jasenko; Gavrankapetanović, Faris; Heljić, Becir
2005-08-01
To examine the gene encoding for 5alpha-reductase type 1 in hyperandrogenic women, and assess the association of its eventual mutations or polymorphisms with the development of the hyperandrogenic female pattern. Sixteen hyperandrogenic women were included in the study. Single-stranded conformation polymorphism analysis (SSCP) and DNA sequencing were performed after polymerase chain reaction amplification of each of the 5 exons of the SRD5A1 gene in both hyperandrogenic and control group (16 participants). Sequence analysis identified the existence of many polymorphisms; in codon 24 of exon 1, GGC (Gly) into GAC (Asp); in codon 30 of exon 1, CGG (Arg) into CGC (Arg); in exon 3 codon 169, ACA to ACG (both encoding for threonine); in exon 5, AGA to AGG (both encoding for arginine, codon 260); and T/C polymorphism in intron 2. Polymorphisms were found in both groups. Polymorphisms of SRD5A1 gene were the same in both hyperandrogenic and healthy women, indicating no significant associations of genetic polymorphisms/variations of SRD5A1 gene with clinical manifestations of hyperandrogenic disorders in women.
Expression of exon-8-skipped kindlin-1 does not compensate for defects of Kindler syndrome.
Natsuga, Ken; Nishie, Wataru; Shinkuma, Satoru; Nakamura, Hideki; Matsushima, Yoichiro; Tatsuta, Aya; Komine, Mayumi; Shimizu, Hiroshi
2011-01-01
Kindler syndrome (KS) is a rare, inherited skin disease characterized by blister formation and generalized poikiloderma. Mutations in KIND1, which encodes kindlin-1, are responsible for KS. c.1089del/1089+1del is a recurrent splice-site deletion mutation in KS patients. To elucidate the effects of c.1089del/1089+1del at the mRNA and protein level. Two KS patients with c.1089del/1089+1del were included in this study. Immunofluorescence analysis of KS skin samples using antibodies against the dermo-epidermal junction proteins was performed. Exon-trapping experiments were performed to isolate the mRNA sequences transcribed from genomic DNA harbouring c.1089del/1089+1del. β1 integrin activation in HeLa cells transfected with truncated KIND1 cDNA was analyzed. Immunofluorescence study showed positive expression of kindlin-1 in KS skin with c.1089del/1089+1del mutation. We identified the exon-8-skipped in-frame transcript as the main product among multiple splicing variants derived from that mutation. HeLa cells transfected with KIND1 cDNA without exon 8 showed impaired β1 integrin activation. Exon-8-coding amino acids are located in the FERM F2 domain, which is conserved among species, and the unstructured region between F2 and the pleckstrin homology domain. This study suggests that exon-8-skipped truncated kindlin-1 is functionally defective and does not compensate for the defects of KS, even though kindlin-1 expression in skin is positive. Copyright © 2010 Japanese Society for Investigative Dermatology. Published by Elsevier Ireland Ltd. All rights reserved.
Lenglet, Marion; Robriquet, Florence; Schwarz, Klaus; Camps, Carme; Couturier, Anne; Hoogewijs, David; Buffet, Alexandre; Knight, Samantha Jl; Gad, Sophie; Couvé, Sophie; Chesnel, Franck; Pacault, Mathilde; Lindenbaum, Pierre; Job, Sylvie; Dumont, Solenne; Besnard, Thomas; Cornec, Marine; Dreau, Helene; Pentony, Melissa; Kvikstad, Erika; Deveaux, Sophie; Burnichon, Nelly; Ferlicot, Sophie; Vilaine, Mathias; Mazzella, Jean-Michaël; Airaud, Fabrice; Garrec, Céline; Heidet, Laurence; Irtan, Sabine; Mantadakis, Elpis; Bouchireb, Karim; Debatin, Klaus-Michael; Redon, Richard; Bezieau, Stéphane; Bressac-de Paillerets, Brigitte; Teh, Bin Tean; Girodon, François; Randi, Maria-Luigia; Putti, Maria Caterina; Bours, Vincent; Van Wijk, Richard; Göthert, Joachim R; Kattamis, Antonis; Janin, Nicolas; Bento, Celeste; Taylor, Jenny C; Arlot-Bonnemains, Yannick; Richard, Stéphane; Gimenez-Roqueplo, Anne-Paule; Cario, Holger; Gardie, Betty
2018-06-11
Chuvash polycythemia is an autosomal recessive form of erythrocytosis associated with a homozygous p.Arg200Trp mutation in the von Hippel-Lindau (VHL) gene. Since this discovery, additional VHL mutations have been identified in patients with congenital erythrocytosis, in a homozygous or compound-heterozygous state. VHL is a major tumor suppressor gene, mutations in which were first described in patients presenting with von Hippel-Lindau disease, which is characterized by the development of highly vascularized tumors. Here, we identified a new VHL cryptic-exon (termed E1') deep in intron 1 that is naturally expressed in many tissues. More importantly, we identified mutations in E1' in seven families with erythrocytosis (one homozygous case and six compound-heterozygous cases with a mutation in E1' in addition to a mutation in VHL coding sequences) and in one large family with typical VHL disease but without any alteration in the other VHL exons. In this study we have shown that the mutations induced a dysregulation of the VHL splicing with excessive retention of E1' and are associated with a downregulation of VHL protein expression. In addition, we have demonstrated a pathogenic role for synonymous mutations in VHL-Exon 2 that alter splicing through E2-skipping in five families with erythrocytosis or VHL disease. In all the studied cases, the mutations differentially impact splicing, correlating with phenotype severity. This study demonstrates that cryptic-exon-retention or exon-skipping are new VHL alterations and reveals a novel complex splicing regulation of the VHL gene. These findings open new avenues for diagnosis and research into the VHL-related-hypoxia-signaling pathway. Copyright © 2018 American Society of Hematology.
Kaer, Kristel; Branovets, Jelena; Hallikma, Anni; Nigumann, Pilvi; Speek, Mart
2011-01-01
Background Transcriptional interference has been recently recognized as an unexpectedly complex and mostly negative regulation of genes. Despite a relatively few studies that emerged in recent years, it has been demonstrated that a readthrough transcription derived from one gene can influence the transcription of another overlapping or nested gene. However, the molecular effects resulting from this interaction are largely unknown. Methodology/Principal Findings Using in silico chromosome walking, we searched for prematurely terminated transcripts bearing signatures of intron retention or exonization of intronic sequence at their 3′ ends upstream to human L1 retrotransposons, protein-coding and noncoding nested genes. We demonstrate that transcriptional interference induced by intronic L1s (or other repeated DNAs) and nested genes could be characterized by intron retention, forced exonization and cryptic polyadenylation. These molecular effects were revealed from the analysis of endogenous transcripts derived from different cell lines and tissues and confirmed by the expression of three minigenes in cell culture. While intron retention and exonization were comparably observed in introns upstream to L1s, forced exonization was preferentially detected in nested genes. Transcriptional interference induced by L1 or nested genes was dependent on the presence or absence of cryptic splice sites, affected the inclusion or exclusion of the upstream exon and the use of cryptic polyadenylation signals. Conclusions/Significance Our results suggest that transcriptional interference induced by intronic L1s and nested genes could influence the transcription of the large number of genes in normal as well as in tumor tissues. Therefore, this type of interference could have a major impact on the regulation of the host gene expression. PMID:22022525
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.
Pauciullo, Alfredo; Erhardt, Georg
2015-01-01
In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5’- and 3’-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species. PMID:25923814
Pauciullo, Alfredo; Erhardt, Georg
2015-01-01
In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5'- and 3'-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species.
Veenstra, Jan A; Khammassi, Hela
2017-04-01
RYamides are arthropod neuropeptides with unknown function. In 2011 two RYamides were isolated from D. melanogaster as the ligands for the G-protein coupled receptor CG5811. The D. melanogaster gene encoding these neuropeptides is highly unusual, as there are four RYamide encoding exons in the current genome assembly, but an exon encoding a signal peptide is absent. Comparing the D. melanogaster gene structure with those from other species, including D. virilis, suggests that the gene is degenerating. RNAseq data from 1634 short sequence read archives at NCBI containing more than 34 billion spots yielded numerous individual spots that correspond to the RYamide encoding exons, of which a large number include the intron-exon boundary at the start of this exon. Although 72 different sequences have been spliced onto this RYamide encoding exon, none codes for the signal peptide of this gene. Thus, the RNAseq data for this gene reveal only noise and no signal. The very small quantities of peptide recovered during isolation and the absence of credible RNAseq data, indicates that the gene is very little expressed, while the RYamide gene structure in D. melanogaster suggests that it might be evolving into a pseudogene. Yet, the identification of the peptides it encodes clearly shows it is still functional. Using region specific antisera, we could localize numerous neurons and enteroendocrine cells in D. willistoni, D. virilis and D. pseudoobscura, but only two adult abdominal neurons in D. melanogaster. Those two neurons project to and innervate the rectal papillae, suggesting that RYamides may be involved in the regulation of water homeostasis. Copyright © 2017 Elsevier Ltd. All rights reserved.
Probing the Boundaries of Orthology: The Unanticipated Rapid Evolution of Drosophila centrosomin
Eisman, Robert C.; Kaufman, Thomas C.
2013-01-01
The rapid evolution of essential developmental genes and their protein products is both intriguing and problematic. The rapid evolution of gene products with simple protein folds and a lack of well-characterized functional domains typically result in a low discovery rate of orthologous genes. Additionally, in the absence of orthologs it is difficult to study the processes and mechanisms underlying rapid evolution. In this study, we have investigated the rapid evolution of centrosomin (cnn), an essential gene encoding centrosomal protein isoforms required during syncytial development in Drosophila melanogaster. Until recently the rapid divergence of cnn made identification of orthologs difficult and questionable because Cnn violates many of the assumptions underlying models for protein evolution. To overcome these limitations, we have identified a group of insect orthologs and present conserved features likely to be required for the functions attributed to cnn in D. melanogaster. We also show that the rapid divergence of Cnn isoforms is apparently due to frequent coding sequence indels and an accelerated rate of intronic additions and eliminations. These changes appear to be buffered by multi-exon and multi-reading frame maximum potential ORFs, simple protein folds, and the splicing machinery. These buffering features also occur in other genes in Drosophila and may help prevent potentially deleterious mutations due to indels in genes with large coding exons and exon-dense regions separated by small introns. This work promises to be useful for future investigations of cnn and potentially other rapidly evolving genes and proteins. PMID:23749319
PTEN/MMAC1 Mutations in Hepatocellular Carcinomas: Somatic Inactivation of Both Alleles in Tumors
Kawamura, Naoki; Nagai, Hisaki; Bando, Koichi; Koyama, Masaaki; Matsumoto, Satoshi; Tajiri, Takashi; Onda, Masahiko; Fujimoto, Jiro; Ueki, Takahiro; Konishi, Noboru; Shiba, Tadayoshi
1999-01-01
Allelic loss of loci on chromosome 10q occurs frequently in hepatocellular carcinomas. Somatic mutations of the PTEN/MMAC1 gene on this chromosome at 10q23 were recently identified in sporadic cancers of the uterus, brain, prostate and breast. To investigate the potential role of PTEN/MMAC1 gene in the genesis of hepatocellular carcinomas, we examined 96 tumors for allelic loss on 10q and also for subtle mutations anywhere within the coding region of PTEN/MMAC1 gene. Allelic loss was identified in 25 of the 89 (27%) tumors that were informative for polymorphic markers in the region. Somatic mutations were identified in five of those tumors: three frameshift mutations, a 1‐bp insertion at codon 83–84 in exon 4 and two 4‐bp deletions, both at codon 318–319 in exon 8; two C‐to‐G transversion mutation, both at ‐9 bp from the initiation codon in the 5’non‐coding region of exon 1. No missense mutation was observed in this panel of tumors. In most of the informative tumors carrying intragenic mutations of one allele, we were able to detect loss of heterozygosity as well. These findings suggest that two alleles of the PTEN/MMAC1 gene may be inactivated by a combination of intragenic point mutation on one allele and loss of chromosomal material on the other allele in some of these tumors. PMID:10363579
Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A
1996-01-01
In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965
Is a Genome a Codeword of an Error-Correcting Code?
Kleinschmidt, João H.; Silva-Filho, Márcio C.; Bim, Edson; Herai, Roberto H.; Yamagishi, Michel E. B.; Palazzo, Reginaldo
2012-01-01
Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction. PMID:22649495
Complex mosaic CDKL5 deletion with two distinct mutant alleles in a 4-year-old girl.
Boutry-Kryza, Nadia; Ville, Dorothée; Labalme, Audrey; Calender, Alain; Dupont, Jean-Michel; Touraine, Renaud; Edery, Patrick; des Portes, Vincent; Sanlaville, Damien; Lesca, Gaetan
2014-08-01
Mutations of the CDKL5 gene cause early epileptic encephalopathy. Patients manifest refractory epilepsy, beginning before the age of 3 months, which is associated with severe psychomotor delay and features that overlap with Rett syndrome. We report here a patient with mosaicism for CDKL5 exonic deletion, with the presence of two mutant alleles. The affected 4-year-old girl presented with infantile spasms, beginning at the age of 9 months, but subsequent progression of the disease was consistent with the classical CDKL5-related phenotype. A deletion of exons 17 and 18 was suspected on the basis of Multiplex Ligation Probe Amplification analysis, but unexpected results for cDNA analysis, which showed the presence of an abnormal transcript with the deletion of exon 18 only, led us to suspect that two distinct events might have occurred. We used custom array-CGH to determine the size and breakpoints of these deletions. Exon 18 was deleted from one of the abnormal alleles, and exon 17 was deleted from the other. A Fork Stalling and Template Switching (FoSTeS) mechanism was proposed to explain the two events, given the presence of regions of microhomology at the breakpoints. We propose here an original involvement of the FoSTeS mechanism to explain the co-occurrence of these two events in the CDKL5 gene in a single patient. This patient highlights the difficulties involved in the detection of such abnormalities, particularly when they occur in a mosaic state and involve two distinct mutational events in a single gene. © 2014 Wiley Periodicals, Inc.
Chen, Xiuhua; Qi, Xiling; Tan, Yanhong; Xu, Zhifang; Xu, Aining; Zhang, Linlin; Wang, Hongwei
2011-06-15
JAK2V617F mutation has been reported in 90% of patients with polycythemia vera (PV) and about 50% of patients with essential thromobocythemia (ET) and primary myelofibrosis (PMF). Recently, acquired mutations in the transmembrane-juxtamembrane region of MPL (MPLW515 mutations) have been reported in approximately 5% of JAK2V617F-negative PMF and about 1% of all cases of ET. MPL is the receptor for thrombopoietin that regulates the production of platelets by bone marrow. It is likely that some mutations more closely related to ET in MPL exon10 may have been missed by current assays. We inferred that there might be other mutations in MPL exon10 for MPN patients in addition to MPLW515 mutations. To investigate its mutation types and prevalence in Chinese patients with myeloproliferative neoplasms (MPN), we performed mutation detection on MPL exon10 in 103 JAK2V617F-negative MPN patients by single strand conformation polymorphism (SSCP) and allele-specific PCR (AS-PCR) combined with sequencing. As a result, one previously unrecognized MPL mutation (12-bp in-frame insertion) was identified in one patient with ET in addition to an MPLW515K mutation identified in one PMF patient. This confirms our hypothesis that BCR/ABL negative and JAK2V617F-negative MPN patients have other mutations besides W515 mutation in MPL exon10 and mutations other than single nucleotide exchange also exist. In addition, MPL mutation was associated with Chinese MPN patients. Copyright © 2011 Elsevier Inc. All rights reserved.
Gu, Wanjun; Gurguis, Christopher I.; Zhou, Jin J.; Zhu, Yihua; Ko, Eun-A.; Ko, Jae-Hong; Wang, Ting; Zhou, Tong
2015-01-01
Genetic variation arising from single nucleotide polymorphisms (SNPs) is ubiquitously found among human populations. While disease-causing variants are known in some cases, identifying functional or causative variants for most human diseases remains a challenging task. Rare SNPs, rather than common ones, are thought to be more important in the pathology of most human diseases. We propose that rare SNPs should be divided into two categories dependent on whether the minor alleles are derived or ancestral. Derived alleles are less likely to have been purified by evolutionary processes and may be more likely to induce deleterious effects. We therefore hypothesized that the rare SNPs with derived minor alleles would be more important for human diseases and predicted that these variants would have larger functional or structural consequences relative to the rare variants for which the minor alleles are ancestral. We systematically investigated the consequences of the exonic SNPs on protein function, mRNA structure, and translation. We found that the functional and structural consequences are more significant for the rare exonic variants for which the minor alleles are derived. However, this pattern is reversed when the minor alleles are ancestral. Thus, the rare exonic SNPs with derived minor alleles are more likely to be deleterious. Age estimation of rare SNPs confirms that these potentially deleterious SNPs are recently evolved in the human population. These results have important implications for understanding the function of genetic variations in human exonic regions and for prioritizing functional SNPs in genome-wide association studies of human diseases. PMID:26454016
Allelic combinations of promoter and exon 2 in DQB1 in dogs and wolves.
Berggren, Karin T; Seddon, Jennifer M
2008-07-01
Polymorphism of PBRs of the major histocompatibility complex (MHC) genes is well recognized, but the polymorphism also extends to proximal promoter regions. Examining DQB1 variability in dogs and wolves, we identified 7 promoter variants and 13 exon 2 alleles among 89 dogs, including a previously unknown DQB1 exon 2 allele, and 8 promoter variants and 9 exon 2 alleles among 85 wolves. As expected from previous studies and from a close chromosomal location, strong linkage disequilibrium was demonstrated in both wolves and dogs by having significantly fewer promoter/exon 2 combinations than expected from simulations of randomized data sets. Interestingly, we noticed weaker haplotypic associations in dogs than in wolves. Dogs had twice as many promoter/exon 2 combinations as wolves and an almost 2-fold difference in the number of exon 2 alleles per promoter variant. This difference was not caused by an admixture of breeds in our group of dogs because the high ratio of observed to expected number of haplotypes persisted within a single dog breed, the German Shepherd. Ewens-Watterson tests indicated that both the promoter and exon 2 are under the balancing selection, and both regions appear to be more recently derived in the dog than in the wolf. Hence, although reasons for the differences are unknown, they may relate to altered selection pressure on patterns of expression. Deviations from normal MHC expression patterns have been associated with autoimmune diseases, which occur frequently in several dog breeds. Further knowledge about these deviations may help us understand the source of such diseases.
Wang, Jinyan; Yang, Yuwen; Jin, Lamei; Ling, Xitie; Liu, Tingli; Chen, Tianzi; Ji, Yinghua; Yu, Wengui; Zhang, Baolong
2018-06-04
Long Noncoding-RNAs (LncRNAs) are known to be involved in some biological processes, but their roles in plant-virus interactions remain largely unexplored. While circular RNAs (circRNAs) have been studied in animals, there has yet to be extensive research on them in a plant system, especially in tomato-tomato yellow leaf curl virus (TYLCV) interaction. In this study, RNA transcripts from the susceptible tomato line JS-CT-9210 either infected with TYLCV or untreated, were sequenced in a pair-end strand-specific manner using ribo-zero rRNA removal library method. A total of 2056 lncRNAs including 1767 long intergenic non-coding RNA (lincRNAs) and 289 long non-coding natural antisense transcripts (lncNATs) were obtained. The expression patterns in lncRNAs were similar in susceptible tomato plants between control check (CK) and TYLCV infected samples. Our analysis suggested that lncRNAs likely played a role in a variety of functions, including plant hormone signaling, protein processing in the endoplasmic reticulum, RNA transport, ribosome function, photosynthesis, glulathione metabolism, and plant-pathogen interactions. Using virus-induced gene silencing (VIGS) analysis, we found that reduced expression of the lncRNA S-slylnc0957 resulted in enhanced resistance to TYLCV in susceptible tomato plants. Moreover, we identified 184 circRNAs candidates using the CircRNA Identifier (CIRI) software, of which 32 circRNAs were specifically expressed in untreated samples and 83 circRNAs in TYLCV samples. Approximately 62% of these circRNAs were derived from exons. We validated the circRNAs by both PCR and Sanger sequencing using divergent primers, and found that most of circRNAs were derived from the exons of protein coding genes. The silencing of these circRNAs parent genes resulted in decreased TYLCV virus accumulation. In this study, we identified novel lncRNAs and circRNAs using bioinformatic approaches and showed that these RNAs function as negative regulators of TYLCV infection. Moreover, the expression patterns of lncRNAs in susceptible tomato plants were different from that of resistant tomato plants, while exonic circRNAs expression positively associated with their respective protein coding genes. This work provides a foundation for elaborating the novel roles of lncRNAs and circRNAs in susceptible tomatoes following TYLCV infection.
Evaluation of non-coding variation in GLUT1 deficiency.
Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S
2016-12-01
Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.
Zhang, Xiaoli; Colleoni, Christophe; Ratushna, Vlada; Sirghie-Colleoni, Mirella; James, Martha G; Myers, Alan M
2004-04-01
Mutations in the maize gene sugary2 ( su2 ) affect starch structure and its resultant physiochemical properties in useful ways, although the gene has not been characterized previously at the molecular level. This study tested the hypothesis that su2 codes for starch synthase IIa (SSIIa). Two independent mutations of the su2 locus, su2-2279 and su2-5178 , were identified in a Mutator -active maize population. The nucleotide sequence of the genomic locus that codes for SSIIa was compared between wild type plants and those homozygous for either novel mutation. Plants bearing su2-2279 invariably contained a Mutator transposon in exon 3 of the SSIIa gene, and su2-5178 mutants always contained a small retrotransposon-like insertion in exon 10. Six allelic su2 (-) mutations conditioned loss or reduction in abundance of the SSIIa protein detected by immunoblot. These data indicate that su2 codes for SSIIa and that deficiency in this isoform is ultimately responsible for the altered physiochemical properties of su2 (-) mutant starches. A specific starch synthase isoform among several identified in soluble endosperm extracts was absent in su2-2279 or su2-5178 mutants, indicating that SSIIa is active in the soluble phase during kernel development. The immediate structural effect of the su2 (-) mutations was shown to be increased abundance of short glucan chains in amylopectin and a proportional decrease in intermediate length chains, similar to the effects of SSII deficiency in other species.
Grzes, M; Nowacka-Woszuk, J; Szczerbal, I; Czerwinska, J; Gracz, J; Switonski, M
2009-01-01
The gene encoding myostatin (MSTN), due to its crucial function for growth of skeletal muscle mass, is an important candidate for muscularity. In this study we analyzed the nucleotide sequence and FISH localization of this gene in 4 canids, including 3 farm species. The nucleotide sequence of the MSTN coding fragment turned out to be highly conserved, since its identity among the studied species was very high and varied between 99.4 and 99.7%. Only 1, widely spread, silent single nucleotide polymorphism (SNP) was found in exon 1 of the Chinese raccoon dog. The MSTN gene was localized close to the centromere in one-armed chromosomes of the dog (37q11) and bi-armed chromosomes of the red fox (16p11) and arctic fox (10q11), with an exception of the Chinese raccoon dog chromosome (2q14-q21). This chromosome is orthologous to 3 canine chromosomes and thus the MSTN was found more interstitially. Our results are in agreement with the hypothesis that karyotypes of the canids evolved mainly through centric fusion/fission events, while tandem fusions occurred rarely. (c) 2009 S. Karger AG, Basel.
Agouti sequence polymorphisms in coyotes, wolves and dogs suggest hybridization.
Schmutz, Sheila M; Berryere, Thomas G; Barta, Jodi L; Reddick, Kimberley D; Schmutz, Josef K
2007-01-01
Domestic dogs have been shown to have multiple alleles of the Agouti Signal Peptide (ASIP) in exon 4 and we wished to determine the level of polymorphism in the common wild canids of Canada, wolves and coyotes, in comparison. All Canadian coyotes and most wolves have banded hairs. The ASIP coding sequence of the wolf did not vary from the domestic dog but one variant was detected in exon 4 of coyotes that did not alter the arginine at this position. Two other differences were found in the sequence flanking exon 4 of coyotes compared with the 45 dogs and 1 wolf. The coyotes also demonstrated a relatively common polymorphism in the 3' UTR sequence that could be used for population studies. One of the ASIP alleles (R96C) in domestic dogs causes a solid black coat color in homozygotes. Although some wolves are melanistic, this phenotype does not appear to be caused by this same mutation. However, one wolf, potentially a dog-wolf hybrid or descendant thereof, was heterozygous for this allele. Likewise 2 coyotes, potentially dog-coyote or wolf-coyote hybrid descendants, were heterozygous for the several polymorphisms in and flanking exon 4. We could conclude that these were coyote-dog hybrids because both were heterozygous for 2 mutations causing fawn coat color in dogs.
Chograni, Manèl; Rejeb, Imen; Jemaa, Lamia Ben; Châabouni, Myriam; Bouhamed, Habiba Chaabouni
2011-01-01
Nance-Horan Syndrome (NHS) or X-linked cataract-dental syndrome is a disease of unknown gene action mechanism, characterized by congenital cataract, dental anomalies, dysmorphic features and, in some cases, mental retardation. We performed linkage analysis in a Tunisian family with NHS in which affected males and obligate carrier female share a common haplotype in the Xp22.32-p11.21 region that contains the NHS gene. Direct sequencing of NHS coding exons and flanking intronic sequences allowed us to identify the first missense mutation (P551S) and a reported SNP-polymorphism (L1319F) in exon 6, a reported UTR–SNP (c.7422 C>T) and a novel one (c.8239 T>A) in exon 8. Both variations P551S and c.8239 T>A segregate with NHS phenotype in this family. Although truncations, frame-shift and copy number variants have been reported in this gene, no missense mutations have been found to segregate previously. This is the first report of a missense NHS mutation causing NHS phenotype (including cardiac defects). We hypothesize also that the non-reported UTR–SNP of the exon 8 (3′-UTR) is specific to the Tunisian population. PMID:21559051
Chograni, Manèl; Rejeb, Imen; Jemaa, Lamia Ben; Châabouni, Myriam; Bouhamed, Habiba Chaabouni
2011-08-01
Nance-Horan Syndrome (NHS) or X-linked cataract-dental syndrome is a disease of unknown gene action mechanism, characterized by congenital cataract, dental anomalies, dysmorphic features and, in some cases, mental retardation. We performed linkage analysis in a Tunisian family with NHS in which affected males and obligate carrier female share a common haplotype in the Xp22.32-p11.21 region that contains the NHS gene. Direct sequencing of NHS coding exons and flanking intronic sequences allowed us to identify the first missense mutation (P551S) and a reported SNP-polymorphism (L1319F) in exon 6, a reported UTR-SNP (c.7422 C>T) and a novel one (c.8239 T>A) in exon 8. Both variations P551S and c.8239 T>A segregate with NHS phenotype in this family. Although truncations, frame-shift and copy number variants have been reported in this gene, no missense mutations have been found to segregate previously. This is the first report of a missense NHS mutation causing NHS phenotype (including cardiac defects). We hypothesize also that the non-reported UTR-SNP of the exon 8 (3'-UTR) is specific to the Tunisian population.
A rare male patient with classic Rett syndrome caused by MeCP2_e1 mutation.
Tokaji, Narumi; Ito, Hiromichi; Kohmoto, Tomohiro; Naruto, Takuya; Takahashi, Rizu; Goji, Aya; Mori, Tatsuo; Toda, Yoshihiro; Saito, Masako; Tange, Shoichiro; Masuda, Kiyoshi; Kagami, Shoji; Imoto, Issei
2018-03-01
Rett syndrome (RTT) is a severe neurodevelopmental disorder typically affecting females. It is mainly caused by loss-of-function mutations that affect the coding sequence of exon 3 or 4 of methyl-CpG-binding protein 2 (MECP2). Severe neonatal encephalopathy resulting in death before the age of 2 years is the most common phenotype observed in males affected by a pathogenic MECP2 variant. Mutations in MECP2 exon 1 affecting the MeCP2_e1 isoform are relatively rare causes of RTT in females, and only one case of a male patient with MECP2-related severe neonatal encephalopathy caused by a mutation in MECP2 exon 1 has been reported. This is the first reported case of a male with classic RTT caused by a 5-bp duplication in the open-reading frame of MECP2 exon 1 (NM_001110792.1:c.23_27dup) that introduced a premature stop codon [p.(Ser10Argfs*36)] in the MeCP2_e1 isoform, which has been reported in one female patient with classic RTT. Therefore, both males and females displaying at least some type of MeCP2_e1 mutation may exhibit the classic RTT phenotype. © 2018 Wiley Periodicals, Inc.
Liu, Hong Yan; Huang, Jia; Wang, Rui Li; Wang, Yue; Guo, Liang Jie; Li, Tao; Wu, Dong; Wang, Hong Dan; Guo, Qian Nan; Dong, Dao Quan
2016-11-01
Familial exudative vitreoretinopathy (FEVR) is a hereditary ocular disorder characterized by a failure of peripheral retinal vascularization. In this report, we describe a novel missense mutation of the Norrie disease gene (NDP) in a Chinese family with X-linked FEVR. Ophthalmologic evaluation was performed on four male patients and seven unaffected individuals after informed consent was obtained. Venous blood was collected from the 11 members of this family, and genomic DNA was extracted using standard methods. The coding exons 2 and 3 and their corresponding exon-intron junctions of NDP were amplified by polymerase chain reaction and then subjected to direct DNA sequencing. A novel missense mutation (c.310A>C) in exon 3, leading to a lysine-to-glutamine substitution at position 104 (p.Lys104Gln), was identified in all four patients with X-linked FEVR. Three unaffected female individuals (III2, IV3, and IV11) were found to be carriers of the mutation. This mutation was not detected in other unaffected individuals. The mutation c.310A>C (p.Lys104Gln) in exon 3 of NDP is associated with FEVR in the studied family. This result further enriches the mutation spectrum of FEVR. Copyright © 2016. Published by Elsevier Taiwan LLC.
PTESFinder: a computational method to identify post-transcriptional exon shuffling (PTES) events.
Izuogu, Osagie G; Alhasan, Abd A; Alafghani, Hani M; Santibanez-Koref, Mauro; Elliott, David J; Elliot, David J; Jackson, Michael S
2016-01-13
Transcripts, which have been subject to Post-transcriptional exon shuffling (PTES), have an exon order inconsistent with the underlying genomic sequence. These have been identified in a wide variety of tissues and cell types from many eukaryotes, and are now known to be mostly circular, cytoplasmic, and non-coding. Although there is no uniformly ascribed function, several have been shown to be involved in gene regulation. Accurate identification of these transcripts can, however, be difficult due to artefacts from a wide variety of sources. Here, we present a computational method, PTESFinder, to identify these transcripts from high throughput RNAseq data. Uniquely, it systematically excludes potential artefacts emanating from pseudogenes, segmental duplications, and template switching, and outputs both PTES and canonical exon junction counts to facilitate comparative analyses. In comparison with four existing methods, PTESFinder achieves highest specificity and comparable sensitivity at a variety of read depths. PTESFinder also identifies between 13 % and 41.6 % more structures, compared to publicly available methods recently used to identify human circular RNAs. With high sensitivity and specificity, user-adjustable filters that target known sources of false positives, and tailored output to facilitate comparison of transcript levels, PTESFinder will facilitate the discovery and analysis of these poorly understood transcripts.
Geiss, K T; Abbas, G M; Makaroff, C A
1994-04-01
The mitochondrial gene coding for subunit 4 of the NADH dehydrogenase complex I (nad4) has been isolated and characterized from lettuce, Lactuca sativa. Analysis of nad4 genes in a number of plants by Southern hybridization had previously suggested that the intron content varied between species. Characterization of the lettuce gene confirms this observation. Lettuce nad4 contains two exons and one group IIA intron, whereas previously sequenced nad4 genes from turnip and wheat contain three group IIA introns. Northern analysis identified a transcript of 1600 nucleotides, which represents the mature nad4 mRNA and a primary transcript of 3200 nucleotides. Sequence analysis of lettuce and turnip nad4 cDNAs was used to confirm the intron/exon border sequences and to examine RNA editing patterns. Editing is observed at the 5' and 3' ends of the lettuce transcript, but is absent from sequences that correspond to exons two, three and the 5' end of exon four in turnip and wheat. In contrast, turnip transcripts are highly edited in this region, suggesting that homologous recombination of an edited and spliced cDNA intermediate was involved in the loss of introns two and three from an ancestral lettuce nad4 gene.
Lunova, Mariia; Guldiken, Nurdan; Lienau, Tim C.; Stickel, Felix; Omary, M. Bishr
2012-01-01
Background Keratins 8 and 18 (K8/K18) are intermediate filament proteins that protect the liver from various forms of injury. Exonic K8/K18 variants associate with adverse outcome in acute liver failure and with liver fibrosis progression in patients with chronic hepatitis C infection or primary biliary cirrhosis. Given the association of K8/K18 variants with end-stage liver disease and progression in several chronic liver disorders, we studied the importance of keratin variants in patients with hemochromatosis. Methods The entire K8/K18 exonic regions were analyzed in 162 hemochromatosis patients carrying homozygous C282Y HFE (hemochromatosis gene) mutations. 234 liver-healthy subjects were used as controls. Exonic regions were PCR-amplified and analyzed using denaturing high-performance liquid chromatography and DNA sequencing. Previously-generated transgenic mice overexpressing K8 G62C were studied for their susceptibility to iron overload. Susceptibility to iron toxicity of primary hepatocytes that express K8 wild-type and G62C was also assessed. Results We identified amino-acid-altering keratin heterozygous variants in 10 of 162 hemochromatosis patients (6.2%) and non-coding heterozygous variants in 6 additional patients (3.7%). Two novel K8 variants (Q169E/R275W) were found. K8 R341H was the most common amino-acid altering variant (4 patients), and exclusively associated with an intronic KRT8 IVS7+10delC deletion. Intronic, but not amino-acid-altering variants associated with the development of liver fibrosis. In mice, or ex vivo, the K8 G62C variant did not affect iron-accumulation in response to iron-rich diet or the extent of iron-induced hepatocellular injury. Conclusion In patients with hemochromatosis, intronic but not exonic K8/K18 variants associate with liver fibrosis development. PMID:22412904
Barley whole exome capture: a tool for genomic research in the genus Hordeum and beyond
Mascher, Martin; Richmond, Todd A; Gerhardt, Daniel J; Himmelbach, Axel; Clissold, Leah; Sampath, Dharanya; Ayling, Sarah; Steuernagel, Burkhard; Pfeifer, Matthias; D'Ascenzo, Mark; Akhunov, Eduard D; Hedley, Pete E; Gonzales, Ana M; Morrell, Peter L; Kilian, Benjamin; Blattner, Frank R; Scholz, Uwe; Mayer, Klaus FX; Flavell, Andrew J; Muehlbauer, Gary J; Waugh, Robbie; Jeddeloh, Jeffrey A; Stein, Nils
2013-01-01
Advanced resources for genome-assisted research in barley (Hordeum vulgare) including a whole-genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole-genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA-coding exome reduces barley genomic complexity more than 50-fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in-solution hybridization-based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full-length cDNAs and de novo assembled RNA-Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA-coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping-by-sequencing and genetic diversity analyzes. PMID:23889683
Genome-wide Discovery of Circular RNAs in the Leaf and Seedling Tissues of Arabidopsis Thaliana
Dou, Yongchao; Li, Shengjun; Yang, Weilong; Liu, Kan; Du, Qian; Ren, Guodong; Yu, Bin; Zhang, Chi
2017-01-01
Background: Recently, identification and functional studies of circular RNAs, a type of non-coding RNAs arising from a ligation of 3’ and 5’ ends of a linear RNA molecule, were conducted in mammalian cells with the development of RNA-seq technology. Method: Since compared with animals, studies on circular RNAs in plants are less thorough, a genome-wide identification of circular RNA candidates in Arabidopsis was conducted with our own developed bioinformatics tool to several existing RNA-seq datasets specifically for non-coding RNAs. Results: A total of 164 circular RNA candidates were identified from RNA-seq data, and 4 circular RNA transcripts, including both exonic and intronic circular RNAs, were experimentally validated. Interestingly, our results show that circular RNA transcripts are enriched in the photosynthesis system for the leaf tissue and correlated to the higher expression levels of their parent genes. Sixteen out of all 40 genes that have circular RNA candidates are related to the photosynthesis system, and out of the total 146 exonic circular RNA candidates, 63 are found in chloroplast. PMID:29081691
Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R
1997-04-28
We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.
Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing
Hoque, Mainul; Ji, Zhe; Zheng, Dinghai; Luo, Wenting; Li, Wencheng; You, Bei; Park, Ji Yeon; Yehia, Ghassan; Tian, Bin
2012-01-01
Alternative cleavage and polyadenylation (APA) leads to mRNA isoforms with different coding sequences (CDS) and/or 3′ untranslated regions (3′UTRs). Using 3′ Region Extraction And Deep Sequencing (3′READS), a method which addresses the internal priming and oligo(A) tail issues that commonly plague polyA site (pA) identification, we comprehensively mapped pAs in the mouse genome, thoroughly annotating 3′ ends of genes and revealing over five thousand pAs (~8% of total) flanked by A-rich sequences, which have hitherto been overlooked. About 79% of mRNA genes and 66% of long non-coding RNA (lncRNA) genes have APA; but these two gene types have distinct usage patterns for pAs in introns and upstream exons. Promoter-distal pAs become relatively more abundant during embryonic development and cell differentiation, a trend affecting pAs in both 3′-most exons and upstream regions. Upregulated isoforms generally have stronger pAs, suggesting global modulation of the 3′ end processing activity in development and differentiation. PMID:23241633
Characterization of an Equine α-S2-Casein Variant Due to a 1.3 kb Deletion Spanning Two Coding Exons
Brinkmann, Julia; Koudelka, Tomas; Keppler, Julia K.; Tholey, Andreas; Schwarz, Karin; Thaller, Georg; Tetens, Jens
2015-01-01
The production and consumption of mare’s milk in Europe has gained importance, mainly based on positive health effects and a lower allergenic potential as compared to cows’ milk. The allergenicity of milk is to a certain extent affected by different genetic variants. In classical dairy species, much research has been conducted into the genetic variability of milk proteins, but the knowledge in horses is scarce. Here, we characterize two major forms of equine αS2-casein arising from genomic 1.3 kb in-frame deletion involving two coding exons, one of which represents an equid specific duplication. Findings at the DNA-level have been verified by cDNA sequencing from horse milk of mares with different genotypes. At the protein-level, we were able to show by SDS-page and in-gel digestion with subsequent LC-MS analysis that both proteins are actually expressed. The comparison with published sequences of other equids revealed that the deletion has probably occurred before the ancestor of present-day asses and zebras diverged from the horse lineage. PMID:26444874
Evaluation of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy.
Stabej, Polona; Leegwater, Peter A; Stokhof, Arnold A; Domanjko-Petric, Aleksandra; van Oost, Bernard A
2005-03-01
To evaluate the role of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy (DCM). 6 dogs with DCM, including 2 Doberman Pinschers, 2 Newfoundlands, and 2 Great Danes. All dogs had clinical signs of congestive heart failure, and a diagnosis of DCM was made on the basis of echocardiographic findings. Blood samples were collected from each dog, and genomic DNA was isolated by a salt extraction method. Specific oligonucleotides were designed to amplify the promoter, exon 1, the 5'-part of exon 2 including the complete coding region, and part of intron 1 of the canine phospholamban gene via polymerase chain reaction procedures. These regions were screened for mutations in DNA obtained from the 6 dogs with DCM. No mutations were identified in the promoter, 5' untranslated region, part of intron 1, part of the 3' untranslated region, and the complete coding region of the phospholamban gene in dogs with DCM. Results indicate that mutations in the phospholamban gene are not a frequent cause of DCM in Doberman Pinschers, Newfoundlands, and Great Danes.
Daher, Tamas; Tur, Mehmet Kemal; Brobeil, Alexander; Etschmann, Benjamin; Witte, Biruta; Engenhart-Cabillic, Rita; Krombach, Gabriele; Blau, Wolfgang; Grimminger, Friedrich; Seeger, Werner; Klussmann, Jens Peter; Bräuninger, Andreas; Gattenlöhner, Stefan
2018-06-01
In head and neck squamous cell carcinoma (HNSCC), the occurrence of concurrent lung malignancies poses a significant diagnostic challenge because metastatic HNSCC is difficult to discern from second primary lung squamous cell carcinoma (SCC). However, this differentiation is crucial because the recommended treatments for metastatic HNSCC and second primary lung SCC differ profoundly. We analyzed the origin of lung tumors in 32 patients with HNSCC using human papillomavirus (HPV) typing and targeted next generation sequencing of all coding exons of tumor protein 53 (TP53). Lung tumors were clearly identified as HNSCC metastases or second primary tumors in 29 patients, thus revealing that 16 patients had received incorrect diagnoses based on clinical and morphological data alone. The HPV typing and mutation analysis of all TP53 coding exons is a valuable diagnostic tool in patients with HNSCC and concurrent lung SCC, which can help to ensure that patients receive the most suitable treatment. © 2018 Wiley Periodicals, Inc.
Dai, Hanjun; Zhang, Xiaohui; Zhao, Xin; Deng, Ting; Dong, Bing; Wang, Jingzhao; Li, Yang
2008-01-01
Usher syndrome type II (USH2) is the most common form of Usher syndrome, an autosomal recessive disorder characterized by moderate to severe hearing loss, postpuberal onset of retinitis pigmentosa (RP), and normal vestibular function. Mutations in the USH2A gene have been shown to be responsible for most cases of USH2. To further elucidate the role of USH2A in USH2, mutation screening was undertaken in three Chinese families with USH2. Three unrelated Chinese families, consisting of six patients and 10 unaffected relatives, were examined clinically, and 100 normal Chinese individuals served as controls. Genomic DNA was extracted from the venous blood of all participants. The coding region (exons 2-72), including the intron-exon boundary of USH2A, was amplified by polymerase chain reaction (PCR). The PCR products amplified from the three probands were analyzed using direct sequencing to screen sequence variants. Whenever substitutions were identified in a patient, restriction fragment length polymorphism analysis, or single strand conformation polymorphism analysis was performed on all available family members and the control group. Fundus examination revealed typical fundus features of RP, including narrowing of the vessels, bone-speckle pigmentation, and waxy optic discs. The ERG wave amplitudes of three probands were undetectable. Audiometric tests indicated moderate to severe sensorineural hearing impairment. Vestibular function was normal. Five novel mutations (one small insertion, one small deletion, one nonsense, one missense, and one splice site) were detected in three families after sequence analysis of USH2A. Of the five mutations, four were located in exons 22-72, specific to the long isoform of USH2A. The mutations found in our study broaden the spectrum of USH2A mutations. Our results further indicate that the long isoform of USH2A may harbor even more mutations of the USH2A gene.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bonaventure, J.; Lasselin, C.; Toutain, A.
1994-09-01
The Stickler syndrome is an arthro-ophthalmopathy which associates progressive myopia with vitreal degeneration and retinal detachment. Cleft palate, cranio-facial abnormalities, deafness and osteoarthritis are often associated symptoms. Genetic heterogeneity of this autosomal dominant disease was consistent with its large clinical variability. Linkage studies have provided evidence for cosegregation of the disease with COL2A1, the gene coding for type II collagen, in about 50% of the families. Four additional families are reported here. Linkage analyses by using a VNTR located in the 3{prime} region of the gene were achieved. In three families, positive lod scores were obtained with a cumulative maximalmore » value of 3.5 at a recombination fraction of 0. In one of these families, single strand conformation analysis of 25 exons disclosed a new mutation in exon 42. Codon for glutamic acid at position a1-803 was converted into a stop codon. The mutation was detected in DNA samples from all the affected members of the family but not in the unaffected. This result confirms that most of the Stickler syndromes linked to COL2A1 are due to premature stop codons. In a second family, an abnormal SSCP pattern of exon 34 was detected in all the affected individuals. The mutation is likely to correspond to a splicing defect in the acceptor site of intron 33. In one family the disease did not segregate with the COL2A1 locus. Further linkage studies with intragenic dimorphic sites in the COL10A1 gene and highly polymorphic markers close to the COL9A1 locus indicated that this disorder did not result from defects in these two genes.« less
Truncating mutations in the last exon of NOTCH3 cause lateral meningocele syndrome.
Gripp, Karen W; Robbins, Katherine M; Sobreira, Nara L; Witmer, P Dane; Bird, Lynne M; Avela, Kristiina; Makitie, Outi; Alves, Daniela; Hogue, Jacob S; Zackai, Elaine H; Doheny, Kimberly F; Stabley, Deborah L; Sol-Church, Katia
2015-02-01
Lateral meningocele syndrome (LMS, OMIM%130720), also known as Lehman syndrome, is a very rare skeletal disorder with facial anomalies, hypotonia and meningocele-related neurologic dysfunction. The characteristic lateral meningoceles represent the severe end of the dural ectasia spectrum and are typically most severe in the lower spine. Facial features of LMS include hypertelorism and telecanthus, high arched eyebrows, ptosis, midfacial hypoplasia, micrognathia, high and narrow palate, low-set ears and a hypotonic appearance. Hyperextensibility, hernias and scoliosis reflect a connective tissue abnormality, and aortic dilation, a high-pitched nasal voice, wormian bones and osteolysis may be present. Lateral meningocele syndrome has phenotypic overlap with Hajdu-Cheney syndrome. We performed exome resequencing in five unrelated individuals with LMS and identified heterozygous truncating NOTCH3 mutations. In an additional unrelated individual Sanger sequencing revealed a deleterious variant in the same exon 33. In total, five novel de novo NOTCH3 mutations were identified in six unrelated patients. One had a 26 bp deletion (c.6461_6486del, p.G2154fsTer78), two carried the same single base pair insertion (c.6692_93insC, p.P2231fsTer11), and three individuals had a nonsense point mutation at c.6247A > T (pK2083*), c.6663C > G (p.Y2221*) or c.6732C > A, (p.Y2244*). All mutations cluster into the last coding exon, resulting in premature termination of the protein and truncation of the negative regulatory proline-glutamate-serine-threonine rich PEST domain. Our results suggest that mutant mRNA products escape nonsense mediated decay. The truncated NOTCH3 may cause gain-of-function through decreased clearance of the active intracellular product, resembling NOTCH2 mutations in the clinically related Hajdu-Cheney syndrome and contrasting the NOTCH3 missense mutations causing CADASIL. © 2014 Wiley Periodicals, Inc.
First Report of a Single Exon Deletion in TCOF1 Causing Treacher Collins Syndrome
Beygo, J.; Buiting, K.; Seland, S.; Lüdecke, H.-J.; Hehr, U.; Lich, C.; Prager, B.; Lohmann, D.R.; Wieczorek, D.
2012-01-01
Treacher Collins syndrome (TCS) is a rare craniofacial disorder characterized by facial anomalies and ear defects. TCS is caused by mutations in the TCOF1 gene and follows autosomal dominant inheritance. Recently, mutations in the POLR1D and POLR1C genes have also been identified to cause TCS. However, in a subset of patients no causative mutation could be found yet. Inter- and intrafamilial phenotypic variability is high as is the variety of mainly family-specific mutations identified throughout TCOF1. No obvious correlation between pheno- and genotype could be observed. The majority of described point mutations, small insertions and deletions comprising only a few nucleotides within TCOF1 lead to a premature termination codon. We investigated a cohort of 112 patients with a tentative clinical diagnosis of TCS by multiplex ligation-dependent probe amplification (MLPA) to search for larger deletions not detectable with other methods used. All patients were selected after negative screening for mutations in TCOF1, POLR1D and POLR1C. In 1 patient with an unequivocal clinical diagnosis of TCS, we identified a 3.367 kb deletion. This deletion abolishes exon 3 and is the first described single exon deletion within TCOF1. On RNA level we observed loss of this exon which supposedly leads to haploinsufficiency of TREACLE, the nucleolar phosphoprotein encoded by TCOF1. PMID:22712005
First Report of a Single Exon Deletion in TCOF1 Causing Treacher Collins Syndrome.
Beygo, J; Buiting, K; Seland, S; Lüdecke, H-J; Hehr, U; Lich, C; Prager, B; Lohmann, D R; Wieczorek, D
2012-01-01
Treacher Collins syndrome (TCS) is a rare craniofacial disorder characterized by facial anomalies and ear defects. TCS is caused by mutations in the TCOF1 gene and follows autosomal dominant inheritance. Recently, mutations in the POLR1D and POLR1C genes have also been identified to cause TCS. However, in a subset of patients no causative mutation could be found yet. Inter- and intrafamilial phenotypic variability is high as is the variety of mainly family-specific mutations identified throughout TCOF1. No obvious correlation between pheno- and genotype could be observed. The majority of described point mutations, small insertions and deletions comprising only a few nucleotides within TCOF1 lead to a premature termination codon. We investigated a cohort of 112 patients with a tentative clinical diagnosis of TCS by multiplex ligation-dependent probe amplification (MLPA) to search for larger deletions not detectable with other methods used. All patients were selected after negative screening for mutations in TCOF1, POLR1D and POLR1C. In 1 patient with an unequivocal clinical diagnosis of TCS, we identified a 3.367 kb deletion. This deletion abolishes exon 3 and is the first described single exon deletion within TCOF1. On RNA level we observed loss of this exon which supposedly leads to haploinsufficiency of TREACLE, the nucleolar phosphoprotein encoded by TCOF1.
Tejedor, J. Ramón; Tilgner, Hagen; Iannone, Camilla; Guigó, Roderic; Valcárcel, Juan
2015-01-01
The OLR1 gene encodes the oxidized low-density lipoprotein receptor (LOX-1), which is responsible for the cellular uptake of oxidized LDL (Ox-LDL), foam cell formation in atheroma plaques and atherosclerotic plaque rupture. Alternative splicing (AS) of OLR1 exon 5 generates two protein isoforms with antagonistic functions in Ox-LDL uptake. Previous work identified six single nucleotide polymorphisms (SNPs) in linkage disequilibrium that influence the inclusion levels of OLR1 exon 5 and correlate with the risk of cardiovascular disease. Here we use minigenes to recapitulate the effects of two allelic series (Low- and High-Risk) on OLR1 AS and identify one SNP in intron 4 (rs3736234) as the main contributor to the differences in exon 5 inclusion, while the other SNPs in the allelic series attenuate the drastic effects of this key SNP. Bioinformatic, proteomic, mutational and functional high-throughput analyses allowed us to define regulatory sequence motifs and identify SR protein family members (SRSF1, SRSF2) and HMGA1 as factors involved in the regulation of OLR1 AS. Our results suggest that antagonism between SRSF1 and SRSF2/HMGA1, and differential recognition of their regulatory motifs depending on the identity of the rs3736234 polymorphism, influence OLR1 exon 5 inclusion and the efficiency of Ox-LDL uptake, with potential implications for atherosclerosis and coronary disease. PMID:25904137
Meyer, S; Ipek, M; Keth, A; Minnemann, T; von Mach, M A; Weise, A; Ittner, J R; Nawroth, P P; Plöckinger, U; Stalla, G K; Tuschy, U; Weber, M M; Kann, P H
2007-08-01
Genetic factors play an expanding role in understanding growth hormone (GH) disorders, therefore the German KIMS Pharmacogenetics Study was initiated with the aim of genotyping various GH-/IGF-I-axis-related genes of GH-deficient adult patients to investigate genotype:phenotype relationships and response to GH therapy. 129 consecutively enrolled GH-deficient adult patients were genotyped for variant 1 (V1) of the alternatively spliced noncoding exons in the 5'-untranslated region and for the nine coding exons of the GH receptor (GHR) gene, which obviously play a striking role in the function of the GH-IGF-I-axis. After detection of a heterozygous, non-synonymous mutation R179C in exon 6 in one single patient with acquired GH-deficiency (GHD) in late adulthood, analysis of her clinical data followed, leading to the diagnosis of mild short stature (-1.5SD). For further endocrine evaluation, five pituitary stimulation tests (arginine) of this patient were statistically compared to stimulation tests (arginine) of ten GH-deficient control patients, retrospectively. The formerly in patients with Laron syndrome and idiopathic short stature reported mutation R179C leads to an amino acid change from an arginine residue (codon CGC) to a cysteine residue (codon TGC) in position 179 of the extracellular domain of the GHR. Statistical analysis revealed significant decreased IGF-I/GH(0) ratio (p=0.004) and IGF-I/GH(max) ratio (p=0.001) of the index patient compared to the control patients, implying growth hormone resistance of the index patient at the level of the GHR, according to the detected R179C mutation. This study reports on the unusual case of a patient with mild short stature, who acquired GHD in late adulthood due to a non-secreting pituitary adenoma and get additionally diagnosed for pre-existing growth hormone insensitivity due to a formerly in two short statured patients described, single, heterozygous, non-synonymous mutation in the GHR. Our findings support the theory that heterozygous mutations in the GHR gene can have mild phenotypical consequences.
Ni, Lianghong; Zhao, Zhili; Xu, Hongxi; Chen, Shilin; Dorje, Gaawe
2016-02-15
Endemic to the Sino-Himalayan subregion, the medicinal alpine plant Gentiana straminea is a threatened species. The genetic and molecular data about it is deficient. Here we report the complete chloroplast (cp) genome sequence of G. straminea, as the first sequenced member of the family Gentianaceae. The cp genome is 148,991bp in length, including a large single copy (LSC) region of 81,240bp, a small single copy (SSC) region of 17,085bp and a pair of inverted repeats (IRs) of 25,333bp. It contains 112 unique genes, including 78 protein-coding genes, 30 tRNAs and 4 rRNAs. The rps16 gene lacks exon2 between trnK-UUU and trnQ-UUG, which is the first rps16 pseudogene found in the nonparasitic plants of Asterids clade. Sequence analysis revealed the presence of 13 forward repeats, 13 palindrome repeats and 39 simple sequence repeats (SSRs). An entire cp genome comparison study of G. straminea and four other species in Gentianales was carried out. Phylogenetic analyses using maximum likelihood (ML) and maximum parsimony (MP) were performed based on 69 protein-coding genes from 36 species of Asterids. The results strongly supported the position of Gentianaceae as one member of the order Gentianales. The complete chloroplast genome sequence will provide intragenic information for its conservation and contribute to research on the genetic and phylogenetic analyses of Gentianales and Asterids. Copyright © 2015 Elsevier B.V. All rights reserved.
Applications of statistical physics and information theory to the analysis of DNA sequences
NASA Astrophysics Data System (ADS)
Grosse, Ivo
2000-10-01
DNA carries the genetic information of most living organisms, and the of genome projects is to uncover that genetic information. One basic task in the analysis of DNA sequences is the recognition of protein coding genes. Powerful computer programs for gene recognition have been developed, but most of them are based on statistical patterns that vary from species to species. In this thesis I address the question if there exist universal statistical patterns that are different in coding and noncoding DNA of all living species, regardless of their phylogenetic origin. In search for such species-independent patterns I study the mutual information function of genomic DNA sequences, and find that it shows persistent period-three oscillations. To understand the biological origin of the observed period-three oscillations, I compare the mutual information function of genomic DNA sequences to the mutual information function of stochastic model sequences. I find that the pseudo-exon model is able to reproduce the mutual information function of genomic DNA sequences. Moreover, I find that a generalization of the pseudo-exon model can connect the existence and the functional form of long-range correlations to the presence and the length distributions of coding and noncoding regions. Based on these theoretical studies I am able to find an information-theoretical quantity, the average mutual information (AMI), whose probability distributions are significantly different in coding and noncoding DNA, while they are almost identical in all studied species. These findings show that there exist universal statistical patterns that are different in coding and noncoding DNA of all studied species, and they suggest that the AMI may be used to identify genes in different living species, irrespective of their taxonomic origin.
Zhang, Genxi; Ding, Fuxiang; Wang, Jinyu; Dai, Guojun; Xie, Kaizhou; Zhang, Lijun; Wang, Wei; Zhou, Shenghua
2011-02-01
In our research, single nucleotide polymorphisms (SNPs) of exon regions of the myostatin gene were detected by PCR-SSCP in the Bian chicken and three reference chicken populations (Jinghai, Youxi, and Arbor Acre). Four novel SNPs (G2283A, C7552T, C7638T, and T7661A) were detected. The findings from the least square means showed that Bian chickens with EE and DE genotypes had significantly higher body weight, at 6-18 weeks of age, than those of the DD genotype (P < 0.05). The results suggest that the mutation G2283A, detected in exon 1, has potential as a genetic marker for body weight traits in the Bian chicken.
Frawley, Thomas; O'Brien, Cathal P; Conneally, Eibhlin; Vandenberghe, Elisabeth; Percy, Melanie; Langabeer, Stephen E; Haslam, Karl
2018-02-01
The classical Philadelphia chromosome-negative myeloproliferative neoplasms (MPNs), consisting of polycythemia vera, essential thrombocythemia, and primary myelofibrosis, are a heterogeneous group of neoplasms that harbor driver mutations in the JAK2, CALR, and MPL genes. The detection of mutations in these genes has been incorporated into the recent World Health Organization (WHO) diagnostic criteria for MPN. Given a pressing clinical need to screen for mutations in these genes in a routine diagnostic setting, a targeted next-generation sequencing (NGS) assay for the detection of MPN-associated mutations located in JAK2 exon 14, JAK2 exon 12, CALR exon 9, and MPL exon 10 was developed to provide a single platform alternative to reflexive, stepwise diagnostic algorithms. Polymerase chain reaction (PCR) primers were designed to target mutation hotspots in JAK2 exon 14, JAK2 exon 12, MPL exon 10, and CALR exon 9. Multiplexed PCR conditions were optimized by using qualitative PCR followed by NGS. Diagnostic genomic DNA from 35 MPN patients, known to harbor driver mutations in one of the target genes, was used to validate the assay. One hundred percent concordance was observed between the previously-identified mutations and those detected by NGS, with no false positives, nor any known mutations missed (specificity = 100%, CI = 0.96, sensitivity = 100%, CI = 0.89). Improved resolution of mutation sequences was also revealed by NGS analysis. Detection of diagnostically relevant driver mutations of MPN is enhanced by employing a targeted multiplex NGS approach. This assay presents a robust solution to classical MPN mutation screening, providing an alternative to time-consuming sequential analyses.
Lee, Su-Jun; Usmani, Khawja A; Chanas, Brian; Ghanayem, Burhan; Xi, Tina; Hodgson, Ernest; Mohrenweiser, Harvey W; Goldstein, Joyce A
2003-08-01
Genetic polymorphisms of cytochromes P450 (CYPs) are a principal reason for inter-individual variations in the metabolism of therapeutic drugs and environmental chemicals in humans. The present study identifies 34 single nucleotide polymorphisms (SNPs) of CYP3A5 including 27 previously unidentified SNPs by direct sequencing of the exons, intron-exon junctions and 5'-upstream region of CYP3A5 from 92 racially diverse individuals (24 Caucasians, 24 Africans, 24 Asians, and 20 individuals of unknown racial origin). Four new CYP3A5 SNPs produced coding changes: R28C, L82R, A337T, and F446S. CYP3A5 R28C occurred in African populations (allelic frequency of 4%). CYP3A5 A337T occurred in Asians (2% allelic frequency), CYP3A5 L82R (occurred in the racially unidentified group) and CYP3A5 F446S (identified in Caucasians with a 2% allelic frequency) were on an allele containing the splice change g.6986A>G known as CYP3A5*3. The newly identified allelic proteins were constructed by site-directed mutagenesis, expressed in Escherichia coli and purified. CYP3A5 L82R was expressed only as denatured CYP420, suggesting it may be unstable. CYP3A5*1 exhibited the highest maximal clearance for testosterone followed by CYP3A5 A337T > CYP3A5 R28C > CYP3A5 F446S. CYP3A5*1 exhibited a higher V(max) for nifedipine oxidation than CYP3A5 A337T > CYP3A5 R28C > CYP3A5 F446S. CYP3A5 A337T and CYP3A5 R28C exhibited a 42-64% lower V(max) for nifedipine oxidation than CYP3A5*1. CYP3A5 F446S exhibited a > 95% decrease in the intrinsic clearance for both 6beta-hydroxytestosterone and nifedipine oxidation. This study identifies four new potentially defective coding alleles. CYP3A5 F446S is predicted to be more catalytically defective than the splice change alone.
Machiavelli, Gloria A; Caputo, Mariela; Rivolta, Carina M; Olcese, María C; Gruñeiro-Papendieck, Laura; Chiesa, Ana; González-Sarmiento, Rogelio; Targovnik, Héctor M
2010-01-01
Thyroglobulin (TG) deficiency is an autosomal-recessive disorder that results in thyroid dyshormonogenesis. A number of distinct mutations have been identified as causing human hypothyroid goitre. The purpose of this study was to identify and characterize new mutations in the TG gene in an attempt to increase the understanding of the genetic mechanism responsible for this disorder. A total of six patients from four nonconsanguineous families with marked impairment of TG synthesis were studied. Single-strand conformation polymorphism (SSCP) analysis, sequencing of DNA, genotyping, expression of chimeric minigenes and bioinformatic analysis were performed. Four different inactivating TG mutations were identified: one novel mutation (c.7006C>T [p.R2317X]) and three previously reported (c.886C>T [p.R277X], c.6701C>A [p.A2215D] and c.6725G>A [p.R2223H]). Consequently, one patient carried a compound heterozygous for p.R2223H/p.R2317X mutations; two brothers showed a homozygous p.A2215D substitution and the remaining three patients, from two families with typical phenotype, had a single p.R277X mutated allele. We also showed functional evidences that premature stop codons inserted at different positions in exon 7, which disrupt exonic splicing enhancer (ESE) sequences, do not interfere with exon definition and processing. In this study, we have identified a novel nonsense mutation p.R2317X in the acetylcholinesterase homology domain of TG. We have also observed that nonsense mutations do not interfere with the pre-mRNA splicing of exon 7. The results are in accordance with previous observations confirming the genetic heterogeneity of TG defects.
Hochbach, Anne; Schneider, Julia; Röser, Martin
2015-06-01
To investigate phylogenetic relationships within the grass subfamily Pooideae we studied about 50 taxa covering all recognized tribes, using one plastid DNA (cpDNA) marker (matK gene-3'trnK exon) and for the first time four nuclear single copy gene loci. DNA sequence information from two parts of the nuclear genes topoisomerase 6 (Topo6) spanning the exons 8-13 and 17-19, the exons 9-13 encoding plastid acetyl-CoA-carboxylase (Acc1) and the partial exon 1 of phytochrome B (PhyB) were generated. Individual and nuclear combined data were evaluated using maximum parsimony, maximum likelihood and Bayesian methods. All of the phylogenetic results show Brachyelytrum and the tribe Nardeae as earliest diverging lineages within the subfamily. The 'core' Pooideae (Hordeeae and the Aveneae/Poeae tribe complex) are also strongly supported, as well as the monophyly of the tribes Brachypodieae, Meliceae and Stipeae (except PhyB). The beak grass tribe Diarrheneae and the tribe Duthieeae are not monophyletic in some of the analyses. However, the combined nuclear DNA (nDNA) tree yields the highest resolution and the best delimitation of the tribes, and provides the following evolutionary hypothesis for the tribes: Brachyelytrum, Nardeae, Duthieeae, Meliceae, Stipeae, Diarrheneae, Brachypodieae and the 'core' Pooideae. Within the individual datasets, the phylogenetic trees obtained from Topo6 exon 8-13 shows the most interesting results. The divergent positions of some clone sequences of Ampelodesmos mauritanicus and Trikeraia pappiformis, for instance, may indicate a hybrid origin of these stipoid taxa. Copyright © 2015 Elsevier Inc. All rights reserved.
Characterization of a splicing mutation in group A xeroderma pigmentosum
DOE Office of Scientific and Technical Information (OSTI.GOV)
Satokata, Ichiro; Tanaka, Kiyoji; Miura, Naoyuki
1990-12-01
The molecular basis of group A xeroderma pigmentosum (WP) was investigated by comparison of the nucleotide sequences of multiple clones of the XP group A complementing gene (XPAC) from a patient with group A XP with that of a normal gene. The clones showed a G {r arrow} C substitution at the 3{prime} splice acceptor site of intron 3, which altered the obligatory AG acceptor dinucleotide to AC. Nucleotide sequencing of cDNAs amplified by the polymerase chain reaction revealed that this single base substitution abolishes the canonical 3{prime} splice site, thus creating two abnormally spliced mRNA forms. The larger formmore » is identical with normal mRNA except for a dinucleotide deletion at the 5{prime} end of exon 4. This deletion results in a frameshift with premature translation termination in exon 4. The smaller form has a deletion of the entire exon 3 and the dinucleotide at the 5{prime} end of exon 4. The result of a transfection study provided additional evidence that this single base substitution is the disease-causing mutation. This single base substitution creates a new cleavage site for the restriction nuclease AlwNI. Analysis of AlwNI restriction fragment length polymorphism showed a high frequency of this mutation in Japanese patients with group A XP: 16 of 21 unrelated Japanese patients were homozygous and 4 were heterozygous for this mutation. However, 11 Caucasians and 2 Blacks with group A XP did not have this mutant allele. The polymorphic AlwNI restriction fragments are concluded to be useful for diagnosis of group A XP in Japanese subjects, including prenatal cases and carriers.« less
Lee, Tai-Sung; Ma, Wanlong; Zhang, Xi; Kantarjian, Hagop; Albitar, Maher
2009-01-01
Background The functional relevance of many of the recently detected JAK2 mutations, except V617F and exon 12 mutants, in patients with chronic myeloproliferative neoplasia (MPN) has been significantly overlooked. To explore atomic-level explanations of the possible mutational effects from those overlooked mutants, we performed a set of molecular dynamics simulations on clinically observed mutants, including newly discovered mutations (K539L, R564L, L579F, H587N, S591L, H606Q, V617I, V617F, C618R, L624P, whole exon 14-deletion) and control mutants (V617C, V617Y, K603Q/N667K). Results Simulation results are consistent with all currently available clinical/experimental evidence. The simulation-derived putative interface, not possibly obtained from static models, between the kinase (JH1) and pseudokinase (JH2) domains of JAK2 provides a platform able to explain the mutational effect for all mutants, including presumably benign control mutants, at the atomic level. Conclusion The results and analysis provide structural bases for mutational mechanisms of JAK2, may advance the understanding of JAK2 auto-regulation, and have the potential to lead to therapeutic approaches. Together with recent mutation profiling results demonstrating the breadth of clinically observed JAK2 mutations, our findings suggest that molecular testing/diagnostics of JAK2 should extend beyond V617F and exon 12 mutations, and perhaps should encompass most of the pseudo-kinase domain-coding region. PMID:19744331
Chang, Cheng; Shen, Wen-Kai; Wang, Tzu-Ting; Lin, Ying-Hsi; Hsu, Err-Lieh; Dai, Shu-Mei
2009-04-01
To identify pertinent mutations associated with knockdown resistance to permethrin, the entire coding sequence of the voltage-gated sodium channel gene Aa-para was sequenced and analyzed from a Per-R strain with 190-fold resistance to permethrin and two susceptible strains of Aedes aegypti. The longest transcript, a 6441bp open reading frame, encodes 2147 amino acid residues with an estimated molecular mass of 241kDa. A total of 33 exons were found in the Aa-para gene over 293kb of genomic DNA. Three previously unreported optional exons were identified. The first two exons, m and n, were located within the intracellular domain I/II, and the third, f', was found within the II/III linkers. The two mutually exclusive exons, d and l, were the only alternative exons in all the cDNA clones sequenced in this study. The most distinct finding was a novel amino acid substitution mutation, D1794Y, located within the extracellular linker between IVS5 and IVS6, which is concurrent with the known V1023G mutation in Aa-para of the Per-R strain. The high frequency and coexistence of the two mutations in the Per-R strain suggest that they might exert a synergistic effect to provide the knockdown resistance to permethrin. Furthermore, both cDNA and genomic DNA data from the same individual mosquitoes have demonstrated that RNA editing was not involved in amino acid substitutions of the Per-R strain.
Molecular defects leading to human complement component C6 deficiency in an African-American family
Zhu, Z-B; Totemchokchyakarn, K; Atkinson, T P; Volanakis, J E
1998-01-01
Complement component C6 deficiency (C6D) was diagnosed in a 16-year-old African-American male with meningococcal meningitis. The patient's father and two brothers also had C6D, but gave no history of meningitis or other neisserial infection. By using exon-specific polymerase chain reaction (PCR)/single-strand conformation polymorphism as a screening step and nucleotide sequencing of target exons, we determined that the proband was a compound heterozygote for two C6 gene mutations. The first, 1195delC located in exon 7, is a novel mutation, while the second, 1936delG in exon 12, has been described before to cause C6D in an unrelated African-American individual. Both mutations result in premature termination codons and C6 null alleles. Allele-specific PCR indicated that the proband's two brothers also inherited the 1195delC mutation from their heterozygous mother and the 1936delG mutation from their homozygous father. PMID:9472666
2012-01-01
Background It is known from recent studies that more than 90% of human multi-exon genes are subject to Alternative Splicing (AS), a key molecular mechanism in which multiple transcripts may be generated from a single gene. It is widely recognized that a breakdown in AS mechanisms plays an important role in cellular differentiation and pathologies. Polymerase Chain Reactions, microarrays and sequencing technologies have been applied to the study of transcript diversity arising from alternative expression. Last generation Affymetrix GeneChip Human Exon 1.0 ST Arrays offer a more detailed view of the gene expression profile providing information on the AS patterns. The exon array technology, with more than five million data points, can detect approximately one million exons, and it allows performing analyses at both gene and exon level. In this paper we describe BEAT, an integrated user-friendly bioinformatics framework to store, analyze and visualize exon arrays datasets. It combines a data warehouse approach with some rigorous statistical methods for assessing the AS of genes involved in diseases. Meta statistics are proposed as a novel approach to explore the analysis results. BEAT is available at http://beat.ba.itb.cnr.it. Results BEAT is a web tool which allows uploading and analyzing exon array datasets using standard statistical methods and an easy-to-use graphical web front-end. BEAT has been tested on a dataset with 173 samples and tuned using new datasets of exon array experiments from 28 colorectal cancer and 26 renal cell cancer samples produced at the Medical Genetics Unit of IRCCS Casa Sollievo della Sofferenza. To highlight all possible AS events, alternative names, accession Ids, Gene Ontology terms and biochemical pathways annotations are integrated with exon and gene level expression plots. The user can customize the results choosing custom thresholds for the statistical parameters and exploiting the available clinical data of the samples for a multivariate AS analysis. Conclusions Despite exon array chips being widely used for transcriptomics studies, there is a lack of analysis tools offering advanced statistical features and requiring no programming knowledge. BEAT provides a user-friendly platform for a comprehensive study of AS events in human diseases, displaying the analysis results with easily interpretable and interactive tables and graphics. PMID:22536968
Mutation analysis in the long isoform of USH2A in American patients with Usher Syndrome type II.
Yan, Denise; Ouyang, Xiaomei; Patterson, D Michael; Du, Li Lin; Jacobson, Samuel G; Liu, Xue-Zhong
2009-12-01
Usher syndrome type II (USH2) is an autosomal recessive disorder characterized by moderate to severe hearing impairment and progressive visual loss due to retinitis pigmentosa (RP). To identify novel mutations and determine the frequency of USH2A mutations as a cause of USH2, we have carried out mutation screening of all 72 coding exons and exon-intron splice sites of the USH2A gene. A total of 20 USH2 American probands of European descent were analyzed using single strand conformational polymorphism (SSCP) and direct sequencing methods. Ten different USH2A mutations were identified in 55% of the probands, five of which were novel mutations. The detected mutations include three missense, three frameshifts and four nonsense mutations, with c.2299delG/p.E767fs mutation, accounting for 38.9% of the pathological alleles. Two cases were homozygotes, two cases were compound heterozygotes and one case had complex allele with three variants. In seven probands, only one USH2A mutation was detected and no pathological mutation was found in the remaining eight individuals. Altogether, our data support the fact that c.2299delG/p.E767fs is indeed the most common USH2A mutation found in USH2 patients of European Caucasian background. Thus, if screening for mutations in USH2A is considered, it is reasonable to screen for the c.2299delG mutation first.
Investigation of the role of TCF4 rare sequence variants in schizophrenia.
Basmanav, F Buket; Forstner, Andreas J; Fier, Heide; Herms, Stefan; Meier, Sandra; Degenhardt, Franziska; Hoffmann, Per; Barth, Sandra; Fricker, Nadine; Strohmaier, Jana; Witt, Stephanie H; Ludwig, Michael; Schmael, Christine; Moebus, Susanne; Maier, Wolfgang; Mössner, Rainald; Rujescu, Dan; Rietschel, Marcella; Lange, Christoph; Nöthen, Markus M; Cichon, Sven
2015-07-01
Transcription factor 4 (TCF4) is one of the most robust of all reported schizophrenia risk loci and is supported by several genetic and functional lines of evidence. While numerous studies have implicated common genetic variation at TCF4 in schizophrenia risk, the role of rare, small-sized variants at this locus-such as single nucleotide variants and short indels which are below the resolution of chip-based arrays requires further exploration. The aim of the present study was to investigate the association between rare TCF4 sequence variants and schizophrenia. Exon-targeted resequencing was performed in 190 German schizophrenia patients. Six rare variants at the coding exons and flanking sequences of the TCF4 gene were identified, including two missense variants and one splice site variant. These six variants were then pooled with nine additional rare variants identified in 379 European participants of the 1000 Genomes Project, and all 15 variants were genotyped in an independent German sample (n = 1,808 patients; n = 2,261 controls). These data were then analyzed using six statistical methods developed for the association analysis of rare variants. No significant association (P < 0.05) was found. However, the results from our association and power analyses suggest that further research into the possible involvement of rare TCF4 sequence variants in schizophrenia risk is warranted by the assessment of larger cohorts with higher statistical power to identify rare variant associations. © 2015 Wiley Periodicals, Inc.
Soheili, Fariborz; Jalili, Zahra; Rahbar, Mahtab; Khatooni, Zahed; Mashayekhi, Amir; Jafari, Hossein
2018-03-01
The mutations in GATA4 gene induce inherited atrial and ventricular septation defects, which is the most frequent forms of congenital heart defects (CHDs) constituting about half of all cases. We have performed High resolution melting (HRM) mutation scanning of GATA4 coding exons of nonsyndrome 100 patients as a case group including 39 atrial septal defects (ASD), 57 ventricular septal defects (VSD) and four patients with both above defects and 50 healthy individuals as a control group. Our samples are categorized according to their HRM graph. The genome sequencing has been done for 15 control samples and 25 samples of patients whose HRM analysis were similar to healthy subjects for each exon. The PolyPhen-2 and MUpro have been used to determine the causative possibility and structural stability prediction of GATA4 sequence variation. The HRM curve analysis exhibit that 21 patients and 3 normal samples have deviated curves for GATA4 coding exons. Sequencing analysis has revealed 12 nonsynonymous mutations while all of them resulted in stability structure of protein 10 of them are pathogenic and 2 of them are benign. Also we found two nucleotide deletions which one of them was novel and one new indel mutation resulting in frame shift mutation, and 4 synonymous variations or polymorphism in 6 of patients and 3 of normal individuals. Six or about 50% of these nonsynonymous mutations have not been previously reported. Our results show that there is a spectrum of GATA4 mutations resulting in septal defects. © 2018 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mercier, B.; Audrezet, M.P.; Guillermit, H.
Cystic fibrosis transmembrane conductance regulator (CFTR), the gene responsible, when mutated, for cystic fibrosis (CF), spans over 230 kb on the long arm of chromosome 7 and is composed of 27 exons. The most common mutation responsible for CF worldwide is the deletion of a phenylalanine amino acid at codon 508 in the first nucleotide-binding fold and accounts for approximately 70% of CF chromosomes studied. More than 250 other mutations have been reported through the CF Genetic Analysis Consortium. The majority of the mutations previously described lie in the two nucleotide-binding folds. To explore exhaustively other regions of the gene,more » particularly exons coding for transmembrane domains, the authors have initiated a collaborative study between different laboratories to screen 369 non-[Delta]F508 CF chromosomes of seven ethnic European populations (Belgian, French, Breton, Irish, Italian, Yugoslavian, Russian). Among these chromosomes carrying an unidentified mutation, 63 were from Brittany, 50 of various French origin, 45 of Irish origin, 56 of Italian origin, 41 of Belgian origin, 2 of Turkish origin, 38 of Yugoslavian origin, 22 of Russian origin, and 52 of Bulgarian origin. Diagnostic criteria for CF included at least one positive sweat test and pulmonary disease with or without pancreatic disease. Using a denaturing gradient gel electrophoresis (DGGE) assay, they have identified eight novel mutations in exon 17b coding for part of the second transmembrane domain of the CFTR and they describe them in this report. 8 refs., 1 fig., 1 tab.« less
Castrignanò, Tiziana; Canali, Alessandro; Grillo, Giorgio; Liuni, Sabino; Mignone, Flavio; Pesole, Graziano
2004-01-01
The identification and characterization of genome tracts that are highly conserved across species during evolution may contribute significantly to the functional annotation of whole-genome sequences. Indeed, such sequences are likely to correspond to known or unknown coding exons or regulatory motifs. Here, we present a web server implementing a previously developed algorithm that, by comparing user-submitted genome sequences, is able to identify statistically significant conserved blocks and assess their coding or noncoding nature through the measure of a coding potential score. The web tool, available at http://www.caspur.it/CSTminer/, is dynamically interconnected with the Ensembl genome resources and produces a graphical output showing a map of detected conserved sequences and annotated gene features. PMID:15215464
CTCF, a Novel Regulator of Alternative Splicing | Center for Cancer Research
Alternative splicing, or the inclusion of different patterns of exons from the same gene, plays an important role in expanding the coding possibilities of a limited genome. The immune system is an ideal system to study this since alternative splicing is used to generate an almost unlimited number of antibodies against any pathogen we might encounter.
Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster
Wang, Wen; Brunet, Frédéric G.; Nevo, Eviatar; Long, Manyuan
2002-01-01
Non-protein-coding RNA genes play an important role in various biological processes. How new RNA genes originated and whether this process is controlled by similar evolutionary mechanisms for the origin of protein-coding genes remains unclear. A young chimeric RNA gene that we term sphinx (spx) provides the first insight into the early stage of evolution of RNA genes. spx originated as an insertion of a retroposed sequence of the ATP synthase chain F gene at the cytological region 60DB since the divergence of Drosophila melanogaster from its sibling species 2–3 million years ago. This retrosequence, which is located at 102F on the fourth chromosome, recruited a nearby exon and intron, thereby evolving a chimeric gene structure. This molecular process suggests that the mechanism of exon shuffling, which can generate protein-coding genes, also plays a role in the origin of RNA genes. The subsequent evolutionary process of spx has been associated with a high nucleotide substitution rate, possibly driven by a continuous positive Darwinian selection for a novel function, as is shown in its sex- and development-specific alternative splicing. To test whether spx has adapted to different environments, we investigated its population genetic structure in the unique “Evolution Canyon” in Israel, revealing a similar haplotype structure in spx, and thus similar evolutionary forces operating on spx between environments. PMID:11904380
Rozhdestvensky, Timofey S; Robeck, Thomas; Galiveti, Chenna R; Raabe, Carsten A; Seeger, Birte; Wolters, Anna; Gubar, Leonid V; Brosius, Jürgen; Skryabin, Boris V
2016-02-05
Prader-Willi syndrome (PWS) is a neurogenetic disorder caused by loss of paternally expressed genes on chromosome 15q11-q13. The PWS-critical region (PWScr) contains an array of non-protein coding IPW-A exons hosting intronic SNORD116 snoRNA genes. Deletion of PWScr is associated with PWS in humans and growth retardation in mice exhibiting ~15% postnatal lethality in C57BL/6 background. Here we analysed a knock-in mouse containing a 5'HPRT-LoxP-Neo(R) cassette (5'LoxP) inserted upstream of the PWScr. When the insertion was inherited maternally in a paternal PWScr-deletion mouse model (PWScr(p-/m5'LoxP)), we observed compensation of growth retardation and postnatal lethality. Genomic methylation pattern and expression of protein-coding genes remained unaltered at the PWS-locus of PWScr(p-/m5'LoxP) mice. Interestingly, ubiquitous Snord116 and IPW-A exon transcription from the originally silent maternal chromosome was detected. In situ hybridization indicated that PWScr(p-/m5'LoxP) mice expressed Snord116 in brain areas similar to wild type animals. Our results suggest that the lack of PWScr RNA expression in certain brain areas could be a primary cause of the growth retardation phenotype in mice. We propose that activation of disease-associated genes on imprinted regions could lead to general therapeutic strategies in associated diseases.
Cost-effective sequencing of full-length cDNA clones powered by a de novo-reference hybrid assembly.
Kuroshu, Reginaldo M; Watanabe, Junichi; Sugano, Sumio; Morishita, Shinichi; Suzuki, Yutaka; Kasahara, Masahiro
2010-05-07
Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence approximately 800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded), and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only approximately US$3 per clone, demonstrating a significant advantage over previous approaches.
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing
Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng
2017-01-01
ABSTRACT Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure. PMID:28277933
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing.
Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng
2017-10-03
Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure.
Liskova, Petra; Tuft, Stephen J.; Gwilliam, Rhian; Ebenezer, Neil D.; Jirsova, Katerina; Prescott, Quincy; Martincova, Radka; Pretorius, Marike; Sinclair, Neil; Boase, David L.; Jeffrey, Margaret J.; Deloukas, Panos; Hardcastle, Alison J.; Filipec, Martin; Bhattacharya, Shomi S.
2009-01-01
We describe the search for mutations in six unrelated Czech and four unrelated British families with posterior polymorphous corneal dystrophy (PPCD); a relatively rare eye disorder. Coding exons and intron/exon boundaries of all three genes (VSX1, COL8A2, and ZEB1/TCF8) previously reported to be implicated in the pathogenesis of this disorder were screened by DNA sequencing. Four novel pathogenic mutations were identified in four families; two deletions, one nonsense, and one duplication within exon 7 in the ZEB1 gene located at 10p11.2. We also genotyped the Czech patients to test for a founder haplotype and lack of disease segregation with the 20p11.2 locus we previously described. Although a systematic clinical examination was not performed, our investigation does not support an association between ZEB1 changes and self reported non-ocular anomalies. In the remaining six families no disease causing mutations were identified thereby indicating that as yet unidentified gene(s) are likely to be responsible for PPCD. PMID:17437275
Exon Shuffling and Origin of Scorpion Venom Biodiversity
Wang, Xueli; Gao, Bin; Zhu, Shunyi
2016-01-01
Scorpion venom is a complex combinatorial library of peptides and proteins with multiple biological functions. A combination of transcriptomic and proteomic techniques has revealed its enormous molecular diversity, as identified by the presence of a large number of ion channel-targeted neurotoxins with different folds, membrane-active antimicrobial peptides, proteases, and protease inhibitors. Although the biodiversity of scorpion venom has long been known, how it arises remains unsolved. In this work, we analyzed the exon-intron structures of an array of scorpion venom protein-encoding genes and unexpectedly found that nearly all of these genes possess a phase-1 intron (one intron located between the first and second nucleotides of a codon) near the cleavage site of a signal sequence despite their mature peptides remarkably differ. This observation matches a theory of exon shuffling in the origin of new genes and suggests that recruitment of different folds into scorpion venom might be achieved via shuffling between body protein-coding genes and ancestral venom gland-specific genes that presumably contributed tissue-specific regulatory elements and secretory signal sequences. PMID:28035955
Exon Shuffling and Origin of Scorpion Venom Biodiversity.
Wang, Xueli; Gao, Bin; Zhu, Shunyi
2016-12-26
Scorpion venom is a complex combinatorial library of peptides and proteins with multiple biological functions. A combination of transcriptomic and proteomic techniques has revealed its enormous molecular diversity, as identified by the presence of a large number of ion channel-targeted neurotoxins with different folds, membrane-active antimicrobial peptides, proteases, and protease inhibitors. Although the biodiversity of scorpion venom has long been known, how it arises remains unsolved. In this work, we analyzed the exon-intron structures of an array of scorpion venom protein-encoding genes and unexpectedly found that nearly all of these genes possess a phase-1 intron (one intron located between the first and second nucleotides of a codon) near the cleavage site of a signal sequence despite their mature peptides remarkably differ. This observation matches a theory of exon shuffling in the origin of new genes and suggests that recruitment of different folds into scorpion venom might be achieved via shuffling between body protein-coding genes and ancestral venom gland-specific genes that presumably contributed tissue-specific regulatory elements and secretory signal sequences.
Chang, Ya-Sian; Lin, Chien-Yu; Yang, Shu-Fen; Ho, Cheng Mao; Chang, Jan-Gowth
2016-02-01
There have been many different mutations reported for the large adenomatous polyposis coli (APC) tumor suppressor gene. APC mutations result in inactivation of APC tumor suppressor action, allowing the progression of tumorigenesis. The present study utilized a highly efficient method to identify APC mutations and investigated the association between the APC genetic variants Y486Y, A545A, T1493T, and D1822V and susceptibility to oral squamous cell carcinoma (OSCC). High-resolution melting (HRM) analysis was used to characterize APC mutations. Genomic DNA was extracted from 83 patient specimens of OSCC and 50 blood samples from healthy control subjects. The 14 exons and mutation cluster region of exon 15 were screened by HRM analysis. All mutations were confirmed by direct DNA sequencing. Three mutations and 4 single nucleotide polymorphisms (SNPs) were found in this study. The mutations were c.573T>C (Y191Y) in exon 5, c.1005A>G (L335L) in exon 9, and c.1488A>T (T496T) in exon 11. Two SNPs, c.4479G>A (T1493T) and c.5465A>T (D1822V), were located in exon 15, whereas c.1458T>C (Y486Y) and c.1635G>A (A545A) were located in exon 11 and 13, respectively. There was no observed association between OSCC risk and genotype for any of the 4 APC SNPs. The mutation of APC is rare in Taiwanese patients with OSCC. HRM analysis is a reliable, accurate, and fast screening method for APC mutations.
Expression Profiling Smackdown: Human Transcriptome Array HTA 2.0 vs. RNA-Seq
Palermo, Meghann; Driscoll, Heather; Tighe, Scott; Dragon, Julie; Bond, Jeff; Shukla, Arti; Vangala, Mahesh; Vincent, James; Hunter, Tim
2014-01-01
The advent of both microarray and massively parallel sequencing have revolutionized high-throughput analysis of the human transcriptome. Due to limitations in microarray technology, detecting and quantifying coding transcript isoforms, in addition to non-coding transcripts, has been challenging. As a result, RNA-Seq has been the preferred method for characterizing the full human transcriptome, until now. A new high-resolution array from Affymetrix, GeneChip Human Transcriptome Array 2.0 (HTA 2.0), has been designed to interrogate all transcript isoforms in the human transcriptome with >6 million probes targeting coding transcripts, exon-exon splice junctions, and non-coding transcripts. Here we compare expression results from GeneChip HTA 2.0 and RNA-Seq data using identical RNA extractions from three samples each of healthy human mesothelial cells in culture, LP9-C1, and healthy mesothelial cells treated with asbestos, LP9-A1. For GeneChip HTA 2.0 sample preparation, we chose to compare two target preparation methods, NuGEN Ovation Pico WTA V2 with the Encore Biotin Module versus Affymetrix's GeneChip WT PLUS with the WT Terminal Labeling Kit, on identical RNA extractions from both untreated and treated samples. These same RNA extractions were used for the RNA-Seq library preparation. All analyses were performed in Partek Genomics Suite 6.6. Expression profiles for control and asbestos-treated mesothelial cells prepared with NuGEN versus Affymetrix target preparation methods (GeneChip HTA 2.0) are compared to each other as well as to RNA-Seq results.
Mutation Screening of 1,237 Cancer Genes across Six Model Cell Lines of Basal-Like Breast Cancer.
Olsson, Eleonor; Winter, Christof; George, Anthony; Chen, Yilun; Törngren, Therese; Bendahl, Pär-Ola; Borg, Åke; Gruvberger-Saal, Sofia K; Saal, Lao H
2015-01-01
Basal-like breast cancer is an aggressive subtype generally characterized as poor prognosis and lacking the expression of the three most important clinical biomarkers, estrogen receptor, progesterone receptor, and HER2. Cell lines serve as useful model systems to study cancer biology in vitro and in vivo. We performed mutational profiling of six basal-like breast cancer cell lines (HCC38, HCC1143, HCC1187, HCC1395, HCC1954, and HCC1937) and their matched normal lymphocyte DNA using targeted capture and next-generation sequencing of 1,237 cancer-associated genes, including all exons, UTRs and upstream flanking regions. In total, 658 somatic variants were identified, of which 378 were non-silent (average 63 per cell line, range 37-146) and 315 were novel (not present in the Catalogue of Somatic Mutations in Cancer database; COSMIC). 125 novel mutations were confirmed by Sanger sequencing (59 exonic, 48 3'UTR and 10 5'UTR, 1 splicing), with a validation rate of 94% of high confidence variants. Of 36 mutations previously reported for these cell lines but not detected in our exome data, 36% could not be detected by Sanger sequencing. The base replacements C/G>A/T, C/G>G/C, C/G>T/A and A/T>G/C were significantly more frequent in the coding regions compared to the non-coding regions (OR 3.2, 95% CI 2.0-5.3, P<0.0001; OR 4.3, 95% CI 2.9-6.6, P<0.0001; OR 2.4, 95% CI 1.8-3.1, P<0.0001; OR 1.8, 95% CI 1.2-2.7, P = 0.024, respectively). The single nucleotide variants within the context of T[C]T/A[G]A and T[C]A/T[G]A were more frequent in the coding than in the non-coding regions (OR 3.7, 95% CI 2.2-6.1, P<0.0001; OR 3.8, 95% CI 2.0-7.2, P = 0.001, respectively). Copy number estimations were derived from the targeted regions and correlated well to Affymetrix SNP array copy number data (Pearson correlation 0.82 to 0.96 for all compared cell lines; P<0.0001). These mutation calls across 1,237 cancer-associated genes and identification of novel variants will aid in the design and interpretation of biological experiments using these six basal-like breast cancer cell lines.
A Novel Nonsense Mutation in Exon 5 of KIND1 Gene in an Iranian Family with Kindler Syndrome.
Heidari, Mohammad Mehdi; Khatami, Mehri; Kargar, Saeed; Azari, Mojdeh; Hoseinzadeh, Hassan; Fallah, Hamedeh
2016-06-01
Kindler syndrome (KS) is an autosomal recessive skin disease characterized by actual blistering, photosensitivity and a progressive poikiloderma. The disorder results from rare mutations in the KIND1 gene. This gene contains 15 exons and expresses two kindlin-1 isoforms. The aim of this investigation was to analyze mutations in the exons 1 to 15 of KIND1 gene in an Iranian family clinically affected with Kindler syndrome. The mutations analysis of 15 coding exons of KIND1 gene was performed with PCR-SSCP and direct sequencing in 14 subjects from one Iranian family clinically affected with Kindler syndrome. We identified eight new nucleotide changes in KIND1 in this family. These changes were found in g.3892delA, g.3951T>C, g.3962T>G, g.4190G>T, g.7497G>A, g.11076T>C, g.11102C>T and g.13177C>T positions. Among them, the g.13177C>T mutation resulting in the formation of a premature stop codon (Q226X) was detected only in seven affected family individuals as homozygous but was not present in 100 unrelated healthy controls. This study suggests that nonsense mutation may lead to incomplete and non-functional protein products and is pathogenic and has meaningful implications for the diagnosis of patients with Kindler syndrome.
Lenarduzzi, S; Morgutti, M; Crovella, S; Coiana, A; Rosatelli, M C
2014-11-14
Cystic fibrosis (CF) is a common recessive genetic disease caused by mutations in the gene encoding for the cystic fibrosis transmembrane conductance regulator (CFTR) protein. More than 1800 different mutations have been described to date. Here, we report 3 novel mutations in CFTR in 3 Italian CF patients. To detect and identify 36 frequent mutations in Caucasians, we used the INNO-LiPA CFTR19 and INNO-LiPA CFTR17+Tn Update kits (Innogenetics; Ghent, Belgium). Our first analysis did not reveal both of the responsible mutations; thus, direct sequencing of the CFTR gene coding region was performed. The 3 patients were compound heterozygous. In one allele, the F508del (c.1521_1523delCTT, p.PHE508del) mutation in exon 11 was observed in each case. For the second allele, in patient No.1, direct sequencing revealed an 11-base pair deletion (GAGGCGATACT) in exon 14 (c.2236_2246del; pGlu746Alafs*29). In patient No. 2, direct sequencing revealed a nonsense mutation at nucleotide 3892 (c.3892G>T) in exon 24. In patient No. 3, direct sequencing revealed a deletion of cytosine in exon 27 (c.4296delC; p.Asn1432Lysfs*16). These 3 novel mutations indicate the production of a truncated protein, which consequently results in a non-functional polypeptide.
Organization of the murine Cd22 locus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Law, Che-Leung; Torres, R.M.; Sundeberg, H.A.
1993-07-01
Murine CD22 (mCD22) is a B cell-associated adhesion protein with seven extracellular Ig-like domains that has 62% amino acid identify to its human homologue. Southern analysis on genomic DNA isolated from tissues and cell lines from several mouse strains using mCD22 cDNA demonstrated that the Cd22 locus encoding mCD22 is a single copy gene of [le]30 kb. Digestion of genomic DNA preparations with four restriction endonucleases revealed the presence of restriction fragment length polymorphisms (RFLP) in BALB/c, C57BL/6, and C3H strains vs DBA/2j, NZB, and NZC strains, suggesting the presence of two or more Cd22 alleles. Using a mCD22 cDNAmore » clone derived from the BALB/c strain, the authors isolated genomic clones from a DBA/2 genomic library that contained all the exons necessary to encode the full length mCD22 cDNA. Fifteen exons, including exon 3 that encodes the translation start codon, were identified. Each extracellular Ig-like domain of mCD22 is encoded by a single exon. A comparison between the nucleotide sequences of the BALB/c CD22 cDNA and the exons of the DBA/2j CD22 genomic clones revealed an 18-nucleotide deletion in exon 4 (encoding the most distal Ig-like domain 1 of mCD22) of the DBA/2j genomic sequence in addition to a number of substitutions, insertions, and deletions in other exons. These nucleotide differences were also present in a cDNA clone isolated from total RNA of LPS-activated DBA/2j splenocytes mosome 7, a region sytenic to human chromosome 19q, close to the previously reported loci, Lyb-8 and Mag (a homologue of Cd22). An antibody (CY34) against the Lyb-8.2 B cell marker reacted with a BHK transfectant expressing the full length mCd22 cDNA, thus demonstrating that Lyb-8 and Cd22 loci are identical. Furthermore, a rat anti-mCD22 mAb, NIM-R6, bound to slgM[sup +] DBA/2j B cells, confirming the expression of a CD22 protein by the Cd22[sup a]/lyb-8[sup a] allele. 63 refs., 7 figs., 1 tab.« less
Genetics of Type III Bartter Syndrome in Spain, Proposed Diagnostic Algorithm
García Castaño, Alejandro; Pérez de Nanclares, Gustavo; Madariaga, Leire; Aguirre, Mireia; Madrid, Alvaro; Nadal, Inmaculada; Navarro, Mercedes; Lucas, Elena; Fijo, Julia; Espino, Mar; Espitaletta, Zilac; Castaño, Luis; Ariceta, Gema
2013-01-01
The p.Ala204Thr mutation (exon 7) of the CLCNKB gene is a "founder" mutation that causes most of type III Bartter syndrome cases in Spain. We performed genetic analysis of the CLCNKB gene, which encodes for the chloride channel protein ClC-Kb, in a cohort of 26 affected patients from 23 families. The diagnostic algorithm was: first, detection of the p.Ala204Thr mutation; second, detecting large deletions or duplications by Multiplex Ligation-dependent Probe Amplification and Quantitative Multiplex PCR of Short Fluorescent Fragments; and third, sequencing of the coding and flanking regions of the whole CLCNKB gene. In our genetic diagnosis, 20 families presented with the p.Ala204Thr mutation. Of those, 15 patients (15 families) were homozygous (57.7% of overall patients). Another 8 patients (5 families) were compound heterozygous for the founder mutation together with a second one. Thus, 3 patients (2 siblings) presented with the c. -19-?_2053+? del deletion (comprising the entire gene); one patient carried the p.Val170Met mutation (exon 6); and 4 patients (3 siblings) presented with the novel p.Glu442Gly mutation (exon 14). On the other hand, another two patients carried two novel mutations in compound heterozygosis: one presented the p.Ile398_Thr401del mutation (exon 12) associated with the c. -19-?_2053+? del deletion, and the other one carried the c.1756+1G>A splice-site mutation (exon 16) as well as the already described p.Ala210Val change (exon 7). One case turned out to be negative in our genetic screening. In addition, 51 relatives were found to be heterozygous carriers of the described CLCNKB mutations. In conclusion, different mutations cause type III Bartter syndrome in Spain. The high prevalence of the p.Ala204Thr in Spanish families thus justifies an initial screen for this mutation. However, should it not be detected further investigation of the CLCNKB gene is warranted in clinically diagnosed families. PMID:24058621
Genetics of type III Bartter syndrome in Spain, proposed diagnostic algorithm.
García Castaño, Alejandro; Pérez de Nanclares, Gustavo; Madariaga, Leire; Aguirre, Mireia; Madrid, Alvaro; Nadal, Inmaculada; Navarro, Mercedes; Lucas, Elena; Fijo, Julia; Espino, Mar; Espitaletta, Zilac; Castaño, Luis; Ariceta, Gema
2013-01-01
The p.Ala204Thr mutation (exon 7) of the CLCNKB gene is a "founder" mutation that causes most of type III Bartter syndrome cases in Spain. We performed genetic analysis of the CLCNKB gene, which encodes for the chloride channel protein ClC-Kb, in a cohort of 26 affected patients from 23 families. The diagnostic algorithm was: first, detection of the p.Ala204Thr mutation; second, detecting large deletions or duplications by Multiplex Ligation-dependent Probe Amplification and Quantitative Multiplex PCR of Short Fluorescent Fragments; and third, sequencing of the coding and flanking regions of the whole CLCNKB gene. In our genetic diagnosis, 20 families presented with the p.Ala204Thr mutation. Of those, 15 patients (15 families) were homozygous (57.7% of overall patients). Another 8 patients (5 families) were compound heterozygous for the founder mutation together with a second one. Thus, 3 patients (2 siblings) presented with the c. -19-?_2053+? del deletion (comprising the entire gene); one patient carried the p.Val170Met mutation (exon 6); and 4 patients (3 siblings) presented with the novel p.Glu442Gly mutation (exon 14). On the other hand, another two patients carried two novel mutations in compound heterozygosis: one presented the p.Ile398_Thr401del mutation (exon 12) associated with the c. -19-?_2053+? del deletion, and the other one carried the c.1756+1G>A splice-site mutation (exon 16) as well as the already described p.Ala210Val change (exon 7). One case turned out to be negative in our genetic screening. In addition, 51 relatives were found to be heterozygous carriers of the described CLCNKB mutations. In conclusion, different mutations cause type III Bartter syndrome in Spain. The high prevalence of the p.Ala204Thr in Spanish families thus justifies an initial screen for this mutation. However, should it not be detected further investigation of the CLCNKB gene is warranted in clinically diagnosed families.
Vasconcelos, O; Sivakumar, K; Dalakas, M C; Quezado, M; Nagle, J; Leon-Monzon, M; Dubnick, M; Gajdusek, D C; Goldfarb, L G
1995-01-01
Mutations in the human phosphofructokinase muscle subunit gene (PFKM) are known to cause myopathy classified as glycogenosis type VII (Tarui disease). Previously described molecular defects include base substitutions altering encoded amino acids or resulting in abnormal splicing. We report a mutation resulting in phosphofructokinase deficiency in three patients from an Ashkenazi Jewish family. Using a reverse transcription PCR assay, PFKM subunit transcripts differing by length were detected in skeletal muscle tissue of all three affected subjects. In the longer transcript, an insertion of 252 nucleotides totally homologous to the structure of the 10th intron of the PFKM gene was found separating exon 10 from exon 11. In addition, two single base transitions were identified by direct sequencing: [exon 6; codon 95; CGA (Arg) to TGA (stop)] and [exon 7; codon 172; ACC (Thr) to ACT (Thr)] in either transcript. Single-stranded conformational polymorphism and restriction enzyme analyses confirmed the presence of these point substitutions in genomic DNA and strongly suggested homozygosity for the pathogenic allele. The nonsense mutation at codon 95 appeared solely responsible for the phenotype in these patients, further expanding genetic heterogeneity of Tarui disease. Transcripts with and without intron 10 arising from identical mutant alleles probably resulted from differential pre-mRNA processing and may represent a novel message from the PFKM gene. Images Fig. 2 Fig. 4 Fig. 5 PMID:7479776
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solera, J.; Magallon, M.; Martin-Villar, J.
1992-02-01
DNA from a patient with severe hemophilia B was evaluated by RFLP analysis, producing results which suggested the existence of a partial deletion within the factor IX gene. The deletion was further localized and characterized by PCR amplification and sequencing. The altered allele has a 4,442-bp deletion which removes both the donor splice site located at the 5[prime] end of intron d and the two last coding nucleotides located at the 3[prime] end of exon IV in the normal factor IX gene; this fragment has been inserted in inverted orientation. Two homologous sequences have been discovered at the ends ofmore » the deleted DNA fragment.« less
USDA-ARS?s Scientific Manuscript database
KCC3 and KCC1 are potassium chloride transporters with partially overlapping function, and KCC3 knockout mice exhibit hypertension. Two KCC3 isoforms differ by alternate promoters and first coding exons: KCC3a is widely expressed, and KCC3b is highly expressed in kidney proximal convoluted tubule. W...
Nucleotide sequences of two genomic DNAs encoding peroxidase of Arabidopsis thaliana.
Intapruk, C; Higashimura, N; Yamamoto, K; Okada, N; Shinmyo, A; Takano, M
1991-02-15
The peroxidase (EC 1.11.1.7)-encoding gene of Arabidopsis thaliana was screened from a genomic library using a cDNA encoding a neutral isozyme of horseradish, Armoracia rusticana, peroxidase (HRP) as a probe, and two positive clones were isolated. From the comparison with the sequences of the HRP-encoding genes, we concluded that two clones contained peroxidase-encoding genes, and they were named prxCa and prxEa. Both genes consisted of four exons and three introns; the introns had consensus nucleotides, GT and AG, at the 5' and 3' ends, respectively. The lengths of each putative exon of the prxEa gene were the same as those of the HRP-basic-isozyme-encoding gene, prxC3, and coded for 349 amino acids (aa) with a sequence homology of 89% to that encoded by prxC3. The prxCa gene was very close to the HRP-neutral-isozyme-encoding gene, prxC1b, and coded for 354 aa with 91% homology to that encoded by prxC1b. The aa sequence homology was 64% between the two peroxidases encoded by prxCa and prxEa.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Feder, J.N.; Jan, L.Y.; Jan, Y.N.
The Drosophila hairy gene encodes a basic helix- loop-helix protein that functions in at least two steps during Drosophila development: (1) during embryogenesis, when it partakes in the establishment of segments, and (2) during the larval stage, when it functions negatively in determining the pattern of sensory bristles on the adult fly. In the rat, a structurally homologous gene (RHL) behaves as an immediate-early gene in its response to growth factors and can, like that in Drosophila, suppress neuronal differentiation events. Here, the authors report the genomic cloning of the human hairy gene homolog (HRY). The coding region of themore » gene is contained within four exons. The predicted amino acid sequence reveals only four amino acid differences between the human and rat genes. Analysis of the DNA sequence 5[prime] to the coding region reveals a putatitve untranslated exon. To increase the value of the HRY gene as a genetic marker and to assess its potential involvement in genetic disorders, they sublocalized the locus to chromosome 3q28-q29 by fluorescence in situ hybridization. 34 refs., 4 figs., 1 tab.« less
Yang, Huiqin; Li, Shiqiang; Xiao, Xueshan; Guo, Xiangming; Zhang, Qingjiong
2012-08-01
To screen mutations in the norrin (NDP) gene in 44 unrelated Chinese patients with familial exudative vitreoretinopathy (FEVR, 38 cases) or Norrie disease (6 cases) and to describe the associated phenotypes. Of the 44 patients, mutation in FZD4, LRP5, and TSPAN12 was excluded in 38 patients with FEVR in previous study. Sanger sequencing was used to analyze the 2 coding exons and their adjacent regions of NDP in the 44 patients. Clinical data were presented for patients with mutation. NDP variants in 5 of the 6 patients with Norrie disease were identified, including a novel missense mutation (c.164G>A, p.Cys55Phe) in one patient, two known missense mutations (c.122G>A, p.Arg41Lys; c.220C>T, p.Arg74Cys) in two patients, and a gross deletion encompassing the two coding exons in two patients. Of the 5 patients, 3 had a family history and 2 were singleton cases. No mutation in NDP was detected in the 38 patients with FEVR. NDP mutations are common cause of Norrie disease but might be rare cause for FEVR in Chinese.
Deletion in the EVC2 gene causes chondrodysplastic dwarfism in Tyrolean Grey cattle.
Murgiano, Leonardo; Jagannathan, Vidhya; Benazzi, Cinzia; Bolcato, Marilena; Brunetti, Barbara; Muscatello, Luisa Vera; Dittmer, Keren; Piffer, Christian; Gentile, Arcangelo; Drögemüller, Cord
2014-01-01
During the summer of 2013 seven Italian Tyrolean Grey calves were born with abnormally short limbs. Detailed clinical and pathological examination revealed similarities to chondrodysplastic dwarfism. Pedigree analysis showed a common founder, assuming autosomal monogenic recessive transmission of the defective allele. A positional cloning approach combining genome wide association and homozygosity mapping identified a single 1.6 Mb genomic region on BTA 6 that was associated with the disease. Whole genome re-sequencing of an affected calf revealed a single candidate causal mutation in the Ellis van Creveld syndrome 2 (EVC2) gene. This gene is known to be associated with chondrodysplastic dwarfism in Japanese Brown cattle, and dwarfism, abnormal nails and teeth, and dysostosis in humans with Ellis-van Creveld syndrome. Sanger sequencing confirmed the presence of a 2 bp deletion in exon 19 (c.2993_2994ACdel) that led to a premature stop codon in the coding sequence of bovine EVC2, and was concordant with the recessive pattern of inheritance in affected and carrier animals. This loss of function mutation confirms the important role of EVC2 in bone development. Genetic testing can now be used to eliminate this form of chondrodysplastic dwarfism from Tyrolean Grey cattle.
Deletion in the EVC2 Gene Causes Chondrodysplastic Dwarfism in Tyrolean Grey Cattle
Murgiano, Leonardo; Jagannathan, Vidhya; Benazzi, Cinzia; Bolcato, Marilena; Brunetti, Barbara; Muscatello, Luisa Vera; Dittmer, Keren; Piffer, Christian; Gentile, Arcangelo; Drögemüller, Cord
2014-01-01
During the summer of 2013 seven Italian Tyrolean Grey calves were born with abnormally short limbs. Detailed clinical and pathological examination revealed similarities to chondrodysplastic dwarfism. Pedigree analysis showed a common founder, assuming autosomal monogenic recessive transmission of the defective allele. A positional cloning approach combining genome wide association and homozygosity mapping identified a single 1.6 Mb genomic region on BTA 6 that was associated with the disease. Whole genome re-sequencing of an affected calf revealed a single candidate causal mutation in the Ellis van Creveld syndrome 2 (EVC2) gene. This gene is known to be associated with chondrodysplastic dwarfism in Japanese Brown cattle, and dwarfism, abnormal nails and teeth, and dysostosis in humans with Ellis-van Creveld syndrome. Sanger sequencing confirmed the presence of a 2 bp deletion in exon 19 (c.2993_2994ACdel) that led to a premature stop codon in the coding sequence of bovine EVC2, and was concordant with the recessive pattern of inheritance in affected and carrier animals. This loss of function mutation confirms the important role of EVC2 in bone development. Genetic testing can now be used to eliminate this form of chondrodysplastic dwarfism from Tyrolean Grey cattle. PMID:24733244
Law, Yee-Song; Gudimella, Ranganath; Song, Beng-Kah; Ratnam, Wickneswari; Harikrishna, Jennifer Ann
2012-01-01
Many of the plant leucine rich repeat receptor-like kinases (LRR-RLKs) have been found to regulate signaling during plant defense processes. In this study, we selected and sequenced an LRR-RLK gene, designated as Oryza rufipogon receptor-like protein kinase 1 (OrufRPK1), located within yield QTL yld1.1 from the wild rice Oryza rufipogon (accession IRGC105491). A 2055 bp coding region and two exons were identified. Southern blotting determined OrufRPK1 to be a single copy gene. Sequence comparison with cultivated rice orthologs (OsI219RPK1, OsI9311RPK1 and OsJNipponRPK1, respectively derived from O. sativa ssp. indica cv. MR219, O. sativa ssp. indica cv. 9311 and O. sativa ssp. japonica cv. Nipponbare) revealed the presence of 12 single nucleotide polymorphisms (SNPs) with five non-synonymous substitutions, and 23 insertion/deletion sites. The biological role of the OrufRPK1 as a defense related LRR-RLK is proposed on the basis of cDNA sequence characterization, domain subfamily classification, structural prediction of extra cellular domains, cluster analysis and comparative gene expression. PMID:22942769
Prenatal diagnosis for a Chinese family with a de novo DMD gene mutation
Li, Tao; Zhang, Zhao-jing; Ma, Xin; Lv, Xue; Xiao, Hai; Guo, Qian-nan; Liu, Hong-yan; Wang, Hong-dan; Wu, Dong; Lou, Gui-yu; Wang, Xin; Zhang, Chao-yang; Liao, Shi-xiu
2017-01-01
Abstract Background: Patients with Duchenne muscular dystrophy (DMD) usually have severe and fatal symptoms. At present, there is no effective treatment for DMD, thus it is very important to avoid the birth of children with DMD by effective prenatal diagnosis. We identified a de novo DMD gene mutation in a Chinese family, and make a prenatal diagnosis. Methods: First, multiplex ligation-dependent probe amplification (MLPA) was applied to analyze DMD gene exon deletion/duplication in all family members. The coding sequences of 79 exons in DMD gene were analyzed by Sanger sequencing in the patient; and then according to DMD gene exon mutation in the patient, DMD gene sequencing was performed in the family members. On the basis of results above, the pathogenic mutation in DMD gene was identified. Results: MLPA showed no DMD gene exon deletion/duplication in all family members. Sanger sequencing revealed c.2767_2767delT [p.Ser923LeufsX26] mutation in DMD gene of the patient. Heterozygous deletion mutation (T/-) at this locus was observed in the pregnant woman and her mother and younger sister. The analyses of amniotic fluid samples indicated negative Y chromosome sex-determining gene, no DMD gene exon deletion/duplication, no mutations at c.2767 locus, and the inherited maternal X chromosome different from that of the patient. Conclusion: The pathogenic mutation in DMD gene, c.2767_2767delT [p.Ser923LeufsX26], identified in this family is a de novo mutation. On the basis of specific conditions, it is necessary to select suitable methods to make prenatal diagnosis more effective, accurate, and economic. PMID:29390271
Mutations in GNA11 in Uveal Melanoma
Van Raamsdonk, Catherine D.; Griewank, Klaus G.; Crosby, Michelle B.; Garrido, Maria C.; Vemula, Swapna; Wiesner, Thomas; Obenauf, Anna C.; Wackernagel, Werner; Green, Gary; Bouvier, Nancy; Sozen, M. Mert; Baimukanova, Gail; Roy, Ritu; Heguy, Adriana; Dolgalev, Igor; Khanin, Raya; Busam, Klaus; Speicher, Michael R.; O’Brien, Joan; Bastian, Boris C.
2011-01-01
BACKGROUND Uveal melanoma is the most common intraocular cancer. There are no effective therapies for metastatic disease. Mutations in GNAQ, the gene encoding an alpha subunit of heterotrimeric G proteins, are found in 40% of uveal melanomas. METHODS We sequenced exon 5 of GNAQ and GNA11, a paralogue of GNAQ, in 713 melanocytic neoplasms of different types (186 uveal melanomas, 139 blue nevi, 106 other nevi, and 282 other melanomas). We sequenced exon 4 of GNAQ and GNA11 in 453 of these samples and in all coding exons of GNAQ and GNA11 in 97 uveal melanomas and 45 blue nevi. RESULTS We found somatic mutations in exon 5 (affecting Q209) and in exon 4 (affecting R183) in both GNA11 and GNAQ, in a mutually exclusive pattern. Mutations affecting Q209 in GNA11 were present in 7% of blue nevi, 32% of primary uveal melanomas, and 57% of uveal melanoma metastases. In contrast, we observed Q209 mutations in GNAQ in 55% of blue nevi, 45% of uveal melanomas, and 22% of uveal melanoma metastases. Mutations affecting R183 in either GNAQ or GNA11 were less prevalent (2% of blue nevi and 6% of uveal melanomas) than the Q209 mutations. Mutations in GNA11 induced spontaneously metastasizing tumors in a mouse model and activated the mitogen-activated protein kinase pathway. CONCLUSIONS Of the uveal melanomas we analyzed, 83% had somatic mutations in GNAQ or GNA11. Constitutive activation of the pathway involving these two genes appears to be a major contributor to the development of uveal melanoma. (Funded by the National Institutes of Health and others.) PMID:21083380
Vysokovsky, A; Saxena, R; Landau, M; Zivelin, A; Eskaraev, R; Rosenberg, N; Seligsohn, U; Inbal, A
2004-10-01
Hereditary factor (F)XIII deficiency is a rare bleeding disorder mostly due to mutations in FXIII A subunit. We studied the molecular basis of FXIII deficiency in patients from 10 unrelated families originating from Israel, India and Tunisia. Exons 2-15 of genomic DNA consisting of coding regions and intron/exon boundaries were amplified and sequenced. Structural analysis of the mutations was undertaken by computer modeling. Seven novel mutations were identified in the FXIIIA gene. The propositus from the Ethiopian-Jewish family was found to be a compound heterozygote for two novel mutations: a 10-bp deletion in exon 12 at nucleotides 1652-1661 (followed by 22 altered amino acids and termination codon) and Ala318Val mutation. The propositus of the Tunisian family was homozygous for C insertion after nucleotide 863 within a stretch of six cytosines of exon 7. This insertion results in generation of eight altered amino acids followed by a termination codon downstream. The propositus from Indian-Jewish origin was found to be homozygous for G to T substitution at IVS 11 [+1] resulting in skipping of exons 10 and 11. In addition to the Ala318Val mutation, three of the novel mutations identified are missense mutations: Arg260Leu, Thr398Asn and Gly210Arg each occurring in a homozygous state in an Israeli-Arab and two Indian families, respectively. Structure-function correlation analysis by computer modeling of the new missense mutations predicted that Gly210Arg will cause protein misfolding, Ala318Val and Thr398Asn will interfere with the catalytic process or protein stability, and Arg260Leu will impair dimerization.
Yeakley, J M; Hedjran, F; Morfin, J P; Merillat, N; Rosenfeld, M G; Emeson, R B
1993-01-01
The calcitonin/calcitonin gene-related peptide (CGRP) primary transcript is alternatively spliced in thyroid C cells and neurons, resulting in the tissue-specific production of calcitonin and CGRP mRNAs. Analyses of mutated calcitonin/CGRP transcription units in permanently transfected cell lines have indicated that alternative splicing is regulated by a differential capacity to utilize the calcitonin-specific splice acceptor. The analysis of an extensive series of mutations suggests that tissue-specific regulation of calcitonin mRNA production does not depend on the presence of a single, unique cis-active element but instead appears to be a consequence of suboptimal constitutive splicing signals. While only those mutations that altered constitutive splicing signals affected splice choices, the action of multiple regulatory sequences cannot be formally excluded. Further, we have identified a 13-nucleotide purine-rich element from a constitutive exon that, when placed in exon 4, entirely switches splice site usage in CGRP-producing cells. These data suggest that specific exon recruitment sequences, in combination with other constitutive elements, serve an important function in exon recognition. These results are consistent with the hypothesis that tissue-specific alternative splicing of the calcitonin/CGRP primary transcript is mediated by cell-specific differences in components of the constitutive splicing machinery. Images PMID:8413203
van den Berg, L; Kwant, L; Hestand, M S; van Oost, B A; Leegwater, P A J
2005-01-01
Aggressive behavior is the most frequently encountered behavioral problem in dogs. Abnormalities in brain serotonin metabolism have been described in aggressive dogs. We studied canine serotonergic genes to investigate genetic factors underlying canine aggression. Here, we describe the characterization of three genes of the canine serotonergic system: the serotonin receptor 1A and 2A gene (htr1A and htr2A) and the serotonin transporter gene (slc6A4). We isolated canine bacterial artificial chromosome clones containing these genes and designed oligonucleotides for genomic sequencing of coding regions and intron-exon boundaries. Golden retrievers were analyzed for DNA sequence variations. We found two nonsynonymous single nucleotide polymorphisms (SNPs) in the coding sequence of htr1A; one SNP close to a splice site in htr2A; and two SNPs in slc6A4, one in the coding sequence and one close to a splice site. In addition, we identified a polymorphic microsatellite marker for each gene. Htr1A is a strong candidate for involvement in the domestication of the dog. We genotyped the htr1A SNPs in 41 dogs of seven breeds with diverse behavioral characteristics. At least three SNP haplotypes were found. Our results do not support involvement of the gene in domestication.
RPS8—a New Informative DNA Marker for Phylogeny of Babesia and Theileria Parasites in China
Tian, Zhan-Cheng; Liu, Guang-Yuan; Yin, Hong; Luo, Jian-Xun; Guan, Gui-Quan; Luo, Jin; Xie, Jun-Ren; Shen, Hui; Tian, Mei-Yuan; Zheng, Jin-feng; Yuan, Xiao-song; Wang, Fang-fang
2013-01-01
Piroplasmosis is a serious debilitating and sometimes fatal disease. Phylogenetic relationships within piroplasmida are complex and remain unclear. We compared the intron–exon structure and DNA sequences of the RPS8 gene from Babesia and Theileria spp. isolates in China. Similar to 18S rDNA, the 40S ribosomal protein S8 gene, RPS8, including both coding and non-coding regions is a useful and novel genetic marker for defining species boundaries and for inferring phylogenies because it tends to have little intra-specific variation but considerable inter-specific difference. However, more samples are needed to verify the usefulness of the RPS8 (coding and non-coding regions) gene as a marker for the phylogenetic position and detection of most Babesia and Theileria species, particularly for some closely related species. PMID:24244571
Garuti, R; Lelli, N; Barozzini, M; Tiozzo, R; Ghisellini, M; Simone, M L; Li Volti, S; Garozzo, R; Mollica, F; Vergoni, W; Bertolini, S; Calandra, S
1996-03-01
In the present study we report two novel partial deletions of the LDL-R gene. The first (FH Siracusa), found in an FH-heterozygote, consists of a 20 kb deletion spanning from the 5' flanking region to the intron 2 of the LDL-receptor gene. The elimination of the promoter and the first two exons prevents the transcription of the deleted allele, as shown by Northern blot analysis of LDL-R mRNA isolated from the proband's fibroblasts. The second deletion (FH Reggio Emilia), which eliminates 11 nucleotides of exon 10, was also found in an FH heterozygote. The characterization of this deletion was made possible by a combination of techniques such as single strand conformation polymorphism (SSCP) analysis, direct sequence of exon 10 and cloning of the normal and deleted exon 10 from the proband's DNA. The 11 nt deletion occurs in a region of exon 10 which contains three triplets (CTG) and two four-nucleotides (CTGG) direct repeats. This structural feature might render this region more susceptible to a slipped mispairing during DNA duplication. Since this deletion causes a shift of the BamHI site at the 5' end of exon 10, a method has been devised for its rapid screening which is based on the PCR amplification of exon 10 followed by BamHI digestion. FH Reggio Emilia deletion produces a shift in the reading frame downstream from Lys458, leading to a sequence of 51 novel amino acids before the occurrence of a premature stop codon (truncated receptor). However, since RT-PCR failed to demonstrate the presence of the mutant LDL-R mRNA in proband fibroblasts, it is likely that the amount of truncated receptor produced in these cells is negligible.
Intergenic mRNA molecules resulting from trans-splicing.
Finta, Csaba; Zaphiropoulos, Peter G
2002-02-22
Accumulated recent evidence is indicating that alternative splicing represents a generalized process that increases the complexity of human gene expression. Here we show that mRNA production may not necessarily be limited to single genes, as human liver also has the potential to produce a variety of hybrid cytochrome P450 3A mRNA molecules. The four known cytochrome P450 3A genes in humans, CYP3A4, CYP3A5, CYP3A7, and CYP3A43, share a high degree of similarity, consist of 13 exons with conserved exon-intron boundaries, and form a cluster on chromosome 7. The chimeric CYP3A mRNA molecules described herein are characterized by CYP3A43 exon 1 joined at canonical splice sites to distinct sets of CYP3A4 or CYP3A5 exons. Because the CYP3A43 gene is in a head-to-head orientation with the CYP3A4 and CYP3A5 genes, bypassing transcriptional termination can not account for the formation of hybrid CYP3A mRNAs. Thus, the mechanism generating these molecules has to be an RNA processing event that joins exons of independent pre-mRNA molecules, i.e. trans-splicing. Using quantitative real-time polymerase chain reaction, the ratio of one CYP3A43/3A4 intergenic combination was estimated to be approximately 0.15% that of the CYP3A43 mRNAs. Moreover, trans-splicing has been found not to interfere with polyadenylation. Heterologous expression of the chimeric species composed of CYP3A43 exon 1 joined to exons 2-13 of CYP3A4 revealed catalytic activity toward testosterone.
Zimowski, Janusz G; Pilch, Jacek; Pawelec, Magdalena; Purzycka, Joanna K; Kubalska, Jolanta; Ziora-Jakutowicz, Karolina; Dudzińska, Magdalena; Zaremba, Jacek
2017-08-01
In the material of 227 families with Becker muscular dystrophy (BMD), we found nine non-consanguineous families with 17 male individuals carrying a rare mutation-a single exon 48 deletion of the dystrophin gene-who were affected with a very mild or subclinical form of BMD. They were usually detected thanks to accidental findings of elevated serum creatine phosphokinase (sCPK). A thorough clinical analysis of the carriers, both children (12) and adults (5), revealed in some of them muscle hypotonia (10/17) and/or very mild muscle weakness (9/17), as well as decreased tendon reflexes (6/17). Adults, apart from very mild muscle weakness and calf hypertrophy in some, had no significant abnormalities on neurological assessments and had good exercise tolerance. Parents of the children carriers of the exon 48 deletion are usually unaware of their children being affected, and possibly at risk of developing life-threatening cardiomyopathy. The same concerns the adult male carriers. Therefore, the authors postulate undertaking preventive measures such as cascade screening of the relatives of the probands. Newborn screening programmes of Duchenne muscular dystrophy (DMD)/BMD based on sCPK marked increase may be considered.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, P.Y.; Ernst, A.R.; Sly, W.S.
1994-04-01
To date, three different structural gene mutations have been identified in patients with carbonic anhydrase II deficiency (osteopetrosis with renal tubular acidosis and cerebral calcification). These include a missense mutation (H107Y) in two families, a splice junction mutation in intron 5 in one of these families, and a splice junction mutation in intron 2 for which many Arabic patients are homozygous. The authors report here a novel mutation for which carbonic anhydrase II-deficient patients from seven unrelated Hispanic families were found to be homozygous. The proband was a 2 1/2-year-old Hispanic girl of Puerto Rican ancestry who was unique clinically,more » in that she had no evidence of renal tubular acidosis, even though she did have osteopetrosis, developmental delay, and cerebral calcification. She proved to be homozygous for a single-base deletion in the coding region of exon 7 that produces a frameshift that changes the next 12 amino acids before leading to chain termination and that also introduces a new MaeIII restriction site. The 27-kD truncated enzyme produced when the mutant cDNA was expressed in COS cells was enzymatically inactive, present mainly in insoluble aggregates, and detectable immunologically at only 5% the level of the 29-kD normal carbonic anhydrase II expressed from the wild-type cDNA. Metabolic labeling revealed that this 27-kD mutant protein has an accelerated rate of degradation. Six subsequent Hispanic patients of Caribbean ancestry, all of whom had osteopetrosis and renal tubular acidosis but who varied widely in clinical severity, were found to be homozygous for the same mutation. These findings identify a novel mutation common to Hispanic patients from the Caribbean islands and provide a ready means for PCR-based diagnosis of the [open quotes]Hispanic mutation.[close quotes] The basis for their phenotypic variability is not yet clear. 15 refs., 5 figs., 1 tab.« less
Suzuki, Yoshiaki; Ohya, Susumu; Yamamura, Hisao; Giles, Wayne R; Imaizumi, Yuji
2016-11-11
Large conductance Ca 2+ -activated K + (BK) channels play essential roles in both excitable and non-excitable cells. For example, in chondrocytes, agonist-induced Ca 2+ release from intracellular store activates BK channels, and this hyperpolarizes these cells, augments Ca 2+ entry, and forms a positive feed-back mechanism for Ca 2+ signaling and stimulation-secretion coupling. In the present study, functional roles of a newly identified splice variant in the BK channel α subunit (BKαΔe2) were examined in a human chondrocyte cell line, OUMS-27, and in a HEK293 expression system. Although BKαΔe2 lacks exon2, which codes the intracellular S0-S1 linker (Glu-127-Leu-180), significant expression was detected in several tissues from humans and mice. Molecular image analyses revealed that BKαΔe2 channels are not expressed on plasma membrane but can traffic to the plasma membrane after forming hetero-tetramer units with wild-type BKα (BKαWT). Single-channel current analyses demonstrated that BKα hetero-tetramers containing one, two, or three BKαΔe2 subunits are functional. These hetero-tetramers have a smaller single channel conductance and exhibit lower trafficking efficiency than BKαWT homo-tetramers in a stoichiometry-dependent manner. Site-directed mutagenesis of residues in exon2 identified Helix2 and the linker to S1 (Trp-158-Leu-180, particularly Arg-178) as an essential segment for channel function including voltage dependence and trafficking. BKαΔe2 knockdown in OUMS-27 chondrocytes increased BK current density and augmented the responsiveness to histamine assayed as cyclooxygenase-2 gene expression. These findings provide significant new evidence that BKαΔe2 can modulate cellular responses to physiological stimuli in human chondrocyte and contribute under pathophysiological conditions, such as osteoarthritis. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Song, Yiqing; Hsu, Yi-Hsiang; Niu, Tianhua; Manson, Joann E; Buring, Julie E; Liu, Simin
2009-01-17
Ion channel transient receptor potential membrane melastatin 6 and 7 (TRPM6 and TRPM7) play a central role in magnesium homeostasis, which is critical for maintaining glucose and insulin metabolism. However, it is unclear whether common genetic variation in TRPM6 and TRPM7 contributes to risk of type 2 diabetes. We conducted a nested case-control study in the Women's Health Study. During a median of 10 years of follow-up, 359 incident diabetes cases were diagnosed and matched by age and ethnicity with 359 controls. We analyzed 20 haplotype-tagging single nucleotide polymorphisms (SNPs) in TRPM6 and 5 common SNPs in TRPM7 for their association with diabetes risk. Overall, there was no robust and significant association between any single SNP and diabetes risk. Neither was there any evidence of association between common TRPM6 and TRPM7 haplotypes and diabetes risk. Our haplotype analyses suggested a significant risk of type 2 diabetes among carriers of both the rare alleles from two non-synomous SNPs in TRPM6 (Val1393Ile in exon 26 [rs3750425] and Lys1584Glu in exon 27 [rs2274924]) when their magnesium intake was lower than 250 mg per day. Compared with non-carriers, women who were carriers of the haplotype 1393Ile-1584Glu had an increased risk of type 2 diabetes (OR, 4.92, 95% CI, 1.05-23.0) only when they had low magnesium intake (<250 mg/day). Our results provide suggestive evidence that two common non-synonymous TRPM6 coding region variants, Ile1393Val and Lys1584Glu polymorphisms, might confer susceptibility to type 2 diabetes in women with low magnesium intake. Further replication in large-scale studies is warranted.
Lyons, Brendan M; McHenry, Monique A; Barrington, David S
2017-07-01
Cytosolic phosphoglucose isomerase (pgiC) is an enzyme essential to glycolysis found universally in eukaryotes, but broad understanding of variation in the gene coding for pgiC is lacking for ferns. We used a substantially expanded representation of the gene for Andean species of the fern genus Polystichum to characterize pgiC in ferns relative to angiosperms, insects, and an amoebozoan; assess the impact of selection versus neutral evolutionary processes on pgiC; and explore evolutionary relationships of selected Andean species. The dataset of complete sequences comprised nine accessions representing seven species and one hybrid from the Andes and Serra do Mar. The aligned sequences of the full data set comprised 3376 base pairs (70% of the entire gene) including 17 exons and 15 introns from two central areas of the gene. The exons are highly conserved relative to angiosperms and retain substantial homology to insect pgiC, but intron length and structure are unique to the ferns. Average intron size is similar to angiosperms; intron number and location in insects are unlike those of the plants we considered. The introns included an array of indels and, in intron 7, an extensive microsatellite array with potential utility in analyzing population-level histories. Bayesian and maximum-parsimony analysis of 129 variable nucleotides in the Andean polystichums revealed that 59 (1.7% of the 3376 total) were phylogenetically informative; most of these united sister accessions. The phylogenetic trees for the Andean polystichums were incongruent with previously published cpDNA trees for the same taxa, likely the result of rapid evolutionary change in the introns and contrasting stability in the exons. The exons code a total of seven amino-acid substitutions. Comparison of non-synonymous to synonymous substitutions did not suggest that the pgiC gene is under selection in the Andes. Variation in pgiC including two additional accessions represented by incomplete sequences provided new insights into reticulate relationships among Andean taxa. Copyright © 2017 Elsevier Inc. All rights reserved.
The effect of the common c.2299delG mutation in USH2A on RNA splicing.
Lenassi, Eva; Saihan, Zubin; Bitner-Glindzicz, Maria; Webster, Andrew R
2014-05-01
Recessive variants in the USH2A gene are an important cause of both Usher syndrome and nonsyndromic retinitis pigmentosa. A single base-pair deletion in exon 13 (c.2299delG, p.Glu767Serfs*21) is considered the most frequent mutation of USH2A. It is predicted to generate a premature termination codon and is presumed to lead to nonsense mediated decay. However the effect of this variant on RNA has not been formally investigated. It is not uncommon for exonic sequence alterations to cause aberrant splicing and the aim of the present report is to evaluate the effect of c.2299delG on USH2A transcripts. Nasal cells represent the simplest available tissue to study splicing defects in USH2A. Nasal brushing, RNA extraction from nasal epithelial cells and reverse transcription PCR were performed in five Usher syndrome patients who were homozygous for c.2299delG, two unaffected c.2299delG heterozygotes and seven control individuals. Primers to amplify between exons 12 and 15 and exons 10 and 14 were utilised. Significant variability was observed between different RT-PCR experiments. Importantly, in controls, PCR product of the expected size were amplified on all occasions (13/13 experiments); for patients this was true in only 4/14 experiments (Fisher exact test p = 0.0002). Bioinformatics tools predict the c.2299delG change to disrupt an exonic splicing enhancer and to create an exonic splicing silencer within exon 13. Here, we report an effect of the common c.2299delG mutation on splicing of exons 12 and 13 of USH2A. Future studies are expected to provide important insights into the contribution of this effect on the phenotype. Copyright © 2014 Elsevier Ltd. All rights reserved.
Exclusion of alternative exon 33 of CaV1.2 calcium channels in heart is proarrhythmogenic
Li, Guang; Wang, Juejin; Liao, Ping; Bartels, Peter; Zhang, Hengyu; Yu, Dejie; Liang, Mui Cheng; Poh, Kian Keong; Yu, Chye Yun; Jiang, Fengli; Yong, Tan Fong; Wong, Yuk Peng; Hu, Zhenyu; Huang, Hua; Zhang, Guangqin; Galupo, Mary Joyce; Bian, Jin-Song; Ponniah, Sathivel; Trasti, Scott Lee; Foo, Roger; Hoppe, Uta C.; Herzig, Stefan; Soong, Tuck Wah
2017-01-01
Alternative splicing changes the CaV1.2 calcium channel electrophysiological property, but the in vivo significance of such altered channel function is lacking. Structure–function studies of heterologously expressed CaV1.2 channels could not recapitulate channel function in the native milieu of the cardiomyocyte. To address this gap in knowledge, we investigated the role of alternative exon 33 of the CaV1.2 calcium channel in heart function. Exclusion of exon 33 in CaV1.2 channels has been reported to shift the activation potential −10.4 mV to the hyperpolarized direction, and increased expression of CaV1.2Δ33 channels was observed in rat myocardial infarcted hearts. However, how a change in CaV1.2 channel electrophysiological property, due to alternative splicing, might affect cardiac function in vivo is unknown. To address these questions, we generated mCacna1c exon 33−/−-null mice. These mice contained CaV1.2Δ33 channels with a gain-of-function that included conduction of larger currents that reflects a shift in voltage dependence and a modest increase in single-channel open probability. This altered channel property underscored the development of ventricular arrhythmia, which is reflected in significantly more deaths of exon 33−/− mice from β-adrenergic stimulation. In vivo telemetric recordings also confirmed increased frequencies in premature ventricular contractions, tachycardia, and lengthened QT interval. Taken together, the significant decrease or absence of exon 33-containing CaV1.2 channels is potentially proarrhythmic in the heart. Of clinical relevance, human ischemic and dilated cardiomyopathy hearts showed increased inclusion of exon 33. However, the possible role that inclusion of exon 33 in CaV1.2 channels may play in the pathogenesis of human heart failure remains unclear. PMID:28490495
Lam, V M; Huang, W; Lam, S T; Yeung, C Y; Johnson, P H
1996-03-01
We describe here the use of denaturing gradient gel electrophoresis (DGGE) to detect the most common Chinese glucose-6-phosphate dehydrogenase (G6PD) variants, which are the single point mutations: G-->T at nt 1376, G-->A at 1388 both in exon 12 and A-->G at nt 95 in exon 02. In each case, the mutant allele resolves well from the normal allele(s). The distinct heteroduplex bands are characteristic of a particular genotype suggesting that this feature is very useful for identifying all heterozygous carriers for this and other X-linked diseases. When the analysis is extended to other exons, DGGE scans the gene and coupled with direct sequencing, it leads to the identification of new G6PD variation(s). With this approach, we identified a mutation in exon 9 which had not been reported in Hong Kong. Since DGGE can rapidly screen many unknown samples in one gel, this approach could be used to diagnose these G6PD mutations and to identify the at-risk for counselling.
Suzuki, Takashi; Brown, Judy J.; Swift, Larry L.
2016-01-01
Microsomal triglyceride transfer protein (MTP) is essential for the assembly of triglyceride-rich apolipoprotein B-containing lipoproteins. Previous studies in our laboratory identified a novel splice variant of MTP in mice that we named MTP-B. MTP-B has a unique first exon (1B) located 2.7 kB upstream of the first exon (1A) for canonical MTP (MTP-A). The two mature isoforms, though nearly identical in sequence and function, have different tissue expression patterns. In this study we report the identification of a second MTP splice variant (MTP-C), which contains both exons 1B and 1A. MTP-C is expressed in all the tissues we tested. In cells transfected with MTP-C, protein expression was less than 15% of that found when the cells were transfected with MTP-A or MTP-B. In silico analysis of the 5’-UTR of MTP-C revealed seven ATGs upstream of the start site for MTP-A, which is the only viable start site in frame with the main coding sequence. One of those ATGs was located in the 5’-UTR for MTP-A. We generated reporter constructs in which the 5’-UTRs of MTP-A or MTP-C were inserted between an SV40 promoter and the coding sequence of the luciferase gene and transfected these constructs into HEK 293 cells. Luciferase activity was significantly reduced by the MTP-C 5’-UTR, but not by the MTP-A 5’-UTR. We conclude that alternative splicing plays a key role in regulating MTP expression by introducing unique 5’-UTRs, which contain elements that alter translation efficiency, enabling the cell to optimize MTP levels and activity. PMID:26771188
Gonzalez, Luis Miguel; Bonay, Pedro; Benitez, Laura; Ferrer, Elizabeth; Harrison, Leslie J S; Parkhouse, R Michael E; Garate, Teresa
2007-02-01
Two clones from an activated Taenia saginata oncosphere cDNA library, Ts45W and Ts45S, were isolated and sequenced. Both of these genes belong to the Taenia ovis 45W gene family. The Ts45W and Ts45S cDNAs are 997- and 1,004-bp-long, each corresponding to 255 amino acids and with theoretical molecular masses of 27.8 and 27.7 kDa, respectively. Southern blot profiles obtained with Ts45W cDNA as a probe suggest that these two genes are members of a multigene family with tandem organization. The full genomic sequence was determined for the Ts45W gene and a new family member, the Ts45W/2 gene. The genomic sequences of the T. saginata Ts45W and Ts45W/2 genes were at least 2.2 kb in length with four exons separated by three introns. Exons 1 and 4 coded for hydrophobic domains, while, importantly, exons 2 and 3 coded for fibronectin homologous domains. These domains are presumably responsible for the demonstrated cell adhesion and, perhaps, the protective nature of this family of molecules and the acronym TAF (Taenia adhesion family) is proposed for this group of genes. We hypothesize that these TAF proteins and another T. saginata-protective antigen, HP6, have evolved the dual functions of facilitating tissue invasion and stimulating protective immunity to first ensure primary infection and subsequently to establish a concomitant protective immunity to protect the host from death or debilitation through superinfection by subsequent infections and thus help ensure parasite survival.
Screening for microsatellite instability target genes in colorectal cancers
Vilkki, S; Launonen, V; Karhu, A; Sistonen, P; Vastrik, I; Aaltonen, L
2002-01-01
Background: Defects in the DNA repair system lead to genetic instability because replication errors are not corrected. This type of genetic instability is a key event in the malignant progression of HNPCC and a subset of sporadic colon cancers and mutation rates are particularly high at short repetitive sequences. Somatic deletions of coding mononucleotide repeats have been detected, for example, in the TGFßRII and BAX genes, and recently many novel target genes for microsatellite instability (MSI) have been proposed. Novel target genes are likely to be discovered in the future. More data should be created on background mutation rates in MSI tumours to evaluate mutation rates observed in the candidate target genes. Methods: Mutation rates in 14 neutral intronic repeats were evaluated in MSI tumours. Bioinformatic searches combined with keywords related to cancer and tumour suppressor or CRC related gene homology were used to find new candidate MSI target genes. By comparison of mutation frequencies observed in intronic mononucleotide repeats versus exonic coding repeats of potential MSI target genes, the significance of the exonic mutations was estimated. Results: As expected, the length of an intronic mononucleotide repeat correlated positively with the number of slippages for both G/C and A/T repeats (p=0.0020 and p=0.0012, respectively). BRCA1, CtBP1, and Rb1 associated CtIP and other candidates were found in a bioinformatic search combined with keywords related to cancer. Sequencing showed a significantly increased mutation rate in the exonic A9 repeat of CtIP (25/109=22.9%) as compared with similar intronic repeats (p≤0.001). Conclusions: We propose a new candidate MSI target gene CtIP to be evaluated in further studies. PMID:12414815
Yao, Q; Fischer, K P; Tyrrell, D L; Gutfreund, K S
2015-04-01
Programmed death ligand-1 (PD-L1) plays an important role in the attenuation of adaptive immune responses in higher vertebrates. Here, we describe the identification of the Pekin duck PD-L1 orthologue (duPD-L1) and its gene structure. The duPD-L1 cDNA encodes a 311-amino acid protein that has an amino acid identity of 78% and 42% with chicken and human PD-L1, respectively. Mapping of the duPD-L1 cDNA with duck genomic sequences revealed an exonic structure of its coding sequence similar to those of other vertebrates but lacked a noncoding exon 1. Homology modelling of the duPD-L1 extracellular domain was compatible with the tandem IgV-like and IgC-like IgSF domain structure of human PD-L1 (PDB ID: 3BIS). Residues known to be important for receptor binding of human PD-L1 were mostly conserved in duPD-L1 within the N-terminus and the G sheet, and partially conserved within the F sheet but not within sheets C and C'. DuPD-L1 mRNA was constitutively expressed in all tissues examined with highest expression levels in lung and spleen and very low levels of expression in muscle, kidney and brain. Mitogen stimulation of duck peripheral blood mononuclear cells transiently increased duPD-L1 mRNA expression. Our observations demonstrate evolutionary conservation of the exonic structure of its coding sequence, the extracellular domain structure and residues implicated in receptor binding, but the role of the longer cytoplasmic tail in avian PD-L1 proteins remains to be determined. © 2014 John Wiley & Sons Ltd.
Rozhdestvensky, Timofey S.; Robeck, Thomas; Galiveti, Chenna R.; Raabe, Carsten A.; Seeger, Birte; Wolters, Anna; Gubar, Leonid V.; Brosius, Jürgen; Skryabin, Boris V.
2016-01-01
Prader-Willi syndrome (PWS) is a neurogenetic disorder caused by loss of paternally expressed genes on chromosome 15q11-q13. The PWS-critical region (PWScr) contains an array of non-protein coding IPW-A exons hosting intronic SNORD116 snoRNA genes. Deletion of PWScr is associated with PWS in humans and growth retardation in mice exhibiting ~15% postnatal lethality in C57BL/6 background. Here we analysed a knock-in mouse containing a 5′HPRT-LoxP-NeoR cassette (5′LoxP) inserted upstream of the PWScr. When the insertion was inherited maternally in a paternal PWScr-deletion mouse model (PWScrp−/m5′LoxP), we observed compensation of growth retardation and postnatal lethality. Genomic methylation pattern and expression of protein-coding genes remained unaltered at the PWS-locus of PWScrp−/m5′LoxP mice. Interestingly, ubiquitous Snord116 and IPW-A exon transcription from the originally silent maternal chromosome was detected. In situ hybridization indicated that PWScrp−/m5′LoxP mice expressed Snord116 in brain areas similar to wild type animals. Our results suggest that the lack of PWScr RNA expression in certain brain areas could be a primary cause of the growth retardation phenotype in mice. We propose that activation of disease-associated genes on imprinted regions could lead to general therapeutic strategies in associated diseases. PMID:26848093
Dynamic and Widespread lncRNA Expression in a Sponge and the Origin of Animal Complexity
Gaiti, Federico; Fernandez-Valverde, Selene L.; Nakanishi, Nagayasu; Calcino, Andrew D.; Yanai, Itai; Tanurdzic, Milos; Degnan, Bernard M.
2015-01-01
Long noncoding RNAs (lncRNAs) are important developmental regulators in bilaterian animals. A correlation has been claimed between the lncRNA repertoire expansion and morphological complexity in vertebrate evolution. However, this claim has not been tested by examining morphologically simple animals. Here, we undertake a systematic investigation of lncRNAs in the demosponge Amphimedon queenslandica, a morphologically simple, early-branching metazoan. We combine RNA-Seq data across multiple developmental stages of Amphimedon with a filtering pipeline to conservatively predict 2,935 lncRNAs. These include intronic overlapping lncRNAs, exonic antisense overlapping lncRNAs, long intergenic nonprotein coding RNAs, and precursors for small RNAs. Sponge lncRNAs are remarkably similar to their bilaterian counterparts in being relatively short with few exons and having low primary sequence conservation relative to protein-coding genes. As in bilaterians, a majority of sponge lncRNAs exhibit typical hallmarks of regulatory molecules, including high temporal specificity and dynamic developmental expression. Specific lncRNA expression profiles correlate tightly with conserved protein-coding genes likely involved in a range of developmental and physiological processes, such as the Wnt signaling pathway. Although the majority of Amphimedon lncRNAs appears to be taxonomically restricted with no identifiable orthologs, we find a few cases of conservation between demosponges in lncRNAs that are antisense to coding sequences. Based on the high similarity in the structure, organization, and dynamic expression of sponge lncRNAs to their bilaterian counterparts, we propose that these noncoding RNAs are an ancient feature of the metazoan genome. These results are consistent with lncRNAs regulating the development of animals, regardless of their level of morphological complexity. PMID:25976353
A mutation of the p63 gene in non‐syndromic cleft lip
Leoyklang, P; Siriwan, P; Shotelersuk, V
2006-01-01
Mutations in the p63 gene (TP63) underlie several monogenic malformation syndromes manifesting cleft lip with or without cleft palate (CL/P). We investigated whether p63 mutations also result in non‐syndromic CL/P. Specifically, we performed mutation analysis of the 16 exons of the p63 gene for 100 Thai patients with non‐syndromic CL/P. In total, 21 variant sites were identified. All were single nucleotide changes, with six in coding regions, including three novel non‐synonymous changes: S90L, R313G, and D564H. The R313G was concluded to be pathogenic on the basis of its amino acid change, evolutionary conservation, its occurrence in a functionally important domain, its predicted damaging function, its de novo occurrence, and its absence in 500 control individuals. Our data strongly suggest, for the first time, a causative role of a heterozygous mutation in the p63 gene in non‐syndromic CL/P, highlighting the wide phenotypic spectrum of p63 gene mutations. PMID:16740912
Kassabov, Stefan R.; Choi, Yun-Beom; Karl, Kevin A.; Vishwasrao, Harshad D.; Bailey, Craig H.; Kandel, Eric R.
2014-01-01
Summary Neurotrophins control the development and adult plasticity of the vertebrate nervous system. Failure to identify invertebrate neurotrophin orthologs, however, has precluded studies in invertebrate models, limiting understanding of fundamental aspects of neurotrophin biology and function. We identified a neurotrophin (ApNT) and Trk receptor (ApTrk) in the mollusk Aplysia and find they play a central role in learning related synaptic plasticity. ApNT increases the magnitude and lowers the threshold for induction of long-term facilitation and initiates the growth of new synaptic varicosities at the monosynaptic connection between sensory and motor neurons of the gill-withdrawal reflex. Unlike vertebrate neurotrophins, ApNT has multiple coding exons and exerts distinct synaptic effects through differentially processed and secreted splice isoforms. Our findings demonstrate the existence of bona-fide neurotrophin signaling in invertebrates and reveal a novel, post-transcriptional mechanism, regulating neurotrophin processing and the release of pro- and mature neurotrophins which differentially modulate synaptic plasticity. PMID:23562154
Cost-Effective Sequencing of Full-Length cDNA Clones Powered by a De Novo-Reference Hybrid Assembly
Sugano, Sumio; Morishita, Shinichi; Suzuki, Yutaka
2010-01-01
Background Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. Methodology We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence ∼800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded), and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. Conclusions The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only ∼US$3 per clone, demonstrating a significant advantage over previous approaches. PMID:20479877
Kuroyanagi, Hidehito; Watanabe, Yohei; Suzuki, Yutaka; Hagiwara, Masatoshi
2013-01-01
A large fraction of protein-coding genes in metazoans undergo alternative pre-mRNA splicing in tissue- or cell-type-specific manners. Recent genome-wide approaches have identified many putative-binding sites for some of tissue-specific trans-acting splicing regulators. However, the mechanisms of splicing regulation in vivo remain largely unknown. To elucidate the modes of splicing regulation by the neuron-specific CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans, we performed deep sequencing of poly(A)+ RNAs from the unc-75(+)- and unc-75-mutant worms and identified more than 20 cassette and mutually exclusive exons repressed or activated by UNC-75. Motif searches revealed that (G/U)UGUUGUG stretches are enriched in the upstream and downstream introns of the UNC-75-repressed and -activated exons, respectively. Recombinant UNC-75 protein specifically binds to RNA fragments carrying the (G/U)UGUUGUG stretches in vitro. Bi-chromatic fluorescence alternative splicing reporters revealed that the UNC-75-target exons are regulated in tissue-specific and (G/U)UGUUGUG element-dependent manners in vivo. The unc-75 mutation affected the splicing reporter expression specifically in the nervous system. These results indicate that UNC-75 regulates alternative splicing of its target exons in neuron-specific and position-dependent manners through the (G/U)UGUUGUG elements in C. elegans. This study thus reveals the repertoire of target events for the CELF family in the living organism. PMID:23416545
Lack of haplotype structuring for two candidate genes for trypanotolerance in cattle.
Álvarez, I; Pérez-Pardal, L; Traoré, A; Fernández, I; Goyache, F
2016-04-01
Bovine trypanotolerance is a heritable trait associated to the ability of the individuals to control parasitaemia and anaemia. The INHBA (BTA4) and TICAM1 (BTA7) genes are strong candidates for trypanotolerance-related traits. The coding sequence of both genes (3951 bp in total) were analysed in a panel including 79 Asian, African and European cattle (Bos taurus and B. indicus) to identify naturally occurring polymorphisms on both genes. In general, the genetic diversity was low. Nineteen of the 33 mutations identified were found just one time. Seventeen different haplotypes were defined for the TICAM1 gene, and 9 and 12 were defined for the exon 1 and the exon 2 of the INHBA gene, respectively. There was no clear separation between cattle groups. The most frequent haplotypes identified in West African taurine samples were also identified in other cattle groups including Asian zebu and European cattle. Phylogenetic trees and principal component analysis confirmed that divergence among the cattle groups analysed was poor, particularly for the INHBA sequences. The European cattle subset had the lowest values of haplotype diversity for both the exon1 (monomorphic) and the exon2 (0.077 ± 0.066) of the INHBA gene. Neutrality tests, in general, did not suggest that the analysed genes were under positive selection. The assessed scenario would be consistent with the identification of recent mutations in evolutionary terms. © 2015 Blackwell Verlag GmbH.
Hypoxia regulates alternative splicing of HIF and non-HIF target genes.
Sena, Johnny A; Wang, Liyi; Heasley, Lynn E; Hu, Cheng-Jun
2014-09-01
Hypoxia is a common characteristic of many solid tumors. The hypoxic microenvironment stabilizes hypoxia-inducible transcription factor 1α (HIF1α) and 2α (HIF2α/EPAS1) to activate gene transcription, which promotes tumor cell survival. The majority of human genes are alternatively spliced, producing RNA isoforms that code for functionally distinct proteins. Thus, an effective hypoxia response requires increased HIF target gene expression as well as proper RNA splicing of these HIF-dependent transcripts. However, it is unclear if and how hypoxia regulates RNA splicing of HIF targets. This study determined the effects of hypoxia on alternative splicing (AS) of HIF and non-HIF target genes in hepatocellular carcinoma cells and characterized the role of HIF in regulating AS of HIF-induced genes. The results indicate that hypoxia generally promotes exon inclusion for hypoxia-induced, but reduces exon inclusion for hypoxia-reduced genes. Mechanistically, HIF activity, but not hypoxia per se is found to be necessary and sufficient to increase exon inclusion of several HIF targets, including pyruvate dehydrogenase kinase 1 (PDK1). PDK1 splicing reporters confirm that transcriptional activation by HIF is sufficient to increase exon inclusion of PDK1 splicing reporter. In contrast, transcriptional activation of a PDK1 minigene by other transcription factors in the absence of endogenous HIF target gene activation fails to alter PDK1 RNA splicing. This study demonstrates a novel function of HIF in regulating RNA splicing of HIF target genes. ©2014 American Association for Cancer Research.
De novo insertion of an intron into the mammalian sex determining gene, SRY
O’Neill, Rachel J. Waugh; Brennan, Francine E.; Delbridge, Margaret L.; Crozier, Ross H.; Graves, Jennifer A. Marshall
1998-01-01
Two theories have been proposed to explain the evolution of introns within eukaryotic genes. The introns early theory, or “exon theory of genes,” proposes that introns are ancient and that recombination within introns provided new exon structure, and thus new genes. The introns late theory, or “insertional theory of introns,” proposes that ancient genes existed as uninterrupted exons and that introns have been introduced during the course of evolution. There is still controversy as to how intron–exon structure evolved and whether the majority of introns are ancient or novel. Although there is extensive evidence in support of the introns early theory, phylogenetic comparisons of several genes indicate recent gain and loss of introns within these genes. However, no example has been shown of a protein coding gene, intronless in its ancestral form, which has acquired an intron in a derived form. The mammalian sex determining gene, SRY, is intronless in all mammals studied to date, as is the gene from which it recently evolved. However, we report here comparisons of genomic and cDNA sequences that now provide evidence of a de novo insertion of an intron into the SRY gene of dasyurid marsupials. This recently (approximately 45 million years ago) inserted sequence is not homologous with known transposable elements. Our data demonstrate that introns may be inserted as spliced units within a developmentally crucial gene without disrupting its function. PMID:9465071
Pastor, André F.; Moura, Laís Rodrigues; Neto, José W.D.; Nascimento, Eduardo J.M.; Calzavara-Silva, Carlos E.; Gomes, Ana Lisa V.; da Silva, Ana Maria; Cordeiro, Marli T.; Braga-Neto, Ulisses; Crovella, Sergio; Gil, Laura H.V.G.; Marques, Ernesto T.A.; Acioli-Santos, Bartolomeu
2013-01-01
Four genetic polymorphisms located at the promoter (C-257T) and coding regions of CFH gene (exon 2 G257A, exon 14 A2089G and exon 19 G2881T) were investigated in 121 dengue patients (DENV-3) in order to assess the relationship between allele/haplotypes variants and clinical outcomes. A statistical value was found between the CFH-257T allele (TT/TC genotypes) and reduced susceptibility to severe dengue (SD). Statistical associations indicate that individuals bearing a T allele presented significantly higher protein levels in plasma. The –257T variant is located within a NF-κB binding site, suggesting that this variant might have effect on the ability of the CFH gene to respond to signals via the NF-κB pathway. The G257A allelic variant showed significant protection against severe dengue. When CFH haplotypes effect was considered, the ancestral CG/CG promoter-exon 2 SNP genotype showed significant risk to SD either in a general comparison (ancestral × all variant genotypes), as well as in individual genotypes comparison (ancestral × each variant genotype), where the most prevalent effect was observed in the CG/CG × CA/TG comparison. These findings support the involvement of –257T, 257A allele variants and haplotypes on severe dengue phenotype protection, related with high basal CFH expression. PMID:23747994
Campos, W N; Massaro, J D; Martinelli, A L C; Halliwell, J A; Marsh, S G E; Mendes-Junior, C T; Donadi, E A
2017-10-01
The HFE molecule controls iron uptake from gut, and defects in the molecule have been associated with iron overload, particularly in hereditary hemochromatosis. The HFE gene including both coding and boundary intronic regions were sequenced in 304 Brazilian individuals, encompassing healthy individuals and patients exhibiting hereditary or acquired iron overload. Six sites of variation were detected: (1) H63D C>G in exon 2, (2) IVS2 (+4) T>C in intron 2, (3) a C>G transversion in intron 3, (4) C282Y G>A in exon 4, (5) IVS4 (-44) T>C in intron 4, and (6) a new guanine deletion (G>del) in intron 5, which were used for haplotype inference. Nine HFE alleles were detected and six of these were officially named on the basis of the HLA Nomenclature, defined by the World Health Organization (WHO) Nomenclature Committee for Factors of the HLA System, and published via the IPD-IMGT/HLA website. Four alleles, HFE*001, *002, *003, and *004 exhibited variation within their exon sequences. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
PrimerZ: streamlined primer design for promoters, exons and human SNPs.
Tsai, Ming-Fang; Lin, Yi-Jung; Cheng, Yu-Chang; Lee, Kuo-Hsi; Huang, Cheng-Chih; Chen, Yuan-Tsong; Yao, Adam
2007-07-01
PrimerZ (http://genepipe.ngc.sinica.edu.tw/primerz/) is a web application dedicated primarily to primer design for genes and human SNPs. PrimerZ accepts genes by gene name or Ensembl accession code, and SNPs by dbSNP rs or AFFY_Probe IDs. The promoter and exon sequence information of all gene transcripts fetched from the Ensembl database (http://www.ensembl.org) are processed before being passed on to Primer3 (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi) for individual primer design. All results returned from Primer 3 are organized and integrated in a specially designed web page for easy browsing. Besides the web page presentation, csv text file export is also provided for enhanced user convenience. PrimerZ automates highly standard but tedious gene primer design to improve the success rate of PCR experiments. More than 2000 primers have been designed with PrimerZ at our institute since 2004 and the success rate is over 70%. The addition of several new features has made PrimerZ even more useful to the research community in facilitating primer design for promoters, exons and SNPs.
Peng, Ting; Wang, Li; Zhou, Shu-Feng; Li, Xiaotian
2010-12-01
A number of mutations in GATA4 and NKX2.5 have been identified to be causative for a subset of familial congenital heart defects (CHDs) and a small number of sporadic CHDs. In this study, we evaluated common GATA4 and NKX2.5 mutations in 135 Chinese pediatric patients with non-familial congenital heart defects. Two novel mutations in the coding region of GATA4 were identified, namely, 487C >T (Pro163Ser) in exon 1 in a child with tetralogy of Fallot and 1220C >A (Pro407Gln) in exon 6 in a pediatric patient with outlet membranous ventricular septal defect. We also found 848C >A (Pro283Gln) in exon 2 of the NKX2.5 gene in a pediatric patient with ventricular septal defect, patent ductus arteriosus and aortic isthmus stenosis. None of the mutations was detected in healthy control subjects (n = 114). This study suggests that GATA4 and NKX2.5 missense mutations may be associated with congenital heart defects in pediatric Chinese patients. Further clinical studies with large samples are warranted.
Zhao, Zhanqi; Chu, Chan-Ching; Chang, Mei-Yun; Chang, Hao-Tai; Hsu, Yeong-Long
2018-06-01
Methylmalonic acidemia (MMA) is an autosomal recessive disease of organic acidemia. We report a 26-year-old male who presented with metabolic acidosis, acute renal failure required hemodialysis and acute respiratory failure required mechanical ventilation support. Progressive hypotonia of muscles made weaning from mechanical ventilator difficult. High level of serum methylmalonic acid and the mut genotype sequences confirmed the diagnosis of this adult-onset MMA. Two mut genotype sequences were found by analyzing all coding exons and exon-intron junctions. One genotype was well documented (Exon 6 Mutation, c. 1280G>A. p. G427D, heterozygous). The other mut genotype sequence had never been reported elsewhere (Intron 6 Novel, c. 1333-13_c. 1333-8delTTTTTC, heterozygous). Diet modification, medication, regular hemodialysis and physical rehabilitation. Weaning strategy adjusted with help of electrical impedance tomography. The muscle power of the patient gradually recovered. Extubation of the patient was successful and he was discharged without oxygen required. This case gives us the lesson that MMA can be newly diagnosed in adult patient. A new mut genotype sequence was discovered. The use of electrical impedance tomography to select a suitable method for inspiratory muscle training was possible and useful.
Novel RS1 mutations associated with X-linked juvenile retinoschisis
YI, JUNHUI; LI, SHIQIANG; JIA, XIAOYUN; XIAO, XUESHAN; WANG, PANFENG; GUO, XIANGMING; ZHANG, QINGJIONG
2012-01-01
To identify mutations in the retinoschisin (RS1) gene in families with X-linked retinoschisis (XLRS). Twenty families with XLRS were enrolled in this study. All six coding exons and adjacent intronic regions of RS1 were amplified by polymerase chain reaction (PCR). The nucleotide sequences of the amplicons were determined by Sanger sequencing. Ten hemizygous mutations in RS1 were detected in patients from 14 of the 20 families. Four of the ten mutations were novel, including c:176G>A (p:Cys59Tyr) in exon 3, c:531T>G (p:Tyr177X), c:607C>G (p:Pro203Ala) and c:668G>A (p:Cys223Tyr) in exon 6. These four novel mutations were not present in 176 normal individuals. The remaining six were recurrent mutations, including c:214G>A (p:Glu72Lys), c:304C>T (p:Arg102Trp), c:436G>A (p:Glu146Lys), c:544C>T (p:Arg182Cys), c:599G>A (p:Arg200His) and c:644A>T (p:Glu215Val). Our study expanded the mutation spectrum of RS1 and enriches our understanding of the molecular basis of XLRS. PMID:22245991
Novel RS1 mutations associated with X-linked juvenile retinoschisis.
Yi, Junhui; Li, Shiqiang; Jia, Xiaoyun; Xiao, Xueshan; Wang, Panfeng; Guo, Xiangming; Zhang, Qingjiong
2012-04-01
To identify mutations in the retinoschisin (RS1) gene in families with X-linked retinoschisis (XLRS). Twenty families with XLRS were enrolled in this study. All six coding exons and adjacent intronic regions of RS1 were amplified by polymerase chain reaction (PCR). The nucleotide sequences of the amplicons were determined by Sanger sequencing. Ten hemizygous mutations in RS1 were detected in patients from 14 of the 20 families. Four of the ten mutations were novel, including c:176G>A (p:Cys59Tyr) in exon 3, c:531T>G (p:Tyr177X), c:607C>G (p:Pro203Ala) and c:668G>A (p:Cys223Tyr) in exon 6. These four novel mutations were not present in 176 normal individuals. The remaining six were recurrent mutations, including c:214G>A (p:Glu72Lys), c:304C>T (p:Arg102Trp), c:436G>A (p:Glu146Lys), c:544C>T (p:Arg182Cys), c:599G>A (p:Arg200His) and c:644A>T (p:Glu215Val). Our study expanded the mutation spectrum of RS1 and enriches our understanding of the molecular basis of XLRS.
Yuan, Kejun; Wang, Changjun; Xin, Li; Zhang, Anning; Ai, Chengxiang
2013-07-25
A farnesyl diphosphate synthase gene (FPPS2), which contains 11 introns and 12 exons, was isolated from the apple cultivar "White Winter Pearmain". When it was compared to our previously reported FPPS1, its each intron size was different, its each exon size was the same as that of FPPS1 gene, 30 nucleotide differences were found in its coding sequence. Based on these nucleotide differences, specific primers were designed to perform expression analysis; the results showed that it expressed in both fruit and leaf, its expression level was obviously lower than that of FPPS1 gene in fruit which was stored at 4°C for 5 weeks. This is the first report concerning two FPPS genes and their expression comparison in apples. Copyright © 2013 Elsevier B.V. All rights reserved.
Yan, S Q; Hou, J N; Bai, C Y; Jiang, Y; Zhang, X J; Ren, H L; Sun, B X; Zhao, Z H; Sun, J H
2014-04-01
The dominant white coat colour of farmed blue fox is inherited as a monogenic autosomal dominant trait and is suggested to be embryonic lethal in the homozygous state. In this study, the transcripts of KIT were identified by RT-PCR for a dominant white fox and a normal blue fox. Sequence analysis showed that the KIT transcript in normal blue fox contained the full-length coding sequence of 2919 bp (GenBank Acc. No KF530833), but in the dominant white individual, a truncated isoform lacking the entire exon 12 specifically co-expressed with the normal transcript. Genomic DNA sequencing revealed that a single nucleotide polymorphism (c.1867+1G>T) in intron 12 appeared only in the dominant white individuals and a 1-bp ins/del polymorphism in the same intron showed in individuals representing two different coat colours. Genotyping results of the SNP with PCR-RFLP in 185 individuals showed all 90 normal blue foxes were homozygous for the G allele, and all dominant white individuals were heterozygous. Due to the truncated protein with a deletion of 35 amino acids and an amino acid replacement (p.Pro623Ala) located in the conserved ATP binding domain, we propose that the mutant receptor had absent tyrosine kinase activity. These findings reveal that the base substitution at the first nucleotide of intron 12 of KIT gene, resulting in skipping of exon 12, is a causative mutation responsible for the dominant white phenotype of blue fox. © 2013 Stichting International Foundation for Animal Genetics.
Kimura, Hiroki; Tsuboi, Daisuke; Wang, Chenyao; Kushima, Itaru; Koide, Takayoshi; Ikeda, Masashi; Iwayama, Yoshimi; Toyota, Tomoko; Yamamoto, Noriko; Kunimoto, Shohko; Nakamura, Yukako; Yoshimi, Akira; Banno, Masahiro; Xing, Jingrui; Takasaki, Yuto; Yoshida, Mami; Aleksic, Branko; Uno, Yota; Okada, Takashi; Iidaka, Tetsuya; Inada, Toshiya; Suzuki, Michio; Ujike, Hiroshi; Kunugi, Hiroshi; Kato, Tadafumi; Yoshikawa, Takeo; Iwata, Nakao; Kaibuchi, Kozo; Ozaki, Norio
2015-01-01
Background: Nuclear distribution E homolog 1 (NDE1), located within chromosome 16p13.11, plays an essential role in microtubule organization, mitosis, and neuronal migration and has been suggested by several studies of rare copy number variants to be a promising schizophrenia (SCZ) candidate gene. Recently, increasing attention has been paid to rare single-nucleotide variants (SNVs) discovered by deep sequencing of candidate genes, because such SNVs may have large effect sizes and their functional analysis may clarify etiopathology. Methods and Results: We conducted mutation screening of NDE1 coding exons using 433 SCZ and 145 pervasive developmental disorders samples in order to identify rare single nucleotide variants with a minor allele frequency ≤5%. We then performed genetic association analysis using a large number of unrelated individuals (3554 SCZ, 1041 bipolar disorder [BD], and 4746 controls). Among the discovered novel rare variants, we detected significant associations between SCZ and S214F (P = .039), and between BD and R234C (P = .032). Furthermore, functional assays showed that S214F affected axonal outgrowth and the interaction between NDE1 and YWHAE (14-3-3 epsilon; a neurodevelopmental regulator). Conclusions: This study strengthens the evidence for association between rare variants within NDE1 and SCZ, and may shed light into the molecular mechanisms underlying this severe psychiatric disorder. PMID:25332407
nGASP - the nematode genome annotation assessment project
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coghlan, A; Fiedler, T J; McKay, S J
2008-12-19
While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner'more » algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders. While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders.« less
Multiplex amplification of large sets of human exons.
Porreca, Gregory J; Zhang, Kun; Li, Jin Billy; Xie, Bin; Austin, Derek; Vassallo, Sara L; LeProust, Emily M; Peck, Bill J; Emig, Christopher J; Dahl, Fredrik; Gao, Yuan; Church, George M; Shendure, Jay
2007-11-01
A new generation of technologies is poised to reduce DNA sequencing costs by several orders of magnitude. But our ability to fully leverage the power of these technologies is crippled by the absence of suitable 'front-end' methods for isolating complex subsets of a mammalian genome at a scale that matches the throughput at which these platforms will routinely operate. We show that targeting oligonucleotides released from programmable microarrays can be used to capture and amplify approximately 10,000 human exons in a single multiplex reaction. Additionally, we show integration of this protocol with ultra-high-throughput sequencing for targeted variation discovery. Although the multiplex capture reaction is highly specific, we found that nonuniform capture is a key issue that will need to be resolved by additional optimization. We anticipate that highly multiplexed methods for targeted amplification will enable the comprehensive resequencing of human exons at a fraction of the cost of whole-genome resequencing.
Carpinelli, Marina R.; Wicks, Ian P.; Sims, Natalie A.; O’Donnell, Kristy; Hanzinikolas, Katherine; Burt, Rachel; Foote, Simon J.; Bahlo, Melanie; Alexander, Warren S.; Hilton, Douglas J.
2002-01-01
We describe the clinical, genetic, biochemical, and molecular characterization of a mouse that arose in the first generation (G1) of a random mutagenesis screen with the chemical mutagen ethyl-nitrosourea. The mouse was observed to have skeletal abnormalities inherited with an X-linked dominant pattern of inheritance. The causative mutation, named Skeletal abnormality 1 (Ska1), was shown to be a single base pair mutation in a splice donor site immediately following exon 8 of the Phex (phosphate-regulating gene with homologies to endopeptidases located on the X-chromosome) gene. This point mutation caused skipping of exon 8 from Phex mRNA, hypophosphatemia, and features of rickets. This experimentally induced phenotype mirrors the human condition X-linked hypophosphatemia; directly confirms the role of Phex in phosphate homeostasis, normal skeletal development, and rickets; and illustrates the power of mutagenesis in exploring animal models of human disease. PMID:12414538
Carpinelli, Marina R; Wicks, Ian P; Sims, Natalie A; O'Donnell, Kristy; Hanzinikolas, Katherine; Burt, Rachel; Foote, Simon J; Bahlo, Melanie; Alexander, Warren S; Hilton, Douglas J
2002-11-01
We describe the clinical, genetic, biochemical, and molecular characterization of a mouse that arose in the first generation (G(1)) of a random mutagenesis screen with the chemical mutagen ethyl-nitrosourea. The mouse was observed to have skeletal abnormalities inherited with an X-linked dominant pattern of inheritance. The causative mutation, named Skeletal abnormality 1 (Ska1), was shown to be a single base pair mutation in a splice donor site immediately following exon 8 of the Phex (phosphate-regulating gene with homologies to endopeptidases located on the X-chromosome) gene. This point mutation caused skipping of exon 8 from Phex mRNA, hypophosphatemia, and features of rickets. This experimentally induced phenotype mirrors the human condition X-linked hypophosphatemia; directly confirms the role of Phex in phosphate homeostasis, normal skeletal development, and rickets; and illustrates the power of mutagenesis in exploring animal models of human disease.
Two distinct promoters drive transcription of the human D1A dopamine receptor gene.
Lee, S H; Minowa, M T; Mouradian, M M
1996-10-11
The human D1A dopamine receptor gene has a GC-rich, TATA-less promoter located upstream of a small, noncoding exon 1, which is separated from the coding exon 2 by a 116-base pair (bp)-long intron. Serial 3'-deletions of the 5'-noncoding region of this gene, including the intron and 5'-end of exon 2, resulted in 80 and 40% decrease in transcriptional activity of the upstream promoter in two D1A-expressing neuroblastoma cell lines, SK-N-MC and NS20Y, respectively. To investigate the function of this region, the intron and 245 bp at the 5'-end of exon 2 were investigated. Transient expression analyses using various chloramphenicol acetyltransferase constructs showed that the transcriptional activity of the intron is higher than that of the upstream promoter by 12-fold in SK-N-MC cells and by 5.5-fold in NS20Y cells in an orientation-dependent manner, indicating that the D1A intron is a strong promoter. Primer extension and ribonuclease protection assays revealed that transcription driven by the intron promoter is initiated at the junction of intron and exon 2 and at a cluster of nucleotides located 50 bp downstream from this junction. The same transcription start sites are utilized by the chloramphenicol acetyltransferase constructs employed in transfections as well as by the D1A gene expressed within the human caudate. The relative abundance of D1A transcripts originating from the upstream promoter compared with those transcribed from the intron promoter is 1.5-2.9 times in SK-N-MC cells and 2 times in the human caudate. Transcript stability studies in SK-N-MC cells revealed that longer D1A mRNA molecules containing exon 1 are degraded 1.8 times faster than shorter transcripts lacking exon 1. Although gel mobility shift assay could not detect DNA-protein interaction at the D1A intron, competitive co-transfection using the intron as competitor confirmed the presence of trans-acting factors at the intron. These data taken together indicate that the human D1A gene has two functional TATA-less promoters, both in D1A expressing cultured neuroblastoma cells and in the human striatum.
Guttman, Mitchell; Garber, Manuel; Levin, Joshua Z.; Donaghey, Julie; Robinson, James; Adiconis, Xian; Fan, Lin; Koziol, Magdalena J.; Gnirke, Andreas; Nusbaum, Chad; Rinn, John L.; Lander, Eric S.; Regev, Aviv
2010-01-01
RNA-Seq provides an unbiased way to study a transcriptome, including both coding and non-coding genes. To date, most RNA-Seq studies have critically depended on existing annotations, and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We apply it to mouse embryonic stem cells, neuronal precursor cells, and lung fibroblasts to accurately reconstruct the full-length gene structures for the vast majority of known expressed genes. We identify substantial variation in protein-coding genes, including thousands of novel 5′-start sites, 3′-ends, and internal coding exons. We then determine the gene structures of over a thousand lincRNA and antisense loci. Our results open the way to direct experimental manipulation of thousands of non-coding RNAs, and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes. PMID:20436462
Whyte, Michael P; Totty, William G; Novack, Deborah V; Zhang, Xiafang; Wenkert, Deborah; Mumm, Steven
2011-05-01
We report a 32-year-old man and his 59-year-old mother with a unique and extensive variant of Camurati-Engelmann disease (CED) featuring histopathological changes of osteomalacia and alterations within TGFβ1 and TNFSF11 encoding TGFβ1 and RANKL, respectively. He suffered leg pain and weakness since childhood and reportedly grew until his late 20s, reaching 7 feet in height. He had deafness, perforated nasal septum, torus palatinus, disproportionately long limbs with knock-knees, low muscle mass, and pseudoclubbing. Radiographs revealed generalized skeletal abnormalities, including wide bones and cortical and trabecular bone thickening in keeping with CED, except that long bone ends were also affected. Lumbar spine and hip BMD Z-scores were + 7.7 and + 4.4, respectively. Biochemical markers of bone turnover were elevated. Hypocalciuria accompanied low serum 25-hydroxyvitamin D (25[OH]D) levels. Pituitary hypogonadism and low serum insulin-like growth factor (IGF)-1 were present. Karyotype was normal. Despite vitamin D repletion, iliac crest histology revealed severe osteomalacia. Exon 1 of TNFRSF11A (RANK), exons 2, 3, and 4 of LRP5, and all coding exons and adjacent mRNA splice junctions of TNFRSF11B (OPG), SQSTM1 (sequestosome 1), and TNSALP (tissue nonspecific alkaline phosphatase) were intact. His asymptomatic and less dysmorphic 5'11″ mother, also with low serum 25(OH)D, had milder clinical, radiological, biochemical, and histopathological findings. Both individuals were heterozygous for a novel 12-bp duplication (c.27_38dup, p.L10_L13dup) in exon 1 of TGFβ1, predicting four additional leucine residues in the latency-associated-peptide segment of TGFβ1, consistent with CED. The son was also homozygous for a single base transversion in TNFSF11, predicting a nonconservative amino acid change (c.107C > G, p.Pro36Arg) in the intracellular domain of RANKL that was heterozygous in his nonconsanguineous parents. This TNFSF11 variant was not found in the SNP Database, nor in published TNFSF11 association studies, but it occurred in four of the 134 TNFSF11 alleles (3.0%) we tested randomly among individuals without CED. Perhaps the unique phenotype of this CED family is conditioned by altered RANKL activity. Copyright © 2011 American Society for Bone and Mineral Research.
Semerci, C Nur; Kalay, Ersan; Yıldırım, Cem; Dinçer, Tuba; Olmez, Akgün; Toraman, Bayram; Koçyiğit, Ali; Bulgu, Yunus; Okur, Volkan; Satıroğlu-Tufan, Lale; Akarsu, Nurten A
2014-06-01
This study aimed to identify the underlying genetic defect responsible for anophthalmia/microphthalmia. In total, two Turkish families with a total of nine affected individuals were included in the study. Affymetrix 250 K single nucleotide polymorphism genotyping and homozygosity mapping were used to identify the localisation of the genetic defect in question. Coding region of the ALDH1A3 gene was screened via direct sequencing. cDNA samples were generated from primary fibroblast cell cultures for expression analysis. Reverse transcriptase PCR (RT-PCR) analysis was performed using direct sequencing of the obtained fragments. The causative genetic defect was mapped to chromosome 15q26.3. A homozygous G>A substitution (c.666G>A) at the last nucleotide of exon 6 in the ALDH1A3 gene was identified in the first family. Further cDNA sequencing of ALDH1A3 showed that the c.666G>A mutation caused skipping of exon 6, which predicted in-frame loss of 43 amino acids (p.Trp180_Glu222del). A novel missense c.1398C>A mutation in exon 12 of ALDH1A3 that causes the substitution of a conserved asparagine by lysine at amino acid position 466 (p.Asn466Lys) was observed in the second family. No extraocular findings-except for nevus flammeus in one affected individual and a variant of Dandy-Walker malformation in another affected individual-were observed. Autistic-like behaviour and mental retardation were observed in three cases. In conclusion, novel ALDH1A3 mutations identified in the present study confirm the pivotal role of ALDH1A3 in human eye development. Autistic features, previously reported as an associated finding, were considered to be the result of social deprivation and inadequate parenting during early infancy in the presented families. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Molecular Analysis of Glucose-6-Phosphate Dehydrogenase Gene Mutations in Bangladeshi Individuals.
Sarker, Suprovath Kumar; Islam, Md Tarikul; Eckhoff, Grace; Hossain, Mohammad Amir; Qadri, Syeda Kashfi; Muraduzzaman, A K M; Bhuyan, Golam Sarower; Shahidullah, Mohammod; Mannan, Mohammad Abdul; Tahura, Sarabon; Hussain, Manzoor; Akhter, Shahida; Nahar, Nazmun; Shirin, Tahmina; Qadri, Firdausi; Mannoor, Kaiissar
2016-01-01
Glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common X-linked human enzyme defect of red blood cells (RBCs). Individuals with this gene defect appear normal until exposed to oxidative stress which induces hemolysis. Consumption of certain foods such as fava beans, legumes; infection with bacteria or virus; and use of certain drugs such as primaquine, sulfa drugs etc. may result in lysis of RBCs in G6PD deficient individuals. The genetic defect that causes G6PD deficiency has been identified mostly as single base missense mutations. One hundred and sixty G6PD gene mutations, which lead to amino acid substitutions, have been described worldwide. The purpose of this study was to detect G6PD gene mutations in hospital-based settings in the local population of Dhaka city, Bangladesh. Qualitative fluorescent spot test and quantitative enzyme activity measurement using RANDOX G6PDH kit were performed for analysis of blood specimens and detection of G6PD-deficient participants. For G6PD-deficient samples, PCR was done with six sets of primers specific for G6PD gene. Automated Sanger sequencing of the PCR products was performed to identify the mutations in the gene. Based on fluorescence spot test and quantitative enzyme assay followed by G6PD gene sequencing, 12 specimens (11 males and one female) among 121 clinically suspected patient-specimens were found to be deficient, suggesting a frequency of 9.9% G6PD deficiency. Sequencing of the G6PD-deficient samples revealed c.C131G substitution (exon-3: Ala44Gly) in six samples, c.G487A substitution (exon-6:Gly163Ser) in five samples and c.G949A substitution (exon-9: Glu317Lys) of coding sequence in one sample. These mutations either affect NADP binding or disrupt protein structure. From the study it appears that Ala44Gly and Gly163Ser are the most common G6PD mutations in Dhaka, Bangladesh. This is the first study of G6PD mutations in Bangladesh.
Molecular Analysis of Glucose-6-Phosphate Dehydrogenase Gene Mutations in Bangladeshi Individuals
Sarker, Suprovath Kumar; Hossain, Mohammad Amir; Qadri, Syeda Kashfi; Muraduzzaman, A. K. M.; Bhuyan, Golam Sarower; Shahidullah, Mohammod; Mannan, Mohammad Abdul; Tahura, Sarabon; Hussain, Manzoor; Akhter, Shahida; Nahar, Nazmun; Shirin, Tahmina; Qadri, Firdausi; Mannoor, Kaiissar
2016-01-01
Glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common X-linked human enzyme defect of red blood cells (RBCs). Individuals with this gene defect appear normal until exposed to oxidative stress which induces hemolysis. Consumption of certain foods such as fava beans, legumes; infection with bacteria or virus; and use of certain drugs such as primaquine, sulfa drugs etc. may result in lysis of RBCs in G6PD deficient individuals. The genetic defect that causes G6PD deficiency has been identified mostly as single base missense mutations. One hundred and sixty G6PD gene mutations, which lead to amino acid substitutions, have been described worldwide. The purpose of this study was to detect G6PD gene mutations in hospital-based settings in the local population of Dhaka city, Bangladesh. Qualitative fluorescent spot test and quantitative enzyme activity measurement using RANDOX G6PDH kit were performed for analysis of blood specimens and detection of G6PD-deficient participants. For G6PD-deficient samples, PCR was done with six sets of primers specific for G6PD gene. Automated Sanger sequencing of the PCR products was performed to identify the mutations in the gene. Based on fluorescence spot test and quantitative enzyme assay followed by G6PD gene sequencing, 12 specimens (11 males and one female) among 121 clinically suspected patient-specimens were found to be deficient, suggesting a frequency of 9.9% G6PD deficiency. Sequencing of the G6PD-deficient samples revealed c.C131G substitution (exon-3: Ala44Gly) in six samples, c.G487A substitution (exon-6:Gly163Ser) in five samples and c.G949A substitution (exon-9: Glu317Lys) of coding sequence in one sample. These mutations either affect NADP binding or disrupt protein structure. From the study it appears that Ala44Gly and Gly163Ser are the most common G6PD mutations in Dhaka, Bangladesh. This is the first study of G6PD mutations in Bangladesh. PMID:27880809
El-Magd, Mohammed Abu; Abo-Al-Ela, Haitham G; El-Nahas, Abeer; Saleh, Ayman A; Mansour, Ali A
2014-05-01
Insulin-like growth factor 2 receptor (IGF2R) is responsible for degradation of the muscle development initiator, IGF2, and thus it can be used as a marker for selection strategies in the farm animals. The aim of this study was to search for polymorphisms in three coding loci of IGF2R, and to analyze their effect on the growth traits and on the expression levels of IGF2R and IGF2 genes in the gluteus medius muscle of Egyptian buffaloes. A novel A266C SNP was detected in the coding sequences of the third IGF2R locus (at nucleotide number 51 of exon 23) among Egyptian water buffaloes. This SNP was non-synonymous mutation and led to replacement of Y (tyrosine) amino acid (aa) by D (aspartic acid) aa. Three different single-strand conformation polymorphism patterns were observed in the third IGF2R locus: AA, AC, and CC with frequencies of 0.555, 0.195, and 0.250, respectively. Statistical analysis showed that the homozygous AA genotype significantly associated with the average daily gain than AC and CC genotypes from birth to 9 mo of age. Expression analysis showed that the A266C SNP was correlated with IGF2, but not with IGF2R, mRNA levels in the gluteus medius muscle of Egyptian buffaloes. The highest IGF2 mRNA level was estimated in the muscle of animals with the AA homozygous genotype as compared to the AC heterozygotes and CC homozygotes. We conclude that A266C SNP at nucleotide number 51 of exon 23 of the IGF2R gene is associated with the ADG during the early stages of life (from birth to 9 mo of age) and this effect is accompanied by, and may be caused by, increased expression levels of the IGF2 gene. Copyright © 2014 Elsevier B.V. All rights reserved.
Ren, Hong-tao; Zhang, Guang-qin; Li, Jian-lin; Tang, Yong-kai; Li, Hong-xia; Yu, Ju-hua; Xu, Pao
2013-08-01
Δ6-Desaturase is the rate-limiting enzyme involved in highly unsaturated fatty acid (HUFA) biosynthesis. There is very little information on the evolution and functional characterization of Δ6Fad-a and Δ6Fad-b in common carp (Cyprinus carpio var. Jian). In the present study, the genomic sequences and structures of two putative Δ6-desaturase-like genes in common carp genome were obtained. We investigated the mRNA expression patterns of Δ6Fad-a and Δ6Fad-b in tissue, hatching carp embryos, larvae by temperature shock and juveniles under nutritional regulation. Our results showed that the two Δ6Fad genes had identical coding exon structures, being comprised of 12 coding exons, and with introns of distinct size and sequence composition. They were not allelic variants of a single gene. Both Δ6Fad genes were highly expressed in liver, intestine (pyloric caeca) and brain. The Δ6Fad-a and Δ6Fad-b mRNAs showed an increase in expression from newly hatched to 25 days after hatching. The expression levels of Δ6Fad-a were obviously regulated by temperature, whereas Δ6Fad-b was not affected by temperature. The regulation of Δ6Fad-a and Δ6Fad-b in response to dietary fatty acid composition was determined in liver, brain and intestine (pyloric caeca) of common carp fed with diets: diet1with fish oil (FO) rich in n-3 HUFA, diet2 with corn oil (CO, 18:2n-6) and diet3 with linseed oil (LO, 18:3n-3). The differential expression of Δ6Fad-a and Δ6Fad-b genes in liver, brain and intestine in common carps was fed with different oil sources, respectively. Further work is in progress to determine the mechanism of differential expression of the Δ6Fad-a and Δ6Fad-b genes in different tissues and the roles of transcription factors in regulating HUFA synthesis. Copyright © 2013 Elsevier B.V. All rights reserved.
The developmental transcriptome of Drosophila melanogaster
DOE Office of Scientific and Technical Information (OSTI.GOV)
University of Connecticut; Graveley, Brenton R.; Brooks, Angela N.
Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, predictionmore » and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development. Drosophila melanogaster is an important non-mammalian model system that has had a critical role in basic biological discoveries, such as identifying chromosomes as the carriers of genetic information and uncovering the role of genes in development. Because it shares a substantial genic content with humans, Drosophila is increasingly used as a translational model for human development, homeostasis and disease. High-quality maps are needed for all functional genomic elements. Previous studies demonstrated that a rich collection of genes is deployed during the life cycle of the fly. Although expression profiling using microarrays has revealed the expression of, 13,000 annotated genes, it is difficult to map splice junctions and individual base modifications generated by RNA editing using such approaches. Single-base resolution is essential to define precisely the elements that comprise the Drosophila transcriptome. Estimates of the number of transcript isoforms are less accurate than estimates of the number of genes. Whereas, 20% of Drosophila genes are annotated as encoding alternatively spliced premRNAs, splice-junction microarray experiments indicate that this number is at least 40% (ref. 7). Determining the diversity of mRNAs generated by alternative promoters, alternative splicing and RNA editing will substantially increase the inferred protein repertoire. Non-coding RNA genes (ncRNAs) including short interfering RNAs (siRNAs) and microRNAS (miRNAs) (reviewed in ref. 10), and longer ncRNAs such as bxd (ref. 11) and rox (ref. 12), have important roles in gene regulation, whereas others such as small nucleolar RNAs (snoRNAs)and small nuclear RNAs (snRNAs) are important components of macromolecular machines such as the ribosome and spliceosome. The transcription and processing of these ncRNAs must also be fully documented and mapped. As part of the modENCODE project to annotate the functional elements of the D. melanogaster and Caenorhabditis elegans genomes, we used RNA-Seq and tiling microarrays to sample the Drosophila transcriptome at unprecedented depth throughout development from early embryo to ageing male and female adults. We report on a high-resolution view of the discovery, structure and dynamic expression of the D. melanogaster transcriptome.« less
Zhang, Yang; Zhu, Zhen; Xu, Qi; Chen, Guohong
2014-01-07
Primers based on the cDNA sequence of the goose growth hormone (GH) gene in GenBank were designed to amplify exon 2 of the GH gene in Huoyan goose. A total of 552 individuals were brooded in one batch and raised in Liaoning and Jiangsu Provinces, China. Single nucleotide polymorphisms (SNPs) of exon 2 in the GH gene were detected by the polymerase chain reaction (single strand conformation polymorphism method). Homozygotes were subsequently cloned, sequenced and analyzed. Two SNP mutations were detected, and 10 genotypes (referred to as AA, BB, CC, DD, AB, AC, AD, BC, BD and CD) were obtained. Allele D was predominant, and the frequencies of the 10 genotypes fit the Hardy-Weinberg equilibrium in the male, female and whole populations according to the chi-square test. Based on SNP types, the 10 genotypes were combined into three main genotypes. Multiple comparisons were carried out between different genotypes and production traits when the geese were 10 weeks old. Some indices of production performance were significantly (p < 0.05) associated with the genotype. Particularly, geese with genotype AB or BB were highly productive. Thus, these genotypes may serve as selection markers for production traits in Huoyan geese.
Abo-Al-Ela, Haitham G; El-Magd, Mohammed Abu; El-Nahas, Abeer F; Mansour, Ali A
2014-08-01
Insulin-like growth factor 2 (IGF2) plays an important role in muscle growth and it might be used as a marker for the growth traits selection strategies in farm animals. The objectives of this study were to detect polymorphisms in exon 10 of IGF2 and to determine associations between these polymorphisms and growth traits in Egyptian water buffalo. PCR-single-strand conformation polymorphism (SSCP) and DNA sequencing methods were used to detect any prospective polymorphism. A novel single nucleotide polymorphism (SNP), C287A, was detected. It was a non-synonymous mutation and led to replacement of glutamine (Q) amino acid (aa) by histidine (H) aa. Three different SSCP patterns were observed: AA, AC, and CC, with frequencies of 0.540, 0.325, and 0.135, respectively. Association analyses revealed that the AA individuals had a higher average daily gain (ADG) than other individuals (CC and AC) from birth to 9 months of age. We conclude that the AA genotype in C287A SNP in the exon 10 of the IGF2 gene is associated with the ADG during the age from birth to 9 months and could be used as a potential genetic marker for selection of growth traits in Egyptian buffalo.
Translational and regulatory challenges for exon skipping therapies.
Aartsma-Rus, Annemieke; Ferlini, Alessandra; Goemans, Nathalie; Pasmooij, Anna M G; Wells, Dominic J; Bushby, Katerine; Vroom, Elizabeth; Balabanov, Pavel
2014-10-01
Several translational challenges are currently impeding the therapeutic development of antisense-mediated exon skipping approaches for rare diseases. Some of these are inherent to developing therapies for rare diseases, such as small patient numbers and limited information on natural history and interpretation of appropriate clinical outcome measures. Others are inherent to the antisense oligonucleotide (AON)-mediated exon skipping approach, which employs small modified DNA or RNA molecules to manipulate the splicing process. This is a new approach and only limited information is available on long-term safety and toxicity for most AON chemistries. Furthermore, AONs often act in a mutation-specific manner, in which case multiple AONs have to be developed for a single disease. A workshop focusing on preclinical development, trial design, outcome measures, and different forms of marketing authorization was organized by the regulatory models and biochemical outcome measures working groups of Cooperation of Science and Technology Action: "Networking towards clinical application of antisense-mediated exon skipping for rare diseases." The workshop included participants from patient organizations, academia, and members of staff from the European Medicine Agency and Medicine Evaluation Board (the Netherlands). This statement article contains the key outcomes of this meeting.
Short, Stephen; Peterkin, Tessa; Guille, Matthew; Patient, Roger; Sharpe, Colin
2015-01-01
Vertebrate NCoR-family co-repressors play central roles in the timing of embryo and stem cell differentiation by repressing the activity of a range of transcription factors. They interact with nuclear receptors using short linear motifs (SLiMs) termed co-repressor for nuclear receptor (CoRNR) boxes. Here, we identify the pathway leading to increasing co-repressor diversity across the deuterostomes. The final complement of CoRNR boxes arose in an ancestral cephalochordate, and was encoded in one large exon; the urochordates and vertebrates then split this region between 10 and 12 exons. In Xenopus, alternative splicing is prevalent in NCoR2, but absent in NCoR1. We show for one NCoR1 exon that alternative splicing can be recovered by a single point mutation, suggesting NCoR1 lost the capacity for alternative splicing. Analyses in Xenopus and zebrafish identify that cellular context, rather than gene sequence, predominantly determines species differences in alternative splicing. We identify a pathway to diversity for the NCoR family beginning with the addition of a SLiM, followed by gene duplication, the generation of alternatively spliced isoforms and their differential deployment. PMID:26289800
Bogdanowicz, Brian S; Hoch, Matthew A; Hartranft, Megan E
2017-04-01
Purpose The approval history, pharmacology, pharmacokinetics, clinical trials, efficacy, dosing recommendations, drug interactions, safety, place in therapy, and economic considerations of gefitinib are reviewed. Summary Lung cancer is one of the most commonly diagnosed cancers and is the leading cause of cancer death. Platinum-based chemotherapy and tyrosine kinase inhibitors, such as erlotinib and afatinib, are recommended therapies for nonsmall cell lung cancer. The European Medicines Association based their approval of gefitinib on the randomized, multicenter Iressa Pan-Asia Study (IPASS, NCT00322452) and a single-arm study showing effectiveness in Caucasians (IFUM, NCT01203917). Both studies were recently referenced by the United States Food & Drug Administration to reapprove gefitinib for the first-line treatment of advanced nonsmall cell lung cancer with epidermal growth factor receptor exon 19 deletions or exon 21 substitution. Diarrhea, acneiform rash, and interstitial lung disease are known side effects of gefitinib. Conclusion Use of gefitinib for the first-line therapy of metastatic nonsmall cell lung cancer with epidermal growth factor receptor exon 19 deletions (residues 747-750) or exon 21 substitution mutation (L858R) is well-documented and supported.
Typing of artiodactyl MHC-DRB genes with the help of intronic simple repeated DNA sequences.
Schwaiger, F W; Buitkamp, J; Weyers, E; Epplen, J T
1993-02-01
An efficient oligonucleotide typing method for the highly polymorphic MHC-DRB genes is described for artiodactyls like cattle, sheep and goat. By means of the polymerase chain reaction, the second exon of MHC-DRB is amplified as well as part of the adjacent intron containing a mixed simple repeat sequence. Using this primer combination we were able to amplify the MHC-DRB exons 2 and adjacent introns from all of the investigated 10 species of the family of Bovidae and giraffes. Therefore, the DRB genes of novel artiodactyl species can also be readily studied. Oligonucleotide probes specific for the polymorphisms of ungulate DRB genes are used with which sequences differing in at least one single base can be distinguished. Exonic polymorphism was found to be correlated with the allele lengths and the patterns of the repeat structures. Hence oligonucleotide probes specific for different simple repeats and polymorphic positions serve also for typing across species barriers. The strict correlation of sequence length and exonic polymorphism permits a preselection of specific oligonucleotides for hybridization. Thus more than 20 alleles can already be differentiated from each of the three species.
Nakao, Minoru; Lavikainen, Antti; Iwaki, Takashi; Haukisalmi, Voitto; Konyaev, Sergey; Oku, Yuzaburo; Okamoto, Munehiro; Ito, Akira
2013-05-01
The cestode family Taeniidae generally consists of two valid genera, Taenia and Echinococcus. The genus Echinococcus is monophyletic due to a remarkable similarity in morphology, features of development and genetic makeup. By contrast, Taenia is a highly diverse group formerly made up of different genera. Recent molecular phylogenetic analyses strongly suggest the paraphyly of Taenia. To clarify the genetic relationships among the representative members of Taenia, molecular phylogenies were constructed using nuclear and mitochondrial genes. The nuclear phylogenetic trees of 18S ribosomal DNA and concatenated exon regions of protein-coding genes (phosphoenolpyruvate carboxykinase and DNA polymerase delta) demonstrated that both Taenia mustelae and a clade formed by Taenia parva, Taenia krepkogorski and Taenia taeniaeformis are only distantly related to the other members of Taenia. Similar topologies were recovered in mitochondrial genomic analyses using 12 complete protein-coding genes. A sister relationship between T. mustelae and Echinococcus spp. was supported, especially in protein-coding gene trees inferred from both nuclear and mitochondrial data sets. Based on these results, we propose the resurrection of Hydatigera Lamarck, 1816 for T. parva, T. krepkogorski and T. taeniaeformis and the creation of a new genus, Versteria, for T. mustelae. Due to obvious morphological and ecological similarities, Taenia brachyacantha is also included in Versteria gen. nov., although molecular evidence is not available. Taenia taeniaeformis has been historically regarded as a single species but the present data clearly demonstrate that it consists of two cryptic species. Copyright © 2013 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Freytsis, Marina; Wang, Xueding; Peter, Inga; Guillemette, Chantal; Hazarika, Suwagmani; Duan, Su X.; Greenblatt, David J.; Lee, William M.
2013-01-01
Acetaminophen is cleared primarily by hepatic glucuronidation. Polymorphisms in genes encoding the acetaminophen UDP-glucuronosyltransferase (UGT) enzymes could explain interindividual variability in acetaminophen glucuronidation and variable risk for liver injury after acetaminophen overdose. In this study, human liver bank samples were phenotyped for acetaminophen glucuronidation activity and genotyped for the major acetaminophen-glucuronidating enzymes (UGTs 1A1, 1A6, 1A9, and 2B15). Of these, only three linked single nucleotide polymorphisms (SNPs) located in the shared UGT1A-3′UTR region (rs10929303, rs1042640, rs8330) were associated with acetaminophen glucuronidation activity, with rs8330 consistently showing higher acetaminophen glucuronidation at all the tested concentrations of acetaminophen. Mechanistic studies using luciferase-UGT1A-3′UTR reporters indicated that these SNPs do not alter mRNA stability or translation efficiency. However, there was evidence for allelic imbalance and a gene-dose proportional increase in the amount of exon 5a versus exon 5b containing UGT1A mRNA spliced transcripts in livers with the rs8330 variant allele. Cotransfection studies demonstrated an inhibitory effect of exon 5b containing cDNAs on acetaminophen glucuronidation by UGT1A1 and UGT1A6 cDNAs containing exon 5a. In silico analysis predicted that rs8330 creates an exon splice enhancer site that could favor exon 5a (over exon 5b) utilization during splicing. Finally, the prevalence of rs8330 was significantly lower (P = 0.027, χ2 test) in patients who had acute liver failure from unintentional acetaminophen overdose compared with patients with acute liver failure from other causes or a race- or ethnicity-matched population. Together, these findings suggest that rs8330 is an important determinant of acetaminophen glucuronidation and could affect an individual’s risk for acetaminophen-induced liver injury. PMID:23408116
Diniz, Erik Trovão; Jorge, Alexander A L; Arnhold, Ivo J P; Rosenbloom, Arlan L; Bandeira, Francisco
2008-11-01
To date, about sixty different mutations within GH receptor (GHR) gene have been described in patients with GH insensitivity syndrome (GHI). In this report, we described a novel nonsense mutation of GHR. The patient was evaluated at the age of 6 yr, for short stature associated to clinical phenotype of GHI. GH, IGF-1, and GHBP levels were determined. The PCR products from exons 2-10 were sequenced. The patient had high GH (26 microg/L), low IGF-1 (22.5 ng/ml) and undetectable GHBP levels. The sequencing of GHR exon 5 disclosed adenine duplication at nucleotide 338 of GHR coding sequence (c.338dupA) in homozygous state. We described a novel mutation that causes a truncated GHR and a loss of receptor function due to the lack of amino acids comprising the transmembrane and intracellular regions of GHR protein, leading to GHI.
Circular RNAs and hereditary bone diseases.
Zhai, Naixiang; Lu, Yanqin; Wang, Yanzhou; Ren, Xiuzhi; Han, Jinxiang
2018-02-01
Circular RNA (circRNA) is a non-linear form of RNA derived from exonic, intronic, and exon-intron gene regions. circRNAs are characterized by covalent closed loops, highly stable nuclease resistance, and specific expression in species and developmental stages. CircRNA molecules have been identified as playing roles in the regulation of cell transcription, transcriptional expression after translation, interactions with microRNAs, and protein coding. A high stability and tissue- and disease-specific expression allow circRNAs to serve as potential biomarkers both for diseases and prognosis. CircRNAs function in bone remodeling by directly participating in bone-related signaling pathways and by forming the circRNA-miRNA-mRNA axis. Studies have seldom reported on the low incidence of circRNAs in genetic bone disorders. The current study reviews the characteristics of circRNAs and recent research on their role in rare hereditary bone diseases.
Permanent Neonatal Diabetes Caused by Creation of an Ectopic Splice Site within the INS Gene
Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis
2012-01-01
Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272
Chen, Yong; Yang, Fuwei; Zheng, Hexin; Zhu, Ganghua; Hu, Peng; Wu, Weijing
2015-12-01
To explore the molecular etiology of two pedigrees affected with type II Waardenburg syndrome (WS2) and to provide genetic diagnosis and counseling. Blood samples were collected from the proband and his family members. Following extraction of genomic DNA, the coding sequences of PAX3, MITF, SOX10 and SNAI2 genes were amplified with PCR and subjected to DNA sequencing to detect potential mutations. A heterozygous deletional mutation c.649_651delAGA in exon 7 of the MITF gene has been identified in all patients from the first family, while no mutation was found in the other WS2 related genes including PAX3, MITF, SOX10 and SNAI2. The heterozygous deletion mutation c.649_651delAGA in exon 7 of the MITF gene probably underlies the disease in the first family. It is expected that other genes may also underlie WS2.
Zouheir Habbal, Mohammad; Bou-Assi, Tarek; Zhu, Jun; Owen, Renius; Chehab, Farid F
2014-01-01
Alkaptonuria is often diagnosed clinically with episodes of dark urine, biochemically by the accumulation of peripheral homogentisic acid and molecularly by the presence of mutations in the homogentisate 1,2-dioxygenase gene (HGD). Alkaptonuria is invariably associated with HGD mutations, which consist of single nucleotide variants and small insertions/deletions. Surprisingly, the presence of deletions beyond a few nucleotides among over 150 reported deleterious mutations has not been described, raising the suspicion that this gene might be protected against the detrimental mechanisms of gene rearrangements. The quest for an HGD mutation in a proband with AKU revealed with a SNP array five large regions of homozygosity (5-16 Mb), one of which includes the HGD gene. A homozygous deletion of 649 bp deletion that encompasses the 72 nucleotides of exon 2 and surrounding DNA sequences in flanking introns of the HGD gene was unveiled in a proband with AKU. The nature of this deletion suggests that this in-frame deletion could generate a protein without exon 2. Thus, we modeled the tertiary structure of the mutant protein structure to determine the effect of exon 2 deletion. While the two β-pleated sheets encoded by exon 2 were missing in the mutant structure, other β-pleated sheets are largely unaffected by the deletion. However, nine novel α-helical coils substituted the eight coils present in the native HGD crystal structure. Thus, this deletion results in a deleterious enzyme, which is consistent with the proband's phenotype. Screening for mutations in the HGD gene, particularly in the Middle East, ought to include this exon 2 deletion in order to determine its frequency and uncover its origin.
Habbal, Mohammad Zouheir; Bou-Assi, Tarek; Zhu, Jun; Owen, Renius; Chehab, Farid F.
2014-01-01
Alkaptonuria is often diagnosed clinically with episodes of dark urine, biochemically by the accumulation of peripheral homogentisic acid and molecularly by the presence of mutations in the homogentisate 1,2-dioxygenase gene (HGD). Alkaptonuria is invariably associated with HGD mutations, which consist of single nucleotide variants and small insertions/deletions. Surprisingly, the presence of deletions beyond a few nucleotides among over 150 reported deleterious mutations has not been described, raising the suspicion that this gene might be protected against the detrimental mechanisms of gene rearrangements. The quest for an HGD mutation in a proband with AKU revealed with a SNP array five large regions of homozygosity (5–16 Mb), one of which includes the HGD gene. A homozygous deletion of 649 bp deletion that encompasses the 72 nucleotides of exon 2 and surrounding DNA sequences in flanking introns of the HGD gene was unveiled in a proband with AKU. The nature of this deletion suggests that this in-frame deletion could generate a protein without exon 2. Thus, we modeled the tertiary structure of the mutant protein structure to determine the effect of exon 2 deletion. While the two β-pleated sheets encoded by exon 2 were missing in the mutant structure, other β-pleated sheets are largely unaffected by the deletion. However, nine novel α-helical coils substituted the eight coils present in the native HGD crystal structure. Thus, this deletion results in a deleterious enzyme, which is consistent with the proband’s phenotype. Screening for mutations in the HGD gene, particularly in the Middle East, ought to include this exon 2 deletion in order to determine its frequency and uncover its origin. PMID:25233259
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klebig, M.L.; Woychik, R.P.; Wilkinson, J.E.
1994-09-01
The lethal yellow (A{sup y/-}) and viable yellow (A{sup vy/-}) mouse agouti mutants have a predominantly yellow pelage and display a complex syndrome that includes obesity, hyperinsulinemia, and insulin resistance, hallmark features of obesity-associated noninsulin-dependent diabetes mellitus (NIDDM) in humans. A new dominant agouti allele, A{sup iapy}, has recently been identified; like the A{sup vy} allele, it is homozygous viable and confers obesity and yellow fur in heterozygotes. The agouti gene was cloned and characterized at the molecular level. The gene is expressed in the skin during hair growth and is predicted to encode a 131 amino acid protein, thatmore » is likely to be a secreted factor. In both Ay/- and A{sup iapy}/- mice, the obesity and other dominant pleiotropic effects are associated with an ectopic expression of agouti in many tissues where the gene product is normally not produced. In Ay, a 170-kb deletion has occurred that causes an upstream promoter to drive the ectopic expression of the wild-type agouti coding exons. In A{sup iapy}, the coding region of the gene is expressed from a cryptic promoter within the LTR of an intracisternal A-particle (IAP), which has integrated within the region just upstream of the first agouti coding exon. Transgenic mice ubiquitously expressing the cloned agouti gene under the influence of the beta-actin and phosphoglycerate kinase promoters display obesity, hyperinsulinemia, and yellow coat color. This demonstrates unequivocally that ectopic expression of agouti is responsible for the yellow obese syndrome.« less
Mulder, Kevin P.; Cortazar-Chinarro, Maria; Harris, D. James; Crottini, Angelica; Grant, Evan H. Campbell; Fleischer, Robert C.; Savage, Anna E.
2017-01-01
The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa.
Mulder, Kevin P; Cortazar-Chinarro, Maria; Harris, D James; Crottini, Angelica; Campbell Grant, Evan H; Fleischer, Robert C; Savage, Anna E
2017-11-01
The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa. Copyright © 2017 Elsevier Ltd. All rights reserved.
Pastor, André F; Rodrigues Moura, Laís; Neto, José W D; Nascimento, Eduardo J M; Calzavara-Silva, Carlos E; Gomes, Ana Lisa V; Silva, Ana Maria da; Cordeiro, Marli T; Braga-Neto, Ulisses; Crovella, Sergio; Gil, Laura H V G; Marques, Ernesto T A; Acioli-Santos, Bartolomeu
2013-09-01
Four genetic polymorphisms located at the promoter (C-257T) and coding regions of CFH gene (exon 2 G257A, exon 14 A2089G and exon 19 G2881T) were investigated in 121 dengue patients (DENV-3) in order to assess the relationship between allele/haplotypes variants and clinical outcomes. A statistical value was found between the CFH-257T allele (TT/TC genotypes) and reduced susceptibility to severe dengue (SD). Statistical associations indicate that individuals bearing a T allele presented significantly higher protein levels in plasma. The -257T variant is located within a NF-κB binding site, suggesting that this variant might have effect on the ability of the CFH gene to respond to signals via the NF-κB pathway. The G257A allelic variant showed significant protection against severe dengue. When CFH haplotypes effect was considered, the ancestral CG/CG promoter-exon 2 SNP genotype showed significant risk to SD either in a general comparison (ancestral × all variant genotypes), as well as in individual genotypes comparison (ancestral × each variant genotype), where the most prevalent effect was observed in the CG/CG × CA/TG comparison. These findings support the involvement of -257T, 257A allele variants and haplotypes on severe dengue phenotype protection, related with high basal CFH expression. Copyright © 2013 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Callén, E; Tischkowitz, M D; Creus, A; Marcos, R; Bueren, J A; Casado, J A; Mathew, C G; Surrallés, J
2004-01-01
Fanconi anaemia is an autosomal recessive disease characterized by chromosome fragility, multiple congenital abnormalities, progressive bone marrow failure and a high predisposition to develop malignancies. Most of the Fanconi anaemia patients belong to complementation group FA-A due to mutations in the FANCA gene. This gene contains 43 exons along a 4.3-kb coding sequence with a very heterogeneous mutational spectrum that makes the mutation screening of FANCA a difficult task. In addition, as the FANCA gene is rich in Alu sequences, it was reported that Alu-mediated recombination led to large intragenic deletions that cannot be detected in heterozygous state by conventional PCR, SSCP analysis, or DNA sequencing. To overcome this problem, a method based on quantitative fluorescent multiplex PCR was proposed to detect intragenic deletions in FANCA involving the most frequently deleted exons (exons 5, 11, 17, 21 and 31). Here we apply the proposed method to detect intragenic deletions in 25 Spanish FA-A patients previously assigned to complementation group FA-A by FANCA cDNA retroviral transduction. A total of eight heterozygous deletions involving from one to more than 26 exons were detected. Thus, one third of the patients carried a large intragenic deletion that would have not been detected by conventional methods. These results are in agreement with previously published data and indicate that large intragenic deletions are one of the most frequent mutations leading to Fanconi anaemia. Consequently, this technology should be applied in future studies on FANCA to improve the mutation detection rate. Copyright 2003 S. Karger AG, Basel
Glucocorticoid receptor represses brain-derived neurotrophic factor expression in neuron-like cells.
Chen, Hui; Lombès, Marc; Le Menuet, Damien
2017-04-12
Brain-derived neurotrophic factor (BDNF) is involved in many functions such as neuronal growth, survival, synaptic plasticity and memorization. Altered expression levels are associated with many pathological situations such as depression, epilepsy, Alzheimer's, Huntington's and Parkinson's diseases. Glucocorticoid receptor (GR) is also crucial for neuron functions, via binding of glucocorticoid hormones (GCs). GR actions largely overlap those of BDNF. It has been proposed that GR could be a regulator of BDNF expression, however the molecular mechanisms involved have not been clearly defined yet. Herein, we analyzed the effect of a GC agonist dexamethasone (DEX) on BDNF expression in mouse neuronal primary cultures and in the newly characterized, mouse hippocampal BZ cell line established by targeted oncogenesis. Mouse Bdnf gene exhibits a complex genomic structure with 8 untranslated exons (I to VIII) splicing onto one common and unique coding exon IX. We found that DEX significantly downregulated total BDNF mRNA expression by around 30%. Expression of the highly expressed exon IV and VI containing transcripts was also reduced by DEX. The GR antagonist RU486 abolished this effect, which is consistent with specific GR-mediated action. Transient transfection assays allowed us to define a short 275 bp region within exon IV promoter responsible for GR-mediated Bdnf repression. Chromatin immunoprecipitation experiments demonstrated GR recruitment onto this fragment, through unidentified transcription factor tethering. Altogether, GR downregulates Bdnf expression through direct binding to Bdnf regulatory sequences. These findings bring new insights into the crosstalk between GR and BDNF signaling pathways both playing a major role in physiology and pathology of the central nervous system.
A splice variant in the ACSL5 gene relates migraine with fatty acid activation in mitochondria
Matesanz, Fuencisla; Fedetz, María; Barrionuevo, Cristina; Karaky, Mohamad; Catalá-Rabasa, Antonio; Potenciano, Victor; Bello-Morales, Raquel; López-Guerrero, Jose-Antonio; Alcina, Antonio
2016-01-01
Genome-wide association studies (GWAS) in migraine are providing the molecular basis of this heterogeneous disease, but the understanding of its aetiology is still incomplete. Although some biomarkers have currently been accepted for migraine, large amount of studies for identifying new ones is needed. The migraine-associated variant rs12355831:A>G (P=2 × 10−6), described in a GWAS of the International Headache Genetic Consortium, is localized in a non-coding sequence with unknown function. We sought to identify the causal variant and the genetic mechanism involved in the migraine risk. To this end, we integrated data of RNA sequences from the Genetic European Variation in Health and Disease (GEUVADIS) and genotypes from 1000 GENOMES of 344 lymphoblastoid cell lines (LCLs), to determine the expression quantitative trait loci (eQTLs) in the region. We found that the migraine-associated variant belongs to a linkage disequilibrium block associated with the expression of an acyl-coenzyme A synthetase 5 (ACSL5) transcript lacking exon 20 (ACSL5-Δ20). We showed by exon-skipping assay a direct causality of rs2256368-G in the exon 20 skipping of approximately 20 to 40% of ACSL5 RNA molecules. In conclusion, we identified the functional variant (rs2256368:A>G) affecting ACSL5 exon 20 skipping, as a causal factor linked to the migraine-associated rs12355831:A>G, suggesting that the activation of long-chain fatty acids by the spliced ACSL5-Δ20 molecules, a mitochondrial located enzyme, is involved in migraine pathology. PMID:27189022
Wang, Xinsheng; Zhao, Xiangzhong; Wang, Xiaoling; Yao, Jian; Zhang, Feifei; Lang, Yanhua; Tuffery-Giraud, Sylvie; Bottillo, Irene; Shao, Leping
2015-01-01
Twenty-six HOGA1 mutations have been reported in primary hyperoxaluria (PH) type 3 (PH3) patients with c.700 + 5G>T accounting for about 50% of the total alleles. However, PH3 has never been described in Asians. A Chinese child with early-onset nephrolithiasis was suspected of having PH. We searched for AGXT, GRHPR and HOGA1 gene mutations in this patient and his parents. All coding regions, including intron-exon boundaries, were analyzed using PCR followed by direct sequence analysis. Two heterozygous mutations not previously described in the literature about HOGA1 were identified (compound heterozygous). One mutation was a successive 2 bp substitution at the last nucleotide of exon 6 and at the first nucleotide of intron 6, respectively (c.834_834 + 1GG>TT), while the other one was a guanine to adenine substitution of the last nucleotide of exon 6 (c.834G>A). Direct sequencing analysis failed to find these mutations in 100 unrelated healthy subjects and the functional role on splicing of both variants found in this study was confirmed by a minigene assay based on the pSPL3 exon trapping vector. In addition, we found a SNP in this family (c.715G>A, p.V239I). There were no mutations detected in AGXT and GRHPR. Two novel HOGA1 mutations were identified in association with PH3. This is the first description and investigation on mutant gene analysis of PH3 in an Asian. © 2015 S. Karger AG, Basel
Sarkar, Debina; Oghabian, Ali; Bodiyabadu, Pasani K; Joseph, Wayne R; Leung, Euphemia Y; Finlay, Graeme J; Baguley, Bruce C; Askarian-Amiri, Marjan E
2017-06-27
The long non-coding RNA ANRIL , antisense to the CDKN2B locus, is transcribed from a gene that encompasses multiple disease-associated polymorphisms. Despite the identification of multiple isoforms of ANRIL , expression of certain transcripts has been found to be tissue-specific and the characterisation of ANRIL transcripts remains incomplete. Several functions have been associated with ANRIL . In our judgement, studies on ANRIL functionality are premature pending a more complete appreciation of the profusion of isoforms. We found differential expression of ANRIL exons, which indicates that multiple isoforms exist in melanoma cells. In addition to linear isoforms, we identified circular forms of ANRIL ( circANRIL ). Further characterisation of circANR IL in two patient-derived metastatic melanoma cell lines (NZM7 and NZM37) revealed the existence of a rich assortment of circular isoforms. Moreover, in the two melanoma cell lines investigated, the complements of circANRIL isoforms were almost completely different. Novel exons were also discovered. We also found the family of linear ANRIL was enriched in the nucleus, whilst the circular isoforms were enriched in the cytoplasm and they differed markedly in stability. With respect to the variable processing of circANRIL species, bioinformatic analysis indicated that intronic Arthrobacter luteus (Alu) restriction endonuclease inverted repeats and exon skipping were not involved in selection of back-spliced exon junctions. Based on our findings, we hypothesise that " ANRIL " has wholly distinct dual sets of functions in melanoma. This reveals the dynamic nature of the locus and constitutes a basis for investigating the functions of ANRIL in melanoma.
PANAGOPOULOS, IOANNIS; GORUNOVA, LUDMILA; BJERKEHAGEN, BODIL; LOBMAIER, INGVILD; HEIM, SVERRE
2015-01-01
Lipomas are the most common soft tissue tumors in adults. They often carry chromosome aberrations involving 12q13~15 leading to rearrangements of the HMGA2 gene in 12q14.3, with breakpoints occurring within or outside of the gene. Here, we present eleven lipomas and one osteochondrolipoma with a novel recurrent chromosome aberration, t(12;18) (q14~15;q12~21). Molecular studies on eight of the tumors showed that full-length HMGA2 transcript was expressed in three and a chimeric HMGA2 transcript in five of them. In three lipomas and in the osteochondrolipoma, exons 1–3 of HMGA2 were fused to a sequence of SETBP1 on 18q12.3 or an intragenic sequence from 18q12.3 circa 10 kbp distal to SETBP1. In another lipoma, exons 1–4 of HMGA2 were fused to an intronic sequence of GRIP1 which maps to chromosome band 12q14.3, distal to HMGA2. The ensuing HMGA2 fusion transcripts code for putative proteins which contain amino acid residues of HMGA2 corresponding to exons 1–3 (or exons 1–4 in one case) followed by amino acid residues corresponding to the fused sequences. Thus, the pattern is similar to the rearrangements of HMGA2 found in other lipomas, i.e., disruption of the HMGA2 locus leaves intact exons 1–3 which encode the AT-hooks domains and separates them from the 3′-terminal part of the gene. The fact that the examined osteochondrolipoma had a t(12;18) and a HMGA2-SETBP1 fusion identical to the findings in the much more common ordinary lipomas, underscores the close developmental relationship between the two tumor types. PMID:26202160
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tomatsu, Shunji; Fukuda, Seiji; Yamagishi, Atsushi
1996-05-01
We report four new mutations in Japanese patients with mucopolysaccharidosis IVA (MPSIVA) who were heterozygous for a common double gene deletion. A nonsense mutation of CAG to TAG at codon 148 in exon 4 was identified, resulting in a change of Q to a stop codon and three missense mutations: V (GTC) to A (GCC) at codon 138 in exon 4, P (CCC) to S (TCC) at codon 151 in exon 5, and P (CCC) to L (CTC) at codon 151 in exon 5. Introduction of these mutations into the normal GALNS cDNA and transient expression in cultured fibroblasts resultedmore » in a significant decrease in the enzyme activity. V138A and Q148X mutations result in changes of restriction site, which were analyzed by restriction-enzyme assay. P151S and P151L mutations that did not alter the restriction site were detected by direct sequencing or allele specific oligohybridization. Detection of the double gene deletion was initially done using Southern blots and was confirmed by PCR. Haplotypes were determined using seven polymorphisms to the GALNS locus in families with the double gene deletion. Haplotype analysis showed that the common double gene deletion occurred on a single haplotype, except for some variation in a VNTR-like polymorphism. This finding is consistent with a common founder for all individuals with this mutation. 48 refs., 5 figs., 1 tab.« less
Boyd, Elaine M; Bench, Anthony J; Goday-Fernández, Andrea; Anand, Shubha; Vaghela, Krishna J; Beer, Phillip; Scott, Mike A; Bareford, David; Green, Anthony R; Huntly, Brian; Erber, Wendy N
2010-04-01
Approximately 50% of essential thrombocythaemia and primary myelo-fibrosis patients do not have a JAK2 V617F mutation. Up to 5% of these are reported to have a MPL exon 10 mutation but testing for MPL is not routine as there are multiple mutation types. The ability to routinely assess both JAK2 and MPL mutations would be beneficial in the differential diagnosis of unexplained thrombocytosis or myelofibrosis. We developed and applied a high resolution melt (HRM) assay, capable of detecting all known MPL mutations in a single analysis, for the detection of MPL exon 10 mutations. We assessed 175 ET and PMF patients, including 67 that were JAK2 V617F-negative by real time polymerase chain reaction (PCR). Overall, 19/175 (11%) patients had a MPL exon 10 mutation, of whom 16 were JAK2 V617F-negative (16/67; 24%). MPL mutation types were W515L (11), W515K (4), W515R (2) and W515A (1). One patient had both W515L and S505N MPL mutations and these were present in the same haemopoietic colonies. Real time PCR for JAK2 V617F analysis and HRM for MPL exon 10 status identified one or more clonal marker in 71% of patients. This combined genetic approach increases the sensitivity of meeting the World Health Organization diagnostic criteria for these myeloproliferative neoplasms.
Regions of extreme synonymous codon selection in mammalian genes
Schattner, Peter; Diekhans, Mark
2006-01-01
Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H
2006-04-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.
Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.
2006-01-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031
Yue, M; Tian, Y G; Wang, Y J; Gu, Y; Bayaer, N; Hu, Q; Gu, W W
2014-02-27
The IGF-1 gene is an important regulating factor that has a growth-promoting effect on growth hormone. The IGF-1 gene promotes muscle cell differentiation in the muscle cell formation process. The IGF-1 gene also regulates the growth of skeletal muscle during skeletal muscle growth. In addition, the IGF-1 gene plays an important role in the formation of mammals and poultry embryos, and the process of postnatal growth. The IGF-1 gene has been implicated as a candidate gene for the regulation of pig growth traits. We analyzed exon 3 of the IGF-1 gene polymorphism in Tibetan miniature pigs (N = 128) by polymerase chain reaction-single-strand conformation polymorphism and DNA sequencing. One single nucleotide polymorphism (T40C) was found on exon 3 of the IGF-1 gene. Statistical analysis of genotype frequencies revealed that the T allele was dominant in Tibetan miniature pigs at the T40C locus. The association analysis showed that the IGF-1 mutation had an effect on the body weight, body length, and chest circumference of pigs aged 6-8 months. In addition, the IGF-1 mutation had an effect on body weight in pigs aged 9-11 months (P < 0.05). We speculated that the pigs with the TT genotype grow more rapidly compared to those with the TC genotype. The TC genotype of the Tibetan miniature pig has a smaller body type. This information provides a theoretical basis for the genetic background of Tibetan miniature pigs.
Poon, Kok Siong; Sng, Andrew Anjian; Ho, Cindy Weili; Koay, Evelyn Siew-Chuan
2015-01-01
Loss-of-function mutations in the phosphate regulating gene with homologies to endopeptidases on the X-chromosome (PHEX) have been causally associated with X-linked hypophosphatemic rickets (XLHR). The early diagnosis of XLHR in infants is challenging when it is based solely on clinical features and biochemical findings. We report a 7-month-old boy with a family history of hypophosphatemic rickets., who demonstrated early clinical evidence of rickets, although serial biochemical findings could not definitively confirm rickets. A sequencing assay targeting the PHEX gene was first performed on the mother’s DNA to screen for mutations in the 5′UTR, 22 coding exons, and the exon-intron junctions. Targeted mutation analysis and mRNA studies were subsequently performed on the boys’ DNA to investigate the pathogenicity of the identified mutation. Genetic screening of the PHEX gene revealed a novel mutation, c.1080-2A>C, at the splice acceptor site in intron 9. The detection of an aberrant mRNA transcript with skipped (loss of) exon 10 establishes its pathogenicity and confirms the diagnosis of XLHR in this infant. Genetic testing of the PHEX gene resulted in early diagnosis of XLHR, thus enabling initiation of therapy and prevention of progressive rachitic changes in the infant. PMID:26904698
Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R
2004-01-01
A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563
Pleiotropic biological activities of alternatively spliced TMPRSS2/ERG fusion gene transcripts
Wang, Jianghua; Cai, Yi; Yu, Wendong; Ren, Chengxi; Spencer, David M.; Ittmann, Michael
2008-01-01
TMPRSS2/ERG gene fusions are found in the majority of prostate cancers; however, there is significant heterogeneity in the 5′ region of the alternatively spliced fusion gene transcripts. We have found that there is also significant heterogeneity within the coding exons as well. There is variable inclusion of a 72-bp exon and other novel alternatively spliced isoforms. To assess the biological significance of these alternatively spliced transcripts, we expressed various transcripts in primary prostatic epithelial cells and in an immortalized prostatic epithelial cell line, PNT1a. The fusion gene transcripts promoted proliferation, invasion and motility with variable activities that depended on the structure of the 5′ region encoding the TMPRSS2/ERG fusion and the presence of the 72-bp exon. Cotransfection of different isoforms further enhanced biological activity, mimicking the situation in vivo, in which multiple isoforms are expressed. Finally, knockdown of the fusion gene in VCaP cells resulted in inhibition of proliferation in vitro and tumor progression in an in vivo orthotopic mice model. Our results indicate that TMPRSS2/ERG fusion isoforms have variable biological activities promoting tumor initiation and progression and are consistent with our previous clinical observations indicating that certain TMPRSS2/ERG fusion isoforms are significantly correlated with more aggressive disease. PMID:18922926
[Identifying and sequence analysis of HLA-B*2736].
Li, Zhen; Zou, Hong-Yan; Shao, Chao-Peng; Tang, Si; Wang, Da-Ming; Cheng, Liang-Hong
2007-11-01
An unknown HLA-B allele which was similar to HLA-B*270401 was detected by FLOW-SSOPCR-SSP and heterozygous sequence-based typing (SBT) in Chinese Han individual. Its anomalous patterns suggested the possible presence of new allele. Amplifying exon 2-5(include intron 2-4) of the HLA-B*27 allele separately by using allele-specific primers and sequencing in both directions. Identifying the difference between the novel B*27 allele and B*270401. The sequence of novel B*27 from exon 2 to partial exon 5 is 1 815 bp. There are 10 nt changes from B*270401 in exon 3-4, at nt634where A-->C(codon130 AGC-->CGC, 130 S-->R); nt670 where A-->T (codon142 ACC-->TCC, 142 T-->S); nt683 where G-->T (codon146 TGG-->TTG, 146 W-->L); nt698 where A-->T (codon151 GAG-->GTG, 151 E-->V); nt774 where G-->C (codon176 GAG-->GAC, 176 E-->D); nt776 where C-->A (codon177 ACG-->AAG, 177 T-->K); nt781 where C-->G (codon179 CAG-->GAG, 179Q-->E); nt789 where G-->T (codon181 GCG-->GCT) resulting no coding change; nt1438 where C-->T (codon206 GGC-->GGT) resulting no coding change; nt1449 where G-->C (codon210 GGG-->GCG, 210G-->A). In IMGT/HLA database, only three alleles (B*270502/2706/2732) have sequences of introns. The same sequence in intron 2 showed homology between the novel HLA-B*27 allele and B*2706, but their homology could not be supported in intron 3-4. Comparing the sequence of the novel B*27 allele in intron 3 and 4 with B*27 group, it showed there are three mutations at nt106 C-->G, nt179 G-->A, nt536 G-->A and one deletion at nt168 in intron 3 and one mutations at nt82 T-->C in intron 4, but the sequence of the novel B*27 allele in intron 3 and 4 was all the same to B*070201. The sequence was submitted to Gen-Bank and the accession number was DQ915176. The allele has been confirmed as an extension of B*2736 by the WHO Nomenclature committee in November 2006.
Diane Dietrich; Casey Crooks
2009-01-01
A pyranose 2-oxidase gene from the brown-rot basidiomycete Gloeophyllum trabeum was isolated using homology-based degenerate PCR. The gene structure was determined and compared to that of several pyranose 2-oxidases cloned from white-rot fungi. The G. trabeum pyranose 2-oxidase gene consists of 16 coding exons with canonical promoter CAAT and TATA elements in the 5âUTR...
Han, R-L; Lan, X-Y; Zhang, L-Z; Ren, G; Jing, Y-J; Li, M-J; Zhang, B; Zhao, M; Guo, Y-K; Kang, X-T; Chen, H
2010-01-01
Visfatin is a peptide that is predominantly expressed in visceral adipose tissue and is hypothesized to be related to obesity and insulin resistance. In this study, a novel silent single-nucleotide polymorphism (SNP) was found in exon 7 of the chicken visfatin gene (also known as PBEF1) by single-stranded conformation polymorphism (SSCP) and DNA sequencing. In total, 836 chickens forming an F2 resource population of Gushi chicken crossed with Anka broiler were genotyped by XbaI forced RFLP, and the associations of this polymorphism with chicken growth, carcass characteristics, and meat quality were analyzed. Significant associations were found between the polymorphism and 4-week body weight (BW4), 6-week body weight (BW6), 4-week body slanting length (BSL4), fat bandwidth (FBW), breast muscle water loss rate (BWLR) and breast muscle fiber density (BFD) (P < 0.05), as well as 4-week breastbone length (BBL4) (P < 0.01). These observations suggested that the polymorphism in exon7 of the visfatin gene had significant effects on the early growth traits of chicken.
Chan, Kuang-Lim; Rosli, Rozana; Tatarinova, Tatiana V; Hogan, Michael; Firdaus-Raih, Mohd; Low, Eng-Ti Leslie
2017-01-27
Gene prediction is one of the most important steps in the genome annotation process. A large number of software tools and pipelines developed by various computing techniques are available for gene prediction. However, these systems have yet to accurately predict all or even most of the protein-coding regions. Furthermore, none of the currently available gene-finders has a universal Hidden Markov Model (HMM) that can perform gene prediction for all organisms equally well in an automatic fashion. We present an automated gene prediction pipeline, Seqping that uses self-training HMM models and transcriptomic data. The pipeline processes the genome and transcriptome sequences of the target species using GlimmerHMM, SNAP, and AUGUSTUS pipelines, followed by MAKER2 program to combine predictions from the three tools in association with the transcriptomic evidence. Seqping generates species-specific HMMs that are able to offer unbiased gene predictions. The pipeline was evaluated using the Oryza sativa and Arabidopsis thaliana genomes. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed that the pipeline was able to identify at least 95% of BUSCO's plantae dataset. Our evaluation shows that Seqping was able to generate better gene predictions compared to three HMM-based programs (MAKER2, GlimmerHMM and AUGUSTUS) using their respective available HMMs. Seqping had the highest accuracy in rice (0.5648 for CDS, 0.4468 for exon, and 0.6695 nucleotide structure) and A. thaliana (0.5808 for CDS, 0.5955 for exon, and 0.8839 nucleotide structure). Seqping provides researchers a seamless pipeline to train species-specific HMMs and predict genes in newly sequenced or less-studied genomes. We conclude that the Seqping pipeline predictions are more accurate than gene predictions using the other three approaches with the default or available HMMs.
Novel mutations of MYO7A and USH1G in Israeli Arab families with Usher syndrome type 1.
Rizel, Leah; Safieh, Christine; Shalev, Stavit A; Mezer, Eedy; Jabaly-Habib, Haneen; Ben-Neriah, Ziva; Chervinsky, Elena; Briscoe, Daniel; Ben-Yosef, Tamar
2011-01-01
This study investigated the genetic basis for Usher syndrome type 1 (USH1) in four consanguineous Israeli Arab families. Haplotype analysis for all known USH1 loci was performed in each family. In families for which haplotype analysis was inconclusive, we performed genome-wide homozygosity mapping using a single nucleotide polymorphism (SNP) array. For mutation analysis, specific primers were used to PCR amplify the coding exons of the MYO7A, USH1C, and USH1G genes including intron-exon boundaries. Mutation screening was performed with direct sequencing. A combination of haplotype analysis and genome-wide homozygosity mapping indicated linkage to the USH1B locus in two families, USH1C in one family and USH1G in another family. Sequence analysis of the relevant genes (MYO7A, USH1C, and USH1G) led to the identification of pathogenic mutations in all families. Two of the identified mutations are novel (c.1135-1147dup in MYO7A and c.206-207insC in USH1G). USH1 is a genetically heterogenous condition. Of the five USH1 genes identified to date, USH1C and USH1G are the rarest contributors to USH1 etiology worldwide. It is therefore interesting that two of the four Israeli Arab families reported here have mutations in these two genes. This finding further demonstrates the unique genetic structure of the Israeli population in general, and the Israeli Arab population in particular, which due to high rates of consanguinity segregates many rare autosomal recessive genetic conditions.
Ding, X Z; Liang, C N; Guo, X; Xing, C F; Bao, P J; Chu, M; Pei, J; Zhu, X S; Yan, P
2012-01-01
Lipoprotein lipase (LPL) is considered as a key enzyme in the lipid deposition and metabolism in tissues. It is assumed to be a major candidate gene for genetic markers in lipid deposition. Therefore, the polymorphisms of the LPL gene and associations with carcass traits and viscera fat content were examined in 398 individuals from five yak (Bos grunniens) breeds using PCR-SSCP analysis and DNA sequencing. A novel nucleotide polymorphism (SNP)-C→T (nt19913) was identified located in exon 7 in the coding region of the LPL gene, which replacement was responsible for a Phe-to-Ser substitution at amino acid. Two alleles (A and B) and three genotypes designed as AA, AB and BB were detected in the PCR products. The frequencies of allele A were 0.7928, 0.7421, 0.7357, 0.6900 and 0.7083 for Tianzhu white yak (WY), Gannan yak (GY), Qinghai-Plateau yak (PY), Xinjiang yak (XY) and Datong yak (DY), respectively. The SNP loci was in Hardy-Weinberg equilibrium in five yak populations (P>0.05). Polymorphism of LPL gene was shown to be associated with carcass traits and lipid deposition. Least squares analysis revealed that there was a significant effect on live-weight (LW) (P<0.01), average daily weight gain (ADG) and carcass weight (P<0.05). Individuals with genotype BB had lower mean values than those with genotype AA and AB for loin eye area and viscera fat weight (% of LW) in 25-36 months (P<0.05). The results indicated that LPL gene is a strong candidate gene that affects carcass traits and fat deposition in yak.
Foster, R; Byrnes, E; Meldrum, C; Griffith, R; Ross, G; Upjohn, E; Braue, A; Scott, R; Varigos, G; Ferrao, P; Ashman, L K
2008-11-01
The receptor tyrosine kinase c-KIT plays a key role in normal mast cell development. Point mutations in c-KIT have been associated with sporadic or familial mastocytosis. Two unrelated pairs of apparently identical twins affected by cutaneous mastocytosis attending the Mastocytosis Clinic at the Royal Children's Hospital, Melbourne, provided an opportunity to assess the possible contribution of c-KIT germline mutations or polymorphisms in this disease. Tissue biopsy, blood and/or buccal swab specimens were collected from 10 children with mastocytosis. To detect germline mutations/polymorphisms in c-KIT, we studied all coding exons by denaturing high pressure liquid chromatography. Exons showing mismatches were examined by direct sequencing. The influence of the substitution identified was further examined by expressing the variant form of c-KIT in factor-dependent FDC-P1 cells. In both pairs of twins, a heterozygous ATG to CTG transition in codon 541 was observed, resulting in the substitution of a methionine residue in the transmembrane domain by leucine (M541L). In each case, one parent was also heterozygous for this allele. Expression of M541L KIT in FDC-P1 cells enabled them to grow in human KIT ligand (stem cell factor, SCF) but did not confer factor independence. Compared with cells expressing wild-type KIT at a similar level, M541L KIT-expressing cells displayed enhanced growth at low levels of SCF, and heightened sensitivity to the KIT inhibitor, imatinib mesylate. The data suggest that the single nucleotide polymorphism resulting in the substitution M541L may predispose to paediatric mastocytosis.
Mutations in the GIGYF2 (TNRC15) Gene at the PARK11 Locus in Familial Parkinson Disease
Lautier, Corinne; Goldwurm, Stefano; Dürr, Alexandra; Giovannone, Barbara; Tsiaras, William G.; Pezzoli, Gianni; Brice, Alexis; Smith, Robert J.
2008-01-01
The genetic basis for association of the PARK11 region of chromosome 2 with familial Parkinson disease (PD) is unknown. This study examined the GIGYF2 (Grb10-Interacting GYF Protein-2) (TNRC15) gene, which contains the PARK11 microsatellite marker with the highest linkage score (D2S206, LOD 5.14). The 27 coding exons of the GIGYF2 gene were sequenced in 123 Italian and 126 French patients with familial PD, plus 131 Italian and 96 French controls. A total of seven different GIGYF2 missense mutations resulting in single amino acid substitutions were present in 12 unrelated PD index patients (4.8%) and not in controls. Three amino acid insertions or deletions were found in four other index patients and absent in controls. Specific exon sequencing showed that these ten sequence changes were absent from a further 91 controls. In four families with amino acid substitutions in which at least one other PD case was available, the GIGYF2 mutations (Asn56Ser, Thr112Ala, and Asp606Glu) segregated with PD. There were, however, two unaffected carriers in one family, suggesting age-dependent or incomplete penetrance. One index case (PD onset age 33) inherited a GIGYF2 mutation (Ile278Val) from her affected father (PD onset age 66) and a previously described PD-linked mutation in the LRRK2 gene (Ile1371Val) from her affected mother (PD onset age 61). The earlier onset and severe clinical course in the index patient suggest additive effects of the GIGYF2 and LRRK2 mutations. These data strongly support GIGYF2 as a PARK11 gene with a causal role in familial PD. PMID:18358451
Marini, Francesca; Giusti, Francesca; Fossi, Caterina; Cioppi, Federica; Cianferotti, Luisella; Masi, Laura; Boaretto, Francesca; Zovato, Stefania; Cetani, Filomena; Colao, Annamaria; Davì, Maria Vittoria; Faggiano, Antongiulio; Fanciulli, Giuseppe; Ferolla, Piero; Ferone, Diego; Loli, Paola; Mantero, Franco; Marcocci, Claudio; Opocher, Giuseppe; Beck-Peccoz, Paolo; Persani, Luca; Scillitani, Alfredo; Guizzardi, Fabiana; Spada, Anna; Tomassetti, Paola; Tonelli, Francesco; Brandi, Maria Luisa
2018-03-01
Multiple endocrine neoplasia type 1 (MEN1) is caused by germline inactivating mutations of the MEN1 gene. Currently, no direct genotype-phenotype correlation is identified. We aim to analyze MEN1 mutation site and features, and possible correlations between the mutation type and/or the affected menin functional domain and clinical presentation in patients from the Italian multicenter MEN1 database, one of the largest worldwide MEN1 mutation series published to date. The study included the analysis of MEN1 mutation profile in 410 MEN1 patients [370 familial cases from 123 different pedigrees (48 still asymptomatic at the time of this study) and 40 single cases]. We identified 99 different mutations: 41 frameshift [small intra-exon deletions (28) or insertions (13)], 13 nonsense, 26 missense and 11 splicing site mutations, 4 in-frame small deletions, and 4 intragenic large deletions spanning more than one exon. One family had two different inactivating MEN1 mutations on the same allele. Gastro-entero-pancreatic tumors resulted more frequent in patients with a nonsense mutation, and thoracic neuroendocrine tumors in individuals bearing a splicing-site mutation. Our data regarding mutation type frequency and distribution are in accordance with previously published data: MEN1 mutations are scattered through the entire coding region, and truncating mutations are the most common in MEN1 syndrome. A specific direct correlation between MEN1 genotype and clinical phenotype was not found in all our families, and wide intra-familial clinical variability and variable disease penetrance were both confirmed, suggesting a role for modifying, still undetermined, factors, explaining the variable MEN1 tumorigenesis.
Waardenburg syndrome: Novel mutations in a large Brazilian sample.
Bocángel, Magnolia Astrid Pretell; Melo, Uirá Souto; Alves, Leandro Ucela; Pardono, Eliete; Lourenço, Naila Cristina Vilaça; Marcolino, Humberto Vicente Cezar; Otto, Paulo Alberto; Mingroni-Netto, Regina Célia
2018-06-01
This paper deals with the molecular investigation of Waardenburg syndrome (WS) in a sample of 49 clinically diagnosed probands (most from southeastern Brazil), 24 of them having the type 1 (WS1) variant (10 familial and 14 isolated cases) and 25 being affected by the type 2 (WS2) variant (five familial and 20 isolated cases). Sequential Sanger sequencing of all coding exons of PAX3, MITF, EDN3, EDNRB, SOX10 and SNAI2 genes, followed by CNV detection by MLPA of PAX3, MITF and SOX10 genes in selected cases revealed many novel pathogenic variants. Molecular screening, performed in all patients, revealed 19 causative variants (19/49 = 38.8%), six of them being large whole-exon deletions detected by MLPA, seven (four missense and three nonsense substitutions) resulting from single nucleotide substitutions (SNV), and six representing small indels. A pair of dizygotic affected female twins presented the c.430delC variant in SOX10, but the mutation, imputed to gonadal mosaicism, was not found in their unaffected parents. At least 10 novel causative mutations, described in this paper, were found in this Brazilian sample. Copy-number-variation detected by MLPA identified the causative mutation in 12.2% of our cases, corresponding to 31.6% of all causative mutations. In the majority of cases, the deletions were sporadic, since they were not present in the parents of isolated cases. Our results, as a whole, reinforce the fact that the screening of copy-number-variants by MLPA is a powerful tool to identify the molecular cause in WS patients. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
D'Amora, Paulo; Sato, Hélio; Girão, Manoel J B C; Silva, Ismael D C G; Schor, Eduardo
2006-09-01
To study possible correlation between the prevalence of polymorphisms in the type I interleukin-1 receptor gene and pelvic endometriosis. Genotypes of 223 women were analyzed: 109 women with surgically and histologically confirmed endometriosis and 114 healthy women. Distributions of two single-base polymorphisms of the human interleukin-1 receptor type I (IL-1RI) gene were evaluated: PstI, due to a C-->T transition in exon 1B and BsrBI a C-->A transition at position 52 in exon 1C. Polymorphisms were detected by polymerase chain reaction (PCR) followed by restriction fragment length polymorphism analysis (RFLP) resolved on 3% agarose gels stained with ethidium bromide. Genotypes for PstI polymorphisms did not differ significantly among control and endometriosis (P = 0.058). However, in relation to BsrBI polymorphism, protective risk was observed for the development of endometriosis [OR 0.39-IC 95% (0.2-0.9)]. BsrBI heterozygote genotype (C/A) showed protective effect against endometriosis development.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mebarki, F.; Forest, M.G.; Josso, N.
The androgen insensivity syndrome (AIS) is a recessive X-linked disorder resulting from a deficient function of the androgen receptor (AR). The human AR gene has 3 functional domains: N-terminal encoded by exon 1, DNA-binding domain encoded by exons 2 and 3, and androgen-binding domain encoded by exons 4 to 8. In order to characterize the molecular defects of the AR gene in AIS, the entire coding regions and the intronic bording sequences of the AR gene were amplified by PCR before automatic direct sequencing in 45 patients. Twenty seven different point mutations were found in 32 unrelated AIS patients: 18more » with a complete form (CAIS), 14 with a partial form (PAIS); 18 of these mutations are novel mutations, not published to date. Only 3 mutations were repeatedly found: R804H in 3 families; M780I in 3 families and R774C in 2 families. For 26 patients out of the 32 found to have a mutation, maternal DNA was collected and sequenced: 6 de novo mutations were detected (i.e. 23% of the cases). Finally, no mutation was detected in 13 patients (29%): 7 with CAIS and 6 familial severe PAIS. The latter all presented with perineal hypospadias, micropenis, 4 out of 6 being raised as girl. Diagnosis of AIS in these 13 families in whom no mutation was detected is supported by the following criteria: clinical data, familial history (2 or 3 index cases in the same family), familial segregation of the polymorphic CAG repeat of the AR gene. Mutations in intronic regions or the promoter of the AR gene could not explain all cases of AIS without mutations in the AR coding regions, because AR binding (performed in 9 out of 13) was normal in 6, suggesting the synthesis of an AR protein. This situation led us to speculate that another X-linked factor associated with the AR could be implicated in some cases of AIS.« less
Johnson, Alexander A. T.
2017-01-01
Iron (Fe) uptake in graminaceous plant species occurs via the release and uptake of Fe-chelating compounds known as mugineic acid family phytosiderophores (MAs). In the MAs biosynthetic pathway, nicotianamine aminotransferase (NAAT) and deoxymugineic acid synthase (DMAS) enzymes catalyse the formation of 2’-deoxymugineic acid (DMA) from nicotianamine (NA). Here we describe the identification and characterisation of six TaNAAT and three TaDMAS1 genes in bread wheat (Triticum aestivum L.). The coding sequences of all six TaNAAT homeologs consist of seven exons with ≥88.0% nucleotide sequence identity and most sequence variation present in the first exon. The coding sequences of the three TaDMAS1 homeologs consist of three exons with ≥97.8% nucleotide sequence identity. Phylogenetic analysis revealed that the TaNAAT and TaDMAS1 proteins are most closely related to the HvNAAT and HvDMAS1 proteins of barley and that there are two distinct groups of TaNAAT proteins—TaNAAT1 and TaNAAT2 –that correspond to the HvNAATA and HvNAATB proteins, respectively. Quantitative reverse transcription-PCR analysis revealed that the TaNAAT2 genes are expressed at highest levels in anther tissues whilst the TaNAAT1 and TaDMAS1 genes are expressed at highest levels in root tissues of bread wheat. Furthermore, the TaNAAT1, TaNAAT2 and TaDMAS1 genes were differentially regulated by plant Fe status and their expression was significantly upregulated in root tissues from day five onwards during a seven-day Fe deficiency treatment. The identification and characterization of the TaNAAT1, TaNAAT2 and TaDMAS1 genes provides a valuable genetic resource for improving bread wheat growth on Fe deficient soils and enhancing grain Fe nutrition. PMID:28475636
Grossen, Christine; Keller, Lukas; Biebach, Iris; Croll, Daniel
2014-01-01
The major histocompatibility complex (MHC) is a crucial component of the vertebrate immune system and shows extremely high levels of genetic polymorphism. The extraordinary genetic variation is thought to be ancient polymorphisms maintained by balancing selection. However, introgression from related species was recently proposed as an additional mechanism. Here we provide evidence for introgression at the MHC in Alpine ibex (Capra ibex ibex). At a usually very polymorphic MHC exon involved in pathogen recognition (DRB exon 2), Alpine ibex carried only two alleles. We found that one of these DRB alleles is identical to a DRB allele of domestic goats (Capra aegagrus hircus). We sequenced 2489 bp of the coding and non-coding regions of the DRB gene and found that Alpine ibex homozygous for the goat-type DRB exon 2 allele showed nearly identical sequences (99.8%) to a breed of domestic goats. Using Sanger and RAD sequencing, microsatellite and SNP chip data, we show that the chromosomal region containing the goat-type DRB allele has a signature of recent introgression in Alpine ibex. A region of approximately 750 kb including the DRB locus showed high rates of heterozygosity in individuals carrying one copy of the goat-type DRB allele. These individuals shared SNP alleles both with domestic goats and other Alpine ibex. In a survey of four Alpine ibex populations, we found that the region surrounding the DRB allele shows strong linkage disequilibria, strong sequence clustering and low diversity among haplotypes carrying the goat-type allele. Introgression at the MHC is likely adaptive and introgression critically increased MHC DRB diversity in the genetically impoverished Alpine ibex. Our finding contradicts the long-standing view that genetic variability at the MHC is solely a consequence of ancient trans-species polymorphism. Introgression is likely an underappreciated source of genetic diversity at the MHC and other loci under balancing selection. PMID:24945814
Haut, Donald D.; Pintel, D. J.
1998-01-01
Alternative splicing of pre-mRNAs plays a critical role in maximizing the coding capacity of the small parvovirus genome. The small-intron region of minute virus of mice (MVM) pre-mRNAs undergoes an unusual pattern of overlapping alternative splicing—using two donors (D1 and D2) and two acceptors (A1 and A2) within a region of 120 nucleotides—that determines the steady-state ratios of the various viral mRNAs. In this report, we show that the determinants that govern excision of the small intron are complex and are also required for efficient definition of the upstream exon. For the MVM small intron in its natural context, the two donors appear to compete for the splicing machinery: the position of D1 favors its usage, while the primary sequence of D2 must be more like the consensus sequence than is D1 to be used efficiently. We have genetically defined the branch points that are used for generation of the major and minor spliced forms and show that recognition of components of the small-intron acceptors is likely to be the dominant determinant in alternative small-intron excision. We have also identified a G-rich intronic enhancer sequence within the small intron that is essential for splicing of the minor form (D2 to A2) but not the major form (D1 to A1) of MVM mRNAs and is required for efficient definition of the upstream NS2-specific exon. In its natural context, the small intron appears to be excised by a mechanism consistent with intron definition. When the MVM small intron is expanded, various parameters of its excision are altered, indicating that critical cis-acting signals are context dependent. Relative use of the donors and acceptors is altered, and the upstream NS2-specific exon is no longer efficiently defined. The fact that definition of the upstream NS2-specific exon can be achieved by the MVM small intron in its natural context, but not when it is expanded, suggests that the multiple determinants that govern definition and excision of the small intron are required, in concert, for upstream exon definition. Our data are consistent with a model in which alternative splicing of the MVM P4-generated pre-mRNAs is governed by a hybrid of intron- and exon-defining mechanisms. PMID:9499034
NASA Technical Reports Server (NTRS)
Piao, C. Q.; Willey, J. C.; Hei, T. K.; Hall, E. J. (Principal Investigator)
1999-01-01
The cellular and molecular mechanisms of radiation-induced lung cancer are not known. In the present study, alterations of p53 in tumorigenic human papillomavirus-immortalized human bronchial epithelial (BEP2D) cells induced by a single low dose of either alpha-particles or 1 GeV/nucleon (56)Fe were analyzed by PCR-single-stranded conformation polymorphism (SSCP) coupled with sequencing analysis and immunoprecipitation assay. A total of nine primary and four secondary tumor cell lines, three of which were metastatic, together with the parental BEP2D and primary human bronchial epithelial (NHBE) cells were studied. The immunoprecipitation assay showed overexpression of mutant p53 proteins in all the tumor lines but not in NHBE and BEP2D cells. PCR-SSCP and sequencing analysis found band shifts and gene mutations in all four of the secondary tumors. A G-->T transversion in codon 139 in exon 5 that replaced Lys with Asn was detected in two tumor lines. One mutation each, involving a G-->T transversion in codon 215 in exon 6 (Ser-->lle) and a G-->A transition in codon 373 in exon 8 (Arg-->His), was identified in the remaining two secondary tumors. These results suggest that p53 alterations correlate with tumorigenesis in the BEP2D cell model and that mutations in the p53 gene may be indicative of metastatic potential.
A deep intronic mutation in the SLC12A3 gene leads to Gitelman syndrome.
Nozu, Kandai; Iijima, Kazumoto; Nozu, Yoshimi; Ikegami, Ei; Imai, Takehide; Fu, Xue Jun; Kaito, Hiroshi; Nakanishi, Koichi; Yoshikawa, Norishige; Matsuo, Masafumi
2009-11-01
Many mutations have been detected in the SLC12A3 gene of Gitelman syndrome (GS, OMIM 263800) patients. In previous studies, only one mutant allele was detected in approximately 20 to 41% of patients with GS; however, the exact reason for the nonidentification has not been established. In this study, we used RT-PCR using mRNA to investigate for the first time transcript abnormalities caused by deep intronic mutation. Direct sequencing analysis of leukocyte DNA identified one base insertion in exon 6 (c.818_819insG), but no mutation was detected in another allele. We analyzed RNA extracted from leukocytes and urine sediments and detected unknown sequence containing 238bp between exons 13 and 14. The genomic DNA analysis of intron 13 revealed a single-base substitution (c.1670-191C>T) that creates a new donor splice site within the intron resulting in the inclusion of a novel cryptic exon in mRNA. This is the first report of creation of a splice site by a deep intronic single-nucleotide change in GS and the first report to detect the onset mechanism in a patient with GS and missing mutation in one allele. This molecular onset mechanism may partly explain the poor success rate of mutation detection in both alleles of patients with GS.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chuang, J.L.; Fisher, C.R.; Chuang, D.T.
1994-08-01
The authors report the occurrence of three novel mutations in the E1[alpha] (BCKDHA) locus of the branched-chain [alpha]-keto acid dehydrogenase (BCKAD) complex that cause maple syrup urine disease (MSUD). An 8-bp deletion in exon 7 is present in one allele of a compound-heterozygous patient (GM-649). A single C nucleotide insertion in exon 2 occurs in one allele of an intermediate-MSUD patient (Lo). The second allele of patient Lo carries an A-to-G transition in exon 9 of the E1[alpha] gene. This missense mutation changes Tyr-368 to Cys (Y368C) in the E1[alpha] subunit. Both the 8-bp deletion and the single C insertionmore » generate a downstream nonsense codon. Both mutations appear to be associated with a low abundance of the mutant E1[alpha] mRNA, as determined by allele-specific oligonucleotide probing. Transfection studies strongly suggest that the Y368C substitution in the E1[alpha] subunit impairs its proper assembly with the normal E1[beta]. Unassembled as well as misassembled E1[alpha] and E1[beta] subunits are degraded in the cell. 32 refs., 8 figs.« less
Mata López, Sara; Hammond, James J; Rigsby, Madison B; Balog-Alvarez, Cynthia J; Kornegay, Joe N; Nghiem, Peter P
2018-05-29
Boys with Duchenne muscular dystrophy (DMD) have DMD gene mutations, with associated loss of the dystrophin protein and progressive muscle degeneration and weakness. Corticosteroids and palliative support are currently the best treatment options. The long-term benefits of recently approved compounds such as eteplirsen and ataluren remain to be seen. Dogs with naturally occurring dystrophinopathies show progressive disease akin to that of DMD. Accordingly, canine DMD models are useful for studies of pathogenesis and preclinical therapy development. A dystrophin-deficient, male border collie dog was evaluated at the age of 5 months for progressive muscle weakness and dysphagia. Dramatically increased serum creatine kinase levels (41,520 U/L; normal range 59-895 U/L) were seen on a biochemistry panel. Histopathologic changes characteristic of dystrophinopathy were seen. Dystrophin was absent in the skeletal muscle on immunofluorescence microscopy and western blot. Whole genome sequencing, polymerase chain reaction, and Sanger sequencing revealed a frameshift, single nucleotide deletion in canine DMD exon 20, position 27,626,466 (c.2841delT mRNA), resulting in a stop codon six nucleotides downstream. Semen was archived for future line perpetuation. This spontaneous canine dystrophinopathy occurred due to a novel mutation in the minor DMD mutation hotspot (between exons 2 through 20). Perpetuating this line could allow for preclinical testing of genetic therapies targeted to this area of the DMD gene.
Natural gene therapy in monozygotic twins with Fanconi anemia.
Mankad, Anuj; Taniguchi, Toshiyasu; Cox, Barbara; Akkari, Yassmine; Rathbun, R Keaney; Lucas, Lora; Bagby, Grover; Olson, Susan; D'Andrea, Alan; Grompe, Markus
2006-04-15
Monozygotic twin sisters, with nonhematologic symptoms of Fanconi anemia (FA), were discovered to be somatic mosaics for mutations in the FANCA gene. Skin fibroblasts, but not lymphocytes or committed hematopoietic progenitors, were sensitive to DNA cross-linking agents. Molecular analysis revealed, in skin cells of both twins, a frameshift causing deletion in exon 27 (2555deltaT) and an exon 28 missense mutation (2670G>A/R880Q). The latter resulted in primarily cytoplasmic expression and reduced function of the mutant FANCA (R880Q) protein. Surprisingly, the same acquired exon 30 missense change (2927G>A/E966K) was detected in the hematopoietic cells of both sisters, but not in their fibroblasts, nor in either parent. This compensatory mutation existed in cis with the maternal exon 28 mutation, and it restored function and nuclear localization of the resulting protein. Both sisters have been free of hematologic symptoms for more than 2 decades, suggesting that this de novo mutation occurred prenatally in a single hematopoietic stem cell (HSC) in one twin and that descendants of this functionally corrected HSC, via intra-uterine circulation, repopulated the blood lineages of both sisters. This finding suggests that treating FA patients with gene therapy might require transduction of only a few hematopoietic stem cells.
Hinrich, Anthony J; Jodelka, Francine M; Chang, Jennifer L; Brutman, Daniella; Bruno, Angela M; Briggs, Clark A; James, Bryan D; Stutzmann, Grace E; Bennett, David A; Miller, Steven A; Rigo, Frank; Marr, Robert A; Hastings, Michelle L
2016-04-01
Apolipoprotein E receptor 2 (ApoER2) is an apolipoprotein E receptor involved in long-term potentiation, learning, and memory. Given its role in cognition and its association with the Alzheimer's disease (AD) risk gene, apoE, ApoER2 has been proposed to be involved in AD, though a role for the receptor in the disease is not clear. ApoER2 signaling requires amino acids encoded by alternatively spliced exon 19. Here, we report that the balance of ApoER2 exon 19 splicing is deregulated in postmortem brain tissue from AD patients and in a transgenic mouse model of AD To test the role of deregulated ApoER2 splicing in AD, we designed an antisense oligonucleotide (ASO) that increases exon 19 splicing. Treatment of AD mice with a single dose of ASO corrected ApoER2 splicing for up to 6 months and improved synaptic function and learning and memory. These results reveal an association between ApoER2 isoform expression and AD, and provide preclinical evidence for the utility of ASOs as a therapeutic approach to mitigate Alzheimer's disease symptoms by improving ApoER2 exon 19 splicing. © 2016 The Authors. Published under the terms of the CC BY 4.0 license.
An insight into the sialome of the horse fly, Tabanus bromius
Ribeiro, José M.C.; Kazimirova, Maria; Takac, Peter; Andersen, John F.; Francischetti, Ivo M.B.
2015-01-01
Blood feeding animals face their host's defenses against tissue injury and blood loss while attempting to feed. One adaptation to surmount these barriers involves the evolution of a salivary potion that disarms their host's inflammatory and anti-hemostatic processes. The composition of the peptide moiety of this potion, or sialome (from the Greek sialo=saliva), can be deducted in part by proper interpretation of the blood feeder' sialotranscriptome. In this work we disclose the sialome of the blood feeding adult female Tabanus bromius. Following assembly of over 75 million Illumina reads (101 nt long) 16,683 contigs were obtained from which 4,078 coding sequences were extracted. From these, 320 were assigned as coding for putative secreted proteins. These 320 contigs mapped 85% of the reads. The antigen-5 proteins family was studied in detail, indicating three Tabanus specific clades with and without disintegrin domains, as well as with and without leukotriene binding domains. Defensins were also detailed; a clade of salivary tabanid peptides was found lacking the propeptide domain ending in the KR dipeptide signaling furin cleavage. Novel protein families were also disclosed. Viral transcripts were identified closely matching the Kotonkan virus capsid proteins. Full length Mariner transposases were also identified. A total of 3,043 coding sequences and their protein products were deposited in Genbank. Hyperlinked excel spreadsheets containing the coding sequences and their annotation are available at http://exon.niaid.nih.gov/transcriptome/T_bromius/Tbromius-web.xlsx (hyperlinked excel spreadsheet, 11 MB) and http://exon.niaid.nih.gov/transcriptome/T_bromius/Tbromius-SA.zip (Standalone excel with all local links, 360 MB). These sequences provide for a platform from which further proteomic studies may be designed to identify salivary proteins from T. bromius that are of pharmacological interest or used as immunological markers of host exposure. PMID:26369729
2015-10-01
a promising target for precision therapy , but the mechanisms leading to hypermutation, optimal methods to measure hypermutation status in the ...1 was largely completed in Year 1 and is summarized below. We published a manuscript in Nature Communications based on the work accomplished in Aim...multiplexing 24 samples per lane on a HiSeq2500. The BROCA assay uses the Agilent SureSelect enrichment system to capture the coding exons and
Age of heart disease presentation and dysmorphic nuclei in patients with LMNA mutations
Core, Jason Q.; Mehrabi, Mehrsa; Robinson, Zachery R.; Ochs, Alexander R.; McCarthy, Linda A.; Zaragoza, Michael V.
2017-01-01
Nuclear shape defects are a distinguishing characteristic in laminopathies, cancers, and other pathologies. Correlating these defects to the symptoms, mechanisms, and progression of disease requires unbiased, quantitative, and high-throughput means of quantifying nuclear morphology. To accomplish this, we developed a method of automatically segmenting fluorescently stained nuclei in 2D microscopy images and then classifying them as normal or dysmorphic based on three geometric features of the nucleus using a package of Matlab codes. As a test case, cultured skin-fibroblast nuclei of individuals possessing LMNA splice-site mutation (c.357-2A>G), LMNA nonsense mutation (c.736 C>T, pQ246X) in exon 4, LMNA missense mutation (c.1003C>T, pR335W) in exon 6, Hutchinson-Gilford Progeria Syndrome, and no LMNA mutations were analyzed. For each cell type, the percentage of dysmorphic nuclei, and other morphological features such as average nuclear area and average eccentricity were obtained. Compared to blind observers, our procedure implemented in Matlab codes possessed similar accuracy to manual counting of dysmorphic nuclei while being significantly more consistent. The automatic quantification of nuclear defects revealed a correlation between in vitro results and age of patients for initial symptom onset. Our results demonstrate the method’s utility in experimental studies of diseases affecting nuclear shape through automated, unbiased, and accurate identification of dysmorphic nuclei. PMID:29149195
Age of heart disease presentation and dysmorphic nuclei in patients with LMNA mutations.
Core, Jason Q; Mehrabi, Mehrsa; Robinson, Zachery R; Ochs, Alexander R; McCarthy, Linda A; Zaragoza, Michael V; Grosberg, Anna
2017-01-01
Nuclear shape defects are a distinguishing characteristic in laminopathies, cancers, and other pathologies. Correlating these defects to the symptoms, mechanisms, and progression of disease requires unbiased, quantitative, and high-throughput means of quantifying nuclear morphology. To accomplish this, we developed a method of automatically segmenting fluorescently stained nuclei in 2D microscopy images and then classifying them as normal or dysmorphic based on three geometric features of the nucleus using a package of Matlab codes. As a test case, cultured skin-fibroblast nuclei of individuals possessing LMNA splice-site mutation (c.357-2A>G), LMNA nonsense mutation (c.736 C>T, pQ246X) in exon 4, LMNA missense mutation (c.1003C>T, pR335W) in exon 6, Hutchinson-Gilford Progeria Syndrome, and no LMNA mutations were analyzed. For each cell type, the percentage of dysmorphic nuclei, and other morphological features such as average nuclear area and average eccentricity were obtained. Compared to blind observers, our procedure implemented in Matlab codes possessed similar accuracy to manual counting of dysmorphic nuclei while being significantly more consistent. The automatic quantification of nuclear defects revealed a correlation between in vitro results and age of patients for initial symptom onset. Our results demonstrate the method's utility in experimental studies of diseases affecting nuclear shape through automated, unbiased, and accurate identification of dysmorphic nuclei.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsugu, H.; Horowitz, R.; Gibson, N.
1994-12-01
Sera from approximately 30% of patients with systemic lupus erythematosus (SLE) contain high titers of autoantibodies that bind to the 52-kDa Ro/SSA protein. We previously detected polymorphisms in the 52-kDa Ro/SSA gene (SSA1) with restriction enzymes, one of which is strongly associated with the presence of SLE (P < 0.0005) in African Americans. A higher disease frequency and more severe forms of the disease are commonly noted among these female patients. To determine the location and nature of this polymorphism, we obtained two clones that span 8.5 kb of the 52-kDa Ro/SSA locus including its upstream regulatory region. Six exonsmore » were identified, and their nucleotide sequences plus adjacent noncoding regions were determined. No differences were found between these exons and the coding region of one of the reported cDNAs. The disease-associated polymorphic site suggested by a restriction enzyme map and confirmed by DNA amplification and nucleotide sequencing was present upstream of exon 1. This polymorphism may be a genetic marker for a disease-related variation in the coding region for the protein or in the upstream regulatory region of this gene. Although this RFLP is present in Japanese, it is not associated with lupus in this race. 41 refs., 4 figs., 2 tabs.« less
Facial asymmetry and clinical manifestations in patients with novel insertion of the TCOF1 gene.
Su, P-H; Liu, Y-F; Yu, J-S; Chen, J-Y; Chen, S-J; Lai, Y-J
2012-11-01
This study explored the role of TCOF1 insertion mutations in Taiwanese patients with craniofacial anomalies. Twelve patients with single or multiple, asymmetrical congenital craniofacial anomalies were enrolled. Genomic DNA was prepared from leukocytes; the coding regions of TCOF1 were analyzed by polymerase chain reaction and direct sequencing. Clinical manifestations were correlated to the TCOF1 mutation. Six of 12 patients diagnosed with hemifacial microsomia exhibited a novel insertion mutation 4127 ins G (frameshift) in exon 24 in the TCOF1 gene. All six patients were diagnosed with anomalies on the left side. In addition, four of these six patients had hearing impairment; three had other major anomalies; and two had developmental delay. The insertion caused a frameshift, an early truncation, the loss of two putative nuclear localization signals (residues 1404-1420 and 1424-1440), and the loss of coiled coil domain (1406-1426) in treacle protein. These findings support the existence of two regulators of growth of the mandibular condyles. © 2011 John Wiley & Sons A/S.
The ATRX cDNA is prone to bacterial IS10 element insertions that alter its structure.
Valle-García, David; Griffiths, Lyra M; Dyer, Michael A; Bernstein, Emily; Recillas-Targa, Félix
2014-01-01
The SWI/SNF-like chromatin-remodeling protein ATRX has emerged as a key factor in the regulation of α-globin gene expression, incorporation of histone variants into the chromatin template and, more recently, as a frequently mutated gene across a wide spectrum of cancers. Therefore, the availability of a functional ATRX cDNA for expression studies is a valuable tool for the scientific community. We have identified two independent transposon insertions of a bacterial IS10 element into exon 8 of ATRX isoform 2 coding sequence in two different plasmids derived from a single source. We demonstrate that these insertion events are common and there is an insertion hotspot within the ATRX cDNA. Such IS10 insertions produce a truncated form of ATRX, which significantly compromises its nuclear localization. In turn, we describe ways to prevent IS10 insertion during propagation and cloning of ATRX-containing vectors, including optimal growth conditions, bacterial strains, and suggested sequencing strategies. Finally, we have generated an insertion-free plasmid that is available to the community for expression studies of ATRX.
FERMT1 promoter mutations in patients with Kindler syndrome.
Has, C; Chmel, N; Levati, L; Neri, I; Sonnenwald, T; Pigors, M; Godbole, K; Dudhbhate, A; Bruckner-Tuderman, L; Zambruno, G; Castiglia, D
2015-09-01
Mutations in the FERMT1 gene, encoding the focal adhesion protein kindlin-1 underlie the Kindler syndrome (KS), an autosomal recessive skin disorder with a phenotype comprising skin blistering, photosensitivity, progressive poikiloderma with extensive skin atrophy, and propensity to skin cancer. The FERMT1 mutational spectrum comprises gross genomic deletions, splice site, nonsense, and frameshift mutations, which are scattered over the coding region spanning exon 2-15. We now report three KS families with mutations affecting the promoter region of FERMT1. Two of these mutations are large deletions (∼38.0 and 1.9 kb in size) and one is a single nucleotide variant (c.-20A>G) within the 5' untranslated region (UTR). Each mutation resulted in loss of gene expression in patient skin or cultured keratinocytes. Reporter assays showed the functional relevance of the genomic regions deleted in our patients for FERMT1 gene transcription and proved the causal role of the c.-20A>G variant in reducing transcriptional activity. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Lack of association between sigma receptor gene variants and schizophrenia.
Satoh, Fumiaki; Miyatake, Ryosuke; Furukawa, Aizo; Suwaki, Hiroshi
2004-08-01
Several pharmacological studies suggest the possible involvement of sigma(1) receptors in the pathogenesis of schizophrenia. An association has been reported between schizophrenia and two variants (GC-241-240TT and Gln2Pro) in the sigma(1) receptor gene (SIGMAR1). We also previously reported that, along with T-485 A, these two variants alter SIGMAR1 function. To investigate the role of SIGMAR1 in conveying susceptibility to schizophrenia, we performed a case-control study. We initially screened for polymorphisms in the SIGMAR1 coding region using PCR-single strand conformation polymorphism analysis. The distribution of SIGMAR1 polymorphisms was analyzed in 100 schizophrenic and 104 control subjects. A novel G620A variant was detected in exon4. G620A was predicted to alter the amino acid represented by codon 211 from arginine to glutamine. Our case-control study showed no significant association between the T-485 A, GC-241-240TT, Gln2Pro, and G620A (Arg211Gln) variants and schizophrenia and clinical characteristics. These findings suggest that these SIGMAR1 variants may not affect susceptibility to schizophrenia.
Tavira, Beatriz; Coto, Eliecer; Diaz-Corte, Carmen; Alvarez, Victoria; López-Larrea, Carlos; Ortega, Francisco
2013-08-01
The CYP3A5*3 and CYP3A4*1B alleles have been related with tacrolimus (Tac) dose requirements. The rare CYP3A4*22 variant has also been associated with a significantly lower Tac dose. We genotyped the three single-nucleotide polymorphisms in 206 kidney-transplanted patients who received Tac as the primary immunosuppressor. CYP3A5*1 and CYP3A4*1B allele carriers received a significantly higher Tac dose (P<0.01) compared with wild-type homozygotes. We did not find significant differences between the CYP3A4*22 genotypes, either nominally or according to the CYP3A5 genotype (expressers vs. nonexpressers). Sequencing of CYP3A4 coding exons in a total of 15 patients revealed only one nonreported missense change (p.P227>T) in one patient. We concluded that CYP3A5*3 and CYP3A4*1B were the main determinants of the Tac dose-adjusted blood concentration in our cohort of renal-transplanted patients.
Mutation Scanning in Wheat by Exon Capture and Next-Generation Sequencing.
King, Robert; Bird, Nicholas; Ramirez-Gonzalez, Ricardo; Coghill, Jane A; Patil, Archana; Hassani-Pak, Keywan; Uauy, Cristobal; Phillips, Andrew L
2015-01-01
Targeted Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach to identify novel sequence variation in genomes, with the aims of investigating gene function and/or developing useful alleles for breeding. Despite recent advances in wheat genomics, most current TILLING methods are low to medium in throughput, being based on PCR amplification of the target genes. We performed a pilot-scale evaluation of TILLING in wheat by next-generation sequencing through exon capture. An oligonucleotide-based enrichment array covering ~2 Mbp of wheat coding sequence was used to carry out exon capture and sequencing on three mutagenised lines of wheat containing previously-identified mutations in the TaGA20ox1 homoeologous genes. After testing different mapping algorithms and settings, candidate SNPs were identified by mapping to the IWGSC wheat Chromosome Survey Sequences. Where sequence data for all three homoeologues were found in the reference, mutant calls were unambiguous; however, where the reference lacked one or two of the homoeologues, captured reads from these genes were mis-mapped to other homoeologues, resulting either in dilution of the variant allele frequency or assignment of mutations to the wrong homoeologue. Competitive PCR assays were used to validate the putative SNPs and estimate cut-off levels for SNP filtering. At least 464 high-confidence SNPs were detected across the three mutagenized lines, including the three known alleles in TaGA20ox1, indicating a mutation rate of ~35 SNPs per Mb, similar to that estimated by PCR-based TILLING. This demonstrates the feasibility of using exon capture for genome re-sequencing as a method of mutation detection in polyploid wheat, but accurate mutation calling will require an improved genomic reference with more comprehensive coverage of homoeologues.
X-linked Alport syndrome: An SSCP-based mutation survey over all 51 exons of the COL4A5 gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Renieri, A.; Bruttini, M.; Galli, L.
1996-06-01
The COL4A5 gene encodes the {alpha}5 (type IV) collagen chain and is defective in X-linked Alport syndrome (AS). Here, we report the first systematic analysis of all 51 exons of COL4A5 gene in a series of 201 Italian AS patients. We have previously reported nine major rearrangements, as well as 18 small mutations identified in the same patient series by SSCP analysis of several exons. After systematic analysis of all 51 exons of COL4A5, we have now identified 30 different mutations: 10 glycine substitutions in the triple helical domain of the protein, 9 frameshift mutations, 4 in-frame deletions, 1 startmore » codon, 1 nonsense, and 5 splice-site mutations. These mutations were either unique or found in two unrelated families, thus excluding the presence of a common mutation in the coding part of the gene. Overall, mutations were detected in only 45% of individuals with a certain or likely diagnosis of X-linked AS. This finding suggests that mutations in noncoding segments of COL4A5 account for a high number of X-linked AS cases. An alternative hypothesis is the presence of locus heterogeneity, even within the X-linked form of the disease. A genotype/phenotype comparison enabled us to better substantiate a significant correlation between the degree of predicted disruption of the {alpha}5 chain and the severity of phenotype in affected male individuals. Our study has significant implications in the diagnosis and follow-up of AS patients. 44 refs., 3 figs., 4 tabs.« less
A novel CYP27B1 mutation causes a feline vitamin D-dependent rickets type IA.
Grahn, Robert A; Ellis, Melanie R; Grahn, Jennifer C; Lyons, Leslie A
2012-08-01
A 12-week-old domestic cat presented at a local veterinary clinic with hypocalcemia and skeletal abnormalities suggestive of rickets. Osteomalacia (rickets) is a disease caused by impaired bone mineralization leading to an increased prevalence of fractures and deformity. Described in a variety of species, rickets is most commonly caused by vitamin D or calcium deficiencies owing to both environmental and or genetic abnormalities. Vitamin D-dependent rickets type 1A (VDDR-1A) is a result of the enzymatic pathway defect caused by mutations in the 25-hydroxyvitamin D(3)-1-alpha-hydroxylase gene [cytochrome P27 B1 (CYP27B1)]. Calcitriol, the active form of vitamin D(3), regulates calcium homeostasis, which requires sufficient dietary calcium availability and correct hormonal function for proper bone growth and maintenance. Patient calcitriol concentrations were low while calcidiol levels were normal suggestive of VDDR-1A. The entire DNA coding sequencing of CYP27B1 was evaluated. The affected cat was wild type for previously identified VDDR-1A causative mutations. However, six novel mutations were identified, one of which was a nonsense mutation at G637T in exon 4. The exon 4 G637T nonsense mutation results in a premature protein truncation, changing a glutamic acid to a stop codon, E213X, likely causing the clinical presentation of rickets. The previously documented genetic mutation resulting in feline VDDR-1A rickets, as well as the case presented in this research, result from novel exon 4 CYP27B1 mutations, thus exon 4 should be the initial focus of future sequencing efforts.
Distribution of MICA alleles and haplotypes associated with HLA in the Korean population.
Pyo, Chul-Woo; Hur, Seong-Suk; Kim, Yang-Kyum; Choi, Hee-Baeg; Kim, Tae-Yoon; Kim, Tai-Gyu
2003-03-01
The MICA (MHC class I chain-related gene A) is a polymorphic gene located 46 kb centromeric of the HLA-B gene, and is preferentially expressed in epithelial cells and intestinal mucosa. The MICA gene, similar to human leukocyte antigen (HLA) class I, displays a high degree of genetic polymorphism in exons 2, 3, 4, and 5, amounting to 54 alleles. In this study, we investigated the polymorphisms at exons coding for extracellular domains (exons 2, 3, and 4), and the GCT repeat polymorphism at the transmembrane (exon 5) of MICA in 199 unrelated healthy Koreans. Eight alleles were observed in the Korean population, with allele frequencies for MICA*010, MICA*00201, MICA*027, MICA*004, MICA*012, MICA*00801, MICA*00901, and MICA*00701 being 18.3%, 17.8%, 13.6%, 12.3%, 11.1%, 10.8%, 10.6%, and 3.3%, respectively. Strong linkage disequilibria were also observed between the MICA and HLA-B gene-MICA*00201-B58, MICA*004-B44, MICA*00701-B27, MICA*00801-B60, MICA*00901-B51, MICA*010-B62, MICA*012-B54, and MICA*027-B61. In the analysis of the haplotypes of HLA class I genes (HLA-A, B, and C) and the MICA, the most common haplotype was MICA*004-A33-B44-Cw*07, followed by MICA*00201-A2-B58-Cw*0302 and MICA*012-A2-B54-Cw*0102. The MICA null haplotype might be identified in the HLA-B48 homozygous individual. These results will provide an understanding of the role of MICA in transplantation, disease association, and population analyses in Koreans.
ANXA11 mutations prevail in Chinese ALS patients with and without cognitive dementia.
Zhang, Kang; Liu, Qing; Liu, Keqiang; Shen, Dongchao; Tai, Hongfei; Shu, Shi; Ding, Qingyun; Fu, Hanhui; Liu, Shuangwu; Wang, Zhili; Li, Xiaoguang; Liu, Mingsheng; Zhang, Xue; Cui, Liying
2018-06-01
To investigate the genetic contribution of ANXA11 , a gene associated with amyotrophic lateral sclerosis (ALS), in Chinese ALS patients with and without cognitive dementia. Sequencing all the coding exons of ANXA11 and intron-exon boundaries in 18 familial amyotrophic lateral sclerosis (FALS), 353 unrelated sporadic amyotrophic lateral sclerosis (SALS), and 12 Chinese patients with ALS-frontotemporal lobar dementia (ALS-FTD). The transcripts in peripheral blood generated from a splicing mutation were examined by reverse transcriptase PCR. We identified 6 nonsynonymous heterozygous mutations (5 novel and 1 recurrent), 1 splice site mutation, and 1 deletion of 10 amino acids (not accounted in the mutant frequency) in 11 unrelated patients, accounting for a mutant frequency of 5.6% (1/18) in FALS, 2.3% (8/353) in SALS, and 8.3% (1/12) in ALS-FTD. The deletion of 10 amino acids was detected in 1 clinically undetermined male with an ALS family history who had atrophy in hand muscles and myotonic discharges revealed by EMG. The novel p. P36R mutation was identified in 1 FALS index, 1 patient with SALS, and 1 ALS-FTD. The splicing mutation (c.174-2A>G) caused in-frame skipping of the entire exon 6. The rest missense mutations including p.D40G, p.V128M, p.S229R, p.R302C and p.G491R were found in 6 unrelated patients with SALS. The ANXA11 gene is one of the most frequently mutated genes in Chinese patients with SALS. A canonical splice site mutation leading to skipping of the entire exon 6 further supports the loss-of-function mechanism. In addition, the study findings further expand the ANXA11 phenotype, first highlighting its pathogenic role in ALS-FTD.
Li, Xiaoxin; Ma, Xiang; Tao, Yong
2007-06-07
To describe the clinical phenotype of X linked juvenile retinoschisis (XLRS) in 12 Chinese families with 11 different mutations in the XLRS1 (RS1) gene. Complete ophthalmic examinations were carried out in 29 affected males (12 probands), 38 heterozygous females carriers, and 100 controls. The coding regions of the RS1 gene that encodes retinoschisin were amplified by polymerase chain reaction and directly sequenced. Of the 29 male participants, 28 (96.6%) displayed typical foveal schisis. Eleven different RS1 mutations were identified in 12 families; four of these mutations, two frameshift mutations (26 del T of exon 1 and 488 del G of exon 5), and two missense mutations (Asp145His and Arg156Gly) of exon 5, had not been previously described. One non-disease-related polymorphism (NSP): 576C to T (Pro192Pro) change was also newly reported herein. We compared genotypes and observed more severe clinical features in families with the following mutations: frameshift mutation (26 del T) of exon 1, the splice donor site mutation (IVS1+2T to C),or Arg102Gln, Arg209His, and Arg213Gln mutations. Severe XLRS phenotypes are associated with the frameshift mutation 26 del T, splice donor site mutation (IVS1+2T to C), and Arg102Gln, Asp145His, Arg209His, and Arg213Gln mutations. The wide variability in the phenotype in Chinese patients with XLRS and different mutations in the RS1 gene is described. Identification of mutations in the RS1 gene and expanded information on clinical manifestations will facilitate early diagnosis, appropriate early therapy, and genetic counseling regarding the prognosis of XLRS.
Ratnam, Kavitha; Birch, David G.; Sundquist, Sanna M.; Lucero, Anna S.; Zhang, Yuhua; Meltzer, Meira; Smaoui, Nizar; Roorda, Austin
2011-01-01
Purpose. To evaluate macular cone structure in patients with X-linked retinoschisis (XLRS) caused by mutations in exon 6 of the RS1 gene. Methods. High-resolution macular images were obtained with adaptive optics scanning laser ophthalmoscopy (AOSLO) and spectral domain optical coherence tomography (SD-OCT) in two patients with XLRS and 27 age-similar healthy subjects. Retinal structure was correlated with best-corrected visual acuity, kinetic and static perimetry, fundus-guided microperimetry, full-field electroretinography (ERG), and multifocal ERG. The six coding exons and the flanking intronic regions of the RS1 gene were sequenced in each patient. Results. Two unrelated males, ages 14 and 29, with visual acuity ranging from 20/32 to 20/63, had macular schisis with small relative central scotomas in each eye. The mixed scotopic ERG b-wave was reduced more than the a-wave. SD-OCT showed schisis cavities in the outer and inner nuclear and plexiform layers. Cone spacing was increased within the largest foveal schisis cavities but was normal elsewhere. In each patient, a mutation in exon 6 of the RS1 gene was identified and was predicted to change the amino acid sequence in the discoidin domain of the retinoschisin protein. Conclusions. AOSLO images of two patients with molecularly characterized XLRS revealed increased cone spacing and abnormal packing in the macula of each patient, but cone coverage and function were near normal outside the central foveal schisis cavities. Although cone density is reduced, the preservation of wave-guiding cones at the fovea and eccentric macular regions has prognostic and therapeutic implications for XLRS patients with foveal schisis. (Clinical Trials.gov number, NCT00254605.) PMID:22110067
Ma, Xiang; Tao, Yong
2007-01-01
Purpose To describe the clinical phenotype of X linked juvenile retinoschisis (XLRS) in 12 Chinese families with 11 different mutations in the XLRS1 (RS1) gene. Methods Complete ophthalmic examinations were carried out in 29 affected males (12 probands), 38 heterozygous females carriers, and 100 controls. The coding regions of the RS1 gene that encodes retinoschisin were amplified by polymerase chain reaction and directly sequenced. Results Of the 29 male participants, 28 (96.6%) displayed typical foveal schisis. Eleven different RS1 mutations were identified in 12 families; four of these mutations, two frameshift mutations (26 del T of exon 1 and 488 del G of exon 5), and two missense mutations (Asp145His and Arg156Gly) of exon 5, had not been previously described. One non-disease-related polymorphism (NSP): 576C to T (Pro192Pro) change was also newly reported herein. We compared genotypes and observed more severe clinical features in families with the following mutations: frameshift mutation (26 del T) of exon 1, the splice donor site mutation (IVS1+2T to C),or Arg102Gln, Arg209His, and Arg213Gln mutations. Conclusions Severe XLRS phenotypes are associated with the frameshift mutation 26 del T, splice donor site mutation (IVS1+2T to C), and Arg102Gln, Asp145His, Arg209His, and Arg213Gln mutations. The wide variability in the phenotype in Chinese patients with XLRS and different mutations in the RS1 gene is described. Identification of mutations in the RS1 gene and expanded information on clinical manifestations will facilitate early diagnosis, appropriate early therapy, and genetic counseling regarding the prognosis of XLRS. PMID:17615541
Promoter mutation is a common variant in GJC2-associated Pelizaeus-Merzbacher-like disease.
Meyer, E; Kurian, M A; Morgan, N V; McNeill, A; Pasha, S; Tee, L; Younis, R; Norman, A; van der Knaap, M S; Wassmer, E; Trembath, R C; Brueton, L; Maher, E R
2011-12-01
Pelizaeus-Merzbacher-like disease (PMLD) is a clinically and genetically heterogeneous neurological disorder of cerebral hypomyelination. It is clinically characterised by early onset (usually infantile) nystagmus, impaired motor development, ataxia, choreoathetoid movements, dysarthria and progressive limb spasticity. We undertook autozygosity mapping studies in a large consanguineous family of Pakistani origin in which affected children had progressive lower limb spasticity and features of cerebral hypomyelination on MR brain imaging. SNP microarray and microsatellite marker analysis demonstrated linkage to chromosome 1q42.13-1q42.2. Direct sequencing of the gap junction protein gamma-2 gene, GJC2, identified a promoter region mutation (c.-167A>G) in the non-coding exon 1. The c.-167A>G promoter mutation was identified in a further 4 individuals from two families (who were also of Pakistani origin) with clinical and radiological features of PMLD in whom previous routine diagnostic screening of GJC2 had been reported as negative. A common haplotype was identified at the GJC2 locus in the three mutation-positive families, consistent with a common origin for the mutation and likely founder effect. This promoter mutation has only recently been reported in GJC2-PMLD but it has been postulated to affect the binding of the transcription factor SOX10 and appears to be a prevalent mutation, accounting for ~29% of reported patients with GJC2-PMLD. We propose that diagnostic screening of GJC2 should include sequence analysis of the non-coding exon 1, as well as the coding regions to avoid misdiagnosis or diagnostic delay in suspected PMLD. Copyright © 2011 Elsevier Inc. All rights reserved.
Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M
2015-05-15
The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.
GeneBuilder: interactive in silico prediction of gene structure.
Milanesi, L; D'Angelo, D; Rogozin, I B
1999-01-01
Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.
Izuogu, Osagie G; Alhasan, Abd A; Mellough, Carla; Collin, Joseph; Gallon, Richard; Hyslop, Jonathon; Mastrorosa, Francesco K; Ehrmann, Ingrid; Lako, Majlinda; Elliott, David J; Santibanez-Koref, Mauro; Jackson, Michael S
2018-04-20
Circular RNAs (circRNAs) are predominantly derived from protein coding genes, and some can act as microRNA sponges or transcriptional regulators. Changes in circRNA levels have been identified during human development which may be functionally important, but lineage-specific analyses are currently lacking. To address this, we performed RNAseq analysis of human embryonic stem (ES) cells differentiated for 90 days towards 3D laminated retina. A transcriptome-wide increase in circRNA expression, size, and exon count was observed, with circRNA levels reaching a plateau by day 45. Parallel statistical analyses, controlling for sample and locus specific effects, identified 239 circRNAs with expression changes distinct from the transcriptome-wide pattern, but these all also increased in abundance over time. Surprisingly, circRNAs derived from long non-coding RNAs (lncRNAs) were found to account for a significantly larger proportion of transcripts from their loci of origin than circRNAs from coding genes. The most abundant, circRMST:E12-E6, showed a > 100X increase during differentiation accompanied by an isoform switch, and accounts for > 99% of RMST transcripts in many adult tissues. The second most abundant, circFIRRE:E10-E5, accounts for > 98% of FIRRE transcripts in differentiating human ES cells, and is one of 39 FIRRE circRNAs, many of which include multiple unannotated exons. Our results suggest that during human ES cell differentiation, changes in circRNA levels are primarily globally controlled. They also suggest that RMST and FIRRE, genes with established roles in neurogenesis and topological organisation of chromosomal domains respectively, are processed as circular lncRNAs with only minor linear species.
Role of LRRK2 and SNCA in autosomal dominant Parkinson's disease in Turkey.
Kessler, Christoph; Atasu, Burcu; Hanagasi, Hasmet; Simón-Sánchez, Javier; Hauser, Ann-Kathrin; Pak, Meltem; Bilgic, Basar; Erginel-Unaltuna, Nihan; Gurvit, Hakan; Gasser, Thomas; Lohmann, Ebba
2018-03-01
Mutations in the LRRK2 and alpha-synuclein (SNCA) genes are well-established causes of autosomal dominant Parkinson's disease (PD). However, their frequency differs widely between ethnic groups. Only three studies have screened all coding regions of LRRK2 and SNCA in European samples so far. In Turkey, the role of LRRK2 in Parkinson's disease has been studied fragmentarily, and the incidence of SNCA copy number variations is unknown. The purpose of this study is to determine the frequency of LRRK2 and SNCA mutations in autosomal dominant PD in Turkey. We performed Sanger sequencing of all coding LRRK2 and SNCA exons in a sample of 91 patients with Parkinsonism. Copy number variations in SNCA, PRKN, PINK1, DJ1 and ATP13A2 were assessed using the MLPA method. All patients had a positive family history compatible with autosomal dominant inheritance. Known mutations in LRRK2 and SNCA were found in 3.3% of cases: one patient harbored the LRRK2 G2019S mutation, and two patients carried a SNCA gene duplication. Furthermore, we found a heterozygous deletion of PRKN exon 2 in one patient, and four rare coding variants of unknown significance (LRRK2: A211V, R1067Q, T2494I; SNCA: T72T). Genetic testing in one affected family identified the LRRK2 R1067Q variant as a possibly pathogenic substitution. Point mutations in LRRK2 and SNCA are a rare cause of autosomal dominant PD in Turkey. However, copy number variations should be considered. The unclassified variants, especially LRRK2 R1067Q, demand further investigation. Copyright © 2017. Published by Elsevier Ltd.
Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro
2008-01-03
The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
Raj, Towfique; Ryan, Katie J.; Replogle, Joseph M.; Chibnik, Lori B.; Rosenkrantz, Laura; Tang, Anna; Rothamel, Katie; Stranger, Barbara E.; Bennett, David A.; Evans, Denis A.; De Jager, Philip L.; Bradshaw, Elizabeth M.
2014-01-01
We previously demonstrated that the Alzheimer's disease (AD) associated risk allele, rs3865444C, results in a higher surface density of CD33 on monocytes. Here, we find alternative splicing of exon 2 to be the primary mechanism of the genetically driven differential expression of CD33 protein. We report that the risk allele, rs3865444C, is associated with greater cell surface expression of CD33 in both subjects of European and African–American ancestry and that there is a single haplotype influencing CD33 surface expression. A meta-analysis of the two populations narrowed the number of significant SNPs in high linkage disequilibrium (LD) (r2 > 0.8) with rs3865444 to just five putative causal variants associated with increased protein expression. Using gene expression data from flow-sorted CD14+CD16− monocytes from 398 healthy subjects of three populations, we show that the rs3865444C risk allele is strongly associated with greater expression of CD33 exon 2 (pMETA = 2.36 × 10−60). Western blotting confirms increased protein expression of the full-length CD33 isoform containing exon 2 relative to the rs3865444C allele (P < 0.0001). Of the variants in strong LD with rs3865444, rs12459419, which is located in a putative SRSF2 splice site of exon 2, is the most likely candidate to mediate the altered alternative splicing of CD33's Immunoglobulin V-set domain 2 and ultimately influence AD susceptibility. PMID:24381305
Hypervariable and highly divergent intron-exon organizations in the chordate Oikopleura dioica.
Edvardsen, Rolf B; Lerat, Emmanuelle; Maeland, Anne Dorthea; Flåt, Mette; Tewari, Rita; Jensen, Marit F; Lehrach, Hans; Reinhardt, Richard; Seo, Hee-Chan; Chourrout, Daniel
2004-10-01
Oikopleura dioica is a pelagic tunicate with a very small genome and a very short life cycle. In order to investigate the intron-exon organizations in Oikopleura, we have isolated and characterized ribosomal protein EF-1alpha, Hox, and alpha-tubulin genes. Their intron positions have been compared with those of the same genes from various invertebrates and vertebrates, including four species with entirely sequenced genomes. Oikopleura genes, like Caenorhabditis genes, have introns at a large number of nonconserved positions, which must originate from late insertions or intron sliding of ancient insertions. Both species exhibit hypervariable intron-exon organization within their alpha-tubulin gene family. This is due to localization of most nonconserved intron positions in single members of this gene family. The hypervariability and divergence of intron positions in Oikopleura and Caenorhabditis may be related to the predominance of short introns, the processing of which is not very dependent upon the exonic environment compared to large introns. Also, both species have an undermethylated genome, and the control of methylation-induced point mutations imposes a control on exon size, at least in vertebrate genes. That introns placed at such variable positions in Oikopleura or C. elegans may serve a specific purpose is not easy to infer from our current knowledge and hypotheses on intron functions. We propose that new introns are retained in species with very short life cycles, because illegitimate exchanges including gene conversion are repressed. We also speculate that introns placed at gene-specific positions may contribute to suppressing these exchanges and thereby favor their own persistence.
Poultney, Christopher S.; Goldberg, Arthur P.; Drapeau, Elodie; Kou, Yan; Harony-Nicolas, Hala; Kajiwara, Yuji; De Rubeis, Silvia; Durand, Simon; Stevens, Christine; Rehnström, Karola; Palotie, Aarno; Daly, Mark J.; Ma’ayan, Avi; Fromer, Menachem; Buxbaum, Joseph D.
2013-01-01
Copy number variation (CNV) is an important determinant of human diversity and plays important roles in susceptibility to disease. Most studies of CNV carried out to date have made use of chromosome microarray and have had a lower size limit for detection of about 30 kilobases (kb). With the emergence of whole-exome sequencing studies, we asked whether such data could be used to reliably call rare exonic CNV in the size range of 1–30 kilobases (kb), making use of the eXome Hidden Markov Model (XHMM) program. By using both transmission information and validation by molecular methods, we confirmed that small CNV encompassing as few as three exons can be reliably called from whole-exome data. We applied this approach to an autism case-control sample (n = 811, mean per-target read depth = 161) and observed a significant increase in the burden of rare (MAF ≤1%) 1–30 kb CNV, 1–30 kb deletions, and 1–10 kb deletions in ASD. CNV in the 1–30 kb range frequently hit just a single gene, and we were therefore able to carry out enrichment and pathway analyses, where we observed enrichment for disruption of genes in cytoskeletal and autophagy pathways in ASD. In summary, our results showed that XHMM provided an effective means to assess small exonic CNV from whole-exome data, indicated that rare 1–30 kb exonic deletions could contribute to risk in up to 7% of individuals with ASD, and implicated a candidate pathway in developmental delay syndromes. PMID:24094742
Georgiou, Theodoros; Chuang, Jacinta L.; Wynn, R. Max; Stylianidou, Goula; Korson, Mark; Chuang, David T.
2009-01-01
We report five mutations, three of them novel, responsible for maple syrup urine disease in four unrelated Cypriot families. The five children studied are the first cases of classic maple syrup urine disease to be reported among Cypriots. The first novel mutation identified is a single-base deletion in exon 6 of the Elα gene (c.718delG), which leads to a frameshift after Ala240 and to a stop codon 89 residues further downstream. The other two novel mutations identified are in the Elβ subunit: a two-base deletion in exon 6, c.662_663delCC, which leads to a frameshift after Ala221 and creates a stop codon 17 residues further downstream, as well as a splice mutation, IVS3[+3]delA, which results in the skipping of exon 3. The two known mutations identified are in the Elα gene: the G > C transversion at the 3′-splice acceptor site, (IVS5-1G > C), which results in the deletion of the entire exon 6, and the missense mutation in exon 5 (c.632C > T), which corresponds to a p.Thr211Met substitution. The p.Thr211Met substitution is located in a potassium-ion pocket in the E1 component required for stability of the bound cofactor thiamine diphosphate. The mutant E1 protein harboring the p.Thr211Met substitution was shown unable to bind thiamine diphosphate, leading to undetectable E1 activity. PMID:19715473
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willing, M.; Deschenes, S.
We have identified a G to A substitution in the 5{prime} donor splice site of intron 18 of one COL1A1 allele in two unrelated families with osteogenesis imperfecta (OI) type I. A third OI type I family has a G to A substitution at the identical position in intron 48 of one COL1A1 allele. Both mutations abolish normal splicing and lead to reduced steady-state levels of mRNA from the mutant COL1A1 allele. The intron 18 mutation leads to both exon 18 skipping in the mRNA and to utilization of a single alternative splice site near the 3{prime} end of exonmore » 18. The latter results in deletion of the last 8 nucleotides of exon 18 from the mRNA, a shift in the translational reading-frame, and the creation of a premature termination codon in exon 19. Of the potential alternative 5{prime} splice sites in exon 18 and intron 18, the one utilized has a surrounding nucleotide sequence which most closely resembles that of the natural splice site. Although a G to A mutation was detected at the identical position in intron 48 of one COL1A1 allele in another OI type I family, nine complex alternative splicing patterns were identified by sequence analysis of cDNA clones derived from fibroblast mRNA from this cell strain. All result in partial or complete skipping of exon 48, with in-frame deletions of portions of exons 47 and/or 49. The different patterns of RNA splicing were not explained by their sequence homology with naturally occuring 5{prime} splice sites, but rather by recombination between highly homologous exon sequences, suggesting that we may not have identified the major splicing alternative(s) in this cell strain. Both G to A mutations result in decreased production of type I collagen, the common biochemical correlate of OI type I.« less
Plasmodium vivax rhomboid-like protease 1 gene diversity in Thailand.
Mataradchakul, Touchchapol; Uthaipibull, Chairat; Nosten, Francois; Vega-Rodriguez, Joel; Jacobs-Lorena, Marcelo; Lek-Uthai, Usa
2017-10-01
Plasmodium vivax infection remains a major public health problem, especially along the Thailand border regions. We examined the genetic diversity of this parasite by analyzing single-nucleotide polymorphisms (SNPs) of the P. vivax rhomboid-like protease 1 gene (Pvrom1) in parasites collected from western (Tak province, Thai-Myanmar border) and eastern (Chanthaburi province, Thai-Cambodia border) regions. Data were collected by a cross-sectional survey, consisting of 47 and 45 P. vivax-infected filter paper-spotted blood samples from the western and eastern regions of Thailand, respectively during September 2013 to May 2014. Extracted DNA was examined for presence of P. vivax using Plasmodium species-specific nested PCR. Pvrom1 gene was PCR amplified, sequenced and the SNP diversity was analyzed using F-STAT, DnaSP, MEGA and LIAN programs. Comparison of sequences of the 92 Pvrom1 831-base open reading frames with that of a reference sequence (GenBank acc. no. XM001615211) revealed 17 samples with a total of 8 polymorphic sites, consisting of singleton (exon 3, nt 645) and parsimony informative (exon 1, nt 22 and 39; exon 3, nt 336, 537 and 656; and exon 4, nt 719 and 748) sites, which resulted in six different deduced Pvrom1 variants. Non-synonymous to synonymous substitutions ratio estimated by the DnaSP program was 1.65 indicating positive selection, but the Z-tests of selection showed no significant deviations from neutrality for Pvrom1 samples from western region of Thailand. In addition McDonald Kreitman test (MK) showed not significant, and Fst values are not different between the two regions and the regions combined. Interestingly, only Pvrom1 exon 2 was the most conserved sequences among the four exons. The relatively high degree of Pvrom1 polymorphism suggests that the protein is important for parasite survival in face of changes in both insect vector and human populations. These polymorphisms could serve as a sensitive marker for studying plasmodial genetic diversity. The significance of Pvrom1 conserved exon 2 sequence remains to be investigated. Copyright © 2017 Mahidol University. Published by Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Poliakov, Alexander; Couronne, Olivier
2002-11-04
Aligning large vertebrate genomes that are structurally complex poses a variety of problems not encountered on smaller scales. Such genomes are rich in repetitive elements and contain multiple segmental duplications, which increases the difficulty of identifying true orthologous SNA segments in alignments. The sizes of the sequences make many alignment algorithms designed for comparing single proteins extremely inefficient when processing large genomic intervals. We integrated both local and global alignment tools and developed a suite of programs for automatically aligning large vertebrate genomes and identifying conserved non-coding regions in the alignments. Our method uses the BLAT local alignment program tomore » find anchors on the base genome to identify regions of possible homology for a query sequence. These regions are postprocessed to find the best candidates which are then globally aligned using the AVID global alignment program. In the last step conserved non-coding segments are identified using VISTA. Our methods are fast and the resulting alignments exhibit a high degree of sensitivity, covering more than 90% of known coding exons in the human genome. The GenomeVISTA software is a suite of Perl programs that is built on a MySQL database platform. The scheduler gets control data from the database, builds a queve of jobs, and dispatches them to a PC cluster for execution. The main program, running on each node of the cluster, processes individual sequences. A Perl library acts as an interface between the database and the above programs. The use of a separate library allows the programs to function independently of the database schema. The library also improves on the standard Perl MySQL database interfere package by providing auto-reconnect functionality and improved error handling.« less
Kühne, Annett; Kaiser, Rolf; Schirmer, Markus; Heider, Ulrike; Muhlke, Sabine; Niere, Wiebke; Overbeck, Tobias; Hohloch, Karin; Trümper, Lorenz; Sezer, Orhan; Brockmöller, Jürgen
2007-07-01
Melphalan is widely used in the treatment of multiple myeloma. Pharmacokinetics of this alkylating drug shows high inter-individual variability. As melphalan is a phenylalanine derivative, the pharmacokinetic variability may be determined by genetic polymorphisms in the L-type amino acid transporters LAT1 (SLC7A5) and LAT2 (SLC7A8). Pharmacokinetics were analysed in 64 patients after first administration of intravenous melphalan. Severity of side effects was documented according to WHO criteria. Genomic DNA was analysed for polymorphisms in LAT1 and LAT2 by sequencing of the entire coding region, intron-exon boundaries and 2 kb upstream promoter region. Selected polymorphisms in the common heavy chain of both transporters, the protein 4F2hc (SLC3A2), were analysed by single nucleotide primer extension. Melphalan pharmacokinetics was highly variable with up to 6.2-fold differences in total clearance. A total of 44 polymorphisms were identified in LAT1 and 21 polymorphisms in LAT2. From all variants, only five were in the coding region and only one heterozygous non-synonymous polymorphism (Ala94Thr) was found in LAT2. Numerous polymorphisms were found in the LAT1 and LAT2 5'-flanking regions but did not correlate with expression of the respective genes. No significant correlations could be observed between the polymorphisms in 4F2hc, LAT1, and LAT2 with melphalan pharmacokinetics or with melphalan side effects. The study confirmed that these transporter genes are highly conserved, particularly in the coding sequences. Genetic variation in 4F2hc, LAT1, and LAT2 does not appear to be a major cause of inter-individual variability in pharmacokinetics and of adverse reactions to melphalan.
Novel mutations in the TULP1 gene causing autosomal recessive retinitis pigmentosa.
Paloma, E; Hjelmqvist, L; Bayés, M; García-Sandoval, B; Ayuso, C; Balcells, S; Gonzàlez-Duarte, R
2000-03-01
To assess the contribution of TULP1 to autosomal recessive retinitis pigmentosa (arRP). Fifteen exons of the gene were screened by single-strand conformation polymorphism analysis of 7 (of 49) arRP pedigrees showing cosegregation with TULP1 locus markers. In one of the seven families two allelic mutations, IVS4-2delAGA and c.937delC, were found in exons 5 and 10, respectively. Two novel mutations in TULP1 were found to be associated with arRP. That they both compromise the gene product supports their pathogenicity. This gene was present in no more than 2% of a panel of 49 Spanish families affected by arRP.
FANCA Gene Mutations with 8 Novel Molecular Changes in Indian Fanconi Anemia Patients.
Solanki, Avani; Mohanty, Purvi; Shukla, Pallavi; Rao, Anita; Ghosh, Kanjaksha; Vundinti, Babu Rao
2016-01-01
Fanconi anemia (FA), a rare heterogeneous genetic disorder, is known to be associated with 19 genes and a spectrum of clinical features. We studied FANCA molecular changes in 34 unrelated and 2 siblings of Indian patients with FA and have identified 26 different molecular changes of FANCA gene, of which 8 were novel mutations (a small deletion c.2500delC, 4 non-sense mutations c.2182C>T, c.2630C>G, c.3677C>G, c.3189G>A; and 3 missense mutations; c.1273G>C, c.3679 G>C, and c.3992 T>C). Among these only 16 patients could be assigned FA-A complementation group, because we could not confirm single exon deletions detected by MLPA or cDNA amplification by secondary confirmation method and due to presence of heterozygous non-pathogenic variations or heterozygous pathogenic mutations. An effective molecular screening strategy should be developed for confirmation of these mutations and determining the breakpoints for single exon deletions.
FANCA Gene Mutations with 8 Novel Molecular Changes in Indian Fanconi Anemia Patients
Solanki, Avani; Mohanty, Purvi; Shukla, Pallavi; Rao, Anita; Ghosh, Kanjaksha; Vundinti, Babu Rao
2016-01-01
Fanconi anemia (FA), a rare heterogeneous genetic disorder, is known to be associated with 19 genes and a spectrum of clinical features. We studied FANCA molecular changes in 34 unrelated and 2 siblings of Indian patients with FA and have identified 26 different molecular changes of FANCA gene, of which 8 were novel mutations (a small deletion c.2500delC, 4 non-sense mutations c.2182C>T, c.2630C>G, c.3677C>G, c.3189G>A; and 3 missense mutations; c.1273G>C, c.3679 G>C, and c.3992 T>C). Among these only 16 patients could be assigned FA-A complementation group, because we could not confirm single exon deletions detected by MLPA or cDNA amplification by secondary confirmation method and due to presence of heterozygous non-pathogenic variations or heterozygous pathogenic mutations. An effective molecular screening strategy should be developed for confirmation of these mutations and determining the breakpoints for single exon deletions. PMID:26799702
DU, Zhi-Heng; Liu, Zong-Yue; Bai, Xiu-Juan
2010-06-01
Using single-strand conformation polymorphism (PCR-SSCP) and DNA sequencing, single nucleotide polymorphisms (SNPs) of growth hormone receptor (GHR) gene were detected in an arctic fox population. Correlation analysis between GHR polymorphisms and growth traits were carried out using the appropriate model. Four SNPs, G3A in the 5'UTR, C99T in the first exon, T59C and G65A in the fifth exon were identified on the arctic fox GHR gene. The G3A and C99T polymorphisms of GHR were associated with female fox body weight (Pamp;0.05) and the T59C and G65A polymorphisms of GHR were associated with male fox body weight (Pamp;0.05) and the skin length of the female fox (Pamp;0.01). Therefore, marker assistant selection on body weight and skin length of arctic foxes using these SNPs can be applied to get big and high quality arctic foxes.
Ren, H; Stiles, G L
1994-01-01
The human A1 adenosine receptor gene contains six exons with exons 1, 2, 3, 4, and part of 5 representing 5' untranslated regions. Reverse transcription-PCR with exon-specific primers showed two distinct transcripts containing either exons 3, 5, and 6 or exons 4, 5, and 6, with exons 3 and 4 being mutually exclusive. No mature mRNAs containing exons 1 and 2 have been detected. All human tissues that express any A1 receptors contain mRNA with exons 4, 5, and 6. Tissues which express high levels of A1 receptors contain mRNA with exons 3, 5, and 6. Exon 4 contains two upstream ATG codons whereas exon 3 contains none. COS cells transfected with expression vectors containing exon 4 (exons 1-6, 3-6, or Ex4-6) express much lower levels of A1 receptors than vectors without exon 4 (exons 3, 5, and 6). Mutation of upstream ATG codons in exon 4 leads to 3- to 7-fold increased A1 receptor expression, up to the level seen with the construct containing exons 3, 5, and 6. Thus, in human tissues "basal" levels of A1 receptors can be expressed by use of mRNA containing exons 4, 5, and 6, but when high levels are needed, alternative transcripts with exons 3, 5, and 6 are produced. Images PMID:8197148
Kralovicova, Jana; Knut, Marcin; Cross, Nicholas C. P.; Vorechovsky, Igor
2015-01-01
The auxiliary factor of U2 small nuclear RNA (U2AF) is a heterodimer consisting of 65- and 35-kD proteins that bind the polypyrimidine tract (PPT) and AG dinucleotides at the 3′ splice site (3′ss). The gene encoding U2AF35 (U2AF1) is alternatively spliced, giving rise to two isoforms U2AF35a and U2AF35b. Here, we knocked down U2AF35 and each isoform and characterized transcriptomes of HEK293 cells with varying U2AF35/U2AF65 and U2AF35a/b ratios. Depletion of both isoforms preferentially modified alternative RNA processing events without widespread failure to recognize 3′ss or constitutive exons. Over a third of differentially used exons were terminal, resulting largely from the use of known alternative polyadenylation (APA) sites. Intronic APA sites activated in depleted cultures were mostly proximal whereas tandem 3′UTR APA was biased toward distal sites. Exons upregulated in depleted cells were preceded by longer AG exclusion zones and PPTs than downregulated or control exons and were largely activated by PUF60 and repressed by CAPERα. The U2AF(35) repression and activation was associated with a significant interchange in the average probabilities to form single-stranded RNA in the optimal PPT and branch site locations and sequences further upstream. Although most differentially used exons were responsive to both U2AF subunits and their inclusion correlated with U2AF levels, a small number of transcripts exhibited distinct responses to U2AF35a and U2AF35b, supporting the existence of isoform-specific interactions. These results provide new insights into function of U2AF and U2AF35 in alternative RNA processing. PMID:25779042
AB033. Preimplantation genetic diagnosis of spinal muscular atrophy in Vietnam
Khoa, Tran Van; Nga, Nguyen Thi Thanh; Tao, Nguyen Dinh; Sang, Trieu Tien; Giang, Ngo Truong; Dung, Vu Chi
2015-01-01
Objective Spinal muscular atrophy (SMA) is a severe neurodegenerative autosomal recessive disorder. Most of patients are caused by the homozygous absence of exon 7 of the telomeric copy of the SMN gene (SMNt) on chromosome 5. Setting up a molecular diagnostic protocol for detecting exon 7 gen SMNT homozygous deletion in single cell is basic to preimplantation genetic diagnosis of spinal muscular atrophy. Methods This study was carried out on 17 patients and their parents. Firstly, lymphocytes of patients and their parents were isolated from fresh blood by ficoll. Taking a lymphocyte on stereoscopic microscope, lysing the cell, amplifying whole genome, then amplifying exon 7 of SMNT gene by using a polymerase chain reaction, followed by HinfI restriction digest enzyme of the PCR enabling the important SMNT gene to be distinguished from the centromic SMN gene (SMNc) which has no clinical phenotype to detect mutation. Electrophoresis PCR products after digesting by restriction enzyme and analysis. Besides, the minisequencing technique has also been used to detect the absence of exon 7 of SMNT gene based on the difference of one nucleotide at 214-position in exon 7 (C-SMNT, T-SMNc). Secondly, the application of the protocol was set up on one lymphocyte to preimplantation genetic diagnosis of spinal muscular atrophy on biopsied blastomeres. Results Two different protocols which were PCR-RFLP and minisequencing, were set up on 200 lymphocytes from 17 patients and their parents to screen the homozygous deletion in exon 7 SMNT gene with the PCR efficiency in 96%. The results were similar with the gene diagnosed from fresh blood. The methods were also efficient, providing interpretable result in 96.55% (28/29) of the blastomeres tested. Three couples were treated using this method. Three normal embryos were transfer which resulted in one clinical pregnancy. Conclusions We have successfully applied the technique of PCR-RFLP and minisequencing for the preimplantation genetic diagnosis of spinal muscular atrophy.
Sirdah, Mahmoud M; Shubair, Mohammad E; Al-Kahlout, Mustafa S; Al-Tayeb, Jamal M; Prchal, Josef T; Reading, N Scott
2017-07-01
Glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common X-linked inherited enzymopathic disorder affecting more than 500 million people worldwide. It has so far been linked to 217 distinct genetic variants in the exons and exon-intron boundaries of the G6PD gene, giving rise to a wide range of biochemical heterogeneity and clinical manifestations. Reports from different settings suggested the association of intronic and other mutations outside the reading frame of the G6PD gene with reduced enzyme activity and presenting clinical symptoms. The present study aimed to investigate any association of other variations apart of the exonic or exonic intronic boundaries in the development of G6PD deficiency. Sixty-seven unrelated Palestinian children admitted to the pediatric hospital with hemolytic crises due to G6PD deficiency were studied. In our Palestinian cohort of 67 [59 males (M) and 8 females (F)] G6PD-deficient children, previously hospitalized for acute hemolytic anemia due to favism, molecular sequencing of the G6PD gene revealed four cases (3M and 1F) that did not have any of the variants known to cause G6PD deficiency, but the 3' UTR c.*+357A>G (rs1050757) polymorphism in association with IVS 11 (c.1365-13T>C; rs2071429), and c.1311C>T (rs2230037). We now provide an additional evidence form Palestinian G6PD-deficient subjects for a possible role of 3' UTR c.*+357 A>G, c.1365-13T>C, and/or c.1311C>T polymorphism for G6PD deficiency, suggesting that not only a single variation in the exonic or exonic intronic boundaries, but also a haplotype of G6PD should considered as a cause for G6PD deficiency.
Characterisation of CDKL5 Transcript Isoforms in Human and Mouse
Dando, Owen; Landsberger, Nicoletta; Kilstrup-Nielsen, Charlotte; Kind, Peter C.; Bailey, Mark E. S.; Cobb, Stuart R.
2016-01-01
Mutations in the X-linked Cyclin-Dependent Kinase-Like 5 gene (CDKL5) cause early onset infantile spasms and subsequent severe developmental delay in affected children. Deleterious mutations have been reported to occur throughout the CDKL5 coding region. Several studies point to a complex CDKL5 gene structure in terms of exon usage and transcript expression. Improvements in molecular diagnosis and more extensive research into the neurobiology of CDKL5 and pathophysiology of CDKL5 disorders necessitate an updated analysis of the gene. In this study, we have analysed human and mouse CDKL5 transcript patterns both bioinformatically and experimentally. We have characterised the predominant brain isoform of CDKL5, a 9.7 kb transcript comprised of 18 exons with a large 6.6 kb 3’-untranslated region (UTR), which we name hCDKL5_1. In addition we describe new exonic regions and a range of novel splice and UTR isoforms. This has enabled the description of an updated gene model in both species and a standardised nomenclature system for CDKL5 transcripts. Profiling revealed tissue- and brain development stage-specific differences in expression between transcript isoforms. These findings provide an essential backdrop for the diagnosis of CDKL5-related disorders, for investigations into the basic biology of this gene and its protein products, and for the rational design of gene-based and molecular therapies for these disorders. PMID:27315173
Characterisation of CDKL5 Transcript Isoforms in Human and Mouse.
Hector, Ralph D; Dando, Owen; Landsberger, Nicoletta; Kilstrup-Nielsen, Charlotte; Kind, Peter C; Bailey, Mark E S; Cobb, Stuart R
2016-01-01
Mutations in the X-linked Cyclin-Dependent Kinase-Like 5 gene (CDKL5) cause early onset infantile spasms and subsequent severe developmental delay in affected children. Deleterious mutations have been reported to occur throughout the CDKL5 coding region. Several studies point to a complex CDKL5 gene structure in terms of exon usage and transcript expression. Improvements in molecular diagnosis and more extensive research into the neurobiology of CDKL5 and pathophysiology of CDKL5 disorders necessitate an updated analysis of the gene. In this study, we have analysed human and mouse CDKL5 transcript patterns both bioinformatically and experimentally. We have characterised the predominant brain isoform of CDKL5, a 9.7 kb transcript comprised of 18 exons with a large 6.6 kb 3'-untranslated region (UTR), which we name hCDKL5_1. In addition we describe new exonic regions and a range of novel splice and UTR isoforms. This has enabled the description of an updated gene model in both species and a standardised nomenclature system for CDKL5 transcripts. Profiling revealed tissue- and brain development stage-specific differences in expression between transcript isoforms. These findings provide an essential backdrop for the diagnosis of CDKL5-related disorders, for investigations into the basic biology of this gene and its protein products, and for the rational design of gene-based and molecular therapies for these disorders.
Sanger sequencing as a first-line approach for molecular diagnosis of Andersen-Tawil syndrome.
Totomoch-Serra, Armando; Marquez, Manlio F; Cervantes-Barragán, David E
2017-01-01
In 1977, Frederick Sanger developed a new method for DNA sequencing based on the chain termination method, now known as the Sanger sequencing method (SSM). Recently, massive parallel sequencing, better known as next-generation sequencing (NGS), is replacing the SSM for detecting mutations in cardiovascular diseases with a genetic background. The present opinion article wants to remark that "targeted" SSM is still effective as a first-line approach for the molecular diagnosis of some specific conditions, as is the case for Andersen-Tawil syndrome (ATS). ATS is described as a rare multisystemic autosomal dominant channelopathy syndrome caused mainly by a heterozygous mutation in the KCNJ2 gene . KCJN2 has particular characteristics that make it attractive for "directed" SSM. KCNJ2 has a sequence of 17,510 base pairs (bp), and a short coding region with two exons (exon 1=166 bp and exon 2=5220 bp), half of the mutations are located in the C-terminal cytosolic domain, a mutational hotspot has been described in residue Arg218, and this gene explains the phenotype in 60% of ATS cases that fulfill all the clinical criteria of the disease. In order to increase the diagnosis of ATS we urge cardiologists to search for facial and muscular abnormalities in subjects with frequent ventricular arrhythmias (especially bigeminy) and prominent U waves on the electrocardiogram.
Sanger sequencing as a first-line approach for molecular diagnosis of Andersen-Tawil syndrome
Totomoch-Serra, Armando; Marquez, Manlio F.; Cervantes-Barragán, David E.
2017-01-01
In 1977, Frederick Sanger developed a new method for DNA sequencing based on the chain termination method, now known as the Sanger sequencing method (SSM). Recently, massive parallel sequencing, better known as next-generation sequencing (NGS), is replacing the SSM for detecting mutations in cardiovascular diseases with a genetic background. The present opinion article wants to remark that “targeted” SSM is still effective as a first-line approach for the molecular diagnosis of some specific conditions, as is the case for Andersen-Tawil syndrome (ATS). ATS is described as a rare multisystemic autosomal dominant channelopathy syndrome caused mainly by a heterozygous mutation in the KCNJ2 gene . KCJN2 has particular characteristics that make it attractive for “directed” SSM. KCNJ2 has a sequence of 17,510 base pairs (bp), and a short coding region with two exons (exon 1=166 bp and exon 2=5220 bp), half of the mutations are located in the C-terminal cytosolic domain, a mutational hotspot has been described in residue Arg218, and this gene explains the phenotype in 60% of ATS cases that fulfill all the clinical criteria of the disease. In order to increase the diagnosis of ATS we urge cardiologists to search for facial and muscular abnormalities in subjects with frequent ventricular arrhythmias (especially bigeminy) and prominent U waves on the electrocardiogram. PMID:29093808
Agirbasli, Deniz; Hyatt, Tommy; Agirbasli, Mehmet
2018-04-26
This is a case report of a 38-year-old Syrian refugee male with early-onset extensive atherosclerosis. The physical and laboratory examination were remarkable with severe xanthomas in the upper and lower extremities and with low-density lipoprotein cholesterol (LDL-C) 417 mg/dL, total cholesterol 495 mg/dL, high-density lipoprotein cholesterol 30 mg/dL, and triglycerides 242 mg/dL. LDL-C level responded poorly to the high-dose statin treatment. The genetic analysis indicated that the patient had a large homozygous deletion in LDL receptor gene including the exons 7-14. A 12-kb deletion had occurred between the 2 Alu repetitive sequences that were oriented in opposite directions, one in intron 6 and the other in intron 14. This deletion eliminated exons 7-14, which exactly corresponded to the entire exon sequence coding the epidermal growth factor precursor homology domain. This deletion in LDL receptor was previously reported. This rare case of homozygous familial hypercholesterolemia presenting with multiple large and widely distributed xanthomas implicates the need for novel treatment options in familial hypercholesterolemia patients. The case is a Syrian refugee and emphasizes the urgent need to address orphan disease in refugee populations throughout the world. Copyright © 2018 National Lipid Association. Published by Elsevier Inc. All rights reserved.
In situ genetic correction of F8 intron 22 inversion in hemophilia A patient-specific iPSCs.
Wu, Yong; Hu, Zhiqing; Li, Zhuo; Pang, Jialun; Feng, Mai; Hu, Xuyun; Wang, Xiaolin; Lin-Peng, Siyuan; Liu, Bo; Chen, Fangping; Wu, Lingqian; Liang, Desheng
2016-01-08
Nearly half of severe Hemophilia A (HA) cases are caused by F8 intron 22 inversion (Inv22). This 0.6-Mb inversion splits the 186-kb F8 into two parts with opposite transcription directions. The inverted 5' part (141 kb) preserves the first 22 exons that are driven by the intrinsic F8 promoter, leading to a truncated F8 transcript due to the lack of the last 627 bp coding sequence of exons 23-26. Here we describe an in situ genetic correction of Inv22 in patient-specific induced pluripotent stem cells (iPSCs). By using TALENs, the 627 bp sequence plus a polyA signal was precisely targeted at the junction of exon 22 and intron 22 via homologous recombination (HR) with high targeting efficiencies of 62.5% and 52.9%. The gene-corrected iPSCs retained a normal karyotype following removal of drug selection cassette using a Cre-LoxP system. Importantly, both F8 transcription and FVIII secretion were rescued in the candidate cell types for HA gene therapy including endothelial cells (ECs) and mesenchymal stem cells (MSCs) derived from the gene-corrected iPSCs. This is the first report of an efficient in situ genetic correction of the large inversion mutation using a strategy of targeted gene addition.