flanking intron sequences: Topics by Science.gov

Sample records for flanking intron sequences

Nucleotide sequence of the ribosomal RNA gene of Physarum polycephalum: intron 2 and its flanking regions of the 26S rRNA gene.

PubMed Central

Nomiyama, H; Kuhara, S; Kukita, T; Otsuka, T; Sakaki, Y

1981-01-01

The 26S ribosomal RNA gene of Physarum polycephalum is interrupted by two introns, and we have previously determined the sequence of one of them (intron 1) (Nomiyama et al. Proc.Natl.Acad.Sci.USA 78, 1376-1380, 1981). In this study we sequenced the second intron (intron 2) of about 0.5 kb length and its flanking regions, and found that one nucleotide at each junction is identical in intron 1 and intron 2, though the junction regions share no other sequence homology. Comparison of the flanking exon sequences to E. coli 23S rRNA sequences shows that conserved sequences are interspersed with tracts having little homology. In particular, the region encompassing the intron 2 interruption site is highly conserved. The E. coli ribosomal protein L1 binding region is also conserved. Images PMID:6171776
Development of single-copy nuclear intron markers for species-level phylogenetics: Case study with Paullinieae (Sapindaceae).

PubMed

Chery, Joyce G; Sass, Chodon; Specht, Chelsea D

2017-09-01

We developed a bioinformatic pipeline that leverages a publicly available genome and published transcriptomes to design primers in conserved coding sequences flanking targeted introns of single-copy nuclear loci. Paullinieae (Sapindaceae) is used to demonstrate the pipeline. Transcriptome reads phylogenetically closer to the lineage of interest are aligned to the closest genome. Single-nucleotide polymorphisms are called, generating a "pseudoreference" closer to the lineage of interest. Several filters are applied to meet the criteria of single-copy nuclear loci with introns of a desired size. Primers are designed in conserved coding sequences flanking introns. Using this pipeline, we developed nine single-copy nuclear intron markers for Paullinieae. This pipeline is highly flexible and can be used for any group with available genomic and transcriptomic resources. This pipeline led to the development of nine variable markers for phylogenetic study without generating sequence data de novo.
COL1A1 transgene expression in stably transfected osteoblastic cells. Relative contributions of first intron, 3'-flanking sequences, and sequences derived from the body of the human COL1A1 minigene

NASA Technical Reports Server (NTRS)

Breault, D. T.; Lichtler, A. C.; Rowe, D. W.

1997-01-01

Collagen reporter gene constructs have be used to identify cell-specific sequences needed for transcriptional activation. The elements required for endogenous levels of COL1A1 expression, however, have not been elucidated. The human COL1A1 minigene is expressed at high levels and likely harbors sequence elements required for endogenous levels of activity. Using stably transfected osteoblastic Py1a cells, we studied a series of constructs (pOBColCAT) designed to characterize further the elements required for high level of expression. pOBColCAT, which contains the COL1A1 first intron, was expressed at 50-100-fold higher levels than ColCAT 3.6, which lacks the first intron. This difference is best explained by improved mRNA processing rather than a transcriptional effect. Furthermore, variation in activity observed with the intron deletion constructs is best explained by altered mRNA splicing. Two major regions of the human COL1A1 minigene, the 3'-flanking sequences and the minigene body, were introduced into pOBColCAT to assess both transcriptional enhancing activity and the effect on mRNA stability. Analysis of the minigene body, which includes the first five exons and introns fused with the terminal six introns and exons, revealed an orientation-independent 5-fold increase in CAT activity. In contrast the 3'-flanking sequences gave rise to a modest 61% increase in CAT activity. Neither region increased the mRNA half-life of the parent construct, suggesting that CAT-specific mRNA instability elements may serve as dominant negative regulators of stability. This study suggests that other sites within the body of the COL1A1 minigene are important for high expression, e.g. during periods of rapid extracellular matrix production.
Evaluation of the mechanisms of intron loss and gain in the social amoebae Dictyostelium.

PubMed

Ma, Ming-Yue; Che, Xun-Ru; Porceddu, Andrea; Niu, Deng-Ke

2015-12-18

Spliceosomal introns are a common feature of eukaryotic genomes. To approach a comprehensive understanding of intron evolution on Earth, studies should look beyond repeatedly studied groups such as animals, plants, and fungi. The slime mold Dictyostelium belongs to a supergroup of eukaryotes not covered in previous studies. We found 441 precise intron losses in Dictyostelium discoideum and 202 precise intron losses in Dictyostelium purpureum. Consistent with these observations, Dictyostelium discoideum was found to have significantly more copies of reverse transcriptase genes than Dictyostelium purpureum. We also found that the lost introns are significantly further from the 5' end of genes than the conserved introns. Adjacent introns were prone to be lost simultaneously in Dictyostelium discoideum. In both Dictyostelium species, the exonic sequences flanking lost introns were found to have a significantly higher GC content than those flanking conserved introns. Together, these observations support a reverse-transcription model of intron loss in which intron losses were caused by gene conversion between genomic DNA and cDNA reverse transcribed from mature mRNA. We also identified two imprecise intron losses in Dictyostelium discoideum that may have resulted from genomic deletions. Ninety-eight putative intron gains were also observed. Consistent with previous studies of other lineages, the source sequences were found in only a small number of cases, with only two instances of intron gain identified in Dictyostelium discoideum. Although they diverged very early from animals and fungi, Dictyostelium species have similar mechanisms of intron loss.
Processing of Archaebacterial Intron-Containing tRNA Gene Transcripts.

DTIC Science & Technology

1987-07-31

1{ 1. Project Goals: A. To determine the mechanism of tRNA intron processing in the halophilic archaebacteria. B. Characterize and compare the...enzyme(s) responsible for the removal of 5’-flanking sequences from halophilic and sulfur-dependent tRNA gene transcripts. C. Examine the structure and...distribution of tRNA introns in the halophilic archaebacteria. 2. Accomplishments: A. Intron processing mechanism We have succeeded in our primary
Fungal origin by horizontal transfer of a plant mitochondrial group I intron in the chimeric CoxI gene of Peperomia.

PubMed

Vaughn, J C; Mason, M T; Sper-Whitis, G L; Kuhlman, P; Palmer, J D

1995-11-01

We present phylogenetic evidence that a group I intron in an angiosperm mitochondrial gene arose recently by horizontal transfer from a fungal donor species. A 1,716-bp fragment of the mitochondrial coxI gene from the angiosperm Peperomia polybotrya was amplified via the polymerase chain reaction and sequenced. Comparison to other coxI genes revealed a 966-bp group I intron, which, based on homology with the related yeast coxI intron aI4, potentially encodes a 279-amino-acid site-specific DNA endonuclease. This intron, which is believed to function as a ribozyme during its own splicing, is not present in any of 19 coxI genes examined from other diverse vascular plant species. Phylogenetic analysis of intron origin was carried out using three different tree-generating algorithms, and on a variety of nucleotide and amino acid data sets from the intron and its flanking exon sequences. These analyses show that the Peperomia coxI gene intron and exon sequences are of fundamentally different evolutionary origin. The Peperomia intron is more closely related to several fungal mitochondrial introns, two of which are located at identical positions in coxI, than to identically located coxI introns from the land plant Marchantia and the green alga Prototheca. Conversely, the exon sequence of this gene is, as expected, most closely related to other angiosperm coxI genes. These results, together with evidence suggestive of co-conversion of exonic markers immediately flanking the intron insertion site, lead us to conclude that the Peperomia coxI intron probably arose by horizontal transfer from a fungal donor, using the double-strand-break repair pathway. The donor species may have been one of the symbiotic mycorrhizal fungi that live in close obligate association with most plants.
A pipeline of programs for collecting and analyzing group II intron retroelement sequences from GenBank

PubMed Central

2013-01-01

Background Accurate and complete identification of mobile elements is a challenging task in the current era of sequencing, given their large numbers and frequent truncations. Group II intron retroelements, which consist of a ribozyme and an intron-encoded protein (IEP), are usually identified in bacterial genomes through their IEP; however, the RNA component that defines the intron boundaries is often difficult to identify because of a lack of strong sequence conservation corresponding to the RNA structure. Compounding the problem of boundary definition is the fact that a majority of group II intron copies in bacteria are truncated. Results Here we present a pipeline of 11 programs that collect and analyze group II intron sequences from GenBank. The pipeline begins with a BLAST search of GenBank using a set of representative group II IEPs as queries. Subsequent steps download the corresponding genomic sequences and flanks, filter out non-group II introns, assign introns to phylogenetic subclasses, filter out incomplete and/or non-functional introns, and assign IEP sequences and RNA boundaries to the full-length introns. In the final step, the redundancy in the data set is reduced by grouping introns into sets of ≥95% identity, with one example sequence chosen to be the representative. Conclusions These programs should be useful for comprehensive identification of group II introns in sequence databases as data continue to rapidly accumulate. PMID:24359548
Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays

PubMed Central

Sugnet, Charles W; Srinivasan, Karpagam; Clark, Tyson A; O'Brien, Georgeann; Cline, Melissa S; Wang, Hui; Williams, Alan; Kulp, David; Blume, John E; Haussler, David; Ares, Manuel

2006-01-01

Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families. PMID:16424921
Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

PubMed

Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

2016-06-04

Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.
Changes in exon–intron structure during vertebrate evolution affect the splicing pattern of exons

PubMed Central

Gelfman, Sahar; Burstein, David; Penn, Osnat; Savchenko, Anna; Amit, Maayan; Schwartz, Schraga; Pupko, Tal; Ast, Gil

2012-01-01

Exon–intron architecture is one of the major features directing the splicing machinery to the short exons that are located within long flanking introns. However, the evolutionary dynamics of exon–intron architecture and its impact on splicing is largely unknown. Using a comparative genomic approach, we analyzed 17 vertebrate genomes and reconstructed the ancestral motifs of both 3′ and 5′ splice sites, as also the ancestral length of exons and introns. Our analyses suggest that vertebrate introns increased in length from the shortest ancestral introns to the longest primate introns. An evolutionary analysis of splice sites revealed that weak splice sites act as a restrictive force keeping introns short. In contrast, strong splice sites allow recognition of exons flanked by long introns. Reconstruction of the ancestral state suggests these phenomena were not prevalent in the vertebrate ancestor, but appeared during vertebrate evolution. By calculating evolutionary rate shifts in exons, we identified cis-acting regulatory sequences that became fixed during the transition from early vertebrates to mammals. Experimental validations performed on a selection of these hexamers confirmed their regulatory function. We additionally revealed many features of exons that can discriminate alternative from constitutive exons. These features were integrated into a machine-learning approach to predict whether an exon is alternative. Our algorithm obtains very high predictive power (AUC of 0.91), and using these predictions we have identified and successfully validated novel alternatively spliced exons. Overall, we provide novel insights regarding the evolutionary constraints acting upon exons and their recognition by the splicing machinery. PMID:21974994
Nucleotide sequence of the L1 ribosomal protein gene of Xenopus laevis: remarkable sequence homology among introns.

PubMed Central

Loreni, F; Ruberti, I; Bozzoni, I; Pierandrei-Amaldi, P; Amaldi, F

1985-01-01

Ribosomal protein L1 is encoded by two genes in Xenopus laevis. The comparison of two cDNA sequences shows that the two L1 gene copies (L1a and L1b) have diverged in many silent sites and very few substitution sites; moreover a small duplication occurred at the very end of the coding region of the L1b gene which thus codes for a product five amino acids longer than that coded by L1a. Quantitatively the divergence between the two L1 genes confirms that a whole genome duplication took place in Xenopus laevis approximately 30 million years ago. A genomic fragment containing one of the two L1 gene copies (L1a), with its nine introns and flanking regions, has been completely sequenced. The 5' end of this gene has been mapped within a 20-pyridimine stretch as already found for other vertebrate ribosomal protein genes. Four of the nine introns have a 60-nucleotide sequence with 80% homology; within this region some boxes, one of which is 16 nucleotides long, are 100% homologous among the four introns. This feature of L1a gene introns is interesting since we have previously shown that the activity of this gene is regulated at a post-transcriptional level and it involves the block of the normal splicing of some intron sequences. Images Fig. 3. Fig. 5. PMID:3841512
A comparative genomics strategy for targeted discovery of single-nucleotide polymorphisms and conserved-noncoding sequences in orphan crops.

PubMed

Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H

2006-04-01

Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.
A Comparative Genomics Strategy for Targeted Discovery of Single-Nucleotide Polymorphisms and Conserved-Noncoding Sequences in Orphan Crops1[W

PubMed Central

Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.

2006-01-01

Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031
Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication

PubMed Central

Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

2016-01-01

Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to “Gopoong” and “K-1” were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information. PMID:27271615
Increased complexity of circRNA expression during species evolution.

PubMed

Dong, Rui; Ma, Xu-Kai; Chen, Ling-Ling; Yang, Li

2017-08-03

Circular RNAs (circRNAs) are broadly identified from precursor mRNA (pre-mRNA) back-splicing across various species. Recent studies have suggested a cell-/tissue- specific manner of circRNA expression. However, the distinct expression pattern of circRNAs among species and its underlying mechanism still remain to be explored. Here, we systematically compared circRNA expression from human and mouse, and found that only a small portion of human circRNAs could be determined in parallel mouse samples. The conserved circRNA expression between human and mouse is correlated with the existence of orientation-opposite complementary sequences in introns that flank back-spliced exons in both species, but not the circRNA sequences themselves. Quantification of RNA pairing capacity of orientation-opposite complementary sequences across circRNA-flanking introns by Complementary Sequence Index (CSI) identifies that among all types of complementary sequences, SINEs, especially Alu elements in human, contribute the most for circRNA formation and that their diverse distribution across species leads to the increased complexity of circRNA expression during species evolution. Together, our integrated and comparative reference catalog of circRNAs in different species reveals a species-specific pattern of circRNA expression and suggests a previously under-appreciated impact of fast-evolved SINEs on the regulation of (circRNA) gene expression.
The chloroplast tRNALys(UUU) gene from mustard (Sinapis alba) contains a class II intron potentially coding for a maturase-related polypeptide.

PubMed

Neuhaus, H; Link, G

1987-01-01

The trnK gene endocing the tRNALys(UUU) has been located on mustard (Sinapis alba) chloroplast DNA, 263 bp upstream of the psbA gene on the same strand. The nucleotide sequence of the trnK gene and its flanking regions as well as the putative transcription start and termination sites are shown. The 5' end of the transcript lies 121 bp upstream of the 5' tRNA coding region and is preceded by procaryotic-type "-10" and "-35" sequence elements, while the 3' end maps 2.77 kb downstream to a DNA region with possible stemloop secondary structure. The anticodon loop of the tRNALys is interrupted by a 2,574 bp intron containing a long open reading frame, which codes for 524 amino acids. Based on conserved stem and loop structures, this intron has characteristic features of a class II intron. A region near the carboxyl terminus of the derived polypeptide appears structurally related to maturases.
A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals.

PubMed

Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M

2017-01-01

Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
Genomic organization of human fetal specific P-450IIIA7 (cytochrome P-450HFLa)-related gene(s) and interaction of transcriptional regulatory factor with its DNA element in the 5' flanking region.

PubMed

Itoh, S; Yanagimoto, T; Tagawa, S; Hashimoto, H; Kitamura, R; Nakajima, Y; Okochi, T; Fujimoto, S; Uchino, J; Kamataki, T

1992-03-24

P-450IIIA7 is a form of cytochrome P-450 which was isolated from human fetal livers and termed P-450HFLa. This form has been clarified to be expressed during fetal life specifically (Komori, M., Nishio, K., Kitada, M., Shiramatsu, K., Muroya, K., Soma, M., Nagashima, K. and Kamataki, T. (1990) Biochemistry 29, 4430-4433). In the present study, we isolated five independent clones which probably corresponded to the human P-450IIIA7 gene. These clones were completely sequenced, all exons, exon-intron junctions and the 5' flanking region from the cap site to-869. Although the sequences in the coding region were completely identical to P-450IIIA7, it is possible that genomic fragments sequenced in this study encode portions of other P-450IIIA7-related genes since we could not obtain a complete overlapping set of genomic clones. Within its 5' flanking sequence, the putative binding sites of several transcriptional regulatory factors existed. Among them, it was shown that a basic transcription element binding factor (BTEB) actually interacted with the 5' flanking region of this gene.
Resequencing IRS2 reveals rare variants for obesity but not fasting glucose homeostasis in Hispanic children

USDA-ARS?s Scientific Manuscript database

Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5' and 3' flanking regions of IRS2 (approx. 14.5 kb), were bidirectionally sequenced for single nucleotide...
Group I intron-mediated trans-splicing in mitochondria of Gigaspora rosea and a robust phylogenetic affiliation of arbuscular mycorrhizal fungi with Mortierellales.

PubMed

Nadimi, Maryam; Beaudet, Denis; Forget, Lise; Hijri, Mohamed; Lang, B Franz

2012-09-01

Gigaspora rosea is a member of the arbuscular mycorrhizal fungi (AMF; Glomeromycota) and a distant relative of Glomus species that are beneficial to plant growth. To allow for a better understanding of Glomeromycota, we have sequenced the mitochondrial DNA of G. rosea. A comparison with Glomus mitochondrial genomes reveals that Glomeromycota undergo insertion and loss of mitochondrial plasmid-related sequences and exhibit considerable variation in introns. The gene order between the two species is almost completely reshuffled. Furthermore, Gigaspora has fragmented cox1 and rns genes, and an unorthodox initiator tRNA that is tailored to decoding frequent UUG initiation codons. For the fragmented cox1 gene, we provide evidence that its RNA is joined via group I-mediated trans-splicing, whereas rns RNA remains in pieces. According to our model, the two cox1 precursor RNA pieces are brought together by flanking cox1 exon sequences that form a group I intron structure, potentially in conjunction with the nad5 intron 3 sequence. Finally, we present analyses that address the controversial phylogenetic association of Glomeromycota within fungi. According to our results, Glomeromycota are not a separate group of paraphyletic zygomycetes but branch together with Mortierellales, potentially also Harpellales.

Evaluation of the arrestin gene in patients with retinitis pigmentosa or an allied disease

DOE Office of Scientific and Technical Information (OSTI.GOV)

DeStefano, D.J.; Berson, E.L.; Dryja, T.P.

1994-09-01

Arrestin, also called 48K protein or S-antigen, plays a role in deactivating rhodopsin, the photosensitive, seven-helix, G-protein receptor found in rod photoreceptors. In Drosophila, null mutations in arrestin genes cause a light-dependent photoreceptor degeneration. It is possible that a comparable photoreceptor degeneration in humans is caused by defects in the rod arrestin gene. In order to evaluate this possibility, we are characterizing the human arrestin locus on chromosome 2q. We screened a genomic library (5 million plaques) using an arrestin cDNA clone. Sixty-eight hybridizing clones were identified; portions of 7 clones were sequenced to determine the intron sequence flanking themore » exons. We are using SSCP analysis and direct genomic sequencing to screen the entire coding region, splice donor and acceptor sites, and the promoter region of the arrestin gene in 188 patients with autosomal dominant and 104 patients with autosomal recessive retinitis pigmentosa. We have already obtained flanking intron sequences necessary for SSCP analysis for 13 of 16 exons. So far, we have identified 4 silent base changes at codons 67 (TGC-to-TGT), 107 (CTG-to-CTC), 163 (GCC-to-GCT), and 288 (CTG-to-TGT), all with allele frequencies at 1% or less. Several other variant bands detected by SSCP analysis are currently being sequenced.« less
The human myelin oligodendrocyte glycoprotein (MOG) gene: Complete nucleotide sequence and structural characterization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Paule Roth, M.; Malfroy, L.; Offer, C.

1995-07-20

Human myelin oligodendrocyte glycoprotein (MOG), a myelin component of the central nervous system, is a candidate target antigen for autoimmune-mediated demyelination. We have isolated and sequenced part of a cosmid clone that contains the entire human MOG gene. The primary nuclear transcript, extending from the putative start of transcription to the site of poly(A) addition, is 15,561 nucleotides in length. The human MOG gene contains 8 exons, separated by 7 introns; canonical intron/exon boundary sites are observed at each junction. The introns vary in size from 242 to 6484 bp and contain numerous repetitive DNA elements, including 14 Alu sequencesmore » within 3 introns. Another Alu element is located in the 3{prime}-untranslated region of the gene. Alu sequences were classified with respect to subfamily assignment. Seven hundred sixty-three nucleotides 5{prime} of the transcription start and 1214 nucleotides 3{prime} of the poly(A) addition sites were also sequenced. The 5{prime}-flanking region revealed the presence of several consensus sequences that could be relevant in the transcription of the MOG gene, in particular binding sites in common with other myelin gene promoters. Two polymorphic intragenic dinucleotide (CA){sub n} and tetranucleotide (TAAA){sub n} repeats were identified and may provide genetic marker tools for association and linkage studies. 50 refs., 3 figs., 3 tabs.« less
The structure of the coding and 5'-flanking region of the type 1 iodothyronine deiodinase (dio1) gene is normal in a patient with suspected congenital dio1 deficiency.

PubMed

Toyoda, N; Kleinhaus, N; Larsen, P R

1996-06-01

We analyzed the exon-intron structure of the human type 1 deiodinase gene (dio1) and compared it with that of a patient with suspected congenital type 1 deiodinase (D1) deficiency. The hdio1 gene is identical in exon-intron arrangement to the mouse gene, with coding sequences and a selenocysteine insertion sequence (SECIS) element contained in four exons. There were no mutations in the sequences of exons 1-4 of the patient's genomic DNA. Functional studies by transient expression techniques showed no difference in basal promoter activity or T3 responsiveness between the patient's and the normal dio1 gene. A structural abnormality in the dio1 gene is not a likely explanation for this patient's D1-deficient phenotype.
Molecular analysis of the glucocerebrosidase gene locus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Winfield, S.L.; Martin, B.M.; Fandino, A.

1994-09-01

Gaucher disease is due to a deficiency in the activity of the lysosomal enzyme glucocerebrosidase. Both the functional gene for this enzyme and a pseudogene are located in close proximity on chromosome 1q21. Analysis of the mutations present in patient samples has suggested interaction between the functional gene and the pseudogene in the origin of mutant genotypes. To investigate the involvement of regions flanking the functional gene and pseudogene in the origin of mutations found in Gaucher disease, a YAC clone containing DNA from this locus has been subcloned and characterized. The original YAC containing {approximately}360 kb was truncated withmore » the use of fragmentation plasmids to about 85 kb. A lambda library derived from this YAC was screened to obtain clones containing glucocerebrosidase sequences. PCR amplification was used to identify subclones containing 5{prime}, central, or 3{prime} sequences of the functional gene or of the pseudogene. Clones spanning the entire distance from the last exon of the functional gene to intron 1 of the pseudogene, the 5{prime} end of the functional gene and 16 kb of 5{prime} flanking region and approximately 15 kb of 3{prime} flanking region of the pseudogene were sequenced. Sequence data from 48 kb of intergenic and flanking regions of the glucocerebrosidase gene and its pseudogene has been generated. A large number of Alu sequences and several simple repeats have been found. Two of these repeats exhibit fragment length polymorphism. There is almost 100% homology between the 3{prime} flanking regions of the functional gene and the pseudogene, extending to about 4 kb past the termination codons. A much lower degree of homology is observed in the 5{prime} flanking region. Patient samples are currently being screened for polymorphisms in these flanking regions.« less
Genomic organization of the Neurospora crassa gsn gene: possible involvement of the STRE and HSE elements in the modulation of transcription during heat shock.

PubMed

Freitas, F Zanolli; Bertolini, M C

2004-12-01

Glycogen synthase, an enzyme involved in glycogen biosynthesis, is regulated by phosphorylation and by the allosteric ligand glucose-6-phosphate (G6P). In addition, enzyme levels can be regulated by changes in gene expression. We recently cloned a cDNA for glycogen synthase ( gsn) from Neurospora crassa, and showed that gsn transcription decreased when cells were exposed to heat shock (shifted from 30 degrees C to 45 degrees C). In order to understand the mechanisms that control gsn expression, we isolated the gene, including its 5' and 3' flanking regions, from the genome of N. crassa. An ORF of approximately 2.4 kb was identified, which is interrupted by four small introns (II-V). Intron I (482 bp) is located in the 5'UTR region. Three putative Transcription Initiation Sites (TISs) were mapped, one of which lies downstream of a canonical TATA-box sequence (5'-TGTATAAA-3'). Analysis of the 5'-flanking region revealed the presence of putative transcription factor-binding sites, including Heat Shock Elements (HSEs) and STress Responsive Elements (STREs). The possible involvement of these motifs in the negative regulation of gsn transcription was investigated using Electrophoretic Mobility Shift Assays (EMSA) with nuclear extracts of N. crassa mycelium obtained before and after heat shock, and DNA fragments encompassing HSE and STRE elements from the 5'-flanking region. While elements within the promoter region are involved in transcription under heat shock, elements in the 5'UTR intron may participate in transcription during vegetative growth. The results thus suggest that N. crassa possesses trans -acting elements that interact with the 5'-flanking region to regulate gsn transcription during heat shock and vegetative growth.
Impaired Spermatogenesis, Muscle, and Erythrocyte Function in U12 Intron Splicing-Defective Zrsr1 Mutant Mice.

PubMed

Horiuchi, Keiko; Perez-Cerezales, Serafín; Papasaikas, Panagiotis; Ramos-Ibeas, Priscila; López-Cardona, Angela Patricia; Laguna-Barraza, Ricardo; Fonseca Balvís, Noelia; Pericuesta, Eva; Fernández-González, Raul; Planells, Benjamín; Viera, Alberto; Suja, Jose Angel; Ross, Pablo Juan; Alén, Francisco; Orio, Laura; Rodriguez de Fonseca, Fernando; Pintado, Belén; Valcárcel, Juan; Gutiérrez-Adán, Alfonso

2018-04-03

The U2AF35-like ZRSR1 has been implicated in the recognition of 3' splice site during spliceosome assembly, but ZRSR1 knockout mice do not show abnormal phenotypes. To analyze ZRSR1 function and its precise role in RNA splicing, we generated ZRSR1 mutant mice containing truncating mutations within its RNA-recognition motif. Homozygous mutant mice exhibited severe defects in erythrocytes, muscle stretch, and spermatogenesis, along with germ cell sloughing and apoptosis, ultimately leading to azoospermia and male sterility. Testis RNA sequencing (RNA-seq) analyses revealed increased intron retention of both U2- and U12-type introns, including U12-type intron events in genes with key functions in spermatogenesis and spermatid development. Affected U2 introns were commonly found flanking U12 introns, suggesting functional cross-talk between the two spliceosomes. The splicing and tissue defects observed in mutant mice attributed to ZRSR1 loss of function suggest a physiological role for this factor in U12 intron splicing. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Recurrent Loss of Specific Introns during Angiosperm Evolution

PubMed Central

Wang, Hao; Devos, Katrien M.; Bennetzen, Jeffrey L.

2014-01-01

Numerous instances of presence/absence variations for introns have been documented in eukaryotes, and some cases of recurrent loss of the same intron have been suggested. However, there has been no comprehensive or phylogenetically deep analysis of recurrent intron loss. Of 883 cases of intron presence/absence variation that we detected in five sequenced grass genomes, 93 were confirmed as recurrent losses and the rest could be explained by single losses (652) or single gains (118). No case of recurrent intron gain was observed. Deep phylogenetic analysis often indicated that apparent intron gains were actually numerous independent losses of the same intron. Recurrent loss exhibited extreme non-randomness, in that some introns were removed independently in many lineages. The two larger genomes, maize and sorghum, were found to have a higher rate of both recurrent loss and overall loss and/or gain than foxtail millet, rice or Brachypodium. Adjacent introns and small introns were found to be preferentially lost. Intron loss genes exhibited a high frequency of germ line or early embryogenesis expression. In addition, flanking exon A+T-richness and intron TG/CG ratios were higher in retained introns. This last result suggests that epigenetic status, as evidenced by a loss of methylated CG dinucleotides, may play a role in the process of intron loss. This study provides the first comprehensive analysis of recurrent intron loss, makes a series of novel findings on the patterns of recurrent intron loss during the evolution of the grass family, and provides insight into the molecular mechanism(s) underlying intron loss. PMID:25474210
Multiple splicing defects in an intronic false exon.

PubMed

Sun, H; Chasin, L A

2000-09-01

Splice site consensus sequences alone are insufficient to dictate the recognition of real constitutive splice sites within the typically large transcripts of higher eukaryotes, and large numbers of pseudoexons flanked by pseudosplice sites with good matches to the consensus sequences can be easily designated. In an attempt to identify elements that prevent pseudoexon splicing, we have systematically altered known splicing signals, as well as immediately adjacent flanking sequences, of an arbitrarily chosen pseudoexon from intron 1 of the human hprt gene. The substitution of a 5' splice site that perfectly matches the 5' consensus combined with mutation to match the CAG/G sequence of the 3' consensus failed to get this model pseudoexon included as the central exon in a dhfr minigene context. Provision of a real 3' splice site and a consensus 5' splice site and removal of an upstream inhibitory sequence were necessary and sufficient to confer splicing on the pseudoexon. This activated context also supported the splicing of a second pseudoexon sequence containing no apparent enhancer. Thus, both the 5' splice site sequence and the polypyrimidine tract of the pseudoexon are defective despite their good agreement with the consensus. On the other hand, the pseudoexon body did not exert a negative influence on splicing. The introduction into the pseudoexon of a sequence selected for binding to ASF/SF2 or its replacement with beta-globin exon 2 only partially reversed the effect of the upstream negative element and the defective polypyrimidine tract. These results support the idea that exon-bridging enhancers are not a prerequisite for constitutive exon definition and suggest that intrinsically defective splice sites and negative elements play important roles in distinguishing the real splicing signal from the vast number of false splicing signals.
Compositional correlations in the chicken genome.

PubMed

Musto, H; Romero, H; Zavala, A; Bernardi, G

1999-09-01

This paper analyses the compositional correlations that hold in the chicken genome. Significant linear correlations were found among the regions studied-coding sequences (and their first, second, and third codon positions), flanking regions (5' and 3'), and introns-as is the case in the human genome. We found that these compositional correlations are not limited to global GC levels but even extend to individual bases. Furthermore, an analysis of 1037 coding sequences has confirmed a correlation among GC(3), GC(2), and GC(1). The implications of these results are discussed.
Structural and functional differences in the dio1 gene in mice with inherited type 1 deiodinase deficiency.

PubMed

Maia, A L; Berry, M J; Sabbag, R; Harney, J W; Larsen, P R

1995-08-01

The type 1 deiodinase (D1) provides the major portion of the circulating T3 in vertebrates. In C3H and certain other inbred mice, liver and kidney D1 activity is 5- to 10-fold lower than in the common phenotype, C57. The lower D1 levels are paralleled by a decreased normal-sized dio1 mRNA and hyperthyroxinemia. Low activity cosegregates with a restriction fragment length variant (RFLV) in both inbred and recombinant strains, indicating it is due to differences in the dio1 gene. The exonic structure and the deduced amino acid sequences are identical for both strains and highly homologous to that of the rat. The RFLV is due to an approximately 150-base pair expansion of repetitive sequences in the second intron of the C3H gene, but this segment does not differentially affect the transient expression of a human GH gene. The promoter and 5'-flanking regions of the C3H and C57 dio1 genes are very similar and are GC rich without TATA or CCAAT boxes. However, functional assays of 1.5-kilobase 5'-flanking dio1-CAT constructs showed 2- to 3-fold higher activity of the C57-CAT constructs. Deletion mutants showed that sequences between -705 and -162 were the cause of this. In this region, the only major difference between the two genes is a 21-base pair insert containing five CTG repeats in the C3H promoter. This difference also cosegregates with low D1 activity and the intron RFLV in four other mouse strains. The correlation of the CTG repeat insert with both in vitro and in vivo expression and the absence of other significant sequence differences in the 5'-flanking region argue that this is the major explanation for the impaired expression of the dio1 gene and the resulting hyperthyroxinemia of the C3H mouse.
Identification of Genetic Elements Associated with EPSPS Gene Amplification

PubMed Central

Gaines, Todd A.; Wright, Alice A.; Molin, William T.; Lorentz, Lothar; Riggins, Chance W.; Tranel, Patrick J.; Beffa, Roland; Westra, Philip; Powles, Stephen B.

2013-01-01

Weed populations can have high genetic plasticity and rapid responses to environmental selection pressures. For example, 100-fold amplification of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene evolved in the weed species Amaranthus palmeri to confer resistance to glyphosate, the world’s most important herbicide. However, the gene amplification mechanism is unknown. We sequenced the EPSPS gene and genomic regions flanking EPSPS loci in A. palmeri, and searched for mobile genetic elements or repetitive sequences. The EPSPS gene was 10,229 bp, containing 8 exons and 7 introns. The gene amplification likely proceeded through a DNA-mediated mechanism, as introns exist in the amplified gene copies and the entire amplified sequence is at least 30 kb in length. Our data support the presence of two EPSPS loci in susceptible (S) A. palmeri, and that only one of these was amplified in glyphosate-resistant (R) A. palmeri. The EPSPS gene amplification event likely occurred recently, as no sequence polymorphisms were found within introns of amplified EPSPS copies from R individuals. Sequences with homology to miniature inverted-repeat transposable elements (MITEs) were identified next to EPSPS gene copies only in R individuals. Additionally, a putative Activator (Ac) transposase and a repetitive sequence region were associated with amplified EPSPS genes. The mechanism controlling this DNA-mediated amplification remains unknown. Further investigation is necessary to determine if the gene amplification may have proceeded via DNA transposon-mediated replication, and/or unequal recombination between different genomic regions resulting in replication of the EPSPS gene. PMID:23762434
Structure and genomic organization of the human B1 receptor gene for kinins (BDKRB1).

PubMed

Bachvarov, D R; Hess, J F; Menke, J G; Larrivée, J F; Marceau, F

1996-05-01

Two subtypes of mammalian bradykinin receptors, B1 and B2 (BDKRB1 and BDKRB2), have been defined based on their pharmacological properties. The B1 type kinin receptors have weak affinity for intact BK or Lys-BK but strong affinity for kinin metabolites without the C-terminal arginine (e.g., des-Arg9-BK and Lys-des-Arg9-BK, also called des-Arg10-kallidin), which are generated by kininase I. The B1 receptor expression is up-regulated following tissue injury and inflammation (hyperemia, exudation, hyperalgesia, etc.). In the present study, we have cloned and sequenced the gene encoding human B1 receptor from a human genomic library. The human B1 receptor gene contains three exons separated by two introns. The first and the second exon are noncoding, while the coding region and the 3'-flanking region are located entirely on the third exon. The exon-intron arrangement of the human B1 receptor gene shows significant similarity with the genes encoding the B2 receptor subtype in human, mouse, and rat. Sequence analysis of the 5'-flanking region revealed the presence of a consensus TATA box and of numerous candidate transcription factor binding sequences. Primer extension experiments have shown the existence of multiple transcription initiation sites situated downstream and upstream from the consensus TATA box. Genomic Southern blot analysis indicated that the human B1 receptor is encoded by a single-copy gene.
Characterization of a marsupial sperm protamine gene and its transcripts from the North American opossum (Didelphis marsupialis).

PubMed

Winkfein, R J; Nishikawa, S; Connor, W; Dixon, G H

1993-07-01

A synthetic oligonucleotide primer, designed from marsupial protamine protein-sequence data [Balhorn, R., Corzett, M., Matrimas, J. A., Cummins, J. & Faden, B. (1989) Analysis of protamines isolated from two marsupials, the ring-tailed wallaby and gray short-tailed opossum, J. Cell. Biol. 107] was used to amplify, via the polymerase chain reaction, protamine sequences from a North American opossum (Didelphis marsupialis) cDNA. Using the amplified sequences as probes, several protamine cDNA clones were isolated. The protein sequence, predicted from the cDNA sequences, consisted of 57 amino acids, contained a large number of arginine residues and exhibited the sequence ARYR at its amino terminus, which is conserved in avian and most eutherian mammal protamines. Like the true protamines of trout and chicken, the opossum protamine lacked cysteine residues, distinguishing it from placental mammalian protamine 1 (P1 or stable) protamines. Examination of the protamine gene, isolated by polymerase-chain-reaction amplification of genomic DNA, revealed the presence of an intron dividing the protamine-coding region, a common characteristic of all mammalian P1 genes. In addition, extensive sequence identity in the 5' and 3' flanking regions between mouse and opossum sequences classify the marsupial protamine as being closely related to placental mammal P1. Protamine transcripts, in both birds and mammals, are present in two size classes, differing by the length of their poly(A) tails (either short or long). Examination of opossum protamine transcripts by Northern hybridization revealed four distinct mRNA species in the total RNA fraction, two of which were enriched in the poly(A)-rich fraction. Northern-blot analysis, using an intron-specific probe, revealed the presence of intron sequences in two of the four protamine transcripts. If expressed, the corresponding protein from intron-containing transcripts would differ from spliced transcripts by length (49 versus 57 amino acids) and would contain a cysteine residue.
Multiple recent horizontal transfers of the cox1 intron in Solanaceae and extended co-conversion of flanking exons

PubMed Central

2011-01-01

Background The most frequent case of horizontal transfer in plants involves a group I intron in the mitochondrial gene cox1, which has been acquired via some 80 separate plant-to-plant transfer events among 833 diverse angiosperms examined. This homing intron encodes an endonuclease thought to promote the intron's promiscuous behavior. A promising experimental approach to study endonuclease activity and intron transmission involves somatic cell hybridization, which in plants leads to mitochondrial fusion and genome recombination. However, the cox1 intron has not yet been found in the ideal group for plant somatic genetics - the Solanaceae. We therefore undertook an extensive survey of this family to find members with the intron and to learn more about the evolutionary history of this exceptionally mobile genetic element. Results Although 409 of the 426 species of Solanaceae examined lack the cox1 intron, it is uniformly present in three phylogenetically disjunct clades. Despite strong overall incongruence of cox1 intron phylogeny with angiosperm phylogeny, two of these clades possess nearly identical intron sequences and are monophyletic in intron phylogeny. These two clades, and possibly the third also, contain a co-conversion tract (CCT) downstream of the intron that is extended relative to all previously recognized CCTs in angiosperm cox1. Re-examination of all published cox1 genes uncovered additional cases of extended co-conversion and identified a rare case of putative intron loss, accompanied by full retention of the CCT. Conclusions We infer that the cox1 intron was separately and recently acquired by at least three different lineages of Solanaceae. The striking identity of the intron and CCT from two of these lineages suggests that one of these three intron captures may have occurred by a within-family transfer event. This is consistent with previous evidence that horizontal transfer in plants is biased towards phylogenetically local events. The discovery of extended co-conversion suggests that other cox1 conversions may be longer than realized but obscured by the exceptional conservation of plant mitochondrial sequences. Our findings provide further support for the rampant-transfer model of cox1 intron evolution and recommend the Solanaceae as a model system for the experimental analysis of cox1 intron transfer in plants. PMID:21943226
Unique CD44 intronic SNP is associated with tumor grade in breast cancer: a case control study and in silico analysis.

PubMed

Esmaeili, Rezvan; Abdoli, Nasrin; Yadegari, Fatemeh; Neishaboury, Mohamadreza; Farahmand, Leila; Kaviani, Ahmad; Majidzadeh-A, Keivan

2018-01-01

CD44 encoded by a single gene is a cell surface transmembrane glycoprotein. Exon 2 is one of the important exons to bind CD44 protein to hyaluronan. Experimental evidences show that hyaluronan-CD44 interaction intensifies the proliferation, migration, and invasion of breast cancer cells. Therefore, the current study aimed at investigating the association between specific polymorphisms in exon 2 and its flanking region of CD44 with predisposition to breast cancer. In the current study, 175 Iranian female patients with breast cancer and 175 age-matched healthy controls were recruited in biobank, Breast Cancer Research Center, Tehran, Iran. Single nucleotide polymorphisms of CD44 exon 2 and its flanking were analyzed via polymerase chain reaction and gene sequencing techniques. Association between the observed variation with breast cancer risk and clinico-pathological characteristics were studied. Subsequently, bioinformatics analysis was conducted to predict potential exonic splicing enhancer (ESE) motifs changed as the result of a mutation. A unique polymorphism of the gene encoding CD44 was identified at position 14 nucleotide upstream of exon 2 (A37692→G) by the sequencing method. The A > G polymorphism exhibited a significant association with higher-grades of breast cancer, although no significant relation was found between this polymorphism and breast cancer risk. Finally, computational analysis revealed that the intronic mutation generated a new consensus-binding motif for the splicing factor, SC35, within intron 1. The current study results indicated that A > G polymorphism was associated with breast cancer development; in addition, in silico analysis with ESE finder prediction software showed that the change created a new SC35 binding site.
Gene structure of CYP3A4, an adult-specific form of cytochrome P450 in human livers, and its transcriptional control.

PubMed

Hashimoto, H; Toide, K; Kitamura, R; Fujita, M; Tagawa, S; Itoh, S; Kamataki, T

1993-12-01

CYP3 A4 is the adult-specific form of cytochrome P450 in human livers [Komori, M., Nishio, K., Kitada, M., Shiramatsu, K., Muroya, K., Soma, M., Nagashima, K. & Kamataki, T. (1990) Biochemistry 29, 4430-4433]. The sequences of three genomic clones for CYP3A4 were analyzed for all exons, exon-intron junctions and the 5'-flanking region from the major transcription site to nucleotide position -1105, and compared with those of the CYP3A7 gene, a fetal-specific form of cytochrome P450 in humans. The results showed that the identity of 5'-flanking sequences between CYP3A4 and CYP3A7 genes was 91%, and that each 5'-flanking region had characteristic sequences termed as NFSE (P450NF-specific element) and HFLaSE (P450HFLa specific element), respectively. A basic transcription element (BTE) also lay in the 5'-flanking region of the CYP3A4 gene as seen in many CYP genes [Yanagida, A., Sogawa, K., Yasumoto, K. & Fujii-Kuriyama, Y. (1990) Mol. Cell. Biol. 10, 1470-1475]. The BTE binding factor (BTEB) was present in both adult and fetal human livers. To examine the transcriptional activity of the CYP3A4 gene, DNA fragments in the 5'-flanking region of the gene were inserted in front of the simian virus 40 promoter and the chloramphenicol acetyltransferase structural gene, and the constructs were transfected in HepG2 cells. The analysis of the chloramphenicol acetyltransferase activity indicated that (a) specific element(s) which could bind with a factor(s) in livers was present in the 5'-flanking region of the CYP3A4 gene to show the transcriptional activity.
A novel DSPP mutation causes dentinogenesis imperfecta type II in a large Mongolian family

PubMed Central

2010-01-01

Background Several studies have shown that the clinical phenotypes of dentinogenesis imperfecta type II (DGI-II) may be caused by mutations in dentin sialophosphoprotein (DSPP). However, no previous studies have documented the clinical phenotype and genetic basis of DGI-II in a Mongolian family from China. Methods We identified a large five-generation Mongolian family from China with DGI-II, comprising 64 living family members of whom 22 were affected. Linkage analysis of five polymorphic markers flanking DSPP gene was used to genotype the families and to construct the haplotypes of these families. All five DSPP exons including the intron-exon boundaries were PCR-amplified and sequenced in 48 members of this large family. Results All affected individuals showed discoloration and severe attrition of their teeth, with obliterated pulp chambers and without progressive high frequency hearing loss or skeletal abnormalities. No recombination was found at five polymorphic markers flanking DSPP in the family. Direct DNA sequencing identified a novel A→G transition mutation adjacent to the donor splicing site within intron 3 in all affected individuals but not in the unaffected family members and 50 unrelated Mongolian individuals. Conclusion This study identified a novel mutation (IVS3+3A→G) in DSPP, which caused DGI-II in a large Mongolian family. This expands the spectrum of mutations leading to DGI-II. PMID:20146806
The human serotonin 5-HT{sub 2C} receptor: Complete cDNA, genomic structure, and alternatively spliced variant

DOE Office of Scientific and Technical Information (OSTI.GOV)

Xie, Enzhong; Zhu, Lingyu; Zhao, Lingyun

1996-08-01

The complete 4775-nt cDNA encoding the human serotonin 5-HT{sub 2C} receptor (5-HT{sub 2C}R), a G-protein-coupled receptor, has been isolated. It contains a 1377-nt coding region flanked by a 728-nt 5{prime}-untranslated region and a 2670-nt 3{prime}-untranslated region. By using the cloned 5-HT{sub 2C}R cDNA probe, the complete human gene for this receptor has been isolated and shown to contain six exons and five introns spanning at least 230 kb of DNA. The coding region of the human 5-HT{sub 2C}R gene is interrupted by three introns, and the positions of the intron/exon junctions are conserved between the human and the rodent genes.more » In addition, an alternatively spliced 5-HT{sub 2C}R RNA that contains a 95-nt deletion in the region coding for the second intracellular loop and the fourth transmembrane domain of the receptor has been identified. This deletion leads to a frameshift and premature termination so that the short isoform RNA encodes a putative protein of 248 amino acids. The ratio for the short isoform over the 5-HT{sub 2C}R RNA was found to be higher in choroid plexus tumor than in normal brain tissue, suggesting the possibility of differential regulation of the 5-HT{sub 2C}R gene in different neural tissues or during tumorigenesis. Transcription of the human 5-HT{sub 2C}R gene was found to be initiated at multiple sites. No classical TATA-box sequence was found at the appropriate location, and the 5{prime}-flanking sequence contains many potential transcription factor-binding sites. A 7.3-kb 5{prime}-flanking 5-HT{sub 2C}R DNA directed the efficient expression of a luciferase reported gene in SK-N-SH and IMR32 neuroblastoma cells, indicating that is contains a functional promoter. 69 refs., 8 figs., 1 tab.« less
Myostatin-2 gene structure and polymorphism of the promoter and first intron in the marine fish Sparus aurata: evidence for DNA duplications and/or translocations.

PubMed

Nadjar-Boger, Elisabeth; Funkenstein, Bruria

2011-02-01

Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily that functions as a negative regulator of skeletal muscle development and growth in mammals. Fish express at least two genes for MSTN: MSTN-1 and MSTN-2. To date, MSTN-2 promoters have been cloned only from salmonids and zebrafish. Here we described the cloning and sequence analysis of MSTN-2 gene and its 5' flanking region in the marine fish Sparus aurata (saMSTN-2). We demonstrate the existence of three alleles of the promoter and three alleles of the first intron. Sequence comparison of the promoter region in the three alleles revealed that although the sequences of the first 1050 bp upstream of the translation start site are almost identical in the three alleles, a substantial sequence divergence is seen further upstream. Careful sequence analysis of the region upstream of the first 1050 bp in the three alleles identified several elements that appear to be repeated in some or all sequences, at different positions. This suggests that the promoter region of saMSTN-2 has been subjected to various chromosomal rearrangements during the course of evolution, reflecting either insertion or deletion events. Screening of several genomic DNA collections indicated differences in allele frequency, with allele 'b' being the most abundant, followed by allele 'c', whereas allele 'a' is relatively rare. Sequence analysis of saMSTN-2 gene also revealed polymorphism in the first intron, identifying three alleles. The length difference in alleles '1R' and '2R' of the first intron is due to the presence of one or two copies of a repeated block of approximately 150 bp, located at the 5' end of the first intron. The third allele, '4R', has an additional insertion of 323 bp located 116 bp upstream of the 3' end of the first intron. Analysis of several DNA collections showed that the '2R' allele is the most common, followed by the '4R' allele, whereas the '1R' allele is relatively rare. Progeny analysis of a full-sib family showed a Mendelian mode of inheritance of the two genetic loci. No clear association was found between the two genetic markers and growth rate. These results show for the first time a substantial degree of polymorphism in both the promoter and first intron of MSTN-2 gene in a perciform fish species which points to chromosomal rearrangements that took place during evolution.
Isolation and sequencing of the gene encoding Sp23, a structural protein of spermatophore of the mealworm beetle, Tenebrio molitor.

PubMed

Feng, X; Happ, G M

1996-11-14

The cDNA for Sp23, a structural protein of the spermatophore of Tenebrio molitor, had been previously cloned and characterized (Paesen, G.C., Schwartz, M.B., Peferoen, M., Weyda, F. and Happ, G.M. (1992a) Amino acid sequence of Sp23, a structure protein of the spermatophore of the mealworm beetle, Tenebrio molitor. J. Biol. Chem. 257, 18852-18857). Using the labeled cDNA for Sp23 as a probe to screen a library of genomic DNA from Tenebrio molitor, we isolated a genomic clone for Sp23. A 5373-base pair (bp) restriction fragment containing the Sp23 gene was sequenced. The coding region is separated by a 55-bp intron which is located close to the translation start site. Three putative ecdysone response elements (EcRE) are identified in the 5' flanking region of the Sp23 gene. Comparison of the flanking regions of the Sp23 gene with those of the D-protein gene expressed in the accessory glands of Tenebrio reveals similar sequences present in the flanking regions of the two genes. The genomic organization of the coding region of the Sp23 gene shares similarities with that of the D-protein gene, three Drosophila accessory gland genes and two Drosophila 20-OH ecdysone-responsive genes.

Identification of the genomic locus for the human Rieske Fe-S Protein gene on Chromosome 19q12

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pennacchio, L.A.

1994-05-06

We have identified the chromosomal location of the human Rieske Iron-Sulfur Protein (UQCRFS1) gene. Mapping by hybridization to a panel of monochromosomal hybrid cell lines indicated that the gene was either on chromosome 19 or 22. By screening a human chromosome 19 specific genomic cosmid library with an oligonucleotide probe made from the published Rieske cDNA sequence, we identified a corresponding cosmid. Portions of this cosmid were sequenced directly. The exon, exon:intron junction, and flanking sequences verified that this cosmid contains the genomic locus. Fluorescent in situ hybridization (FISH) was performed to localize this cosmid to chromosome band 19q12.
Complete nucleotide sequence of the gene for human heparin cofactor II and mapping to chromosomal band 22q11

DOE Office of Scientific and Technical Information (OSTI.GOV)

Herzog, R.; Lutz, S.; Blin, N.

1991-02-05

Heparin cofactor II (HCII) is a 66-kDa plasma glycoprotein that inhibits thrombin rapidly in the presence of dermatan sulfate or heparin. Clones comprising the entire HCII gene were isolated from a human leukocyte genomic library in EMBL-3 {lambda} phage. The sequence of the gene was determined on both strands of DNA (15,849 bp) and included 1,749 bp of 5{prime}-flanking sequence, five exons, four introns, and 476 bp of DNA 3{prime} to the polyadenylation site. Ten complete and one partial Alu repeats were identified in the introns and 5{prime}-flanking region. The HCII gene was regionally mapped on chromosome 22 using rodent-humanmore » somatic cell hybrids, carrying only parts of human chromosome 22, and the chronic myelogenous leukemia cell line K562. With the cDNA probe HCII7.2, containing the entire coding region of the gene, the HCII gene was shown to be amplified 10-20-fold in K562 cells by Southern analysis and in situ hybridization. From these data, the authors concluded that the HCII gene is localized on the chromosomal band 22q11 proximal to the breakpoint cluster region (BCR). Analysis by pulsed-field gel electrophoresis indicated that the amplified HCII gene in K562 cells maps at least 2 Mbp proximal to BCR-1. Furthermore, the HCII7.2 cDNA probe detected two frequent restriction fragment length polymorphisms with the restriction enzymes BamHI and Hind III.« less
Mutation Spectrum of the ABCA4 Gene in a Greek Cohort with Stargardt Disease: Identification of Novel Mutations and Evidence of Three Prevalent Mutated Alleles

PubMed Central

Vassiliki, Kokkinou; George, Koutsodontis; Polixeni, Stamatiou; Christoforos, Giatzakis; Minas, Aslanides Ioannis; Stavrenia, Koukoula; Ioannis, Datseris

2018-01-01

Aim To evaluate the frequency and pattern of disease-associated mutations of ABCA4 gene among Greek patients with presumed Stargardt disease (STGD1). Materials and Methods A total of 59 patients were analyzed for ABCA4 mutations using the ABCR400 microarray and PCR-based sequencing of all coding exons and flanking intronic regions. MLPA analysis as well as sequencing of two regions in introns 30 and 36 reported earlier to harbor deep intronic disease-associated variants was used in 4 selected cases. Results An overall detection rate of at least one mutant allele was achieved in 52 of the 59 patients (88.1%). Direct sequencing improved significantly the complete characterization rate, that is, identification of two mutations compared to the microarray analysis (93.1% versus 50%). In total, 40 distinct potentially disease-causing variants of the ABCA4 gene were detected, including six previously unreported potentially pathogenic variants. Among the disease-causing variants, in this cohort, the most frequent was c.5714+5G>A representing 16.1%, while p.Gly1961Glu and p.Leu541Pro represented 15.2% and 8.5%, respectively. Conclusions By using a combination of methods, we completely molecularly diagnosed 48 of the 59 patients studied. In addition, we identified six previously unreported, potentially pathogenic ABCA4 mutations. PMID:29854428
Integrative analysis of Arabidopsis thaliana transcriptomics reveals intuitive splicing mechanism for circular RNA.

PubMed

Sun, Xiaoyong; Wang, Lin; Ding, Jiechao; Wang, Yanru; Wang, Jiansheng; Zhang, Xiaoyang; Che, Yulei; Liu, Ziwei; Zhang, Xinran; Ye, Jiazhen; Wang, Jie; Sablok, Gaurav; Deng, Zhiping; Zhao, Hongwei

2016-10-01

A new regulatory class of small endogenous RNAs called circular RNAs (circRNAs) has been described as miRNA sponges in animals. Using 16 Arabidopsis thaliana RNA-Seq data sets, we identified 803 circRNAs in RNase R-/non-RNase R-treated samples. The results revealed the following features: Canonical and noncanonical splicing can generate circRNAs; chloroplasts are a hotspot for circRNA generation; furthermore, limited complementary sequences exist not only in introns, but also in the sequences flanking splice sites. The latter finding suggests that multiple combinations between complementary sequences may facilitate the formation of the circular structure. Our results contribute to a better understanding of this novel class of plant circRNAs. © 2016 Federation of European Biochemical Societies.
Molecular cloning of actin genes in Trichomonas vaginalis and phylogeny inferred from actin sequences.

PubMed

Bricheux, G; Brugerolle, G

1997-08-01

The parasitic protozoan Trichomonas vaginalis is known to contain the ubiquitous and highly conserved protein actin. A genomic library and a cDNA library have been screened to identify and clone the actin gene(s) of T. vaginalis. The nucleotide sequence of one gene and its flanking regions have been determined. The open reading frame encodes a protein of 376 amino acids. The sequence is not interrupted by any introns and the promoter could be represented by a 10 bp motif close to a consensus motif also found upstream of most sequenced T. vaginalis genes. The five different clones isolated from the cDNA library have similar sequences and encode three actin proteins differing only by one or two amino acids. A phylogenetic analysis of 31 actin sequences by distance matrix and parsimony methods, using centractin as outgroup, gives congruent trees with Parabasala branching above Diplomonadida.
Cloning and sequencing of a laccase gene from the lignin-degrading basidiomycete Pleurotus ostreatus.

PubMed Central

Giardina, P; Cannio, R; Martirani, L; Marzullo, L; Palmieri, G; Sannia, G

1995-01-01

The gene (pox1) encoding a phenol oxidase from Pleurotus ostreatus, a lignin-degrading basidiomycete, was cloned and sequenced, and the corresponding pox1 cDNA was also synthesized and sequenced. The isolated gene consists of 2,592 bp, with the coding sequence being interrupted by 19 introns and flanked by an upstream region in which putative CAAT and TATA consensus sequences could be identified at positions -174 and -84, respectively. The isolation of a second cDNA (pox2 cDNA), showing 84% similarity, and of the corresponding truncated genomic clones demonstrated the existence of a multigene family coding for isoforms of laccase in P. ostreatus. PCR amplifications of specific regions on the DNA of isolated monokaryons proved that the two genes are not allelic forms. The POX1 amino acid sequence deduced was compared with those of other known laccases from different fungi. PMID:7793961
Combinatorial control of Drosophila circular RNA expression by intronic repeats, hnRNPs, and SR proteins

PubMed Central

Kramer, Marianne C.; Liang, Dongming; Tatomer, Deirdre C.; Gold, Beth; March, Zachary M.; Cherry, Sara; Wilusz, Jeremy E.

2015-01-01

Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3′ end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine–arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. PMID:26450910
Circular RNA biogenesis can proceed through an exon-containing lariat precursor.

PubMed

Barrett, Steven P; Wang, Peter L; Salzman, Julia

2015-06-09

Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical 'backsplicing' event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure.
Combinatorial control of Drosophila circular RNA expression by intronic repeats, hnRNPs, and SR proteins.

PubMed

Kramer, Marianne C; Liang, Dongming; Tatomer, Deirdre C; Gold, Beth; March, Zachary M; Cherry, Sara; Wilusz, Jeremy E

2015-10-15

Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼ 400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3' end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine-arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. © 2015 Kramer et al.; Published by Cold Spring Harbor Laboratory Press.
Functional Analyses of a Novel Splice Variant in the CHD7 Gene, Found by Next Generation Sequencing, Confirm Its Pathogenicity in a Spanish Patient and Diagnose Him with CHARGE Syndrome.

PubMed

Villate, Olatz; Ibarluzea, Nekane; Fraile-Bethencourt, Eugenia; Valenzuela, Alberto; Velasco, Eladio A; Grozeva, Detelina; Raymond, F L; Botella, María P; Tejada, María-Isabel

2018-01-01

Mutations in CHD7 have been shown to be a major cause of CHARGE syndrome, which presents many symptoms and features common to other syndromes making its diagnosis difficult. Next generation sequencing (NGS) of a panel of intellectual disability related genes was performed in an adult patient without molecular diagnosis. A splice donor variant in CHD7 (c.5665 + 1G > T) was identified. To study its potential pathogenicity, exons and flanking intronic sequences were amplified from patient DNA and cloned into the pSAD ® splicing vector. HeLa cells were transfected with this construct and a wild-type minigene and functional analysis were performed. The construct with the c.5665 + 1G > T variant produced an aberrant transcript with an insert of 63 nucleotides of intron 28 creating a premature termination codon (TAG) 25 nucleotides downstream. This would lead to the insertion of 8 new amino acids and therefore a truncated 1896 amino acid protein. As a result of this, the patient was diagnosed with CHARGE syndrome. Functional analyses underline their usefulness for studying the pathogenicity of variants found by NGS and therefore its application to accurately diagnose patients.
An indicator gene to demonstrate intracellular transposition of defective retroviruses.

PubMed Central

Heidmann, T; Heidmann, O; Nicolas, J F

1988-01-01

An indicator gene for detection and quantitation of RNA-mediated transposition was constructed (neoRT). It was inserted into a Moloney murine leukemia provirus (Mo-MLV) deleted for the envelope gene to test for intracellular transposition of defective retroviruses [Mo-MLV(neo)RT]. NeoRT contains the selectable neo gene (which confers resistance to the drug G418), inactivated by a polyadenylylation sequence inserted between the neo promotor and coding sequence. The polyadenylylation sequence is flanked (on the antisense strand of the DNA) by a donor and an acceptor splice site so as to be removed upon passage of the provirus through an RNA intermediate. 3T3 cells transfected with the defective Mo-MLV(neo)RT provirus are sensitive to G418. After trans-complementation with Mo-MLV, viral transcripts confer resistance to G418 upon infection of test cells. In the resistant cells, the polyadenylylation sequence has been removed, as a result in most cases of precise splicing of the intronic domain. Retrotransposition of the defective Mo-MLV(neo)RT provirus was demonstrated by submitting transfected G418-sensitive clones to selection. Between 1 and 10 G418-resistant clones were obtained per 10(7) cells. Several possess additional copies, with evidence for precise removal of the intronic domain. By using target test cells in coculture experiments, extracellular intermediates of retrotransposition could not be detected. Images PMID:2832848
Intraspecific variations of Dekkera/Brettanomyces bruxellensis genome studied by capillary electrophoresis separation of the intron splice site profiles.

PubMed

Vigentini, Ileana; De Lorenzis, Gabriella; Picozzi, Claudia; Imazio, Serena; Merico, Annamaria; Galafassi, Silvia; Piškur, Jure; Foschino, Roberto

2012-06-15

In enology, "Brett" character refers to the wine spoilage caused by the yeast Dekkera/Brettanomyces bruxellensis and its production of volatile phenolic off-flavours. However, the spoilage potential of this yeast is strain-dependent. Therefore, a rapid and reliable recognition at the strain level is a key point to avoid serious economic losses. The present work provides an operative tool to assess the genetic intraspecific variation in this species through the use of introns as molecular targets. Firstly, the available partial D./B. bruxellensis genome sequence was investigated in order to build primers annealing to introns 5' splice site sequence (ISS). This analysis allowed the detection of a non-random vocabulary flanking the site and, exploiting this feature, the creation of specific probes for strain discrimination. Secondly, the separation of the intron splice site PCR fragments was obtained throughout the set up of a capillary electrophoresis protocol, giving a 94% repeatability threshold in our experimental conditions. The comparison of results obtained with ISS-PCR/CE versus the ones performed by mtDNA RFLP revealed that the former protocol is more discriminating and allowed a reliable identification at strain level. Actually sixty D./B. bruxellensis isolates were recognised as unique strains, showing a level of similarity below 79% and confirming the high genetic polymorphism existing within the species. Two main clusters were grouped at similarity levels of about 46% and 47%, respectively, showing a poor correlation with the geographic area of isolation. Moreover, from the evolutionary point of view, the proposed technique could determine the frequency of the genome rearrangements that can occur in D./B. bruxellesis populations. Copyright © 2012 Elsevier B.V. All rights reserved.
Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis

PubMed Central

D’Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; He, Hong; Li, Shibo; Hejtmancik, James F.; Sieving, Paul A.; Wang, Xinjing

2013-01-01

Purpose X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4–5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Methods Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Results Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5′ region of the RS1 gene (including the promoter) through intron 1 (c.(−35)-1723_c.51+2664del4472). The exon 4–5 deletion spans introns 3 to intron 5 (c.185–1020_c.522+1844del5764). Conclusions Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes. PMID:24227916
Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis.

PubMed

D'Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; Lee, Ji-Yun; He, Hong; Li, Shibo; Smaoui, Nizar; Hejtmancik, James F; Sieving, Paul A; Wang, Xinjing

2013-01-01

X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4-5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5' region of the RS1 gene (including the promoter) through intron 1 (c.(-35)-1723_c.51+2664del4472). The exon 4-5 deletion spans introns 3 to intron 5 (c.185-1020_c.522+1844del5764). Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes.
Gene organization and alternative splicing of human prohormone convertase PC8.

PubMed Central

Goodge, K A; Thomas, R J; Martin, T J; Gillespie, M T

1998-01-01

The mammalian Ca2+-dependent serine protease prohormone convertase PC8 is expressed ubiquitously, being transcribed as 3.5, 4.3 and 6.0 kb mRNA isoforms in various tissues. To determine the origin of these various mRNA isoforms we report the characterization of the human PC8 gene, which has been previously localized to chromosome 11q23-24. Consisting of 16 exons, the human PC8 gene spans approx. 27 kb. A comparison of the position of intron-exon junctions of the human PC8 gene with the gene structures of previously reported prohormone convertase genes demonstrated a divergence of the human PC8 from the highly conserved nature of the gene organization of this enzyme family. The nucleotide sequence of the 5'-flanking region of the human PC8 is reported and possesses putative promoter elements characteristic of a GC-rich promoter. Further supporting the potential role of a GC-rich promoter element, multiple transcriptional initiation sites within a 200 bp region were demonstrated. We propose that the various mRNA isoforms of PC8 result from the inclusion of intronic sequences within transcripts. PMID:9820811
Mhc class II B gene evolution in East African cichlid fishes.

PubMed

Figueroa, F; Mayer, W E; Sültmann, H; O'hUigin, C; Tichy, H; Satta, Y; Takezaki, N; Takahata, N; Klein, J

2000-06-01

A distinctive feature of essential major histocompatibility complex (Mhc) loci is their polymorphism characterized by large genetic distances between alleles and long persistence times of allelic lineages. Since the lineages often span several successive speciations, we investigated the behavior of the Mhc alleles during or close to the speciation phase. We sequenced exon 2 of the class II B locus 4 from 232 East African cichlid fishes representing 32 related species. The divergence times of the (sub)species ranged from 6,000 to 8.4 million years. Two types of evolutionary analysis were used to elucidate the pattern of exon 2 sequence divergence. First, phylogenetic methods were applied to reconstruct the most likely evolutionary pathways leading from the last common ancestor of the set to the extant sequences, and to assess the probable mechanisms involved in allelic diversification. Second, pairwise comparisons of sequences were carried out to detect differences seemingly incompatible with origin by nonparallel point mutations. The analysis revealed point mutations to be the most important mechanism behind allelic divergences, with recombination playing only an auxiliary part. Comparison of sequences from related species revealed evidence of random allelic (lineage) losses apparently associated with speciation. Sharing of identical alleles could be demonstrated between species that diverged 2 million years ago. The phylogeny of the exon was incongruent with that of the flanking introns, indicating either a high degree of convergent evolution at the peptide-binding region-encoding sites, or intron homogenization.
Structure of the human type IV collagen COL4A6 gene, which is mutated in Alport syndrome-associated leiomyomatosis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Xu; Zhou, Jing; Reeders, S.T.

1996-05-01

Basement membrane (type IV) collagen, a subfamily of the collagen protein family, is encoded by six distinct genes in mammals. Three of those, COL4A3, COL4A4, and COL4A5, are linked with Alport syndrome (hereditary nephritis). Patients with leimoyomatosis associated with Alport syndrome have been shown to have deletions in the 5{prime} end of the COL4A6 gene, in addition to having deletions in COL4A6. The human COL4A6 gene is reported to be 425 kb as determined by mapping of overlapping YAC clones by probes for its 5{prime} and 3{prime} ends. In the present study we describe the complete exon/intron size pattern ofmore » the human COL4A6 gene. The 12 {lambda} phage clones characterized in the study spanned a total of 110 kb, including 85 kb of the actual gene and 25 kb of flanking sequences. The overlapping clones contained all 46 exons of the gene and all introns, except for intron 2. Since the total size of the exons and all introns except for intron 2 is about 85 kb, intron 2 must be about 340 kb. All exons of the gene were assigned to EcoRI restriction fragments to facilitate analysis of the gene in patients with leiomyomatosis associated with Alport syndrome. The exon size pattern of COL4A6 is highly homologous with that of the human and mouse COL4A2 genes, with 27 of the 46 exons of COL4A6 being identical in size between the genes. 42 refs., 2 figs., 3 tabs.« less
Gene-Based Single Nucleotide Polymorphism Markers for Genetic and Association Mapping in Common Bean

PubMed Central

2012-01-01

Background In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. Results In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. Conclusions In short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop. PMID:22734675
Using a minigene approach to characterize a novel splice site mutation in human F7 gene causing inherited factor VII deficiency in a Chinese pedigree.

PubMed

Yu, T; Wang, X; Ding, Q; Fu, Q; Dai, J; Lu, Y; Xi, X; Wang, H

2009-11-01

Factor VII deficiency which transmitted as an autosomal recessive disorder is a rare haemorrhagic condition. The aim of this study was to identify the molecular genetic defect and determine its functional consequences in a Chinese pedigree with FVII deficiency. The proband was diagnosed as inherited coagulation FVII deficiency by reduced plasma levels of FVII activity (4.4%) and antigen (38.5%). All nine exons and their flanking sequence of F7 gene were amplified by polymerase chain reaction (PCR) for the proband and the PCR products were directly sequenced. The compound heterozygous mutations of F7 (NM_000131.3) c.572-1G>A and F7 (NM_000131.3) c.1165T>G; p.Cys389Gly were identified in the proband's F7 gene. To investigate the splicing patterns associated with F7 c.572-1G>A, ectopic transcripts in leucocytes of the proband were analyzed. F7 minigenes, spanning from intron 4 to intron 7 and carrying either an A or a G at position -1 of intron 5, were constructed and transiently transfected into human embryonic kidney (HEK) 293T cells, followed by RT-PCR analysis. The aberrant transcripts from the F7 c.572-1G>A mutant allele were not detected by ectopic transcription study. Sequencing of the RT-PCR products from the mutant transfectant demonstrated the production of an erroneously spliced mRNA with exon 6 skipping, whereas a normal splicing occurred in the wide type transfectant. The aberrant mRNA produced from the F7 c.572-1G>A mutant allele is responsible for the factor VII deficiency in this pedigree.
Characterization of the human gene (TBXAS1) encoding thromboxane synthase.

PubMed

Miyata, A; Yokoyama, C; Ihara, H; Bandoh, S; Takeda, O; Takahashi, E; Tanabe, T

1994-09-01

The gene encoding human thromboxane synthase (TBXAS1) was isolated from a human EMBL3 genomic library using human platelet thromboxane synthase cDNA as a probe. Nucleotide sequencing revealed that the human thromboxane synthase gene spans more than 75 kb and consists of 13 exons and 12 introns, of which the splice donor and acceptor sites conform to the GT/AG rule. The exon-intron boundaries of the thromboxane synthase gene were similar to those of the human cytochrome P450 nifedipine oxidase gene (CYP3A4) except for introns 9 and 10, although the primary sequences of these enzymes exhibited 35.8% identity each other. The 1.2-kb of the 5'-flanking region sequence contained potential binding sites for several transcription factors (AP-1, AP-2, GATA-1, CCAAT box, xenobiotic-response element, PEA-3, LF-A1, myb, basic transcription element and cAMP-response element). Primer-extension analysis indicated the multiple transcription-start sites, and the major start site was identified as an adenine residue located 142 bases upstream of the translation-initiation site. However, neither a typical TATA box nor a typical CAAT box is found within the 100-b upstream of the translation-initiation site. Southern-blot analysis revealed the presence of one copy of the thromboxane synthase gene per haploid genome. Furthermore, a fluorescence in situ hybridization study revealed that the human gene for thromboxane synthase is localized to band q33-q34 of the long arm of chromosome 7. A tissue-distribution study demonstrated that thromboxane synthase mRNA is widely expressed in human tissues and is particularly abundant in peripheral blood leukocyte, spleen, lung and liver. The low but significant levels of mRNA were observed in kidney, placenta and thymus.

Conservation and diversification of Msx protein in metazoan evolution.

PubMed

Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun

2008-01-01

Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family proteins contributed to the diversification of animal body organization.
RNA structure in splicing: An evolutionary perspective.

PubMed

Lin, Chien-Ling; Taggart, Allison J; Fairbrother, William G

2016-09-01

Pre-mRNA splicing is a key post-transcriptional regulation process in which introns are excised and exons are ligated together. A novel class of structured intron was recently discovered in fish. Simple expansions of complementary AC and GT dimers at opposite boundaries of an intron were found to form a bridging structure, thereby enforcing correct splice site pairing across the intron. In some fish introns, the RNA structures are strong enough to bypass the need of regulatory protein factors for splicing. Here, we discuss the prevalence and potential functions of highly structured introns. In humans, structured introns usually arise through the co-occurrence of C and G-rich repeats at intron boundaries. We explore the potentially instructive example of the HLA receptor genes. In HLA pre-mRNA, structured introns flank the exons that encode the highly polymorphic β sheet cleft, making the processing of the transcript robust to variants that disrupt splicing factor binding. While selective forces that have shaped HLA receptor are fairly atypical, numerous other highly polymorphic genes that encode receptors contain structured introns. Finally, we discuss how the elevated mutation rate associated with the simple repeats that often compose structured intron can make structured introns themselves rapidly evolving elements.
Circular RNA biogenesis can proceed through an exon-containing lariat precursor

PubMed Central

Barrett, Steven P; Wang, Peter L; Salzman, Julia

2015-01-01

Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical ‘backsplicing’ event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure. DOI: http://dx.doi.org/10.7554/eLife.07540.001 PMID:26057830
Group I introns are widespread in archaea.

PubMed

Nawrocki, Eric P; Jones, Thomas A; Eddy, Sean R

2018-05-18

Group I catalytic introns have been found in bacterial, viral, organellar, and some eukaryotic genomes, but not in archaea. All known archaeal introns are bulge-helix-bulge (BHB) introns, with the exception of a few group II introns. It has been proposed that BHB introns arose from extinct group I intron ancestors, much like eukaryotic spliceosomal introns are thought to have descended from group II introns. However, group I introns have little sequence conservation, making them difficult to detect with standard sequence similarity searches. Taking advantage of recent improvements in a computational homology search method that accounts for both conserved sequence and RNA secondary structure, we have identified 39 group I introns in a wide range of archaeal phyla, including examples of group I introns and BHB introns in the same host gene.
The prediction of human exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames

DOE Office of Scientific and Technical Information (OSTI.GOV)

Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.

1994-12-31

Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Gaucher disease: A G[sup +1][yields]A[sup +1] IVS2 splice donor site mutation causing exon 2 skipping in the acid [beta]-glucosidase mRNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

He, Guo-Shun; Grabowski, G.A.

1992-10-01

Gaucher disease is the most frequent lysosomal storage disease and the most prevalent Jewish genetic disease. About 30 identified missense mutations are causal to the defective activity of acid [beta]-glucosidase in this disease. cDNAs were characterized from a moderately affected 9-year-old Ashkenazi Jewish Gaucher disease type 1 patient whose 80-years-old, enzyme-deficient, 1226G (Asn[sup 370][yields]Ser [N370S]) homozygous grandfather was nearly asymptomatic. Sequence analyses revealed four populations of cDNAs with either the 1226G mutation, an exact exon 2 ([Delta] EX2) deletion, a deletion of exon 2 and the first 115 bp of exon 3 ([Delta] EX2-3), or a completely normal sequence. Aboutmore » 50% of the cDNAs were the [Delta] EX2, the [Delta] EX2-3, and the normal cDNAs, in a ratio of 6:3:1. Specific amplification and characterization of exon 2 and 5[prime] and 3[prime] intronic flanking sequences from the structural gene demonstrated clones with either the normal sequence or with a G[sup +1][yields]A[sup +1] transition at the exon 2/intron 2 boundary. This mutation destroyed the splice donor consensus site (U1 binding site) for mRNA processing. This transition also was present at the corresponding exon/intron boundary of the highly homologous pseudogene. This new mutation, termed [open quotes]IVS2 G[sup +1],[close quotes] is the first in the Ashkenazi Jewish population. The occurrence of this [open quotes]pseudogene[close quotes]-type mutation in the structural gene indicates the role of acid [beta]-glucosidase pseudogene and structural gene rearrangements in the pathogenesis of this disease. 33 refs., 8 figs., 1 tab.« less
Genomic organization of the human gene (CA5) and pseudogene for mitochondrial carbonic anhydrase V and their localization to chromosomes 16q and 16p

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nagao, Yoshiro; Sly, W.S.; Batanian, J.R.

1995-08-10

Carbonic anhydrase V (CA V) is expressed in mitochondrial matrix in liver and several other tissues. It is of interest for its putative roles in providing bicarbonate to carbamoyl phosphate synthetase for ureagenesis and to pyruvate carboxylase for gluconeogenesis and its possible importance in explaining certain inherited metabolic disorders with hyperammonemia and hypoglycemia. Following the recent characterization of the cDNA for human CA V, we report the isolation of the human gene from two {lambda} genomic libraries and its characterization. The CA V gene (CA5) is approximately 50 kb long and contains 7 exons and 6 introns. The exon-intron boundariesmore » are found in positions identical to those determined for the previously described CA II, CA III, and CA VII genes. Like the CA VII gene, CA5 does not contain typical TATA and CAAT promoter elements in the 5{prime} flanking region but does contain a TTTAA sequence 147 nucleotides upstream of the initiation codon. CA5 also contains a 12-bp GT-rich segment beginning 13 bp downstream of the polyadenylation signal in the 3{prime} untranslated region of exon 7. FISH analysis allowed CA5 to be assigned to chromosome 16q24.3. An unprocessed pseudogene containing sequence homologous to exons 3-7 and introns 3-6 was also isolated and was assigned by FISH analysis to chromosome 16p11.2-p12. 22 refs., 4 figs., 1 tab.« less
Evolution of the myosin heavy chain gene MYH14 and its intronic microRNA miR-499: muscle-specific miR-499 expression persists in the absence of the ancestral host gene.

PubMed

Bhuiyan, Sharmin Siddique; Kinoshita, Shigeharu; Wongwarangkana, Chaninya; Asaduzzaman, Md; Asakawa, Shuichi; Watabe, Shugo

2013-07-06

A novel sarcomeric myosin heavy chain gene, MYH14, was identified following the completion of the human genome project. MYH14 contains an intronic microRNA, miR-499, which is expressed in a slow/cardiac muscle specific manner along with its host gene; it plays a key role in muscle fiber-type specification in mammals. Interestingly, teleost fish genomes contain multiple MYH14 and miR-499 paralogs. However, the evolutionary history of MYH14 and miR-499 has not been studied in detail. In the present study, we identified MYH14/miR-499 loci on various teleost fish genomes and examined their evolutionary history by sequence and expression analyses. Synteny and phylogenetic analyses depict the evolutionary history of MYH14/miR-499 loci where teleost specific duplication and several subsequent rounds of species-specific gene loss events took place. Interestingly, miR-499 was not located in the MYH14 introns of certain teleost fish. An MYH14 paralog, lacking miR-499, exhibited an accelerated rate of evolution compared with those containing miR-499, suggesting a putative functional relationship between MYH14 and miR-499. In medaka, Oryzias latipes, miR-499 is present where MYH14 is completely absent in the genome. Furthermore, by using in situ hybridization and small RNA sequencing, miR-499 was expressed in the notochord at the medaka embryonic stage and slow/cardiac muscle at the larval and adult stages. Comparing the flanking sequences of MYH14/miR-499 loci between torafugu Takifugu rubripes, zebrafish Danio rerio, and medaka revealed some highly conserved regions, suggesting that cis-regulatory elements have been functionally conserved in medaka miR-499 despite the loss of its host gene. This study reveals the evolutionary history of the MYH14/miRNA-499 locus in teleost fish, indicating divergent distribution and expression of MYH14 and miR-499 genes in different teleost fish lineages. We also found that medaka miR-499 was even expressed in the absence of its host gene. To our knowledge, this is the first report that shows the conversion of intronic into non-intronic miRNA during the evolution of a teleost fish lineage.
A molecular study of a family with Greek hereditary persistence of fetal hemoglobin and beta-thalassemia.

PubMed Central

Giglioni, B; Casini, C; Mantovani, R; Merli, S; Comi, P; Ottolenghi, S; Saglio, G; Camaschella, C; Mazza, U

1984-01-01

A family was studied in which two inherited defects of the non-alpha-globin cluster segregate: Greek hereditary persistence of fetal hemoglobin (HPFH) and beta-thalassemia. Fragments of the non-alpha-globin cluster from two patients were cloned in cosmid and phage lambda vectors, and assigned to either the HPFH or beta-thalassemic chromosome on the basis of the demonstration of a polymorphic BglII site in the HPFH gamma-globin cluster. The thalassemic beta-globin gene carries a mutation at nucleotide 1 of the intervening sequence I, known to cause beta zero-thalassemia; the beta-globin gene from the HPFH chromosome is entirely normal, both in the intron-exon sequence and in 5' flanking regions required for transcription. As the compound HPFH/beta-thalassemia heterozygote synthesizes HbA, these data prove that the HPFH beta-globin gene is functional, although at a decreased rate; its lower activity is likely to be due to a distant mutation. The HPFH A gamma-globin gene shows only two mutations: a T----C substitution in the large intervening sequence (responsible for the BglII polymorphic site) and a C----T substitution 196 nucleotides 5' to the cap site; the 5' flanking sequence is normal up to -1350 nucleotides upstream from the gene. Circumstantial evidence suggests that the mutation at -196 may be responsible for the abnormally high expression of the A gamma-globin gene. Images Fig. 1. Fig. 3. Fig. 4. Fig. 5. PMID:6210198
Structure and polymorphism of the mouse myelin/oligodendrocyte glycoprotein gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Daubas, P.; Pham-Dinh, D.; Dautigny, A.

1994-09-01

The authors have isolated and characterized genomic clones containing the mouse myelin/oligodendrocyte glycoprotein (MOG) gene. It spans a region of 12.5 kb and consists of eight exons. Its exon-intron structure differs from that of classical MHC-class I genes, with which it is linked in the mouse genome. Nucleotide sequencing of the 5{prime} flanking region revelas that it contains several putative protein-binding sites, some of them in common with other myelin gene promoters. One intragenic polymorphism has been identified: it consists of a GA repeat, defining at least three alleles in mouse inbred strains, and is easily detectable using the polymerasemore » chain reaction method.« less
Evolutionary dynamics of an expressed MHC class IIβ locus in the Ranidae (Anura) uncovered by genome walking and high-throughput amplicon sequencing

USGS Publications Warehouse

Mulder, Kevin P.; Cortazar-Chinarro, Maria; Harris, D. James; Crottini, Angelica; Grant, Evan H. Campbell; Fleischer, Robert C.; Savage, Anna E.

2017-01-01

The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa.
Evolutionary dynamics of an expressed MHC class IIβ locus in the Ranidae (Anura) uncovered by genome walking and high-throughput amplicon sequencing.

PubMed

Mulder, Kevin P; Cortazar-Chinarro, Maria; Harris, D James; Crottini, Angelica; Campbell Grant, Evan H; Fleischer, Robert C; Savage, Anna E

2017-11-01

The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa. Copyright © 2017 Elsevier Ltd. All rights reserved.
Bioinformatics analysis of plant orthologous introns: identification of an intronic tRNA-like sequence.

PubMed

Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei

2014-09-10

Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.
Mitochondrial genes in the colourless alga Prototheca wickerhamii resemble plant genes in their exons but fungal genes in their introns.

PubMed Central

Wolff, G; Burger, G; Lang, B F; Kück, U

1993-01-01

The mitochondrial DNA from the colourless alga Prototheca wickerhamii contains two mosaic genes as was revealed from complete sequencing of the circular extranuclear genome. The genes for the large subunit of the ribosomal RNA (LSUrRNA) as well as for subunit I of the cytochrome oxidase (coxI) carry two and three intronic sequences respectively. On the basis of their canonical nucleotide sequences they can be classified as group I introns. Phylogenetic comparisons of the coxI protein sequences allow us to conclude that the P.wickerhamii mtDNA is much closer related to higher plant mtDNAs than to those of the chlorophyte alga C.reinhardtii. The comparison of the intron sequences revealed several unusual features: (1) The P.wickerhamii introns are structurally related to mitochondrial introns from various ascomycetous fungi. (2) Phylogenetic analyses indicate a close relationship between fungal and algal intronic sequences. (3) The P. wickerhamii introns are located at positions within the structural genes which can be considered as preferred intron insertion sites in homologous mitochondrial genes from fungi or liverwort. In all cases, the sequences adjacent to the insertion sites are very well conserved over large evolutionary distances. Our finding of highly similar introns in fungi and algae is consistent with the idea that introns have already been present in the bacterial ancestors of present day mitochondria and evolved concomitantly with the organelles. PMID:7680126
Remarkable sequence conservation of the last intron in the PKD1 gene.

PubMed

Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P

2003-10-01

The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.
Microbial and Natural Metabolites That Inhibit Splicing: A Powerful Alternative for Cancer Treatment.

PubMed

Martínez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Martínez-Montiel, Mónica; Gaspariano-Cholula, Mayra Patricia; Martínez-Contreras, Rebeca D

2016-01-01

In eukaryotes, genes are frequently interrupted with noncoding sequences named introns. Alternative splicing is a nuclear mechanism by which these introns are removed and flanking coding regions named exons are joined together to generate a message that will be translated in the cytoplasm. This mechanism is catalyzed by a complex machinery known as the spliceosome, which is conformed by more than 300 proteins and ribonucleoproteins that activate and regulate the precision of gene expression when assembled. It has been proposed that several genetic diseases are related to defects in the splicing process, including cancer. For this reason, natural products that show the ability to regulate splicing have attracted enormous attention due to its potential use for cancer treatment. Some microbial metabolites have shown the ability to inhibit gene splicing and the molecular mechanism responsible for this inhibition is being studied for future applications. Here, we summarize the main types of natural products that have been characterized as splicing inhibitors, the recent advances regarding molecular and cellular effects related to these molecules, and the applications reported so far in cancer therapeutics.
Bottomless barrel-sponge species in the Indo-Pacific?

PubMed

Setiawan, Edwin; Voogd, Nicole J De; Wörheide, Gert; Erpenbeck, Dirk

2016-07-06

The use of nuclear markers, in addition to traditional mitochondrial markers, helps to clarify hidden patterns of genetic structure in natural populations (Palumbi & Baker, 1994). This is particularly evident among demosponges that possess slow mitochondrial evolutionary rates compared to Bilateria, where nuclear intron markers can aid in the understanding of shallow level phylogenetic relationships (Shearer et al., 2002). Ideally, these nuclear markers (i) are evolutionary well-conserved across different lineages, (ii) produce amplicons holding a number of sites with sufficient variability to answer the relevant phylogenetic question, (iii) derive from single copy genes (see review in Zhang & Hewitt, 2003). A popular method to amplify intron markers uses EPIC (Exon-Primed, Intron-Crossing) primers that anneal to the more conserved flanking exon regions and subsequently bridge the intron during amplification (Palumbi & Baker, 1994).
A Splice Defect in the EDA Gene in Dogs with an X-Linked Hypohidrotic Ectodermal Dysplasia (XLHED) Phenotype.

PubMed

Waluk, Dominik P; Zur, Gila; Kaufmann, Ronnie; Welle, Monika M; Jagannathan, Vidhya; Drögemüller, Cord; Müller, Eliane J; Leeb, Tosso; Galichet, Arnaud

2016-09-08

X-linked hypohidrotic ectodermal dysplasia (XLHED) caused by variants in the EDA gene represents the most common ectodermal dysplasia in humans. We investigated three male mixed-breed dogs with an ectodermal dysplasia phenotype characterized by marked hypotrichosis and multifocal complete alopecia, almost complete absence of sweat and sebaceous glands, and altered dentition with missing and abnormally shaped teeth. Analysis of SNP chip genotypes and whole genome sequence data from the three affected dogs revealed that the affected dogs shared the same haplotype on a large segment of the X-chromosome, including the EDA gene. Unexpectedly, the whole genome sequence data did not reveal any nonsynonymous EDA variant in the affected dogs. We therefore performed an RNA-seq experiment on skin biopsies to search for changes in the transcriptome. This analysis revealed that the EDA transcript in the affected dogs lacked 103 nucleotides encoded by exon 2. We speculate that this exon skipping is caused by a genetic variant located in one of the large introns flanking this exon, which was missed by whole genome sequencing with the illumina short read technology. The altered EDA transcript splicing most likely causes the observed ectodermal dysplasia in the affected dogs. These dogs thus offer an excellent opportunity to gain insights into the complex splicing processes required for expression of the EDA gene, and other genes with large introns. Copyright © 2016 Waluk et al.
A novel lens epithelium gene, LEP503, is highly conserved in different vertebrate species and is developmentally regulated in postnatal rat lens.

PubMed

Wen, Y; Sachs, G; Athmann, C

2000-02-01

The development of the lens is dependent on the proliferation of lens epithelial cells and their differentiation into fiber cells near the lens bow/equator. Identification of genes specifically expressed in the lens epithelial cells and their functions may provide insight into molecular events that regulate the processes of lens epithelial cell differentiation. In this study, a novel lens epithelium gene product, LEP503, identified from rat by a subtractive cDNA cloning strategy was investigated in the genome organization, mRNA expression and protein localization. The genomic sequences for LEP503 isolated from rat, mouse and human span 1754 bp, 1694 bp and 1895 bp regions encompassing the 5'-flanking region, two exons, one intron and 3'-flanking region. All exon-intron junction sequences conform to the GT/AG rule. Both mouse and human LEP503 genes show very high identity (93% for mouse and 79% for human) to rat LEP503 gene in the exon 1 that contains an open reading frame coding for a protein of 61 amino acid residues with a leucine-rich domain. The deduced protein sequences also show high identity (91% between mouse and rat and 77% between human and rat). Western blot shows that LEP503 is present as a specific approximately 6.9 kDa band in the water-insoluble-urea-soluble fraction of lens cortex where lens epithelium is included. Immuno-staining shows that LEP503 is localized in the epithelial cells along the entire anterior surface of rat lens. Developmentally, LEP503 is expressed at a low level at newborn, and then the expression level increases by about ten-fold around postnatal day 14 and remains at this high level for about 25 days before it drops back to the low level by postnatal day 84. These data suggest that the LEP503 may be an important lens epithelial cell gene involving the processes of epithelial cell differentiation. Copyright 2000 Academic Press.
Organization, chromosomal localization and promoter analysis of the gene encoding human acidic fibroblast growth factor intracellular binding protein.

PubMed Central

Kolpakova, E; Frengen, E; Stokke, T; Olsnes, S

2000-01-01

Acidic fibroblast growth factor (aFGF) intracellular binding protein (FIBP) is a protein found mainly in the nucleus that might be involved in the intracellular function of aFGF. Here we present a comparative analysis of the deduced amino acid sequences of human, murine and Drosophila FIBP analogues and demonstrate that FIBP is an evolutionarily conserved protein. The human gene spans more than 5 kb, comprising ten exons and nine introns, and maps to chromosome 11q13.1. Two slightly different splice variants found in different tissues were isolated and characterized. Sequence analysis of the region surrounding the translation start revealed a CpG island, a classical feature of widely expressed genes. Functional studies of the promoter region with a luciferase reporter system suggested a strong transcriptional activity residing within 600 bp of the 5' flanking region. PMID:11104667

Introns: The Functional Benefits of Introns in Genomes.

PubMed

Jo, Bong-Seok; Choi, Sun Shim

2015-12-01

The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.
Identification of some ectomycorrhizal basidiomycetes by PCR amplification of their gpd (glyceraldehyde-3-phosphate dehydrogenase) genes.

PubMed Central

Kreuzinger, N; Podeu, R; Gruber, F; Göbl, F; Kubicek, C P

1996-01-01

Degenerated oligonucleotide primers designed to flank an approximately 1.2-kb fragment of the gene encoding glyceraldehyde-3-phosphate dehydrogenase (gpd) from ascomycetes and basidiomycetes were used to amplify the corresponding gpd fragments from several species of the ectomycorrhizal fungal taxa Boletus, Amanita, and Lactarius. Those from B. edulis, A. muscaria, and L. deterrimus were cloned and sequenced. The respective nucleotide sequences of these gene fragments showed a moderate degree of similarity (72 to 76%) in the protein-encoding regions and only a low degree of similarity in the introns (56 to 66%). Introns, where present, occurred at conserved positions, but the respective positions and numbers of introns in a given taxon varied. The amplified fragment from a given taxon could be distinguished from that of others by both restriction nuclease cleavage analysis and Southern hybridization. A procedure for labeling DNA probes with fluorescein-12-dUTP by PCR was developed. These probes were used in a nonradioactive hybridization assay, with which the gene could be detected in 2 ng of chromosomal DNA of L. deterrimus on slot blots. Taxon-specific amplification was achieved by the design of specific oligonucleotide primers. The application of the gpd gene for the identification of mycorrhizal fungi under field conditions was demonstrated, with Picea abies (spruce) mycorrhizal roots harvested from a northern alpine forest area as well as from a plant-breeding nursery. The interference by inhibitory substances, which sometimes occurred in the DNA extracted from the root-fungus mixture, could be overcome by using very diluted concentrations of template DNA for a first round of PCR amplification followed by a second round with nested oligonucleotide primers. We conclude that gpd can be used to detect ectomycorrhizal fungi during symbiotic interaction. PMID:8795234
Molecular analysis of the split cox1 gene from the Basidiomycota Agrocybe aegerita: relationship of its introns with homologous Ascomycota introns and divergence levels from common ancestral copies.

PubMed

Gonzalez, P; Barroso, G; Labarère, J

1998-10-05

The Basidiomycota Agrocybe aegerita (Aa) mitochondrial cox1 gene (6790 nucleotides), encoding a protein of 527aa (58377Da), is split by four large subgroup IB introns possessing site-specific endonucleases assumed to be involved in intron mobility. When compared to other fungal COX1 proteins, the Aa protein is closely related to the COX1 one of the Basidiomycota Schizophyllum commune (Sc). This clade reveals a relationship with the studied Ascomycota ones, with the exception of Schizosaccharomyces pombe (Sp) which ranges in an out-group position compared with both higher fungi divisions. When comparison is extended to other kingdoms, fungal COX1 sequences are found to be more related to algae and plant ones (more than 57.5% aa similarity) than to animal sequences (53.6% aa similarity), contrasting with the previously established close relationship between fungi and animals, based on comparisons of nuclear genes. The four Aa cox1 introns are homologous to Ascomycota or algae cox1 introns sharing the same location within the exonic sequences. The percentages of identity of the intronic nucleotide sequences suggest a possible acquisition by lateral transfers of ancestral copies or of their derived sequences. These identities extend over the whole intronic sequences, arguing in favor of a transfer of the complete intron rather than a transfer limited to the encoded ORF. The intron i4 shares 74% of identity, at the nucleotidic level, with the Podospora anserina (Pa) intron i14, and up to 90.5% of aa similarity between the encoded proteins, i.e. the highest values reported to date between introns of two phylogenetically distant species. This low divergence argues for a recent lateral transfer between the two species. On the contrary, the low sequence identities (below 36%) observed between Aa i1 and the homologous Sp i1 or Prototheca wickeramii (Pw) i1 suggest a long evolution time after the separation of these sequences. The introns i2 and i3 possessed intermediate percentages of identity with their homologous Ascomycota introns. This is the first report of the complete nucleotide sequence and molecular organization of a mitochondrial cox1 gene of any member of the Basidiomycota division.
Splicing-Related Features of Introns Serve to Propel Evolution

PubMed Central

Luo, Yuping; Li, Chun; Gong, Xi; Wang, Yanlu; Zhang, Kunshan; Cui, Yaru; Sun, Yi Eve; Li, Siguang

2013-01-01

The role of spliceosomal intronic structures played in evolution has only begun to be elucidated. Comparative genomic analyses of fungal snoRNA sequences, which are often contained within introns and/or exons, revealed that about one-third of snoRNA-associated introns in three major snoRNA gene clusters manifested polymorphisms, likely resulting from intron loss and gain events during fungi evolution. Genomic deletions can clearly be observed as one mechanism underlying intron and exon loss, as well as generation of complex introns where several introns lie in juxtaposition without intercalating exons. Strikingly, by tracking conserved snoRNAs in introns, we found that some introns had moved from one position to another by excision from donor sites and insertion into target sties elsewhere in the genome without needing transposon structures. This study revealed the origin of many newly gained introns. Moreover, our analyses suggested that intron-containing sequences were more prone to sustainable structural changes than DNA sequences without introns due to intron's ability to jump within the genome via unknown mechanisms. We propose that splicing-related structural features of introns serve as an additional motor to propel evolution. PMID:23516505
Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.

PubMed

Sharma, Rekha; Ahlawat, Sonika; Maitra, A; Roy, Manoranjan; Mandakmale, S; Tantia, M S

2013-12-10

Bone morphogenetic proteins (BMPs) are members of the TGF-β (transforming growth factor-beta) superfamily, of which BMP4 is the most important due to its crucial role in follicular growth and differentiation, cumulus expansion and ovulation. Reproduction is a crucial trait in goat breeding and based on the important role of BMP4 gene in reproduction it was considered as a possible candidate gene for the prolificacy of goats. The objective of the present study was to detect polymorphism in intronic, exonic and 3' un-translated regions of BMP4 gene in Indian goats. Nine different goat breeds (Barbari, Beetal, Black Bengal, Malabari, Jakhrana (Twinning>40%), Osmanabadi, Sangamneri (Twinning 20-30%), Sirohi and Ganjam (Twinning<10%)) differing in prolificacy and geographic distribution were employed for polymorphism scanning. Cattle sequence (AC_000167.1) was used to design primers for the amplification of a targeted region followed by direct DNA sequencing to identify the genetic variations. Single nucleotide polymorphisms (SNPs) were not detected in exon 3, the intronic region and the 3' flanking region. A SNP (G1534A) was identified in exon 2. It was a non-synonymous mutation resulting in an arginine to lysine change in a corresponding protein sequence. G to A transition at the 1534 locus revealed two genotypes GG and GA in the nine investigated goat breeds. The GG genotype was predominant with a genotype frequency of 0.98. The GA genotype was present in the Black Bengal as well as Jakhrana breed with a genotype frequency of 0.02. A microsatellite was identified in the 3' flanking region, only 20 nucleotides downstream from the termination site of the coding region, as a short sequence with more than nineteen continuous and repeated CA dinucleotides. Since the gene is highly evolutionarily conserved, identification of a non-synonymous SNP (G1534A) in the coding region gains further importance. To our knowledge, this is the first report of a mutation in the coding region of the caprine BMP4 gene. But whether the reproduction trait of goat is associated with the BMP4 polymorphism, needs to be further defined by association studies in more populations so as to delineate an effect on it. © 2013 Elsevier B.V. All rights reserved.
Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing

PubMed Central

Hoque, Mainul; Ji, Zhe; Zheng, Dinghai; Luo, Wenting; Li, Wencheng; You, Bei; Park, Ji Yeon; Yehia, Ghassan; Tian, Bin

2012-01-01

Alternative cleavage and polyadenylation (APA) leads to mRNA isoforms with different coding sequences (CDS) and/or 3′ untranslated regions (3′UTRs). Using 3′ Region Extraction And Deep Sequencing (3′READS), a method which addresses the internal priming and oligo(A) tail issues that commonly plague polyA site (pA) identification, we comprehensively mapped pAs in the mouse genome, thoroughly annotating 3′ ends of genes and revealing over five thousand pAs (~8% of total) flanked by A-rich sequences, which have hitherto been overlooked. About 79% of mRNA genes and 66% of long non-coding RNA (lncRNA) genes have APA; but these two gene types have distinct usage patterns for pAs in introns and upstream exons. Promoter-distal pAs become relatively more abundant during embryonic development and cell differentiation, a trend affecting pAs in both 3′-most exons and upstream regions. Upregulated isoforms generally have stronger pAs, suggesting global modulation of the 3′ end processing activity in development and differentiation. PMID:23241633
[Detection and prenatal diagnosis for RS1 gene mutations in two Chinese families with X-linked juvenile retinoschisis].

PubMed

Chu, Yan; Fang, Dong; Hou, Qiao-fang; Wang, Li-ya; Guo, Xi-rang; Wang, Ying-tai; Liao, Shi-xiu

2013-04-01

To identify potential mutations of retinoschisis 1 (RS1) gene responsible for X-linked retinoschisis (XLRS) in two Chinese families. The 6 exons and flanking intronic regions were analyzed with PCR and direct sequencing. Two RS1 mutations were identified in the two families, which included 1 frameshift mutation (c.573delG, p.Pro192fs) and 1 missense mutation (c.626G>A, p.Arg209His). Two RS1 mutations have been identified, among which Pro192fs mutation is discovered for the first time in Chinese population. Above results may enrich our understanding of the clinical manifestations of XLRS and facilitated early diagnosis and genetic counseling for the disease.
Colonization of heterochromatic genes by transposable elements in Drosophila.

PubMed

Dimitri, Patrizio; Junakovic, Nikolaj; Arcà, Bruno

2003-04-01

As a further step toward understanding transposable element-host genome interactions, we investigated the molecular anatomy of introns from five heterochromatic and 22 euchromatic protein-coding genes of Drosophila melanogaster. A total of 79 kb of intronic sequences from heterochromatic genes and 355 kb of intronic sequences from euchromatic genes have been used in Blast searches against Drosophila transposable elements (TEs). The results show that TE-homologous sequences belonging to 19 different families represent about 50% of intronic DNA from heterochromatic genes. In contrast, only 0.1% of the euchromatic intron DNA exhibits homology to known TEs. Intraspecific and interspecific size polymorphisms of introns were found, which are likely to be associated with changes in TE-related sequences. Together, the enrichment in TEs and the apparent dynamic state of heterochromatic introns suggest that TEs contribute significantly to the evolution of genes located in heterochromatin.
Evolution of Mhc-DRB introns: implications for the origin of primates.

PubMed

Kupfermann, H; Satta, Y; Takahata, N; Tichy, H; Klein, J

1999-06-01

Introns are generally believed to evolve too rapidly and too erratically to be of much use in phylogenetic reconstructions. Few phylogenetically informative intron sequences are available, however, to ascertain the validity of this supposition. In the present study the supposition was tested on the example of the mammalian class II major histocompatibility complex (Mhc) genes of the DRB family. Since the Mhc genes evolve under balancing selection and are believed to recombine or rearrange frequently, the evolution of their introns could be expected to be particularly rapid and subject to scrambling. Sequences of intron 4 and 5 DRB genes were obtained from polymerase chain reaction-amplified fragments of genomic DNA from representatives of six eutherian orders-Primates, Scandentia, Chiroptera, Dermoptera, Lagomorpha, and Insectivora. Although short stretches of the introns have indeed proved to be unalignable, the bulk of the intron sequences from all six orders, spanning >85 million years (my) of evolution, could be aligned and used in a study of the tempo and mode of intron evolution. The analysis has revealed the Mhc introns to evolve at a rate similar to that of other genes and of synonymous sites of non-Mhc genes. No evidence of homogenization or large-scale scrambling of the intron sequences could be found. The Mhc introns apparently evolve largely by point mutations and insertions/deletions. The phylogenetic signals contained in the intron sequences could be used to identify Scandentia as the sister group of Primates, to support the existence of the Archonta superorder, and to confirm the monophyly of the Chiroptera.
An intronic open reading frame was released from one of group II introns in the mitochondrial genome of the haptophyte Chrysochromulina sp. NIES-1333

PubMed Central

Nishimura, Yuki; Kamikawa, Ryoma; Hashimoto, Tetsuo; Inagaki, Yuji

2014-01-01

Mitochondrial (mt) genome sequences, which often bear introns, have been sampled from phylogenetically diverse eukaryotes. Thus, we can anticipate novel insights into intron evolution from previously unstudied mt genomes. We here investigated the origins and evolution of three introns in the mt genome of the haptophyte Chrysochromulina sp. NIES-1333, which was sequenced completely in this study. All the three introns were characterized as group II, on the basis of predicted secondary structure, and the conserved sequence motifs at the 5′ and 3′ termini. Our comparative studies on diverse mt genomes prompt us to propose that the Chrysochromulina mt genome laterally acquired the introns from mt genomes in distantly related eukaryotes. Many group II introns harbor intronic open reading frames for the proteins (intron-encoded proteins or IEPs), which likely facilitate the splicing of their host introns. However, we propose that a “free-standing,” IEP-like protein, which is not encoded within any introns in the Chrysochromulina mt genome, is involved in the splicing of the first cox1 intron that lacks any open reading frames. PMID:25054084
Flanking genes of an essential gene give information about the evolution of metazoa.

PubMed

Zimek, Alexander; Weber, Klaus

2011-04-01

We collected as much information as possible on new lamin genes and their flanking genes. The number of lamin genes varies from 1 to 4 depending more or less on the phylogenetic position of the species. Strong genome drift is recognised by fewer and unusually placed introns and a change in flanking genes. This applies to the nematode Caenorhabditis elegans, the insect Drosophila melanogaster, the urochordate Ciona intestinalis, the annelid Capitella teleta and the planaria Schmidtea mediterranea. In contrast stable genomes show astonishing conservation of the flanking genes. These are identical in the sea anemone Nematostella vectensis and the cephalochordate Branchiostoma floridae lamin B1 gene. Even in the lamin B1 genes from Xenopus tropicalis and man one of the flanking genes is conserved. Finally our analysis forms the basis for a molecular analysis of metazoan phylogeny. Copyright © 2010 Elsevier GmbH. All rights reserved.
Sequence Variation of the tRNALeu Intron as a Marker for Genetic Diversity and Specificity of Symbiotic Cyanobacteria in Some Lichens

PubMed Central

Paulsrud, Per; Lindblad, Peter

1998-01-01

We examined the genetic diversity of Nostoc symbionts in some lichens by using the tRNALeu (UAA) intron as a genetic marker. The nucleotide sequence was analyzed in the context of the secondary structure of the transcribed intron. Cyanobacterial tRNALeu (UAA) introns were specifically amplified from freshly collected lichen samples without previous DNA extraction. The lichen species used in the present study were Nephroma arcticum, Peltigera aphthosa, P. membranacea, and P. canina. Introns with different sizes around 300 bp were consistently obtained. Multiple clones from single PCRs were screened by using their single-stranded conformational polymorphism pattern, and the nucleotide sequence was determined. No evidence for sample heterogenity was found. This implies that the symbiont in situ is not a diverse community of cyanobionts but, rather, one Nostoc strain. Furthermore, each lichen thallus contained only one intron type, indicating that each thallus is colonized only once or that there is a high degree of specificity. The same cyanobacterial intron sequence was also found in samples of one lichen species from different localities. In a phylogenetic analysis, the cyanobacterial lichen sequences grouped together with the sequences from two free-living Nostoc strains. The size differences in the intron were due to insertions and deletions in highly variable regions. The sequence data were used in discussions concerning specificity and biology of the lichen symbiosis. It is concluded that the tRNALeu (UAA) intron can be of great value when examining cyanobacterial diversity. PMID:9435083
Human Ro60 (SSA2) genomic organization and sequence alterations, examined in cutaneous lupus erythematosus.

PubMed

Millard, T P; Ashton, G H S; Kondeatis, E; Vaughan, R W; Hughes, G R V; Khamashta, M A; Hawk, J L M; McGregor, J M; McGrath, J A

2002-02-01

The Ro 60 kDa protein (Ro60 or SSA2) is the major component of the Ro ribonucleoprotein (Ro RNP) complex, to which an immune response is a specific feature of several autoimmune diseases. The genomic organization and any sequence variation within the DNA encoding Ro60 are unknown. To characterize the Ro60 gene structure and to assess whether any sequence alterations might be associated with serum anti-Ro antibody in subacute cutaneous lupus erythematosus (SCLE), thus potentially providing new insight into disease pathogenesis. The cDNA sequence for Ro60 was obtained from the NCBI database and used for a BLAST search for a clone containing the entire genomic sequence. The intron-exon borders were confirmed by designing intronic primer pairs to flank each exon, which were then used to amplify genomic DNA for automated sequencing from 36 caucasian patients with SCLE (anti-Ro positive) and 49 with discoid LE (DLE, anti-Ro negative), in addition to 36 healthy caucasian controls. Heteroduplex analysis of polymerase chain reaction (PCR) products from patients and controls spanning all Ro60 exons (1-8) revealed a common bandshift in the PCR products spanning exon 7. Sequencing of the corresponding PCR products demonstrated an A > G substitution at nucleotide position 1318-7, within the consensus acceptor splice site of exon 7 (GenBank XM001901). The allele frequencies were major allele A (0.71) and minor allele G (0.29) in 72 control chromosomes, with no significant differences found between SCLE patients, DLE patients and controls. The genomic organization of the DNA encoding the Ro60 protein is described, including a common polymorphism within the consensus acceptor splice site of exon 7. Our delineation of a strategy for the genomic amplification of Ro60 forms a basis for further examination of the pathological functions of the Ro RNP in autoimmune disease.
Exon definition as a potential negative force against intron losses in evolution.

PubMed

Niu, Deng-Ke

2008-11-13

Previous studies have indicated that the wide variation in intron density (the number of introns per gene) among different eukaryotes largely reflects varying degrees of intron loss during evolution. The most popular model, which suggests that organisms lose introns through a mechanism in which reverse-transcribed cDNA recombines with the genomic DNA, concerns only one mutational force. Using exons as the units of splicing-site recognition, exon definition constrains the length of exons. An intron-loss event results in fusion of flanking exons and thus a larger exon. The large size of the newborn exon may cause splicing errors, i.e., exon skipping, if the splicing of pre-mRNAs is initiated by exon definition. By contrast, if the splicing of pre-mRNAs is initiated by intron definition, intron loss does not matter. Exon definition may thus be a selective force against intron loss. An organism with a high frequency of exon definition is expected to experience a low rate of intron loss throughout evolution and have a high density of spliceosomal introns. The majority of spliceosomal introns in vertebrates may be maintained during evolution not because of potential functions, but because of their splicing mechanism (i.e., exon definition). Further research is required to determine whether exon definition is a negative force in maintaining the high intron density of vertebrates. This article was reviewed by Dr. Scott W. Roy (nominated by Dr. John Logsdon), Dr.Eugene V. Koonin, and Dr. Igor B. Rogozin (nominated by Dr. Mikhail Gelfand). For the full reviews,please go to the Reviewers' comments section.
Mechanism for DNA transposons to generate introns on genomic scales

PubMed Central

Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.

2017-01-01

Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113
The development and mapping of functional markers in Fragaria and their transferability and potential for mapping in other genera.

PubMed

Sargent, D J; Rys, A; Nier, S; Simpson, D W; Tobutt, K R

2007-01-01

We have developed 46 primer pairs from exon sequences flanking polymorphic introns of 23 Fragaria gene sequences and one Malus sequence deposited in the EMBL database. Sequencing of a set of the PCR products amplified with the novel primer pairs in diploid Fragaria showed the products to be homologous to the sequences from which the primers were originally designed. By scoring the segregation of the 24 genes in two diploid Fragaria progenies FV x FN (F. vesca x F. nubicola F(2)) and 815 x 903BC (F. vesca x F. viridis BC(1)) 29 genetic loci at discrete positions on the seven linkage groups previously characterised could be mapped, bringing to 35 the total number of known function genes mapped in Fragaria. Twenty primer pairs, representing 14 genes, amplified a product of the expected size in both Malus and Prunus. To demonstrate the applicability of these gene-specific loci to comparative mapping in Rosaceae, five markers that displayed clear polymorphism between the parents of a Malus and a Prunus mapping population were selected. The markers were then scored and mapped in at least one of the two additional progenies.
Detection of Self Incompatibility Genotypes in Prunus africana: Characterization, Evolution and Spatial Analysis

PubMed Central

Nantongo, Judith Ssali; Eilu, Gerald; Geburek, Thomas; Schueler, Silvio; Konrad, Heino

2016-01-01

In flowering plants, self-incompatibility is an effective genetic mechanism that prevents self-fertilization. Most Prunus tree species exhibit a homomorphic gametophytic self-incompatibility (GSI) system, in which the pollen phenotype is encoded by its own haploid genome. To date, no identification of S-alleles had been done in Prunus africana, the only member of the genus in Africa. To identify S-RNase alleles and hence determine S-genotypes in African cherry (Prunus africana) from Mabira Forest Reserve, Uganda, primers flanking the first and second intron were designed and these amplified two bands in most individuals. PCR bands on agarose indicated 26 and 8 different S-alleles for second and first intron respectively. Partial or full sequences were obtained for all these fragments. Comparison with published S-RNase data indicated that the amplified products were S-RNase alleles with very high interspecies homology despite the high intraspecific variation. Against expectations for a locus under balancing selection, frequency and spatial distribution of the alleles in a study plot was not random. Implications of the results to breeding efforts in the species are discussed, and mating experiments are strongly suggested to finally prove the functionality of SI in P. africana. PMID:27348423
Using the Developmental Gene Bicoid to Identify Species of Forensically Important Blowflies (Diptera: Calliphoridae)

PubMed Central

Park, Seong Hwan; Park, Chung Hyun; Zhang, Yong; Piao, Huguo; Chung, Ukhee; Kim, Seong Yoon; Ko, Kwang Soo; Yi, Cheong-Ho; Jo, Tae-Ho; Hwang, Juck-Joon

2013-01-01

Identifying species of insects used to estimate postmortem interval (PMI) is a major subject in forensic entomology. Because forensic insect specimens are morphologically uniform and are obtained at various developmental stages, DNA markers are greatly needed. To develop new autosomal DNA markers to identify species, partial genomic sequences of the bicoid (bcd) genes, containing the homeobox and its flanking sequences, from 12 blowfly species (Aldrichina grahami, Calliphora vicina, Calliphora lata, Triceratopyga calliphoroides, Chrysomya megacephala, Chrysomya pinguis, Phormia regina, Lucilia ampullacea, Lucilia caesar, Lucilia illustris, Hemipyrellia ligurriens and Lucilia sericata; Calliphoridae: Diptera) were determined and analyzed. This study first sequenced the ten blowfly species other than C. vicina and L. sericata. Based on the bcd sequences of these 12 blowfly species, a phylogenetic tree was constructed that discriminates the subfamilies of Calliphoridae (Luciliinae, Chrysomyinae, and Calliphorinae) and most blowfly species. Even partial genomic sequences of about 500 bp can distinguish most blowfly species. The short intron 2 and coding sequences downstream of the bcd homeobox in exon 3 could be utilized to develop DNA markers for forensic applications. These gene sequences are important in the evolution of insect developmental biology and are potentially useful for identifying insect species in forensic science. PMID:23586044
Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids.

PubMed

Niu, Zhitao; Pan, Jiajia; Zhu, Shuying; Li, Ludan; Xue, Qingyun; Liu, Wei; Ding, Xiaoyu

2017-01-01

Apostasioideae, consists of only two genera, Apostasia and Neuwiedia , which are mainly distributed in Southeast Asia and northern Australia. The floral structure, taxonomy, biogeography, and genome variation of Apostasioideae have been intensively studied. However, detailed analyses of plastome composition and structure and comparisons with those of other orchid subfamilies have not yet been conducted. Here, the complete plastome sequences of Apostasia wallichii and Neuwiedia singapureana were sequenced and compared with 43 previously published photosynthetic orchid plastomes to characterize the plastome structure and evolution in the orchids. Unlike many orchid plastomes (e.g., Paphiopedilum and Vanilla ), the plastomes of Apostasioideae contain a full set of 11 functional NADH dehydrogenase ( ndh ) genes. The distribution of repeat sequences and simple sequence repeat elements enhanced the view that the mutation rate of non-coding regions was higher than that of coding regions. The 10 loci- ndhA intron, matK-5'trnK , clpP-psbB , rps8-rpl14 , trnT-trnL , 3'trnK-matK , clpP intron , psbK-trnK , trnS-psbC , and ndhF-rpl32 -that had the highest degrees of sequence variability were identified as mutational hotspots for the Apostasia plastome. Furthermore, our results revealed that plastid genes exhibited a variable evolution rate within and among different orchid genus. Considering the diversified evolution of both coding and non-coding regions, we suggested that the plastome-wide evolution of orchid species was disproportional. Additionally, the sequences flanking the inverted repeat/small single copy (IR/SSC) junctions of photosynthetic orchid plastomes were categorized into three types according to the presence/absence of ndh genes. Different evolutionary dynamics for each of the three IR/SSC types of photosynthetic orchid plastomes were also proposed.
Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids

PubMed Central

Niu, Zhitao; Pan, Jiajia; Zhu, Shuying; Li, Ludan; Xue, Qingyun; Liu, Wei; Ding, Xiaoyu

2017-01-01

Apostasioideae, consists of only two genera, Apostasia and Neuwiedia, which are mainly distributed in Southeast Asia and northern Australia. The floral structure, taxonomy, biogeography, and genome variation of Apostasioideae have been intensively studied. However, detailed analyses of plastome composition and structure and comparisons with those of other orchid subfamilies have not yet been conducted. Here, the complete plastome sequences of Apostasia wallichii and Neuwiedia singapureana were sequenced and compared with 43 previously published photosynthetic orchid plastomes to characterize the plastome structure and evolution in the orchids. Unlike many orchid plastomes (e.g., Paphiopedilum and Vanilla), the plastomes of Apostasioideae contain a full set of 11 functional NADH dehydrogenase (ndh) genes. The distribution of repeat sequences and simple sequence repeat elements enhanced the view that the mutation rate of non-coding regions was higher than that of coding regions. The 10 loci—ndhA intron, matK-5′trnK, clpP-psbB, rps8-rpl14, trnT-trnL, 3′trnK-matK, clpP intron, psbK-trnK, trnS-psbC, and ndhF-rpl32—that had the highest degrees of sequence variability were identified as mutational hotspots for the Apostasia plastome. Furthermore, our results revealed that plastid genes exhibited a variable evolution rate within and among different orchid genus. Considering the diversified evolution of both coding and non-coding regions, we suggested that the plastome-wide evolution of orchid species was disproportional. Additionally, the sequences flanking the inverted repeat/small single copy (IR/SSC) junctions of photosynthetic orchid plastomes were categorized into three types according to the presence/absence of ndh genes. Different evolutionary dynamics for each of the three IR/SSC types of photosynthetic orchid plastomes were also proposed. PMID:29046685

Genomic structure of two ras family genes in the slime mold Physarum polycephalum.

PubMed

Trzcińska-Danielewicz, Joanna; Kozlowski, Piotr; Gierdal, Katarzyna; Wiejak, Jolanta; Jagielski, Adam; Toczko, Kazimierz; Fronk, Jan

2002-08-01

Genomic structure of two Physarum polycephalum ras family genes, Ppras2 and Pprap1, has been determined, including the upstream region of the latter. The genes are interrupted by three and four introns, respectively. The first intron of Ppras2 has the same location within the coding sequence as the first intron in another ras homolog from this organism, Ppras1 [Trzcińska-Danielewicz, J., Kozlowski, P., and Toczko, K. (1996). "Cloning and genomic sequence of the Physarum polycephalum Ppras1 gene, a homologue of the ras protooncogene", Gene 169, pp. 143-144]. All introns, ranging from 53 to ca. 460 base pairs, have the canonical 5' and 3' ends, are greatly enriched in pyrimidines in the coding strand and have frequent pyrimidines-only tracts. These latter features seem to be responsible for the difficulties in cloning and sequencing of parts of these genes. Short sequences shared with P. polycephalum transposon-like repeats are common in the introns, indicating a possible role of transposition in intron evolution. In all three ras family genes phase zero introns are located mostly between sequences coding for regular protein secondary structure elements.
Alternative Splicing of STAT3 Is Affected by RNA Editing.

PubMed

Goldberg, Lior; Abutbul-Amitai, Mor; Paret, Gideon; Nevo-Caspi, Yael

2017-05-01

A-to-I RNA editing, carried out by adenosine deaminase acting on RNA (ADAR) enzymes, is an epigenetic phenomenon of posttranscriptional modifications on pre-mRNA. RNA editing in intronic sequences may influence alternative splicing of flanking exons. We have previously shown that conditions that induce editing result in elevated expression of signal transducer and activator of transcription 3 (STAT3), preferentially the alternatively-spliced STAT3β isoform. Mechanisms regulating alternative splicing of STAT3 have not been elucidated. STAT3 undergoes A-to-I RNA editing in an intron residing in proximity to the alternatively spliced exon. We hypothesized that RNA editing plays a role in regulating alternative splicing toward STAT3β. In this study we extend our observation connecting RNA editing to the preferential induction of STAT3β expression. We study the involvement of ADAR1 in STAT3 editing and reveal the connection between editing and alternative splicing of STAT3. Deferoaxamine treatment caused the induction in STAT3 RNA editing and STAT3β expression. Silencing ADAR1 caused a decrease in STAT3 editing and expression with a preferential decrease in STAT3β. Cells transfected with a mutated minigene showed preferential splicing toward the STAT3β transcript. Editing in the STAT3 intron is performed by ADAR1 and affects STAT3 alternative splicing. These results suggest that RNA editing is one of the molecular mechanisms regulating the expression of STAT3β.
Intronic variants in the dopa decarboxylase (DDC) gene are associated with smoking behavior in European-Americans and African-Americans.

PubMed

Yu, Yi; Panhuysen, Carolien; Kranzler, Henry R; Hesselbrock, Victor; Rounsaville, Bruce; Weiss, Roger; Brady, Kathleen; Farrer, Lindsay A; Gelernter, Joel

2006-07-15

We report here a study considering association of alleles and haplotypes at the DOPA decarboxylase (DDC) locus with the DSM-IV diagnosis of nicotine dependence (ND) or a quantitative measure for ND using the Fagerstrom Test for Nicotine Dependence (FTND). We genotyped 18 single nucleotide polymorphisms (SNPs) spanning a region of approximately 210 kb that includes DDC and the genes immediately flanking DDC in 1,590 individuals from 621 families of African-American (AA) or European-American (EA) ancestry. Evidence of association (family-based tests) was observed with several SNPs for both traits (0.0002
Hidden genetic history of the Japanese sand dollar Peronella (Echinoidea: Laganidae) revealed by nuclear intron sequences.

PubMed

Endo, Megumi; Hirose, Mamiko; Honda, Masanao; Koga, Hiroyuki; Morino, Yoshiaki; Kiyomoto, Masato; Wada, Hiroshi

2018-06-15

The marine environment around Japan experienced significant changes during the Cenozoic Era. In this study, we report findings suggesting that this dynamic history left behind traces in the genome of the Japanese sand dollar species Peronella japonica and P. rubra. Although mitochondrial Cytochrome C Oxidase I sequences did not indicate fragmentation of the current local populations of P. japonica around Japan, two different types of intron sequence were found in the Alx1 locus. We inferred that past fragmentation of the populations account for the presence of two types of nuclear sequences as alleles in the Alx1 intron of P. japonica. It is likely that the split populations have intermixed in recent times; hence, we did not detect polymorphisms in the sequences reflecting the current localization of the species. In addition, we found two allelic sequences of theAlx1 intron in the sister species P. rubra. The divergence times of the two types of Alx1 intron sequences were estimated at approximately 14.9 and 4.0 million years ago for P. japonica and P. rubra, respectively. Our study indicates that information from the intron sequences of nuclear genes can enhance our understanding of past genetic events in organisms. Copyright © 2018 Elsevier B.V. All rights reserved.
Comparative Analysis of Vertebrate Dystrophin Loci Indicate Intron Gigantism as a Common Feature

PubMed Central

Pozzoli, Uberto; Elgar, Greg; Cagliani, Rachele; Riva, Laura; Comi, Giacomo P.; Bresolin, Nereo; Bardoni, Alessandra; Sironi, Manuela

2003-01-01

The human DMD gene is the largest known to date, spanning > 2000 kb on the X chromosome. The gene size is mainly accounted for by huge intronic regions. We sequenced 190 kb of Fugu rubripes (pufferfish) genomic DNA corresponding to the complete dystrophin gene (FrDMD) and provide the first report of gene structure and sequence comparison among dystrophin genomic sequences from different vertebrate organisms. Almost all intron positions and phases are conserved between FrDMD and its mammalian counterparts, and the predicted protein product of the Fugu gene displays 55% identity and 71% similarity to human dystrophin. In analogy to the human gene, FrDMD presents several-fold longer than average intronic regions. Analysis of intron sequences of the human and murine genes revealed that they are extremely conserved in size and that a similar fraction of total intron length is represented by repetitive elements; moreover, our data indicate that intron expansion through repeat accumulation in the two orthologs is the result of independent insertional events. The hypothesis that intron length might be functionally relevant to the DMD gene regulation is proposed and substantiated by the finding that dystrophin intron gigantism is common to the three vertebrate genes. [Supplemental material is available online at www.genome.org.] PMID:12727896
Mollusk genes encoding lysine tRNA (UUU) contain introns.

PubMed

Matsuo, M; Abe, Y; Saruta, Y; Okada, N

1995-11-20

New intron-containing genes encoding tRNAs were discovered when genomic DNA isolated from various animal species was amplified by the polymerase chain reaction (PCR) with primers based on sequences of rabbit tRNA(Lys). From sequencing analysis of the products of PCR, we found that introns are present in several genes encoding tRNA(Lys) in mollusks, such as Loligo bleekeri (squid) and Octopus vulgaris (octopus). These introns were specific to genes encoding tRNA(Lys)(CUU) and were not present in genes encoding tRNA(Lys)(CUU). In addition, the sequences of the introns were different from one another. To confirm the results of our initial experiments, we isolated and sequenced genes encoding tRNA(Lys)(CUU) and tRNA(Lys)(UUU). The gene for tRNA(Lys)(UUU) from squid contained an intron, whose sequence was the same as that identified by PCR, and the gene formed a cluster with a corresponding pseudogene. Several DNA regions of 2.1 kb containing this cluster appeared to be tandemly arrayed in the squid genome. By contrast, the gene encoding tRNA(Lys)(CUU) did not contain an intron, as shown also by PCR. The tRNA(Lys)(UUU) that corresponded to the analyzed gene was isolated and characterized. The present study provides the first example of an intron-containing gene encoding a tRNA in mollusks and suggests the universality of introns in such genes in higher eukaryotes.
Genetic Variation among Major Human Geographic Groups Supports a Peculiar Evolutionary Trend in PAX9

PubMed Central

Paixão-Côrtes, Vanessa R.; Meyer, Diogo; Pereira, Tiago V.; Mazières, Stéphane; Elion, Jacques; Krishnamoorthy, Rajagopal; Zago, Marco A.; Silva, Wilson A.; Salzano, Francisco M.; Bortolini, Maria Cátira

2011-01-01

A total of 172 persons from nine South Amerindian, three African and one Eskimo populations were studied in relation to the Paired box gene 9 (PAX9) exon 3 (138 base pairs) as well as its 5′and 3′flanking intronic segments (232 bp and 220 bp, respectively) and integrated with the information available for the same genetic region from individuals of different geographical origins. Nine mutations were scored in exon 3 and six in its flanking regions; four of them are new South American tribe-specific singletons. Exon3 nucleotide diversity is several orders of magnitude higher than its intronic regions. Additionally, a set of variants in the PAX9 and 101 other genes related with dentition can define at least some dental morphological differences between Sub-Saharan Africans and non-Africans, probably associated with adaptations after the modern human exodus from Africa. Exon 3 of PAX9 could be a good molecular example of how evolvability works. PMID:21298044
A novel nonsense mutation in CRYBB1 associated with autosomal dominant congenital cataract

PubMed Central

Yang, Juhua; Zhu, Yihua; Gu, Feng; He, Xiang; Cao, Zongfu; Li, Xuexi; Tong, Yi

2008-01-01

Purpose To identify the molecular defect underlying an autosomal dominant congenital nuclear cataract in a Chinese family. Methods Twenty-two members of a three-generation pedigree were recruited, clinical examinations were performed, and genomic DNA was extracted from peripheral blood leukocytes. All members were genotyped with polymorphic microsatellite markers adjacent to each of the known cataract-related genes. Linkage analysis was performed after genotyping. Candidate genes were screened for mutation using direct sequencing. Individuals were screened for presence of a mutation by restriction fragment length polymorphism (RFLP) analysis. Results Linkage analysis identified a maximum LOD score of 3.31 (recombination fraction [θ]=0.0) with marker D22S1167 on chromosome 22, which flanks the β-crystallin gene cluster (CRYBB3, CRYBB2, CRYBB1, and CRYBA4). Sequencing the coding regions and the flanking intronic sequences of these four candidate genes identified a novel, heterozygous C→T transition in exon 6 of CRYBB1 in the affected individuals of the family. This single nucleotide change introduced a novel BfaI site and was predicted to result in a nonsense mutation at codon 223 that changed a phylogenetically conserved amino acid to a stop codon (p.Q223X). RFLP analysis confirmed that this mutation co-segregated with the disease phenotype in all available family members and was not found in 100 normal unrelated individuals from the same ethnic background. Conclusions This study has identified a novel nonsense mutation in CRYBB1 (p.Q223X) associated with autosomal dominant congenital nuclear cataract. PMID:18432316
MiMIC: a highly versatile transposon insertion resource for engineering Drosophila melanogaster genes

PubMed Central

Venken, Koen J. T.; Schulze, Karen L.; Haelterman, Nele A.; Pan, Hongling; He, Yuchun; Evans-Holm, Martha; Carlson, Joseph W.; Levis, Robert W.; Spradling, Allan C.; Hoskins, Roger A.; Bellen, Hugo J.

2011-01-01

We demonstrate the versatility of a collection of insertions of the transposon Minos mediated integration cassette (MiMIC), in Drosophila melanogaster. MiMIC contains a gene-trap cassette and the yellow+ marker flanked by two inverted bacteriophage ΦC31 attP sites. MiMIC integrates almost at random in the genome to create sites for DNA manipulation. The attP sites allow the replacement of the intervening sequence of the transposon with any other sequence through recombinase mediated cassette exchange (RMCE). We can revert insertions that function as gene traps and cause mutant phenotypes to wild type by RMCE and modify insertions to control GAL4 or QF overexpression systems or perform lineage analysis using the Flp system. Insertions within coding introns can be exchanged with protein-tag cassettes to create fusion proteins to follow protein expression and perform biochemical experiments. The applications of MiMIC vastly extend the Drosophila melanogaster toolkit. PMID:21985007
Genetic study of the PAH locus in the Iranian population: familial gene mutations and minihaplotypes.

PubMed

Razipour, Masoumeh; Alavinejad, Elaheh; Sajedi, Seyede Zahra; Talebi, Saeed; Entezam, Mona; Mohajer, Neda; Kazemi-Sefat, Golnaz-Ensieh; Gharesouran, Jalal; Setoodeh, Aria; Mohaddes Ardebili, Seyyed Mojtaba; Keramatipour, Mohammad

2017-10-01

Phenylketonuria (PKU), one of the most common inborn errors of amino acid metabolism, is caused by mutations in the phenylalanine hydroxylase (PAH) gene (PAH). PKU has wide allelic heterogeneity, and over 600 different disease-causing mutations in PAH have been detected to date. Up to now, there have been no reports on the minihaplotype (VNTR/STR) analysis of PAH locus in the Iranian population. The aims of the present study were to determine PAH mutations and minihaplotypes in Iranian families with PAH deficiency and to investigate the correlation between them. A total of 81 Iranian families with PAH deficiency were examined using PCR-sequencing of all 13 PAH exons and their flanking intron regions to identify sequence variations. Fragment analysis of the PAH minihaplotypes was performed by capillary electrophoresis for 59 families. In our study, 33 different mutations were found accounting for 95% of the total mutant alleles. The majority of these mutations (72%) were distributed across exons 7, 11, 2 and their flanking intronic regions. Mutation c.1066-11G > A was the most common with a frequency of 20.37%. The less frequent mutations, p.Arg261Gln (8%), p.Arg243Ter (7.4%), p.Leu48Ser (7.4%), p.Lys363Asnfs*37 (6.79%), c.969 + 5G > A (6.17%), p.Pro281Leu (5.56), c.168 + 5G > C (5.56), and p.Arg261Ter (4.94) together comprised about 52% of all mutant alleles. In this study, a total of seventeen PAH gene minihaplotypes were detected, six of which associated exclusively with particular mutations. Our findings indicate a broad PAH mutation spectrum in the Iranian population, which is consistent with previous studies reporting a wide range of PAH mutations, most likely due to ethnic heterogeneity. High prevalence of c.1066-11G > A mutation linked to minihaplotype 7/250 among both Iranian and Mediterranean populations is indicative of historical and geographical links between them. Also, strong association between particular mutations and minihaplotypes could be useful for prenatal diagnosis (PND) and preimplantation genetic diagnosis (PGD) in affected families.
Euglena gracilis chloroplast DNA: analysis of a 1.6 kb intron of the psb C gene containing an open reading frame of 458 codons.

PubMed

Montandon, P E; Vasserot, A; Stutz, E

1986-01-01

We retrieved a 1.6 kbp intron separating two exons of the psb C gene which codes for the 44 kDa reaction center protein of photosystem II. This intron is 3 to 4 times the size of all previously sequenced Euglena gracilis chloroplast introns. It contains an open reading frame of 458 codons potentially coding for a basic protein of 54 kDa of yet unknown function. The intron boundaries follow consensus sequences established for chloroplast introns related to class II and nuclear pre-mRNA introns. Its 3'-terminal segment has structural features similar to class II mitochondrial introns with an invariant base A as possible branch point for lariat formation.
Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria.

PubMed

Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée

2006-09-14

The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis.
Determinism and randomness in the evolution of introns and sine inserts in mouse and human mitochondrial solute carrier and cytokine receptor genes.

PubMed

Cianciulli, Antonia; Calvello, Rosa; Panaro, Maria A

2015-04-01

In the homologous genes studied, the exons and introns alternated in the same order in mouse and human. We studied, in both species: corresponding short segments of introns, whole corresponding introns and complete homologous genes. We considered the total number of nucleotides and the number and orientation of the SINE inserts. Comparisons of mouse and human data series showed that at the level of individual relatively short segments of intronic sequences the stochastic variability prevails in the local structuring, but at higher levels of organization a deterministic component emerges, conserved in mouse and human during the divergent evolution, despite the ample re-editing of the intronic sequences and the fact that processes such as SINE spread had taken place in an independent way in the two species. Intron conservation is negatively correlated with the SINE occupancy, suggesting that virus inserts interfere with the conservation of the sequences inherited from the common ancestor. Copyright © 2015 Elsevier Ltd. All rights reserved.
Molecular cloning of rat sperm galactosyl receptor, a C-type lectin with in vitro egg binding activity.

PubMed

Rivkin, E; Tres, L L; Kaplan-Kraicer, R; Shalgi, R; Kierszenbaum, A L

2000-07-01

Rat sperm galactosyl receptor is a member of the C-type animal lectin family showing preferential binding to N-acetylgalactosamine compared to galactose. Binding is mediated by a Ca(2+)-dependent carbohydrate-recognition domain (CRD) identical to that of the minor variant of rat hepatic lectin receptor 2/3 (RHL-2/3). The molecular organization of the genomic DNA, cDNA, and derived amino acid sequence of rat testis galactosyl receptor have been determined and in vitro fertilization studies were conducted to ascertain its role. We have determined that the rat testis galactosyl receptor gene generates two mRNA species: one species, designated liver-type, is identical to RHL-2/3; the other, designated testis-type, contains one unspliced intron (86 nt) which alters the reading frame and changes the amino acid sequence of the carboxyl terminus. As a result, the CRD (glutamine-proline-aspartic acid/QPD) and flanked Ca(2+)-binding amino acid sequences were not present in the testis-type protein. Northern and Southern blots demonstrated presence of transcripts with unspliced intron in rat sperm but not liver. Similarly, antibody, raised against a synthetic 12-amino acid peptide (p12) encoded by the unspliced intron, recognized in immunoblots a 54 kDa receptor protein in protein extracts from testis but not from liver. Immunofluorescence and immunogold electron microscopy studies demonstrated that both protein species localized on the plasma membrane surface of the head and tail of rat sperm. Furthermore, capacitated rat sperm preincubated with polyclonal antisera to RHL-2/3 or to the CRD of the liver-type galactosyl receptor showed a statistically significant decrease in the in vitro fertilization rate. We conclude that rat sperm galactosyl receptor may play a role in egg binding and that an undetermined molecular mechanism operates to generate two proteins with identical intracellular amino terminal domain but only one of them displays a CRD and associated Ca(2+)-binding sites at the carboxyl terminal extracellular domain. Copyright 2000 Wiley-Liss, Inc.
Spliced RNA of woodchuck hepatitis virus.

PubMed

Ogston, C W; Razman, D G

1992-07-01

Polymerase chain reaction was used to investigate RNA splicing in liver of woodchucks infected with woodchuck hepatitis virus (WHV). Two spliced species were detected, and the splice junctions were sequenced. The larger spliced RNA has an intron of 1300 nucleotides, and the smaller spliced sequence shows an additional downstream intron of 1104 nucleotides. We did not detect singly spliced sequences from which the smaller intron alone was removed. Control experiments showed that spliced sequences are present in both RNA and DNA in infected liver, showing that the viral reverse transcriptase can use spliced RNA as template. Spliced sequences were detected also in virion DNA prepared from serum. The upstream intron produces a reading frame that fuses the core to the polymerase polypeptide, while the downstream intron causes an inframe deletion in the polymerase open reading frame. Whereas the splicing patterns in WHV are superficially similar to those reported recently in hepatitis B virus, we detected no obvious homology in the coding capacity of spliced RNAs from these two viruses.
Novel USH2A compound heterozygous mutations cause RP/USH2 in a Chinese family.

PubMed

Liu, Xiaowen; Tang, Zhaohui; Li, Chang; Yang, Kangjuan; Gan, Guanqi; Zhang, Zibo; Liu, Jingyu; Jiang, Fagang; Wang, Qing; Liu, Mugen

2010-03-17

To identify the disease-causing gene in a four-generation Chinese family affected with retinitis pigmentosa (RP). Linkage analysis was performed with a panel of microsatellite markers flanking the candidate genetic loci of RP. These loci included 38 known RP genes. The complete coding region and exon-intron boundaries of Usher syndrome 2A (USH2A) were sequenced with the proband DNA to screen the disease-causing gene mutation. Restriction fragment length polymorphism (RFLP) analysis and direct DNA sequence analysis were done to demonstrate co-segregation of the USH2A mutations with the family disease. One hundred normal controls were used without the mutations. The disease-causing gene in this Chinese family was linked to the USH2A locus on chromosome 1q41. Direct DNA sequence analysis of USH2A identified two novel mutations in the patients: one missense mutation p.G1734R in exon 26 and a splice site mutation, IVS32+1G>A, which was found in the donor site of intron 32 of USH2A. Neither the p.G1734R nor the IVS32+1G>A mutation was found in the unaffected family members or the 100 normal controls. One patient with a homozygous mutation displayed only RP symptoms until now, while three patients with compound heterozygous mutations in the family of study showed both RP and hearing impairment. This study identified two novel mutations: p.G1734R and IVS32+1G>A of USH2A in a four-generation Chinese RP family. In this study, the heterozygous mutation and the homozygous mutation in USH2A may cause Usher syndrome Type II or RP, respectively. These two mutations expand the mutant spectrum of USH2A.
Molecular cloning of the mouse gene coding for {alpha}{sub 2}-macroglobulin and targeting of the gene in embryonic stem cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Umans, L.; Serneels, L.; Hilliker, C.

1994-08-01

The authors have cloned the mouse gene coding for {alpha}{sub 2}-macroglobulin in overlapping {lambda} clones and have analyzed its structure. The gene contains 36 exons, coding for the 4.8-kb cDNA that we cloned previously. Including putative control elements in the 5{prime} flanking region, the gene covers about 45 kb. A region of 3.8 kb, stretching from 835 bases upstream of the cDNA start site to exon 4, including all intervening sequences, was sequenced completely. The analysis demonstrated that the putative promoter region of the mouse A2M gene differed considerably from the known promoter sequences of the human A2M gene andmore » of the rat acute-phas A2M gene. Comparison of the exon-intron structure of all known genes of the A2M family confirmed that the rat acute phase A2M gene is more closely related to the human gene than to the mouse A2M gene. To generate mice with the A2M gene inactivated, an insertion type of construct containing 7.5 kb of genomic DNA of the mouse strain 129/J, encompassing exons 16 to 19, was synthesized. A hygromycin marker gene was embedded in intron 17. After electroporation, 198 hygromycin-resistant ES cell lines were isolated and analyzed by Southern blotting. Five ES cell lines were obtained with one allele of the mouse A2M gene targeted by this insertion construct, demonstrating that the position and the characteristics of the vector served the intended goal.« less
Resequencing of IRS2 reveals rare variants for obesity but not fasting glucose homeostasis in Hispanic children.

PubMed

Butte, Nancy F; Voruganti, V Saroja; Cole, Shelley A; Haack, Karin; Comuzzie, Anthony G; Muzny, Donna M; Wheeler, David A; Chang, Kyle; Hawes, Alicia; Gibbs, Richard A

2011-09-22

Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5' and 3' flanking regions of IRS2 (∼14.5 kb), were bidirectionally sequenced for single nucleotide polymorphism (SNP) discovery in 934 Hispanic children using 3730XL DNA Sequencers. Additionally, 15 SNPs derived from Illumina HumanOmni1-Quad BeadChips were analyzed. Measured genotype analysis tested associations between SNPs and obesity and diabetes-related traits. Bayesian quantitative trait nucleotide analysis was used to statistically infer the most likely functional polymorphisms. A total of 140 SNPs were identified with minor allele frequencies (MAF) ranging from 0.001 to 0.47. Forty-two of the 70 coding SNPs result in nonsynonymous amino acid substitutions relative to the consensus sequence; 28 SNPs were detected in the promoter, 12 in introns, 28 in the 3'-UTR, and 2 in the 5'-UTR. Two insertion/deletions (indels) were detected. Ten independent rare SNPs (MAF = 0.001-0.009) were associated with obesity-related traits (P = 0.01-0.00002). SNP 10510452_139 in the promoter region was shown to have a high posterior probability (P = 0.77-0.86) of influencing BMI, fat mass, and waist circumference in Hispanic children. SNP 10510452_139 contributed between 2 and 4% of the population variance in body weight and composition. None of the SNPs or indels were associated with diabetes-related traits or accounted for a previously identified quantitative trait locus on chromosome 13 for fasting serum glucose. Rare but not common IRS2 variants may play a role in the regulation of body weight but not an essential role in fasting glucose homeostasis in Hispanic children.
Genetic variations of VDR/NR1I1 encoding vitamin D receptor in a Japanese population.

PubMed

Ukaji, Maho; Saito, Yoshiro; Fukushima-Uesaka, Hiromi; Maekawa, Keiko; Katori, Noriko; Kaniwa, Nahoko; Yoshida, Teruhiko; Nokihara, Hiroshi; Sekine, Ikuo; Kunitoh, Hideo; Ohe, Yuichiro; Yamamoto, Noboru; Tamura, Tomohide; Saijo, Nagahiro; Sawada, Jun-ichi

2007-12-01

The vitamin D receptor (VDR) is a transcriptional factor responsive to 1alpha,25-dihydroxyvitamin D(3) and lithocholic acid, and induces expression of drug metabolizing enzymes CYP3A4, CYP2B6 and CYP2C9. In this study, the promoter regions, 14 exons (including 6 exon 1's) and their flanking introns of VDR were comprehensively screened for genetic variations in 107 Japanese subjects. Sixty-one genetic variations including 25 novel ones were found: 9 in the 5'-flanking region, 2 in the 5'-untranslated region (UTR), 7 in the coding exons (5 synonymous and 2 nonsynonymous variations), 12 in the 3'-UTR, 19 in the introns between the exon 1's, and 12 in introns 2 to 8. Of these, one novel nonsynonymous variation, 154A>G (Met52Val), was detected with an allele frequency of 0.005. The single nucleotide polymorphisms (SNPs) that increase VDR expression or activity, -29649G>A, 2T>C and 1592((*)308)C>A tagging linked variations in the 3'-UTR, were detected at 0.430, 0.636, and 0.318 allele frequencies, respectively. Another SNP, -26930A>G, with reduced VDR transcription was found at a 0.028 frequency. These findings would be useful for association studies on VDR variations in Japanese.
The utility of DNA sequences of an intron from the beta-fibrinogen gene in phylogenetic analysis of woodpeckers (Aves: Picidae).

PubMed

Prychitko, T M; Moore, W S

1997-10-01

Estimating phylogenies from DNA sequence data has become the major methodology of molecular phylogenetics. To date, molecular phylogenetics of the vertebrates has been very dependent on mtDNA, but studies involving mtDNA are limited because the several genes comprising the mt-genome are inherited as a single linkage group. The only apparent solution to this problem is to sequence additional genes, each representing a distinct linkage group, so that the resultant gene trees provide independent estimates of the species tree. There exists the need to find novel gene sequences which contain enough phylogenetic information to resolve relationships between closely related species. A possible source is the nuclear-encoded introns, because they evolve more rapidly than exons. We designed primers to amplify and sequence the 7 intron from the beta-fibrinogen gene for a recently evolved group, the woodpeckers. We sequenced the entire intron for 10 specimens representing five species. Nucleotide substitutions are randomly distributed along the length of the intron, suggesting selective neutrality. A preliminary analysis indicates that the phylogenetic signal in the intron is as strong as that in the mitochondrial encoded cytochrome b (cyt b) gene. The topology of the beta-fibrinogen tree is identical to that of the cyt b tree. This analysis demonstrates the ability of the 7 intron of beta-fibrinogen to provide well resolved, independent gene trees for recently evolved groups and establishes it as a source of sequences to be used in other phylogenetic studies. Copyright 1997 Academic Press

The wheat cytochrome oxidase subunit II gene has an intron insert and three radical amino acid changes relative to maize

PubMed Central

Bonen, Linda; Boer, Poppo H.; Gray, Michael W.

1984-01-01

We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565
PlantRNA, a database for tRNAs of photosynthetic eukaryotes.

PubMed

Cognat, Valérie; Pawlak, Gaël; Duchêne, Anne-Marie; Daujat, Magali; Gigant, Anaïs; Salinas, Thalia; Michaud, Morgane; Gutmann, Bernard; Giegé, Philippe; Gobert, Anthony; Maréchal-Drouard, Laurence

2013-01-01

PlantRNA database (http://plantrna.ibmp.cnrs.fr/) compiles transfer RNA (tRNA) gene sequences retrieved from fully annotated plant nuclear, plastidial and mitochondrial genomes. The set of annotated tRNA gene sequences has been manually curated for maximum quality and confidence. The novelty of this database resides in the inclusion of biological information relevant to the function of all the tRNAs entered in the library. This includes 5'- and 3'-flanking sequences, A and B box sequences, region of transcription initiation and poly(T) transcription termination stretches, tRNA intron sequences, aminoacyl-tRNA synthetases and enzymes responsible for tRNA maturation and modification. Finally, data on mitochondrial import of nuclear-encoded tRNAs as well as the bibliome for the respective tRNAs and tRNA-binding proteins are also included. The current annotation concerns complete genomes from 11 organisms: five flowering plants (Arabidopsis thaliana, Oryza sativa, Populus trichocarpa, Medicago truncatula and Brachypodium distachyon), a moss (Physcomitrella patens), two green algae (Chlamydomonas reinhardtii and Ostreococcus tauri), one glaucophyte (Cyanophora paradoxa), one brown alga (Ectocarpus siliculosus) and a pennate diatom (Phaeodactylum tricornutum). The database will be regularly updated and implemented with new plant genome annotations so as to provide extensive information on tRNA biology to the research community.
Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria

PubMed Central

Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée

2006-01-01

Background The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). Results A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Conclusion Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis. PMID:16972986
Engineering an efficient and tight D-amino acid-inducible gene expression system in Rhodosporidium/Rhodotorula species.

PubMed

Liu, Yanbin; Koh, Chong Mei John; Ngoh, Si Te; Ji, Lianghui

2015-10-26

Rhodosporidium and Rhodotorula are two genera of oleaginous red yeast with great potential for industrial biotechnology. To date, there is no effective method for inducible expression of proteins and RNAs in these hosts. We have developed a luciferase gene reporter assay based on a new codon-optimized LUC2 reporter gene (RtLUC2), which is flanked with CAR2 homology arms and can be integrated into the CAR2 locus in the nuclear genome at >90 % efficiency. We characterized the upstream DNA sequence of a D-amino acid oxidase gene (DAO1) from R. toruloides ATCC 10657 by nested deletions. By comparing the upstream DNA sequences of several putative DAO1 homologs of Basidiomycetous fungi, we identified a conserved DNA motif with a consensus sequence of AGGXXGXAGX11GAXGAXGG within a 0.2 kb region from the mRNA translation initiation site. Deletion of this motif led to strong mRNA transcription under non-inducing conditions. Interestingly, DAO1 promoter activity was enhanced about fivefold when the 108 bp intron 1 was included in the reporter construct. We identified a conserved CT-rich motif in the intron with a consensus sequence of TYTCCCYCTCCYCCCCACWYCCGA, deletion or point mutations of which drastically reduced promoter strength under both inducing and non-inducing conditions. Additionally, we created a selection marker-free DAO1-null mutant (∆dao1e) which displayed greatly improved inducible gene expression, particularly when both glucose and nitrogen were present in high levels. To avoid adding unwanted peptide to proteins to be expressed, we converted the original translation initiation codon to ATC and re-created a translation initiation codon at the start of exon 2. This promoter, named P DAO1-in1m1 , showed very similar luciferase activity to the wild-type promoter upon induction with D-alanine. The inducible system was tunable by adjusting the levels of inducers, carbon source and nitrogen source. The intron 1-containing DAO1 promoters coupled with a DAO1 null mutant makes an efficient and tight D-amino acid-inducible gene expression system in Rhodosporidium and Rhodotorula genera. The system will be a valuable tool for metabolic engineering and enzyme expression in these yeast hosts.
An experimental system for the evaluation of retroviral vector design to diminish the risk for proto-oncogene activation

PubMed Central

Ryu, Byoung Y.; Evans-Galea, Marguerite V.; Gray, John T.; Bodine, David M.; Persons, Derek A.

2008-01-01

Pathogenic activation of the LMO2 proto-oncogene by an oncoretroviral vector insertion in a clinical trial for X-linked severe combined immunodeficiency (X-SCID) has prompted safety concerns. We used an adeno-associated virus vector to achieve targeted insertion of a γ-retroviral long terminal repeat (LTR) driving a GFP expression cassette with flanking loxP sites in a human T-cell line at the precise location of vector integration in one of the patients with X-SCID. The LTR-GFP cassette was inserted into the first intron of the LMO2 gene, resulting in strong activation of LMO2. Cre-mediated cassette exchange was used to replace the original LTR-GFP cassette with one flanked by insulator elements leading to a several fold reduction in LMO2 expression. The LTR-GFP cassette was also replaced with a globin gene regulatory cassette that failed to activate the LMO2 gene in lymphoid cells. A γ-retroviral vector with 2 intact LTRs resulted in activation of the LMO2 gene when inserted into the first intron, but a self-inactivating lentiviral vector with an internal cellular promoter and flanking insulator elements did not activate the LMO2 gene. Thus, this system is useful for comparing the safety profiles of vector cassettes with various regulatory elements for their potential for proto-oncogene activation. PMID:17991809
Chlamydomonas chloroplasts can use short dispersed repeats and multiple pathways to repair a double-strand break in the genome.

PubMed

Odom, Obed W; Baek, Kwang-Hyun; Dani, Radhika N; Herrin, David L

2008-03-01

Certain group I introns insert into intronless DNA via an endonuclease that creates a double-strand break (DSB). There are two models for intron homing in phage: synthesis-dependent strand annealing (SDSA) and double-strand break repair (DSBR). The Cr.psbA4 intron homes efficiently from a plasmid into the chloroplast psbA gene in Chlamydomonas, but little is known about the mechanism. Analysis of co-transformants selected using a spectinomycin-resistant 16S gene (16S(spec)) provided evidence for both pathways. We also examined the consequences of the donor DNA having only one-sided or no homology with the psbA gene. When there was no homology with the donor DNA, deletions of up to 5 kb involving direct repeats that flank the psbA gene were obtained. Remarkably, repeats as short as 15 bp were used for this repair, which is consistent with the single-strand annealing (SSA) pathway. When the donor had one-sided homology, the DSB in most co-transformants was repaired using two DNAs, the donor and the 16S(spec) plasmid, which, coincidentally, contained a region that is repeated upstream of psbA. DSB repair using two separate DNAs provides further evidence for the SDSA pathway. These data show that the chloroplast can repair a DSB using short dispersed repeats located proximally, distally, or even on separate molecules relative to the DSB. They also provide a rationale for the extensive repertoire of repeated sequences in this genome.
Splicing of designer exons informs a biophysical model for exon definition

PubMed Central

Arias, Mauricio A.; Chasin, Lawrence A.

2015-01-01

Pre-mRNA molecules in humans contain mostly short internal exons flanked by longer introns. To explain the removal of such introns, exon recognition instead of intron recognition has been proposed. We studied this exon definition using designer exons (DEs) made up of three prototype modules of our own design: an exonic splicing enhancer (ESE), an exonic splicing silencer (ESS), and a Reference Sequence (R) predicted to be neither. Each DE was examined as the central exon in a three-exon minigene. DEs made of R modules showed a sharp size dependence, with exons shorter than 14 nt and longer than 174 nt splicing poorly. Changing the strengths of the splice sites improved longer exon splicing but worsened shorter exon splicing, effectively displacing the curve to the right. For the ESE we found, unexpectedly, that its enhancement efficiency was independent of its position within the exon. For the ESS we found a step-wise positional increase in its effects; it was most effective at the 3′ end of the exon. To apply these results quantitatively, we developed a biophysical model for exon definition of internal exons undergoing cotranscriptional splicing. This model features commitment to inclusion before the downstream exon is synthesized and competition between skipping and inclusion fates afterward. Collision of both exon ends to form an exon definition complex was incorporated to account for the effect of size; ESE/ESS effects were modeled on the basis of stabilization/destabilization. This model accurately predicted the outcome of independent experiments on more complex DEs that combined ESEs and ESSs. PMID:25492963
Evolution of EF-hand calcium-modulated proteins. IV. Exon shuffling did not determine the domain compositions of EF-hand proteins

NASA Technical Reports Server (NTRS)

Kretsinger, R. H.; Nakayama, S.

1993-01-01

In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.
Influence of intron length on interaction characters between post-spliced intron and its CDS in ribosomal protein genes

NASA Astrophysics Data System (ADS)

Zhao, Xiaoqing; Li, Hong; Bao, Tonglaga; Ying, Zhiqiang

2012-09-01

Many experiment evidences showed that sequence structures of introns and intron loss/gain can influence gene expression, but current mechanisms did not refer to the functions of post-spliced introns directly. We propose that postspliced introns play their functions in gene expression by interacting with their mRNA sequences and the interaction is characterized by the matched segments between introns and their CDS. In this study, we investigated the interaction characters with length series by improved Smith-Waterman local alignment software for the ribosomal protein genes in C. elegans and D. melanogaster. Our results showed that RF values of five intron groups are significantly high in the central non-conserved region and very low in 5'-end and 3'-end splicing region. It is interesting that the number of the optimal matched regions gradually increases with intron length. Distributions of the optimal matched regions are different for five intron groups. Our study revealed that there are more interaction regions between longer introns and their CDS than shorter, and it provides a positive pattern for regulating the gene expression.
Novel strains of mice deficient for the vesicular acetylcholine transporter: insights on transcriptional regulation and control of locomotor behavior.

PubMed

Martins-Silva, Cristina; De Jaeger, Xavier; Guzman, Monica S; Lima, Ricardo D F; Santos, Magda S; Kushmerick, Christopher; Gomez, Marcus V; Caron, Marc G; Prado, Marco A M; Prado, Vania F

2011-03-10

Defining the contribution of acetylcholine to specific behaviors has been challenging, mainly because of the difficulty in generating suitable animal models of cholinergic dysfunction. We have recently shown that, by targeting the vesicular acetylcholine transporter (VAChT) gene, it is possible to generate genetically modified mice with cholinergic deficiency. Here we describe novel VAChT mutant lines. VAChT gene is embedded within the first intron of the choline acetyltransferase (ChAT) gene, which provides a unique arrangement and regulation for these two genes. We generated a VAChT allele that is flanked by loxP sequences and carries the resistance cassette placed in a ChAT intronic region (FloxNeo allele). We show that mice with the FloxNeo allele exhibit differential VAChT expression in distinct neuronal populations. These mice show relatively intact VAChT expression in somatomotor cholinergic neurons, but pronounced decrease in other cholinergic neurons in the brain. VAChT mutant mice present preserved neuromuscular function, but altered brain cholinergic function and are hyperactive. Genetic removal of the resistance cassette rescues VAChT expression and the hyperactivity phenotype. These results suggest that release of ACh in the brain is normally required to "turn down" neuronal circuits controlling locomotion.
Molecular gene organisation and secondary structure of the mitochondrial large subunit ribosomal RNA from the cultivated Basidiomycota Agrocybe aegerita: a 13 kb gene possessing six unusual nucleotide extensions and eight introns.

PubMed

Gonzalez, P; Barroso, G; Labarère, J

1999-04-01

The complete gene sequence and secondary structure of the mitochondrial LSU rRNA from the cultivated Basidiomycota Agrocybe aegerita was derived by chromosome walking. The A.aegerita LSU rRNA gene (13 526 nt) represents, to date, the longest described, due to the highest number of introns (eight) and the occurrence of six long nucleotidic extensions. Seven introns belong to group I, while the intronic sequence i5 constitutes the first typical group II intron reported in a fungal mitochondrial LSU rDNA. As with most fungal LSU rDNA introns reported to date, four introns (i5-i8) are distributed in domain V associated with the peptidyl-transferase activity. One intron (i1) is located in domain I, and three (i2-i4) in domain II. The introns i2-i8 possess homologies with other fungal, algal or protozoan introns located at the same position in LSU rDNAs. One of them (i6) is located at the same insertion site as most Ascomycota or algae LSU introns, suggesting a possible inheritance from a common ancestor. On the contrary, intron i1 is located at a so-far unreported insertion site. Among the six unusual nucleotide extensions, five are located in domain I and one in domain V. This is the first report of a mitochondrial LSU rRNA gene sequence and secondary structure for the whole Basidiomycota division.
Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting

PubMed Central

Piazza, Carol Lyn; Smith, Dorie

2018-01-01

Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis, inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. PMID:29905149
Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting.

PubMed

Qu, Guosheng; Piazza, Carol Lyn; Smith, Dorie; Belfort, Marlene

2018-06-15

Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis , inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. © 2018, Qu et al.
A 5′ Noncoding Exon Containing Engineered Intron Enhances Transgene Expression from Recombinant AAV Vectors in vivo

PubMed Central

Lu, Jiamiao; Williams, James A.; Luke, Jeremy; Zhang, Feijie; Chu, Kirk; Kay, Mark A.

2017-01-01

We previously developed a mini-intronic plasmid (MIP) expression system in which the essential bacterial elements for plasmid replication and selection are placed within an engineered intron contained within a universal 5′ UTR noncoding exon. Like minicircle DNA plasmids (devoid of bacterial backbone sequences), MIP plasmids overcome transcriptional silencing of the transgene. However, in addition MIP plasmids increase transgene expression by 2 and often >10 times higher than minicircle vectors in vivo and in vitro. Based on these findings, we examined the effects of the MIP intronic sequences in a recombinant adeno-associated virus (AAV) vector system. Recombinant AAV vectors containing an intron with a bacterial replication origin and bacterial selectable marker increased transgene expression by 40 to 100 times in vivo when compared with conventional AAV vectors. Therefore, inclusion of this noncoding exon/intron sequence upstream of the coding region can substantially enhance AAV-mediated gene expression in vivo. PMID:27903072
The low information content of Neurospora splicing signals: implications for RNA splicing and intron origin.

PubMed

Collins, Richard A; Stajich, Jason E; Field, Deborah J; Olive, Joan E; DeAbreu, Diane M

2015-05-01

When we expressed a small (0.9 kb) nonprotein-coding transcript derived from the mitochondrial VS plasmid in the nucleus of Neurospora we found that it was efficiently spliced at one or more of eight 5' splice sites and ten 3' splice sites, which are present apparently by chance in the sequence. Further experimental and bioinformatic analyses of other mitochondrial plasmids, random sequences, and natural nuclear genes in Neurospora and other fungi indicate that fungal spliceosomes recognize a wide range of 5' splice site and branchpoint sequences and predict introns to be present at high frequency in random sequence. In contrast, analysis of intronless fungal nuclear genes indicates that branchpoint, 5' splice site and 3' splice site consensus sequences are underrepresented compared with random sequences. This underrepresentation of splicing signals is sufficient to deplete the nuclear genome of splice sites at locations that do not comprise biologically relevant introns. Thus, the splicing machinery can recognize a wide range of splicing signal sequences, but splicing still occurs with great accuracy, not because the splicing machinery distinguishes correct from incorrect introns, but because incorrect introns are substantially depleted from the genome. © 2015 Collins et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Sequence analyses reveal that a TPR-DP module, surrounded by recombinable flanking introns, could be at the origin of eukaryotic Hop and Hip TPR-DP domains and prokaryotic GerD proteins.

PubMed

Hernández Torres, Jorge; Papandreou, Nikolaos; Chomilier, Jacques

2009-05-01

The co-chaperone Hop [heat shock protein (HSP) organising protein] is known to bind both Hsp70 and Hsp90. Hop comprises three repeats of a tetratricopeptide repeat (TPR) domain, each consisting of three TPR motifs. The first and last TPR domains are followed by a domain containing several dipeptide (DP) repeats called the DP domain. These analyses suggest that the hop genes result from successive recombination events of an ancestral TPR-DP module. From a hydrophobic cluster analysis of homologous Hop protein sequences derived from gene families, we can postulate that shifts in the open reading frames are at the origin of the present sequences. Moreover, these shifts can be related to the presence or absence of biological function. We propose to extend the family of Hop co-chaperons into the kingdom of bacteria, as several structurally related genes have been identified by hydrophobic cluster analysis. We also provide evidence of common structural characteristics between hop and hip genes, suggesting a shared precursor of ancestral TPR-DP domains.
Cystinuria Associated with Different SLC7A9 Gene Variants in the Cat

PubMed Central

Raj, Karthik; Osborne, Carl; Giger, Urs

2016-01-01

Cystinuria is a classical inborn error of metabolism characterized by a selective proximal renal tubular defect affecting cystine, ornithine, lysine, and arginine (COLA) reabsorption, which can lead to uroliths and urinary obstruction. In humans, dogs and mice, cystinuria is caused by variants in one of two genes, SLC3A1 and SLC7A9, which encode the rBAT and bo,+AT subunits of the bo,+ basic amino acid transporter system, respectively. In this study, exons and flanking regions of the SLC3A1 and SLC7A9 genes were sequenced from genomic DNA of cats (Felis catus) with COLAuria and cystine calculi. Relative to the Felis catus-6.2 reference genome sequence, DNA sequences from these affected cats revealed 3 unique homozygous SLC7A9 missense variants: one in exon 5 (p.Asp236Asn) from a non-purpose-bred medium-haired cat, one in exon 7 (p.Val294Glu) in a Maine Coon and a Sphinx cat, and one in exon 10 (p.Thr392Met) from a non-purpose-bred long-haired cat. A genotyping assay subsequently identified another cystinuric domestic medium-haired cat that was homozygous for the variant originally identified in the purebred cats. These missense variants result in deleterious amino acid substitutions of highly conserved residues in the bo,+AT protein. A limited population survey supported that the variants found were likely causative. The remaining 2 sequenced domestic short-haired cats had a heterozygous variant at a splice donor site in intron 10 and a homozygous single nucleotide variant at a branchpoint in intron 11 of SLC7A9, respectively. This study identifies the first SLC7A9 variants causing feline cystinuria and reveals that, as in humans and dogs, this disease is genetically heterogeneous in cats. PMID:27404572
Selection, trans-species polymorphism, and locus identification of major histocompatibility complex class IIβ alleles of New World ranid frogs

USGS Publications Warehouse

Kiemnec-Tyburczy, Karen M.; Richmond, Jonathan Q.; Savage, Anna E.; Zamudio, Kelly R.

2010-01-01

Genes encoded by the major histocompatibility complex (MHC) play key roles in the vertebrate immune system. However, our understanding of the evolutionary processes and underlying genetic mechanisms shaping these genes is limited in many taxa, including amphibians, a group currently impacted by emerging infectious diseases. To further elucidate the evolution of the MHC in frogs (anurans) and develop tools for population genetics, we surveyed allelic diversity of the MHC class II ??1 domain in both genomic and complementary DNA of seven New World species in the genus Rana (Lithobates). To assign locus affiliation to our alleles, we used a "gene walking" technique to obtain intron 2 sequences that flanked MHC class II?? exon 2. Two distinct intron sequences were recovered, suggesting the presence of at least two class II?? loci in Rana. We designed a primer pair that successfully amplified an orthologous locus from all seven Rana species. In total, we recovered 13 alleles and documented trans-species polymorphism for four of the alleles. We also found quantitative evidence of selection acting on amino acid residues that are putatively involved in peptide binding and structural stability of the ??1 domain of anurans. Our results indicated that primer mismatch can result in polymerase chain reaction (PCR) bias, which influences the number of alleles that are recovered. Using a single locus may minimize PCR bias caused by primer mismatch, and the gene walking technique was an effective approach for generating single-copy orthologous markers necessary for future studies of MHC allelic variation in natural amphibian populations. ?? 2010 Springer-Verlag.
[Mutation analysis of the PAH gene in children with phenylketonuria from the Qinghai area of China].

PubMed

He, Jiang; Wang, Hui-Zhen; Xu, Fa-Liang; Yang, Xi; Wang, Rui; Zou, Hong-Yun; Yu, Wu-Zhong

2015-11-01

To study the mutation characteristics of the phenylalanine hydroxylase (PAH) gene in children with phenylketonuria (PKU) from the Qinghai area of China, in order to provide basic information for genetic counseling and prenatal diagnosis. Mutations of the PAH gene were detected in the promoter and exons 1-13 and their flanking intronic sequences of PAH gene by PCR and DNA sequencing in 49 children with PKU and their parents from the Qinghai area of China. A total of 30 different mutations were detected in 80 out of 98 mutant alleles (82%), including 19 missense (63%), 5 nonsense (17%), 3 splice-site (10%) and 3 deletions (10%). Most mutations were detected in exons 3, 6, 7, 11 and intron 4 of PAH gene. The most frequent mutations were p.R243Q (19%), IVS4-1G>A (9%), p.Y356X (7%) and p.EX6-96A>G(5%). Two novel mutations p.N93fsX5 (c.279-282delCATC) and p.G171E (c.512G>A) were found. p.H64fsX9(c.190delC) was documented for the second time in Chinese PAH gene. The mutation spectrum of the gene PAH in the Qinghai population was similar to that in other populations in North China while significantly different from that in the populations from some provinces in southern China, Japan and Europe. The mutations of PAH gene in the Qinghai area of China demonstrate a unique diversity, complexity and specificity.
Polymorphism in Mitochondrial Group I Introns among Cryptococcus neoformans and Cryptococcus gattii Genotypes and Its Association with Drug Susceptibility.

PubMed

Gomes, Felipe E E S; Arantes, Thales D; Fernandes, José A L; Ferreira, Leonardo C; Romero, Héctor; Bosco, Sandra M G; Oliveira, Maria T B; Del Negro, Gilda M B; Theodoro, Raquel C

2018-01-01

Cryptococcosis, one of the most important systemic mycosis in the world, is caused by different genotypes of Cryptococcus neoformans and Cryptococcus gattii , which differ in their ecology, epidemiology, and antifungal susceptibility. Therefore, the search for new molecular markers for genotyping, pathogenicity and drug susceptibility is necessary. Group I introns fulfill the requisites for such task because (i) they are polymorphic sequences; (ii) their self-splicing is inhibited by some drugs; and (iii) their correct splicing under parasitic conditions is indispensable for pathogen survival. Here, we investigated the presence of group I introns in the mitochondrial LSU rRNA gene in 77 Cryptococcus isolates and its possible relation to drug susceptibility. Sequencing revealed two new introns in the LSU rRNA gene. All the introns showed high sequence similarity to other mitochondrial introns from distinct fungi, supporting the hypothesis of an ancient non-allelic invasion. Intron presence was statistically associated with those genotypes reported to be less pathogenic ( p < 0.001). Further virulence assays are needed to confirm this finding. In addition, in vitro antifungal tests indicated that the presence of LSU rRNA introns may influence the minimum inhibitory concentration (MIC) of amphotericin B and 5-fluorocytosine. These findings point to group I introns in the mitochondrial genome of Cryptococcus as potential molecular markers for antifungal resistance, as well as therapeutic targets.

Genome-wide identification of conserved intronic non-coding sequences using a Bayesian segmentation approach.

PubMed

Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M

2017-03-27

Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.
A 3,387 bp 5'-flanking sequence of the goat alpha-S1-casein gene provides correct tissue-specific expression of human granulocyte colony-stimulating factor (hG-CSF) in the mammary gland of transgenic mice.

PubMed

Serova, Irina A; Dvoryanchikov, Gennady A; Andreeva, Ludmila E; Burkov, Ivan A; Dias, Luciene P B; Battulin, Nariman R; Smirnov, Alexander V; Serov, Oleg L

2012-06-01

A new expression vector containing the 1,944 bp 5'-flanking regulatory region together with exon 1 and intron 1 of the goat alpha-S1-casein gene (CSN1S1), the full-sized human granulocyte colony-stimulating factor gene (hGCSF) and the 3'-flanking sequence of the bovine CSN1S1, was created. The vector DNA was used for generation of four mouse transgenic lines. The transgene was integrated into chromosomes 8 and 12 of two founders as 2 and 5 copies, respectively. Tissue-specific secretion of hG-CSF into the milk of transgenic mice was in the range of 19-40 μg/ml. RT-PCR analysis of various tissues of the transgenic mice demonstrated that expression of hGCSF was detected in only the mammary gland in the progeny of all founders. Moreover, cells were shown to be positive for hG-CSF by immunofluorescent analysis in the mammary glands but not in any other tissues. There were no signs of mosaic expression in the mammary gland. Trace amounts of hG-CSF were detected in the serum of females of two transgenic lines during lactation only. However, no transgenic mice showed any changes in hematopoiesis based on the number of granulocytes in blood. Immunoblotting of hG-CSF in the milk of transgenic mice revealed two forms, presumably the glycosylated and non-glycosylated forms. The hematopoietic activity of hG-CSF in the milk of transgenic females is comparable to that of recombinant G-CSF. In general, the data obtained in this study show that the new expression vector is able to provide correct tissue-specific expression of hG-CSF with high biological activity in transgenic mice.
a Simple Symmetric Algorithm Using a Likeness with Introns Behavior in RNA Sequences

NASA Astrophysics Data System (ADS)

Regoli, Massimo

2009-02-01

The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences has some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algoritnm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
Interactions between the promoter and first intron are involved in transcriptional control of alpha 1(I) collagen gene expression.

PubMed Central

Bornstein, P; McKay, J; Liska, D J; Apone, S; Devarayalu, S

1988-01-01

The first intron of the human collagen alpha 1(I) gene contains several positively and negatively acting elements. We have studied the transcription of collagen-human growth hormone fusion genes, containing deletions and rearrangements of collagen intronic sequences, by transient transfection of chick tendon fibroblasts and NIH 3T3 cells. In chick tendon fibroblasts, but not in 3T3 cells, inversion of intronic sequences containing a previously studied 274-base-pair segment, A274, resulted in markedly reduced human growth hormone mRNA levels as determined by an RNase protection assay. This inhibitory effect was largely alleviated when deletions were introduced in the collagen promoter of plasmids containing negatively oriented intronic sequences. Evidence for interaction of the promoter with the intronic segment, A274, was obtained by gel mobility shift assays. We suggest that promoter-intron interactions, mediated by DNA-binding proteins, regulate collagen gene transcription. Inversion of intronic segments containing critical interactive elements might then lead to an altered geometry and reduced activity of a transcriptional complex in those cells with sufficiently high levels of appropriate transcription factors. We further suggest that the deleted promoter segment plays a key role in directing DNA interactions involved in transcriptional control. Images PMID:3211130
A common class of transcripts with 5'-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification.

PubMed

Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P

2017-03-01

Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Genetic analysis of a Chinese family with members affected with Usher syndrome type II and Waardenburg syndrome type IV.

PubMed

Wang, Xueling; Lin, Xiao-Jiang; Tang, Xiangrong; Chai, Yong-Chuan; Yu, De-Hong; Chen, Dong-Ye; Wu, Hao

2017-11-01

The purpose of this study was to identify the genetic causes of a family presenting with multiple symptoms overlapping Usher syndrome type II (USH2) and Waardenburg syndrome type IV (WS4). Targeted next-generation sequencing including the exon and flanking intron sequences of 79 deafness genes was performed on the proband. Co-segregation of the disease phenotype and the detected variants were confirmed in all family members by PCR amplification and Sanger sequencing. The affected members of this family had two different recessive disorders, USH2 and WS4. By targeted next-generation sequencing, we identified that USH2 was caused by a novel missense mutation, p.V4907D in GPR98; whereas WS4 due to p.V185M in EDNRB. This is the first report of homozygous p.V185M mutation in EDNRB in patient with WS4. This study reported a Chinese family with multiple independent and overlapping phenotypes. In condition, molecular level analysis was efficient to identify the causative variant p.V4907D in GPR98 and p.V185M in EDNRB, also was helpful to confirm the clinical diagnosis of USH2 and WS4. Copyright © 2017 Elsevier B.V. All rights reserved.
An intron within the 16S ribosomal RNA gene of the archaeon Pyrobaculum aerophilum

NASA Technical Reports Server (NTRS)

Burggraf, S.; Larsen, N.; Woese, C. R.; Stetter, K. O.

1993-01-01

The 16S rRNA genes of Pyrobaculum aerophilum and Pyrobaculum islandicum were amplified by the polymerase chain reaction, and the resulting products were sequenced directly. The two organisms are closely related by this measure (over 98% similar). However, they differ in that the (lone) 16S rRNA gene of Pyrobaculum aerophilum contains a 713-bp intron not seen in the corresponding gene of Pyrobaculum islandicum. To our knowledge, this is the only intron so far reported in the small subunit rRNA gene of a prokaryote. Upon excision the intron is circularized. A secondary structure model of the intron-containing rRNA suggests a splicing mechanism of the same type as that invoked for the tRNA introns of the Archaea and Eucarya and 23S rRNAs of the Archaea. The intron contains an open reading frame whose protein translation shows no certain homology with any known protein sequence.
Patterns and rates of intron divergence between humans and chimpanzees

PubMed Central

Gazave, Elodie; Marqués-Bonet, Tomàs; Fernando, Olga; Charlesworth, Brian; Navarro, Arcadi

2007-01-01

Background Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results. Results We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns). Conclusion We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints. PMID:17309804
Identification of Genomic Insertion and Flanking Sequence of G2-EPSPS and GAT Transgenes in Soybean Using Whole Genome Sequencing Method.

PubMed

Guo, Bingfu; Guo, Yong; Hong, Huilong; Qiu, Li-Juan

2016-01-01

Molecular characterization of sequence flanking exogenous fragment insertion is essential for safety assessment and labeling of genetically modified organism (GMO). In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS) method. More than 22.4 Gb sequence data (∼21 × coverage) for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundaries of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of genomic insertion sites of G2-EPSPS and GAT transgenes will facilitate the utilization of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS was a cost-effective and rapid method for identifying sites of T-DNA insertions and flanking sequences in soybean.
Tobacco chloroplast tRNALys(UUU) gene contains a 2.5-kilobase-pair intron: An open reading frame and a conserved boundary sequence in the intron

PubMed Central

Sugita, Mamoru; Shinozaki, Kazuo; Sugiura, Masahiro

1985-01-01

The nucleotide sequence of a tRNALys(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNAGly(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long. Images PMID:16593561
Tobacco chloroplast tRNA(UUU) gene contains a 2.5-kilobase-pair intron: An open reading frame and a conserved boundary sequence in the intron.

PubMed

Sugita, M; Shinozaki, K; Sugiura, M

1985-06-01

The nucleotide sequence of a tRNA(Lys)(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNA(Gly)(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long.
The gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis contains a group I intron.

PubMed Central

De Wachter, R; Neefs, J M; Goris, A; Van de Peer, Y

1992-01-01

The nucleotide sequence of the gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis was determined. It revealed the presence of a group I intron with a length of 411 nucleotides. This is the third occurrence of such an intron discovered in a small subunit rRNA gene encoded by a eukaryotic nuclear genome. The other two occurrences are in Pneumocystis carinii, a fungus of uncertain taxonomic status, and Ankistrodesmus stipitatus, a green alga. The nucleotides of the conserved core structure of 101 group I intron sequences present in different genes and genome types were aligned and their evolutionary relatedness was examined. This revealed a cluster including all group I introns hitherto found in eukaryotic nuclear genes coding for small and large subunit rRNAs. A secondary structure model was designed for the area of the Ustilago maydis small ribosomal subunit RNA precursor where the intron is situated. It shows that the internal guide sequence pairing with the intron boundaries fits between two helices of the small subunit rRNA, and that minimal rearrangement of base pairs suffices to achieve the definitive secondary structure of the 18S rRNA upon splicing. PMID:1561081
The intron 1 of HPV 16 has a suboptimal branch point at a guanosine.

PubMed

De la Rosa-Rios, Marco Antonio; Martínez-Salazar, Martha; Martínez-Garcia, Martha; González-Bonilla, César; Villegas-Sepúlveda, Nicolás

2006-06-01

The branch point sequence (BPS) of intron 1 of the HPV-16 was determined via RT-PCR in a cell free system, using lariat intermediates obtained by in vitro splicing reactions. We used synthetic E6/E7 transcripts and HeLa nuclear protein extracts to obtain the splicing intermediates. Then, a divergent oligonucleotide primer set, pairing on the lariat RNA that encompassed the 2'-5' phosphodiester bond formed between the 5' end of the intron and the BPS, was used for cDNA synthesis and PCR amplification. Subsequent RT-PCR assays revealed four splicing intermediates, made up of a major intermediary corresponding to the BPS and four cryptic branched sequences. Only intermediates bound at the 5' end of the intron are probably the authentic branch point sequence, and all of them branch at guanosine 328 instead of the typical adenosine. Unusually, the BPS of intron 1 of HPV-16 is a suboptimal sequence (AGUGAGU) that differs from the eukaryotic consensus BPS, which correlates with the splicing profile observed for early transcripts of HPV-16 in tumors and tumor derived cell lines. The implications of this unusual branch point sequence for splicing of the HPV-16 pre-mRNA are discussed.
[Identifying and sequence analysis of HLA-B*2736].

PubMed

Li, Zhen; Zou, Hong-Yan; Shao, Chao-Peng; Tang, Si; Wang, Da-Ming; Cheng, Liang-Hong

2007-11-01

An unknown HLA-B allele which was similar to HLA-B*270401 was detected by FLOW-SSOPCR-SSP and heterozygous sequence-based typing (SBT) in Chinese Han individual. Its anomalous patterns suggested the possible presence of new allele. Amplifying exon 2-5(include intron 2-4) of the HLA-B*27 allele separately by using allele-specific primers and sequencing in both directions. Identifying the difference between the novel B*27 allele and B*270401. The sequence of novel B*27 from exon 2 to partial exon 5 is 1 815 bp. There are 10 nt changes from B*270401 in exon 3-4, at nt634where A-->C(codon130 AGC-->CGC, 130 S-->R); nt670 where A-->T (codon142 ACC-->TCC, 142 T-->S); nt683 where G-->T (codon146 TGG-->TTG, 146 W-->L); nt698 where A-->T (codon151 GAG-->GTG, 151 E-->V); nt774 where G-->C (codon176 GAG-->GAC, 176 E-->D); nt776 where C-->A (codon177 ACG-->AAG, 177 T-->K); nt781 where C-->G (codon179 CAG-->GAG, 179Q-->E); nt789 where G-->T (codon181 GCG-->GCT) resulting no coding change; nt1438 where C-->T (codon206 GGC-->GGT) resulting no coding change; nt1449 where G-->C (codon210 GGG-->GCG, 210G-->A). In IMGT/HLA database, only three alleles (B*270502/2706/2732) have sequences of introns. The same sequence in intron 2 showed homology between the novel HLA-B*27 allele and B*2706, but their homology could not be supported in intron 3-4. Comparing the sequence of the novel B*27 allele in intron 3 and 4 with B*27 group, it showed there are three mutations at nt106 C-->G, nt179 G-->A, nt536 G-->A and one deletion at nt168 in intron 3 and one mutations at nt82 T-->C in intron 4, but the sequence of the novel B*27 allele in intron 3 and 4 was all the same to B*070201. The sequence was submitted to Gen-Bank and the accession number was DQ915176. The allele has been confirmed as an extension of B*2736 by the WHO Nomenclature committee in November 2006.
[Analysis of chloroplast rpS16 intron sequences in Lemnaceae].

PubMed

Martirosian, E V; Ryzhova, N N; Kochieva, E Z; Skriabin, K G

2009-01-01

Chloroplast rpS16 gene intron sequences were determined and characterized for twenty-five Lemnaceae accessions representing nine duckweed species. For each Lemnaceae species nucleotide substitutions and for Lemna minor, Lemna aequinoctialis, Wolffia arrhiza different indels were detected. Most of indels were found for Wolffia arrhiza and Lemna aequinoctialis. The analyses of intraspecific polymorphism resulted in identification of several gaplotypes in L. gibba and L. trisulca. Lemnaceae phylogenetic relationship based on rpS16 intron variability data has revealed significant differences between L. aequinoctialis and other Lemna species. Genetic distance values corroborated competence of Landoltia punctata separations from Spirodela into an independent generic taxon. The acceptability of rpS16 intron sequences for phylogenetic studies in Lemnaceae was shown.
cisprimertool: software to implement a comparative genomics strategy for the development of conserved intron scanning (CIS) markers.

PubMed

Jayashree, B; Jagadeesh, V T; Hoisington, D

2008-05-01

The availability of complete, annotated genomic sequence information in model organisms is a rich resource that can be extended to understudied orphan crops through comparative genomic approaches. We report here a software tool (cisprimertool) for the identification of conserved intron scanning regions using expressed sequence tag alignments to a completely sequenced model crop genome. The method used is based on earlier studies reporting the assessment of conserved intron scanning primers (called CISP) within relatively conserved exons located near exon-intron boundaries from onion, banana, sorghum and pearl millet alignments with rice. The tool is freely available to academic users at http://www.icrisat.org/gt-bt/CISPTool.htm. © 2007 ICRISAT.
Analysis of Claviceps africana and C. sorghi from India using AFLPs, EF-1alpha gene intron 4, and beta-tubulin gene intron 3.

PubMed

Tooley, Paul W; Bandyopadhyay, Ranajit; Carras, Marie M; Pazoutová, Sylvie

2006-04-01

Isolates of Claviceps causing ergot on sorghum in India were analysed by AFLP analysis, and by analysis of DNA sequences of the EF-1alpha gene intron 4 and beta-tubulin gene intron 3 region. Of 89 isolates assayed from six states in India, four were determined to be C. sorghi, and the rest C. africana. A relatively low level of genetic diversity was observed within the Indian C. africana population. No evidence of genetic exchange between C. africana and C. sorghi was observed in either AFLP or DNA sequence analysis. Phylogenetic analysis was conducted using DNA sequences from 14 different Claviceps species. A multigene phylogeny based on the EF-1alpha gene intron 4, the beta-tubulin gene intron 3 region, and rDNA showed that C. sorghi grouped most closely with C. gigantea and C. africana. Although the Claviceps species we analysed were closely related, they colonize hosts that are taxonomically very distinct suggesting that there is no direct coevolution of Claviceps with its hosts.
Thermostable group II intron reverse transcriptase fusion proteins and their use in cDNA synthesis and next-generation RNA sequencing.

PubMed

Mohr, Sabine; Ghanem, Eman; Smith, Whitney; Sheeter, Dennis; Qin, Yidan; King, Olga; Polioudakis, Damon; Iyer, Vishwanath R; Hunicke-Smith, Scott; Swamy, Sajani; Kuersten, Scott; Lambowitz, Alan M

2013-07-01

Mobile group II introns encode reverse transcriptases (RTs) that function in intron mobility ("retrohoming") by a process that requires reverse transcription of a highly structured, 2-2.5-kb intron RNA with high processivity and fidelity. Although the latter properties are potentially useful for applications in cDNA synthesis and next-generation RNA sequencing (RNA-seq), group II intron RTs have been difficult to purify free of the intron RNA, and their utility as research tools has not been investigated systematically. Here, we developed general methods for the high-level expression and purification of group II intron-encoded RTs as fusion proteins with a rigidly linked, noncleavable solubility tag, and we applied them to group II intron RTs from bacterial thermophiles. We thus obtained thermostable group II intron RT fusion proteins that have higher processivity, fidelity, and thermostability than retroviral RTs, synthesize cDNAs at temperatures up to 81°C, and have significant advantages for qRT-PCR, capillary electrophoresis for RNA-structure mapping, and next-generation RNA sequencing. Further, we find that group II intron RTs differ from the retroviral enzymes in template switching with minimal base-pairing to the 3' ends of new RNA templates, making it possible to efficiently and seamlessly link adaptors containing PCR-primer binding sites to cDNA ends without an RNA ligase step. This novel template-switching activity enables facile and less biased cloning of nonpolyadenylated RNAs, such as miRNAs or protein-bound RNA fragments. Our findings demonstrate novel biochemical activities and inherent advantages of group II intron RTs for research, biotechnological, and diagnostic methods, with potentially wide applications.
Modulation of hepatocyte growth factor gene expression by estrogen in mouse ovary.

PubMed

Liu, Y; Lin, L; Zarnegar, R

1994-09-01

Hepatocyte growth factor (HGF) is expressed in a variety of tissues and cell types under normal conditions and in response to various stimuli such as tissue injury. In the present study, we demonstrate that the transcription of the HGF gene is stimulated by estrogen in mouse ovary. A single injection of 17 beta-estradiol results in a dramatic and transient elevation of the levels of mouse HGF mRNA. Sequence analysis has found that two putative estrogen responsive elements (ERE) reside at -872 in the 5'-flanking region and at +511 in the first intron, respectively, of the mouse HGF gene. To test whether these ERE elements are responsible for estrogen induction of HGF gene expression, chimeric plasmids containing variable regions of the 5'-flanking sequence of HGF gene and the coding region for chloramphenicol acetyltransferase (CAT) gene were transiently transfected into both human endometrial carcinoma RL 95-2 cells and mouse fibroblast NIH 3T3 cells to assess hormone responsiveness. Transfection results indicate that the ERE elements of the mouse HGF gene can confer estrogen action to either homologous or heterologous promoters. Nuclear protein extracts either from RL95-2 cells transfected with the estrogen receptor expression vector or from mouse liver bound in vitro to ERE elements specifically, as shown by band shift assay. Therefore, our results demonstrate that the HGF gene is transcriptionally regulated by estrogen in mouse ovary; and such regulation is mediated via a direct interaction of the estrogen receptor complex with cis-acting ERE elements identified in the mouse HGF gene.
Knockdown of Zebrafish Lumican Gene (zlum) Causes Scleral Thinning and Increased Size of Scleral Coats*

PubMed Central

Yeh, Lung-Kun; Liu, Chia-Yang; Kao, Winston W.-Y.; Huang, Chang-Jen; Hu, Fung-Rong; Chien, Chung-Liang; Wang, I-Jong

2010-01-01

The lumican gene (lum), which encodes one of the major keratan sulfate proteoglycans (KSPGs) in the vertebrate cornea and sclera, has been linked to axial myopia in humans. In this study, we chose zebrafish (Danio rerio) as an animal model to elucidate the role of lumican in the development of axial myopia. The zebrafish lumican gene (zlum) spans ∼4.6 kb of the zebrafish genome. Like human (hLUM) and mouse (mlum), zlum consists of three exons, two introns, and a TATA box-less promoter at the 5′-flanking region of the transcription initiation site. Sequence analysis of the cDNA predicts that zLum encodes 344 amino acids. zLum shares 51% amino acid sequence identity with human lumican. Similar to hLUM and mlum, zlum mRNA is expressed in the eye and many other tissues, such as brain, muscle, and liver as well. Transgenic zebrafish harboring an enhanced GFP reporter gene construct downstream of a 1.7-kb zlum 5′-flanking region displayed enhanced GFP expression in the cornea and sclera, as well as throughout the body. Down-regulation of zlum expression by antisense zlum morpholinos manifested ocular enlargement resembling axial myopia due to disruption of the collagen fibril arrangement in the sclera and resulted in scleral thinning. Administration of muscarinic receptor antagonists, e.g. atropine and pirenzepine, effectively subdued the ocular enlargement caused by morpholinos in in vivo zebrafish larvae assays. The observation suggests that zebrafish can be used as an in vivo model for screening compounds in treating myopia. PMID:20551313

Intron loss from the NADH dehydrogenase subunit 4 gene of lettuce mitochondrial DNA: evidence for homologous recombination of a cDNA intermediate.

PubMed

Geiss, K T; Abbas, G M; Makaroff, C A

1994-04-01

The mitochondrial gene coding for subunit 4 of the NADH dehydrogenase complex I (nad4) has been isolated and characterized from lettuce, Lactuca sativa. Analysis of nad4 genes in a number of plants by Southern hybridization had previously suggested that the intron content varied between species. Characterization of the lettuce gene confirms this observation. Lettuce nad4 contains two exons and one group IIA intron, whereas previously sequenced nad4 genes from turnip and wheat contain three group IIA introns. Northern analysis identified a transcript of 1600 nucleotides, which represents the mature nad4 mRNA and a primary transcript of 3200 nucleotides. Sequence analysis of lettuce and turnip nad4 cDNAs was used to confirm the intron/exon border sequences and to examine RNA editing patterns. Editing is observed at the 5' and 3' ends of the lettuce transcript, but is absent from sequences that correspond to exons two, three and the 5' end of exon four in turnip and wheat. In contrast, turnip transcripts are highly edited in this region, suggesting that homologous recombination of an edited and spliced cDNA intermediate was involved in the loss of introns two and three from an ancestral lettuce nad4 gene.
Discovery and information-theoretic characterization of transcription factor binding sites that act cooperatively.

PubMed

Clifford, Jacob; Adami, Christoph

2015-09-02

Transcription factor binding to the surface of DNA regulatory regions is one of the primary causes of regulating gene expression levels. A probabilistic approach to model protein-DNA interactions at the sequence level is through position weight matrices (PWMs) that estimate the joint probability of a DNA binding site sequence by assuming positional independence within the DNA sequence. Here we construct conditional PWMs that depend on the motif signatures in the flanking DNA sequence, by conditioning known binding site loci on the presence or absence of additional binding sites in the flanking sequence of each site's locus. Pooling known sites with similar flanking sequence patterns allows for the estimation of the conditional distribution function over the binding site sequences. We apply our model to the Dorsal transcription factor binding sites active in patterning the Dorsal-Ventral axis of Drosophila development. We find that those binding sites that cooperate with nearby Twist sites on average contain about 0.5 bits of information about the presence of Twist transcription factor binding sites in the flanking sequence. We also find that Dorsal binding site detectors conditioned on flanking sequence information make better predictions about what is a Dorsal site relative to background DNA than detection without information about flanking sequence features.
Mutation analysis of the Fanconi Anemia Gene FACC

DOE Office of Scientific and Technical Information (OSTI.GOV)

Verlander, P.C.; Lin, J.D.; Udono, M.U.

1994-04-01

Fanconi anemia (FA) is a genetically heterogeneous autosomal recessive disorder characterized by a unique hypersensitivity of cells to DNA cross-linking agents; a gene for complementation group C (FACC) has recently been cloned. The authors have amplified FACC exons with their flanking intron sequences from genomic DNA from 174 racially and ethnically diverse families in the International Fanconi Anemia Registry and have screened for mutations by using SSCP analysis. They have identified eight different variants in 32 families; three were detected in exon 1, one in exon 4, one in intron 4, two in exon 6, and one in exon 14.more » Two of the eight variants, in seven families, did not segregate with the disease allele in multiplex families, suggesting that these variants represented benign polymorphisms. Disease-associated mutations in FACC were detected in a total of 25 (14.4%) of 174 families screened. The most frequent mutations were IVS4 + 4 A [yields] T (intron 4; 12 families) and 322delG (exon 1; 9 families). Other, less common mutations include Q13X in exon 1, R185X and D195V in exon 6, and L554P in exon 14. The polymorphisms were S26F in exon 1 and G139E in exon 4. All patients in the study with 322delG, Q13X, R185X, and D195V are of northern or eastern European or southern Italian ancestry, and 18 of 19 have a mild form of the disease, while the 2 patients with L554P, both from the same family, have a severe phenotype. All 19 patients with IVS4 + 4 A [yields] T have Jewish ancestry and have a severe phenotype. 19 refs., 1 fig., 3 tabs.« less
De novo insertion of an intron into the mammalian sex determining gene, SRY

PubMed Central

O’Neill, Rachel J. Waugh; Brennan, Francine E.; Delbridge, Margaret L.; Crozier, Ross H.; Graves, Jennifer A. Marshall

1998-01-01

Two theories have been proposed to explain the evolution of introns within eukaryotic genes. The introns early theory, or “exon theory of genes,” proposes that introns are ancient and that recombination within introns provided new exon structure, and thus new genes. The introns late theory, or “insertional theory of introns,” proposes that ancient genes existed as uninterrupted exons and that introns have been introduced during the course of evolution. There is still controversy as to how intron–exon structure evolved and whether the majority of introns are ancient or novel. Although there is extensive evidence in support of the introns early theory, phylogenetic comparisons of several genes indicate recent gain and loss of introns within these genes. However, no example has been shown of a protein coding gene, intronless in its ancestral form, which has acquired an intron in a derived form. The mammalian sex determining gene, SRY, is intronless in all mammals studied to date, as is the gene from which it recently evolved. However, we report here comparisons of genomic and cDNA sequences that now provide evidence of a de novo insertion of an intron into the SRY gene of dasyurid marsupials. This recently (approximately 45 million years ago) inserted sequence is not homologous with known transposable elements. Our data demonstrate that introns may be inserted as spliced units within a developmentally crucial gene without disrupting its function. PMID:9465071
Identification and analysis of multigene families by comparison of exon fingerprints.

PubMed

Brown, N P; Whittaker, A J; Newell, W R; Rawlings, C J; Beck, S

1995-06-02

Gene families are often recognised by sequence homology using similarity searching to find relationships, however, genomic sequence data provides gene architectural information not used by conventional search methods. In particular, intron positions and phases are expected to be relatively conserved features, because mis-splicing and reading frame shifts should be selected against. A fast search technique capable of detecting possible weak sequence homologies apparent at the intron/exon level of gene organization is presented for comparing spliceosomal genes and gene fragments. FINEX compares strings of exons delimited by intron/exon boundary positions and intron phases (exon fingerprint) using a global dynamic programming algorithm with a combined intron phase identity and exon size dissimilarity score. Exon fingerprints are typically two orders of magnitude smaller than their nucleic acid sequence counterparts giving rise to fast search times: a ranked search against a library of 6755 fingerprints for a typical three exon fingerprint completes in under 30 seconds on an ordinary workstation, while a worst case largest fingerprint of 52 exons completes in just over one minute. The short "sequence" length of exon fingerprints in comparisons is compensated for by the large exon alphabet compounded of intron phase types and a wide range of exon sizes, the latter contributing the most information to alignments. FINEX performs better in some searches than conventional methods, finding matches with similar exon organization, but low sequence homology. A search using a human serum albumin finds all members of the multigene family in the FINEX database at the top of the search ranking, despite very low amino acid percentage identities between family members. The method should complement conventional sequence searching and alignment techniques, offering a means of identifying otherwise hard to detect homologies where genomic data are available.
Phylogenetics and Gene Structure Dynamics of Polygalacturonase Genes in Aspergillus and Neurospora crassa

PubMed Central

Hong, Jin-Sung; Ryu, Ki-Hyun; Kwon, Soon-Jae; Kim, Jin-Won; Kim, Kwang-Soo; Park, Kyong-Cheul

2013-01-01

Polygalacturonase (PG) gene is a typical gene family present in eukaryotes. Forty-nine PGs were mined from the genomes of Neurospora crassa and five Aspergillus species. The PGs were classified into 3 clades such as clade 1 for rhamno-PGs, clade 2 for exo-PGs and clade 3 for exo- and endo-PGs, which were further grouped into 13 sub-clades based on the polypeptide sequence similarity. In gene structure analysis, a total of 124 introns were present in 44 genes and five genes lacked introns to give an average of 2.5 introns per gene. Intron phase distribution was 64.5% for phase 0, 21.8% for phase 1, and 13.7% for phase 2, respectively. The introns varied in their sequences and their lengths ranged from 20 bp to 424 bp with an average of 65.9 bp, which is approximately half the size of introns in other fungal genes. There were 29 homologous intron blocks and 26 of those were sub-clade specific. Intron losses were counted in 18 introns in which no obvious phase preference for intron loss was observed. Eighteen introns were placed at novel positions, which is considerably higher than those of plant PGs. In an evolutionary sense both intron loss and gain must have taken place for shaping the current PGs in these fungi. Together with the small intron size, low conservation of homologous intron blocks and higher number of novel introns, PGs of fungal species seem to have recently undergone highly dynamic evolution. PMID:25288950
Deep intronic GPR143 mutation in a Japanese family with ocular albinism

PubMed Central

Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei

2015-01-01

Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease. PMID:26061757
Deep intronic GPR143 mutation in a Japanese family with ocular albinism.

PubMed

Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei

2015-06-10

Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease.
A Detailed History of Intron-rich Eukaryotic Ancestors Inferred from a Global Survey of 100 Complete Genomes

PubMed Central

Csuros, Miklos; Rogozin, Igor B.; Koonin, Eugene V.

2011-01-01

Protein-coding genes in eukaryotes are interrupted by introns, but intron densities widely differ between eukaryotic lineages. Vertebrates, some invertebrates and green plants have intron-rich genes, with 6–7 introns per kilobase of coding sequence, whereas most of the other eukaryotes have intron-poor genes. We reconstructed the history of intron gain and loss using a probabilistic Markov model (Markov Chain Monte Carlo, MCMC) on 245 orthologous genes from 99 genomes representing the three of the five supergroups of eukaryotes for which multiple genome sequences are available. Intron-rich ancestors are confidently reconstructed for each major group, with 53 to 74% of the human intron density inferred with 95% confidence for the Last Eukaryotic Common Ancestor (LECA). The results of the MCMC reconstruction are compared with the reconstructions obtained using Maximum Likelihood (ML) and Dollo parsimony methods. An excellent agreement between the MCMC and ML inferences is demonstrated whereas Dollo parsimony introduces a noticeable bias in the estimations, typically yielding lower ancestral intron densities than MCMC and ML. Evolution of eukaryotic genes was dominated by intron loss, with substantial gain only at the bases of several major branches including plants and animals. The highest intron density, 120 to 130% of the human value, is inferred for the last common ancestor of animals. The reconstruction shows that the entire line of descent from LECA to mammals was intron-rich, a state conducive to the evolution of alternative splicing. PMID:21935348
Mitochondrial Intronic Open Reading Frames in Podospora: Mobility and Consecutive Exonic Sequence Variations

PubMed Central

Sellem, C. H.; d'Aubenton-Carafa, Y.; Rossignol, M.; Belcour, L.

1996-01-01

The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optional sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences. In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group I intronic ORFs are mobile elements and that their transfer, and comcomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes. PMID:8725226
Mitochondrial intronic open reading frames in Podospora: mobility and consecutive exonic sequence variations.

PubMed

Sellem, C H; d'Aubenton-Carafa, Y; Rossignol, M; Belcour, L

1996-06-01

The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optional sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences. In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group 1 intronic ORFs are mobile elements and that their transfer, and concomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes.
Accurate, simple, and inexpensive assays to diagnose F8 gene inversion mutations in hemophilia A patients and carriers.

PubMed

Dutta, Debargh; Gunasekera, Devi; Ragni, Margaret V; Pratt, Kathleen P

2016-12-27

The most frequent mutations resulting in hemophilia A are an intron 22 or intron 1 gene inversion, which together cause ∼50% of severe hemophilia A cases. We report a simple and accurate RNA-based assay to detect these mutations in patients and heterozygous carriers. The assays do not require specialized equipment or expensive reagents; therefore, they may provide useful and economic protocols that could be standardized for central laboratory testing. RNA is purified from a blood sample, and reverse transcription nested polymerase chain reaction (RT-NPCR) reactions amplify DNA fragments with the F8 sequence spanning the exon 22 to 23 splice site (intron 22 inversion test) or the exon 1 to 2 splice site (intron 1 inversion test). These sequences will be amplified only from F8 RNA without an intron 22 or intron 1 inversion mutation, respectively. Additional RT-NPCR reactions are then carried out to amplify the inverted sequences extending from F8 exon 19 to the first in-frame stop codon within intron 22 or a chimeric transcript containing F8 exon 1 and the VBP1 gene. These latter 2 products are produced only by individuals with an intron 22 or intron 1 inversion mutation, respectively. The intron 22 inversion mutations may be further classified (eg, as type 1 or type 2, reflecting the specific homologous recombination sites) by the standard DNA-based "inverse-shifting" PCR assay if desired. Efficient Bcl I and T4 DNA ligase enzymes that cleave and ligate DNA in minutes were used, which is a substantial improvement over previous protocols that required overnight incubations. These protocols can accurately detect F8 inversion mutations via same-day testing of patient samples.
Sequencing of mitochondrial genomes of nine Aspergillus and Penicillium species identifies mobile introns and accessory genes as main sources of genome size variability.

PubMed

Joardar, Vinita; Abrams, Natalie F; Hostetler, Jessica; Paukstelis, Paul J; Pakala, Suchitra; Pakala, Suman B; Zafar, Nikhat; Abolude, Olukemi O; Payne, Gary; Andrianopoulos, Alex; Denning, David W; Nierman, William C

2012-12-12

The genera Aspergillus and Penicillium include some of the most beneficial as well as the most harmful fungal species such as the penicillin-producer Penicillium chrysogenum and the human pathogen Aspergillus fumigatus, respectively. Their mitochondrial genomic sequences may hold vital clues into the mechanisms of their evolution, population genetics, and biology, yet only a handful of these genomes have been fully sequenced and annotated. Here we report the complete sequence and annotation of the mitochondrial genomes of six Aspergillus and three Penicillium species: A. fumigatus, A. clavatus, A. oryzae, A. flavus, Neosartorya fischeri (A. fischerianus), A. terreus, P. chrysogenum, P. marneffei, and Talaromyces stipitatus (P. stipitatum). The accompanying comparative analysis of these and related publicly available mitochondrial genomes reveals wide variation in size (25-36 Kb) among these closely related fungi. The sources of genome expansion include group I introns and accessory genes encoding putative homing endonucleases, DNA and RNA polymerases (presumed to be of plasmid origin) and hypothetical proteins. The two smallest sequenced genomes (A. terreus and P. chrysogenum) do not contain introns in protein-coding genes, whereas the largest genome (T. stipitatus), contains a total of eleven introns. All of the sequenced genomes have a group I intron in the large ribosomal subunit RNA gene, suggesting that this intron is fixed in these species. Subsequent analysis of several A. fumigatus strains showed low intraspecies variation. This study also includes a phylogenetic analysis based on 14 concatenated core mitochondrial proteins. The phylogenetic tree has a different topology from published multilocus trees, highlighting the challenges still facing the Aspergillus systematics. The study expands the genomic resources available to fungal biologists by providing mitochondrial genomes with consistent annotations for future genetic, evolutionary and population studies. Despite the conservation of the core genes, the mitochondrial genomes of Aspergillus and Penicillium species examined here exhibit significant amount of interspecies variation. Most of this variation can be attributed to accessory genes and mobile introns, presumably acquired by horizontal gene transfer of mitochondrial plasmids and intron homing.
HFE gene polymorphism defined by sequence-based typing of the Brazilian population and a standardized nomenclature for HFE allele sequences.

PubMed

Campos, W N; Massaro, J D; Martinelli, A L C; Halliwell, J A; Marsh, S G E; Mendes-Junior, C T; Donadi, E A

2017-10-01

The HFE molecule controls iron uptake from gut, and defects in the molecule have been associated with iron overload, particularly in hereditary hemochromatosis. The HFE gene including both coding and boundary intronic regions were sequenced in 304 Brazilian individuals, encompassing healthy individuals and patients exhibiting hereditary or acquired iron overload. Six sites of variation were detected: (1) H63D C>G in exon 2, (2) IVS2 (+4) T>C in intron 2, (3) a C>G transversion in intron 3, (4) C282Y G>A in exon 4, (5) IVS4 (-44) T>C in intron 4, and (6) a new guanine deletion (G>del) in intron 5, which were used for haplotype inference. Nine HFE alleles were detected and six of these were officially named on the basis of the HLA Nomenclature, defined by the World Health Organization (WHO) Nomenclature Committee for Factors of the HLA System, and published via the IPD-IMGT/HLA website. Four alleles, HFE*001, *002, *003, and *004 exhibited variation within their exon sequences. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Proliferation of group II introns in the chloroplast genome of the green alga Oedocladium carolinianum (Chlorophyceae).

PubMed

Brouard, Jean-Simon; Turmel, Monique; Otis, Christian; Lemieux, Claude

2016-01-01

The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA) structure, size, gene order, and intron content have been observed. The large inverted repeat (IR), an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales) but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum . The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium , it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold longer and dispersed repeats are more abundant, but a smaller fraction of the Oedocladium genome is occupied by introns. Six additional group II introns are present, five of which lack ORFs and carry highly similar sequences to that of the ORF-less IIA intron shared with Oedogonium . Secondary structure analysis of the group IIA introns disclosed marked differences in the exon-binding sites; however, each intron showed perfect or nearly perfect base pairing interactions with its target site. Our results suggest that chloroplast genes rearrange more slowly in the Oedogoniales than in the Chaetophorales and raise questions as to what was the nature of the foreign coding sequences in the IR of the common ancestor of the Oedogoniales. They provide the first evidence for intragenomic proliferation of group IIA introns in the Viridiplantae, revealing that intron spread in the Oedocladium lineage likely occurred by retrohoming after sequence divergence of the exon-binding sites.
Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

PubMed

Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

1996-02-15

Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).
Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

PubMed Central

Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

1996-01-01

Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U). PMID:8604302
Antisense Masking of an hnRNP A1/A2 Intronic Splicing Silencer Corrects SMN2 Splicing in Transgenic Mice

PubMed Central

Hua, Yimin; Vickers, Timothy A.; Okunola, Hazeem L.; Bennett, C. Frank; Krainer, Adrian R.

2008-01-01

survival of motor neuron 2, centromeric (SMN2) is a gene that modifies the severity of spinal muscular atrophy (SMA), a motor-neuron disease that is the leading genetic cause of infant mortality. Increasing inclusion of SMN2 exon 7, which is predominantly skipped, holds promise to treat or possibly cure SMA; one practical strategy is the disruption of splicing silencers that impair exon 7 recognition. By using an antisense oligonucleotide (ASO)-tiling method, we systematically screened the proximal intronic regions flanking exon 7 and identified two intronic splicing silencers (ISSs): one in intron 6 and a recently described one in intron 7. We analyzed the intron 7 ISS by mutagenesis, coupled with splicing assays, RNA-affinity chromatography, and protein overexpression, and found two tandem hnRNP A1/A2 motifs within the ISS that are responsible for its inhibitory character. Mutations in these two motifs, or ASOs that block them, promote very efficient exon 7 inclusion. We screened 31 ASOs in this region and selected two optimal ones to test in human SMN2 transgenic mice. Both ASOs strongly increased hSMN2 exon 7 inclusion in the liver and kidney of the transgenic animals. Our results show that the high-resolution ASO-tiling approach can identify cis-elements that modulate splicing positively or negatively. Most importantly, our results highlight the therapeutic potential of some of these ASOs in the context of SMA. PMID:18371932
Resequencing of IRS2 reveals rare variants for obesity but not fasting glucose homeostasis in Hispanic children

PubMed Central

Voruganti, V. Saroja; Cole, Shelley A.; Haack, Karin; Comuzzie, Anthony G.; Muzny, Donna M.; Wheeler, David A.; Chang, Kyle; Hawes, Alicia; Gibbs, Richard A.

2011-01-01

Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5′ and 3′ flanking regions of IRS2 (∼14.5 kb), were bidirectionally sequenced for single nucleotide polymorphism (SNP) discovery in 934 Hispanic children using 3730XL DNA Sequencers. Additionally, 15 SNPs derived from Illumina HumanOmni1-Quad BeadChips were analyzed. Measured genotype analysis tested associations between SNPs and obesity and diabetes-related traits. Bayesian quantitative trait nucleotide analysis was used to statistically infer the most likely functional polymorphisms. A total of 140 SNPs were identified with minor allele frequencies (MAF) ranging from 0.001 to 0.47. Forty-two of the 70 coding SNPs result in nonsynonymous amino acid substitutions relative to the consensus sequence; 28 SNPs were detected in the promoter, 12 in introns, 28 in the 3′-UTR, and 2 in the 5′-UTR. Two insertion/deletions (indels) were detected. Ten independent rare SNPs (MAF = 0.001–0.009) were associated with obesity-related traits (P = 0.01–0.00002). SNP 10510452_139 in the promoter region was shown to have a high posterior probability (P = 0.77–0.86) of influencing BMI, fat mass, and waist circumference in Hispanic children. SNP 10510452_139 contributed between 2 and 4% of the population variance in body weight and composition. None of the SNPs or indels were associated with diabetes-related traits or accounted for a previously identified quantitative trait locus on chromosome 13 for fasting serum glucose. Rare but not common IRS2 variants may play a role in the regulation of body weight but not an essential role in fasting glucose homeostasis in Hispanic children. PMID:21771880
HybPiper: Extracting coding sequence and introns for phylogenetics from high-throughput sequencing reads using target enrichment1

PubMed Central

Johnson, Matthew G.; Gardner, Elliot M.; Liu, Yang; Medina, Rafael; Goffinet, Bernard; Shaw, A. Jonathan; Zerega, Nyree J. C.; Wickett, Norman J.

2016-01-01

Premise of the study: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). Methods and Results: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly complete gene sequences for all genes in 22 species of Artocarpus. HybPiper also recovered more than 500 bp of nontargeted intron sequence in over half of the phylogenetic markers and identified paralogous gene copies in Artocarpus. Conclusions: HybPiper was designed for Linux and Mac OS X and is freely available at https://github.com/mossmatters/HybPiper. PMID:27437175

The complete plastid genome sequence of Eustrephus latifolius (Asparagaceae: Lomandroideae).

PubMed

Kim, Hyoung Tae; Kim, Jung Sung; Kim, Joo-Hwan

2016-01-01

The complete chloroplast (cp) genome sequence of Eustrephus latifolius was firstly determined in subfamily Lomandriodeae of family Asparagaceae. It was 159,736 bp and contained a large single copy region (82,403 bp) and a small single copy region (13,607 bp) which were separated by two inverted repeat regions (31,863 bp). In total, 132 genes were identified and they were consisted of 83 coding genes, 8 rRNA genes, 38 tRNA genes, 3 pseudogenes. rpl23 and clpP were pseudogenes due to sequence deletions. Among 23 genes containing introns, rps12 and ycf3 contained two introns and the rest had just one intron. The intact ycf68 was identified within an intron of trnI-GAU. The amino acid sequence was almost identical with Phoenix dactylifera in Aracales. Ycf1 of E. latifolius was completely located in IR. It was similar to cp genome structure of Lemna minor, Spirodela polyrhiza, Wolffiella lingulata, Wolffia australiana in Alismatales.
Phylogenetic inferences of Nepenthes species in Peninsular Malaysia revealed by chloroplast (trnL intron) and nuclear (ITS) DNA sequences.

PubMed

Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd

2017-01-26

The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.
Sequence analyses reveal that a TPR–DP module, surrounded by recombinable flanking introns, could be at the origin of eukaryotic Hop and Hip TPR–DP domains and prokaryotic GerD proteins

PubMed Central

Papandreou, Nikolaos; Chomilier, Jacques

2008-01-01

The co-chaperone Hop [heat shock protein (HSP) organising protein] is known to bind both Hsp70 and Hsp90. Hop comprises three repeats of a tetratricopeptide repeat (TPR) domain, each consisting of three TPR motifs. The first and last TPR domains are followed by a domain containing several dipeptide (DP) repeats called the DP domain. These analyses suggest that the hop genes result from successive recombination events of an ancestral TPR–DP module. From a hydrophobic cluster analysis of homologous Hop protein sequences derived from gene families, we can postulate that shifts in the open reading frames are at the origin of the present sequences. Moreover, these shifts can be related to the presence or absence of biological function. We propose to extend the family of Hop co-chaperons into the kingdom of bacteria, as several structurally related genes have been identified by hydrophobic cluster analysis. We also provide evidence of common structural characteristics between hop and hip genes, suggesting a shared precursor of ancestral TPR–DP domains. Electronic supplementary material The online version of this article (doi:10.1007/s12192-008-0083-8) contains supplementary material, which is available to authorized users. PMID:18987995
Congenital protein losing enteropathy: an inborn error of lipid metabolism due to DGAT1 mutations.

PubMed

Stephen, Joshi; Vilboux, Thierry; Haberman, Yael; Pri-Chen, Hadass; Pode-Shakked, Ben; Mazaheri, Sina; Marek-Yagel, Dina; Barel, Ortal; Di Segni, Ayelet; Eyal, Eran; Hout-Siloni, Goni; Lahad, Avishay; Shalem, Tzippora; Rechavi, Gideon; Malicdan, May Christine V; Weiss, Batia; Gahl, William A; Anikster, Yair

2016-08-01

Protein-losing enteropathy (PLE) is a clinical disorder of protein loss from the gastrointestinal system that results in hypoproteinemia and malnutrition. This condition is associated with a wide range of gastrointestinal disorders. Recently, a unique syndrome of congenital PLE associated with biallelic mutations in the DGAT1 gene has been reported in a single family. We hypothesize that mutations in this gene are responsible for undiagnosed cases of PLE in infancy. Here we investigated three children in two families presenting with severe diarrhea, hypoalbuminemia and PLE, using clinical studies, homozygosity mapping, and exome sequencing. In one family, homozygosity mapping using SNP arrays revealed the DGAT1 gene as the best candidate gene for the proband. Sequencing of all the exons including flanking regions and promoter regions of the gene identified a novel homozygous missense variant, p.(Leu295Pro), in the highly conserved membrane-bound O-acyl transferase (MBOAT) domain of the DGAT1 protein. Expression studies verified reduced amounts of DGAT1 in patient fibroblasts. In a second family, exome sequencing identified a previously reported splice site mutation in intron 8. These cases of DGAT1 deficiency extend the molecular and phenotypic spectrum of PLE, suggesting a re-evaluation of the use of DGAT1 inhibitors for metabolic disorders including obesity and diabetes.
An RNAi-enhanced Logic Circuit for Cancer Specific Detection and Destruction

DTIC Science & Technology

2010-07-01

Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its mutant hBax-S184A [4]. A plasmid containing the tested gene was transfected into HEK...the far-red fluorescent protein mKate to express the Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and...intron-exon junction. Among the donor and acceptor sequences found in literature our intron features were chosen according SplicePort [5], an
Discovering weighted patterns in intron sequences using self-adaptive harmony search and back-propagation algorithms.

PubMed

Huang, Yin-Fu; Wang, Chia-Ming; Liou, Sing-Wu

2013-01-01

A hybrid self-adaptive harmony search and back-propagation mining system was proposed to discover weighted patterns in human intron sequences. By testing the weights under a lazy nearest neighbor classifier, the numerical results revealed the significance of these weighted patterns. Comparing these weighted patterns with the popular intron consensus model, it is clear that the discovered weighted patterns make originally the ambiguous 5SS and 3SS header patterns more specific and concrete.
Discovering Weighted Patterns in Intron Sequences Using Self-Adaptive Harmony Search and Back-Propagation Algorithms

PubMed Central

Wang, Chia-Ming; Liou, Sing-Wu

2013-01-01

A hybrid self-adaptive harmony search and back-propagation mining system was proposed to discover weighted patterns in human intron sequences. By testing the weights under a lazy nearest neighbor classifier, the numerical results revealed the significance of these weighted patterns. Comparing these weighted patterns with the popular intron consensus model, it is clear that the discovered weighted patterns make originally the ambiguous 5SS and 3SS header patterns more specific and concrete. PMID:23737711
Insertion of a self-splicing intron into the mtDNA of atriploblastic animal

DOE Office of Scientific and Technical Information (OSTI.GOV)

Valles, Y.; Halanych, K.; Boore, J.L.

2006-04-14

Nephtys longosetosa is a carnivorous polychaete worm that lives in the intertidal and subtidal zones with worldwide distribution (pleijel&rouse2001). Its mitochondrial genome has the characteristics typical of most metazoans: 37 genes; circular molecule; almost no intergenic sequence; and no significant gene rearrangements when compared to other annelid mtDNAs (booremoritz19981995). Ubiquitous features as small intergenic regions and lack of introns suggested that metazoan mtDNAs are under strong selective pressures to reduce their genome size allowing for faster replication requirements (booremoritz19981995Lynch2005). Yet, in 1996 two type I introns were found in the mtDNA of the basal metazoan Metridium senile (FigureX). Breaking amore » long-standing rule (absence of introns in metazoan mtDNA), this finding was later supported by the further presence of group I introns in other cnidarians. Interestingly, only the class Anthozoa within cnidarians seems to harbor such introns. Although several hundreds of triploblastic metazoan mtDNAs have been sequenced, this study is the first evidence of mitochondrial introns in triploblastic metazoans. The cox1 gene of N. longosetosa has an intron of almost 2 kbs in length. This finding represents as well the first instance of a group II intron (anthozoans harbor group I introns) in all metazoan lineages. Opposite trends are observed within plants, fungi and protist mtDNAs, where introns (both group I and II) and other non-coding sequences are widespread. Plant, fungal and protist mtDNA structure and organization differ enormously from that of metazoan mtDNA. Both, plant and fungal mtDNA are dynamic molecules that undergo high rates of recombination, contain long intergenic spacer regions and harbor both group I and group II introns. However, as metazoans they have a conserved gene content. Protists, on the other hand have a striking variation of gene content and introns that account for the genome size variation. In contrast to this mtDNA structure and organization diversity, current genome level studies point to a monophyletic origin of the mitochondria (REFS), raising questions such as: what are the pressures at work shaping the evolution of the mitochondrial genome at 'higher' levels? What drives the absence of introns and other non-coding spacers in metazoan mtDNA? What characteristics must have an intron to be maintained in an environment where 'extra chromosomes' are usually selected against?« less
Validation of high-resolution DNA melting analysis for mutation scanning of the CDKL5 gene: identification of novel mutations.

PubMed

Raymond, Laure; Diebold, Bertrand; Leroux, Céline; Maurey, Hélène; Drouin-Garraud, Valérie; Delahaye, Andre; Dulac, Olivier; Metreau, Julia; Melikishvili, Gia; Toutain, Annick; Rivier, François; Bahi-Buisson, Nadia; Bienvenu, Thierry

2013-01-01

Mutations in the cyclin-dependent kinase-like 5 gene (CDKL5) have been predominantly described in epileptic encephalopathies of female, including infantile spasms with Rett-like features. Up to now, detection of mutations in this gene was made by laborious, expensive and/or time consuming methods. Here, we decided to validate high-resolution melting analysis (HRMA) for mutation scanning of the CDKL5 gene. Firstly, using a large DNA bank consisting to 34 samples carrying different mutations and polymorphisms, we validated our analytical conditions to analyse the different exons and flanking intronic sequences of the CDKL5 gene by HRMA. Secondly, we screened CDKL5 by both HRMA and denaturing high performance liquid chromatography (dHPLC) in a cohort of 135 patients with early-onset seizures. Our results showed that point mutations and small insertions and deletions can be reliably detected by HRMA. Compared to dHPLC, HRMA profiles are more discriminated, thereby decreasing unnecessary sequencing. In this study, we identified eleven novel sequence variations including four pathogenic mutations (2.96% prevalence). HRMA appears cost-effective, easy to set up, highly sensitive, non-toxic and rapid for mutation screening, ideally suited for large genes with heterogeneous mutations located along the whole coding sequence, such as the CDKL5 gene. Copyright © 2012 Elsevier B.V. All rights reserved.
Genomic organization of the neurofibromatosis 1 gene (NF1)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Y.; O`Connell, P.; Huntsman Breidenbach, H.

Neurofibromatosis 1 maps to chromosome band 17q11.2, and the NF1 locus has been partially characterized. Even though the full-length NF1 cDNA has been sequenced, the complete genomic structure of the NF1 gene has not been elucidated. The 5{prime} end of NF1 is embedded in a CpG island containing a NotI restriction site, and the remainder of the gene lies in the adjacent 350-kb NotI fragment. In our efforts to develop a comprehensive screen for NF1 mutations, we have isolated genomic DNA clones that together harbor the entire NF1 cDNA sequence. We have identified all intron-exon boundaries of the coding regionmore » and established that it is composed of 59 exons. Furthermore, we have defined the 3{prime}-untranslated region (3{prime}-UTR) of the NF1 gene; it spans approximately 3.5 kb of genomic DNA sequence and is continuous with the stop codon. Oligonucleotide primer pairs synthesized from exon-flanking DNA sequences were used in the polymerase chain reaction with cloned, chromosome 17-specific genomic DNA as template to amplify NF1 exons 1 through 27b and the exon containing the 3{prime}-UTR separately. This information should be useful for implementing a comprehensive NF1 mutation screen using genomic DNA as template. 41 refs., 3 figs., 2 tabs.« less
Compound haplotypes at Xp11.23 and human population growth in Eurasia.

PubMed

Alonso, S; Armour, J A L

2004-09-01

To investigate patterns of diversity and the evolutionary history of Eurasians, we have sequenced a 2.8 kb region at Xp11.23 in a sample of African and Eurasian chromosomes. This region is in a long intron of CLCN5 and is immediately flanked by a highly variable minisatellite, DXS255, and a human-specific Ta0 LINE. Compared to Africans, Eurasians showed a marked reduction in sequence diversity. The main Euro-Asiatic haplotype seems to be the ancestral haplotype for the whole sample. Coalescent simulations, including recombination and exponential growth, indicate a median length of strong linkage disequilibrium, up to approximately 9 kb for this area. The Ka/Ks ratio between the coding sequence of human CLCN5 and its mouse orthologue is much less than 1. This implies that the region sequenced is unlikely to be under the strong influence of positive selective processes on CLCN5, mutations in which have been associated with disorders such as Dent's disease. In contrast, a scenario based on a population bottleneck and exponential growth seems a more likely explanation for the reduced diversity observed in Eurasians. Coalescent analysis and linked minisatellite diversity (which reaches a gene diversity value greater than 98% in Eurasians) suggest an estimated age of origin of the Euro-Asiatic diversity compatible with a recent out-of-Africa model for colonization of Eurasia by modern Homo sapiens.
Gene structure and functional characterization of growth hormone in dogfish, Squalus acanthias.

PubMed

Moriyama, Shunsuke; Oda, Mayumi; Yamazaki, Tomohide; Yamaguchi, Kiyoko; Amiya, Noriko; Takahashi, Akiyoshi; Amano, Masafumi; Goto, Tomoaki; Nozaki, Masumi; Meguro, Hiroshi; Kawauchi, Hiroshi

2008-06-01

Dogfish (Squalus acanthias) growth hormone (GH) was identified by cDNA cloning and protein purification from the pituitary gland. Dogfish GH cDNA encoded a prehormone of 210 amino acids (aa). Sequence analysis of purified GH revealed that the prehormone is composed of a signal peptide of 27 aa and a mature protein of 183 aa. Dogfish GH showed 94% sequence identity with blue shark GH, and also showed 37-66%, 26%, and 48-67% sequence identity with GH from osteichtyes, an agnathan, and tetrapods. The site of production was identified through immunocytochemistry to be cells of the proximal pars distalis of the pituitary gland. Dogfish GH stimulates both insulin-like growth factor-I and II mRNA levels in dogfish liver in vitro. The dogfish GH gene consisted of five exons and four introns, the same as in lamprey, teleosts such as cypriniforms and siluriforms, and tetrapods. The 5'-flanking region within 1082 bp of the transcription start site contained consensus sequences for the TATA box, Pit-1/GHF-1, CRE, TRE, and ERE. These results show that the endocrine mechanism for growth stimulation by the GH-IGF axis was established at an early stage of vertebrate evolution, and that the 5-exon-type gene organization might reflect the structure of the ancestral gene for the GH gene family.
Transfection and heat-inducible expression of molluscan promoter-luciferase reporter gene constructs in the Biomphalaria glabrata embryonic snail cell line.

PubMed

Yoshino, T P; Wu, X J; Liu, H D

1998-09-01

Studies were initiated to begin developing a genetic transformation system for cells derived from the freshwater gastropod, Biomphalaria glabrata, an intermediate host of the human blood fluke Schistosoma mansoni. Using a 70-kD heat-shock protein (HSP70) cDNA probe obtained from the B. glabrata embryonic (Bge) cell line, we cloned from Bge cells a complete HSP70 gene including a 1-kb genomic DNA fragment in its 5'-flanking region containing sequences indicative of a HSP promoter. Identified in the 5'-half (416 nucleotides) of this genomic fragment were TATA and CAAT boxes, two putative transcription initiation sites, and a series of palindromic DNA repeats with shared homology to the heat-shock element consensus sequence (Bge HSP70(0.5k) promoter). The 3'-half of this upstream flanking region was comprised of a 508-base intron located immediately 5' of the ATG start codon. To determine the functionality of the putative snail promoter sequence, Bge HSP promoter/luciferase (Luc) reporter gene constructs were introduced into Bge cells by N-(1-(2,3-dioleoyloxy) propyl)-N,N,N-trimethylammonium methylsulfate (DOTAP)-mediated transfection methods, and assayed for Luc activity 48 hr following a 1.5-hr heat-shock treatment (40 degrees C). Compared with control vectors or the Bge HSP70(0.5k/1.0k) promoter constructs at 26 degrees C, a 10- to 300-fold increase in Luc expression was obtained only in the Bge HSP70 promoter/Luc-transfected cells following heat-shock. Results of transfection experiments demonstrate that the Bge HSP70(0.5k) DNA segment contains appropriate promoter sequences for driving temperature-inducible gene expression in the Bge snail cell line. This report represents the first isolation and functional characterization of an inducible promoter from a freshwater gastropod mollusc. Successful transient expression of a foreign reporter gene in Bge cells using a homologous, inducible promoter sequence now paves the way for development of methods for stable integration and expression of snail genes of interest into the Bge cell line.
[Identification and phylogenetic application of unique nucleotide sequence of nad7 intron2 in Rhodiola (Crassulaceae) species].

PubMed

Deng, Ke-Jun; Yang, Zu-Jun; Liu, Cheng; Zhao, Wei; Liu, Chang; Feng, Juan; Ren, Zheng-Long

2007-03-01

Genetic characterization of 9 populations of Rhodiola crenulata, R. fastigiata and R. sachalinensis (Crassulaceae) species from Sichuan and Jilin Provinces of China, was investigated using the conserved primer of nad7 intron 2. All PCR products about 800 bp long were shorter than other Crassulaceae plants, which were used as molecular markers to identify the Rhodiola species. The sequence of the products indicated that total exon of 53 bp and intron of 738 bp exhibit only 9 nucleotide variations. Blasting the nad7 sequences to GenBank and the phylogenetic analysis showed that the sequence of Rhodiola species was clusted independently, and the length was smaller than all the registered sequences of higher plants. The result suggests that the Rhiodola species had a unique sequence in this gene region, which might be related to the special growth condition.
Mitochondrial intronic open reading frames in Podospora: Mobility and consecutive exonic sequence variations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sellem, C.H.; Rossignol, M.; Belcour, L.

1996-06-01

The mitochondrial genome of 23 wild-type strains belonging to three different species of the filamentous fungus Podospora was examined. Among the 15 optical sequences identified are two intronic reading frames, nad1-i4-orf1 and cox1-i7-orf2. We show that the presence of these sequences was strictly correlated with tightly clustered nucleotide substitutions in the adjacent exon. This correlation applies to the presence or absence of closely related open reading frames (ORFs), found at the same genetic locations, in all the Pyrenomycete genera examined. The recent gain of these optional ORFs in the evolution of the genus Podospora probably account for such sequence differences.more » In the homoplasmic progeny from heteroplasmons constructed between Podospora strains differing by the presence of these optional ORFs, nad1-i4-orf1 and cox1-i7-orf2 appeared highly invasive. Sequence comparisons in the nad1-i4 intron of various strains of the Pyrenomycete family led us to propose a scenario of its evolution that includes several events of loss and gain of intronic ORFs. These results strongly reinforce the idea that group I intronic ORFs are mobile elements and that their transfer, and comcomitant modification of the adjacent exon, could participate in the modular evolution of mitochondrial genomes. 46 refs., 5 figs., 2 tabs.« less
Phylogenetic Distribution of Intron Positions in Alpha-Amylase Genes of Bilateria Suggests Numerous Gains and Losses

PubMed Central

Da Lage, Jean-Luc; Maczkowiak, Frédérique; Cariou, Marie-Louise

2011-01-01

Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that “resets” of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures. PMID:21611157
Permanent Neonatal Diabetes Caused by Creation of an Ectopic Splice Site within the INS Gene

PubMed Central

Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis

2012-01-01

Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272
Pea chloroplast tRNA(Lys) (UUU) gene: transcription and analysis of an intron-containing gene.

PubMed

Boyer, S K; Mullet, J E

1988-07-01

The pea chloroplast trnK gene which encodes tRNA(Lys) (UUU) was sequenced. TrnK is located 210 bp upstream from the promoter of psbA and immediately downstream from the 3'-end of rbcL. The gene is transcribed from the same DNA strand as psbA and rbcL. A 2447 bp intron with class II features is located in the trnK anticodon loop. The intron contains a 506 amino acid open reading frame which could encode an RNA maturase. The primary transcript of trnK is 2.9 kb long; its 5'-end was identified as a site of transcription initiation by in vitro transcription experiments. The 5'-terminus is adjacent to DNA sequences previously identified as transcription promoter elements. The most abundant trnK transcript is 2.5 kb long with termini corresponding to the 5' and 3' ends of the trnK exons. Intron specific RNAs were not detected. This suggests that RNA processing which produces tRNA(Lys) leads to rapid degradation of intron sequences.
Fractal landscape analysis of DNA walks

NASA Technical Reports Server (NTRS)

Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Sciortino, F.; Simons, M.; Stanley, H. E.

1992-01-01

By mapping nucleotide sequences onto a "DNA walk", we uncovered remarkably long-range power law correlations [Nature 356 (1992) 168] that imply a new scale invariant property of DNA. We found such long-range correlations in intron-containing genes and in non-transcribed regulatory DNA sequences, but not in cDNA sequences or intron-less genes. In this paper, we present more explicit evidences to support our findings.
[Applylication of new type combined fragments: nrDNA ITS+ nad 1-intron 2 for identification of Dendrobium species of Fengdous].

PubMed

Geng, Li-xia; Zheng, Rui; Ren, Jie; Niu, Zhi-tao; Sun, Yu-long; Xue, Qing-yun; Liu, Wei; Ding, Xiao-yu

2015-08-01

In this study, 17 kinds of Dendrobium species of Fengdous including 39 individuals were collected from 4 provinces. Mitochondrial gene sequences co I, nad 5, nad 1-intron 2 and chloroplast gene sequences rbcL, matK amd psbA-trnH were amplified from these materials, as well as nrDNA ITS. Furthermore, suitable sequences for identification of Dendrobium species of Fengdous were screened by K-2-P and P-distance. The results showed that during the mentioned 7 sequences, nrDNA ITS, nad 1-intron 2 and psbA-trnH which had a high degree of variability could be used to identify Dendrobium species of Fengdous. However, single fragment could not be used to distinguish D. moniliforme and D. huoshanense. Moreover, compared to other combined fragments, new type combined fragments nrDNA ITS+nad 1-intron 2 was more effective in identifying the original plants of Dendrobium species and could be used to identify D. huoshanense and D. moniliforme. Besides, according to the UPGMA tree constructed with nrDNA ITS+nad 1-intron 2, 3 inspected Dendrobium plants were identified as D. huoshanense, D. moniliforme and D. officinale, respectively. This study identified Dendrobium species of Fengdous by combined fragments nrDNA ITS+nad 1-intron 2 for the first time, which provided a more effective basis for identification of Dendrobium species. And this study will be helpful for regulating the market of Fengdous.

Novel methodologies for spectral classification of exon and intron sequences

NASA Astrophysics Data System (ADS)

Kwan, Hon Keung; Kwan, Benjamin Y. M.; Kwan, Jennifer Y. Y.

2012-12-01

Digital processing of a nucleotide sequence requires it to be mapped to a numerical sequence in which the choice of nucleotide to numeric mapping affects how well its biological properties can be preserved and reflected from nucleotide domain to numerical domain. Digital spectral analysis of nucleotide sequences unfolds a period-3 power spectral value which is more prominent in an exon sequence as compared to that of an intron sequence. The success of a period-3 based exon and intron classification depends on the choice of a threshold value. The main purposes of this article are to introduce novel codes for 1-sequence numerical representations for spectral analysis and compare them to existing codes to determine appropriate representation, and to introduce novel thresholding methods for more accurate period-3 based exon and intron classification of an unknown sequence. The main findings of this study are summarized as follows: Among sixteen 1-sequence numerical representations, the K-Quaternary Code I offers an attractive performance. A windowed 1-sequence numerical representation (with window length of 9, 15, and 24 bases) offers a possible speed gain over non-windowed 4-sequence Voss representation which increases as sequence length increases. A winner threshold value (chosen from the best among two defined threshold values and one other threshold value) offers a top precision for classifying an unknown sequence of specified fixed lengths. An interpolated winner threshold value applicable to an unknown and arbitrary length sequence can be estimated from the winner threshold values of fixed length sequences with a comparable performance. In general, precision increases as sequence length increases. The study contributes an effective spectral analysis of nucleotide sequences to better reveal embedded properties, and has potential applications in improved genome annotation.
Intronic splicing mutations in PTCH1 cause Gorlin syndrome.

PubMed

Bholah, Zaynab; Smith, Miriam J; Byers, Helen J; Miles, Emma K; Evans, D Gareth; Newman, William G

2014-09-01

Gorlin syndrome is an autosomal dominant disorder characterized by multiple early-onset basal cell carcinoma, odontogenic keratocysts and skeletal abnormalities. It is caused by heterozygous mutations in the tumour suppressor PTCH1. Routine clinical genetic testing, by Sanger sequencing and multiplex ligation-dependent probe amplification (MLPA) to confirm a clinical diagnosis of Gorlin syndrome, identifies a mutation in 60-90 % of cases. We undertook RNA analysis on lymphocytes from ten individuals diagnosed with Gorlin syndrome, but without known PTCH1 mutations by exonic sequencing or MLPA. Two altered PTCH1 transcripts were identified. Genomic DNA sequence analysis identified an intron 7 mutation c.1068-10T>A, which created a strong cryptic splice acceptor site, leading to an intronic insertion of eight bases; this is predicted to create a frameshift p.(His358Alafs*12). Secondly, a deep intronic mutation c.2561-2057A>G caused an inframe insertion of 78 intronic bases in the cDNA transcript, leading to a premature stop codon p.(Gly854fs*3). The mutations are predicted to cause loss of function of PTCH1, consistent with its tumour suppressor function. The findings indicate the importance of RNA analysis to detect intronic mutations in PTCH1 not identified by routine screening techniques.
Evolutionary and biogeographical implications of degraded LAGLIDADG endonuclease functionality and group I intron occurrence in stony corals (Scleractinia) and mushroom corals (Corallimorpharia).

PubMed

Celis, Juan Sebastián; Edgell, David R; Stelbrink, Björn; Wibberg, Daniel; Hauffe, Torsten; Blom, Jochen; Kalinowski, Jörn; Wilke, Thomas

2017-01-01

Group I introns and homing endonuclease genes (HEGs) are mobile genetic elements, capable of invading target sequences in intron-less genomes. LAGLIDADG HEGs are the largest family of endonucleases, playing a key role in the mobility of group I introns in a process known as 'homing'. Group I introns and HEGs are rare in metazoans, and can be mainly found inserted in the COXI gene of some sponges and cnidarians, including stony corals (Scleractinia) and mushroom corals (Corallimorpharia). Vertical and horizontal intron transfer mechanisms have been proposed as explanations for intron occurrence in cnidarians. However, the central role of LAGLIDADG motifs in intron mobility mechanisms remains poorly understood. To resolve questions regarding the evolutionary origin and distribution of group I introns and HEGs in Scleractinia and Corallimorpharia, we examined intron/HEGs sequences within a comprehensive phylogenetic framework. Analyses of LAGLIDADG motif conservation showed a high degree of degradation in complex Scleractinia and Corallimorpharia. Moreover, the two motifs lack the respective acidic residues necessary for metal-ion binding and catalysis, potentially impairing horizontal intron mobility. In contrast, both motifs are highly conserved within robust Scleractinia, indicating a fully functional endonuclease capable of promoting horizontal intron transference. A higher rate of non-synonymous substitutions (Ka) detected in the HEGs of complex Scleractinia and Corallimorpharia suggests degradation of the HEG, whereas lower Ka rates in robust Scleractinia are consistent with a scenario of purifying selection. Molecular-clock analyses and ancestral inference of intron type indicated an earlier intron insertion in complex Scleractinia and Corallimorpharia in comparison to robust Scleractinia. These findings suggest that the lack of horizontal intron transfers in the former two groups is related to an age-dependent degradation of the endonuclease activity. Moreover, they also explain the peculiar geographical patterns of introns in stony and mushroom corals.
Rare HFE variants are the most frequent cause of hemochromatosis in non-c282y homozygous patients with hemochromatosis.

PubMed

Hamdi-Rozé, Houda; Beaumont-Epinette, Marie-Pascale; Ben Ali, Zeineb; Le Lan, Caroline; Loustaud-Ratti, Véronique; Causse, Xavier; Loreal, Olivier; Deugnier, Yves; Brissot, Pierre; Jouanolle, Anne-Marie; Bardou-Jacquet, Edouard

2016-12-01

p.Cys282Tyr (C282Y) homozygosity explains most cases of HFE-related hemochromatosis, but a significant number of patients presenting with typical type I hemochromatosis phenotype remain unexplained. We sought to describe the clinical relevance of rare HFE variants in non-C282Y homozygotes. Patients referred for hemochromatosis to the National Reference Centre for Rare Iron Overload Diseases from 2004 to 2010 were studied. Sequencing was performed for coding region and intronic flanking sequences of HFE, HAMP, HFE2, TFR2, and SLC40A1. Nine private HFE variants were identified in 13 of 206 unrelated patients. Among those, five have not been previously described: p.Leu270Argfs*4, p.Ala271Valfs*25, p.Tyr52*, p.Lys166Asn, and p.Asp141Tyr. Our results show that rare HFE variants are identified more frequently than variants in the other genes associated with iron overload. Rare HFE variants are therefore the most frequent cause of hemochromatosis in non-C282Y homozygote HFE patients. Am. J. Hematol. 91:1202-1205, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Structural organization and chromosomal assignment of the mouse embryonic TEA domain-containing factor (ETF) gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Suzuki, Kazuo; Yasunami, Michio; Matsuda, Yoichi

1996-09-01

Embryonic TEA domain-containing factor (ETF) belongs to the family of proteins structurally related to transcriptional enhancer factor-1 (TEF-1) and is implicated in neural development. Isolation and characterization of the cosmid clones encoding the mouse ETF gene (Etdf) revealed that Etdf spans approximately 17.9 kb and consists of 12 exons. The exon-intron structure of Etdf closely resembles that of the Drosophila scalloped gene, indicating that these genes may have evolved from a common ancestor. Then multiple transcription initiation sites revealed by S1 protection and primer extension analyses are consistent with the absence of the canonical TATA and CAAT boxes in themore » 5{prime}-flanking region, which contains many potential regulatory sequences, such as the E-box, N-box, Sp1 element, GATA-1 element, TAATGARAT element, and B2 short interspersed element (SINE) as well as several direct and inverted repeat sequences. The Etdf locus was assigned to the proximal region of mouse chromosome 7 using fluorescence in situ hybridization and linkage mapping analyses. These results provide the molecular basis for studying the regulation, in vivo function, and evolution of Etdf. 29 refs., 5 figs., 1 tab.« less
Structural organization and chromosomal assignment of the mouse embryonic TEA domain-containing factor (ETF) gene.

PubMed

Suzuki, K; Yasunami, M; Matsuda, Y; Maeda, T; Kobayashi, H; Terasaki, H; Ohkubo, H

1996-09-01

Embryonic TEA domain-containing factor (ETF) belongs to the family of proteins structurally related to transcriptional enhancer factor-1 (TEF-1) and is implicated in neural development. Isolation and characterization of the cosmid clones encoding the mouse ETF gene (Etdf) revealed that Etdf spans approximately 17.9 kb and consists of 12 exons. The exon-intron structure of Etdf closely resembles that of the Drosophila scalloped gene, indicating that these genes may have evolved from a common ancestor. The multiple transcription initiation sites revealed by S1 protection and primer extension analyses are consistent with the absence of the canonical TATA and CAAT boxes in the 5'-flanking region, which contains many potential regulatory sequences, such as the E-box, N-box, Sp1 element, GATA-1 element, TAATGARAT element, and B2 short interspersed element (SINE) as well as several direct and inverted repeat sequences. The Etdf locus was assigned to the proximal region of mouse chromosome 7 using fluorescence in situ hybridization and linkage mapping analyses. These results provide the molecular basis for studying the regulation, in vivo function, and evolution of Etdf.
Saccharomyces cerevisiae ribosomal protein L37 is encoded by duplicate genes that are differentially expressed.

PubMed

Tornow, J; Santangelo, G M

1994-06-01

A duplicate copy of the RPL37A gene (encoding ribosomal protein L37) was cloned and sequenced. The coding region of RPL37B is very similar to that of RPL37A, with only one conservative amino-acid difference. However, the intron and flanking sequences of the two genes are extremely dissimilar. Disruption experiments indicate that the two loci are not functionally equivalent: disruption of RPL37B was insignificant, but disruption of RPL37A severely impaired the growth rate of the cell. When both RPL37 loci are disrupted, the cell is unable to grow at all, indicating that rpL37 is an essential protein. The functional disparity between the two RPL37 loci could be explained by differential gene expression. The results of two experiments support this idea: gene fusion of RPL37A to a reporter gene resulted in six-fold higher mRNA levels than was generated by the same reporter gene fused to RPL37B, and a modest increase in gene dosage of RPL37B overcame the lack of a functional RPL37A gene.
Exome capture from the spruce and pine giga-genomes.

PubMed

Suren, H; Hodgins, K A; Yeaman, S; Nurkowski, K A; Smets, P; Rieseberg, L H; Aitken, S N; Holliday, J A

2016-09-01

Sequence capture is a flexible tool for generating reduced representation libraries, particularly in species with massive genomes. We used an exome capture approach to sequence the gene space of two of the dominant species in Canadian boreal and montane forests - interior spruce (Picea glauca x engelmanii) and lodgepole pine (Pinus contorta). Transcriptome data generated with RNA-seq were coupled with draft genome sequences to design baits corresponding to 26 824 genes from pine and 28 649 genes from spruce. A total of 579 samples for spruce and 631 samples for pine were included, as well as two pine congeners and six spruce congeners. More than 50% of targeted regions were sequenced at >10× depth in each species, while ~12% captured near-target regions within 500 bp of a bait position were sequenced to a depth >10×. Much of our read data arose from off-target regions, which was likely due to the fragmented and incomplete nature of the draft genome assemblies. Capture in general was successful for the related species, suggesting that baits designed for a single species are likely to successfully capture sequences from congeners. From these data, we called approximately 10 million SNPs and INDELs in each species from coding regions, introns, untranslated and flanking regions, as well as from the intergenic space. Our study demonstrates the utility of sequence capture for resequencing in complex conifer genomes, suggests guidelines for improving capture efficiency and provides a rich resource of genetic variants for studies of selection and local adaptation in these species. © 2016 John Wiley & Sons Ltd.
Activity Enhancement of G-Quadruplex/Hemin DNAzyme by Flanking d(CCC).

PubMed

Chang, Tianjun; Gong, Hongmei; Ding, Pi; Liu, Xiangjun; Li, Weiguo; Bing, Tao; Cao, Zehui; Shangguan, Dihua

2016-03-14

G-quadruplex (G4)/hemin DNAzymes have been extensively applied in bioanalysis and molecular devices. However, their catalytic activity is still much lower than that of proteinous enzymes. The G4/hemin DNAzyme activity is correlated with the G4 conformations and the solution conditions. However, little is known about the effect of the flanking sequences on the activity, though they are important parts of G4s. Here, we report sequences containing d(CCC), flanked on both ends of the G4-core sequences remarkably enhance their DNAzyme activity. By using circular dichroism and UV-visible spectroscopy, the d(CCC) flanking sequences were demonstrated to improve the hemin binding affinity to G4s instead of increasing the parallel G4 formation, which might explain the enhanced DNAzyme activity. Meanwhile, the increased hemin binding ability promoted the degradation of hemin within the DNAzyme by H2O2. Furthermore, the DNAzyme with d(CCC) flanking sequences showed strong tolerance to pH value changes, which makes it more suitable for applications requiring wide pH conditions. The results highlight the influence of the flanking sequences on the DNAzyme activity and provide insightful information for the design of highly active DNAzymes. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Origin and evolution of spliceosomal introns

PubMed Central

2012-01-01

Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section. PMID:22507701
Intermediate introns in nuclear genes of euglenids - are they a distinct type?

PubMed

Milanowski, Rafał; Gumińska, Natalia; Karnkowska, Anna; Ishikawa, Takao; Zakryś, Bożena

2016-02-29

Nuclear genes of euglenids contain two major types of introns: conventional spliceosomal and nonconventional introns. The latter are characterized by variable non-canonical borders, RNA secondary structure that brings intron ends together, and an unknown mechanism of removal. Some researchers also distinguish intermediate introns, which combine features of both types. They form a stable RNA secondary structure and are classified into two subtypes depending on whether they contain one (intermediate/nonconventional subtype) or both (conventional/intermediate subtype) canonical spliceosomal borders. However, it has been also postulated that most introns classified as intermediate could simply be special cases of conventional or nonconventional introns. Sequences of tubB, hsp90 and gapC genes from six strains of Euglena agilis were obtained. They contain four, six, and two or three introns, respectively (the third intron in the gapC gene is unique for just one strain). Conventional introns were present at three positions: two in the tubB gene (at one position conventional/intermediate introns were also found) and one in the gapC gene. Nonconventional introns are present at ten positions: two in the tubB gene (at one position intermediate/nonconventional introns were also found), six in hsp90 (at four positions intermediate/nonconventional introns were also found), and two in the gapC gene. Sequence and RNA secondary structure analyses of nonconventional introns confirmed that their most strongly conserved elements are base pairing nucleotides at positions +4, +5 and +6/ -8, -7 and -6 (in most introns CAG/CTG nucleotides were observed). It was also confirmed that the presence of the 5' GT/C end in intermediate/nonconventional introns is not the result of kinship with conventional introns, but is due to evolutionary pressure to preserve the purine at the 5' end. However, an example of a nonconventional intron with GC-AG ends was shown, suggesting the possibility of intron type conversion between nonconventional and conventional. Furthermore, an analysis of conventional introns revealed that the ability to form a stable RNA secondary structure by some introns is probably not a result of their relationship with nonconventional introns. It was also shown that acquisition of new nonconventional introns is an ongoing process and can be observed at the level of a single species. In the recently acquired intron in the gapC gene an extended direct repeats at the intron-exon junctions are present, suggesting that double-strand break repair process could be the source of new nonconventional introns.
Functional comparison of three transformer gene introns regulating conditional female lethality

USDA-ARS?s Scientific Manuscript database

The trasformer gene plays a critical role in the sex determination pathways of many insects. We cloned two transformer gene introns from Anastrepha suspensa, the Caribbean fruit fly. These introns have sequences that putatively have a role in sex-specific splicing patterns that affect sex determinat...
The Reverse Transcriptase/RNA Maturase Protein MatR Is Required for the Splicing of Various Group II Introns in Brassicaceae Mitochondria

PubMed Central

Sultan, Laure D.; Grewe, Felix; Rolle, Katarzyna; Abudraham, Sivan; Shevtsov, Sofia; Klipcan, Liron; Barciszewski, Jan; Dietrich, André

2016-01-01

Group II introns are large catalytic RNAs that are ancestrally related to nuclear spliceosomal introns. Sequences corresponding to group II RNAs are found in many prokaryotes and are particularly prevalent within plants organellar genomes. Proteins encoded within the introns themselves (maturases) facilitate the splicing of their own host pre-RNAs. Mitochondrial introns in plants have diverged considerably in sequence and have lost their maturases. In angiosperms, only a single maturase has been retained in the mitochondrial DNA: the matR gene found within NADH dehydrogenase 1 (nad1) intron 4. Its conservation across land plants and RNA editing events, which restore conserved amino acids, indicates that matR encodes a functional protein. However, the biological role of MatR remains unclear. Here, we performed an in vivo investigation of the roles of MatR in Brassicaceae. Directed knockdown of matR expression via synthetically designed ribozymes altered the processing of various introns, including nad1 i4. Pull-down experiments further indicated that MatR is associated with nad1 i4 and several other intron-containing pre-mRNAs. MatR may thus represent an intermediate link in the gradual evolutionary transition from the intron-specific maturases in bacteria into their versatile spliceosomal descendants in the nucleus. The similarity between maturases and the core spliceosomal Prp8 protein further supports this intriguing theory. PMID:27760804
Regulatory elements of the floral homeotic gene AGAMOUS identified by phylogenetic footprinting and shadowing.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.

2003-06-01

OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally importantmore » for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.« less
Pre-Mrna Introns as a Model for Cryptographic Algorithm:. Theory and Experiments

NASA Astrophysics Data System (ADS)

Regoli, Massimo

2010-01-01

The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. In particular the RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
Finding FMR1 mosaicism in Fragile X syndrome

PubMed Central

Gonçalves, Thaís Fernandez; dos Santos, Jussara Mendonça; Gonçalves, Andressa Pereira; Tassone, Flora; Mendoza-Morales, Guadalupe; Ribeiro, Márcia Gonçalves; Kahn, Evelyn; Boy, Raquel; Pimentel, Márcia Mattos Gonçalves; Santos-Rebouças, Cíntia Barros

2016-01-01

OBJETIVE Almost all patients with Fragile X Syndrome (FXS) exhibit a CGG repeat expansion (full mutation) in the Fragile Mental Retardation 1 gene (FMR1). Here, we report five unrelated males with FXS harboring a somatic full mutation/deletion mosaicism. METHODS Mutational profiles were only elucidated by using a combination of molecular approaches (CGG-based PCR, Sanger sequencing, MS-MLPA, Southern blot and mPCR). RESULT Four patients exhibited small deletions encompassing the CGG repeats tract and flanking regions, whereas the remaining had a larger deletion comprising at least exon 1 and part of intron 1 of FMR1 gene. The presence of a 2–3 base pairs microhomology in proximal and distal non-recurrent breakpoints without scars supports the involvement of microhomology mediated induced repair (MMBIR) mechanism in three small deletions. CONCLUSION Our data highlights the importance of using different research methods to elucidate atypical FXS mutational profiles, which are clinically undistinguishable and may have been underestimated. PMID:26716517
Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort.

PubMed

Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Siler Masters, Bettie Sue; Martásek, Pavel

2015-01-01

Estimating polymorphic allele frequencies of the NADPH-CYP450 oxidoreductase (POR) gene in a Czech Slavic population. The POR gene was analyzed in 322 individuals from a control cohort by sequencing and high resolution melting analysis. We identified seven unreported SNP genetic variations, including two SNPs in the 5' flanking region (g.4965C>T and g.4994G>T), one intronic variant (c.1899-20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared with wild-type. New POR variant identification indicates the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYP450s in the endoplasmic reticulum. Original submitted 15 September 2014; Revision submitted 17 November 2014.
Analysis of the ABCR (ABCA4) gene in 4-aminoquinoline retinopathy: is retinal toxicity by chloroquine and hydroxychloroquine related to Stargardt disease?

PubMed

Shroyer, N F; Lewis, R A; Lupski, J R

2001-06-01

To determine if mutations in ABCR (ABCA4) are associated with chloroquine/hydroxychloroquine retinopathy. DNA from eight patients with chloroquine or hydroxychloroquine retinopathy was studied. Controls were 80 individuals over age 65 years with normal retinal examinations. Ophthalmoscopy, color vision testing, visual fields, retinal photography, and fluorescein angiography were performed on the eight patients. Direct DNA sequencing of the exons and flanking intronic regions of the ABCR gene was completed for all patients. Clinical evaluation confirmed the diagnosis of chloroquine/hydroxychloroquine retinopathy and excluded Stargardt disease in each patient. Two patients had heterozygous ABCR missense mutations previously associated with Stargardt disease. None of the controls had these missense mutations. Three other patients had other missense polymorphisms. Some individuals who have ABCR mutations may be predisposed to develop retinal toxicity when exposed to chloroquine/hydroxychloroquine. We urge further study of a larger cohort of patients with chloroquine/hydroxychloroquine retinopathy.
Mutations in the RS1 gene in a Chinese family with X-linked juvenile retinoschisis.

PubMed

Hou, Qiaofang; Chu, Yan; Guo, Qiannan; Wu, Dong; Liao, Shixiu

2012-02-01

The purpose of our study was to identify the mutations in the retinoschisis 1 (RS1) gene, which was associated with X-linked retinoschisis (XLRS) in a four-generation Chinese family, and to provide the theoretical basis for gene diagnosis and gene therapy. Genomic DNA was extracted from peripheral leukocytes. All six exons and flanking intronic regions were amplified by polymerase chain reaction (PCR), followed by direct sequencing. Through our genetic analysis, one frameshift 573delG mutation was identified in the patients of this four-generation pedigree; however, this mutation was absent in normal or non-carrier subjects. In conclusion, this 573delG mutation is reported in the Chinese population for the first time. This mutation widens the mutational spectrum of RS1 in Asians. Identification of mutations in the RS1 gene and expanded information on clinical manifestations will facilitate early diagnosis, appropriate early therapy, and genetic counseling regarding the prognosis of XLRS.
Intron Definition Is Required for Excision of the Minute Virus of Mice Small Intron and Definition of the Upstream Exon

PubMed Central

Haut, Donald D.; Pintel, D. J.

1998-01-01

Alternative splicing of pre-mRNAs plays a critical role in maximizing the coding capacity of the small parvovirus genome. The small-intron region of minute virus of mice (MVM) pre-mRNAs undergoes an unusual pattern of overlapping alternative splicing—using two donors (D1 and D2) and two acceptors (A1 and A2) within a region of 120 nucleotides—that determines the steady-state ratios of the various viral mRNAs. In this report, we show that the determinants that govern excision of the small intron are complex and are also required for efficient definition of the upstream exon. For the MVM small intron in its natural context, the two donors appear to compete for the splicing machinery: the position of D1 favors its usage, while the primary sequence of D2 must be more like the consensus sequence than is D1 to be used efficiently. We have genetically defined the branch points that are used for generation of the major and minor spliced forms and show that recognition of components of the small-intron acceptors is likely to be the dominant determinant in alternative small-intron excision. We have also identified a G-rich intronic enhancer sequence within the small intron that is essential for splicing of the minor form (D2 to A2) but not the major form (D1 to A1) of MVM mRNAs and is required for efficient definition of the upstream NS2-specific exon. In its natural context, the small intron appears to be excised by a mechanism consistent with intron definition. When the MVM small intron is expanded, various parameters of its excision are altered, indicating that critical cis-acting signals are context dependent. Relative use of the donors and acceptors is altered, and the upstream NS2-specific exon is no longer efficiently defined. The fact that definition of the upstream NS2-specific exon can be achieved by the MVM small intron in its natural context, but not when it is expanded, suggests that the multiple determinants that govern definition and excision of the small intron are required, in concert, for upstream exon definition. Our data are consistent with a model in which alternative splicing of the MVM P4-generated pre-mRNAs is governed by a hybrid of intron- and exon-defining mechanisms. PMID:9499034

Virtual Genome Walking across the 32 Gb Ambystoma mexicanum genome; assembling gene models and intronic sequence.

PubMed

Evans, Teri; Johnson, Andrew D; Loose, Matthew

2018-01-12

Large repeat rich genomes present challenges for assembly using short read technologies. The 32 Gb axolotl genome is estimated to contain ~19 Gb of repetitive DNA making an assembly from short reads alone effectively impossible. Indeed, this model species has been sequenced to 20× coverage but the reads could not be conventionally assembled. Using an alternative strategy, we have assembled subsets of these reads into scaffolds describing over 19,000 gene models. We call this method Virtual Genome Walking as it locally assembles whole genome reads based on a reference transcriptome, identifying exons and iteratively extending them into surrounding genomic sequence. These assemblies are then linked and refined to generate gene models including upstream and downstream genomic, and intronic, sequence. Our assemblies are validated by comparison with previously published axolotl bacterial artificial chromosome (BAC) sequences. Our analyses of axolotl intron length, intron-exon structure, repeat content and synteny provide novel insights into the genic structure of this model species. This resource will enable new experimental approaches in axolotl, such as ChIP-Seq and CRISPR and aid in future whole genome sequencing efforts. The assembled sequences and annotations presented here are freely available for download from https://tinyurl.com/y8gydc6n . The software pipeline is available from https://github.com/LooseLab/iterassemble .
Phylogenomic Resolution of the Phylogeny of Laurasiatherian Mammals: Exploring Phylogenetic Signals within Coding and Noncoding Sequences.

PubMed

Chen, Meng-Yun; Liang, Dan; Zhang, Peng

2017-08-01

The interordinal relationships of Laurasiatherian mammals are currently one of the most controversial questions in mammalian phylogenetics. Previous studies mainly relied on coding sequences (CDS) and seldom used noncoding sequences. Here, by data mining public genome data, we compiled an intron data set of 3,638 genes (all introns from a protein-coding gene are considered as a gene) (19,055,073 bp) and a CDS data set of 10,259 genes (20,994,285 bp), covering all major lineages of Laurasiatheria (except Pholidota). We found that the intron data contained stronger and more congruent phylogenetic signals than the CDS data. In agreement with this observation, concatenation and species-tree analyses of the intron data set yielded well-resolved and identical phylogenies, whereas the CDS data set produced weakly supported and incongruent results. Further analyses showed that the phylogeny inferred from the intron data is highly robust to data subsampling and change in outgroup, but the CDS data produced unstable results under the same conditions. Interestingly, gene tree statistical results showed that the most frequently observed gene tree topologies for the CDS and intron data are identical, suggesting that the major phylogenetic signal within the CDS data is actually congruent with that within the intron data. Our final result of Laurasiatheria phylogeny is (Eulipotyphla,((Chiroptera, Perissodactyla),(Carnivora, Cetartiodactyla))), favoring a close relationship between Chiroptera and Perissodactyla. Our study 1) provides a well-supported phylogenetic framework for Laurasiatheria, representing a step towards ending the long-standing "hard" polytomy and 2) argues that intron within genome data is a promising data resource for resolving rapid radiation events across the tree of life. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
A KCNH2 branch point mutation causing aberrant splicing contributes to an explanation of genotype-negative long QT syndrome.

PubMed

Crotti, Lia; Lewandowska, Marzena A; Schwartz, Peter J; Insolia, Roberto; Pedrazzini, Matteo; Bussani, Erica; Dagradi, Federica; George, Alfred L; Pagani, Franco

2009-02-01

Genetic screening of long QT syndrome (LQTS) fails to identify disease-causing mutations in about 30% of patients. So far, molecular screening has focused mainly on coding sequence mutations or on substitutions at canonical splice sites. The purpose of this study was to explore the possibility that intronic variants not at canonical splice sites might affect splicing regulatory elements, lead to aberrant transcripts, and cause LQTS. Molecular screening was performed through DHPLC and sequence analysis. The role of the intronic mutation identified was assessed with a hybrid minigene splicing assay. A three-generation LQTS family was investigated. Molecular screening failed to identify an obvious disease-causing mutation in the coding sequences of the major LQTS genes but revealed an intronic A-to-G substitution in KCNH2 (IVS9-28A/G) cosegregating with the clinical phenotype in family members. In vitro analysis proved that the mutation disrupts the acceptor splice site definition by affecting the branch point (BP) sequence and promoting intron retention. We further demonstrated a tight functional relationship between the BP and the polypyrimidine tract, whose weakness is responsible for the pathological effect of the IVS9-28A/G mutation. We identified a novel BP mutation in KCNH2 that disrupts the intron 9 acceptor splice site definition and causes LQT2. The present finding demonstrates that intronic mutations affecting pre-mRNA processing may contribute to the failure of traditional molecular screening in identifying disease-causing mutations in LQTS subjects and offers a rationale strategy for the reduction of genotype-negative cases.
Base pairing between the 3' exon and an internal guide sequence increases 3' splice site specificity in the Tetrahymena self-splicing rRNA intron.

PubMed Central

Suh, E R; Waring, R B

1990-01-01

It has been proposed that recognition of the 3' splice site in many group I introns involves base pairing between the start of the 3' exon and a region of the intron known as the internal guide sequence (R. W. Davies, R. B. Waring, J. Ray, T. A. Brown, and C. Scazzocchio, Nature [London] 300:719-724, 1982). We have examined this hypothesis, using the self-splicing rRNA intron from Tetrahymena thermophila. Mutations in the 3' exon that weaken this proposed pairing increased use of a downstream cryptic 3' splice site. Compensatory mutations in the guide sequence that restore this pairing resulted in even stronger selection of the normal 3' splice site. These changes in 3' splice site usage were more pronounced in the background of a mutation (414A) which resulted in an adenine instead of a guanine being the last base of the intron. These results show that the proposed pairing (P10) plays an important role in ensuring that cryptic 3' splice sites are selected against. Surprisingly, the 414A mutation alone did not result in activation of the cryptic 3' splice site. Images PMID:2342465
Mutations in POLR3A and POLR3B are a major cause of hypomyelinating leukodystrophies with or without dental abnormalities and/or hypogonadotropic hypogonadism.

PubMed

Daoud, Hussein; Tétreault, Martine; Gibson, William; Guerrero, Kether; Cohen, Ana; Gburek-Augustat, Janina; Synofzik, Matthis; Brais, Bernard; Stevens, Cathy A; Sanchez-Carpintero, Rocio; Goizet, Cyril; Naidu, Sakkubai; Vanderver, Adeline; Bernard, Geneviève

2013-03-01

Leukodystrophies are a heterogeneous group of inherited neurodegenerative disorders characterised by abnormal central nervous system white matter. Mutations in POLR3A and POLR3B genes were recently reported to cause four clinically overlapping hypomyelinating leukodystrophy phenotypes. Our aim was to investigate the presence and frequency of POLR3A and POLR3B mutations in patients with genetically unexplained hypomyelinating leukodystrophies with typical clinical and/or radiologic features of Pol III-related leukodystrophies. The entire coding region and the flanking exon/intron boundaries of POLR3A and/or POLR3B genes were amplified and sequenced in 14 patients. Recessive mutations in POLR3A or POLR3B were uncovered in all 14 patients. Eight novel mutations were identified in POLR3A: six missenses, one nonsense, and one frameshift mutation. Seven patients carried compound heterozygous mutations in POLR3B, of whom six shared the common mutation in exon 15 (p.V523E). Seven novel mutations were identified in POLR3B: four missenses, two splice sites, and one intronic mutation. To date, our group has described 37 patients, of whom 27 have mutations in POLR3A and 10 in POLR3B, respectively. Altogether, our results further support the proposal that POLR3A and POLR3B mutations are a major cause of hypomyelinating leukodystrophies and suggest that POLR3A mutations are more frequent.
Contrasting population structure from nuclear intron sequences and mtDNA of humpback whales.

PubMed

Palumbi, S R; Baker, C S

1994-05-01

Powerful analyses of population structure require information from multiple genetic loci. To help develop a molecular toolbox for obtaining this information, we have designed universal oligonucleotide primers that span conserved intron-exon junctions in a wide variety of animal phyla. We test the utility of exon-primed, intron-crossing amplifications by analyzing the variability of actin intron sequences from humpback, blue, and bowhead whales and comparing the results with mitochondrial DNA (mtDNA) haplotype data. Humpback actin introns fall into two major clades that exist in different frequencies in different oceanic populations. It is surprising that Hawaii and California populations, which are very distinct in mtDNAs, are similar in actin intron alleles. This discrepancy between mtDNA and nuclear DNA results may be due either to differences in genetic drift in mitochondrial and nuclear genes or to preferential movement of males, which do not transmit mtDNA to offspring, between separate breeding grounds. Opposing mtDNA and nuclear DNA results can help clarify otherwise hidden patterns of structure in natural populations.
A Rapid, High-Quality, Cost-Effective, Comprehensive and Expandable Targeted Next-Generation Sequencing Assay for Inherited Heart Diseases.

PubMed

Wilson, Kitchener D; Shen, Peidong; Fung, Eula; Karakikes, Ioannis; Zhang, Angela; InanlooRahatloo, Kolsoum; Odegaard, Justin; Sallam, Karim; Davis, Ronald W; Lui, George K; Ashley, Euan A; Scharfe, Curt; Wu, Joseph C

2015-09-11

Thousands of mutations across >50 genes have been implicated in inherited cardiomyopathies. However, options for sequencing this rapidly evolving gene set are limited because many sequencing services and off-the-shelf kits suffer from slow turnaround, inefficient capture of genomic DNA, and high cost. Furthermore, customization of these assays to cover emerging targets that suit individual needs is often expensive and time consuming. We sought to develop a custom high throughput, clinical-grade next-generation sequencing assay for detecting cardiac disease gene mutations with improved accuracy, flexibility, turnaround, and cost. We used double-stranded probes (complementary long padlock probes), an inexpensive and customizable capture technology, to efficiently capture and amplify the entire coding region and flanking intronic and regulatory sequences of 88 genes and 40 microRNAs associated with inherited cardiomyopathies, congenital heart disease, and cardiac development. Multiplexing 11 samples per sequencing run resulted in a mean base pair coverage of 420, of which 97% had >20× coverage and >99% were concordant with known heterozygous single nucleotide polymorphisms. The assay correctly detected germline variants in 24 individuals and revealed several polymorphic regions in miR-499. Total run time was 3 days at an approximate cost of $100 per sample. Accurate, high-throughput detection of mutations across numerous cardiac genes is achievable with complementary long padlock probe technology. Moreover, this format allows facile insertion of additional probes as more cardiomyopathy and congenital heart disease genes are discovered, giving researchers a powerful new tool for DNA mutation detection and discovery. © 2015 American Heart Association, Inc.
Human T-cell leukemia virus type 1 Tax requires direct access to DNA for recruitment of CREB binding protein to the viral promoter.

PubMed

Lenzmeier, B A; Giebler, H A; Nyborg, J K

1998-02-01

Efficient human T-cell leukemia virus type 1 (HTLV-1) replication and viral gene expression are dependent upon the virally encoded oncoprotein Tax. To activate HTLV-1 transcription, Tax interacts with the cellular DNA binding protein cyclic AMP-responsive element binding protein (CREB) and recruits the coactivator CREB binding protein (CBP), forming a nucleoprotein complex on the three viral cyclic AMP-responsive elements (CREs) in the HTLV-1 promoter. Short stretches of dG-dC-rich (GC-rich) DNA, immediately flanking each of the viral CREs, are essential for Tax recruitment of CBP in vitro and Tax transactivation in vivo. Although the importance of the viral CRE-flanking sequences is well established, several studies have failed to identify an interaction between Tax and the DNA. The mechanistic role of the viral CRE-flanking sequences has therefore remained enigmatic. In this study, we used high resolution methidiumpropyl-EDTA iron(II) footprinting to show that Tax extended the CREB footprint into the GC-rich DNA flanking sequences of the viral CRE. The Tax-CREB footprint was enhanced but not extended by the KIX domain of CBP, suggesting that the coactivator increased the stability of the nucleoprotein complex. Conversely, the footprint pattern of CREB on a cellular CRE lacking GC-rich flanking sequences did not change in the presence of Tax or Tax plus KIX. The minor-groove DNA binding drug chromomycin A3 bound to the GC-rich flanking sequences and inhibited the association of Tax and the Tax-CBP complex without affecting CREB binding. Tax specifically cross-linked to the viral CRE in the 5'-flanking sequence, and this cross-link was blocked by chromomycin A3. Together, these data support a model where Tax interacts directly with both CREB and the minor-groove viral CRE-flanking sequences to form a high-affinity binding site for the recruitment of CBP to the HTLV-1 promoter.
Influence of flanking sequences on presentation efficiency of a CD8+ cytotoxic T-cell epitope delivered by parvovirus-like particles.

PubMed

Rueda, P; Morón, G; Sarraseca, J; Leclerc, C; Casal, J I

2004-03-01

We have previously developed an antigen-delivery system based on hybrid recombinant porcine parvovirus-like particles (PPV-VLPs) formed by the self-assembly of the VP2 protein of PPV carrying a foreign epitope at its N terminus. In this study, different constructs were made containing a CD8(+) T-cell epitope of chicken ovalbumin (OVA) to analyse the influence of the sequence inserted into VP2 on the correct processing of VLPs by antigen-presenting cells. We analysed the presentation of the OVA epitope inserted without flanking sequences or with either different natural flanking sequences or with the natural flanking sequences of a CD8(+) T-cell epitope from the lymphocytic choriomeningitis virus nucleoprotein, and as a dimer with or without linker sequences. All constructs were studied in terms of level of expression, assembly of VLPs and ability to deliver the inserted epitope into the MHC I pathway. The presentation of the OVA epitope was considerably improved by insertion of short natural flanking sequences, which indicated the relevance of the flanking sequences on the processing of PPV-VLPs. Only PPV-VLPs carrying two copies of the OVA epitope linked by two glycines were able to be properly processed, suggesting that the introduction of flexible residues between the two consecutive OVA epitopes may be necessary for the correct presentation of these dimers by PPV-VLPs. These results provide information to improve the insertion of epitopes into PPV-VLPs to facilitate their processing and presentation by MHC class I molecules.
Nucleotide sequence of the COX1 gene in Kluyveromyces lactis mitochondrial DNA: evidence for recent horizontal transfer of a group II intron.

PubMed

Hardy, C M; Clark-Walker, G D

1991-07-01

The cytochrome oxidase subunit 1 gene (COX1) in K. lactis K8 mtDNA spans 8,826 bp and contains five exons (termed E1-E5) totalling 1,602 bp that show 88% nucleotide base matching and 91% amino acid homology to the equivalent gene in S. cerevisiae. The four introns (termed K1 cox1.1-1.4) contain open reading frames encoding proteins of 786, 333, 319 and 395 amino acids respectively that potentially encode maturase enzymes. The first intron belongs to group II whereas the remaining three are group I type B. Introns K1 cox1.1, 1.3, and 1.4 are found at identical locations to introns Sc cox1.2, 1.5 a, and 1.5 b respectively from S. cerevisiae. Horizontal transfer of an intron between recent progenitors of K. lactis and S. cerevisiae is suggested by the observation that K1 cox1.1 and Sc cox1.2 show 96% base matching. Sequence comparisons between K1 cox1.3/Sc cox1.5 a and K1 cox1.4/Sc cox1.5 b suggest that these introns are likely to have been present in the ancestral COX1 gene of these yeasts. Intron K1 cox1.2 is not found in S. cerevisiae and appears at an unique location in K. lactis. A feature of the DNA sequences of the group I introns K1 cox1.2, 1.3, and 1.4 is the presence of 11 GC-rich clusters inserted into both coding and noncoding regions. Immediately downstream of the COX1 gene is the ATPase subunit 8 gene (A8) that shows 82.6% base matching to its counterpart in S. cerevisiae mtDNA.
Molecular and bioinformatical characterization of a novel superfamily of cysteine-rich peptides from arthropods.

PubMed

Zeng, Xian-Chun; Nie, Yao; Luo, Xuesong; Wu, Shifen; Shi, Wanxia; Zhang, Lei; Liu, Yichen; Cao, Hanjun; Yang, Ye; Zhou, Jianping

2013-03-01

The full-length cDNA sequences of two novel cysteine-rich peptides (referred to as HsVx1 and MmKTx1) were obtained from scorpions. The two peptides represent a novel class of cysteine-rich peptides with a unique cysteine pattern. The genomic sequence of HsVx1 is composed of three exons interrupted by two introns that are localized in the mature peptide encoding region and inserted in phase 1 and phase 2, respectively. Such a genomic organization markedly differs from those of other peptides from scorpions described previously. Genome-wide search for the orthologs of HsVx1 identified 59 novel cysteine-rich peptides from arthropods. These peptides share a consistent cysteine pattern with HsVx1. Genomic comparison revealed extensive intron length differences and intronic number and position polymorphisms among the genes of these peptides. Further analysis identified 30 cases of intron sliding, 1 case of intron gain and 22 cases of intron loss occurred with the genes of the HsVx1 and HsVx1-like peptides. It is interesting to see that three HsVx1-like peptides XP_001658928, XP_001658929 and XP_001658930 were derived from a single gene (XP gene): the former two were generated from alternative splicing; the third one was encoded by a DNA region in the reverse complementary strand of the third intron of the XP gene. These findings strongly suggest that the genes of these cysteine-rich peptides were evolved by intron sliding, intron gain/loss, gene recombination and alternative splicing events in response to selective forces without changing their cysteine pattern. The evolution of these genes is dominated by intron sliding and intron loss. Copyright © 2012 Elsevier Inc. All rights reserved.
The Reverse Transcriptase/RNA Maturase Protein MatR Is Required for the Splicing of Various Group II Introns in Brassicaceae Mitochondria.

PubMed

Sultan, Laure D; Mileshina, Daria; Grewe, Felix; Rolle, Katarzyna; Abudraham, Sivan; Głodowicz, Paweł; Niazi, Adnan Khan; Keren, Ido; Shevtsov, Sofia; Klipcan, Liron; Barciszewski, Jan; Mower, Jeffrey P; Dietrich, André; Ostersetzer-Biran, Oren

2016-11-01

Group II introns are large catalytic RNAs that are ancestrally related to nuclear spliceosomal introns. Sequences corresponding to group II RNAs are found in many prokaryotes and are particularly prevalent within plants organellar genomes. Proteins encoded within the introns themselves (maturases) facilitate the splicing of their own host pre-RNAs. Mitochondrial introns in plants have diverged considerably in sequence and have lost their maturases. In angiosperms, only a single maturase has been retained in the mitochondrial DNA: the matR gene found within NADH dehydrogenase 1 (nad1) intron 4. Its conservation across land plants and RNA editing events, which restore conserved amino acids, indicates that matR encodes a functional protein. However, the biological role of MatR remains unclear. Here, we performed an in vivo investigation of the roles of MatR in Brassicaceae. Directed knockdown of matR expression via synthetically designed ribozymes altered the processing of various introns, including nad1 i4. Pull-down experiments further indicated that MatR is associated with nad1 i4 and several other intron-containing pre-mRNAs. MatR may thus represent an intermediate link in the gradual evolutionary transition from the intron-specific maturases in bacteria into their versatile spliceosomal descendants in the nucleus. The similarity between maturases and the core spliceosomal Prp8 protein further supports this intriguing theory. © 2016 American Society of Plant Biologists. All rights reserved.
Length and sequence dependence in the association of Huntingtin protein with lipid membranes

NASA Astrophysics Data System (ADS)

Jawahery, Sudi; Nagarajan, Anu; Matysiak, Silvina

2013-03-01

There is a fundamental gap in our understanding of how aggregates of mutant Huntingtin protein (htt) with overextended polyglutamine (polyQ) sequences gain the toxic properties that cause Huntington's disease (HD). Experimental studies have shown that the most important step associated with toxicity is the binding of mutant htt aggregates to lipid membranes. Studies have also shown that flanking amino acid sequences around the polyQ sequence directly affect interactions with the lipid bilayer, and that polyQ sequences of greater than 35 glutamine repeats in htt are a characteristic of HD. The key steps that determine how flanking sequences and polyQ length affect the structure of lipid bilayers remain unknown. In this study, we use atomistic molecular dynamics simulations to study the interactions between lipid membranes of varying compositions and polyQ peptides of varying lengths and flanking sequences. We find that overextended polyQ interactions do cause deformation in model membranes, and that the flanking sequences do play a role in intensifying this deformation by altering the shape of the affected regions.
The alternative oxidase family of Vitis vinifera reveals an attractive model to study the importance of genomic design.

PubMed

Costa, José Hélio; de Melo, Dirce Fernandes; Gouveia, Zélia; Cardoso, Hélia Guerra; Peixe, Augusto; Arnholdt-Schmitt, Birgit

2009-12-01

'Genomic design' refers to the structural organization of gene sequences. Recently, the role of intron sequences for gene regulation is being better understood. Further, introns possess high rates of polymorphism that are considered as the major source for speciation. In molecular breeding, the length of gene-specific introns is recognized as a tool to discriminate genotypes with diverse traits of agronomic interest. 'Economy selection' and 'time-economy selection' have been proposed as models for explaining why highly expressed genes typically contain small introns. However, in contrast to these theories, plant-specific selection reveals that highly expressed genes contain introns that are large. In the presented research, 'wet'Aox gene identification from grapevine is advanced by a bioinformatics approach to study the species-specific organization of Aox gene structures in relation to available expressed sequence tag (EST) data. Two Aox1 and one Aox2 gene sequences have been identified in Vitis vinifera using grapevine cultivars from Portugal and Germany. Searching the complete genome sequence data of two grapevine cultivars confirmed that V. vinifera alternative oxidase (Aox) is encoded by a small multigene family composed of Aox1a, Aox1b and Aox2. An analysis of EST distribution revealed high expression of the VvAox2 gene. A relationship between the atypical long primary transcript of VvAox2 (in comparison to other plant Aox genes) and its expression level is suggested. V. vinifera Aox genes contain four exons interrupted by three introns except for Aox1a which contains an additional intron in the 3'-UTR. The lengths of primary Aox transcripts were estimated for each gene in two V. vinifera varieties: PN40024 and Pinot Noir. In both varieties, Aox1a and Aox1b contained small introns that corresponded to primary transcript lengths ranging from 1501 to 1810 bp. The Aox2 of PN40024 (12 329 bp) was longer than that from Pinot Noir (7279 bp) because of selection against a transposable-element insertion that is 5028 bp in size. An EST database basic local alignment search tool (BLAST) search of GenBank revealed the following ESTs percentages for each gene: Aox1a (26.2%), Aox1b (11.9%) and Aox2 (61.9%). Aox1a was expressed in fruits and roots, Aox1b expression was confined to flowers and Aox2 was ubiquitously expressed. These data for V. vinifera show that atypically long Aox intron lengths are related to high levels of gene expression. Furthermore, it is shown for the first time that two grapevine cultivars can be distinguished by Aox intron length polymorphism.
Selfish DNA: homing endonucleases find a home.

PubMed

Edgell, David R

2009-02-10

Self-splicing group I introns come in two flavours - those with a homing endonuclease to promote mobility of the intron, and those without an endonuclease. How homing endonucleases and self-splicing introns associate to form a composite selfish genetic element is a question of long-standing interest. Recent work has revealed that a shared characteristic of both introns and endonucleases, the targeting of conserved sequences, may provide the impetus for the evolution of composite mobile genetic elements.
Introduction of a novel 18S rDNA gene arrangement along with distinct ITS region in the saline water microalga Dunaliella

PubMed Central

2010-01-01

Comparison of 18S rDNA gene sequences is a very promising method for identification and classification of living organisms. Molecular identification and discrimination of different Dunaliella species were carried out based on the size of 18S rDNA gene and, number and position of introns in the gene. Three types of 18S rDNA structure have already been reported: the gene with a size of ~1770 bp lacking any intron, with a size of ~2170 bp consisting one intron near 5' terminus, and with a size of ~2570 bp harbouring two introns near 5' and 3' termini. Hereby, we report a new 18S rDNA gene arrangement in terms of intron localization and nucleotide sequence in a Dunaliella isolated from Iranian salt lakes (ABRIINW-M1/2). PCR amplification with genus-specific primers resulted in production of a ~2170 bp DNA band, which is similar to that of D. salina 18S rDNA gene containing only one intron near 5' terminus. Whilst, sequence composition of the gene revealed the lack of any intron near 5' terminus in our isolate. Furthermore, another alteration was observed due to the presence of a 440 bp DNA fragment near 3' terminus. Accordingly, 18S rDNA gene of the isolate is clearly different from those of D. salina and any other Dunaliella species reported so far. Moreover, analysis of ITS region sequence showed the diversity of this region compared to the previously reported species. 18S rDNA and ITS sequences of our isolate were submitted with accesion numbers of EU678868 and EU927373 in NCBI database, respectively. The optimum growth rate of this isolate occured at the salinity level of 1 M NaCl. The maximum carotenoid content under stress condition of intense light (400 μmol photon m-2 s-1), high salinity (4 M NaCl) and deficiency of nitrate and phosphate nutritions reached to 240 ng/cell after 15 days. PMID:20377865
[Polymorphisms of inhibin α gene exon 1 in buffalo (Bubalus bubalis), gayal (Bos frontalis) and yak (Bos grunniens)].

PubMed

Miao, Yong-Wang; Ha, Fu; Gao, Hua-Shan; Yuan, Feng; Li, Da-Lin; Yuan, Yue-Yun

2012-08-01

To elucidate the genetic characteristics of the bovine Inhibin α subunit (INHA) gene, the polymorphisms in exon 1 of INHA and its bilateral sequences were assayed using PCR with direct sequencing in buffalo, gayal and yak. A comparative analysis was conducted by pooled the results in this study with the published data of INHA on some mammals including some bovine species together. A synonymous substitution c.73C>A was identified in exon 1 of INHA for buffalo, which results in identical encoding product in river and swamp buffalo. In gayal, two non-synonymous but same property substitutions in exon 1 of INHA, viz. c.62 C>T and c.187 G>A, were detected, which lead to p. P21L, p. V63M changes in INHA, respectively. In yak, nucleotide substitution c.62C> T, c.129A>G were found in exon 1 of INHA, the former still causes p. P21L substitution and the latter is synonymous. For the sequence of the 5'-flanking region of INHA examined, no SNPs were found within the species, but a substitution, c. -6T>G, was found. The nucleotide in this site in gayal, yak and cattle was c. -6G, whereas in buffalo it was c. -6T. Meanwhile, a 6-bp deletion, namely c. 262+31_262+36delTCTGAC, was found in the intron of buffalo INHA gene. For this deletion, wild types (+/+) account for main part in river buffalo while mutant types (-/-) are predominant in swamp buffalo. This deletion was not found in gayal, yak and cattle, though these all have another deletion in the intron of INHA, c. 262+78_262+79delTG. The results of sequence alignment showed that the substitutions c. 43A and c. 67G in exon 1 of INHA are specific to buffalo, whereas the substitutions c. 173A and c. 255G are exclusive to gayal, yak and cattle, and c. 24C, c. 47G, c. 174T and c. 206T are specific to goat. Furthermore, there are few differences among gayal, yak and cattle, but there relatively great differences between buffalo, goat and other bovine species regarding the sequences of INHA exon 1.
Complete plastid genome sequence of the chickpea (Cicer arietinum) and the phylogenetic distribution of rps12 and clpP intron losses among legumes (Leguminosae)

PubMed Central

Jansen, Robert K.; Wojciechowski, Martin F.; Sanniyasi, Elumalai; Lee, Seung-Bum; Daniell, Henry

2008-01-01

Chickpea (Cicer arietinum, Leguminosae), an important grain legume, is widely used for food and fodder throughout the world. We sequenced the complete plastid genome of chickpea, which is 125,319 bp in size, and contains only one copy of the inverted repeat (IR). The genome encodes 108 genes, including 4 rRNAs, 29 tRNAs, and 75 proteins. The genes rps16, infA, and ycf4 are absent in the chickpea plastid genome, and ndhB has an internal stop codon in the 5′exon, similar to other legumes. Two genes have lost their introns, one in the 3′exon of the transpliced gene rps12, and the one between exons 1 and 2 of clpP; this represents the first documented case of the loss of introns from both of these genes in the same plastid genome. An extensive phylogenetic survey of these intron losses was performed on 302 taxa across legumes and the related family Polygalaceae. The clpP intron has been lost exclusively in taxa from the temperate “IR-lacking clade” (IRLC), whereas the rps12 intron has been lost in most members of the IRLC (with the exception of Wisteria, Callerya, Afgekia, and certain species of Millettia, which represent the earliest diverging lineages of this clade), and in the tribe Desmodieae, which is closely related to the tribes Phaseoleae and Psoraleeae. Data provided here suggest that the loss of the rps12 intron occurred after the loss of the IR. The two new genomic changes identified in the present study provide additional support of the monophyly of the IR-loss clade, and resolution of the pattern of the earliest-branching lineages in this clade. The availability of the complete chickpea plastid genome sequence also provides valuable information on intergenic spacer regions among legumes and endogenous regulatory sequences for plastid genetic engineering. PMID:18638561
Structural and transcription analysis of two homologous genes for the P700 chlorophyll a-apoproteins in Chlamydomonas reinhardii: evidence for in vivo trans-splicing

PubMed Central

Kück, Ulrich; Choquet, Yves; Schneider, Michel; Dron, Michel; Bennoun, Pierre

1987-01-01

The two homologous genes for the P700 chlorophyll a-apoproteins (ps1A1 and ps1A2) are encoded by the plastom in the green alga Chlamydomonas reinhardii. The structure and organization of the two genes were determined by comparison with the homologous genes from maize using data from heterologous hybridizations as well as from DNA and RNA sequencing. While the ps1A2 (736 codons) gene shows a continuous gene organization, the ps1A1 (754 codons) gene possesses some unusual features. The discontinuous gene is split into three separate exons which are scattered around the circular chloroplast genome. Exon 1 (86 bp) is separated by ∼50 kb from exon 2 (198 bp), which is located ∼ 90 kb apart from exon 3 (1984 bp). All exons are flanked by intronic sequences of group II. Transcription analysis reveals that the ps1A2 gene hybridizes with a 2.8-kb transcript, while all exon regions of the ps1A1 gene are homologous to a mature mRNA of 2.7 kb. From our data we conclude that the three distantly separated exonic sequences of the ps1A1 gene constitute a functional gene which probably operates by a trans-splicing mechanism. ImagesFig. 3.Fig. 5.Fig. 6. PMID:16453785
Draft genome of the American Eel (Anguilla rostrata).

PubMed

Pavey, Scott A; Laporte, Martin; Normandeau, Eric; Gaudin, Jérémy; Letourneau, Louis; Boisvert, Sébastien; Corbeil, Jacques; Audet, Céline; Bernatchez, Louis

2017-07-01

Freshwater eels (Anguilla sp.) have large economic, cultural, ecological and aesthetic importance worldwide, but they suffered more than 90% decline in global stocks over the past few decades. Proper genetic resources, such as sequenced, assembled and annotated genomes, are essential to help plan sustainable recoveries by identifying physiological, biochemical and genetic mechanisms that caused the declines or that may lead to recoveries. Here, we present the first sequenced genome of the American eel. This genome contained 305 043 contigs (N50 = 7397) and 79 209 scaffolds (N50 = 86 641) for a total size of 1.41 Gb, which is in the middle of the range of previous estimations for this species. In addition, protein-coding regions, including introns and flanking regions, are very well represented in the genome, as 95.2% of the 458 core eukaryotic genes and 98.8% of the 248 ultra-conserved subset were represented in the assembly and a total of 26 564 genes were annotated for future functional genomics studies. We performed a candidate gene analysis to compare three genes among all three freshwater eel species and, congruent with the phylogenetic relationships, Japanese eel (A. japanica) exhibited the most divergence. Overall, the sequenced genome presented in this study is a crucial addition to the presently available genetic tools to help guide future conservation efforts of freshwater eels. © 2016 John Wiley & Sons Ltd.

Pseudo-Bartter syndrome as the sole manifestation of cystic fibrosis in a child with 711+G>T/IVS8-5T mutation: a new face of an old disease.

PubMed

Tinsa, Faten; Hadj Fredj, Sondes; Bel Hadj, Imen; Khalsi, Fatma; Abdelhak, Sonia; Boussetta, Khadija; Messaoud, Taieb

2017-08-01

Pseudo-Bartter syndrome (PBS) describes an uncommon complication of cystic fibrosis leading to hypochloraemic, hypokalaemic metabolic alkalosis. PBS as the sole manifestation of cystic fibrosis in children is extremely rare and has never been described in patients carrying 5T variant. We report a clinical, biochemical and genetic study of a four year-old boy presenting a pseudo-Bartter syndrome as the sole manifestation of cystic fibrosis. All 27 exons and the flanking intron regions of the CFTR gene were analysed by PCR and direct sequencing. Direct sequencing was also used to analyse TG m T n and M470V polymorphisms in the patient and his parents. Two sweat tests were abnormal with elevated chloride levels at 78 and 88 mmol/L. DNA sequencing revealed a heterozygous mutation 711+1 G>T and an IVS8-T5 allele. The mutation 711+1 G>T is in trans with the IVS8-T5-TG11 allele and the child carried M470/V470 genotype. To the best of our knowledge, the genotype 711+1 G>T /IVS8-5T found in our patient is described for the first time. The role of TG11-5T-V470 allele in cases of cystic fibrosis with PB syndrome remains to be determined.
PAH mutation spectrum and correlation with PKU manifestation in north Jiangsu province population.

PubMed

Wang, Zhen-Wen; Jiang, Shi-Wen; Zhou, Bao-Cheng

2018-02-01

Phenylketonuria (PKU) is a common autosomal recessive disorder of phenylalanine metabolism and mainly results a deficiency of phenylalanine hydroxylase gene (PAH). The incidence of various PAH mutations have race and ethnicity differences. We report a spectrum of PAH mutations complied from 35 PKU children who are all Chinese Han population from north Jiangsu in this study. All 13 exons and their flanking intron sequences of PAH were determined by Ion Torrent PGM™ sequencing. The relationship of genotype and phenotype was analyzed based on the sum of the arbitrary value (AV) values of the two alleles. We identified 61 mutations, with a frequency of 87.14%, among 70 alleles of 35 patients. The most prevalent mutations were R243Q (26.23%), R241C (9.84%) and V399V (8.20%). Furthermore, the consistency between prediction of the biochemical phenotype and the observed phenotype was 81.25%, with the highest consistency observed in classic PKU (87.50%). A significant correlation was found between pretreatment levels of phenylalanine and AV sum (r = -0.87, P < 0.05). Finally, our study constructs PAH mutation spectrum by next generation sequencing (NGS), and reveals that the PAH genotypes and biochemical phenotypes were significantly correlated. These offers facilitate the provision of appropriate genetic counseling for PKU patients. Copyright © 2017. Published by Elsevier Taiwan.
Identification of human short introns

PubMed Central

Abebrese, Emmanuel L.; Arnold, Zachary R.; Armstrong, Katharine; Burns, Lindsay; Day, R. Thomas; Hsu, Daniel G.; Jarrell, Katherine; Luo, Yi; Mugayo, Daphine

2017-01-01

Canonical pre-mRNA splicing requires snRNPs and associated splicing factors to excise conserved intronic sequences, with a minimum intron length required for efficient splicing. Non-canonical splicing–intron excision without the spliceosome–has been documented; most notably, some tRNAs and the XBP1 mRNA contain short introns that are not removed by the spliceosome. There have been some efforts to identify additional short introns, but little is known about how many short introns are processed from mRNAs. Here, we report an approach to identify RNA short introns from RNA-Seq data, discriminating against small genomic deletions. We identify hundreds of short introns conserved among multiple human cell lines. These short introns are often alternatively spliced and are found in a variety of RNAs–both mRNAs and lncRNAs. Short intron splicing efficiency is increased by secondary structure, and we detect both canonical and non-canonical short introns. In many cases, splicing of these short introns from mRNAs is predicted to alter the reading frame and change protein output. Our findings imply that standard gene prediction models which often assume a lower limit for intron size fail to predict short introns effectively. We conclude that short introns are abundant in the human transcriptome, and short intron splicing represents an added layer to mRNA regulation. PMID:28520720
A trait stacking system via intra-genomic homologous recombination.

PubMed

Kumar, Sandeep; Worden, Andrew; Novak, Stephen; Lee, Ryan; Petolino, Joseph F

2016-11-01

A gene targeting method has been developed, which allows the conversion of 'breeding stacks', containing unlinked transgenes into a 'molecular stack' and thereby circumventing the breeding challenges associated with transgene segregation. A gene targeting method has been developed for converting two unlinked trait loci into a single locus transgene stack. The method utilizes intra-genomic homologous recombination (IGHR) between stably integrated target and donor loci which share sequence homology and nuclease cleavage sites whereby the donor contains a promoterless herbicide resistance transgene. Upon crossing with a zinc finger nuclease (ZFN)-expressing plant, double-strand breaks (DSB) are created in both the stably integrated target and donor loci. DSBs flanking the donor locus result in intra-genomic mobilization of a promoterless selectable marker-containing donor sequence, which can be utilized as a template for homology-directed repair of a concomitant DSB at the target locus resulting in a functional selectable marker via nuclease-mediated cassette exchange (NMCE). The method was successfully demonstrated in maize using a glyphosate tolerance gene as a donor whereby up to 3.3 % of the resulting progeny embryos cultured on selection medium regenerated plants with the donor sequence integrated into the target locus. The process could be extended to multiple cycles of trait stacking by virtue of a unique intron sequence homology for NMCE between the target and the donor loci. This is the first report that describes NMCE via IGHR, thereby enabling trait stacking using conventional crossing.
The HLA-DRB9 gene and the origin of HLA-DR haplotypes.

PubMed

Gongora, R; Figueroa, F; Klein, J

1996-11-01

HLA-DRB9 is a gene fragment consisting of exon 2 and flanking intron sequences. It is located at the extreme end of the DRB subregion, whose other end is demarcated by the DRB1 locus. We sequenced approximately 1400 base pairs of the segment encompassing the DRB9 locus from eight human haplotypes (DR1, DR10, DR2, DR3, DR5, DR6, DR8, and DR9, the DR4 and DR7 having been sequenced by others earlier), as well as two chimpanzee, five gorillas, one orangutan and one macaque haplotype. The analysis of these sequences indicates that the DRB9 locus, which we estimate to be more than 58 million years (my) old, has been coevolving with the DRB1 locus for the last 4.2 my. As a consequence of this coevolution, the human DRB9 alleles fall into groups that correlate with the DRB1 allelic groups and with the gene organization of the human haplotypes. This observation implies that the present-day HLA-DR haplotype groups (DR1, DR51, DR52, DR8, and DR53) were founded more than 4 my ago and have remained intact (barring minor internal rearrangements that did not recombine the DRB1 and DRB9 genes) for this period of time. The haplotypes have been transmitted during speciations from ancestral to emerging species just like allelic lineages at the DRB1 locus. Thus not only allelic but also haplotype polymorphism evolves trans-specifically.
Biotechnological applications of mobile group II introns and their reverse transcriptases: gene targeting, RNA-seq, and non-coding RNA analysis.

PubMed

Enyeart, Peter J; Mohr, Georg; Ellington, Andrew D; Lambowitz, Alan M

2014-01-13

Mobile group II introns are bacterial retrotransposons that combine the activities of an autocatalytic intron RNA (a ribozyme) and an intron-encoded reverse transcriptase to insert site-specifically into DNA. They recognize DNA target sites largely by base pairing of sequences within the intron RNA and achieve high DNA target specificity by using the ribozyme active site to couple correct base pairing to RNA-catalyzed intron integration. Algorithms have been developed to program the DNA target site specificity of several mobile group II introns, allowing them to be made into 'targetrons.' Targetrons function for gene targeting in a wide variety of bacteria and typically integrate at efficiencies high enough to be screened easily by colony PCR, without the need for selectable markers. Targetrons have found wide application in microbiological research, enabling gene targeting and genetic engineering of bacteria that had been intractable to other methods. Recently, a thermostable targetron has been developed for use in bacterial thermophiles, and new methods have been developed for using targetrons to position recombinase recognition sites, enabling large-scale genome-editing operations, such as deletions, inversions, insertions, and 'cut-and-pastes' (that is, translocation of large DNA segments), in a wide range of bacteria at high efficiency. Using targetrons in eukaryotes presents challenges due to the difficulties of nuclear localization and sub-optimal magnesium concentrations, although supplementation with magnesium can increase integration efficiency, and directed evolution is being employed to overcome these barriers. Finally, spurred by new methods for expressing group II intron reverse transcriptases that yield large amounts of highly active protein, thermostable group II intron reverse transcriptases from bacterial thermophiles are being used as research tools for a variety of applications, including qRT-PCR and next-generation RNA sequencing (RNA-seq). The high processivity and fidelity of group II intron reverse transcriptases along with their novel template-switching activity, which can directly link RNA-seq adaptor sequences to cDNAs during reverse transcription, open new approaches for RNA-seq and the identification and profiling of non-coding RNAs, with potentially wide applications in research and biotechnology.
Towards barcode markers in Fungi: an intron map of Ascomycota mitochondria.

PubMed

Santamaria, Monica; Vicario, Saverio; Pappadà, Graziano; Scioscia, Gaetano; Scazzocchio, Claudio; Saccone, Cecilia

2009-06-16

A standardized and cost-effective molecular identification system is now an urgent need for Fungi owing to their wide involvement in human life quality. In particular the potential use of mitochondrial DNA species markers has been taken in account. Unfortunately, a serious difficulty in the PCR and bioinformatic surveys is due to the presence of mobile introns in almost all the fungal mitochondrial genes. The aim of this work is to verify the incidence of this phenomenon in Ascomycota, testing, at the same time, a new bioinformatic tool for extracting and managing sequence databases annotations, in order to identify the mitochondrial gene regions where introns are missing so as to propose them as species markers. The general trend towards a large occurrence of introns in the mitochondrial genome of Fungi has been confirmed in Ascomycota by an extensive bioinformatic analysis, performed on all the entries concerning 11 mitochondrial protein coding genes and 2 mitochondrial rRNA (ribosomal RNA) specifying genes, belonging to this phylum, available in public nucleotide sequence databases. A new query approach has been developed to retrieve effectively introns information included in these entries. After comparing the new query-based approach with a blast-based procedure, with the aim of designing a faithful Ascomycota mitochondrial intron map, the first method appeared clearly the most accurate. Within this map, despite the large pervasiveness of introns, it is possible to distinguish specific regions comprised in several genes, including the full NADH dehydrogenase subunit 6 (ND6) gene, which could be considered as barcode candidates for Ascomycota due to their paucity of introns and to their length, above 400 bp, comparable to the lower end size of the length range of barcodes successfully used in animals. The development of the new query system described here would answer the pressing requirement to improve drastically the bioinformatics support to the DNA Barcode Initiative. The large scale investigation of Ascomycota mitochondrial introns performed through this tool, allowing to exclude the introns-rich sequences from the barcode candidates exploration, could be the first step towards a mitochondrial barcoding strategy for these organisms, similar to the standard approach employed in metazoans.
Bio—Cryptography: A Possible Coding Role for RNA Redundancy

NASA Astrophysics Data System (ADS)

Regoli, M.

2009-03-01

The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions," are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behavior in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
Horizontal transfer and gene conversion as an important driving force in shaping the landscape of mitochondrial introns.

PubMed

Wu, Baojun; Hao, Weilong

2014-04-16

Group I introns are highly dynamic and mobile, featuring extensive presence-absence variation and widespread horizontal transfer. Group I introns can invade intron-lacking alleles via intron homing powered by their own encoded homing endonuclease gene (HEG) after horizontal transfer or via reverse splicing through an RNA intermediate. After successful invasion, the intron and HEG are subject to degeneration and sequential loss. It remains unclear whether these mechanisms can fully address the high dynamics and mobility of group I introns. Here, we found that HEGs undergo a fast gain-and-loss turnover comparable with introns in the yeast mitochondrial 21S-rRNA gene, which is unexpected, as the intron and HEG are generally believed to move together as a unit. We further observed extensively mosaic sequences in both the introns and HEGs, and evidence of gene conversion between HEG-containing and HEG-lacking introns. Our findings suggest horizontal transfer and gene conversion can accelerate HEG/intron degeneration and loss, or rescue and propagate HEG/introns, and ultimately result in high HEG/intron turnover rate. Given that up to 25% of the yeast mitochondrial genome is composed of introns and most mitochondrial introns are group I introns, horizontal transfer and gene conversion could have served as an important mechanism in introducing mitochondrial intron diversity, promoting intron mobility and consequently shaping mitochondrial genome architecture.
Identification of a deep intronic mutation in the COL6A2 gene by a novel custom oligonucleotide CGH array designed to explore allelic and genetic heterogeneity in collagen VI-related myopathies

PubMed Central

2010-01-01

Background Molecular characterization of collagen-VI related myopathies currently relies on standard sequencing, which yields a detection rate approximating 75-79% in Ullrich congenital muscular dystrophy (UCMD) and 60-65% in Bethlem myopathy (BM) patients as PCR-based techniques tend to miss gross genomic rearrangements as well as copy number variations (CNVs) in both the coding sequence and intronic regions. Methods We have designed a custom oligonucleotide CGH array in order to investigate the presence of CNVs in the coding and non-coding regions of COL6A1, A2, A3, A5 and A6 genes and a group of genes functionally related to collagen VI. A cohort of 12 patients with UCMD/BM negative at sequencing analysis and 2 subjects carrying a single COL6 mutation whose clinical phenotype was not explicable by inheritance were selected and the occurrence of allelic and genetic heterogeneity explored. Results A deletion within intron 1A of the COL6A2 gene, occurring in compound heterozygosity with a small deletion in exon 28, previously detected by routine sequencing, was identified in a BM patient. RNA studies showed monoallelic transcription of the COL6A2 gene, thus elucidating the functional effect of the intronic deletion. No pathogenic mutations were identified in the remaining analyzed patients, either within COL6A genes, or in genes functionally related to collagen VI. Conclusions Our custom CGH array may represent a useful complementary diagnostic tool, especially in recessive forms of the disease, when only one mutant allele is detected by standard sequencing. The intronic deletion we identified represents the first example of a pure intronic mutation in COL6A genes. PMID:20302629
A mixed group II/group III twintron in the Euglena gracilis chloroplast ribosomal protein S3 gene: evidence for intron insertion during gene evolution.

PubMed Central

Copertino, D W; Christopher, D A; Hallick, R B

1991-01-01

The splicing of a 409 nucleotide intron from the Euglena gracilis chloroplast ribosomal protein S3 gene (rps3) was examined by cDNA cloning and sequencing, and northern hybridization. Based on the characterization of a partially spliced pre-mRNA, the intron was characterized as a 'mixed' twintron, composed of a 311 nucleotide group II intron internal to a 98 nucleotide group III intron. Twintron excision is via a 2-step sequential splicing pathway, with removal of the internal group II intron preceding excision of the external group III intron. Based on secondary structural analysis of the twintron, we propose that group III introns may represent highly degenerate versions of group II introns. The existence of twintrons is interpreted as evidence that group II introns were inserted during the evolution of Euglena chloroplast genes from a common ancestor with eubacteria, archaebacteria, cyanobacteria, and other chloroplasts. Images PMID:1721702
Another heritage from the RNA world: self-excision of intron sequence from nuclear pre-tRNAs.

PubMed

Weber, U; Beier, H; Gross, H J

1996-06-15

The intervening sequences of nuclear tRNA precursors are known to be excised by tRNA splicing endonuclease. We show here that a T7 transcript corresponding to a pre-tRNA(Tyr) from Arabidopsis thaliana has a highly specific activity for autolytic intron excision. Self-cleavage occurs precisely at the authentic 3'-splice site and at the phosphodiester bond one nucleotide downstream of the authentic 5'-splice site. The reaction results in fragments with 2',3'-cyclic phosphate and 5'-OH termini. It is resistant to proteinase K and/or SDS treatment and is not inhibited by added tRNA. The self-cleavage depends on Mg2+ and is stimulated by spermine and Triton X-100. A set of sequence variants at the cleavage sites has been analysed for autolytic intron excision and, in parallel, for enzymatic in vitro splicing in wheat germ S23 extract. Single-stranded loops are a prerequisite for both reactions. Self-cleavage not only occurs at pyrimidine-A but also at U-U bonds. Since intron self-excision is only about five times slower than the enzymatic intron excision in a wheat germ S23 extract, we propose that the splicing endonuclease may function by improving the preciseness and efficiency of an inherent pre-tRNA self-cleavage activity.
The paradox of MHC-DRB exon/intron evolution: alpha-helix and beta-sheet encoding regions diverge while hypervariable intronic simple repeats coevolve with beta-sheet codons.

PubMed

Schwaiger, F W; Weyers, E; Epplen, C; Brün, J; Ruff, G; Crawford, A; Epplen, J T

1993-09-01

Twenty-one different caprine and 13 ovine MHC-DRB exon 2 sequences were determined including part of the adjacent introns containing simple repetitive (gt)n(ga)m elements. The positions for highly polymorphic DRB amino acids vary slightly among ungulates and other mammals. From man and mouse to ungulates the basic (gt)n(ga)m structure is fixed in evolution for 7 x 10(7) years whereas ample variations exist in the tandem (gt)n and (ga)m dinucleotides and especially their "degenerated" derivatives. Phylogenetic trees for the alpha-helices and beta-pleated sheets of the ungulate DRB sequences suggest different evolutionary histories. In hoofed animals as well as in humans DRB beta-sheet encoding sequences and adjacent intronic repeats can be assembled into virtually identical groups suggesting coevolution of noncoding as well as coding DNA. In contrast alpha-helices and C-terminal parts of the first DRB domain evolve distinctly. In the absence of a defined mechanism causing specific, site-directed mutations, double-recombination or gene-conversion-like events would readily explain this fact. The role of the intronic simple (gt)n(ga)m repeat is discussed with respect to these genetic exchange mechanisms during evolution.
Conformation of Tax-response elements in the human T-cell leukemia virus type I promoter.

PubMed

Cox, J M; Sloan, L S; Schepartz, A

1995-12-01

HTLV-I Tax is believed to activate viral gene expression by binding bZIP proteins (such as CREB) and increasing their affinities for proviral TRE target sites. Each 21 bp TRE target site contains an imperfect copy of the intrinsically bent CRE target site (the TRE core) surrounded by highly conserved flanking sequences. These flanking sequences are essential for maximal increases in DNA affinity and transactivation, but they are not, apparently, contacted by protein. Here we employ non-denaturing gel electrophoresis to evaluate TRE conformation in the presence and absence of bZIP proteins, and to explore the role of DNA conformation in viral transactivation. Our results show that the TRE-1 flanking sequences modulate the structure and modestly increase the affinity of a CREB bZIP peptide for the TRE-1 core recognition sequence. These flanking sequences are also essential for a maximal increase in stability of the CREB-DNA complex in the presence of Tax. The CRE-like TRE core and the TRE flanking sequences are both essential for formation of stable CREB-TRE-1 and Tax-CREB-TRE-1 complexes. These two DNA segments may have co-evolved into a unique structure capable of recognizing Tax and a bZIP protein.
Efficient Processing of the Immunodominant, HLA-A*0201-Restricted Human Immunodeficiency Virus Type 1 Cytotoxic T-Lymphocyte Epitope despite Multiple Variations in the Epitope Flanking Sequences

PubMed Central

Brander, Christian; Yang, Otto O.; Jones, Norman G.; Lee, Yun; Goulder, Philip; Johnson, R. Paul; Trocha, Alicja; Colbert, David; Hay, Christine; Buchbinder, Susan; Bergmann, Cornelia C.; Zweerink, Hans J.; Wolinsky, Steven; Blattner, William A.; Kalams, Spyros A.; Walker, Bruce D.

1999-01-01

Immune escape from cytotoxic T-lymphocyte (CTL) responses has been shown to occur not only by changes within the targeted epitope but also by changes in the flanking sequences which interfere with the processing of the immunogenic peptide. However, the frequency of such an escape mechanism has not been determined. To investigate whether naturally occurring variations in the flanking sequences of an immunodominant human immunodeficiency virus type 1 (HIV-1) Gag CTL epitope prevent antigen processing, cells infected with HIV-1 or vaccinia virus constructs encoding different patient-derived Gag sequences were tested for recognition by HLA-A*0201-restricted, p17-specific CTL. We found that the immunodominant p17 epitope (SL9) and its variants were efficiently processed from minigene expressing vectors and from six HIV-1 Gag variants expressed by recombinant vaccinia virus constructs. Furthermore, SL9-specific CTL clones derived from multiple donors efficiently inhibited virus replication when added to HLA-A*0201-bearing cells infected with primary or laboratory-adapted strains of virus, despite the variability in the SL9 flanking sequences. These data suggest that escape from this immunodominant CTL response is not frequently accomplished by changes in the epitope flanking sequences. PMID:10559335
Fine mapping and identification of a candidate gene for the barley Un8 true loose smut resistance gene.

PubMed

Zang, Wen; Eckstein, Peter E; Colin, Mark; Voth, Doug; Himmelbach, Axel; Beier, Sebastian; Stein, Nils; Scoles, Graham J; Beattie, Aaron D

2015-07-01

The candidate gene for the barley Un8 true loose smut resistance gene encodes a deduced protein containing two tandem protein kinase domains. In North America, durable resistance against all known isolates of barley true loose smut, caused by the basidiomycete pathogen Ustilago nuda (Jens.) Rostr. (U. nuda), is under the control of the Un8 resistance gene. Previous genetic studies mapped Un8 to the long arm of chromosome 5 (1HL). Here, a population of 4625 lines segregating for Un8 was used to delimit the Un8 gene to a 0.108 cM interval on chromosome arm 1HL, and assign it to fingerprinted contig 546 of the barley physical map. The minimal tilling path was identified for the Un8 locus using two flanking markers and consisted of two overlapping bacterial artificial chromosomes. One gene located close to a marker co-segregating with Un8 showed high sequence identity to a disease resistance gene containing two kinase domains. Sequence of the candidate gene from the parents of the segregating population, and in an additional 19 barley lines representing a broader spectrum of diversity, showed there was no intron in alleles present in either resistant or susceptible lines, and fifteen amino acid variations unique to the deduced protein sequence in resistant lines differentiated it from the deduced protein sequences in susceptible lines. Some of these variations were present within putative functional domains which may cause a loss of function in the deduced protein sequences within susceptible lines.
Screening of Variations in CD22 Gene in Children with B-Precursor Acute Lymphoblastic Leukemia.

PubMed

Aslar Oner, Deniz; Akin, Dilara Fatma; Sipahi, Kadir; Mumcuoglu, Mine; Ezer, Ustun; Kürekci, A Emin; Akar, Nejat

2016-09-01

CD22 is expressed on the surface of B-cell lineage cells from the early progenitor stage of pro-B cell until terminal differentiation to mature B cells. It plays a role in signal transduction and as a regulator of B-cell receptor signaling in B-cell development. We aimed to screen exons 9-14 of the CD22 gene, which is a mutational hot spot region in B-precursor acute lymphoblastic leukemia (pre-B ALL) patients, to find possible genetic variants that could play role in the pathogenesis of pre-B ALL in Turkish children. This study included 109 Turkish children with pre-B ALL who were diagnosed at Losante Hospital for Children with Leukemia. Genomic DNA was extracted from both peripheral blood and bone marrow leukocytes. Gene amplification was performed with PCR, and all samples were screened for the variants by single strand conformation polymorphism. Samples showing band shifts were sequenced on an automated sequencer. In our patient group a total of 9 variants were identified in the CD22 gene by sequencing: a novel variant in intron 10 (T2199G); a missense variant in exon 12; 5 intronic variants between exon 12 and intron 13; a novel intronic variant (C2424T); and a synonymous in exon 13. Thirteen of 109 children (11.9%) carried the T2199G novel intronic variant located in intron 10, and 17 of 109 children (15.6%) carried the C2424T novel intronic variant. Novel variants in the CD22 gene in children with pre-B ALL in Turkey that are not present, in the Human Gene Mutation Database or NCBI SNP database, were found.
The Mitochondrial Genome of Chara vulgaris: Insights into the Mitochondrial DNA Architecture of the Last Common Ancestor of Green Algae and Land PlantsW⃞

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

2003-01-01

Mitochondrial DNA (mtDNA) has undergone radical changes during the evolution of green plants, yet little is known about the dynamics of mtDNA evolution in this phylum. Land plant mtDNAs differ from the few green algal mtDNAs that have been analyzed to date by their expanded size, long spacers, and diversity of introns. We have determined the mtDNA sequence of Chara vulgaris (Charophyceae), a green alga belonging to the charophycean order (Charales) that is thought to be the most closely related alga to land plants. This 67,737-bp mtDNA sequence, displaying 68 conserved genes and 27 introns, was compared with those of three angiosperms, the bryophyte Marchantia polymorpha, the charophycean alga Chaetosphaeridium globosum (Coleochaetales), and the green alga Mesostigma viride. Despite important differences in size and intron composition, Chara mtDNA strikingly resembles Marchantia mtDNA; for instance, all except 9 of 68 conserved genes lie within blocks of colinear sequences. Overall, our genome comparisons and phylogenetic analyses provide unequivocal support for a sister-group relationship between the Charales and the land plants. Only four introns in land plant mtDNAs appear to have been inherited vertically from a charalean algar ancestor. We infer that the common ancestor of green algae and land plants harbored a tightly packed, gene-rich, and relatively intron-poor mitochondrial genome. The group II introns in this ancestral genome appear to have spread to new mtDNA sites during the evolution of bryophytes and charalean green algae, accounting for part of the intron diversity found in Chara and land plant mitochondria. PMID:12897260
The chloroplast and mitochondrial genome sequences of the charophyte Chaetosphaeridium globosum: Insights into the timing of the events that restructured organelle DNAs within the green algal lineage that led to land plants

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

2002-01-01

The land plants and their immediate green algal ancestors, the charophytes, form the Streptophyta. There is evidence that both the chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) underwent substantial changes in their architecture (intron insertions, gene losses, scrambling in gene order, and genome expansion in the case of mtDNA) during the evolution of streptophytes; however, because no charophyte organelle DNAs have been sequenced completely thus far, the suite of events that shaped streptophyte organelle genomes remains largely unknown. Here, we have determined the complete cpDNA (131,183 bp) and mtDNA (56,574 bp) sequences of the charophyte Chaetosphaeridium globosum (Coleochaetales). At the levels of gene content (124 genes), intron composition (18 introns), and gene order, Chaetosphaeridium cpDNA is remarkably similar to land-plant cpDNAs, implying that most of the features characteristic of land-plant lineages were gained during the evolution of charophytes. Although the gene content of Chaetosphaeridium mtDNA (67 genes) closely resembles that of the bryophyte Marchantia polymorpha (69 genes), this charophyte mtDNA differs substantially from its land-plant relatives at the levels of size, intron composition (11 introns), and gene order. Our finding that it shares only one intron with its land-plant counterparts supports the idea that the vast majority of mitochondrial introns in land plants appeared after the emergence of these organisms. Our results also suggest that the events accounting for the spacious intergenic spacers found in land-plant mtDNAs took place late during the evolution of charophytes or coincided with the transition from charophytes to land plants. PMID:12161560
Exon–intron organization of genes in the slime mold Physarum polycephalum

PubMed Central

Trzcinska-Danielewicz, Joanna; Fronk, Jan

2000-01-01

The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon–intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon–intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon–intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3′-ends. PMID:10982858

SURVEY AND SUMMARY: exon-intron organization of genes in the slime mold Physarum polycephalum.

PubMed

Trzcinska-Danielewicz, J; Fronk, J

2000-09-15

The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon-intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon-intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon-intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3'-ends.
Quantitation of normal CFTR mRNA in CF patients with splice-site mutations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhou, Z.; Olsen, J.C.; Silverman, L.M.

Previously we identified two mutations in introns of the CFTR gene associated with partially active splice sites and unusual clinical phenotypes. One mutation in intron 19 (3849+10 kb C to T) is common in CF patients with normal sweat chloride values; an 84 bp sequence from intron 19, which contains a stop codon, is inserted between exon 19 and exon 20 in most nasal CFTR transcripts. The other mutation in intron 14B (2789+5 G to A) is associated with elevated sweat chloride levels, but mild pulmonary disease; exon 14B (38 bp) is spliced out of most nasal CFTR transcipts. Themore » remaining CFTR cDNA sequences, other than the 84 bp insertion of exon 14B deletion, are identical to the published sequence. To correlate genotype and phenotype, we used quantitative RT-PCR to determine the levels of normally-spliced CFTR mRNA in nasal epithelia from these patients. CFTR cDNA was amplified (25 cycles) by using primers specific for normally-spliced species, {gamma}-actin cDNA was amplified as a standard.« less
The in vivo use of alternate 3'-splice sites in group I introns.

PubMed

Sellem, C H; Belcour, L

1994-04-11

Alternative splicing of group I introns has been postulated as a possible mechanism that would ensure the translation of proteins encoded into intronic open reading frames, discontinuous with the upstream exon and lacking an initiation signal. Alternate splice sites were previously depicted according to secondary structures of several group I introns. We present here strong evidence that, in the case of Podospora anserina nad 1-i4 and cox1-i7 mitochondrial introns, alternative splicing events do occur in vivo. Indeed, by PCR experiments we have detected molecules whose sequence is precisely that expected if the predicted alternate 3'-splice sites were used.
The first missense mutation of NHS gene in a Tunisian family with clinical features of NHS syndrome including cardiac anomaly

PubMed Central

Chograni, Manèl; Rejeb, Imen; Jemaa, Lamia Ben; Châabouni, Myriam; Bouhamed, Habiba Chaabouni

2011-01-01

Nance-Horan Syndrome (NHS) or X-linked cataract-dental syndrome is a disease of unknown gene action mechanism, characterized by congenital cataract, dental anomalies, dysmorphic features and, in some cases, mental retardation. We performed linkage analysis in a Tunisian family with NHS in which affected males and obligate carrier female share a common haplotype in the Xp22.32-p11.21 region that contains the NHS gene. Direct sequencing of NHS coding exons and flanking intronic sequences allowed us to identify the first missense mutation (P551S) and a reported SNP-polymorphism (L1319F) in exon 6, a reported UTR–SNP (c.7422 C>T) and a novel one (c.8239 T>A) in exon 8. Both variations P551S and c.8239 T>A segregate with NHS phenotype in this family. Although truncations, frame-shift and copy number variants have been reported in this gene, no missense mutations have been found to segregate previously. This is the first report of a missense NHS mutation causing NHS phenotype (including cardiac defects). We hypothesize also that the non-reported UTR–SNP of the exon 8 (3′-UTR) is specific to the Tunisian population. PMID:21559051
The first missense mutation of NHS gene in a Tunisian family with clinical features of NHS syndrome including cardiac anomaly.

PubMed

Chograni, Manèl; Rejeb, Imen; Jemaa, Lamia Ben; Châabouni, Myriam; Bouhamed, Habiba Chaabouni

2011-08-01

Nance-Horan Syndrome (NHS) or X-linked cataract-dental syndrome is a disease of unknown gene action mechanism, characterized by congenital cataract, dental anomalies, dysmorphic features and, in some cases, mental retardation. We performed linkage analysis in a Tunisian family with NHS in which affected males and obligate carrier female share a common haplotype in the Xp22.32-p11.21 region that contains the NHS gene. Direct sequencing of NHS coding exons and flanking intronic sequences allowed us to identify the first missense mutation (P551S) and a reported SNP-polymorphism (L1319F) in exon 6, a reported UTR-SNP (c.7422 C>T) and a novel one (c.8239 T>A) in exon 8. Both variations P551S and c.8239 T>A segregate with NHS phenotype in this family. Although truncations, frame-shift and copy number variants have been reported in this gene, no missense mutations have been found to segregate previously. This is the first report of a missense NHS mutation causing NHS phenotype (including cardiac defects). We hypothesize also that the non-reported UTR-SNP of the exon 8 (3'-UTR) is specific to the Tunisian population.
Experimental Assessment of Splicing Variants Using Expression Minigenes and Comparison with In Silico Predictions

PubMed Central

Sharma, Neeraj; Sosnay, Patrick R.; Ramalho, Anabela S.; Douville, Christopher; Franca, Arianna; Gottschalk, Laura B.; Park, Jeenah; Lee, Melissa; Vecchio-Pagan, Briana; Raraigh, Karen S.; Amaral, Margarida D.; Karchin, Rachel; Cutting, Garry R.

2015-01-01

Assessment of the functional consequences of variants near splice sites is a major challenge in the diagnostic laboratory. To address this issue, we created expression minigenes (EMGs) to determine the RNA and protein products generated by splice site variants (n = 10) implicated in cystic fibrosis (CF). Experimental results were compared with the splicing predictions of eight in silico tools. EMGs containing the full-length Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) coding sequence and flanking intron sequences generated wild-type transcript and fully processed protein in Human Embryonic Kidney (HEK293) and CF bronchial epithelial (CFBE41o-) cells. Quantification of variant induced aberrant mRNA isoforms was concordant using fragment analysis and pyrosequencing. The splicing patterns of c.1585−1G>A and c.2657+5G>A were comparable to those reported in primary cells from individuals bearing these variants. Bioinformatics predictions were consistent with experimental results for 9/10 variants (MES), 8/10 variants (NNSplice), and 7/10 variants (SSAT and Sroogle). Programs that estimate the consequences of mis-splicing predicted 11/16 (HSF and ASSEDA) and 10/16 (Fsplice and SplicePort) experimentally observed mRNA isoforms. EMGs provide a robust experimental approach for clinical interpretation of splice site variants and refinement of in silico tools. PMID:25066652
A PCR-based survey on Phytomonas (Euglenozoa: Trypanosomatidae) in phytophagous hemipterans of the Amazon region.

PubMed

Godoi, Mara M I; Serrano, Myrna G; Teixeira, Marta M G; Camargo, Erney P

2002-01-01

We have surveyed 244 hemipterans from Western Brazilian Amazĵnia for the presence of trypanosomatids and identification of members of the genus Phytomonas. Examination by phase microscopy of squashes of insect salivary glands (SG) and digestive tubes (DT) revealed that 44% (108/244) of insects from seven families harbored trypanosomatids. Infections were 5 times more frequent in Coreidae than in all other families together. Smears of SG and DT of the dissected insects were fixed on glass slides with methanol and stained with Giemsa for morphological analysis. DNA was recovered from these preparations and submitted to a PCR assay that permitted amplification of all trypanosomatid genera using primers of conserved sequences flanking a segment of the spliced leader (SL) gene. Upon PCR amplification of the recovered DNA, amplicons were hybridized with an oligonucletide probe (SL3') complementary to a SL intron sequence specific for flagellates of the genus Phytomonas. Among the trypanosomatid-positive insects, 38.8% harbored Phytomonas spp., corresponding to an overall Phytomonas prevalence of 17.1% among phytophagous bugs, their putative vectors. Since many Phytomonas are pathogenic in plants, this high prevalence in their vectors emphasizes the permanent risk of exposure to disease by native and cultured plants of the Amazon region.
Low incidence of SCN1A genetic mutation in patients with hemiconvulsion-hemiplegia-epilepsy syndrome.

PubMed

Kim, Dong Wook; Lim, Byung Chan; Kim, Ki Joong; Chae, Jong Hee; Lee, Ran; Lee, Sang Kun

2013-10-01

Genetic mutations in SCN1A account for more than two-thirds of patients with classic Dravet syndrome. A role for SCN1A genetic mutations in the development of hemiconvulsion-hemiplegia-epilepsy (HHE) syndrome was recently suggested based on the observation that HHE syndrome and classic Dravet syndrome share many clinical features. We previously identified a 2 bp-deletion mutation in SCN1A in a Dravet patient, and we found out the patient also had HHE syndrome upon clinical re-evaluation. We subsequently screened 10 additional HHE patients for SCN1A. Among the 11 patients who were diagnosed with HHE syndrome, six patients had no other etiology with the exception of prolonged febrile illness, therefore classified as idiopathic HHE syndrome, whereas five patients were classified as symptomatic HHE syndrome. Direct sequencing of all coding exons and flanking intronic sequences of the SCN1A gene was performed, but we failed to identify additional mutations in 10 patients. The patient with SCN1A mutation had the earliest onset of febrile convulsion and hemiparesis. Our study suggests that SCN1A genetic mutation is only a rare predisposing cause of HHE syndrome. Copyright © 2013 Elsevier B.V. All rights reserved.
Tissue- and case-specific retention of intron 40 in mature dystrophin mRNA.

PubMed

Nishida, Atsushi; Minegishi, Maki; Takeuchi, Atsuko; Niba, Emma Tabe Eko; Awano, Hiroyuki; Lee, Tomoko; Iijima, Kazumoto; Takeshima, Yasuhiro; Matsuo, Masafumi

2015-06-01

The dystrophin gene, which is mutated in Duchenne muscular dystrophy (DMD), comprises 79 exons that show multiple alternative splicing events. Intron retention, a type of alternative splicing, may control gene expression. We examined intron retention in dystrophin introns by reverse-transcription PCR from skeletal muscle, focusing on the nine shortest (all <1000 bp), because these are more likely to be retained. Only one, intron 40, was retained in mRNA; sequencing revealed insertion of a complete intron 40 (851 nt) between exons 40 and 41. The intron 40 retention product accounted for 1.2% of the total product but had a premature stop codon at the fifth intronic codon. Intron 40 retention was most strongly observed in the kidney (36.6%) and was not obtained from the fetal liver, lung, spleen or placenta. This indicated that intron retention is a tissue-specific event whose level varies among tissues. In two DMD patients, intron 40 retention was observed in one patient but not in the other. Examination of splicing regulatory factors revealed that intron 40 had the highest guanine-cytosine content of all examined introns in a 30-nt segment at its 3' end. Further studies are needed to clarify the biological role of intron 40-retained dystrophin mRNA.
Elements in the transcriptional regulatory region flanking herpes simplex virus type 1 oriS stimulate origin function.

PubMed

Wong, S W; Schaffer, P A

1991-05-01

Like other DNA-containing viruses, the three origins of herpes simplex virus type 1 (HSV-1) DNA replication are flanked by sequences containing transcriptional regulatory elements. In a transient plasmid replication assay, deletion of sequences comprising the transcriptional regulatory elements of ICP4 and ICP22/47, which flank oriS, resulted in a greater than 80-fold decrease in origin function compared with a plasmid, pOS-822, which retains these sequences. In an effort to identify specific cis-acting elements responsible for this effect, we conducted systematic deletion analysis of the flanking region with plasmid pOS-822 and tested the resulting mutant plasmids for origin function. Stimulation by cis-acting elements was shown to be both distance and orientation dependent, as changes in either parameter resulted in a decrease in oriS function. Additional evidence for the stimulatory effect of flanking sequences on origin function was demonstrated by replacement of these sequences with the cytomegalovirus immediate-early promoter, resulting in nearly wild-type levels of oriS function. In competition experiments, cotransfection of cells with the test plasmid, pOS-822, and increasing molar concentrations of a competitor plasmid which contained the ICP4 and ICP22/47 transcriptional regulatory regions but lacked core origin sequences resulted in a significant reduction in the replication efficiency of pOS-822, demonstrating that factors which bind specifically to the oriS-flanking sequences are likely involved as auxiliary proteins in oriS function. Together, these studies demonstrate that trans-acting factors and the sites to which they bind play a critical role in the efficiency of HSV-1 DNA replication from oriS in transient-replication assays.
Detection of KIT Genotype in Pigs by TaqMan MGB Real-Time Quantitative Polymerase Chain Reaction.

PubMed

Li, Xiuxiu; Li, Xiaoning; Luo, Rongrong; Wang, Wenwen; Wang, Tao; Tang, Hui

2018-05-01

The dominant white phenotype in domestic pigs is caused by two mutations in the KIT gene: a 450 kb duplication containing the entire KIT gene together with flanking sequences and one splice mutation with a G:A substitution in intron 17. The purpose of this study was to establish a simple, rapid method to determine KIT genotype in pigs. First, to detect KIT copy number variation (CNV), primers for exon 2 of the KIT gene, along with a TaqMan minor groove binder (MGB) probe, were designed. The single-copy gene, estrogen receptor (ESR), was used as an internal control. A real-time fluorescence-based quantitative PCR (FQ-PCR) protocol was developed to accurately detect KIT CNVs. Second, to detect the splice mutation ratio of the G:A substitution in intron 17, a 175 bp region, including the target mutation, was amplified from genomic DNA. Based on the sequence of the resulting amplified fragment, an MGB probe set was designed to detect the ratio of splice mutation to normal using FQ-PCR. A series of parallel amplification curves with the same internal distances were obtained using gradually diluted DNA as templates. The CT values among dilutions were significantly different (p < 0.001) and the coefficients of variation from each dilution were low (from 0.13% to 0.26%). The amplification efficiencies for KIT and ESR were approximately equal, indicating ESR was an appropriate control gene. Furthermore, use of the MGB probe set resulted in detection of the target mutation at a high resolution and stability; standard curves illustrated that the amplification efficiencies of KIT1 (G) and KIT2 (A) were approximately equal (98.8% and 97.2%). In conclusion, a simple, rapid method, with high specificity and stability, for the detection of the KIT genotype in pigs was established using TaqMan MGB probe real-time quantitative PCR.
Genetic basis of human complement C8[beta] deficiency

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kaufmann, T.; Rittner, C.; Schneider, P.M.

1993-06-01

The eighth component of human complement (c8) is a serum protein consisting of three chains ([alpha], [beta], and [gamma]) and encoded by three different genes, C8A, C8B, and C8G. C8A and C8B are closely linked on chromosome 1p, whereas C8G is located on chromosome 9q. In the serum the [beta] subunit is non-covalently bound to the disulfide-linked [alpha]-[gamma] subunit. Patients with C8[beta] deficiency suffer from recurrent neisserial infections such as meningitis. Exon-specific polymerase chain reaction (PCR) amplification with primer pairs from the flanking intron sequences was used to amplify all 12 C8B exons separately. No difference regarding the exon sizesmore » was observed in a C8[beta]-deficient patient compared with a normal person. Therefore, direct sequence analysis of all exon-specific PCR products from normal and C8[beta]-deficient individuals was carried out. As a cause for C8[beta] deficiency, we found a single C-T exchange in exon 9 leading to a stop codon. An allele-specific PCR system was designed to detect the normal and the deficiency allele simultaneously. Using this approach as well as PCR typing of the Taql polymorphism located in intron 11, five families with 7 C8[beta]-deficient members were investigated. The mutation was not found to be restricted to one of the two Taql RFLP alleles. The mutant allele was observed in all families investigated and can therefore be regarded as a major cause of C8[beta] deficiency in the Caucasian population. In addition, two C8[beta]-deficient patients were found to be heterozygous for the C-T exchange. The molecular basis of the alleles without this point mutation also causing deficiency has not yet been defined. 23 refs., 4 figs., 3 tabs.« less
Diversity in the glucose transporter-4 gene (SLC2A4) in humans reflects the action of natural selection along the old-world primates evolution.

PubMed

Tarazona-Santos, Eduardo; Fabbri, Cristina; Yeager, Meredith; Magalhaes, Wagner C; Burdett, Laurie; Crenshaw, Andrew; Pettener, Davide; Chanock, Stephen J

2010-03-23

Glucose is an important source of energy for living organisms. In vertebrates it is ingested with the diet and transported into the cells by conserved mechanisms and molecules, such as the trans-membrane Glucose Transporters (GLUTs). Members of this family have tissue specific expression, biochemical properties and physiologic functions that together regulate glucose levels and distribution. GLUT4 -coded by SLC2A4 (17p13) is an insulin-sensitive transporter with a critical role in glucose homeostasis and diabetes pathogenesis, preferentially expressed in the adipose tissue, heart muscle and skeletal muscle. We tested the hypothesis that natural selection acted on SLC2A4. We re-sequenced SLC2A4 and genotyped 104 SNPs along a approximately 1 Mb region flanking this gene in 102 ethnically diverse individuals. Across the studied populations (African, European, Asian and Latin-American), all the eight common SNPs are concentrated in the N-terminal region upstream of exon 7 ( approximately 3700 bp), while the C-terminal region downstream of intron 6 ( approximately 2600 bp) harbors only 6 singletons, a pattern that is not compatible with neutrality for this part of the gene. Tests of neutrality based on comparative genomics suggest that: (1) episodes of natural selection (likely a selective sweep) predating the coalescent of human lineages, within the last 25 million years, account for the observed reduced diversity downstream of intron 6 and, (2) the target of natural selection may not be in the SLC2A4 coding sequence. We propose that the contrast in the pattern of genetic variation between the N-terminal and C-terminal regions are signatures of the action of natural selection and thus follow-up studies should investigate the functional importance of different regions of the SLC2A4 gene.
Characterization and mapping of the human rhodopsin kinase gene and screening of the gene for mutations in patients with retinitis pigmentosa

DOE Office of Scientific and Technical Information (OSTI.GOV)

Khani, S.C.; Lin, D.; Magovcevic, I.

1994-09-01

Rhodopsin kinase (RK) is a cytosolic enzyme in rod photoreceptors that initiates the deactivation of the phototransductions cascade by phosphorylating photoactivated rhodopsin. Although the cDNA sequence of bovine RK has been determined previously, no human cDNA or genomic sequence has thus far been available for genetic studies. In order to investigate the possible role of this candidate gene in retinitis pigmentosa (RP) and allied diseases, we have isolated and characterized human cDNA and genomic clones derived from the RK locus. The coding sequence of the human gene is 1692 nucleotides in length and is split into seven exons. The humanmore » and the bovine sequence show 84% identity at the nucleotide level and 92% identity at the amino acid level. Thus far, the intronic sequences flanking each exon except for one have been determined. We have also mapped the human RK gene to chromosome 13q34 using fluorescence in situ hybridization. To our knowledge, no RP gene has as yet been linked to this region. However, since the substrate for RK (rhodopsin) and other members of the phototransduction cascade have been implicated in the pathogenesis of RP, it is conceivable that defects in RK can also cause some forms of this disease. We are evaluating this possibility by screening DNA from 173 patients with autosomal recessive RP and 190 patients with autosomal dominant RP. So far, we have found 11 patients with variant bands. In one patient with autosomal dominant RP we discovered the missense change Ser536Leu. Cosegregation studies and further sequencing of the variant bands are currently underway.« less
Cloning and expression of a nuclear encoded plastid specific 33 kDa ribonucleoprotein gene (33RNP) from pea that is light stimulated.

PubMed

Reddy, M K; Nair, S; Singh, B N; Mudgil, Y; Tewari, K K; Sopory, S K

2001-01-24

We report the cloning and sequencing of both cDNA and genomic DNA of a 33 kDa chloroplast ribonucleoprotein (33RNP) from pea. The analysis of the predicted amino acid sequence of the cDNA clone revealed that the encoded protein contains two RNA binding domains, including the conserved consensus ribonucleoprotein sequences CS-RNP1 and CS-RNP2, on the C-terminus half and the presence of a putative transit peptide sequence in the N-terminus region. The phylogenetic and multiple sequence alignment analysis of pea chloroplast RNP along with RNPs reported from the other plant sources revealed that the pea 33RNP is very closely related to Nicotiana sylvestris 31RNP and 28RNP and also to 31RNP and 28RNP of Arabidopsis and spinach, respectively. The pea 33RNP was expressed in Escherichia coli and purified to homogeneity. The in vitro import of precursor protein into chloroplasts confirmed that the N-terminus putative transit peptide is a bona fide transit peptide and 33RNP is localized in the chloroplast. The nucleic acid-binding properties of the recombinant protein, as revealed by South-Western analysis, showed that 33RNP has higher binding affinity for poly (U) and oligo dT than for ssDNA and dsDNA. The steady state transcript level was higher in leaves than in roots and the expression of this gene is light stimulated. Sequence analysis of the genomic clone revealed that the gene contains four exons and three introns. We have also isolated and analyzed the 5' flanking region of the pea 33RNP gene.
Isolation and identification of gene-specific microRNAs.

PubMed

Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

2006-01-01

Prediction of microRNA (miRNA) candidates using computer programming has identified hundreds and hundreds of genomic hairpin sequences, of which, the functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene-silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem, and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. By insertion of a hairpin-like pre-miRNA structure into the intron region of a gene, this intronic miRNA biogenesis system has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA-expressing system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafish, chicken embryos, and adult mice. Based on the strand complementarity between the designed miRNA and its target gene sequence, we have also developed a miRNA isolation protocol to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proof- of-principle method, we now have the knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing system.
Isolation and identification of gene-specific microRNAs.

PubMed

Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

2013-01-01

Computer programming has identified hundreds of genomic hairpin sequences, many with functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA generation system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafishes, chicken embryos, and adult mice. We have also developed an miRNA isolation protocol, based on the complementarity between the designed miRNA and its target gene sequence, to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proven-of-principle method, we now have full knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing systems.
The complete chloroplast DNA sequence of Eleutherococcus senticosus (Araliaceae); comparative evolutionary analyses with other three asterids.

PubMed

Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong

2012-05-01

This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or intron regions. These medium size repeats may contribute to developing a cp genome-specific gene introduction vector because the region may use for specific recombination sites.
Organellar maturases: A window into the evolution of the spliceosome.

PubMed

Schmitz-Linneweber, Christian; Lampe, Marie-Kristin; Sultan, Laure D; Ostersetzer-Biran, Oren

2015-09-01

During the evolution of eukaryotic genomes, many genes have been interrupted by intervening sequences (introns) that must be removed post-transcriptionally from RNA precursors to form mRNAs ready for translation. The origin of nuclear introns is still under debate, but one hypothesis is that the spliceosome and the intron-exon structure of genes have evolved from bacterial-type group II introns that invaded the eukaryotic genomes. The group II introns were most likely introduced into the eukaryotic genome from an α-proteobacterial predecessor of mitochondria early during the endosymbiosis event. These self-splicing and mobile introns spread through the eukaryotic genome and later degenerated. Pieces of introns became part of the general splicing machinery we know today as the spliceosome. In addition, group II introns likely brought intron maturases with them to the nucleus. Maturases are found in most bacterial introns, where they act as highly specific splicing factors for group II introns. In the spliceosome, the core protein Prp8 shows homology to group II intron-encoded maturases. While maturases are entirely intron specific, their descendant of the spliceosomal machinery, the Prp8 protein, is an extremely versatile splicing factor with multiple interacting proteins and RNAs. How could such a general player in spliceosomal splicing evolve from the monospecific bacterial maturases? Analysis of the organellar splicing machinery in plants may give clues on the evolution of nuclear splicing. Plants encode various proteins which are closely related to bacterial maturases. The organellar genomes contain one maturase each, named MatK in chloroplasts and MatR in mitochondria. In addition, several maturase genes have been found in the nucleus as well, which are acting on mitochondrial pre-RNAs. All plant maturases show sequence deviation from their progenitor bacterial maturases, and interestingly are all acting on multiple organellar group II intron targets. Moreover, they seem to function in the splicing of group II introns together with a number of additional nuclear-encoded splicing factors, possibly acting as an organellar proto-spliceosome. Together, this makes them interesting models for the early evolution of nuclear spliceosomal splicing. In this review, we summarize recent advances in our understanding of the role of plant maturases and their accessory factors in plants. This article is part of a Special Issue entitled: Chloroplast Biogenesis. Copyright © 2015 Elsevier B.V. All rights reserved.
High-throughput sequencing of the entire genomic regions of CCM1/KRIT1, CCM2 and CCM3/PDCD10 to search for pathogenic deep-intronic splice mutations in cerebral cavernous malformations.

PubMed

Rath, Matthias; Jenssen, Sönke E; Schwefel, Konrad; Spiegler, Stefanie; Kleimeier, Dana; Sperling, Christian; Kaderali, Lars; Felbor, Ute

2017-09-01

Cerebral cavernous malformations (CCM) are vascular lesions of the central nervous system that can cause headaches, seizures and hemorrhagic stroke. Disease-associated mutations have been identified in three genes: CCM1/KRIT1, CCM2 and CCM3/PDCD10. The precise proportion of deep-intronic variants in these genes and their clinical relevance is yet unknown. Here, a long-range PCR (LR-PCR) approach for target enrichment of the entire genomic regions of the three genes was combined with next generation sequencing (NGS) to screen for coding and non-coding variants. NGS detected all six CCM1/KRIT1, two CCM2 and four CCM3/PDCD10 mutations that had previously been identified by Sanger sequencing. Two of the pathogenic variants presented here are novel. Additionally, 20 stringently selected CCM index cases that had remained mutation-negative after conventional sequencing and exclusion of copy number variations were screened for deep-intronic mutations. The combination of bioinformatics filtering and transcript analyses did not reveal any deep-intronic splice mutations in these cases. Our results demonstrate that target enrichment by LR-PCR combined with NGS can be used for a comprehensive analysis of the entire genomic regions of the CCM genes in a research context. However, its clinical utility is limited as deep-intronic splice mutations in CCM1/KRIT1, CCM2 and CCM3/PDCD10 seem to be rather rare. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

Remarkable interkingdom conservation of intron positions and massive, lineage-specific intron loss and gain in eukaryotic evolution.

PubMed

Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V

2003-09-02

Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.
ExDom: an integrated database for comparative analysis of the exon–intron structures of protein domains in eukaryotes

PubMed Central

Bhasi, Ashwini; Philip, Philge; Manikandan, Vinu; Senapathy, Periannan

2009-01-01

We have developed ExDom, a unique database for the comparative analysis of the exon–intron structures of 96 680 protein domains from seven eukaryotic organisms (Homo sapiens, Mus musculus, Bos taurus, Rattus norvegicus, Danio rerio, Gallus gallus and Arabidopsis thaliana). ExDom provides integrated access to exon-domain data through a sophisticated web interface which has the following analytical capabilities: (i) intergenomic and intragenomic comparative analysis of exon–intron structure of domains; (ii) color-coded graphical display of the domain architecture of proteins correlated with their corresponding exon-intron structures; (iii) graphical analysis of multiple sequence alignments of amino acid and coding nucleotide sequences of homologous protein domains from seven organisms; (iv) comparative graphical display of exon distributions within the tertiary structures of protein domains; and (v) visualization of exon–intron structures of alternative transcripts of a gene correlated to variations in the domain architecture of corresponding protein isoforms. These novel analytical features are highly suited for detailed investigations on the exon–intron structure of domains and make ExDom a powerful tool for exploring several key questions concerning the function, origin and evolution of genes and proteins. ExDom database is freely accessible at: http://66.170.16.154/ExDom/. PMID:18984624
Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

PubMed

Bergman, C M; Kreitman, M

2001-08-01

Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.
An RNAi-Enhanced Logic Circuit for Cancer Specific Detection and Destruction

DTIC Science & Technology

2013-02-01

monomeric protein secreted by Corynebacterium diphtheriae, and pro-apoptotic members of Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its...Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and acceptor site – were selected based on previously...sequences found in literature our intron features were chosen according SplicePort [4], an online analyzer that detects the likelihood of splicing to
[Detection of factor VIII intron 1 inversion in severe haemophilia A].

PubMed

Liang, Yan; Yan, Zhen-yu; Yan, Mei; Hua, Bao-lai; Xiao, Bai; Zhao, Yong-qiang; Liu, Jing-zhong

2009-06-01

Screening the intron 1 inversion of factor VIII (FVIII) in the population of severe haemophilia A(HA) in China and performing carrier detection and prenatal diagnosis. Using LD-PCR to detect intron 22 inversions and multiple-PCR within two tubes to intron 1 inversions in severe HA patients. Carrier detection and prenatal diagnosis were performed in affected families. Linkage analysis and DNA sequencing were used to verify these tests. One hundred and eighteen patients were seven diagnosed as intron 22 inversions and 7 were intron 1 inversions out of 247 severe HA patients. The prevalence of the intron 1 inversion in Chinese severe haemophilia A patients was 2.8% (7/247). Six women from family A and 2 from family B were diagnosed as carriers. One fetus from family A was affected fetus. Intron 1 inversion could be detected directly by multiple-PCR within two tubes. This method made the strategy more perfective in carrier and prenatal diagnosis of haemophilia A.
Splicing of a group II intron involved in the conjugative transfer of pRS01 in lactococci.

PubMed

Mills, D A; McKay, L L; Dunny, G M

1996-06-01

Analysis of a region involved in the conjugative transfer of the lactococcal conjugative element pRS01 has revealed a bacteria] group II intron. Splicing of this lactococcal intron (designated Ll.ltrB) in vivo resulted in the ligation of two exon messages (ltrBE1 and ltrBE2) which encoded a putative conjugative relaxase essential for the transfer of pRS01. Like many group II introns, the Ll.ltrB intron possessed an open reading frame (ltrA) with homology to reverse transcriptases. Remarkably, sequence analysis of ltrA suggested a greater similarity to open reading frames encoded by eukaryotic mitochondrial group II introns than to those identified to date from other bacteria. Several insertional mutations within ltrA resulted in plasmids exhibiting a conjugative transfer-deficient phenotype. These results provide the first direct evidence for splicing of a prokaryotic group II intron in vivo and suggest that conjugative transfer is a mechanism for group II intron dissemination in bacteria.
Transposition of an intron in yeast mitochondria requires a protein encoded by that intron.

PubMed

Macreadie, I G; Scott, R M; Zinn, A R; Butow, R A

1985-06-01

The optional 1143 bp intron in the yeast mitochondrial 21S rRNA gene (omega +) is nearly quantitatively inserted in genetic crosses into 21S rRNA alleles that lack it (omega -). The intron contains an open reading frame that can encode a protein of 235 amino acids, but no function has been ascribed to this sequence. We previously found an in vivo double-strand break in omega - DNA at or close to the intron insertion site only in zygotes of omega + X omega - crosses that appears with the same kinetics as intron insertion. We now show that mutations in the intron open reading frame that would alter the translation product simultaneously inhibit nonreciprocal omega recombination and the in vivo double-strand break in omega - DNA. These results provide evidence that the open reading frame encodes a protein required for intron transposition and support the role of the double-strand break in the process.
Intron open reading frames as mobile elements and evolution of a group I intron.

PubMed

Sellem, C H; Belcour, L

1997-05-01

Group I introns are proposed to have become mobile following the acquisition of open reading frames (ORFs) that encode highly specific DNA endonucleases. This proposal implies that intron ORFs could behave as autonomously mobile entities. This was supported by abundant circumstantial evidence but no experiment of ORF transfer from an ORF-containing intron to its ORF-less counterpart has been described. In this paper we present such experiments, which demonstrate the efficient mobility of the mitochondrial nad1-i4-orf1 between two Podospora strains. The homing of this mobile ORF was accompanied by a bidirectional co-conversion that did not systematically involve the whole intron sequence. Orf1 acquisition would be the most recent step in the evolution of the nad1-i4 intron, which has resulted in many strains of Podospora having an intron with two ORFs (biorfic) and four splicing pathways. We show that two of the splicing events that operate in this biorfic intron, as evidenced by PCR experiments, are generated by a 5'-alternative splice site, which is most probably a remnant of the monoorfic ancestral form of the intron. We propose a sequential evolution model that is consistent with the four organizations of the corresponding nad1 locus that we found among various species of the Pyrenomycete family; these organizations consist of no intron, an intron alone, a monoorfic intron, and a biorfic intron.
Woot, an Active Gypsy-Class Retrotransposon in the Flour Beetle, Tribolium Castaneum, Is Associated with a Recent Mutation

PubMed Central

Beeman, R. W.; Thomson, M. S.; Clark, J. M.; DeCamillis, M. A.; Brown, S. J.; Denell, R. E.

1996-01-01

A recently isolated, lethal mutation of the homeotic Abdominal gene of the red flour beetle Tribolium castaneum is associated with an insertion of a novel retrotransposon into an intron. Sequence analysis indicates that this retrotransposon, named Woot, is a member of the gypsy family of mobile elements. Most strains of T. castaneum appear to harbor ~25-35 copies of Woot per genome. Woot is composed of long terminal repeats of unprecedented length (3.6 kb each), flanking an internal coding region 5.0 kb in length. For most copies of Woot, the internal region includes two open reading frames (ORFs) that correspond to the gag and pol genes of previously described retrotransposons and retroviruses. The copy of Woot inserted into Abdominal bears an apparent single frameshift mutation that separates the normal second ORF into two. Woot does not appear to generate infectious virions by the criterion that no envelop gene is discernible. The association of Woot with a recent mutation suggests that this retroelement is currently transpositionally active in at least some strains. PMID:8722793
Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort

PubMed Central

Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Masters, Bettie Sue Siler; Martásek, Pavel

2015-01-01

Background Gene polymorphisms encoding the enzyme NADPH–cytochrome P450 oxidoreductase (POR) contribute to inter-individual differences in drug response. Aim To estimate polymorphic allele frequencies of the POR gene in a Czech Slavic population. Materials & Methods The gene POR was analyzed in 322 Czech Slavic individuals from a control cohort by sequencing and HRM analysis. Results Twenty-five SNP genetic variations were identified. Of these variants, 7 were new, unreported SNPs, including two SNPs in the 5´flanking region (g.4965 C>T and g.4994 G>T), one intronic variant (c.1899 −20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared to wild type. Conclusion New POR variant identification indicates that the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYPs in the endoplasmic reticulum. PMID:25712184
Short intronic repeat sequences facilitate circular RNA production

PubMed Central

Liang, Dongming

2014-01-01

Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Isolation and Identification of Gene-Specific MicroRNAs.

PubMed

Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

2018-01-01

Computer programming has identified hundreds of genomic hairpin sequences, many with functions yet to be determined. Because transfection of hairpin-like microRNA precursors (pre-miRNAs) into mammalian cells is not always sufficient to trigger RNA-induced gene silencing complex (RISC) assembly, a key step for inducing RNA interference (RNAi)-related gene silencing, we have developed an intronic miRNA expression system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene, and hence successfully increase the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis mechanism has been found to depend on a coupled interaction of nascent messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA so obtained is transcribed by type-II RNA polymerases, coexpressed within a primary gene transcript, and then excised out of the gene transcript by intracellular RNA splicing and processing machineries. After that, ribonuclease III (RNaseIII) endonucleases further process the spliced introns into mature miRNAs. Using this intronic miRNA expression system, we have shown for the first time that the intron-derived miRNAs are able to elicit strong RNAi effects in not only human and mouse cells in vitro but also in zebrafishes, chicken embryos, and adult mice in vivo. We have also developed a miRNA isolation protocol, based on the complementarity between the designed miRNA and its targeted gene sequence, to purify and identify the mature miRNAs generated. As a result, several intronic miRNA identities and structures have been confirmed. According to this proof-of-principle methodology, we now have full knowledge to design various intronic pre-miRNA inserts that are more efficient and effective for inducing specific gene silencing effects in vitro and in vivo.
A novel frameshift deletion in the albumin gene causes analbuminemia in a young Turkish woman.

PubMed

Dagnino, Monica; Caridi, Gianluca; Aydin, Zeki; Ozturk, Savas; Karaali, Zeynep; Kazancioglu, Rumeyza; Cefle, Kivanc; Gursu, Meltem; Campagnoli, Monica; Galliano, Monica; Minchiotti, Lorenzo

2010-11-11

Analbuminemia is a rare autosomal recessive disorder manifested by the absence, or severe reduction, of circulating serum albumin. The analbuminemic trait was diagnosed in a young Turkish woman on the basis of her clinical symptoms (bilateral lower limb edema) and biochemical findings (minimal albumin amount and variable increases in other protein fractions). Total DNA from the analbuminemic proband and her parents was PCR-amplified using oligonucleotide primers designed to amplify the 14 exons of the albumin gene (ALB) and the flanking intron regions. The products were screened for mutations by single-strand conformation polymorphism (SSCP) and heteroduplex analyses (HA). HA allowed the identification of the mutation site in exon 12. Direct DNA sequencing of this abnormal fragment revealed that the analbuminemic trait was caused by a homozygous CA deletion at nucleotide positions c. 1614-1615 in the codons for Cys538 and Thr539. The subsequent frameshift should give rise to a putative truncated albumin variant in which the sequence Cys(538)-Thr-Leu-Ser has been changed to Cys(538)-Thr-Phe-Stop. The parents were heterozygous for the same mutation. Gel-based mutation detection and DNA sequencing substantiate the clinical diagnosis of congenital analbuminemia in our patient and show that the condition is caused by a novel mutation within the ALB gene. These results contribute to shed light on the molecular basis of this rare condition. 2010 Elsevier B.V. All rights reserved.
Structural organization of the porcine and human genes coding for a leydig cell-specific insulin-like peptide (LEY I-L) and chromosomal localization of the human gene (INSL3)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Burkhardt E.; Adham, I.M.; Brosig, B.

1994-03-01

Leydig insulin-like protein (LEY I-L) is a member of the insulin-like hormone superfamily. The LEY I-L gene (designated INSL3) is expressed exclusively in prenatal and postnatal Leydig cells. The authors report here the cloning and nucleotide sequence of porcine and human LEY I-L genes including the 5[prime] regions. Both genes consist of two exons and one intron. The organization of the LEY I-L gene is similar to that of insulin and relaxin. The transcription start site in the porcine and human LEY I-L gene is localized 13 and 14 bp upstream of the translation start site, respectively. Alignment of themore » 5[prime] flanking regions of both genes reveals that the first 107 nucleotides upstream of the transcription start site exhibit an overall sequence similarity of 80%. This conserved region contains a consensus TATAA box, a CAAT-like element (GAAT), and a consensus SP1 sequence (GGGCGG) at equivalent positions in both genes and therefore may play a role in regulation of expression of the LEY I-L gene. The porcine and human genome contains a single copy of the LEY I-L gene. By in situ hybridization, the human gene was assigned to bands p13.2-p12 of the short arm of chromosome 19. 25 refs., 6 figs.« less
Genomic cloning and promoter functional analysis of myostatin-2 in shi drum, Umbrina cirrosa: conservation of muscle-specific promoter activity.

PubMed

Nadjar-Boger, Elisabeth; Maccatrozzo, Lisa; Radaelli, Giuseppe; Funkenstein, Bruria

2013-02-01

Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily, known as a negative regulator of skeletal muscle development and growth in mammals. In contrast to mammals, fish possess at least two paralogs of MSTN: MSTN-1 and MSTN-2. Here we describe the cloning and sequence analysis of spliced and precursor (unspliced) transcripts as well as the 5' flanking region of MSTN-2 from the marine fish Umbrina cirrosa (ucMSTN-2). In silico analysis revealed numerous putative cis regulatory elements including several E-boxes known as binding sites to myogenic transcription factors. Transient transfection experiments using non-muscle and muscle cell lines showed high transcriptional activity in muscle cells and in differentiated neural cells, in accordance with our previous findings in MSTN-2 promoter from Sparus aurata. Comparative informatics analysis of MSTN-2 from several fish species revealed high conservation of the predicted amino acid sequence as well as the gene structure (exon length) although intron length varied between species. The proximal promoter of MSTN-2 gene was found to be conserved among Perciforms. In conclusion, this study reinforces our conclusion that MSTN-2 promoter is a very strong promoter, especially in muscle cells. In addition, we show that the MSTN-2 gene structure is highly conserved among fishes as is the predicted amino acid sequence of the peptide. Copyright © 2012 Elsevier Inc. All rights reserved.
Development and application of microsatellites in candidate genes related to wood properties in the Chinese white poplar (Populus tomentosa Carr.).

PubMed

Du, Qingzhang; Gong, Chenrui; Pan, Wei; Zhang, Deqiang

2013-02-01

Gene-derived simple sequence repeats (genic SSRs), also known as functional markers, are often preferred over random genomic markers because they represent variation in gene coding and/or regulatory regions. We characterized 544 genic SSR loci derived from 138 candidate genes involved in wood formation, distributed throughout the genome of Populus tomentosa, a key ecological and cultivated wood production species. Of these SSRs, three-quarters were located in the promoter or intron regions, and dinucleotide (59.7%) and trinucleotide repeat motifs (26.5%) predominated. By screening 15 wild P. tomentosa ecotypes, we identified 188 polymorphic genic SSRs with 861 alleles, 2-7 alleles for each marker. Transferability analysis of 30 random genic SSRs, testing whether these SSRs work in 26 genotypes of five genus Populus sections (outgroup, Salix matsudana), showed that 72% of the SSRs could be amplified in Turanga and 100% could be amplified in Leuce. Based on genotyping of these 26 genotypes, a neighbour-joining analysis showed the expected six phylogenetic groupings. In silico analysis of SSR variation in 220 sequences that are homologous between P. tomentosa and Populus trichocarpa suggested that genic SSR variations between relatives were predominantly affected by repeat motif variations or flanking sequence mutations. Inheritance tests and single-marker associations demonstrated the power of genic SSRs in family-based linkage mapping and candidate gene-based association studies, as well as marker-assisted selection and comparative genomic studies of P. tomentosa and related species.
Geographical origin of Leucobryum boninense Sull. & Lesq. (Leucobryaceae, Musci) endemic to the Bonin Islands, Japan

PubMed Central

Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki

2013-01-01

Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5′ trnK intron, matK, partial 3′ trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species. PMID:23610621
Geographical origin of Leucobryum boninense Sull. & Lesq. (Leucobryaceae, Musci) endemic to the Bonin Islands, Japan.

PubMed

Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki

2013-04-01

Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5' trnK intron, matK, partial 3' trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species.
[Frequency of intron 1 inversion of factor VIII gene in Chinese hemophilia A patients with case report of a female patient with heterozygous intron 1 inversion].

PubMed

Yan, Zhen-yu; Liang, Yan; Yan, Mei; Fan, Lian-kai; Xiao, Bai; Hua, Bao-lai; Liu, Jing-zhong; Zhao, Yong-qiang

2008-10-21

To investigate the frequency of intron 1 inversion (inv1) in FVIII gene in Chinese hemophilia A (HA) patients and to investigate the mechanism of pathogenesis. Peripheral blood samples were collected from 158 unrelated HA patients, aged 20 (1 - 73), including one female HA patient, aged 5, and several family members of a patient positive in inv1. One-stage method was used to assay the FVIII activity (FVIII:C). Long distance PCR and multiple PCR in duplex reactions were used to screen for the intron 22 inversion (inv22) and inv1 of the FVIII coding gene (F8). The F8 coding sequence was amplified with PCR and sequenced with an automatic sequencer. Two unrelated patients (pedigrees) were detected as inv1 positive with a positive rate of 1.26%. A rare female HA patient with inv1 was also discovered in a positive family (3 HA cases were found in this family and regarded as one case in calculating the total detection rate). The full length of FVIII was sequenced, and no other mutation was detected. There frequency of FVIII inv1 is low in Chinese HA patients compared with other populations. Female HA patients are heterozygous for FVIII inv1 and that may be resulted from nonrandom inactivation of X chromosome.
Molecular Phylogenetic Analysis of Archaeal Intron-Containing Genes Coding for rRNA Obtained from a Deep-Subsurface Geothermal Water Pool

PubMed Central

Takai, Ken; Horikoshi, Koki

1999-01-01

Molecular phylogenetic analysis of a naturally occurring microbial community in a deep-subsurface geothermal environment indicated that the phylogenetic diversity of the microbial population in the environment was extremely limited and that only hyperthermophilic archaeal members closely related to Pyrobaculum were present. All archaeal ribosomal DNA sequences contained intron-like sequences, some of which had open reading frames with repeated homing-endonuclease motifs. The sequence similarity analysis and the phylogenetic analysis of these homing endonucleases suggested the possible phylogenetic relationship among archaeal rRNA-encoded homing endonucleases. PMID:10584021

New encoded single-indicator sequences based on physico-chemical parameters for efficient exon identification.

PubMed

Meher, J K; Meher, P K; Dash, G N; Raval, M K

2012-01-01

The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.
Developing a set of strong intronic promoters for robust metabolic engineering in oleaginous Rhodotorula (Rhodosporidium) yeast species.

PubMed

Liu, Yanbin; Yap, Sihui Amy; Koh, Chong Mei John; Ji, Lianghui

2016-11-25

Red yeast species in the Rhodotorula/Rhodosporidium genus are outstanding producers of triacylglyceride and cell biomass. Metabolic engineering is expected to further enhance the productivity and versatility of these hosts for the production of biobased chemicals and fuels. Promoters with strong activity during oil-accumulation stage are critical tools for metabolic engineering of these oleaginous yeasts. The upstream DNA sequences of 6 genes involved in lipid biosynthesis or accumulation in Rhodotorula toruloides were studied by luciferase reporter assay. The promoter of perilipin/lipid droplet protein 1 gene (LDP1) displayed much stronger activity (4-11 folds) than that of glyceraldehyde-3-phosphate dehydrogenase gene (GPD1), one of the strongest promoters known in yeasts. Depending on the stage of cultivation, promoter of acetyl-CoA carboxylase gene (ACC1) and fatty acid synthase β subunit gene (FAS1) exhibited intermediate strength, displaying 50-160 and 20-90% levels of GPD1 promoter, respectively. Interestingly, introns significantly modulated promoter strength at high frequency. The incorporation of intron 1 and 2 of LDP1 (LDP1in promoter) enhanced its promoter activity by 1.6-3.0 folds. Similarly, the strength of ACC1 promoter was enhanced by 1.5-3.2 folds if containing intron 1. The intron 1 sequences of ACL1 and FAS1 also played significant regulatory roles. When driven by the intronic promoters of ACC1 and LDP1 (ACC1in and LDP1in promoter, respectively), the reporter gene expression were up-regulated by nitrogen starvation, independent of de novo oil biosynthesis and accumulation. As a proof of principle, overexpression of the endogenous acyl-CoA-dependent diacylglycerol acyltransferase 1 gene (DGA1) by LDP1in promoter was significantly more efficient than GPD1 promoter in enhancing lipid accumulation. Intronic sequences play an important role in regulating gene expression in R. toruloides. Three intronic promoters, LDP1in, ACC1in and FAS1in, are excellent promoters for metabolic engineering in the oleaginous and carotenogenic yeast, R. toruloides.
Ferritin gene organization: differences between plants and animals suggest possible kingdom-specific selective constraints.

PubMed

Proudhon, D; Wei, J; Briat, J; Theil, E C

1996-03-01

Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.
Comparative analysis of the 5{prime} genomic and promoter regions between the mouse (Hdh) and human Huntington disease (HD) gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kalchman, M.; Lin, B.; Nasir, J.

1994-09-01

The mouse homologue of the Huntington disease gene (Hdh) has recently been cloned and mapped to a region of synteny with the human, on mouse chromosome 5. The two genes share a high degree of both coding (90% amino acid) and nucleotide (86.2%) identity. We have subsequently performed a detailed comparison of the genomic organization of the 5{prime} region of the two genes encompassing the promoter region and first five exons of both the human and mouse genes. The comparative sequence analysis of the promoter region between HD and Hdh reveals two highly conserved regions. One region (-56 to -118)more » (+1 is the ATG start codon), shared 84% nucleotide identity and another region (-130 to -206) had 81% nucleotide identity. Nine putative Sp1 sites appear in the human promoter region contrasted with only 3 in a similar region in the mouse. Furthermore, 17 and 20 base pair direct repeats present in the HD 5{prime} region are absent in the similar Hdh region. Although both the mouse and human intron/exon boundaries conform to the GT/AG rule, the intron sizes between HD and Hdh are markedly different. The first four introns in Hdh are 15, 7, 5 and 0.5 kb compared to sizes of 10, 15, 7 and 0.5 kb, respectively. Comparison between the mouse and human intronic sequences immediately adjacent to the first five exons (excluding exon 1) reveals only about 46 to 50% identity within the first 60 bp of intronic sequence. Furthermore, we have identified novel polymorphic di-, tri- and tetra-nucleotide repeats in Hdh introns of various mouse strains that are not present in the human. For example, polymorphic CT repeats are present in introns 2 and 4 of Hdh and a novel mouse 56 AAG trinucleotide repeat (interrupted by an AAGG) is also located within intron 2. This information concerning the promoter and genomic organization of both HD and Hdh is critical for designing appropriate gene targetting vectors for studying the normal function of the HD and Hdh genes in model systems.« less
A Report on Molecular Diagnostic Testing for Inherited Retinal Dystrophies by Targeted Genetic Analyses.

PubMed

Ramkumar, Hema L; Gudiseva, Harini V; Kishaba, Kameron T; Suk, John J; Verma, Rohan; Tadimeti, Keerti; Thorson, John A; Ayyagari, Radha

2017-02-01

To test the utility of targeted sequencing as a method of clinical molecular testing in patients diagnosed with inherited retinal degeneration (IRD). After genetic counseling, peripheral blood was drawn from 188 probands and 36 carriers of IRD. Single gene testing was performed on each patient in a Clinical Laboratory Improvement Amendment (CLIA) certified laboratory. DNA was isolated, and all exons in the gene of interest were analyzed along with 20 base pairs of flanking intronic sequence. Genetic testing was most often performed on ABCA4, CTRP5, ELOV4, BEST1, CRB1, and PRPH2. Pathogenicity of novel sequence changes was predicted by PolyPhen2 and sorting intolerant from tolerant (SIFT). Of the 225 genetic tests performed, 150 were for recessive IRD, and 75 were for dominant IRD. A positive molecular diagnosis was made in 70 (59%) of probands with recessive IRD and 19 (26%) probands with dominant IRD. Analysis confirmed 12 (34%) of individuals as carriers of familial mutations associated with IRD. Thirty-two novel variants were identified; among these, 17 sequence changes in four genes were predicted to be possibly or probably damaging including: ABCA4 (14), BEST1 (2), PRPH2 (1), and TIMP3 (1). Targeted analysis of clinically suspected genes in 225 subjects resulted in a positive molecular diagnosis in 26% of patients with dominant IRD and 59% of patients with recessive IRD. Novel damaging mutations were identified in four genes. Single gene screening is not an ideal method for diagnostic testing given the phenotypic and genetic heterogeneity among IRD cases. High-throughput sequencing of all genes associated with retinal degeneration may be more efficient for molecular diagnosis.
Molecular genetic studies of DMT1 on 12q in French-Canadian restless legs syndrome patients and families.

PubMed

Xiong, Lan; Dion, Patrick; Montplaisir, Jacques; Levchenko, Anastasia; Thibodeau, Pascale; Karemera, Liliane; Rivière, Jean-Baptiste; St-Onge, Judith; Gaspar, Claudia; Dubé, Marie-Pierre; Desautels, Alex; Turecki, Gustavo; Rouleau, Guy A

2007-10-05

Converging evidence from clinical observations, brain imaging and pathological findings strongly indicate impaired brain iron regulation in restless legs syndrome (RLS). Animal models with mutation in (DMT1) divalent metal transporter 1 gene, an important brain iron transporter, demonstrate a similar iron deficiency profile as found in RLS brain. The human DMT1 gene, mapped to chromosome 12q near the RLS1 locus, qualifies as an excellent functional and possible positional candidate for RLS. DMT1 protein levels were assessed in lymphoblastoid cell lines from RLS patients and controls. Linkage analyses were carried out with markers flanking and within the DMT1 gene. Selected patient samples from RLS families with compatible linkage to the RLS1 locus on 12q were fully sequenced in both the coding regions and the long stretches of UTR sequences. Finally, selected sequence variants were further studied in case/control and family-based association tests. A clinical association of anemia and RLS was further confirmed in this study. There was no detectable difference in DMT1 protein levels between RLS patient lymphoblastoid cell lines and normal controls. Non-parametric linkage analyses failed to identify any significant linkage signals within the DMT1 gene region. Sequencing of selected patients did not detect any sequence variant(s) compatible with DMT1 harboring RLS causative mutation(s). Further studies did not find any association between ten SNPs, spanning the whole DMT1 gene region, and RLS affection status. Finally, two DMT1 intronic SNPs showed positive association with RLS in patients with a history of anemia, when compared to RLS patients without anemia. (c) 2007 Wiley-Liss, Inc.
Differentiating Alström from Bardet-Biedl syndrome (BBS) using systematic ciliopathy genes sequencing.

PubMed

Aliferis, K; Hellé, S; Gyapay, G; Duchatelet, S; Stoetzel, C; Mandel, J L; Dollfus, H

2012-03-01

Early onset retinal degeneration associated with obesity can present a diagnostic challenge in paediatric ophthalmology practice. Clinical overlap between Bardet-Biedl syndrome (BBS) and Alström syndrome has been described, although the two entities are genetically distinct. To date, 16 genes are known to be associated with BBS (BBS1-16) and only one gene has been identified for Alström syndrome (ALMS1). In collaboration with the French National Center for Sequencing (CNS, Evry), all coding exons and flanking introns were sequenced for 27 ciliopathy genes (BBS1-12, MGC1203, TTC21b, AHI1, NPHP2-8 (NPHP6=BBS14), MKS1(BBS13), MKS3, C2ORF86, SDCCAG8, ALMS1) in 96 patients referred with a clinical diagnosis of BBS. ALMS1 gene analysis included sequencing of all coding exons. BBS known gene mutations were found in 44 patients (36 with two mutations and 8 heterozygous). ALMS1 mutations were found in four cases. The rate of ALMS1 mutations among patients suspected of having BBS was 4.2%. Clinically, all four patients presented early-onset severe retinal degeneration with congenital nystagmus associated with obesity. The difficult early differential diagnosis between the two syndromes is outlined. One mutation had already been reported (c.11310delAGAG/p.R3770fsX) and three were novel (c.2293C > T/p.Q765X, c.6823insA/p.R2275fsX, c.9046delA/p.N3016fsX). Ciliopathy genes sequencing can be very helpful in providing a timely diagnosis in this group of patients, hence appropriate genetic counselling for families and adequate medical follow-up for affected children.
Ovarian Tumors related to Intronic Mutations in DICER1: A Report from the International Ovarian and Testicular Stromal Tumor Registry

PubMed Central

Schultz, Kris Ann; Harris, Anne; Messinger, Yoav; Sencer, Susan; Baldinger, Shari; Dehner, Louis P.; Hill, D. Ashley

2015-01-01

Germline DICER1 mutations have been described in individuals with pleuropulmonary blastoma (PPB), ovarian Sertoli-Leydig cell tumor (SLCT), sarcomas, multinodular goiter, thyroid carcinoma, cystic nephroma and other neoplastic conditions. Early results from the International Ovarian and Testicular Stromal Tumor Registry show germline DICER1 mutations in 48% of girls and women with SLCT. In this report, a young woman presented with ovarian undifferentiated sarcoma. Four years later, she presented with SLCT. She was successfully treated for both malignancies. Sequence results showed a germline intronic mutation in DICER1. This mutation results in an exact duplication of the six bases at the splice site at the intron 23 and exon 24 junction. Predicted improper splicing leads to inclusion of 10 bases of intronic sequence, frameshift and premature truncation of the protein disrupting the RNase IIIb domain. A second individual with SLCT was found to have an identical germline mutation. In each of the ovarian tumors, an additional somatic mutation in the RNase IIIb domain of DICER1 was found. In rare patients, germline intronic mutations in DICER1 that are predicted to cause incorrect splicing can also contribute to the pathogenesis of SLCT. PMID:26289771
RNA editing in the anticodon of tRNA Leu (CAA) occurs before group I intron splicing in plastids of a moss Takakia lepidozioides S. Hatt. & Inoue.

PubMed

Miyata, Y; Sugita, C; Maruyama, K; Sugita, M

2008-03-01

RNA editing of cytidine (C) to uridine (U) transitions occurs in plastids and mitochondria of most land plants. In this study, we amplified and sequenced the group I intron-containing tRNA Leu gene, trnL-CAA, from Takakia lepidozioides, a moss. DNA sequence analysis revealed that the T. lepidozioides tRNA Leu gene consisted of a 35-bp 5' exon, a 469-bp group I intron and a 50-bp 3' exon. The intron was inserted between the first and second position of the tRNA Leu anticodon. In general, plastid tRNA Leu genes with a group I intron code for a TAA anticodon in most land plants. This strongly suggests that the first nucleotide of the CAA anticodon could be edited in T. lepidozioides plastids. To investigate this possibility, we analysed cDNAs derived from the trnL-CAA transcripts. We demonstrated that the first nucleotide C of the anticodon was edited to create a canonical UAA anticodon in T. lepidozioides plastids. cDNA sequencing analyses of the spliced or unspliced tRNA Leu transcripts revealed that, while the spliced tRNA was completely edited, editing in the unspliced tRNAs were only partial. This is the first experimental evidence that the anticodon editing of tRNA occurs before RNA splicing in plastids. This suggests that this editing is a prerequisite to splicing of pre-tRNA Leu.
Evaluation of non-coding variation in GLUT1 deficiency.

PubMed

Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S

2016-12-01

Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.
Mobile Bacterial Group II Introns at the Crux of Eukaryotic Evolution

PubMed Central

Lambowitz, Alan M.; Belfort, Marlene

2015-01-01

SUMMARY This review focuses on recent developments in our understanding of group II intron function, the relationships of these introns to retrotransposons and spliceosomes, and how their common features have informed thinking about bacterial group II introns as key elements in eukaryotic evolution. Reverse transcriptase-mediated and host factor-aided intron retrohoming pathways are considered along with retrotransposition mechanisms to novel sites in bacteria, where group II introns are thought to have originated. DNA target recognition and movement by target-primed reverse transcription infer an evolutionary relationship among group II introns, non-LTR retrotransposons, such as LINE elements, and telomerase. Additionally, group II introns are almost certainly the progenitors of spliceosomal introns. Their profound similarities include splicing chemistry extending to RNA catalysis, reaction stereochemistry, and the position of two divalent metals that perform catalysis at the RNA active site. There are also sequence and structural similarities between group II introns and the spliceosome’s small nuclear RNAs (snRNAs) and between a highly conserved core spliceosomal protein Prp8 and a group II intron-like reverse transcriptase. It has been proposed that group II introns entered eukaryotes during bacterial endosymbiosis or bacterial-archaeal fusion, proliferated within the nuclear genome, necessitating evolution of the nuclear envelope, and fragmented giving rise to spliceosomal introns. Thus, these bacterial self-splicing mobile elements have fundamentally impacted the composition of extant eukaryotic genomes, including the human genome, most of which is derived from close relatives of mobile group II introns. PMID:25878921
Rapid Construction of Stable Infectious Full-Length cDNA Clone of Papaya Leaf Distortion Mosaic Virus Using In-Fusion Cloning

PubMed Central

Tuo, Decai; Shen, Wentao; Yan, Pu; Li, Xiaoying; Zhou, Peng

2015-01-01

Papaya leaf distortion mosaic virus (PLDMV) is becoming a threat to papaya and transgenic papaya resistant to the related pathogen, papaya ringspot virus (PRSV). The generation of infectious viral clones is an essential step for reverse-genetics studies of viral gene function and cross-protection. In this study, a sequence- and ligation-independent cloning system, the In-Fusion® Cloning Kit (Clontech, Mountain View, CA, USA), was used to construct intron-less or intron-containing full-length cDNA clones of the isolate PLDMV-DF, with the simultaneous scarless assembly of multiple viral and intron fragments into a plasmid vector in a single reaction. The intron-containing full-length cDNA clone of PLDMV-DF was stably propagated in Escherichia coli. In vitro intron-containing transcripts were processed and spliced into biologically active intron-less transcripts following mechanical inoculation and then initiated systemic infections in Carica papaya L. seedlings, which developed similar symptoms to those caused by the wild-type virus. However, no infectivity was detected when the plants were inoculated with RNA transcripts from the intron-less construct because the instability of the viral cDNA clone in bacterial cells caused a non-sense or deletion mutation of the genomic sequence of PLDMV-DF. To our knowledge, this is the first report of the construction of an infectious full-length cDNA clone of PLDMV and the splicing of intron-containing transcripts following mechanical inoculation. In-Fusion cloning shortens the construction time from months to days. Therefore, it is a faster, more flexible, and more efficient method than the traditional multistep restriction enzyme-mediated subcloning procedure. PMID:26633465
Rapid Construction of Stable Infectious Full-Length cDNA Clone of Papaya Leaf Distortion Mosaic Virus Using In-Fusion Cloning.

PubMed

Tuo, Decai; Shen, Wentao; Yan, Pu; Li, Xiaoying; Zhou, Peng

2015-12-01

Papaya leaf distortion mosaic virus (PLDMV) is becoming a threat to papaya and transgenic papaya resistant to the related pathogen, papaya ringspot virus (PRSV). The generation of infectious viral clones is an essential step for reverse-genetics studies of viral gene function and cross-protection. In this study, a sequence- and ligation-independent cloning system, the In-Fusion(®) Cloning Kit (Clontech, Mountain View, CA, USA), was used to construct intron-less or intron-containing full-length cDNA clones of the isolate PLDMV-DF, with the simultaneous scarless assembly of multiple viral and intron fragments into a plasmid vector in a single reaction. The intron-containing full-length cDNA clone of PLDMV-DF was stably propagated in Escherichia coli. In vitro intron-containing transcripts were processed and spliced into biologically active intron-less transcripts following mechanical inoculation and then initiated systemic infections in Carica papaya L. seedlings, which developed similar symptoms to those caused by the wild-type virus. However, no infectivity was detected when the plants were inoculated with RNA transcripts from the intron-less construct because the instability of the viral cDNA clone in bacterial cells caused a non-sense or deletion mutation of the genomic sequence of PLDMV-DF. To our knowledge, this is the first report of the construction of an infectious full-length cDNA clone of PLDMV and the splicing of intron-containing transcripts following mechanical inoculation. In-Fusion cloning shortens the construction time from months to days. Therefore, it is a faster, more flexible, and more efficient method than the traditional multistep restriction enzyme-mediated subcloning procedure.
Frequencies of VNTR and RFLP polymorphisms associated with factor VIII gene in Singapore

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fong, I.; Lai, P.S.; Ouah, T.C.

1994-09-01

The allelic frequency of any polymorphism within a population determines its usefulness for genetic counselling. This is important in populations of non-Caucasian origin as RFLPs may significantly differ among ethnic groups. We report a study of five intragenic polymorphisms in factor VIII gene carried out in Singapore. The three PCR-based RFLP markers studied were Intron 18/Bcl I, Intron 19/Hind III and Intron 22/Xba I. In an analysis of 148 unrelated normal X chromosomes, the allele frequencies were found to be A1 = 0.18, A2 = 0.82 (Bcl I RFLP), A1 = 0.80, A2 = 0.20 (Hind III RFLP) and A1more » = 0.58, and A2 = 0.42 (Xba I RFLP). The heterozygosity rates of 74 females analyzed separately were 31%, 32% and 84.2%, respectively. Linkage disequilibrium was also observed to some degree between Bcl I and Hind III polymorphism in our population. We have also analyzed a sequence polymorphism in Intron 7 using hybridization with radioactive-labelled {sup 32}P allele-specific oligonucleotide probes. This polymorphism was not very polymorphic in our population with only 2% of 117 individuals analyzed being informative. However, the use of a hypervariable dinucleotide repeat sequence (VNTR) in Intron 13 showed that 25 of our of 27 (93%) females were heterozygous. Allele frequencies ranged from 1 to 55 %. We conclude that a viable strategy for molecular analysis of Hemophilia A families in our population should include the use of Intron 18/Bcl I and Intron 22/Xba I RFLP markers and the Intron 13 VNTR marker.« less
Evolution of the dispersed SUC gene family of Saccharomyces by rearrangements of chromosome telomeres.

PubMed Central

Carlson, M; Celenza, J L; Eng, F J

1985-01-01

The SUC gene family of Saccharomyces contains six structural genes for invertase (SUC1 through SUC5 and SUC7) which are located on different chromosomes. Most yeast strains do not carry all six SUC genes and instead carry natural negative (suc0) alleles at some or all SUC loci. We determined the physical structures of SUC and suc0 loci. Except for SUC2, which is an unusual member of the family, all of the SUC genes are located very close to telomeres and are flanked by homologous sequences. On the centromere-proximal side of the gene, the conserved region contains X sequences, which are sequences found adjacent to telomeres (C. S. M. Chan and B.-K. Tye, Cell 33:563-573, 1983). On the other side of the gene, the homology includes about 4 kilobases of flanking sequence and then extends into a Y' element, which is an element often found distal to the X sequence at telomeres (Chan and Tye, Cell 33:563-573, 1983). Thus, these SUC genes and flanking sequences are embedded in telomere-adjacent sequences. Chromosomes carrying suc0 alleles (except suc20) lack SUC structural genes and portions of the conserved flanking sequences. The results indicate that the dispersal of SUC genes to different chromosomes occurred by rearrangements of chromosome telomeres. Images PMID:3018485
[Bioinformatics Analysis of Clustered Regularly Interspaced Short Palindromic Repeats in the Genomes of Shigella].

PubMed

Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin

2015-04-01

This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.
Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome

PubMed Central

Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

2014-01-01

Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes. PMID:25482895
Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome.

PubMed

Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

2014-01-01

Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes.
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture.

PubMed

Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen; Burge, Christopher B

2017-12-27

Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning ('intron definition') or exon-spanning ('exon definition') pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila , using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60-70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.
BIALLELIC POLYMORPHISM IN THE INTRON REGION OF B-TUBULIN GENE OF CRYPTOSPORIDIUM PARASITES

EPA Science Inventory

Nucleotide sequencing of polymerase chain reaction-amplified intron region of the Cryptosporidium parvum B-tubulin gene in 26 human and 15 animal isolates revealed distinct genetic polymorphism between the human and bovine genotypes. The separation of 2 genotypes of C. parvum is...

Group I introns are inherited through common ancestry in the nuclear-encoded rRNA of Zygnematales (Charophyceae).

PubMed Central

Bhattacharya, D; Surek, B; Rüsing, M; Damberger, S; Melkonian, M

1994-01-01

Group I introns are found in organellar genomes, in the genomes of eubacteria and phages, and in nuclear-encoded rRNAs. The origin and distribution of nuclear-encoded rRNA group I introns are not understood. To elucidate their evolutionary relationships, we analyzed diverse nuclear-encoded small-subunit rRNA group I introns including nine sequences from the green-algal order Zygnematales (Charophyceae). Phylogenetic analyses of group I introns and rRNA coding regions suggest that lateral transfers have occurred in the evolutionary history of group I introns and that, after transfer, some of these elements may form stable components of the host-cell nuclear genomes. The Zygnematales introns, which share a common insertion site (position 1506 relative to the Escherichia coli small-subunit rRNA), form one subfamily of group I introns that has, after its origin, been inherited through common ancestry. Since the first Zygnematales appear in the middle Devonian within the fossil record, the "1506" group I intron presumably has been a stable component of the Zygnematales small-subunit rRNA coding region for 350-400 million years. PMID:7937917
Flanking sequence determination and event-specific detection of genetically modified wheat B73-6-1.

PubMed

Xu, Junyi; Cao, Jijuan; Cao, Dongmei; Zhao, Tongtong; Huang, Xin; Zhang, Piqiao; Luan, Fengxia

2013-05-01

In order to establish a specific identification method for genetically modified (GM) wheat, exogenous insert DNA and flanking sequence between exogenous fragment and recombinant chromosome of GM wheat B73-6-1 were successfully acquired by means of conventional polymerase chain reaction (PCR) and thermal asymmetric interlaced (TAIL)-PCR strategies. Newly acquired exogenous fragment covered the full-length sequence of transformed genes such as transformed plasmid and corresponding functional genes including marker uidA, herbicide-resistant bar, ubiquitin promoter, and high-molecular-weight gluten subunit. The flanking sequence between insert DNA revealed high similarity with Triticum turgidum A gene (GenBank: AY494981.1). A specific PCR detection method for GM wheat B73-6-1 was established on the basis of primers designed according to the flanking sequence. This specific PCR method was validated by GM wheat, GM corn, GM soybean, GM rice, and non-GM wheat. The specifically amplified target band was observed only in GM wheat B73-6-1. This method is of high specificity, high reproducibility, rapid identification, and excellent accuracy for the identification of GM wheat B73-6-1.
Efficiency of introns from various origins in fish cells.

PubMed

Bétancourt, O H; Attal, J; Théron, M C; Puissant, C; Houdebine, L M

1993-06-01

Several vectors containing (1) regulatory regions from Rous sarcoma virus (RSV), human cytomegalovirus (CMV), and herpes simplex thymidine kinase (TK); (2) introns from early or late SV40 genes and from trout growth hormone gene (tGH); (3) chloramphenicol acetyltransferase gene (CAT); and (4) transcription terminators from SV40 were transfected into carp EPC cells, salmon CHSE cells, tilapia TO2 cells, quail QT6 cells, and hamster CHO cells. CAT activity was measured in extracts from several cell lines 3 days after transfection and in the fish EPC stable clones. The CMV and RSV promoters were the most potent in all cell types. The intron from late SV40 genes (VP1 intron) worked properly in QT6 and CHO cells but not in EPC and very weakly in TO2 cells. The tGH intron was efficient in all cell types but preferentially in fish cells. The small t intron from SV40 was processed in all cell types. The small t and, to a lesser extent, the tGH introns amplified expression of cat gene in stable clones, in comparison to the transiently transfected cells. These results indicate that elements from mammalian genes may not be properly recognized by the fish cellular machinery and in an unpredictable manner. This finding suggests that vectors prepared to express foreign genes in transfected cultured fish cells and transgenic fish should preferably contain DNA sequences from fish genes or, alternatively, those sequences from mammalian genes that have been previously proved to be compatible with the fish cellular machinery.
Chloroplast genome expansion by intron multiplication in the basal psychrophilic euglenoid Eutreptiella pomquetensis

PubMed Central

Bennett, Matthew S.; Triemer, Richard E.; Preisfeld, Angelika

2017-01-01

Background Over the last few years multiple studies have been published showing a great diversity in size of chloroplast genomes (cpGenomes), and in the arrangement of gene clusters, in the Euglenales. However, while these genomes provided important insights into the evolution of cpGenomes across the Euglenales and within their genera, only two genomes were analyzed in regard to genomic variability between and within Euglenales and Eutreptiales. To better understand the dynamics of chloroplast genome evolution in early evolving Eutreptiales, this study focused on the cpGenome of Eutreptiella pomquetensis, and the spread and peculiarities of introns. Methods The Etl. pomquetensis cpGenome was sequenced, annotated and afterwards examined in structure, size, gene order and intron content. These features were compared with other euglenoid cpGenomes as well as those of prasinophyte green algae, including Pyramimonas parkeae. Results and Discussion With about 130,561 bp the chloroplast genome of Etl. pomquetensis, a basal taxon in the phototrophic euglenoids, was considerably larger than the two other Eutreptiales cpGenomes sequenced so far. Although the detected quadripartite structure resembled most green algae and plant chloroplast genomes, the gene content of the single copy regions in Etl. pomquetensis was completely different from those observed in green algae and plants. The gene composition of Etl. pomquetensis was extensively changed and turned out to be almost identical to other Eutreptiales and Euglenales, and not to P. parkeae. Furthermore, the cpGenome of Etl. pomquetensis was unexpectedly permeated by a high number of introns, which led to a substantially larger genome. The 51 identified introns of Etl. pomquetensis showed two major unique features: (i) more than half of the introns displayed a high level of pairwise identities; (ii) no group III introns could be identified in the protein coding genes. These findings support the hypothesis that group III introns are degenerated group II introns and evolved later. PMID:28852596
Isolation and characterization of full-length putative alcohol dehydrogenase genes from polygonum minus

NASA Astrophysics Data System (ADS)

Hamid, Nur Athirah Abd; Ismail, Ismanizan

2013-11-01

Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.
Alu sequence involvement in transcriptional insulation of the keratin 18 gene in transgenic mice.

PubMed Central

Thorey, I S; Ceceña, G; Reynolds, W; Oshima, R G

1993-01-01

The human keratin 18 (K18) gene is expressed in a variety of adult simple epithelial tissues, including liver, intestine, lung, and kidney, but is not normally found in skin, muscle, heart, spleen, or most of the brain. Transgenic animals derived from the cloned K18 gene express the transgene in appropriate tissues at levels directly proportional to the copy number and independently of the sites of integration. We have investigated in transgenic mice the dependence of K18 gene expression on the distal 5' and 3' flanking sequences and upon the RNA polymerase III promoter of an Alu repetitive DNA transcription unit immediately upstream of the K18 promoter. Integration site-independent expression of tandemly duplicated K18 transgenes requires the presence of either an 825-bp fragment of the 5' flanking sequence or the 3.5-kb 3' flanking sequence. Mutation of the RNA polymerase III promoter of the Alu element within the 825-bp fragment abolishes copy number-dependent expression in kidney but does not abolish integration site-independent expression when assayed in the absence of the 3' flanking sequence of the K18 gene. The characteristics of integration site-independent expression and copy number-dependent expression are separable. In addition, the formation of the chromatin state of the K18 gene, which likely restricts the tissue-specific expression of this gene, is not dependent upon the distal flanking sequences of the 10-kb K18 gene but rather may depend on internal regulatory regions of the gene. Images PMID:7692231
Choosing and Using Introns in Molecular Phylogenetics

PubMed Central

Creer, Simon

2007-01-01

Introns are now commonly used in molecular phylogenetics in an attempt to recover gene trees that are concordant with species trees, but there are a range of genomic, logistical and analytical considerations that are infrequently discussed in empirical studies that utilize intron data. This review outlines expedient approaches for locus selection, overcoming paralogy problems, recombination detection methods and the identification and incorporation of LVHs in molecular systematics. A range of parsimony and Bayesian analytical approaches are also described in order to highlight the methods that can currently be employed to align sequences and treat indels in subsequent analyses. By covering the main points associated with the generation and analysis of intron data, this review aims to provide a comprehensive introduction to using introns (or any non-coding nuclear data partition) in contemporary phylogenetics. PMID:19461984
Comparative Analysis of Four Calypogeia Species Revealed Unexpected Change in Evolutionarily-Stable Liverwort Mitogenomes

PubMed Central

Ślipiko, Monika; Buczkowska-Chmielewska, Katarzyna; Bączkiewicz, Alina; Szczecińska, Monika; Sawicki, Jakub

2017-01-01

Liverwort mitogenomes are considered to be evolutionarily stable. A comparative analysis of four Calypogeia species revealed differences compared to previously sequenced liverwort mitogenomes. Such differences involve unexpected structural changes in the two genes, cox1 and atp1, which have lost three and two introns, respectively. The group I introns in the cox1 gene are proposed to have been lost by two-step localized retroprocessing, whereas one-step retroprocessing could be responsible for the disappearance of the group II introns in the atp1 gene. These cases represent the first identified losses of introns in mitogenomes of leafy liverworts (Jungermanniopsida) contrasting the stability of mitochondrial gene order with certain changes in the gene content and intron set in liverworts. PMID:29257096
Short intronic repeat sequences facilitate circular RNA production.

PubMed

Liang, Dongming; Wilusz, Jeremy E

2014-10-15

Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
Elongation Factor-1α Accurately Reconstructs Relationships Amongst Psyllid Families (Hemiptera: Psylloidea), with Possible Diagnostic Implications.

PubMed

Martoni, Francesco; Bulman, Simon R; Pitman, Andrew; Armstrong, Karen F

2017-12-05

The superfamily Psylloidea (Hemiptera: Sternorrhyncha) lacks a robust multigene phylogeny. This impedes our understanding of the evolution of this group of insects and, consequently, an accurate identification of individuals, of their plant host associations, and their roles as vectors of economically important plant pathogens. The conserved nuclear gene elongation factor-1 alpha (EF-1α) has been valuable as a higher-level phylogenetic marker in insects and it has also been widely used to investigate the evolution of intron/exon structure. To explore evolutionary relationships among Psylloidea, polymerase chain reaction amplification and nucleotide sequencing of a 250-bp EF-1α gene fragment was applied to psyllids belonging to five different families. Introns were detected in three individuals belonging to two families. The nine genera belonging to the family Aphalaridae all lacked introns, highlighting the possibility of using intron presence/absence as a diagnostic tool at a family level. When paired with cytochrome oxidase I gene sequences, the 250 bp EF-1α sequence appeared to be a very promising higher-level phylogenetic marker for psyllids. © The Author(s) 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Genome Analysis Reveals Interplay between 5′UTR Introns and Nuclear mRNA Export for Secretory and Mitochondrial Genes

PubMed Central

Cenik, Can; Chua, Hon Nian; Zhang, Hui; Tarnawsky, Stefan P.; Akef, Abdalla; Derti, Adnan; Tasan, Murat; Moore, Melissa J.; Palazzo, Alexander F.; Roth, Frederick P.

2011-01-01

In higher eukaryotes, messenger RNAs (mRNAs) are exported from the nucleus to the cytoplasm via factors deposited near the 5′ end of the transcript during splicing. The signal sequence coding region (SSCR) can support an alternative mRNA export (ALREX) pathway that does not require splicing. However, most SSCR–containing genes also have introns, so the interplay between these export mechanisms remains unclear. Here we support a model in which the furthest upstream element in a given transcript, be it an intron or an ALREX–promoting SSCR, dictates the mRNA export pathway used. We also experimentally demonstrate that nuclear-encoded mitochondrial genes can use the ALREX pathway. Thus, ALREX can also be supported by nucleotide signals within mitochondrial-targeting sequence coding regions (MSCRs). Finally, we identified and experimentally verified novel motifs associated with the ALREX pathway that are shared by both SSCRs and MSCRs. Our results show strong correlation between 5′ untranslated region (5′UTR) intron presence/absence and sequence features at the beginning of the coding region. They also suggest that genes encoding secretory and mitochondrial proteins share a common regulatory mechanism at the level of mRNA export. PMID:21533221
RPG: the Ribosomal Protein Gene database.

PubMed

Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya

2004-01-01

RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes.
RPG: the Ribosomal Protein Gene database

PubMed Central

Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya

2004-01-01

RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes. PMID:14681386
Refinement of the X-linked cataract locus (CXN) and gene analysis for CXN and Nance-Horan syndrome (NHS).

PubMed

Brooks, Simon; Ebenezer, Neil; Poopalasundaram, Subathra; Maher, Eamonn; Francis, Peter; Moore, Anthony; Hardcastle, Alison

2004-06-01

The X-linked congenital cataract (CXN) locus has been mapped to a 3-cM (approximately 3.5 Mb) interval on chromosome Xp22.13, which is syntenic to the mouse cataract disease locus Xcat and encompasses the recently refined Nance-Horan syndrome (NHS) locus. A positional cloning strategy has been adopted to identify the causative gene. In an attempt to refine the CXN locus, seven microsatellites were analysed within 21 individuals of a CXN family. Haplotypes were reconstructed confirming disease segregation with markers on Xp22.13. In addition, a proximal cross-over was observed between markers S3 and S4, thereby refining the CXN disease interval by approximately 400 Kb to 3.2 Mb, flanked by markers DXS9902 and S4. Two known genes (RAI2 and RBBP7) and a novel gene (TL1) were screened for mutations within an affected male from the CXN family and an NHS family by direct sequencing of coding exons and intron- exon splice sites. No mutations or polymorphisms were identified, therefore excluding them as disease-causative in CXN and NHS. In conclusion, the CXN locus has been successfully refined and excludes PPEF1 as a candidate gene. A further three candidates were excluded based on sequence analysis. Future positional cloning efforts will focus on the region of overlap between CXN, Xcat, and NHS.
A ribosomal orphon sequence from Xenopus laevis flanked by novel low copy number repetitive elements.

PubMed

Guimond, A; Moss, T

1999-02-01

We have used a differential cloning approach to isolate ribosomal/non-ribosomal frontier sequences from Xenopus laevis. A ribosomal intergenic spacer sequence (IGS) was cloned and shown not to be physically linked with the ribosomal locus. This ribosomal orphon contained the IGS sequences found immediately downstream of the 28S gene and included an array of enhancer repetitions and a non-functional spacer promoter. The orphon sequence was flanked by a member of the novel 'Frt' low copy repetitive element family. Three individual Frt repeats were sequenced and all members of this family were shown to lie clustered at two chromosomal sites, one of which contained the ribosomal orphon. One of the Frt elements contained an insertion of 297 bp that showed extensive homology to sequences within at least three other Xenopus genes. Each homology region was flanked by members of the T2 family of short interspersed repetitive elements, (SINEs), and by its target insertion sequence, suggesting multiple translocation events. The data are discussed in terms of the evolution of the ribosomal gene locus.
Hypervariable and highly divergent intron-exon organizations in the chordate Oikopleura dioica.

PubMed

Edvardsen, Rolf B; Lerat, Emmanuelle; Maeland, Anne Dorthea; Flåt, Mette; Tewari, Rita; Jensen, Marit F; Lehrach, Hans; Reinhardt, Richard; Seo, Hee-Chan; Chourrout, Daniel

2004-10-01

Oikopleura dioica is a pelagic tunicate with a very small genome and a very short life cycle. In order to investigate the intron-exon organizations in Oikopleura, we have isolated and characterized ribosomal protein EF-1alpha, Hox, and alpha-tubulin genes. Their intron positions have been compared with those of the same genes from various invertebrates and vertebrates, including four species with entirely sequenced genomes. Oikopleura genes, like Caenorhabditis genes, have introns at a large number of nonconserved positions, which must originate from late insertions or intron sliding of ancient insertions. Both species exhibit hypervariable intron-exon organization within their alpha-tubulin gene family. This is due to localization of most nonconserved intron positions in single members of this gene family. The hypervariability and divergence of intron positions in Oikopleura and Caenorhabditis may be related to the predominance of short introns, the processing of which is not very dependent upon the exonic environment compared to large introns. Also, both species have an undermethylated genome, and the control of methylation-induced point mutations imposes a control on exon size, at least in vertebrate genes. That introns placed at such variable positions in Oikopleura or C. elegans may serve a specific purpose is not easy to infer from our current knowledge and hypotheses on intron functions. We propose that new introns are retained in species with very short life cycles, because illegitimate exchanges including gene conversion are repressed. We also speculate that introns placed at gene-specific positions may contribute to suppressing these exchanges and thereby favor their own persistence.
Site directed recombination

DOEpatents

Jurka, Jerzy W.

1997-01-01

Enhanced homologous recombination is obtained by employing a consensus sequence which has been found to be associated with integration of repeat sequences, such as Alu and ID. The consensus sequence or sequence having a single transition mutation determines one site of a double break which allows for high efficiency of integration at the site. By introducing single or double stranded DNA having the consensus sequence flanking region joined to a sequence of interest, one can reproducibly direct integration of the sequence of interest at one or a limited number of sites. In this way, specific sites can be identified and homologous recombination achieved at the site by employing a second flanking sequence associated with a sequence proximal to the 3'-nick.
Occurrence of Can-SINEs and intron sequence evolution supports robust phylogeny of pinniped carnivores and their terrestrial relatives.

PubMed

Schröder, Christiane; Bleidorn, Christoph; Hartmann, Stefanie; Tiedemann, Ralph

2009-12-15

Investigating the dog genome we found 178965 introns with a moderate length of 200-1000 bp. A screening of these sequences against 23 different repeat libraries to find insertions of short interspersed elements (SINEs) detected 45276 SINEs. Virtually all of these SINEs (98%) belong to the tRNA-derived Can-SINE family. Can-SINEs arose about 55 million years ago before Carnivora split into two basal groups, the Caniformia (dog-like carnivores) and the Feliformia (cat-like carnivores). Genome comparisons of dog and cat recovered 506 putatively informative SINE loci for caniformian phylogeny. In this study we show how to use such genome information of model organisms to research the phylogeny of related non-model species of interest. Investigating a dataset including representatives of all major caniformian lineages, we analysed 24 randomly chosen loci for 22 taxa. All loci were amplifiable and revealed 17 parsimony-informative SINE insertions. The screening for informative SINE insertions yields a large amount of sequence information, in particular of introns, which contain reliable phylogenetic information as well. A phylogenetic analysis of intron- and SINE sequence data provided a statistically robust phylogeny which is congruent with the absence/presence pattern of our SINE markers. This phylogeny strongly supports a sistergroup relationship of Musteloidea and Pinnipedia. Within Pinnipedia, we see strong support from bootstrapping and the presence of a SINE insertion for a sistergroup relationship of the walrus with the Otariidae.
RNA-binding Protein Trinucleotide repeat-containing 6A Regulates the Formation of Circular RNA 0006916, with Important Functions in Lung Cancer Cells.

PubMed

Dai, Xin; Zhang, Nan; Cheng, Ying; Yang, Ti; Chen, Yingnan; Liu, Zhenzhong; Wang, Zhishan; Yang, Chengfeng; Jiang, Yiguo

2018-05-03

Circular RNAs (circRNAs) are widespread and diverse endogenous RNAs distinct from traditional linear RNAs, which may regulate gene expression in eukaryotes. However, the function of human circRNAs, including their potential role in lung cancer, remains largely unknown. We screened the circRNA circ0006916, which was evidently down-regulated in 16HBE-T cells (anti-benzopyrene-trans-7, 8-dihydrodiol-9, 10-epoxide-transformed human bronchial epithelial cells), and in A549 and H460 cell lines. Silencing of circ0006916, but not its parental gene homer scaffolding protein 1 (HOMER1), promoted cell proliferation via speeding up the cell cycle process rather than by inhibiting apoptosis; conversely, overexpression of circ0006916 had the opposite effect. Luciferase screening assay indicated that circ0006916 bound to miR-522-3p and inhibited pleckstrin homology domain and leucine rich repeat protein phosphatase 1 (PHLPP1) activity. We also explored the effect of the RNA-binding protein trinucleotide repeat-containing 6A (TNRC6A) on circ0006916 production. Circ0006916 expression was decreased after silencing TNRC6A. TNRC6A bound to the intron regions around the circRNA-forming exons of circ0006916, as shown by RNA immunoprecipitation assay combined with sequencing analysis. The association of circ0006916 with TNRC6A was further verified by RNA pull-down assays. We then constructed a carrier and confirmed that TNRC6A binding to the flanked intron region of circ0006916 was necessary for generation of circ0006916. These results demonstrate that TNRC6A regulates the biogenesis of the circRNA circ0006916, which has a regulatory role in cell growth.
An intronless form of the tobacco extensin gene terminator strongly enhances transient gene expression in plant leaves.

PubMed

Rosenthal, Sun Hee; Diamos, Andrew G; Mason, Hugh S

2018-03-01

We have found interesting features of a plant gene (extensin) 3' flanking region, including extremely efficient polyadenylation which greatly improves transient expression of transgenes when an intron is removed. Its use will greatly benefit studies of gene expression in plants, research in molecular biology, and applications for recombinant proteins. Plants are a promising platform for the production of recombinant proteins. To express high-value proteins in plants efficiently, the optimization of expression cassettes using appropriate regulatory sequences is critical. Here, we characterize the activity of the tobacco extensin (Ext) gene terminator by transient expression in Nicotiana benthamiana, tobacco, and lettuce. Ext is a member of the hydroxyproline-rich glycoprotein (HRGP) superfamily and constitutes the major protein component of cell walls. The present study demonstrates that the Ext terminator with its native intron removed increased transient gene expression up to 13.5-fold compared to previously established terminators. The enhanced transgene expression was correlated with increased mRNA accumulation and reduced levels of read-through transcripts, which could impair gene expression. Analysis of transcript 3'-ends found that the majority of polyadenylated transcripts were cleaved at a YA dinucleotide downstream from a canonical AAUAAA motif and a UG-rich region, both of which were found to be highly conserved among related extensin terminators. Deletion of either of these regions eliminated most of the activity of the terminator. Additionally, a 45 nt polypurine sequence ~ 175 nt upstream from the polyadenylation sites was found to also be necessary for the enhanced expression. We conclude that the use of Ext terminator has great potential to benefit the production of recombinant proteins in plants.

G to A substitution in 5{prime} donor splice site of introns 18 and 48 of COL1A1 gene of type I collagen results in different splicing alternatives in osteogenesis imperfecta type I cell strains

DOE Office of Scientific and Technical Information (OSTI.GOV)

Willing, M.; Deschenes, S.

We have identified a G to A substitution in the 5{prime} donor splice site of intron 18 of one COL1A1 allele in two unrelated families with osteogenesis imperfecta (OI) type I. A third OI type I family has a G to A substitution at the identical position in intron 48 of one COL1A1 allele. Both mutations abolish normal splicing and lead to reduced steady-state levels of mRNA from the mutant COL1A1 allele. The intron 18 mutation leads to both exon 18 skipping in the mRNA and to utilization of a single alternative splice site near the 3{prime} end of exonmore » 18. The latter results in deletion of the last 8 nucleotides of exon 18 from the mRNA, a shift in the translational reading-frame, and the creation of a premature termination codon in exon 19. Of the potential alternative 5{prime} splice sites in exon 18 and intron 18, the one utilized has a surrounding nucleotide sequence which most closely resembles that of the natural splice site. Although a G to A mutation was detected at the identical position in intron 48 of one COL1A1 allele in another OI type I family, nine complex alternative splicing patterns were identified by sequence analysis of cDNA clones derived from fibroblast mRNA from this cell strain. All result in partial or complete skipping of exon 48, with in-frame deletions of portions of exons 47 and/or 49. The different patterns of RNA splicing were not explained by their sequence homology with naturally occuring 5{prime} splice sites, but rather by recombination between highly homologous exon sequences, suggesting that we may not have identified the major splicing alternative(s) in this cell strain. Both G to A mutations result in decreased production of type I collagen, the common biochemical correlate of OI type I.« less
Design of retrovirus vectors for transfer and expression of the human. beta. -globin gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Miller, A.D.; Bender, M.A.; Harris, E.A.S.

1988-11-01

Regulated expression of the human ..beta..-globin gene has been demonstrated in cultured murine erythroleukemia cells and in mice after retrovirus-mediated gene transfer. However, the low titer of recombinant viruses described to date results in relatively inefficient gene transfer, which limits their usefulness for animal studies and for potential gene therapy in humans for diseases involving defective ..beta..-globin genes. The authors found regions that interfered with virus production within intron 2 of the ..beta..-globin gene and on both sides of the gene. The flanking regions could be removed, but intron 2 was required for ..beta..-globin expression. Inclusion of ..beta..-globin introns necessitatesmore » an antisense orientation of the gene within the retrovirus vector. However, they found no effect of the antisense ..beta..-globin transcription on virus production. A region downstream of the ..beta..-globin gene that stimulates expression of the gene in transgenic mice was included in the viruses without detrimental effects on virus titer. Virus titers of over 10/sup 6/ CFU/ml were obtained with the final vector design, which retained the ability to direct regulated expression of human ..beta..-globin in murine erythroleukemia cells. The vector also allowed transfer and expression of the human ..beta..-globin gene in hematopoietic cells (CFU-S cells) in mice.« less
DNA double-strand break in vivo at the 3' extremity of exons located upstream of group II introns. Senescence and circular DNA introns in Podospora mitochondria.

PubMed

Sainsard-Chanet, A; Begel, O; Belcour, L

1994-10-07

In the filamentous fungus Podospora anserina, the unavoidable phenomenon of senescence is associated with the amplification of the first intron of the mitochondrial cox1 that accumulates as circular DNA molecules consisting of tandem repeats. This group II intron (cox1-i1 or alpha) is able to transpose and contains an open reading frame with significant amino acid similarity with reverse transcriptases. The generation of these intronic circular DNA molecules, their amplification and their involvement in the senescence process are unresolved questions. We demonstrate here that: (1) another group II intron, the fourth intron of gene cox1, cox1-i4, is also able to give precise DNA end to end junctions; (2) this intronic sequence can be found amplified during senescence, although to a lesser extent than cox1-i1; (3) the amplification of the DNA multimeric cox1-i1 molecules likely does not proceed by autonomous replication; (4) the generation of the DNA intronic circles does not require efficient intron splicing; (5) a DNA double-strand break occurs in vivo at the 3' extremity of the cox1-e1 and cox1-e4 exons preceding the group II introns that form circular DNAs. On the whole, these results show that the ability to form DNA circular molecules is a property of some group II introns and they demonstrate the occurrence of a specific DNA cleavage at or near the integration site of these group II introns. The results strongly suggest that this cleavage is involved in the formation of the group II intronic DNA circles and could also be involved in the phenomenon of group II intron homing.
Branchpoint selection in the splicing of U12-dependent introns in vitro.

PubMed

McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A

2002-05-01

In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome.
Branchpoint selection in the splicing of U12-dependent introns in vitro.

PubMed Central

McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A

2002-01-01

In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome. PMID:12022225
Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence.

PubMed

Shin, Dong-Ho; Webb, Barbara M; Nakao, Miki; Smith, Sylvia L

2009-07-01

Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and -d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (
Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence

PubMed Central

Shin, Dong-Ho; Webb, Barbara M.; Nakao, Miki; Smith, Sylvia L.

2009-01-01

Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and –d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (≤) amino acid identities with each other, 35.4 ~ 39.6% and 62.8 ~ 65.9% with factor I of mammals and banded houndshark (Triakis scyllium), respectively. The modular structure of the GcIf is similar to that of mammals with one notable exception, the presence of a novel shark-specific sequence between the leader peptide (LP) and the factor I membrane attack complex (FIMAC) domain. The cDNA sequences differ only in the size and composition of the shark-specific region (SSR). Sequence analysis of each SSR has identified within the region two novel short sequences (SS1 and SS2) and three repeat sequences (RS1, 2 and 3). Genomic analysis has revealed the existence of three introns between the leader peptide and the FIMAC domain, tentatively designated intron 1, intron 2, and intron 3 which span 4067, 2293 and 2082 bp, respectively. Southern blot analysis suggests the presence of a single gene copy for each cDNA type. Phylogenetic analysis suggests that complement factor I of cartilaginous fish diverged prior to the emergence of mammals. All four GcIf cDNA species are expressed in four different tissues and the liver is the main tissue in which expression level of all four is high. This suggests that the expression of GcIf isotypes is tissue-dependent. PMID:19423168
The sequence, structure and evolutionary features of HOTAIR in mammals

PubMed Central

2011-01-01

Background An increasing number of long noncoding RNAs (lncRNAs) have been identified recently. Different from all the others that function in cis to regulate local gene expression, the newly identified HOTAIR is located between HoxC11 and HoxC12 in the human genome and regulates HoxD expression in multiple tissues. Like the well-characterised lncRNA Xist, HOTAIR binds to polycomb proteins to methylate histones at multiple HoxD loci, but unlike Xist, many details of its structure and function, as well as the trans regulation, remain unclear. Moreover, HOTAIR is involved in the aberrant regulation of gene expression in cancer. Results To identify conserved domains in HOTAIR and study the phylogenetic distribution of this lncRNA, we searched the genomes of 10 mammalian and 3 non-mammalian vertebrates for matches to its 6 exons and the two conserved domains within the 1800 bp exon6 using Infernal. There was just one high-scoring hit for each mammal, but many low-scoring hits were found in both mammals and non-mammalian vertebrates. These hits and their flanking genes in four placental mammals and platypus were examined to determine whether HOTAIR contained elements shared by other lncRNAs. Several of the hits were within unknown transcripts or ncRNAs, many were within introns of, or antisense to, protein-coding genes, and conservation of the flanking genes was observed only between human and chimpanzee. Phylogenetic analysis revealed discrete evolutionary dynamics for orthologous sequences of HOTAIR exons. Exon1 at the 5' end and a domain in exon6 near the 3' end, which contain domains that bind to multiple proteins, have evolved faster in primates than in other mammals. Structures were predicted for exon1, two domains of exon6 and the full HOTAIR sequence. The sequence and structure of two fragments, in exon1 and the domain B of exon6 respectively, were identified to robustly occur in predicted structures of exon1, domain B of exon6 and the full HOTAIR in mammals. Conclusions HOTAIR exists in mammals, has poorly conserved sequences and considerably conserved structures, and has evolved faster than nearby HoxC genes. Exons of HOTAIR show distinct evolutionary features, and a 239 bp domain in the 1804 bp exon6 is especially conserved. These features, together with the absence of some exons and sequences in mouse, rat and kangaroo, suggest ab initio generation of HOTAIR in marsupials. Structure prediction identifies two fragments in the 5' end exon1 and the 3' end domain B of exon6, with sequence and structure invariably occurring in various predicted structures of exon1, the domain B of exon6 and the full HOTAIR. PMID:21496275
Effective suppression of dengue virus using a novel group-I intron that induces apoptotic cell death upon infection through conditional expression of the Bax C-terminal domain.

PubMed

Carter, James R; Keith, James H; Fraser, Tresa S; Dawson, James L; Kucharski, Cheryl A; Horne, Kate M; Higgs, Stephen; Fraser, Malcolm J

2014-06-13

Approximately 100 million confirmed infections and 20,000 deaths are caused by Dengue virus (DENV) outbreaks annually. Global warming and rapid dispersal have resulted in DENV epidemics in formally non-endemic regions. Currently no consistently effective preventive measures for DENV exist, prompting development of transgenic and paratransgenic vector control approaches. Production of transgenic mosquitoes refractory for virus infection and/or transmission is contingent upon defining antiviral genes that have low probability for allowing escape mutations, and are equally effective against multiple serotypes. Previously we demonstrated the effectiveness of an anti-viral group I intron targeting U143 of the DENV genome in mediating trans-splicing and expression of a marker gene with the capsid coding domain. In this report we examine the effectiveness of coupling expression of ΔN Bax to trans-splicing U143 intron activity as a means of suppressing DENV infection of mosquito cells. Targeting the conserved DENV circularization sequence (CS) by U143 intron trans-splicing activity appends a 3' exon RNA encoding ΔN Bax to the capsid coding region of the genomic RNA, resulting in a chimeric protein that induces premature cell death upon infection. TCID50-IFA analyses demonstrate an enhancement of DENV suppression for all DENV serotypes tested over the identical group I intron coupled with the non-apoptotic inducing firefly luciferase as the 3' exon. These cumulative results confirm the increased effectiveness of this αDENV-U143-ΔN Bax group I intron as a sequence specific antiviral that should be useful for suppression of DENV in transgenic mosquitoes. Annexin V staining, caspase 3 assays, and DNA ladder observations confirm DCA-ΔN Bax fusion protein expression induces apoptotic cell death. This report confirms the relative effectiveness of an anti-DENV group I intron coupled to an apoptosis-inducing ΔN Bax 3' exon that trans-splices conserved sequences of the 5' CS region of all DENV serotypes and induces apoptotic cell death upon infection. Our results confirm coupling the targeted ribozyme capabilities of the group I intron with the generation of an apoptosis-inducing transcript increases the effectiveness of infection suppression, improving the prospects of this unique approach as a means of inducing transgenic refractoriness in mosquitoes for all serotypes of this important disease.
Efficient gusA transient expression in Porphyra yezoensis protoplasts mediated by endogenous beta-tubulin flanking sequences

NASA Astrophysics Data System (ADS)

Gong, Qianhong; Yu, Wengong; Dai, Jixun; Liu, Hongquan; Xu, Rifu; Guan, Huashi; Pan, Kehou

2007-01-01

Endogenous tubulin promoter has been widely used for expressing foreign genes in green algae, but the efficiency and feasibility of endogenous tubulin promoter in the economically important Porphyra yezoensis (Rhodophyta) are unknown. In this study, the flanking sequences of beta-tubulin gene from P. yezoensis were amplified and two transient expression vectors were constructed to determine their transcription promoting feasibility for foreign gene gusA. The testing vector pATubGUS was constructed by inserting 5'-and 3'-flanking regions ( Tub5' and Tub3') up-and down-stream of β-glucuronidase (GUS) gene ( gusA), respectively, into pA, a derivative of pCAT®3-enhancer vector. The control construct, pAGUSTub3, contains only gusA and Tub3'. These constructs were electroporated into P. yezoensis protoplasts and the GUS activities were quantitatively analyzed by spectrometry. The results demonstrated that gusA gene was efficiently expressed in P. yezoensis protoplasts under the regulation of 5'-flanking sequence of the beta-tubulin gene. More interestingly, the pATubGUS produced stronger GUS activity in P. yezoensis protoplasts when compared to the result from pBI221, in which the gusA gene was directed by a constitutive CaMV 35S promoter. The data suggest that the integration of P. yezoensis protoplast and its endogenous beta-tubulin flanking sequences is a potential novel system for foreign gene expression.
A sequence-based genetic map of Medicago truncatula and comparison of marker colinearity with M. sativa.

PubMed Central

Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R

2004-01-01

A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563
Sensing Self and Foreign Circular RNAs by Intron Identity.

PubMed

Chen, Y Grace; Kim, Myoungjoo V; Chen, Xingqi; Batista, Pedro J; Aoyama, Saeko; Wilusz, Jeremy E; Iwasaki, Akiko; Chang, Howard Y

2017-07-20

Circular RNAs (circRNAs) are single-stranded RNAs that are joined head to tail with largely unknown functions. Here we show that transfection of purified in vitro generated circRNA into mammalian cells led to potent induction of innate immunity genes and confers protection against viral infection. The nucleic acid sensor RIG-I is necessary to sense foreign circRNA, and RIG-I and foreign circRNA co-aggregate in cytoplasmic foci. CircRNA activation of innate immunity is independent of a 5' triphosphate, double-stranded RNA structure, or the primary sequence of the foreign circRNA. Instead, self-nonself discrimination depends on the intron that programs the circRNA. Use of a human intron to express a foreign circRNA sequence abrogates immune activation, and mature human circRNA is associated with diverse RNA binding proteins reflecting its endogenous splicing and biogenesis. These results reveal innate immune sensing of circRNA and highlight introns-the predominant output of mammalian transcription-as arbiters of self-nonself identity. Copyright © 2017 Elsevier Inc. All rights reserved.
Insights into evolution in Andean Polystichum (Dryopteridaceae) from expanded understanding of the cytosolic phosphoglucose isomerase gene.

PubMed

Lyons, Brendan M; McHenry, Monique A; Barrington, David S

2017-07-01

Cytosolic phosphoglucose isomerase (pgiC) is an enzyme essential to glycolysis found universally in eukaryotes, but broad understanding of variation in the gene coding for pgiC is lacking for ferns. We used a substantially expanded representation of the gene for Andean species of the fern genus Polystichum to characterize pgiC in ferns relative to angiosperms, insects, and an amoebozoan; assess the impact of selection versus neutral evolutionary processes on pgiC; and explore evolutionary relationships of selected Andean species. The dataset of complete sequences comprised nine accessions representing seven species and one hybrid from the Andes and Serra do Mar. The aligned sequences of the full data set comprised 3376 base pairs (70% of the entire gene) including 17 exons and 15 introns from two central areas of the gene. The exons are highly conserved relative to angiosperms and retain substantial homology to insect pgiC, but intron length and structure are unique to the ferns. Average intron size is similar to angiosperms; intron number and location in insects are unlike those of the plants we considered. The introns included an array of indels and, in intron 7, an extensive microsatellite array with potential utility in analyzing population-level histories. Bayesian and maximum-parsimony analysis of 129 variable nucleotides in the Andean polystichums revealed that 59 (1.7% of the 3376 total) were phylogenetically informative; most of these united sister accessions. The phylogenetic trees for the Andean polystichums were incongruent with previously published cpDNA trees for the same taxa, likely the result of rapid evolutionary change in the introns and contrasting stability in the exons. The exons code a total of seven amino-acid substitutions. Comparison of non-synonymous to synonymous substitutions did not suggest that the pgiC gene is under selection in the Andes. Variation in pgiC including two additional accessions represented by incomplete sequences provided new insights into reticulate relationships among Andean taxa. Copyright © 2017 Elsevier Inc. All rights reserved.
Phylogenetic Analysis of Nuclear-Encoded RNA Maturases

PubMed Central

Malik, Sunita; Upadhyaya, KC; Khurana, SM Paul

2017-01-01

Posttranscriptional processes, such as splicing, play a crucial role in gene expression and are prevalent not only in nuclear genes but also in plant mitochondria where splicing of group II introns is catalyzed by a class of proteins termed maturases. In plant mitochondria, there are 22 mitochondrial group II introns. matR, nMAT1, nMAT2, nMAT3, and nMAT4 proteins have been shown to be required for efficient splicing of several group II introns in Arabidopsis thaliana. Nuclear maturases (nMATs) are necessary for splicing of mitochondrial genes, leading to normal oxidative phosphorylation. Sequence analysis through phylogenetic tree (including bootstrapping) revealed high homology with maturase sequences of A thaliana and other plants. This study shows the phylogenetic relationship of nMAT proteins between A thaliana and other nonredundant plant species taken from BLASTP analysis. PMID:28607538
DNA pooling: a comprehensive, multi-stage association analysis of ACSL6 and SIRT5 polymorphisms in schizophrenia.

PubMed

Chowdari, K V; Northup, A; Pless, L; Wood, J; Joo, Y H; Mirnics, K; Lewis, D A; Levitt, P R; Bacanu, S-A; Nimgaonkar, V L

2007-04-01

Many candidate gene association studies have evaluated incomplete, unrepresentative sets of single nucleotide polymorphisms (SNPs), producing non-significant results that are difficult to interpret. Using a rapid, efficient strategy designed to investigate all common SNPs, we tested associations between schizophrenia and two positional candidate genes: ACSL6 (Acyl-Coenzyme A synthetase long-chain family member 6) and SIRT5 (silent mating type information regulation 2 homologue 5). We initially evaluated the utility of DNA sequencing traces to estimate SNP allele frequencies in pooled DNA samples. The mean variances for the DNA sequencing estimates were acceptable and were comparable to other published methods (mean variance: 0.0008, range 0-0.0119). Using pooled DNA samples from cases with schizophrenia/schizoaffective disorder (Diagnostic and Statistical Manual of Mental Disorders edition IV criteria) and controls (n=200, each group), we next sequenced all exons, introns and flanking upstream/downstream sequences for ACSL6 and SIRT5. Among 69 identified SNPs, case-control allele frequency comparisons revealed nine suggestive associations (P<0.2). Each of these SNPs was next genotyped in the individual samples composing the pools. A suggestive association with rs 11743803 at ACSL6 remained (allele-wise P=0.02), with diminished evidence in an extended sample (448 cases, 554 controls, P=0.062). In conclusion, we propose a multi-stage method for comprehensive, rapid, efficient and economical genetic association analysis that enables simultaneous SNP detection and allele frequency estimation in large samples. This strategy may be particularly useful for research groups lacking access to high throughput genotyping facilities. Our analyses did not yield convincing evidence for associations of schizophrenia with ACSL6 or SIRT5.
A draft fur seal genome provides insights into factors affecting SNP validation and how to mitigate them.

PubMed

Humble, E; Martinez-Barrio, A; Forcada, J; Trathan, P N; Thorne, M A S; Hoffmann, M; Wolf, J B W; Hoffman, J I

2016-07-01

Custom genotyping arrays provide a flexible and accurate means of genotyping single nucleotide polymorphisms (SNPs) in a large number of individuals of essentially any organism. However, validation rates, defined as the proportion of putative SNPs that are verified to be polymorphic in a population, are often very low. A number of potential causes of assay failure have been identified, but none have been explored systematically. In particular, as SNPs are often developed from transcriptomes, parameters relating to the genomic context are rarely taken into account. Here, we assembled a draft Antarctic fur seal (Arctocephalus gazella) genome (assembly size: 2.41 Gb; scaffold/contig N50 : 3.1 Mb/27.5 kb). We then used this resource to map the probe sequences of 144 putative SNPs genotyped in 480 individuals. The number of probe-to-genome mappings and alignment length together explained almost a third of the variation in validation success, indicating that sequence uniqueness and proximity to intron-exon boundaries play an important role. The same pattern was found after mapping the probe sequences to the Walrus and Weddell seal genomes, suggesting that the genomes of species divergent by as much as 23 million years can hold information relevant to SNP validation outcomes. Additionally, reanalysis of genotyping data from seven previous studies found the same two variables to be significantly associated with SNP validation success across a variety of taxa. Finally, our study reveals considerable scope for validation rates to be improved, either by simply filtering for SNPs whose flanking sequences align uniquely and completely to a reference genome, or through predictive modelling. © 2015 John Wiley & Sons Ltd.
Alternative splicing by participation of the group II intron ORF in extremely halotolerant and alkaliphilic Oceanobacillus iheyensis.

PubMed

Chee, Gab-Joo; Takami, Hideto

2011-01-01

Group II introns inserted into genes often undergo splicing at unexpected sites, and participate in the transcription of host genes. We identified five copies of a group II intron, designated Oi.Int, in the genome of an extremely halotolerant and alkaliphilic bacillus, Oceanobacillus iheyensis. The Oi.Int4 differs from the Oi.Int3 at four bases. The ligated exons of the Oi.Int4 could not be detected by RT-PCR assays in vivo or in vitro although group II introns can generally self-splice in vitro without the involvement of an intron-encoded open reading frame (ORF). In the Oi.Int4 mutants with base substitutions within the ORF, ligated exons were detected by in vitro self-splicing. It was clear that the ligation of exons during splicing is affected by the sequence of the intron-encoded ORF since the splice sites corresponded to the joining sites of the intron. In addition, the mutant introns showed unexpected multiple products with alternative 5' splice sites. These findings imply that alternative 5' splicing which causes a functional change of ligated exons presumably has influenced past adaptations of O. iheyensis to various environmental changes.
Dispersion of the RmInt1 group II intron in the Sinorhizobium meliloti genome upon acquisition by conjugative transfer

PubMed Central

Nisa-Martínez, Rafael; Jiménez-Zurdo, José I.; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás

2007-01-01

RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase–maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (ΔORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic ΔORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature. PMID:17158161
Dispersion of the RmInt1 group II intron in the Sinorhizobium meliloti genome upon acquisition by conjugative transfer.

PubMed

Nisa-Martínez, Rafael; Jiménez-Zurdo, José I; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás

2007-01-01

RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase-maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (DeltaORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic DeltaORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature.
Long-read sequencing of nascent RNA reveals coupling among RNA processing events.

PubMed

Herzel, Lydia; Straube, Korinna; Neugebauer, Karla M

2018-06-14

Pre-mRNA splicing is accomplished by the spliceosome, a megadalton complex that assembles de novo on each intron. Because spliceosome assembly and catalysis occur cotranscriptionally, we hypothesized that introns are removed in the order of their transcription in genomes dominated by constitutive splicing. Remarkably little is known about splicing order and the regulatory potential of nascent transcript remodeling by splicing, due to the limitations of existing methods that focus on analysis of mature splicing products (mRNAs) rather than substrates and intermediates. Here, we overcome this obstacle through long-read RNA sequencing of nascent, multi-intron transcripts in the fission yeast Schizosaccharomyces pombe Most multi-intron transcripts were fully spliced, consistent with rapid cotranscriptional splicing. However, an unexpectedly high proportion of transcripts were either fully spliced or fully unspliced, suggesting that splicing of any given intron is dependent on the splicing status of other introns in the transcript. Supporting this, mild inhibition of splicing by a temperature-sensitive mutation in prp2 , the homolog of vertebrate U2AF65, increased the frequency of fully unspliced transcripts. Importantly, fully unspliced transcripts displayed transcriptional read-through at the polyA site and were degraded cotranscriptionally by the nuclear exosome. Finally, we show that cellular mRNA levels were reduced in genes with a high number of unspliced nascent transcripts during caffeine treatment, showing regulatory significance of cotranscriptional splicing. Therefore, overall splicing of individual nascent transcripts, 3' end formation, and mRNA half-life depend on the splicing status of neighboring introns, suggesting crosstalk among spliceosomes and the polyA cleavage machinery during transcription elongation. © 2018 Herzel et al.; Published by Cold Spring Harbor Laboratory Press.

Fusion primer and nested integrated PCR (FPNI-PCR): a new high-efficiency strategy for rapid chromosome walking or flanking sequence cloning

PubMed Central

2011-01-01

Background The advent of genomics-based technologies has revolutionized many fields of biological enquiry. However, chromosome walking or flanking sequence cloning is still a necessary and important procedure to determining gene structure. Such methods are used to identify T-DNA insertion sites and so are especially relevant for organisms where large T-DNA insertion libraries have been created, such as rice and Arabidopsis. The currently available methods for flanking sequence cloning, including the popular TAIL-PCR technique, are relatively laborious and slow. Results Here, we report a simple and effective fusion primer and nested integrated PCR method (FPNI-PCR) for the identification and cloning of unknown genomic regions flanked known sequences. In brief, a set of universal primers was designed that consisted of various 15-16 base arbitrary degenerate oligonucleotides. These arbitrary degenerate primers were fused to the 3' end of an adaptor oligonucleotide which provided a known sequence without degenerate nucleotides, thereby forming the fusion primers (FPs). These fusion primers are employed in the first step of an integrated nested PCR strategy which defines the overall FPNI-PCR protocol. In order to demonstrate the efficacy of this novel strategy, we have successfully used it to isolate multiple genomic sequences namely, 21 orthologs of genes in various species of Rosaceace, 4 MYB genes of Rosa rugosa, 3 promoters of transcription factors of Petunia hybrida, and 4 flanking sequences of T-DNA insertion sites in transgenic tobacco lines and 6 specific genes from sequenced genome of rice and Arabidopsis. Conclusions The successful amplification of target products through FPNI-PCR verified that this novel strategy is an effective, low cost and simple procedure. Furthermore, FPNI-PCR represents a more sensitive, rapid and accurate technique than the established TAIL-PCR and hiTAIL-PCR procedures. PMID:22093809
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture

PubMed Central

Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen

2017-01-01

Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing. PMID:29280736
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture

DOE PAGES

Pai, Athma A.; Henriques, Telmo; McCue, Kayla; ...

2017-12-27

Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pai, Athma A.; Henriques, Telmo; McCue, Kayla

Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
Late-onset spastic paraplegia: Aberrant SPG11 transcripts generated by a novel splice site donor mutation.

PubMed

Kawarai, Toshitaka; Miyamoto, Ryosuke; Mori, Atsuko; Oki, Ryosuke; Tsukamoto-Miyashiro, Ai; Matsui, Naoko; Miyazaki, Yoshimichi; Orlacchio, Antonio; Izumi, Yuishin; Nishida, Yoshihiko; Kaji, Ryuji

2015-12-15

We identified a novel homozygous mutation in the splice site donor (SSD) of intron 30 (c.5866+1G>A) in consanguineous Japanese SPG11 siblings showing late-onset spastic paraplegia using the whole-exome sequencing. Phenotypic variability was observed, including age-at-onset, dysarthria and pes cavus. Coding DNA sequencing revealed that the mutation affected the recognition of the constitutive SSD of intron 30, splicing upstream onto a nearby cryptic SSD in exon 30. The use of constitutive splice sites of intron 29 was confirmed by sequencing. The mutant transcripts are mostly subject to degradation by the nonsense-mediated mRNA decay system. SPG11 transcripts, escaping from the nonsense-mediated mRNA decay pathway, would generate a truncated protein (p.Tyr1900Phefs5X) containing the first 1899 amino acids and followed by 4 aberrant amino acids. This study showed a successful clinical application of whole-exome sequencing in spastic paraplegia and demonstrated a further evidence of allelic heterogeneity in SPG11. The confirmation of aberrant transcript by splice site mutation is a prerequisite for a more precise molecular diagnosis. Copyright © 2015 Elsevier B.V. All rights reserved.
Typing of artiodactyl MHC-DRB genes with the help of intronic simple repeated DNA sequences.

PubMed

Schwaiger, F W; Buitkamp, J; Weyers, E; Epplen, J T

1993-02-01

An efficient oligonucleotide typing method for the highly polymorphic MHC-DRB genes is described for artiodactyls like cattle, sheep and goat. By means of the polymerase chain reaction, the second exon of MHC-DRB is amplified as well as part of the adjacent intron containing a mixed simple repeat sequence. Using this primer combination we were able to amplify the MHC-DRB exons 2 and adjacent introns from all of the investigated 10 species of the family of Bovidae and giraffes. Therefore, the DRB genes of novel artiodactyl species can also be readily studied. Oligonucleotide probes specific for the polymorphisms of ungulate DRB genes are used with which sequences differing in at least one single base can be distinguished. Exonic polymorphism was found to be correlated with the allele lengths and the patterns of the repeat structures. Hence oligonucleotide probes specific for different simple repeats and polymorphic positions serve also for typing across species barriers. The strict correlation of sequence length and exonic polymorphism permits a preselection of specific oligonucleotides for hybridization. Thus more than 20 alleles can already be differentiated from each of the three species.
[Sequence analysis of LEAFY homologous gene from Dendrobium moniliforme and application for identification of medicinal Dendrobium].

PubMed

Xing, Wen-Rui; Hou, Bei-Wei; Guan, Jing-Jiao; Luo, Jing; Ding, Xiao-Yu

2013-04-01

The LEAFY (LFY) homologous gene of Dendrobium moniliforme (L.) Sw. was cloned by new primers which were designed based on the conservative region of known sequences of orchid LEAFY gene. Partial LFY homologous gene was cloned by common PCR, then we got the complete LFY homologous gene Den LFY by Tail-PCR. The complete sequence of DenLFY gene was 3 575 bp which contained three exons and two introns. Using BLAST method, comparison analysis among the exon of LFY homologous gene indicted that the DenLFY gene had high identity with orchids LFY homologous, including the related fragment of PhalLFY (84%) in Phalaenopsis hybrid cultivar, LFY homologous gene in Oncidium (90%) and in other orchid (over 80%). Using MP analysis, Dendrobium is found to be the sister to Oncidium and Phalaenopsis. Homologous analysis demonstrated that the C-terminal amino acids were highly conserved. When the exons and introns were separately considered, exons and the sequence of amino acid were good markers for the function research of DenLFY gene. The second intron can be used in authentication research of Dendrobium based on the length polymorphism between Dendrobium moniliforme and Dendrobium officinale.
Development and utilization of novel intron length polymorphic markers in foxtail millet (Setaria italica (L.) P. Beauv.).

PubMed

Gupta, Sarika; Kumari, Kajal; Das, Jyotirmoy; Lata, Charu; Puranik, Swati; Prasad, Manoj

2011-07-01

Introns are noncoding sequences in a gene that are transcribed to precursor mRNA but spliced out during mRNA maturation and are abundant in eukaryotic genomes. The availability of codominant molecular markers and saturated genetic linkage maps have been limited in foxtail millet (Setaria italica (L.) P. Beauv.). Here, we describe the development of 98 novel intron length polymorphic (ILP) markers in foxtail millet using sequence information of the model plant rice. A total of 575 nonredundant expressed sequence tag (EST) sequences were obtained, of which 327 and 248 unique sequences were from dehydration- and salinity-stressed suppression subtractive hybridization libraries, respectively. The BLAST analysis of 98 EST sequences suggests a nearly defined function for about 64% of them, and they were grouped into 11 different functional categories. All 98 ILP primer pairs showed a high level of cross-species amplification in two millets and two nonmillets species ranging from 90% to 100%, with a mean of ∼97%. The mean observed heterozygosity and Nei's average gene diversity 0.016 and 0.171, respectively, established the efficiency of the ILP markers for distinguishing the foxtail millet accessions. Based on 26 ILP markers, a reasonable dendrogram of 45 foxtail millet accessions was constructed, demonstrating the utility of ILP markers in germplasm characterizations and genomic relationships in millets and nonmillets species.
Single-molecule DNA unzipping reveals asymmetric modulation of a transcription factor by its binding site sequence and context

PubMed Central

Rudnizky, Sergei; Khamis, Hadeel; Malik, Omri; Squires, Allison H; Meller, Amit; Melamed, Philippa

2018-01-01

Abstract Most functional transcription factor (TF) binding sites deviate from their ‘consensus’ recognition motif, although their sites and flanking sequences are often conserved across species. Here, we used single-molecule DNA unzipping with optical tweezers to study how Egr-1, a TF harboring three zinc fingers (ZF1, ZF2 and ZF3), is modulated by the sequence and context of its functional sites in the Lhb gene promoter. We find that both the core 9 bp bound to Egr-1 in each of the sites, and the base pairs flanking them, modulate the affinity and structure of the protein–DNA complex. The effect of the flanking sequences is asymmetric, with a stronger effect for the sequence flanking ZF3. Characterization of the dissociation time of Egr-1 revealed that a local, mechanical perturbation of the interactions of ZF3 destabilizes the complex more effectively than a perturbation of the ZF1 interactions. Our results reveal a novel role for ZF3 in the interaction of Egr-1 with other proteins and the DNA, providing insight on the regulation of Lhb and other genes by Egr-1. Moreover, our findings reveal the potential of small changes in DNA sequence to alter transcriptional regulation, and may shed light on the organization of regulatory elements at promoters. PMID:29253225
Novel green tissue-specific synthetic promoters and cis-regulatory elements in rice.

PubMed

Wang, Rui; Zhu, Menglin; Ye, Rongjian; Liu, Zuoxiong; Zhou, Fei; Chen, Hao; Lin, Yongjun

2015-12-11

As an important part of synthetic biology, synthetic promoter has gradually become a hotspot in current biology. The purposes of the present study were to synthesize green tissue-specific promoters and to discover green tissue-specific cis-elements. We first assembled several regulatory sequences related to tissue-specific expression in different combinations, aiming to obtain novel green tissue-specific synthetic promoters. GUS assays of the transgenic plants indicated 5 synthetic promoters showed green tissue-specific expression patterns and different expression efficiencies in various tissues. Subsequently, we scanned and counted the cis-elements in different tissue-specific promoters based on the plant cis-elements database PLACE and the rice cDNA microarray database CREP for green tissue-specific cis-element discovery, resulting in 10 potential cis-elements. The flanking sequence of one potential core element (GEAT) was predicted by bioinformatics. Then, the combination of GEAT and its flanking sequence was functionally identified with synthetic promoter. GUS assays of the transgenic plants proved its green tissue-specificity. Furthermore, the function of GEAT flanking sequence was analyzed in detail with site-directed mutagenesis. Our study provides an example for the synthesis of rice tissue-specific promoters and develops a feasible method for screening and functional identification of tissue-specific cis-elements with their flanking sequences at the genome-wide level in rice.
Characterization and Expression of the Lucina pectinata Oxygen and Sulfide Binding Hemoglobin Genes

PubMed Central

López-Garriga, Juan; Cadilla, Carmen L.

2016-01-01

The clam Lucina pectinata lives in sulfide-rich muds and houses intracellular symbiotic bacteria that need to be supplied with hydrogen sulfide and oxygen. This clam possesses three hemoglobins: hemoglobin I (HbI), a sulfide-reactive protein, and hemoglobin II (HbII) and III (HbIII), which are oxygen-reactive. We characterized the complete gene sequence and promoter regions for the oxygen reactive hemoglobins and the partial structure and promoters of the HbI gene from Lucina pectinata. We show that HbI has two mRNA variants, where the 5’end had either a sequence of 96 bp (long variant) or 37 bp (short variant). The gene structure of the oxygen reactive Hbs is defined by having 4-exons/3-introns with conservation of intron location at B12.2 and G7.0 and the presence of pre-coding introns, while the partial gene structure of HbI has the same intron conservation but appears to have a 5-exon/ 4-intron structure. A search for putative transcription factor binding sites (TFBSs) was done with the promoters for HbII, HbIII, HbI short and HbI long. The HbII, HbIII and HbI long promoters showed similar predicted TFBSs. We also characterized MITE-like elements in the HbI and HbII gene promoters and intronic regions that are similar to sequences found in other mollusk genomes. The gene expression levels of the clam Hbs, from sulfide-rich and sulfide-poor environments showed a significant decrease of expression in the symbiont-containing tissue for those clams in a sulfide-poor environment, suggesting that the sulfide concentration may be involved in the regulation of these proteins. Gene expression evaluation of the two HbI mRNA variants indicated that the longer variant is expressed at higher levels than the shorter variant in both environments. PMID:26824233
Combination of retinitis pigmentosa and hearing loss caused by a novel mutation in PRPH2 and a known mutation in GJB2: importance for differential diagnosis of Usher syndrome.

PubMed

Fakin, Ana; Zupan, Andrej; Glavač, Damjan; Hawlina, Marko

2012-12-15

Purpose of this study was to molecularly characterize a family in which two brothers (46 and 36 years) presented with a combination of retinitis pigmentosa (RP) and severe sensorineural hearing loss while father and sister (71 and 41 years) presented with isolated RP. Retinal phenotype was compared with phenotype of 17 patients with Usher syndrome type 1. Ophthalmological examination included assessment of Snellen visual acuity, color vision with Ishihara tables, Goldmann perimetry (targets II/1-4) and microperimetry. Fundus autofluorescence imaging and optical coherence tomography were performed. Direct sequencing of all coding exons and flanking intronic sequences of GJB2 (gap junction protein, beta 2) and PRPH2 (peripherin 2) genes was performed in younger brother. Other family members were analyzed with sequencing (GJB2), high resolution melt analysis (GJB2) or restriction enzymes (PRPH2). Brothers with hearing loss were found to carry a homozygous c.35 delG mutation in GJB2, the most common mutation associated with recessive hearing loss. All patients were found to carry a novel heterozygous mutation c.389T>C (p.Leu130Pro) on PRPH2. Age of onset was higher in PRPH2 than USH1 patients, however with some overlap. Differentiation from retinal phenotype of USH1 could only be made in the oldest patient, who retained good central visual function after more than three decades of disease. Copyright © 2012 Elsevier Ltd. All rights reserved.
Flanking sequence determination and specific PCR identification of transgenic wheat B102-1-2.

PubMed

Cao, Jijuan; Xu, Junyi; Zhao, Tongtong; Cao, Dongmei; Huang, Xin; Zhang, Piqiao; Luan, Fengxia

2014-01-01

The exogenous fragment sequence and flanking sequence between the exogenous fragment and recombinant chromosome of transgenic wheat B102-1-2 were successfully acquired using genome walking technology. The newly acquired exogenous fragment encoded the full-length sequence of transformed genes with transformed plasmid and corresponding functional genes including ubi, vector pBANF-bar, vector pUbiGUSPlus, vector HSP, reporter vector pUbiGUSPlus, promoter ubiquitin, and coli DH1. A specific polymerase chain reaction (PCR) identification method for transgenic wheat B102-1-2 was established on the basis of designed primers according to flanking sequence. This established specific PCR strategy was validated by using transgenic wheat, transgenic corn, transgenic soybean, transgenic rice, and non-transgenic wheat. A specifically amplified target band was observed only in transgenic wheat B102-1-2. Therefore, this method is characterized by high specificity, high reproducibility, rapid identification, and excellent accuracy for the identification of transgenic wheat B102-1-2.
Gene structure and evolution of transthyretin in the order Chiroptera.

PubMed

Khwanmunee, Jiraporn; Leelawatwattana, Ladda; Prapunpoj, Porntip

2016-02-01

Bats are mammals in the order Chiroptera. Although many extensive morphologic and molecular genetics analyses have been attempted, phylogenetic relationships of bats has not been completely resolved. The paraphyly of microbats is of particular controversy that needs to be confirmed. In this study, we attempted to use the nucleotide sequence of transthyretin (TTR) intron 1 to resolve the relationship among bats. To explore its utility, the complete sequences of TTR gene and intron 1 region of bats in Vespertilionidae: genus Eptesicus (Eptesicus fuscus) and genus Myotis (Myotis brandtii, Myotis davidii, and Myotis lucifugus), and Pteropodidae (Pteropus alecto and Pteropus vampyrus) were extracted from the retrieved sequences, whereas those of Rhinoluphus affinis and Scotophilus kuhlii were amplified and sequenced. The derived overall amino sequences of bat TTRs were found to be very similar to those in other eutherians but differed from those in other classes of vertebrates. However, missing of amino acids from N-terminal or C-terminal region was observed. The phylogenetic analysis of amino acid sequences suggested bat and other eutherian TTRs lineal descent from a single most recent common ancestor which differed from those of non-placental mammals and the other classes of vertebrates. The splicing of bat TTR precursor mRNAs was similar to those of other eutherian but different from those of marsupial, bird, reptile and amphibian. Based on TTR intron 1 sequence, the inferred evolutionary relationship within Chiroptera revealed more closely relatedness of R. affinis to megabats than to microbats. Accordingly, the paraphyly of microbats was suggested.
Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.

PubMed

Yin, Changchuan

2015-04-01

To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
The Mitochondrial Genome of the Prasinophyte Prasinoderma coloniale Reveals Two Trans-Spliced Group I Introns in the Large Subunit rRNA Gene

PubMed Central

Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude

2013-01-01

Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications of this interesting observation for trans-splicing of group I introns. PMID:24386369
[Molecular characterization of 71 cases of glucose-6-phosphate dehydrogenase deficiency in Hainan province].

PubMed

Huang, Dong-Ai; Wang, Xiao-Ying; Wang, Zheng; Zhou, Dai-Feng; Cai, Wang-Wei

2007-04-01

To molecularly analyze in Han and Li individuals of glucose-6-phosphate dehydrogenase deficiency in Hainan, China. The amplification refractory mutation system (ARMS) was employed to detect G1376T, G1388A and A95G mutations. The coding regions and flanking intronic regions from the second to the thirteenth exons of G6PD gene was analyzed by DNA sequencing to characterize the gene mutations in samples without G1376T, G1388A and A95G mutations. Among 29 Han cases of G6PD deficiency, 11 had G1376T (37.9%), 2 G1388A (6.9%), 1 G1376T and G1388A (3.4%) and 1 G1376T and A95G (3.4%) were identified. Mutations of G1376T, G1388A, A95G and their complex accounted for 51.7% of G6PD deficiency in the Han individuals. Among 42 Li cases of G6PD deficiency, 25 had G1376T (59.5%), 6 G1388A (14.3%), 2 A95G (4.8%), 4 G1376T and G1388A (9.5%), 1 G1376T and A95G (2.4% )were identified. These mutations accounted for 90.5% of the Li individuals. Gene mutation of 18 cases (14 Han and 4 Li individuals) remained unknown. Sequencing results of the 18 samples indicated that one case had a single base of T deletion at nucleotide 636 or 637 in the 5th intron (IVS-5 636 or 637 T del) and two cases had C1311T with IVS-11 T93C mutation. G6PD G1376T and G1388A are the most common mutations in the populations of the Han and Li nationalities in Hainan. The IVS-5 636 or 637 T del mutation is first reported in Chinese, and the complex mutation of G1376T/A95G is first found in the Li nationality.
A novel mutation in SOX3 polyalanine tract: a case of Kabuki syndrome with combined pituitary hormone deficiency harboring double mutations in MLL2 and SOX3.

PubMed

Takagi, Masaki; Ishii, Tomohiro; Torii, Chiharu; Kosaki, Kenjiro; Hasegawa, Tomonobu

2014-12-01

Both duplications encompassing SOX3 and loss-of function mutations in SOX3 have been reported in a minor portion of X-linked isolated growth hormone deficiency (GHD) or combined pituitary hormone deficiency (CPHD) patients with or without mental retardation. We report a Japanese male patient with molecularly confirmed Kabuki syndrome who was found to have CPHD. We analyzed all coding exons and flanking introns of currently known nine genes responsible for CPHD by PCR-based sequencing. In this CPHD patient, we identified a novel hemizygous 21-base pair deletion, resulting in the loss of 7 alanine residues from polyalanine (PA) tracts of SOX3. The clinically and endocrinologically normal mother of the patient carried the same deletion in a heterozygous manner. In vitro experiments showed that the del 7A SOX3 had increased transactivation of the HESX1 promoter. Our study provides additional evidence that deletion in PA tracts of SOX3 is associated with hypopituitarism. Female carriers of SOX3 PA tract deletions will show a broad phenotypic spectrum, ranging from clinically normal to CPHD.
Differential splicing of human androgen receptor pre-mRNA in X-linked reifenstein syndrome, because of a deletion involving a putative branch site

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ris-Stalpers, C.; Verleun-Mooijman, M.C.T.; Blaeij, T.J.P. de

1994-04-01

The analysis of the androgen receptor (AR) gene, mRNA, and protein in a subject with X-linked Reifenstein syndrome (partial androgen insensitivity) is reported. The presence of two mature AR transcripts in genital skin fibroblasts of the patient is established, and, by reverse transcriptase-PCR and RNase transcription analysis, the wild-type transcript and a transcript in which exon 3 sequences are absent without disruption of the translational reading frame are identified. Sequencing and hybridization analysis show a deletion of >6 kb in intron 2 of the human AR gene, starting 18 bp upstream of exon 3. The deletion includes the putative branch-pointmore » sequence (BPS) but not the acceptor splice site on the intron 2/exon 3 boundary. The deletion of the putative intron 2 BPS results in 90% inhibition of wild-type splicing. The mutant transcript encodes an AR protein lacking the second zinc finger of the DNA-binding domain. Western/immunoblotting analysis is used to show that the mutant AR protein is expressed in genital skin fibroblasts of the patient. The residual 10% wild-type transcript can be the result of the use of a cryptic BPS located 63 bp upstream of the intron 2/exon 3 boundary of the mutant AR gene. The mutated AR protein has no transcription-activating potential and does not influence the transactivating properties of the wild-type AR, as tested in cotransfection studies. It is concluded that the partial androgen-insensitivity syndrome of this patient is the consequence of the limited amount of wild-type AR protein expressed in androgen target cells, resulting from the deletion of the intron 2 putative BPS. 42 refs., 6 figs., 1 tab.« less
Localization, structure and polymorphism of two paralogous Xenopus laevis mitochondrial malate dehydrogenase genes.

PubMed

Tlapakova, Tereza; Krylov, Vladimir; Macha, Jaroslav

2005-01-01

Two paralogous mitochondrial malate dehydrogenase 2 (Mdh2) genes of Xenopus laevis have been cloned and sequenced, revealing 95% identity. Fluorescence in-situ hybridization (FISH) combined with tyramide amplification discriminates both genes; Mdh2a was localized into chromosome q3 and Mdh2b into chromosome q8. One kb cDNA probes detect both genes with 85% accuracy. The remaining signals were on the paralogous counterpart. Introns interrupt coding sequences at the same nucleotide as defined for mouse. Restriction polymorphism has been detected in the first intron of Mdh2a, while the individual variability in intron 6 of Mdh2b gene is represented by an insertion of incomplete retrotransposon L1Xl. Rates of nucleotide substitutions indicate that both genes are under similar evolutionary constraints. X. laevis Mdh2 genes can be used as markers for physical mapping and linkage analysis.

Developmental expression of a regulatory gene is programmed at the level of splicing.

PubMed Central

Chou, T B; Zachar, Z; Bingham, P M

1987-01-01

We report sequence and transcript structures for a 6191-base chromosomal segment containing the presumptive regulatory gene from Drosophila, suppressor-of-white-apricot [su(wa)]. Our results indicate that su(wa) expression is controlled by regulating occurrence of specific splices. Seven introns are removed from the su(wa) primary transcript during precellular blastoderm development. The sequence of this mature RNA indicates that it is a conventional messenger RNA. In contrast, after cellular blastoderm the first two of these introns cease to be efficiently removed. The mature RNAs resulting from this failure to remove the first two introns have structures quite unexpected of mRNAs. We propose that postcellular blastoderm su(wa) expression is repressed by preventing splices necessary to produce a functional mRNA. Implications and mechanisms are discussed. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. PMID:2832151
Pseudoexon activation increases phenotype severity in a Becker muscular dystrophy patient.

PubMed

Greer, Kane; Mizzi, Kayla; Rice, Emily; Kuster, Lukas; Barrero, Roberto A; Bellgard, Matthew I; Lynch, Bryan J; Foley, Aileen Reghan; O Rathallaigh, Eoin; Wilton, Steve D; Fletcher, Sue

2015-07-01

We report a dystrophinopathy patient with an in-frame deletion of DMD exons 45-47, and therefore a genetic diagnosis of Becker muscular dystrophy, who presented with a more severe than expected phenotype. Analysis of the patient DMD mRNA revealed an 82 bp pseudoexon, derived from intron 44, that disrupts the reading frame and is expected to yield a nonfunctional dystrophin. Since the sequence of the pseudoexon and canonical splice sites does not differ from the reference sequence, we concluded that the genomic rearrangement promoted recognition of the pseudoexon, causing a severe dystrophic phenotype. We characterized the deletion breakpoints and identified motifs that might influence selection of the pseudoexon. We concluded that the donor splice site was strengthened by juxtaposition of intron 47, and loss of intron 44 silencer elements, normally located downstream of the pseudoexon donor splice site, further enhanced pseudoexon selection and inclusion in the DMD transcript in this patient.
A deep intronic mutation in the SLC12A3 gene leads to Gitelman syndrome.

PubMed

Nozu, Kandai; Iijima, Kazumoto; Nozu, Yoshimi; Ikegami, Ei; Imai, Takehide; Fu, Xue Jun; Kaito, Hiroshi; Nakanishi, Koichi; Yoshikawa, Norishige; Matsuo, Masafumi

2009-11-01

Many mutations have been detected in the SLC12A3 gene of Gitelman syndrome (GS, OMIM 263800) patients. In previous studies, only one mutant allele was detected in approximately 20 to 41% of patients with GS; however, the exact reason for the nonidentification has not been established. In this study, we used RT-PCR using mRNA to investigate for the first time transcript abnormalities caused by deep intronic mutation. Direct sequencing analysis of leukocyte DNA identified one base insertion in exon 6 (c.818_819insG), but no mutation was detected in another allele. We analyzed RNA extracted from leukocytes and urine sediments and detected unknown sequence containing 238bp between exons 13 and 14. The genomic DNA analysis of intron 13 revealed a single-base substitution (c.1670-191C>T) that creates a new donor splice site within the intron resulting in the inclusion of a novel cryptic exon in mRNA. This is the first report of creation of a splice site by a deep intronic single-nucleotide change in GS and the first report to detect the onset mechanism in a patient with GS and missing mutation in one allele. This molecular onset mechanism may partly explain the poor success rate of mutation detection in both alleles of patients with GS.
Characterization of 5' end of human thromboxane receptor gene. Organizational analysis and mapping of protein kinase C--responsive elements regulating expression in platelets.

PubMed

D'Angelo, D D; Davis, M G; Houser, W A; Eubank, J J; Ritchie, M E; Dorn, G W

1995-09-01

Platelet thromboxane receptors are acutely and reversibly upregulated after acute myocardial infarction. To determine if platelet thromboxane receptors are under transcriptional control, we isolated and characterized human genomic DNA clones containing the 5' flanking region of the thromboxane receptor gene. The exon-intron structure of the 5' portion of the thromboxane receptor gene was determined initially by comparing the nucleotide sequence of the 5' flanking genomic clone with that of a novel human uterine thromboxane receptor cDNA that extended the mRNA 141 bp further upstream than the previously identified human placental cDNA. A major transcription initiation site was located in three human tissues approximately 560 bp upstream from the translation initiation codon and 380 bp upstream from any previously identified transcription initiation site. The thromboxane receptor gene has neither a TATA nor a CAAT consensus site. Promoter function of the 5' flanking region of the thromboxane receptor gene was evaluated by transfection of thromboxane receptor gene promoter/chloramphenicol acetyltransferase (CAT) chimera plasmids into platelet-like K562 cells. Thromboxane receptor promoter activity, as assessed by CAT expression, was relatively weak but was significantly enhanced by phorbol ester treatment. Functional analysis of 5' deletion constructs in transfected K562 cells and gel mobility shift localized the major phorbol ester-responsive motifs in the thromboxane receptor gene promoter to a cluster of activator protein-2 (AP-2) binding consensus sites located approximately 1.8 kb 5' from the transcription initiation site. These studies are the first to determine the structure and organization of the 5' end of the thromboxane receptor gene and demonstrate that thromboxane receptor gene expression can be regulated by activation of protein kinase C via induction of an AP-2-like nuclear factor binding to upstream promoter elements. These findings strongly suggest that the mechanism for previously described upregulation of platelet thromboxane receptors after acute myocardial infarction is increased thromboxane receptor gene transcription in platelet-progenitor cells.
Cytochrome oxidase subunit II gene in mitochondria of Oenothera has no intron

PubMed Central

Hiesel, Rudolf; Brennicke, Axel

1983-01-01

The cytochrome oxidase subunit II gene has been localized in the mitochondrial genome of Oenothera berteriana and the nucleotide sequence has been determined. The coding sequence contains 777 bp and, unlike the corresponding gene in Zea mays, is not interrupted by an intron. No TGA codon is found within the open reading frame. The codon CGG, as in the maize gene, is used in place of tryptophan codons of corresponding genes in other organisms. At position 742 in the Oenothera sequence the TGG of maize is changed into a CGG codon, where Trp is conserved as the amino acid in other organisms. Homologous sequences occur more than once in the mitochondrial genome as several mitochondrial DNA species hybridize with DNA probes of the cytochrome oxidase subunit II gene. ImagesFig. 5. PMID:16453484
Functional Analysis of Maize Silk-Specific ZmbZIP25 Promoter.

PubMed

Li, Wanying; Yu, Dan; Yu, Jingjuan; Zhu, Dengyun; Zhao, Qian

2018-03-12

ZmbZIP25 ( Zea mays bZIP (basic leucine zipper) transcription factor 25) is a function-unknown protein that belongs to the D group of the bZIP transcription factor family. RNA-seq data showed that the expression of ZmbZIP25 was tissue-specific in maize silks, and this specificity was confirmed by RT-PCR (reverse transcription-polymerase chain reaction). In situ RNA hybridization showed that ZmbZIP25 was expressed exclusively in the xylem of maize silks. A 5' RACE (rapid amplification of cDNA ends) assay identified an adenine residue as the transcription start site of the ZmbZIP25 gene. To characterize this silk-specific promoter, we isolated and analyzed a 2450 bp (from -2083 to +367) and a 2600 bp sequence of ZmbZIP25 (from -2083 to +517, the transcription start site was denoted +1). Stable expression assays in Arabidopsis showed that the expression of the reporter gene GUS driven by the 2450 bp ZmbZIP25 5'-flanking fragment occurred exclusively in the papillae of Arabidopsis stigmas. Furthermore, transient expression assays in maize indicated that GUS and GFP expression driven by the 2450 bp ZmbZIP25 5'-flanking sequences occurred only in maize silks and not in other tissues. However, no GUS or GFP expression was driven by the 2600 bp ZmbZIP25 5'-flanking sequences in either stable or transient expression assays. A series of deletion analyses of the 2450 bp ZmbZIP25 5'-flanking sequence was performed in transgenic Arabidopsis plants, and probable elements prediction analysis revealed the possible presence of negative regulatory elements within the 161 bp region from -1117 to -957 that were responsible for the specificity of the ZmbZIP25 5'-flanking sequence.
Functional Analysis of Maize Silk-Specific ZmbZIP25 Promoter

PubMed Central

Li, Wanying; Yu, Dan; Yu, Jingjuan; Zhu, Dengyun; Zhao, Qian

2018-01-01

ZmbZIP25 (Zea mays bZIP (basic leucine zipper) transcription factor 25) is a function-unknown protein that belongs to the D group of the bZIP transcription factor family. RNA-seq data showed that the expression of ZmbZIP25 was tissue-specific in maize silks, and this specificity was confirmed by RT-PCR (reverse transcription-polymerase chain reaction). In situ RNA hybridization showed that ZmbZIP25 was expressed exclusively in the xylem of maize silks. A 5′ RACE (rapid amplification of cDNA ends) assay identified an adenine residue as the transcription start site of the ZmbZIP25 gene. To characterize this silk-specific promoter, we isolated and analyzed a 2450 bp (from −2083 to +367) and a 2600 bp sequence of ZmbZIP25 (from −2083 to +517, the transcription start site was denoted +1). Stable expression assays in Arabidopsis showed that the expression of the reporter gene GUS driven by the 2450 bp ZmbZIP25 5′-flanking fragment occurred exclusively in the papillae of Arabidopsis stigmas. Furthermore, transient expression assays in maize indicated that GUS and GFP expression driven by the 2450 bp ZmbZIP25 5′-flanking sequences occurred only in maize silks and not in other tissues. However, no GUS or GFP expression was driven by the 2600 bp ZmbZIP25 5′-flanking sequences in either stable or transient expression assays. A series of deletion analyses of the 2450 bp ZmbZIP25 5′-flanking sequence was performed in transgenic Arabidopsis plants, and probable elements prediction analysis revealed the possible presence of negative regulatory elements within the 161 bp region from −1117 to −957 that were responsible for the specificity of the ZmbZIP25 5′-flanking sequence. PMID:29534529
SinEx DB: a database for single exon coding sequences in mammalian genomes.

PubMed

Jorquera, Roddy; Ortiz, Rodrigo; Ossandon, F; Cárdenas, Juan Pablo; Sepúlveda, Rene; González, Carolina; Holmes, David S

2016-01-01

Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as 'single exon genes' (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs.Database URL: www.sinex.cl. © The Author(s) 2016. Published by Oxford University Press.
Molecular characterization of beta-tubulin from Phakopsora pachyrhizi, the causal agent of Asian soybean rust

PubMed Central

2010-01-01

β-tubulins are structural components of microtubules and the targets of benzimidazole fungicides used to control many diseases of agricultural importance. Intron polymorphisms in the intron-rich genes of these proteins have been used in phylogeographic investigations of phytopathogenic fungi. In this work, we sequenced 2764 nucleotides of the β-tubulin gene (Pp tubB) in samples of Phakopsora pachyrhizi collected from seven soybean fields in Brazil. Pp tubB contained an open reading frame of 1341 nucleotides, including nine exons and eight introns. Exon length varied from 14 to 880 nucleotides, whereas intron length varied from 76 to 102 nucleotides. The presence of only four polymorphic sites limited the usefulness of Pp tubB for phylogeographic studies in P. pachyrhizi. The gene structures of Pp tubB and orthologous β-tubulin genes of Melampsora lini and Uromyces viciae-fabae were highly conserved. The amino acid substitutions in β-tubulin proteins associated with the onset of benzimidazole resistance in model organisms, especially at His 6 , Glu 198 and Phe 200 , were absent from the predicted sequence of the P. pachyrhizi β-tubulin protein. PMID:21637494
Seabream ghrelin: cDNA cloning, genomic organization and promoter studies.

PubMed

Yeung, Chung-Man; Chan, Chi-Bun; Woo, Norman Y S; Cheng, Christopher H K

2006-05-01

Recent studies have indicated that ghrelin stimulates growth hormone release from the pituitary via the growth hormone secretagogue receptor (GHSR). We have previously isolated two GHSR subtypes from the pituitary of the black seabream Acanthopagrus schlegeli. In the present study, we have cloned and characterized ghrelin from the same fish species at both the cDNA and gene levels. The full-length seabream ghrelin cDNA, isolated from sea-bream stomach using a novel approach by exploiting a single conserved region in the coding region, was found to encode a prepropeptide of 107 amino acids, with the predicted mature ghrelin peptide consisting of 20 amino acids (GSSFLSPSQKPQNRGKSSRV). Embedded in this full-length cDNA is a putative fish orthologue of the recently reported mammalian obestatin peptide. The ghrelin gene in black seabream, obtained by genomic PCR, was found to encompass four exons and three introns, possessing the same structural organization as in tilapia and goldfish, but different from that in rainbow trout. In addition, a 2230-bp 5'-flanking region of the seabream ghrelin gene was obtained by genome walking. Sequence analysis revealed that, as in the case of the human ghrelin gene, there is neither a GC box nor a CAAT box present in the isolated 5'-flanking region. However, a number of putative transcription factor-binding sites different from the human counterpart were found in the 5'-flanking region of the seabream ghrelin gene, suggesting that different cis- and trans-acting elements are involved in controlling their gene expression. Functional activity of this 5'-flanking region was examined by cloning it into the pGL3-Basic vector upstream of the luciferase reporter gene and transfected into various cell lines. Positive promoter activity could only be recorded in the colon-derived Caco-2 cells, suggesting that the cloned 5'-flanking region represents the functional promoter of the seabream ghrelin gene, which exhibits tissue-specific promoter activity. Using reverse transcriptase PCR analysis, expression of ghrelin was detected only in the seabream stomach, but not in the other tissues examined, including the brain, gill, intestine, kidney, liver and spleen. This stomach-specific expression of ghrelin in seabream is subject to regulation, as administration of growth hormone or ipamorelin to the fish in vivo was demonstrated to enhance its expression. Reminiscent of the homologous upregulation found in the transcriptional control of the seabream GHSR gene, a similar homologous regulatory mechanism might also exist in controlling the expression of seabream ghrelin. The identification of both GHSR and ghrelin from a single fish species would facilitate our subsequent studies on the elucidation of the physiological functions of the ghrelin/GHSR system in teleost. The possible existence of obestatin in teleost opens up new research avenues on the somatotropic axis in fish.
The legumin gene family: structure of a B type gene of Vicia faba and a possible legumin gene specific regulatory element.

PubMed Central

Bäumlein, H; Wobus, U; Pustell, J; Kafatos, F C

1986-01-01

The field bean, Vicia faba L. var. minor, possesses two sub-families of 11 S legumin genes named A and B. We isolated from a genomic library a B-type gene (LeB4) and determined its primary DNA sequence. Gene LeB4 codes for a 484 amino acid residue prepropolypeptide, encompassing a signal peptide of 22 amino acid residues, an acidic, very hydrophilic alpha-chain of 281 residues and a basic, somewhat hydrophobic beta-chain of 181 residues. The latter two coding regions are immediately contiguous, but each is interrupted by a short intron. Type A legumin genes from soybean and pea are known to have introns in the same two positions, in addition to an extra intron (within the alpha-coding sequence). Sequence comparisons of legumin genes from these three plants revealed a highly conserved sequence element of at least 28 bp, centered at approximately 100 bp upstream of each cap site. The element is absent from the equivalent position of all non-legumin and other plant and fungal genes examined. We tentatively name this element "legumin box" and suggest that it may have a function in the regulation of legumin gene expression. PMID:3960730
An RRM–ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion

PubMed Central

Collins, Katherine M.; Kainov, Yaroslav A.; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A.

2017-01-01

Abstract RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1–ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. PMID:28379442
An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.

PubMed

Collins, Katherine M; Kainov, Yaroslav A; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A; Makeyev, Eugene V; Ramos, Andres

2017-06-20

RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
High-throughput sequencing of human plasma RNA by using thermostable group II intron reverse transcriptases

PubMed Central

Qin, Yidan; Yao, Jun; Wu, Douglas C.; Nottingham, Ryan M.; Mohr, Sabine; Hunicke-Smith, Scott; Lambowitz, Alan M.

2016-01-01

Next-generation RNA-sequencing (RNA-seq) has revolutionized transcriptome profiling, gene expression analysis, and RNA-based diagnostics. Here, we developed a new RNA-seq method that exploits thermostable group II intron reverse transcriptases (TGIRTs) and used it to profile human plasma RNAs. TGIRTs have higher thermostability, processivity, and fidelity than conventional reverse transcriptases, plus a novel template-switching activity that can efficiently attach RNA-seq adapters to target RNA sequences without RNA ligation. The new TGIRT-seq method enabled construction of RNA-seq libraries from <1 ng of plasma RNA in <5 h. TGIRT-seq of RNA in 1-mL plasma samples from a healthy individual revealed RNA fragments mapping to a diverse population of protein-coding gene and long ncRNAs, which are enriched in intron and antisense sequences, as well as nearly all known classes of small ncRNAs, some of which have never before been seen in plasma. Surprisingly, many of the small ncRNA species were present as full-length transcripts, suggesting that they are protected from plasma RNases in ribonucleoprotein (RNP) complexes and/or exosomes. This TGIRT-seq method is readily adaptable for profiling of whole-cell, exosomal, and miRNAs, and for related procedures, such as HITS-CLIP and ribosome profiling. PMID:26554030
MitoRes: a resource of nuclear-encoded mitochondrial genes and their products in Metazoa.

PubMed

Catalano, Domenico; Licciulli, Flavio; Turi, Antonio; Grillo, Giorgio; Saccone, Cecilia; D'Elia, Domenica

2006-01-24

Mitochondria are sub-cellular organelles that have a central role in energy production and in other metabolic pathways of all eukaryotic respiring cells. In the last few years, with more and more genomes being sequenced, a huge amount of data has been generated providing an unprecedented opportunity to use the comparative analysis approach in studies of evolution and functional genomics with the aim of shedding light on molecular mechanisms regulating mitochondrial biogenesis and metabolism. In this context, the problem of the optimal extraction of representative datasets of genomic and proteomic data assumes a crucial importance. Specialised resources for nuclear-encoded mitochondria-related proteins already exist; however, no mitochondrial database is currently available with the same features of MitoRes, which is an update of the MitoNuc database extensively modified in its structure, data sources and graphical interface. It contains data on nuclear-encoded mitochondria-related products for any metazoan species for which this type of data is available and also provides comprehensive sequence datasets (gene, transcript and protein) as well as useful tools for their extraction and export. MitoRes http://www2.ba.itb.cnr.it/MitoRes/ consolidates information from publicly external sources and automatically annotates them into a relational database. Additionally, it also clusters proteins on the basis of their sequence similarity and interconnects them with genomic data. The search engine and sequence management tools allow the query/retrieval of the database content and the extraction and export of sequences (gene, transcript, protein) and related sub-sequences (intron, exon, UTR, CDS, signal peptide and gene flanking regions) ready to be used for in silico analysis. The tool we describe here has been developed to support lab scientists and bioinformaticians alike in the characterization of molecular features and evolution of mitochondrial targeting sequences. The way it provides for the retrieval and extraction of sequences allows the user to overcome the obstacles encountered in the integrative use of different bioinformatic resources and the completeness of the sequence collection allows intra- and interspecies comparison at different biological levels (gene, transcript and protein).
Development and in-house validation of the event-specific polymerase chain reaction detection methods for genetically modified soybean MON89788 based on the cloned integration flanking sequence.

PubMed

Liu, Jia; Guo, Jinchao; Zhang, Haibo; Li, Ning; Yang, Litao; Zhang, Dabing

2009-11-25

Various polymerase chain reaction (PCR) methods were developed for the execution of genetically modified organism (GMO) labeling policies, of which an event-specific PCR detection method based on the flanking sequence of exogenous integration is the primary trend in GMO detection due to its high specificity. In this study, the 5' and 3' flanking sequences of the exogenous integration of MON89788 soybean were revealed by thermal asymmetric interlaced PCR. The event-specific PCR primers and TaqMan probe were designed based upon the revealed 5' flanking sequence, and the qualitative and quantitative PCR assays were established employing these designed primers and probes. In qualitative PCR, the limit of detection (LOD) was about 0.01 ng of genomic DNA corresponding to 10 copies of haploid soybean genomic DNA. In the quantitative PCR assay, the LOD was as low as two haploid genome copies, and the limit of quantification was five haploid genome copies. Furthermore, the developed PCR methods were in-house validated by five researchers, and the validated results indicated that the developed event-specific PCR methods can be used for identification and quantification of MON89788 soybean and its derivates.
Amplification and chromosomal dispersion of human endogenous retroviral sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Steele, P.E.; Martin, M.A.; Rabson, A.B.

1986-09-01

Endogenous retroviral sequences have undergone amplification events involving both viral and flanking cellular sequences. The authors cloned members of an amplified family of full-length endogenous retroviral sequences. Genomic blotting, employing a flanking cellular DNA probe derived from a member of this family, revealed a similar array of reactive bands in both humans and chimpanzees, indicating that an amplification event involving retroviral and associated cellular DNA sequences occurred before the evolutionary separation of these two primates. Southern analyses of restricted somatic cell hybrid DNA preparations suggested that endogenous retroviral segments are widely dispersed in the human genome and that amplification andmore » dispersion events may be linked.« less
The complete chloroplast DNA sequences of the charophycean green algae Staurastrum and Zygnema reveal that the chloroplast genome underwent extensive changes during the evolution of the Zygnematales

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

2005-01-01

Background The Streptophyta comprise all land plants and six monophyletic groups of charophycean green algae. Phylogenetic analyses of four genes from three cellular compartments support the following branching order for these algal lineages: Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Coleochaetales and Charales, with the last lineage being sister to land plants. Comparative analyses of the Mesostigma viride (Mesostigmatales) and land plant chloroplast genome sequences revealed that this genome experienced many gene losses, intron insertions and gene rearrangements during the evolution of charophyceans. On the other hand, the chloroplast genome of Chaetosphaeridium globosum (Coleochaetales) is highly similar to its land plant counterparts in terms of gene content, intron composition and gene order, indicating that most of the features characteristic of land plant chloroplast DNA (cpDNA) were acquired from charophycean green algae. To gain further insight into when the highly conservative pattern displayed by land plant cpDNAs originated in the Streptophyta, we have determined the cpDNA sequences of the distantly related zygnematalean algae Staurastrum punctulatum and Zygnema circumcarinatum. Results The 157,089 bp Staurastrum and 165,372 bp Zygnema cpDNAs encode 121 and 125 genes, respectively. Although both cpDNAs lack an rRNA-encoding inverted repeat (IR), they are substantially larger than Chaetosphaeridium and land plant cpDNAs. This increased size is explained by the expansion of intergenic spacers and introns. The Staurastrum and Zygnema genomes differ extensively from one another and from their streptophyte counterparts at the level of gene order, with the Staurastrum genome more closely resembling its land plant counterparts than does Zygnema cpDNA. Many intergenic regions in Zygnema cpDNA harbor tandem repeats. The introns in both Staurastrum (8 introns) and Zygnema (13 introns) cpDNAs represent subsets of those found in land plant cpDNAs. They represent 16 distinct insertion sites, only five of which are shared by the two zygnematalean genomes. Three of these insertions sites have not been identified in Chaetosphaeridium cpDNA. Conclusion The chloroplast genome experienced substantial changes in overall structure, gene order, and intron content during the evolution of the Zygnematales. Most of the features considered earlier as typical of land plant cpDNAs probably originated before the emergence of the Zygnematales and Coleochaetales. PMID:16236178
Genome-wide mapping of alternative splicing in Arabidopsis thaliana

PubMed Central

Filichkin, Sergei A.; Priest, Henry D.; Givan, Scott A.; Shen, Rongkun; Bryant, Douglas W.; Fox, Samuel E.; Wong, Weng-Keen; Mockler, Todd C.

2010-01-01

Alternative splicing can enhance transcriptome plasticity and proteome diversity. In plants, alternative splicing can be manifested at different developmental stages, and is frequently associated with specific tissue types or environmental conditions such as abiotic stress. We mapped the Arabidopsis transcriptome at single-base resolution using the Illumina platform for ultrahigh-throughput RNA sequencing (RNA-seq). Deep transcriptome sequencing confirmed a majority of annotated introns and identified thousands of novel alternatively spliced mRNA isoforms. Our analysis suggests that at least ∼42% of intron-containing genes in Arabidopsis are alternatively spliced; this is significantly higher than previous estimates based on cDNA/expressed sequence tag sequencing. Random validation confirmed that novel splice isoforms empirically predicted by RNA-seq can be detected in vivo. Novel introns detected by RNA-seq were substantially enriched in nonconsensus terminal dinucleotide splice signals. Alternative isoforms with premature termination codons (PTCs) comprised the majority of alternatively spliced transcripts. Using an example of an essential circadian clock gene, we show that intron retention can generate relatively abundant PTC+ isoforms and that this specific event is highly conserved among diverse plant species. Alternatively spliced PTC+ isoforms can be potentially targeted for degradation by the nonsense mediated mRNA decay (NMD) surveillance machinery or regulate the level of functional transcripts by the mechanism of regulated unproductive splicing and translation (RUST). We demonstrate that the relative ratios of the PTC+ and reference isoforms for several key regulatory genes can be considerably shifted under abiotic stress treatments. Taken together, our results suggest that like in animals, NMD and RUST may be widespread in plants and may play important roles in regulating gene expression. PMID:19858364
Mutational Analysis of the Rhodopsin Gene in Sector Retinitis Pigmentosa.

PubMed

Napier, Maria L; Durga, Dash; Wolsley, Clive J; Chamney, Sarah; Alexander, Sharon; Brennan, Rosie; Simpson, David A; Silvestri, Giuliana; Willoughby, Colin E

2015-01-01

To determine the role of rhodopsin (RHO) gene mutations in patients with sector retinitis pigmentosa (RP) from Northern Ireland. A case series of sector RP in a tertiary ocular genetics clinic. Four patients with sector RP were recruited from the Royal Victoria Hospital (Belfast, Northern Ireland) and Altnagelvin Hospital (Londonderry, Northern Ireland) following informed consent. The diagnosis of sector RP was based on clinical examination, International Society for Clinical Electrophysiology of Vision (ISCEV) standard electrophysiology, and visual field analysis. DNA was extracted from peripheral blood leucocytes and the coding regions and adjacent flanking intronic sequences of the RHO gene were polymerase chain reaction (PCR) amplified and cycle sequenced. Rhodopsin mutational status. A heterozygous missense mutation in RHO (c.173C > T) resulting in a non-conservative substitution of threonine to methionine (p. Thr58Met) was identified in one patient and was absent from 360 control individuals. This non-conservative substitution (p.Thr58Met) replaces a highly evolutionary conserved polar hydrophilic threonine residue with a non-polar hydrophobic methionine residue at position 58 near the cytoplasmic border of helix A of RHO. The study identified a RHO gene mutation (p.Thr58Met) not previously reported in RP in a patient with sector RP. These findings outline the phenotypic variability associated with RHO mutations. It has been proposed that the regional effects of RHO mutations are likely to result from interplay between mutant alleles and other genetic, epigenetic and environmental factors.

Identification of fibrillin 1 gene mutations in patients with bicuspid aortic valve (BAV) without Marfan syndrome

PubMed Central

2014-01-01

Background Bicuspid aortic valve (BAV) is the most frequent congenital heart disease with frequent involvement in thoracic aortic dilatation, aneurysm and dissection. Although BAV and Marfan syndrome (MFS) share some clinical features, and some MFS patients with BAV display mutations in FBN1, the gene encoding fibrillin-1, the genetic background of isolated BAV is poorly defined. Methods Ten consecutive BAV patients [8 men, age range 24–42 years] without MFS were clinically characterized. BAV phenotype and function, together with evaluation of aortic morphology, were comprehensively assessed by Doppler echocardiography. Direct sequencing of each FBN1 exon with flanking intron sequences was performed on eight patients. Results We detected three FBN1 mutations in two patients (aged 24 and 25 years) displaying aortic root aneurysm ≥50 mm and moderate aortic regurgitation. In particular, one patient had two mutations (p.Arg2726Trp and p.Arg636Gly) one of which has been previously associated with variable Marfanoid phenotypes. The other patient showed a pArg529Gln substitution reported to be associated with an incomplete MFS phenotype. Conclusions The present findings enlarge the clinical spectrum of isolated BAV to include patients with BAV without MFS who have involvement of FBN1 gene. These results underscore the importance of accurate phenotyping of BAV aortopathy and of clinical characterization of BAV patients, including investigation of systemic connective tissue manifestations and genetic testing. PMID:24564502
Identification of three novel mutations by studying the molecular genetics of Maple Syrup Urine Disease (MSUD) in the Lebanese population.

PubMed

Tabbouche, Omar; Saker, Amer; Mountain, Harry

2014-01-01

Maple Syrup Urine Disease (MSUD) is a genetically heterogeneous metabolic disorder that is transmitted in an autosomal recessive manner. According to clinical data, MSUD prevalence in Lebanon is expected to be higher than the International prevalence because of consanguineous marriage. Novel mutations are still getting detected by using DNA sequencing for mutation analysis in MSUD patients. In the current study, we have extracted DNA from Lebanese MSUD patients in order to amplify the exonic and flanking intronic regions of the genes implicated in MSUD ( BCKDHA , BCKDHB , and DBT ) and sequenced the resultant amplified products to assess the molecular genetics of MSUD in the Lebanese population studied. All of the mutations identified occurred in the homozygous state, which reflects the high rate of consanguineous marriage in Lebanon. In the current study, we have identified one previously cited mutation and three novel mutations not previously described in the scientific literature. The identified mutations were distributed as follows: three patients (60%) had two nucleotide substitutions in the DBT gene (c.224G>A and c.1430T>G), one patient (20%) had a gross deletion in the BCKDHA gene (c.488_1167+3del), and one patient (20%) had a small deletion in the BCKDHB gene (c.92_102del). The majority of the mutations identified in the Lebanese MSUD patients occurred in the DBT gene. Consanguineous marriage is a major risk factor for the prevalence of MSUD in Lebanon.
Generation and characterization of anti-MUC4 monoclonal antibodies reactive with normal and cancer cells in humans.

PubMed

Moniaux, Nicolas; Varshney, Grish Chandra; Chauhan, Subhash Chand; Copin, Marie Christine; Jain, Maneesh; Wittel, Uwe A; Andrianifahanana, Mahefatiana; Aubert, Jean-Pierre; Batra, Surinder Kumar

2004-02-01

We have previously cloned the full-length cDNA (approximately 28 Kb) and established the complete genomic organization (25 exons/introns over 100 kb) of the human MUC4 mucin. This large molecule is predicted to protrude over 2 microm above the cell surface, in which MUC4alpha is an extracellular mucin-type glycoprotein subunit and MUC4beta is the transmembrane subunit. Over two thirds of the encoded protein sequence consists of 16-amino-acid tandem repeats (TR), which are flanked by unique sequences. In this study we generated and characterized monoclonal antibodies (MAbs) directed against the TR region of MUC4. Mice were immunized with a KLH-conjugated MUC4 TR peptide, STGDTTPLPVTDTSSV. Several clones were purified by three rounds of limited dilutions and stable clones presenting a sustained antibody production were selected for subsequent characterization. Antibodies were tested for their reactivity and specificity to recognize the MUC4 peptide and further screened by enzyme-linked immunosorbent assay (ELISA) and Western blotting analyses. One of the MAbs (8G7) was strongly reactive against the MUC4 peptide and with native MUC4 from human tissues or pancreatic cancer cells in Western blotting, immunohistochemistry, and confocal analysis. Anti-MUC4 MAb may represent a powerful tool for the study of MUC4 function under normal and pathological conditions and for diagnosis of solid tumors including those in the breast, pancreas, lungs, and ovaries.
Interlaboratory transfer of a PCR multiplex method for simultaneous detection of four genetically modified maize lines: Bt11, MON810, T25, and GA21.

PubMed

Hernández, Marta; Rodríguez-Lázaro, David; Zhang, David; Esteve, Teresa; Pla, Maria; Prat, Salomé

2005-05-04

The number of cultured hectares and commercialized genetically modified organisms (GMOs) has increased exponentially in the past 9 years. Governments in many countries have established a policy of labeling all food and feed containing or produced by GMOs. Consequently, versatile, laboratory-transferable GMO detection methods are in increasing demand. Here, we describe a qualitative PCR-based multiplex method for simultaneous detection and identification of four genetically modified maize lines: Bt11, MON810, T25, and GA21. The described system is based on the use of five primers directed to specific sequences in these insertion events. Primers were used in a single optimized multiplex PCR reaction, and sequences of the amplified fragments are reported. The assay allows amplification of the MON810 event from the 35S promoter to the hsp intron yielding a 468 bp amplicon. Amplification of the Bt11 and T25 events from the 35S promoter to the PAT gene yielded two different amplicons of 280 and 177 bp, respectively, whereas amplification of the 5' flanking region of the GA21 gave rise to an amplicon of 72 bp. These fragments are clearly distinguishable in agarose gels and have been reproduced successfully in a different laboratory. Hence, the proposed method comprises a rapid, simple, reliable, and sensitive (down to 0.05%) PCR-based assay, suitable for detection of these four GM maize lines in a single reaction.
mRNA-based detection of rare CFTR mutations improves genetic diagnosis of cystic fibrosis in populations with high genetic heterogeneity.

PubMed

Felício, V; Ramalho, A S; Igreja, S; Amaral, M D

2017-03-01

Even with advent of next generation sequencing complete sequencing of large disease-associated genes and intronic regions is economically not feasible. This is the case of cystic fibrosis transmembrane conductance regulator (CFTR), the gene responsible for cystic fibrosis (CF). Yet, to confirm a CF diagnosis, proof of CFTR dysfunction needs to be obtained, namely by the identification of two disease-causing mutations. Moreover, with the advent of mutation-based therapies, genotyping is an essential tool for CF disease management. There is, however, still an unmet need to genotype CF patients by fast, comprehensive and cost-effective approaches, especially in populations with high genetic heterogeneity (and low p.F508del incidence), where CF is now emerging with new diagnosis dilemmas (Brazil, Asia, etc). Herein, we report an innovative mRNA-based approach to identify CFTR mutations in the complete coding and intronic regions. We applied this protocol to genotype individuals with a suspicion of CF and only one or no CFTR mutations identified by routine methods. It successfully detected multiple intronic mutations unlikely to be detected by CFTR exon sequencing. We conclude that this is a rapid, robust and inexpensive method to detect any CFTR coding/intronic mutation (including rare ones) that can be easily used either as primary approach or after routine DNA analysis. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Genomic structure of rat 3alpha-hydroxysteroid/dihydrodiol dehydrogenase (3alpha-HSD/DD, AKR1C9).

PubMed

Lin, H K; Hung, C F; Moore, M; Penning, T M

1999-11-01

Rat liver 3alpha-hydroxysteroid/dihydrodiol dehydrogenase (3alpha-HSD/DD) is a member of the aldo-keto reductase (AKR) superfamily. It is involved in the inactivation of steroid hormones and the metabolic activation of polycyclic aromatic hydrocarbons (PAH) by converting trans-dihydrodiols into reactive and redox-active o-quinones. The structure of the 5'-flanking region of the gene and factors involved in the constitutive and regulated expression of this gene have been reported [H.-K. Lin, T.M. Penning, Cloning, sequencing, and functional analysis of the 5'-flanking region of the rat 3alpha-hydroxysteroid/dihydrodiol dehydrogenase gene, Cancer Res. 55 (1995) 4105-4113]. We now describe the complete genomic structure of the rat type 1 3alpha-HSD/DD gene. Charon 4A and P1 genomic clones contained at least three rat genes (type 1, type 2 and type 3 3alpha-HSD/DD) each of which encoded for the same open reading frame (ORF) but differed in their exon-intron organization. 5'-RACE confirmed that the type 1 3alpha-HSD/DD gene encodes for the dominant transcript in rat liver and it was the regulation of this gene that was previously studied. The rat type 1 3alpha-HSD/DD gene is 30 kb in length and consists of nine exons and eight introns. Exon 9 encodes +931 to 966 bp of the ORF and the 1292 bp 3'-UTR implicated in mRNA stability. This genomic structure is nearly identical to the homologous human genes, type 1 3alpha-HSD (chlordecone reductase/DD4, AKR1C4), type 2 3alpha-HSD (AKR1C3) and type 3 3alpha-HSD (bile-acid binding protein, AKR1C2) genes. Three different cDNA's containing identical ORFs for 3alpha-HSD have been reported suggesting that all three genes may be expressed in rat liver. Using 5' primers corresponding to the 5'-UTR's of the three different cDNA's only one PCR fragment was obtained and corresponded to the type 1 3alpha-HSD/DD gene. These data suggested that the type 2 and type 3 3alpha-HSD/DD genes are not abundantly expressed in rat liver. It is unknown whether the type 2 and type 3 3alpha-HSD/DD genes represent pseudo-genes or whether they represent genes that are differentially expressed in other rat tissues.
Ribosomal DNA sequence divergence and group I introns within the Leucostoma species L. cinctum, L. persoonii, and L. parapersoonii sp. nov., ascomycetes that cause Cytospora canker of fruit trees.

PubMed

Adams, Gerard C; Surve-Iyer, Rupa S; Iezzoni, Amy F

2002-01-01

Leucostoma species that are the causal agents of Cytospora canker of stone and pome fruit trees were studied in detail. DNA sequence of the internal transcribed spacer regions and the 5.8S of the nuclear ribosomal DNA operon (ITS rDNA) supplied sufficient characters to assess the phylogenetic relationships among species of Leucostoma, Valsa, Valsella, and related anamorphs in Cytospora. Parsimony analysis of the aligned sequence divided Cytospora isolates from fruit trees into clades that generally agreed with the morphological species concepts, and with some of the phenetic groupings (PG 1-6) identified previously by isozyme analysis and cultural characteristics. Phylogenetic analysis inferred that isolates of L. persoonii formed two well-resolved clades distinct from isolates of L. cinctum. Phylogenetic analysis of the ITS rDNA, isozyme analysis, and cultural characteristics supported the inference that L. persoonii groups PG 2 and PG 3 were populations of a new species apparently more genetically different from L. persoonii PG 1 than from isolates representative of L. massariana, L. niveum, L. translucens, and Valsella melastoma. The new species, L. parapersoonii, was described. A diverse collection of isolates of L. cinctum, L. persoonii, and L. parapersoonii were examined for genetic variation using restriction fragment length polymorphism (RFLP) analysis of the ITS rDNA and the five prime end of the large subunit of the rDNA (LSU rDNA). HinfI and HpaII endonucleases were each useful in dividing the Leucostoma isolates into RFLP profiles corresponding to the isozyme phenetic groups, PG 1-6. RFLP analysis was more effective than isozyme analysis in uncovering variation among isolates of L. persoonii PG 1, but less effective within L. cinctum populations. Isolates representative of seven of the L. persoonii formae speciales proposed by G. Défago in 1935 were found to be genetically diverse isolates of PG 1. Two large insertions, 415 and 309 nucleotides long, in the small subunit (SSU) of the nuclear rDNA of L. cinctum were identified as Group 1 introns; intron 1 at position 943 and intron 2 at position 1199. The two introns were found to be consistently present in isolates of L. cinctum PG 4 and PG 5 and absent from L. cinctum PG 6 isolates, despite the similarity of the ITS sequence and teleomorph morphology. Intron 1 was of subgroup 1C1 whereas intron 2 was of an unknown subgroup. RFLP patterns and presence/absence of introns were useful characters for expediting the identification of cultures of Leucostoma isolated from stone and pome fruit cankers. RFLP patterns from 13 endonucleases provided an effective method for selecting an array of diverse PG 1 isolates useful in screening plant germplasm for disease-resistance.
Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics.

PubMed

Edwards, Scott V; Cloutier, Alison; Baker, Allan J

2017-11-01

Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600-∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biologists.
Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics

PubMed Central

Cloutier, Alison; Baker, Allan J.

2017-01-01

Abstract Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600–∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. PMID:28637293
The origin of introns and their role in eukaryogenesis: a compromise solution to the introns-early versus introns-late debate?

PubMed Central

Koonin, Eugene V

2006-01-01

Background Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes. Results I argue that several lines of evidence now suggest a coherent solution to the introns-early versus introns-late debate, and the emerging picture of intron evolution integrates aspects of both views although, formally, there seems to be no support for the original version of introns-early. Firstly, there is growing evidence that spliceosomal introns evolved from group II self-splicing introns which are present, usually, in small numbers, in many bacteria, and probably, moved into the evolving eukaryotic genome from the α-proteobacterial progenitor of the mitochondria. Secondly, the concept of a primordial pool of 'virus-like' genetic elements implies that self-splicing introns are among the most ancient genetic entities. Thirdly, reconstructions of the ancestral state of eukaryotic genes suggest that the last common ancestor of extant eukaryotes had an intron-rich genome. Thus, it appears that ancestors of spliceosomal introns, indeed, have existed since the earliest stages of life's evolution, in a formal agreement with the introns-early scenario. However, there is no evidence that these ancient introns ever became widespread before the emergence of eukaryotes, hence, the central tenet of introns-early, the role of introns in early evolution of proteins, has no support. However, the demonstration that numerous introns invaded eukaryotic genes at the outset of eukaryotic evolution and that subsequent intron gain has been limited in many eukaryotic lineages implicates introns as an ancestral feature of eukaryotic genomes and refutes radical versions of introns-late. Perhaps, most importantly, I argue that the intron invasion triggered other pivotal events of eukaryogenesis, including the emergence of the spliceosome, the nucleus, the linear chromosomes, the telomerase, and the ubiquitin signaling system. This concept of eukaryogenesis, in a sense, revives some tenets of the exon hypothesis, by assigning to introns crucial roles in eukaryotic evolutionary innovation. Conclusion The scenario of the origin and evolution of introns that is best compatible with the results of comparative genomics and theoretical considerations goes as follows: self-splicing introns since the earliest stages of life's evolution – numerous spliceosomal introns invading genes of the emerging eukaryote during eukaryogenesis – subsequent lineage-specific loss and gain of introns. The intron invasion, probably, spawned by the mitochondrial endosymbiont, might have critically contributed to the emergence of the principal features of the eukaryotic cell. This scenario combines aspects of the introns-early and introns-late views. Reviewers this article was reviewed by W. Ford Doolittle, James Darnell (nominated by W. Ford Doolittle), William Martin, and Anthony Poole. PMID:16907971
Age at cancer onset in germline TP53 mutation carriers: association with polymorphisms in predicted G-quadruplex structures

PubMed Central

Hainaut, Pierre

2014-01-01

Germline TP53 mutations predispose to multiple cancers defining Li-Fraumeni/Li-Fraumeni-like syndrome (LFS/LFL), a disease with large individual disparities in cancer profiles and age of onset. G-quadruplexes (G4s) are secondary structural motifs occurring in guanine tracks, with regulatory effects on DNA and RNA. We analyzed 85 polymorphisms within or near five predicted G4s in TP53 in search of modifiers of penetrance of LFS/LFL in Brazilian cancer families with (n = 35) or without (n = 110) TP53 mutations. Statistical analyses stratified on family structure showed that cancer tended to occur ~15 years later in mutation carriers who also carried the variant alleles of two polymorphisms within predicted G4-forming regions, rs17878362 (TP53 PIN3, 16 bp duplication in intron 3; P = 0.082) and rs17880560 (6 bp duplication in 3′ flanking region; P = 0.067). Haplotype analysis showed that this inverse association was driven by the polymorphic status of the remaining wild-type (WT) haplotype in mutation carriers: in carriers with a WT haplotype containing at least one variant allele of rs17878362 or rs17880560, cancer occurred ~15 years later than in carriers with other WT haplotypes (P = 0.019). No effect on age of cancer onset was observed in subjects without a TP53 mutation. The G4 in intron 3 has been shown to regulate alternative p53 messenger RNA splicing, whereas the biological roles of predicted G4s in the 3′ flanking region remain to be elucidated. In conclusion, this study demonstrates that G4 polymorphisms in haplotypes of the WT TP53 allele have an impact on LFS/LFL penetrance in germline TP53 mutation carriers. PMID:24336192
Mitochondrial Group II Introns, Cytochrome c Oxidase, and Senescence in Podospora anserina†

PubMed Central

Begel, Odile; Boulay, Jocelyne; Albert, Beatrice; Dufour, Eric; Sainsard-Chanet, Annie

1999-01-01

Podospora anserina is a filamentous fungus with a limited life span. It expresses a degenerative syndrome called senescence, which is always associated with the accumulation of circular molecules (senDNAs) containing specific regions of the mitochondrial chromosome. A mobile group II intron (α) has been thought to play a prominent role in this syndrome. Intron α is the first intron of the cytochrome c oxidase subunit I gene (COX1). Mitochondrial mutants that escape the senescence process are missing this intron, as well as the first exon of the COX1 gene. We describe here the first mutant of P. anserina that has the α sequence precisely deleted and whose cytochrome c oxidase activity is identical to that of wild-type cells. The integration site of the intron is slightly modified, and this change prevents efficient homing of intron α. We show here that this mutant displays a senescence syndrome similar to that of the wild type and that its life span is increased about twofold. The introduction of a related group II intron into the mitochondrial genome of the mutant does not restore the wild-type life span. These data clearly demonstrate that intron α is not the specific senescence factor but rather an accelerator or amplifier of the senescence process. They emphasize the role that intron α plays in the instability of the mitochondrial chromosome and the link between this instability and longevity. Our results strongly support the idea that in Podospora, “immortality” can be acquired not by the absence of intron α but rather by the lack of active cytochrome c oxidase. PMID:10330149
Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae.

PubMed

Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

2014-10-01

Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3' terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species.
Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae

PubMed Central

Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

2014-01-01

Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3′ terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species. PMID:24736785
Parallel Loss of Plastid Introns and Their Maturase in the Genus Cuscuta

PubMed Central

McNeal, Joel R.; Kuehl, Jennifer V.; Boore, Jeffrey L.; Leebens-Mack, Jim; dePamphilis, Claude W.

2009-01-01

Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta. PMID:19543388
Parallel loss of plastid introns and their maturase in the genus Cuscuta.

PubMed

McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; Leebens-Mack, Jim; dePamphilis, Claude W

2009-06-19

Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta.
Expression of a polyubiquitin promoter isolated from Gladiolus.

PubMed

Joung, Young Hee; Kamo, Kathryn

2006-10-01

A polyubiquitin promoter (GUBQ1) including its 5'UTR and intron was isolated from the floral monocot Gladiolus because high levels of expression could not be obtained using publicly available promoters isolated from either cereals or dicots. Sequencing of the promoter revealed highly conserved 5' and 3' intron splicing sites for the 1.234 kb intron. The coding sequence of the first two ubiquitin genes showed the highest homology (87 and 86%, respectively) to the ubiquitin genes of Nicotiana tabacum and Oryza sativa RUBQ2. Transient expression following gene gun bombardment showed that relative levels of GUS activity with the GUBQ1 promoter were comparable to the CaMV 35S promoter in gladiolus, tobacco, rose, rice, and the floral monocot freesia. The highest levels of GUS expression with GUBQ1 were attained with Gladiolus. The full-length GUBQ1 promoter including 5'UTR and intron were necessary for maximum GUS expression in Gladiolus. The relative GUS activity for the promoter only was 9%, and the activity for the promoter with 5'UTR and 399 bp of the full-length 1.234 kb intron was 41%. Arabidopsis plants transformed with uidA under GUBQ1 showed moderate GUS expression throughout young leaves and in the vasculature of older leaves. The highest levels of transient GUS expression in Gladiolus have been achieved using the GUBQ1 promoter. This promoter should be useful for genetic engineering of disease resistance in Gladiolus, rose, and freesia, where high levels of gene expression are important.
Limited MHC class I intron 2 repertoire variation in bonobos.

PubMed

de Groot, Natasja G; Heijmans, Corrine M C; Helsen, Philippe; Otting, Nel; Pereboom, Zjef; Stevens, Jeroen M G; Bontrop, Ronald E

2017-10-01

Common chimpanzees (Pan troglodytes) experienced a selective sweep, probably caused by a SIV-like virus, which targeted their MHC class I repertoire. Based on MHC class I intron 2 data analyses, this selective sweep took place about 2-3 million years ago. As a consequence, common chimpanzees have a skewed MHC class I repertoire that is enriched for allotypes that are able to recognise conserved regions of the SIV proteome. The bonobo (Pan paniscus) shared an ancestor with common chimpanzees approximately 1.5 to 2 million years ago. To investigate whether the signature of this selective sweep is also detectable in bonobos, the MHC class I gene repertoire of two bonobo panels comprising in total 29 animals was investigated by Sanger sequencing. We identified 14 Papa-A, 20 Papa-B and 11 Papa-C alleles, of which eight, five and eight alleles, respectively, have not been reported previously. Within this pool of MHC class I variation, we recovered only 2 Papa-A, 3 Papa-B and 6 Papa-C intron 2 sequences. As compared to humans, bonobos appear to have an even more diminished MHC class I intron 2 lineage repertoire than common chimpanzees. This supports the notion that the selective sweep may have predated the speciation of common chimpanzees and bonobos. The further reduction of the MHC class I intron 2 lineage repertoire observed in bonobos as compared to the common chimpanzee may be explained by a founding effect or other subsequent selective processes.
A conserved intronic U1 snRNP-binding sequence promotes trans-splicing in Drosophila

PubMed Central

Gao, Jun-Li; Fan, Yu-Jie; Wang, Xiu-Ye; Zhang, Yu; Pu, Jia; Li, Liang; Shao, Wei; Zhan, Shuai; Hao, Jianjiang

2015-01-01

Unlike typical cis-splicing, trans-splicing joins exons from two separate transcripts to produce chimeric mRNA and has been detected in most eukaryotes. Trans-splicing in trypanosomes and nematodes has been characterized as a spliced leader RNA-facilitated reaction; in contrast, its mechanism in higher eukaryotes remains unclear. Here we investigate mod(mdg4), a classic trans-spliced gene in Drosophila, and report that two critical RNA sequences in the middle of the last 5′ intron, TSA and TSB, promote trans-splicing of mod(mdg4). In TSA, a 13-nucleotide (nt) core motif is conserved across Drosophila species and is essential and sufficient for trans-splicing, which binds U1 small nuclear RNP (snRNP) through strong base-pairing with U1 snRNA. In TSB, a conserved secondary structure acts as an enhancer. Deletions of TSA and TSB using the CRISPR/Cas9 system result in developmental defects in flies. Although it is not clear how the 5′ intron finds the 3′ introns, compensatory changes in U1 snRNA rescue trans-splicing of TSA mutants, demonstrating that U1 recruitment is critical to promote trans-splicing in vivo. Furthermore, TSA core-like motifs are found in many other trans-spliced Drosophila genes, including lola. These findings represent a novel mechanism of trans-splicing, in which RNA motifs in the 5′ intron are sufficient to bring separate transcripts into close proximity to promote trans-splicing. PMID:25838544
Mitogenome rearrangement in the cold-water scleractinian coral Lophelia pertusa (Cnidaria, Anthozoa) involves a long-term evolving group I intron.

PubMed

Emblem, Åse; Karlsen, Bård Ove; Evertsen, Jussi; Johansen, Steinar D

2011-11-01

Group I introns are genetic insertion elements that invade host genomes in a wide range of organisms. In metazoans, however, group I introns are extremely rare, so far only identified within mitogenomes of hexacorals and some sponges. We sequenced the complete mitogenome of the cold-water scleractinian coral Lophelia pertusa, the dominating deep sea reef-building coral species in the North Atlantic Ocean. The mitogenome (16,150 bp) has the same gene content but organized in a unique gene order compared to that of other known scleractinian corals. A complex group I intron (6460 bp) inserted in the ND5 gene (position 717) was found to host seven essential mitochondrial protein genes and one ribosomal RNA gene. Phylogenetic analysis supports a vertical inheritance pattern of the ND5-717 intron among hexacoral mitogenomes with no examples of intron loss. Structural assessments of the Lophelia intron revealed an unusual organization that lacks the universally conserved ωG at the 3' end, as well as a highly compact RNA core structure with overlapping ribozyme and protein coding capacities. Based on phylogenetic and structural analyses we reconstructed the evolutionary history of ND5-717, from its ancestral protist origin, through intron loss in some early metazoan lineages, and into a compulsory feature with functional implications in hexacorals. Copyright © 2011 Elsevier Inc. All rights reserved.

A novel LPL intronic variant: g.18704C>A identified by re-sequencing Kuwaiti Arab samples is associated with high-density lipoprotein, very low-density lipoprotein and triglyceride lipid levels.

PubMed

Al-Bustan, Suzanne A; Al-Serri, Ahmad; Annice, Babitha G; Alnaqeeb, Majed A; Al-Kandari, Wafa Y; Dashti, Mohammed

2018-01-01

The role interethnic genetic differences play in plasma lipid level variation across populations is a global health concern. Several genes involved in lipid metabolism and transport are strong candidates for the genetic association with lipid level variation especially lipoprotein lipase (LPL). The objective of this study was to re-sequence the full LPL gene in Kuwaiti Arabs, analyse the sequence variation and identify variants that could attribute to variation in plasma lipid levels for further genetic association. Samples (n = 100) of an Arab ethnic group from Kuwait were analysed for sequence variation by Sanger sequencing across the 30 Kb LPL gene and its flanking sequences. A total of 293 variants including 252 single nucleotide polymorphisms (SNPs) and 39 insertions/deletions (InDels) were identified among which 47 variants (32 SNPs and 15 InDels) were novel to Kuwaiti Arabs. This study is the first to report sequence data and analysis of frequencies of variants at the LPL gene locus in an Arab ethnic group with a novel "rare" variant (LPL:g.18704C>A) significantly associated to HDL (B = -0.181; 95% CI (-0.357, -0.006); p = 0.043), TG (B = 0.134; 95% CI (0.004-0.263); p = 0.044) and VLDL (B = 0.131; 95% CI (-0.001-0.263); p = 0.043) levels. Sequence variation in Kuwaiti Arabs was compared to other populations and was found to be similar with regards to the number of SNPs, InDels and distribution of the number of variants across the LPL gene locus and minor allele frequency (MAF). Moreover, comparison of the identified variants and their MAF with other reports provided a list of 46 potential variants across the LPL gene to be considered for future genetic association studies. The findings warrant further investigation into the association of g.18704C>A with lipid levels in other ethnic groups and with clinical manifestations of dyslipidemia.
Complete plastid genome sequence of Daucus carota: implications for biotechnology and phylogeny of angiosperms.

PubMed

Ruhlman, Tracey; Lee, Seung-Bum; Jansen, Robert K; Hostetler, Jessica B; Tallon, Luke J; Town, Christopher D; Daniell, Henry

2006-08-31

Carrot (Daucus carota) is a major food crop in the US and worldwide. Its capacity for storage and its lifecycle as a biennial make it an attractive species for the introduction of foreign genes, especially for oral delivery of vaccines and other therapeutic proteins. Until recently efforts to express recombinant proteins in carrot have had limited success in terms of protein accumulation in the edible tap roots. Plastid genetic engineering offers the potential to overcome this limitation, as demonstrated by the accumulation of BADH in chromoplasts of carrot taproots to confer exceedingly high levels of salt resistance. The complete plastid genome of carrot provides essential information required for genetic engineering. Additionally, the sequence data add to the rapidly growing database of plastid genomes for assessing phylogenetic relationships among angiosperms. The complete carrot plastid genome is 155,911 bp in length, with 115 unique genes and 21 duplicated genes within the IR. There are four ribosomal RNAs, 30 distinct tRNA genes and 18 intron-containing genes. Repeat analysis reveals 12 direct and 2 inverted repeats > or = 30 bp with a sequence identity > or = 90%. Phylogenetic analysis of nucleotide sequences for 61 protein-coding genes using both maximum parsimony (MP) and maximum likelihood (ML) were performed for 29 angiosperms. Phylogenies from both methods provide strong support for the monophyly of several major angiosperm clades, including monocots, eudicots, rosids, asterids, eurosids II, euasterids I, and euasterids II. The carrot plastid genome contains a number of dispersed direct and inverted repeats scattered throughout coding and non-coding regions. This is the first sequenced plastid genome of the family Apiaceae and only the second published genome sequence of the species-rich euasterid II clade. Both MP and ML trees provide very strong support (100% bootstrap) for the sister relationship of Daucus with Panax in the euasterid II clade. These results provide the best taxon sampling of complete chloroplast genomes and the strongest support yet for the sister relationship of Caryophyllales to the asterids. The availability of the complete plastid genome sequence should facilitate improved transformation efficiency and foreign gene expression in carrot through utilization of endogenous flanking sequences and regulatory elements.
A novel LPL intronic variant: g.18704C>A identified by re-sequencing Kuwaiti Arab samples is associated with high-density lipoprotein, very low-density lipoprotein and triglyceride lipid levels

PubMed Central

Al-Serri, Ahmad; Annice, Babitha G.; Alnaqeeb, Majed A.; Al-Kandari, Wafa Y.; Dashti, Mohammed

2018-01-01

The role interethnic genetic differences play in plasma lipid level variation across populations is a global health concern. Several genes involved in lipid metabolism and transport are strong candidates for the genetic association with lipid level variation especially lipoprotein lipase (LPL). The objective of this study was to re-sequence the full LPL gene in Kuwaiti Arabs, analyse the sequence variation and identify variants that could attribute to variation in plasma lipid levels for further genetic association. Samples (n = 100) of an Arab ethnic group from Kuwait were analysed for sequence variation by Sanger sequencing across the 30 Kb LPL gene and its flanking sequences. A total of 293 variants including 252 single nucleotide polymorphisms (SNPs) and 39 insertions/deletions (InDels) were identified among which 47 variants (32 SNPs and 15 InDels) were novel to Kuwaiti Arabs. This study is the first to report sequence data and analysis of frequencies of variants at the LPL gene locus in an Arab ethnic group with a novel “rare” variant (LPL:g.18704C>A) significantly associated to HDL (B = -0.181; 95% CI (-0.357, -0.006); p = 0.043), TG (B = 0.134; 95% CI (0.004–0.263); p = 0.044) and VLDL (B = 0.131; 95% CI (-0.001–0.263); p = 0.043) levels. Sequence variation in Kuwaiti Arabs was compared to other populations and was found to be similar with regards to the number of SNPs, InDels and distribution of the number of variants across the LPL gene locus and minor allele frequency (MAF). Moreover, comparison of the identified variants and their MAF with other reports provided a list of 46 potential variants across the LPL gene to be considered for future genetic association studies. The findings warrant further investigation into the association of g.18704C>A with lipid levels in other ethnic groups and with clinical manifestations of dyslipidemia. PMID:29438437
Translocation and gross deletion breakpoints in human inherited disease and cancer II: Potential involvement of repetitive sequence elements in secondary structure formation between DNA ends.

PubMed

Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N

2003-09-01

Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.
Evolutionary Dynamics of the Gametologous CTNNB1 Gene on the Z and W Chromosomes of Snakes.

PubMed

Laopichienpong, Nararat; Muangmai, Narongrit; Chanhome, Lawan; Suntrarachun, Sunutcha; Twilprawat, Panupon; Peyachoknagul, Surin; Srikulnath, Kornsorn

2017-03-01

Snakes exhibit genotypic sex determination with female heterogamety (ZZ males and ZW females), and the state of sex chromosome differentiation also varies among lineages. To investigate the evolutionary history of homologous genes located in the nonrecombining region of differentiated sex chromosomes in snakes, partial sequences of the gametologous CTNNB1 gene were analyzed for 12 species belonging to henophid (Cylindrophiidae, Xenopeltidae, and Pythonidae) and caenophid snakes (Viperidae, Elapidae, and Colubridae). Nonsynonymous/synonymous substitution ratios (Ka/Ks) in coding sequences were low (Ka/Ks < 1) between CTNNB1Z and CTNNB1W, suggesting that these 2 genes may have similar functional properties. However, frequencies of intron sequence substitutions and insertion–deletions were higher in CTNNB1Z than CTNNB1W, suggesting that Z-linked sequences evolved faster than W-linked sequences. Molecular phylogeny based on both intron and exon sequences showed the presence of 2 major clades: 1) Z-linked sequences of Caenophidia and 2) W-linked sequences of Caenophidia clustered with Z-linked sequences of Henophidia, which suggests that the sequence divergence between CTNNB1Z and CTNNB1W in Caenophidia may have occurred by the cessation of recombination after the split from Henophidia.
Identification of genes in anonymous DNA sequences. Annual performance report, February 1, 1991--January 31, 1992

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fields, C.A.

1996-06-01

The objective of this project is the development of practical software to automate the identification of genes in anonymous DNA sequences from the human, and other higher eukaryotic genomes. A software system for automated sequence analysis, gm (gene modeler) has been designed, implemented, tested, and distributed to several dozen laboratories worldwide. A significantly faster, more robust, and more flexible version of this software, gm 2.0 has now been completed, and is being tested by operational use to analyze human cosmid sequence data. A range of efforts to further understand the features of eukaryoyic gene sequences are also underway. This progressmore » report also contains papers coming out of the project including the following: gm: a Tool for Exploratory Analysis of DNA Sequence Data; The Human THE-LTR(O) and MstII Interspersed Repeats are subfamilies of a single widely distruted highly variable repeat family; Information contents and dinucleotide compostions of plant intron sequences vary with evolutionary origin; Splicing signals in Drosophila: intron size, information content, and consensus sequences; Integration of automated sequence analysis into mapping and sequencing projects; Software for the C. elegans genome project.« less
PERMANENT GENETIC RESOURCES: Consensus primers of cyp73 genes discriminate willow species and hybrids (Salix, Salicaceae).

PubMed

Trung, Le Quang; VAN Puyvelde, Karolien; Triest, Ludwig

2008-03-01

Consensus primers, based on exon sequences of the cyp73 gene family coding for cinnamate 4-hydroxylase (C4H) of the lignin biosynthesis pathway, were designed for the tetraploid willow species Salix alba and Salix fragilis. Diagnostic alleles at species level were observed among introns of three cyp73 genes and allowed unambiguous detection of the first generation and introgressed hybrids in populations. Progeny analysis of a female S. alba with a male introgressed hybrid confirmed the codominant inheritance of each intron. Sequences of the diagnostic alleles of both species were similar to those found in the hybrids. © 2007 The Authors.
The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.

PubMed

Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo

2016-05-01

The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.
Influence of flanking sequences on variability in expression levels of an introduced gene in transgenic tobacco plants.

PubMed Central

Dean, C; Jones, J; Favreau, M; Dunsmuir, P; Bedbrook, J

1988-01-01

The petunia rbcS gene SSU301 was introduced into tobacco using Agrobacterium tumefaciens-mediated transformation. The time at which rbcS expression was maximal after transfer of the tobacco plants to the greenhouse was determined. The expression level of the SSU301 gene varied up to 9 fold between individual tobacco plants which had been standardized physiologically as much as possible. The presence of adjacent pUC plasmid sequences did not affect the expression of the SSU301 gene. In an attempt to reduce the between-transformant variability in expression, the SSU301 gene was introduced into tobacco surrounded by 10kb of 5' and 13 kb of 3' DNA sequences which normally flank SSU301 in petunia. The longer flanking regions did not reduce the between-transformant variability of SSU301 gene expression. Images PMID:3174450
Fractal landscapes in biological systems: long-range correlations in DNA and interbeat heart intervals

NASA Technical Reports Server (NTRS)

Stanley, H. E.; Buldyrev, S. V.; Goldberger, A. L.; Hausdorff, J. M.; Havlin, S.; Mietus, J.; Sciortino, F.; Simons, M.

1992-01-01

Here we discuss recent advances in applying ideas of fractals and disordered systems to two topics of biological interest, both topics having common the appearance of scale-free phenomena, i.e., correlations that have no characteristic length scale, typically exhibited by physical systems near a critical point and dynamical systems far from equilibrium. (i) DNA nucleotide sequences have traditionally been analyzed using models which incorporate the possibility of short-range nucleotide correlations. We found, instead, a remarkably long-range power law correlation. We found such long-range correlations in intron-containing genes and in non-transcribed regulatory DNA sequences as well as intragenomic DNA, but not in cDNA sequences or intron-less genes. We also found that the myosin heavy chain family gene evolution increases the fractal complexity of the DNA landscapes, consistent with the intron-late hypothesis of gene evolution. (ii) The healthy heartbeat is traditionally thought to be regulated according to the classical principle of homeostasis, whereby physiologic systems operate to reduce variability and achieve an equilibrium-like state. We found, however, that under normal conditions, beat-to-beat fluctuations in heart rate display long-range power law correlations.
Contribution of Mobile Group II Introns to Sinorhizobium meliloti Genome Evolution.

PubMed

Toro, Nicolás; Martínez-Abarca, Francisco; Molina-Sánchez, María D; García-Rodríguez, Fernando M; Nisa-Martínez, Rafael

2018-01-01

Mobile group II introns are ribozymes and retroelements that probably originate from bacteria. Sinorhizobium meliloti , the nitrogen-fixing endosymbiont of legumes of genus Medicago , harbors a large number of these retroelements. One of these elements, RmInt1, has been particularly successful at colonizing this multipartite genome. Many studies have improved our understanding of RmInt1 and phylogenetically related group II introns, their mobility mechanisms, spread and dynamics within S. meliloti and closely related species. Although RmInt1 conserves the ancient retroelement behavior, its evolutionary history suggests that this group II intron has played a role in the short- and long-term evolution of the S. meliloti genome. We will discuss its proposed role in genome evolution by controlling the spread and coexistence of potentially harmful mobile genetic elements, by ectopic transposition to different genetic loci as a source of early genomic variation and by generating sequence variation after a very slow degradation process, through intron remnants that may have continued to evolve, contributing to bacterial speciation.
Contribution of Mobile Group II Introns to Sinorhizobium meliloti Genome Evolution

PubMed Central

Toro, Nicolás; Martínez-Abarca, Francisco; Molina-Sánchez, María D.; García-Rodríguez, Fernando M.; Nisa-Martínez, Rafael

2018-01-01

Mobile group II introns are ribozymes and retroelements that probably originate from bacteria. Sinorhizobium meliloti, the nitrogen-fixing endosymbiont of legumes of genus Medicago, harbors a large number of these retroelements. One of these elements, RmInt1, has been particularly successful at colonizing this multipartite genome. Many studies have improved our understanding of RmInt1 and phylogenetically related group II introns, their mobility mechanisms, spread and dynamics within S. meliloti and closely related species. Although RmInt1 conserves the ancient retroelement behavior, its evolutionary history suggests that this group II intron has played a role in the short- and long-term evolution of the S. meliloti genome. We will discuss its proposed role in genome evolution by controlling the spread and coexistence of potentially harmful mobile genetic elements, by ectopic transposition to different genetic loci as a source of early genomic variation and by generating sequence variation after a very slow degradation process, through intron remnants that may have continued to evolve, contributing to bacterial speciation. PMID:29670598
The complete chloroplast genome sequence of Euonymus japonicus (Celastraceae).

PubMed

Choi, Kyoung Su; Park, SeonJoo

2016-09-01

The complete chloroplast (cp) genome sequence of the Euonymus japonicus, the first sequenced of the genus Euonymus, was reported in this study. The total length was 157 637 bp, containing a pair of 26 678 bp inverted repeat region (IR), which were separated by small single copy (SSC) region and large single copy (LSC) region of 18 340 bp and 85 941 bp, respectively. This genome contains 107 unique genes, including 74 coding genes, four rRNA genes, and 29 tRNA genes. Seventeen genes contain intron of E. japonicus, of which three genes (clpP, ycf3, and rps12) include two introns. The maximum likelihood (ML) phylogenetic analysis revealed that E. japonicus was closely related to Manihot and Populus.
Factor IX[sub Madrid 2]: A deletion/insertion in Facotr IX gene which abolishes the sequence of the donor junction at the exon IV-intron d splice site

DOE Office of Scientific and Technical Information (OSTI.GOV)

Solera, J.; Magallon, M.; Martin-Villar, J.

1992-02-01

DNA from a patient with severe hemophilia B was evaluated by RFLP analysis, producing results which suggested the existence of a partial deletion within the factor IX gene. The deletion was further localized and characterized by PCR amplification and sequencing. The altered allele has a 4,442-bp deletion which removes both the donor splice site located at the 5[prime] end of intron d and the two last coding nucleotides located at the 3[prime] end of exon IV in the normal factor IX gene; this fragment has been inserted in inverted orientation. Two homologous sequences have been discovered at the ends ofmore » the deleted DNA fragment.« less
Cloning of a CACTA transposon-like insertion in intron I of tomato invertase Lin5 gene and identification of transposase-like sequences of Solanaceae species.

PubMed

Proels, Reinhard K; Roitsch, Thomas

2006-03-01

Very few CACTA transposon-like sequences have been described in Solanaceae species. Sequence information has been restricted to partial transposase (TPase)-like fragments, and no target gene of CACTA-like transposon insertion has been described in tomato to date. In this manuscript, we report on a CACTA transposon-like insertion in intron I of tomato (Lycopersicon esculentum) invertase gene Lin5 and TPase-like sequences of several Solanaceae species. Consensus primers deduced from the TPase region of the tomato CACTA transposon-like element allowed the amplification of similar sequences from various Solanaceae species of different subfamilies including Solaneae (Solanum tuberosum), Cestreae (Nicotiana tabacum) and Datureae (Datura stramonium). This demonstrates the ubiquitous presence of CACTA-like elements in Solanaceae genomes. The obtained partial sequences are highly conserved, and allow further detection and detailed analysis of CACTA-like transposons throughout Solanaceae species. CACTA-like transposon sequences make possible the evaluation of their use for genome analysis, functional studies of genes and the evolutionary relationships between plant species.
Molecular cloning, expression pattern, and 3D structural prediction of the cold inducible RNA-binding protein (CIRP) in Japanese flounder ( Paralichthys olivaceus)

NASA Astrophysics Data System (ADS)

Yang, Xiao; Gao, Jinning; Ma, Liman; Li, Zan; Wang, Wenji; Wang, Zhongkai; Yu, Haiyang; Qi, Jie; Wang, Xubo; Wang, Zhigang; Zhang, Quanqi

2015-02-01

Cold-inducible RNA-binding protein (CIRP) is a kind of RNA binding proteins that plays important roles in many physiological processes. The CIRP has been widely studied in mammals and amphibians since it was first cloned from mammals. On the contrary, there are little reports in teleosts. In this study, the Po CIRP gene of the Japanese flounder was cloned and sequenced. The genomic sequence consists of seven exons and six introns. The putative PoCIRP protein of flounder was 198 amino acid residues long containing the RNA recognition motif (RRM). Phylogenetic analysis showed that the flounder PoCIRP is highly conserved with other teleost CIRPs. The 5' flanking sequence was cloned by genome walking and many transcription factor binding sites were identified. There is a CpGs region located in promoter and exon I region and the methylation state is low. Quantitative real-time PCR analysis uncovered that Po CIRP gene was widely expressed in adult tissues with the highest expression level in the ovary. The mRNA of the Po CIRP was maternally deposited and the expression level of the gene was regulated up during the gastrula and neurula stages. In order to gain the information how the protein interacts with mRNA, we performed the modeling of the 3D structure of the flounder PoCIRP. The results showed a cleft existing the surface of the molecular. Taken together, the results indicate that the CIRP is a multifunctional molecular in teleosts and the findings about the structure provide valuable information for understanding the basis of this protein's function.
Genome-wide association study identifies phospholipase C zeta 1 (PLCz1) as a stallion fertility locus in Hanoverian warmblood horses.

PubMed

Schrimpf, Rahel; Dierks, Claudia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar

2014-01-01

A consistently high level of stallion fertility plays an economically important role in modern horse breeding. We performed a genome-wide association study for estimated breeding values of the paternal component of the pregnancy rate per estrus cycle (EBV-PAT) in Hanoverian stallions. A total of 228 Hanoverian stallions were genotyped using the Equine SNP50 Beadchip. The most significant association was found on horse chromosome 6 for a single nucleotide polymorphism (SNP) within phospholipase C zeta 1 (PLCz1). In the close neighbourhood to PLCz1 is located CAPZA3 (capping protein (actin filament) muscle Z-line, alpha 3). The gene PLCz1 encodes a protein essential for spermatogenesis and oocyte activation through sperm induced Ca2+-oscillation during fertilization. We derived equine gene models for PLCz1 and CAPZA3 based on cDNA and genomic DNA sequences. The equine PLCz1 had four different transcripts of which two contained a premature termination codon. Sequencing all exons and their flanking sequences using genomic DNA samples from 19 Hanoverian stallions revealed 47 polymorphisms within PLCz1 and one SNP within CAPZA3. Validation of these 48 polymorphisms in 237 Hanoverian stallions identified three intronic SNPs within PLCz1 as significantly associated with EBV-PAT. Bioinformatic analysis suggested regulatory effects for these SNPs via transcription factor binding sites or microRNAs. In conclusion, non-coding polymorphisms within PLCz1 were identified as conferring stallion fertility and PLCz1 as candidate locus for male fertility in Hanoverian warmblood. CAPZA3 could be eliminated as candidate gene for fertility in Hanoverian stallions.
Genome-Wide Association Study Identifies Phospholipase C zeta 1 (PLCz1) as a Stallion Fertility Locus in Hanoverian Warmblood Horses

PubMed Central

Schrimpf, Rahel; Dierks, Claudia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar

2014-01-01

A consistently high level of stallion fertility plays an economically important role in modern horse breeding. We performed a genome-wide association study for estimated breeding values of the paternal component of the pregnancy rate per estrus cycle (EBV-PAT) in Hanoverian stallions. A total of 228 Hanoverian stallions were genotyped using the Equine SNP50 Beadchip. The most significant association was found on horse chromosome 6 for a single nucleotide polymorphism (SNP) within phospholipase C zeta 1 (PLCz1). In the close neighbourhood to PLCz1 is located CAPZA3 (capping protein (actin filament) muscle Z-line, alpha 3). The gene PLCz1 encodes a protein essential for spermatogenesis and oocyte activation through sperm induced Ca2+-oscillation during fertilization. We derived equine gene models for PLCz1 and CAPZA3 based on cDNA and genomic DNA sequences. The equine PLCz1 had four different transcripts of which two contained a premature termination codon. Sequencing all exons and their flanking sequences using genomic DNA samples from 19 Hanoverian stallions revealed 47 polymorphisms within PLCz1 and one SNP within CAPZA3. Validation of these 48 polymorphisms in 237 Hanoverian stallions identified three intronic SNPs within PLCz1 as significantly associated with EBV-PAT. Bioinformatic analysis suggested regulatory effects for these SNPs via transcription factor binding sites or microRNAs. In conclusion, non-coding polymorphisms within PLCz1 were identified as conferring stallion fertility and PLCz1 as candidate locus for male fertility in Hanoverian warmblood. CAPZA3 could be eliminated as candidate gene for fertility in Hanoverian stallions. PMID:25354211
Method to amplify variable sequences without imposing primer sequences

DOEpatents

Bradbury, Andrew M.; Zeytun, Ahmet

2006-11-14

The present invention provides methods of amplifying target sequences without including regions flanking the target sequence in the amplified product or imposing amplification primer sequences on the amplified product. Also provided are methods of preparing a library from such amplified target sequences.
Tissue- and Time-Specific Expression of Otherwise Identical tRNA Genes

PubMed Central

Adir, Idan; Dahan, Orna; Broday, Limor; Pilpel, Yitzhak; Rechavi, Oded

2016-01-01

Codon usage bias affects protein translation because tRNAs that recognize synonymous codons differ in their abundance. Although the current dogma states that tRNA expression is exclusively regulated by intrinsic control elements (A- and B-box sequences), we revealed, using a reporter that monitors the levels of individual tRNA genes in Caenorhabditis elegans, that eight tryptophan tRNA genes, 100% identical in sequence, are expressed in different tissues and change their expression dynamically. Furthermore, the expression levels of the sup-7 tRNA gene at day 6 were found to predict the animal’s lifespan. We discovered that the expression of tRNAs that reside within introns of protein-coding genes is affected by the host gene’s promoter. Pairing between specific Pol II genes and the tRNAs that are contained in their introns is most likely adaptive, since a genome-wide analysis revealed that the presence of specific intronic tRNAs within specific orthologous genes is conserved across Caenorhabditis species. PMID:27560950

A mutation in yeast mitochondrial DNA results in a precise excision of the terminal intron of the cytochrome b gene.

PubMed

Hill, J; McGraw, P; Tzagoloff, A

1985-03-25

The yeast nuclear gene CBP2 was previously proposed to code for a protein necessary for processing of the terminal intron in the cytochrome b pre-mRNA (McGraw, P., and Tzagoloff, A. (1983) J. Biol. Chem. 258, 9459-9468). In the present study we describe a mitochondrial mutation capable of suppressing the respiratory deficiency of cbp2 mutants. The mitochondrial suppressor mutation has been shown to be the result of a precise excision of the last intervening sequence from the cytochrome b gene. Strains with the altered mitochondrial DNA have normal levels of mature cytochrome b mRNA and of cytochrome b and exhibit wild type growth on glycerol. These results confirm that CBP2 codes for a protein specifically required for splicing of the cytochrome b intron and further suggest that absence of the intervening sequence does not noticeably affect the expression of respiratory function in mitochondria.
Intriguing Balancing Selection on the Intron 5 Region of LMBR1 in Human Population

PubMed Central

He, Fang; Wu, Dong-Dong; Kong, Qing-Peng; Zhang, Ya-Ping

2008-01-01

Background The intron 5 of gene LMBR1 is the cis-acting regulatory module for the sonic hedgehog (SHH) gene. Mutation in this non-coding region is associated with preaxial polydactyly, and may play crucial roles in the evolution of limb and skeletal system. Methodology/Principal Findings We sequenced a region of the LMBR1 gene intron 5 in East Asian human population, and found a significant deviation of Tajima's D statistics from neutrality taking human population growth into account. Data from HapMap also demonstrated extended linkage disequilibrium in the region in East Asian and European population, and significantly low degree of genetic differentiation among human populations. Conclusion/Significance We proposed that the intron 5 of LMBR1 was presumably subject to balancing selection during the evolution of modern human. PMID:18698406
Evolutionary conservation and regulation of particular alternative splicing events in plant SR proteins

PubMed Central

Kalyna, Maria; Lopato, Sergiy; Voronin, Viktor; Barta, Andrea

2006-01-01

Alternative splicing is an important mechanism for fine tuning of gene expression at the post-transcriptional level. SR proteins govern splice site selection and spliceosome assembly. The Arabidopsis genome encodes 19 SR proteins, several of which have no orthologues in metazoan. Three of the plant specific subfamilies are characterized by the presence of a relatively long alternatively spliced intron located in their first RNA recognition motif, which potentially results in an extremely truncated protein. In atRSZ33, a member of the RS2Z subfamily, this alternative splicing event was shown to be autoregulated. Here we show that atRSp31, a member of the RS subfamily, does not autoregulate alternative splicing of its similarily positioned intron. Interestingly, this alternative splicing event is regulated by atRSZ33. We demonstrate that the positions of these long introns and their capability for alternative splicing are conserved from green algae to flowering plants. Moreover, in particular alternative splicing events the splicing signals are embedded into highly conserved sequences. In different taxa, these conserved sequences occur in at least one gene within a subfamily. The evolutionary preservation of alternative splice forms together with highly conserved intron features argues for additional functions hidden in the genes of these plant-specific SR proteins. PMID:16936312
Gene replacements and insertions in rice by intron targeting using CRISPR-Cas9.

PubMed

Li, Jun; Meng, Xiangbing; Zong, Yuan; Chen, Kunling; Zhang, Huawei; Liu, Jinxing; Li, Jiayang; Gao, Caixia

2016-09-12

Sequence-specific nucleases have been exploited to create targeted gene knockouts in various plants(1), but replacing a fragment and even obtaining gene insertions at specific loci in plant genomes remain a serious challenge. Here, we report efficient intron-mediated site-specific gene replacement and insertion approaches that generate mutations using the non-homologous end joining (NHEJ) pathway using the clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated protein 9 (Cas9) system. Using a pair of single guide RNAs (sgRNAs) targeting adjacent introns and a donor DNA template including the same pair of sgRNA sites, we achieved gene replacements in the rice endogenous gene 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) at a frequency of 2.0%. We also obtained targeted gene insertions at a frequency of 2.2% using a sgRNA targeting one intron and a donor DNA template including the same sgRNA site. Rice plants harbouring the OsEPSPS gene with the intended substitutions were glyphosate-resistant. Furthermore, the site-specific gene replacements and insertions were faithfully transmitted to the next generation. These newly developed approaches can be generally used to replace targeted gene fragments and to insert exogenous DNA sequences into specific genomic sites in rice and other plants.
In vitro mapping of Myotonic Dystrophy (DM) gene promoter

DOE Office of Scientific and Technical Information (OSTI.GOV)

Storbeck, C.J.; Sabourin, L.; Baird, S.

1994-09-01

The Myotonic Dystrophy Kinase (DMK) gene has been cloned and shared homology to serine/threonine protein kinases. Overexpression of this gene in stably transfected mouse myoblasts has been shown to inhibit fusion into myotubes while myoblasts stably transfected with an antisense construct show increased fusion potential. These experiments, along with data showing that the DM gene is highly expressed in muscle have highlighted the possibility of DMK being involved in myogenesis. The promoter region of the DM gene lacks a consensus TATA box and CAAT box, but harbours numerous transcription binding sites. Clones containing extended 5{prime} upstream sequences (UPS) of DMKmore » only weakly drive the reporter gene chloramphenicol acetyl transferase (CAT) when transfected into C2C12 mouse myoblasts. However, four E-boxes are present in the first intron of the DM gene and transient assays show increased expression of the CAT gene when the first intron is present downstream of these 5{prime} UPS in an orientation dependent manner. Comparison between mouse and human sequence reveals that the regions in the first intron where the E-boxes are located are highly conserved. The mapping of the promoter and the importance of the first intron in the control of DMK expression will be presented.« less
Regional centromeres in the yeast Candida lusitaniae lack pericentromeric heterochromatin

PubMed Central

Kapoor, Shivali; Zhu, Lisha; Froyd, Cara; Liu, Tao; Rusche, Laura N.

2015-01-01

Point centromeres are specified by a short consensus sequence that seeds kinetochore formation, whereas regional centromeres lack a conserved sequence and instead are epigenetically inherited. Regional centromeres are generally flanked by heterochromatin that ensures high levels of cohesin and promotes faithful chromosome segregation. However, it is not known whether regional centromeres require pericentromeric heterochromatin. In the yeast Candida lusitaniae, we identified a distinct type of regional centromere that lacks pericentromeric heterochromatin. Centromere locations were determined by ChIP-sequencing of two key centromere proteins, Cse4 and Mif2, and are consistent with bioinformatic predictions. The centromeric DNA sequence was unique for each chromosome and spanned 4–4.5 kbp, consistent with regional epigenetically inherited centromeres. However, unlike other regional centromeres, there was no evidence of pericentromeric heterochromatin in C. lusitaniae. In particular, flanking genes were expressed at a similar level to the rest of the genome, and a URA3 reporter inserted adjacent to a centromere was not repressed. In addition, regions flanking the centromeric core were not associated with hypoacetylated histones or a sirtuin deacetylase that generates heterochromatin in other yeast. Interestingly, the centromeric chromatin had a distinct pattern of histone modifications, being enriched for methylated H3K79 and H3R2 but lacking methylation of H3K4, which is found at other regional centromeres. Thus, not all regional centromeres require flanking heterochromatin. PMID:26371315
A study of the relationships of cultivated peanut (Arachis hypogaea) and its most closely related wild species using intron sequences and microsatellite markers

PubMed Central

Moretzsohn, Márcio C.; Gouvea, Ediene G.; Inglis, Peter W.; Leal-Bertioli, Soraya C. M.; Valls, José F. M.; Bertioli, David J.

2013-01-01

Background and Aims The genus Arachis contains 80 described species. Section Arachis is of particular interest because it includes cultivated peanut, an allotetraploid, and closely related wild species, most of which are diploids. This study aimed to analyse the genetic relationships of multiple accessions of section Arachis species using two complementary methods. Microsatellites allowed the analysis of inter- and intraspecific variability. Intron sequences from single-copy genes allowed phylogenetic analysis including the separation of the allotetraploid genome components. Methods Intron sequences and microsatellite markers were used to reconstruct phylogenetic relationships in section Arachis through maximum parsimony and genetic distance analyses. Key Results Although high intraspecific variability was evident, there was good support for most species. However, some problems were revealed, notably a probable polyphyletic origin for A. kuhlmannii. The validity of the genome groups was well supported. The F, K and D genomes grouped close to the A genome group. The 2n = 18 species grouped closer to the B genome group. The phylogenetic tree based on the intron data strongly indicated that A. duranensis and A. ipaënsis are the ancestors of A. hypogaea and A. monticola. Intron nucleotide substitutions allowed the ages of divergences of the main genome groups to be estimated at a relatively recent 2·3–2·9 million years ago. This age and the number of species described indicate a much higher speciation rate for section Arachis than for legumes in general. Conclusions The analyses revealed relationships between the species and genome groups and showed a generally high level of intraspecific genetic diversity. The improved knowledge of species relationships should facilitate the utilization of wild species for peanut improvement. The estimates of speciation rates in section Arachis are high, but not unprecedented. We suggest these high rates may be linked to the peculiar reproductive biology of Arachis. PMID:23131301
Extremely hypomorphic and severe deep intronic variants in the ABCA4 locus result in varying Stargardt disease phenotypes.

PubMed

Zernant, Jana; Lee, Winston; Nagasaki, Takayuki; Collison, Frederick T; Fishman, Gerald A; Bertelsen, Mette; Rosenberg, Thomas; Gouras, Peter; Tsang, Stephen H; Allikmets, Rando

2018-05-30

Autosomal recessive Stargardt disease (STGD1, MIM 248200) is caused by mutations in the ABCA4 gene. Complete sequencing of the ABCA4 locus in STGD1 patients identifies two expected disease-causing alleles in ~75% of patients and only one mutation in ~15% of patients. Recently, many possibly pathogenic variants in deep intronic sequences of ABCA4 have been identified in the latter group. We extended our analyses of deep intronic ABCA4 variants and determined that one of these, c.4253+43G>A (rs61754045), is present in 29/1155 (2.6%) of STGD1 patients. The variant is found at statistically significantly higher frequency in patients with only one pathogenic ABCA4 allele, 23/160 (14.38%), MAF=0.072, compared to MAF=0.013 in all STGD1 cases and MAF=0.006 in the matching general population (P<1x10-7). The variant, which is not predicted to have any effect on splicing, is the first reported intronic "extremely hypomorphic allele" in the ABCA4 locus; i.e., it is pathogenic only when in trans with a loss-of-function ABCA4 allele. It results in a distinct clinical phenotype characterized by late-onset of symptoms and foveal sparing. In ~70% of cases the variant was allelic with the c.6006-609T>A (rs575968112) variant, which was deemed non-pathogenic. Another rare deep intronic variant, c.5196+1056A>G (rs886044749), found in 5/834 (0.6%) of STGD1 cases is, conversely, a severe allele. This study determines pathogenicity for three non-coding variants in STGD1 patients of European descent accounting for ~3% of the disease. Defining disease-associated alleles in the non-coding sequences of the ABCA4 locus can be accomplished by integrated clinical and genetic analyses. Cold Spring Harbor Laboratory Press.
Loss of a Trans-Splicing nad1 Intron from Geraniaceae and Transfer of the Maturase Gene matR to the Nucleus in Pelargonium

PubMed Central

Grewe, Felix; Zhu, Andan; Mower, Jeffrey P.

2016-01-01

The mitochondrial nad1 gene of seed plants has a complex structure, including four introns in cis or trans configurations and a maturase gene (matR) hosted within the final intron. In the geranium family (Geraniaceae), however, sequencing of representative species revealed that three of the four introns, including one in a trans configuration and another that hosts matR, were lost from the nad1 gene in their common ancestor. Despite the loss of the host intron, matR has been retained as a freestanding gene in most genera of the family, indicating that this maturase has additional functions beyond the splicing of its host intron. In the common ancestor of Pelargonium, matR was transferred to the nuclear genome, where it was split into two unlinked genes that encode either its reverse transcriptase or maturase domain. Both nuclear genes are transcribed and contain predicted mitochondrial targeting signals, suggesting that they express functional proteins that are imported into mitochondria. The nuclear localization and split domain structure of matR in the Pelargonium nuclear genome offers a unique opportunity to assess the function of these two domains using transgenic approaches. PMID:27664178
Malonyl CoA decarboxylase deficiency: C to T transition in intron 2 of the MCD gene.

PubMed

Surendran, S; Sacksteder, K A; Gould, S J; Coldwell, J G; Rady, P L; Tyring, S K; Matalon, R

2001-09-15

Malonyl CoA decarboxylase (MCD) is an enzyme involved in the metabolism of fatty acids synthesis. Based on reports of MCD deficiency, this enzyme is particular important in muscle and brain metabolism. Mutations in the MCD gene result in a deficiency of MCD activity, that lead to psychomotor retardation, cardiomyopathy and neonatal death. To date however, only a few patients have been reported with defects in MCD. We report here studies of a patient with MCD deficiency, who presented with hypotonia, cardiomyopathy and psychomotor retardation. DNA sequencing of MCD revealed a homozygous intronic mutation, specifically a -5 C to T transition near the acceptor site for exon 3. RT-PCR amplification of exons 2 and 3 revealed that although mRNA from a normal control sample yielded one major DNA band, the mutant mRNA sample resulted in two distinct DNA fragments. Sequencing of the patient's two RT-PCR products revealed that the larger molecular weight fragments contained exons 2 and 3 as well as the intervening intronic sequence. The smaller size band from the patient contained the properly spliced exons, similar to the normal control. Western blotting analysis of the expressed protein showed only a faint band in the patient sample in contrast to a robust band in the control. In addition, the enzyme activity of the mutant protein was lower than that of the control protein. The data indicate that homozygous mutation in intron 2 disrupt normal splicing of the gene, leading to lower expression of the MCD protein and MCD deficiency. Copyright 2001 Wiley-Liss, Inc.
Nonsynonymous substitution in abalone sperm fertilization genes exceeds substitution in introns and mitochondrial DNA

PubMed Central

Metz, Edward C.; Robles-Sikisaka, Refugio; Vacquier, Victor D.

1998-01-01

Strong positive Darwinian selection acts on two sperm fertilization proteins, lysin and 18-kDa protein, from abalone (Haliotis). To understand the phylogenetic context for this dramatic molecular evolution, we obtained sequences of mitochondrial cytochrome c oxidase subunit I (mtCOI), and genomic sequences of lysin, 18-kDa, and a G protein subunit. Based on mtDNA differentiation, four north Pacific abalone species diverged within the past 2 million years (Myr), and remaining north Pacific species diverged over a period of 4–20 Myr. Between-species nonsynonymous differences in lysin and 18-kDa exons exceed nucleotide differences in introns by 3.5- to 24-fold. Remarkably, in some comparisons nonsynonymous substitutions in lysin and 18-kDa genes exceed synonymous substitutions in mtCOI. Lysin and 18-kDa intron/exon segments were sequenced from multiple red abalone individuals collected over a 1,200-km range. Only two nucleotide changes and two sites of slippage variation were detected in a total of >29,000 nucleotides surveyed. However, polymorphism in mtCOI and a G protein intron was found in this species. This finding suggests that positive selection swept one lysin allele and one 18-kDa allele to fixation. Similarities between mtCOI and lysin gene trees indicate that rapid adaptive evolution of lysin has occurred consistently through the history of the group. Comparisons with mtCOI molecular clock calibrations suggest that nonsynonymous substitutions accumulate 2–50 times faster in lysin and 18-kDa genes than in rapidly evolving mammalian genes. PMID:9724763
A base substitution in the donor site of intron 12 of KIT gene is responsible for the dominant white coat colour of blue fox (Alopex lagopus).

PubMed

Yan, S Q; Hou, J N; Bai, C Y; Jiang, Y; Zhang, X J; Ren, H L; Sun, B X; Zhao, Z H; Sun, J H

2014-04-01

The dominant white coat colour of farmed blue fox is inherited as a monogenic autosomal dominant trait and is suggested to be embryonic lethal in the homozygous state. In this study, the transcripts of KIT were identified by RT-PCR for a dominant white fox and a normal blue fox. Sequence analysis showed that the KIT transcript in normal blue fox contained the full-length coding sequence of 2919 bp (GenBank Acc. No KF530833), but in the dominant white individual, a truncated isoform lacking the entire exon 12 specifically co-expressed with the normal transcript. Genomic DNA sequencing revealed that a single nucleotide polymorphism (c.1867+1G>T) in intron 12 appeared only in the dominant white individuals and a 1-bp ins/del polymorphism in the same intron showed in individuals representing two different coat colours. Genotyping results of the SNP with PCR-RFLP in 185 individuals showed all 90 normal blue foxes were homozygous for the G allele, and all dominant white individuals were heterozygous. Due to the truncated protein with a deletion of 35 amino acids and an amino acid replacement (p.Pro623Ala) located in the conserved ATP binding domain, we propose that the mutant receptor had absent tyrosine kinase activity. These findings reveal that the base substitution at the first nucleotide of intron 12 of KIT gene, resulting in skipping of exon 12, is a causative mutation responsible for the dominant white phenotype of blue fox. © 2013 Stichting International Foundation for Animal Genetics.
Plastid and mitochondrion genomic sequences from Arctic Chlorella sp. ArM0029B.

PubMed

Jeong, Haeyoung; Lim, Jong-Min; Park, Jihye; Sim, Young Mi; Choi, Han-Gu; Lee, Jungho; Jeong, Won-Joong

2014-04-16

Chorella is the representative taxon of Chlorellales in Trebouxiophyceae, and its chloroplast (cp) genomic information has been thought to depend only on studies concerning Chlorella vulgaris and GenBank information of C. variablis. Mitochondrial (mt) genomic information regarding Chlorella is currently unavailable. To elucidate the evolution of organelle genomes and genetic information of Chlorella, we have sequenced and characterized the cp and mt genomes of Arctic Chlorella sp. ArM0029B. The 119,989-bp cp genome lacking inverted repeats and 65,049-bp mt genome were sequenced. The ArM0029B cp genome contains 114 conserved genes, including 32 tRNA genes, 3 rRNA genes, and 79 genes encoding proteins. Chlorella cp genomes are highly rearranged except for a Chlorella-specific six-gene cluster, and the ArM0029B plastid resembles that of Chlorella variabilis except for a 15-kb gene cluster inversion. In the mt genome, 62 conserved genes, including 27 tRNA genes, 3 rRNA genes, and 32 genes encoding proteins were determined. The mt genome of ArM0029B is similar to that of the non-photosynthetic species Prototheca and Heicosporidium. The ArM0029B mt genome contains a group I intron, with an ORF containing two LAGLIDADG motifs, in cox1. The intronic ORF is shared by C. vulgaris and Prototheca. The phylogeny of the plastid genome reveals that ArM0029B showed a close relationship of Chlorella to Parachlorella and Oocystis within Chlorellales. The distribution of the cox1 intron at 721 support membership in the order Chlorellales. Mitochondrial phylogenomic analyses, however, indicated that ArM0029B shows a greater affinity to MX-AZ01 and Coccomyxa than to the Helicosporidium-Prototheca clade, although the detailed phylogenetic relationships among the three taxa remain to be resolved. The plastid genome of ArM0029B is similar to that of C. variabilis. The mt sequence of ArM0029B is the first genome to be reported for Chlorella. Chloroplast genome phylogeny supports monophyly of the seven investigated members of Chlorellales. The presence of the cox1 intron at 721 in all four investigated Chlorellales taxa indicates that the cox1 intron had been introduced in early Chorellales as a cis-splice form and that the cis-splicing intron was inherited to recent Chlorellales and was recently trans-spliced in Helicosporidium.
Plastid and mitochondrion genomic sequences from Arctic Chlorella sp. ArM0029B

PubMed Central

2014-01-01

Background Chorella is the representative taxon of Chlorellales in Trebouxiophyceae, and its chloroplast (cp) genomic information has been thought to depend only on studies concerning Chlorella vulgaris and GenBank information of C. variablis. Mitochondrial (mt) genomic information regarding Chlorella is currently unavailable. To elucidate the evolution of organelle genomes and genetic information of Chlorella, we have sequenced and characterized the cp and mt genomes of Arctic Chlorella sp. ArM0029B. Results The 119,989-bp cp genome lacking inverted repeats and 65,049-bp mt genome were sequenced. The ArM0029B cp genome contains 114 conserved genes, including 32 tRNA genes, 3 rRNA genes, and 79 genes encoding proteins. Chlorella cp genomes are highly rearranged except for a Chlorella-specific six-gene cluster, and the ArM0029B plastid resembles that of Chlorella variabilis except for a 15-kb gene cluster inversion. In the mt genome, 62 conserved genes, including 27 tRNA genes, 3 rRNA genes, and 32 genes encoding proteins were determined. The mt genome of ArM0029B is similar to that of the non-photosynthetic species Prototheca and Heicosporidium. The ArM0029B mt genome contains a group I intron, with an ORF containing two LAGLIDADG motifs, in cox1. The intronic ORF is shared by C. vulgaris and Prototheca. The phylogeny of the plastid genome reveals that ArM0029B showed a close relationship of Chlorella to Parachlorella and Oocystis within Chlorellales. The distribution of the cox1 intron at 721 support membership in the order Chlorellales. Mitochondrial phylogenomic analyses, however, indicated that ArM0029B shows a greater affinity to MX-AZ01 and Coccomyxa than to the Helicosporidium-Prototheca clade, although the detailed phylogenetic relationships among the three taxa remain to be resolved. Conclusions The plastid genome of ArM0029B is similar to that of C. variabilis. The mt sequence of ArM0029B is the first genome to be reported for Chlorella. Chloroplast genome phylogeny supports monophyly of the seven investigated members of Chlorellales. The presence of the cox1 intron at 721 in all four investigated Chlorellales taxa indicates that the cox1 intron had been introduced in early Chorellales as a cis-splice form and that the cis-splicing intron was inherited to recent Chlorellales and was recently trans-spliced in Helicosporidium. PMID:24735464
Biothems: Sequence stratigraphic units and their implications for regional tectono-stratigraphic interpretations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lane, H.R.; Frye, M.W.; Couples, G.D.

1992-01-01

Biothems are regional wedge- or lens-shaped bodies of strata that are bounded shelfward or cratonward by paleontologically recognizable unconformities; generally thicken on marine shelves, where they are typically conformable with underlying and overlying biothems; are commonly thinner or represent starved sequences further basinward; and in their most basinward extent, are either bounded by biostratigraphically recognizable unconformities or are conformable with underlying and overlying biothems. As recognized to date, biothems have a logical distribution of faunal and floral components, as well as facies groupings that represent internally consistent and logical sequences of depositional environments. A west-to-east transect within the North Americanmore » Mississippian System which extends from the Basin and Range Province, across the Transcontinental Arch (TA), into the Anadarko Basin, was constructed to demonstrate the regional distribution and tectono-stratigraphic significance of biothems relative to the axis of the TA. The relationships portrayed on the transect, tied to an understanding of North American Mississippian paleogeography, imply that biothems deposited during relative highstand events on one flank of the TA are time-equivalent to biothems deposited during relative lowstand events on the opposite flank of the TA. This distribution is interpreted to have been controlled by intraplate tectonic events that formed piano key basins along the flanks of the TA. The spatial patterns of these basins are not consistent with published models of basin evolution. A further conclusion is that the lack of coincident, transgressive or regressive Mississippian biothems on either flank of the TA suggests that it is inadvisable to impose the Mississippi Valley-derived eustasy curve on western flank depositional sequences.« less
Genes encoding Xenopus laevis Ig L chains: Implications for the evolution of [kappa] and [lambda] chains

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zezza, D.J.; Stewart, S.E.; Steiner, L.A.

1992-12-15

Xenopus laevis Ig contain two distinct types of L chains, designated [rho] or L1 and [sigma] or L2. The authors have analyzed Xenopus genomic DNA by Southern blotting with cDNA probes specific for L1 V and C regions. Many fragments hybridized to the V probe, but only one or two fragments hybridized to the C probe. Corresponding C, J, and V gene segments were identified on clones isolated from a genomic library prepared from the same DNA. One clone contains a C gene segment separated from a J gene segment by an intron of 3.4 kb. The J and Cmore » gene segments are nearly identical in sequence to cDNA clones analyzed previously. The C segment is somewhat more similar and the J segment considerably more similar in sequence to the corresponding segments of mammalian [kappa] chains than to those of mammalian [lambda] chains. Upstream of the J segment is a typical recombination signal sequence with a spacer of 23 bp, as in J[kappa]. A second clone from the library contains four V gene segments, separated by 2.1 to 3.6 kb. Two of these, V1 and V3, have the expected structural and regulatory features of V genes, and are very similar in sequence to each other and to mammalian V[kappa]. A third gene segment, V2, resembles V1 and V3 in its coding region and nearby 5[prime]-flanking region, but diverges in sequence 5[prime] to position [minus]95 with loss of the octamer promoter element. The fourth V-like segment is similar to the others at the 3[prime]-end, but upstream of codon 64 bears no resemblance in sequence to any Ig V region. All four V segments have typical recombination signal sequences with 12-bp spacers at their 3[prime]-ends, as in V[kappa]. Taken together, the data suggest that Xenopus L1 L chain genes are members of the [kappa] gene family. 80 refs., 9 figs.« less
Bipolar localization of the group II intron Ll.LtrB is maintained in Escherichia coli deficient in nucleoid condensation, chromosome partitioning and DNA replication.

PubMed

Beauregard, Arthur; Chalamcharla, Venkata R; Piazza, Carol Lyn; Belfort, Marlene; Coros, Colin J

2006-11-01

Group II introns are mobile genetic elements that invade their cognate intron-minus alleles via an RNA intermediate, in a process known as retrohoming. They can also retrotranspose to ectopic sites at low frequency. In Escherichia coli, retrotransposition of the lactococcal group II intron, Ll.LtrB, occurs preferentially within the Ori and Ter macrodomains of the E. coli chromosome. These macrodomains migrate towards the poles of the cell, where the intron-encoded protein, LtrA, localizes. Here we investigate whether alteration of nucleoid condensation, chromosome partitioning and replication affect retrotransposition frequencies, as well as bipolar localization of the Ll.LtrB intron integration and LtrA distribution in E. coli. We thus examined these properties in the absence of the nucleoid-associated proteins H-NS, StpA and MukB, in variants of partitioning functions including the centromere-like sequence migS and the actin homologue MreB, as well as in the replication mutants DeltaoriC, seqA, tus and topoIV (ts). Although there were some dramatic fluctuations in retrotransposition levels in these hosts, bipolar localization of integration events was maintained. LtrA was consistently found in nucleoid-free regions, with its localization to the cellular poles being largely preserved in these hosts. Together, these results suggest that bipolar localization of group II intron retrotransposition results from the residence of the intron-encoded protein at the poles of the cell.
The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum).

PubMed

Zeng, Fan-chun; Gao, Cheng-wen; Gao, Li-zhi

2016-01-01

The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum) is reported and characterized in this study. The genome size is 156,612 bp, containing a pair of inverted repeats (IRs) of 25,776 bp separated by a large single-copy region of 87,213 bp and a small single-copy region of 17,851 bp. The chloroplast genome harbors 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes, and 37 tRNA genes. A total of 18 of these genes are duplicated in the inverted repeat regions, 16 genes contain 1 intron, and 2 genes and one ycf have 2 introns.
Microhomology-mediated end joining induces hypermutagenesis at breakpoint junctions

PubMed Central

Li, Fuyang; Villarreal, Diana; Shim, Jae Hoon; Myung, Kyungjae; Shim, Eun Yong; Lee, Sang Eun

2017-01-01

Microhomology (MH) flanking a DNA double-strand break (DSB) drives chromosomal rearrangements but its role in mutagenesis has not yet been analyzed. Here we determined the mutation frequency of a URA3 reporter gene placed at multiple locations distal to a DSB, which is flanked by different sizes (15-, 18-, or 203-bp) of direct repeat sequences for efficient repair in budding yeast. Induction of a DSB accumulates mutations in the reporter gene situated up to 14-kb distal to the 15-bp MH, but more modestly to those carrying 18- and 203-bp or no homology. Increased mutagenesis in MH-mediated end joining (MMEJ) appears coupled to its slower repair kinetics and the extensive resection occurring at flanking DNA. Chromosomal translocations via MMEJ also elevate mutagenesis of the flanking DNA sequences 7.1 kb distal to the breakpoint junction as compared to those without MH. The results suggest that MMEJ could destabilize genomes by triggering structural alterations and increasing mutation burden. PMID:28419093
Molecular screening of the CYP4V2 gene in Bietti crystalline dystrophy that is associated with choroidal neovascularization

PubMed Central

Mamatha, Gandra; Umashankar, Vetrivel; Kasinathan, Nachiappan; Krishnan, Tandava; Sathyabaarathi, Ravichandran; Karthiyayini, Thirumalai; Amali, John; Rao, Chetan

2011-01-01

Purpose Bietti crystalline dystrophy (BCD) is an autosomal recessive disease characterized by intraretinal deposits of multiple small crystals, with or without associated crystal deposits in the cornea. The disease is caused by mutation in the cytochrome p450, family 4, subfamily v, polypeptide 2 (CYP4V2) gene. Choroidal neovascularization (CNV) is a rare event in BCD. We report two cases of BCD associated with CNV. CYP4V2 and exon 5 of tissue inhibitor of metalloproteinase 3 (TIMP3) were screened in both cases. A patient with BCD, but without CNV, was also screened to identify pathogenic variations. Methods Three BCD families of Asian Indian origin were recruited after a comprehensive ophthalmic examination. Genomic DNA was isolated from blood leukocytes, and coding exons and flanking introns of CYP4V2 and exon 5 of TIMP3 were amplified via polymerase chain reaction (PCR) and were sequenced. Family segregation, control screening, and bioinformatics tools were used to assess the pathogenicity of the novel variations. Results Of the three BCD patients, two had parafoveal CNV. The patient with BCD, but without CNV had novel single base-pair duplication (c.1062_1063dupA). This mutation results in a structurally defective and unstable protein with impaired protein function. Four novel benign variations (three in exons and one in an intron) were observed in the cohort. Screening of exon 5 of TIMP3 did not reveal any variation in these families. Conclusions A novel mutation was found in a patient with BCD but without CNV, while patients with BCD and CNV did not show any pathogenic variation. The modifier role of TIMP3 in the pathogenesis of CNV in BCD was partly ruled out, as no variation was observed in exon 5 of the gene. A larger BCD cohort with CNV needs to be studied and screened to understand the genetics of CNV in BCD. PMID:21850171

Chloroplast Genome Evolution in Early Diverged Leptosporangiate Ferns

PubMed Central

Kim, Hyoung Tae; Chung, Myong Gi; Kim, Ki-Joong

2014-01-01

In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnV-GCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of co-dons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns. PMID:24823358
Chloroplast genome evolution in early diverged leptosporangiate ferns.

PubMed

Kim, Hyoung Tae; Chung, Myong Gi; Kim, Ki-Joong

2014-05-01

In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnVGCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of codons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns.
Large Diversity of Nonstandard Genes and Dynamic Evolution of Chloroplast Genomes in Siphonous Green Algae (Bryopsidales, Chlorophyta)

PubMed Central

Leliaert, Frederik; Marcelino, Vanessa R

2018-01-01

Abstract Chloroplast genomes have undergone tremendous alterations through the evolutionary history of the green algae (Chloroplastida). This study focuses on the evolution of chloroplast genomes in the siphonous green algae (order Bryopsidales). We present five new chloroplast genomes, which along with existing sequences, yield a data set representing all but one families of the order. Using comparative phylogenetic methods, we investigated the evolutionary dynamics of genomic features in the order. Our results show extensive variation in chloroplast genome architecture and intron content. Variation in genome size is accounted for by the amount of intergenic space and freestanding open reading frames that do not show significant homology to standard plastid genes. We show the diversity of these nonstandard genes based on their conserved protein domains, which are often associated with mobile functions (reverse transcriptase/intron maturase, integrases, phage- or plasmid-DNA primases, transposases, integrases, ligases). Investigation of the introns showed proliferation of group II introns in the early evolution of the order and their subsequent loss in the core Halimedineae, possibly through RT-mediated intron loss. PMID:29635329
DIP1 modulates stem cell homeostasis in Drosophila through regulation of sisR-1.

PubMed

Wong, Jing Ting; Akhbar, Farzanah; Ng, Amanda Yunn Ee; Tay, Mandy Li-Ian; Loi, Gladys Jing En; Pek, Jun Wei

2017-10-02

Stable intronic sequence RNAs (sisRNAs) are by-products of splicing and regulate gene expression. How sisRNAs are regulated is unclear. Here we report that a double-stranded RNA binding protein, Disco-interacting protein 1 (DIP1) regulates sisRNAs in Drosophila. DIP1 negatively regulates the abundance of sisR-1 and INE-1 sisRNAs. Fine-tuning of sisR-1 by DIP1 is important to maintain female germline stem cell homeostasis by modulating germline stem cell differentiation and niche adhesion. Drosophila DIP1 localizes to a nuclear body (satellite body) and associates with the fourth chromosome, which contains a very high density of INE-1 transposable element sequences that are processed into sisRNAs. DIP1 presumably acts outside the satellite bodies to regulate sisR-1, which is not on the fourth chromosome. Thus, our study identifies DIP1 as a sisRNA regulatory protein that controls germline stem cell self-renewal in Drosophila.Stable intronic sequence RNAs (sisRNAs) are by-products of splicing from introns with roles in embryonic development in Drosophila. Here, the authors show that the RNA binding protein DIP1 regulates sisRNAs in Drosophila, which is necessary for germline stem cell homeostasis.
Allelic association of sequence variants in the herpes virus entry mediator-B gene (PVRL2) with the severity of multiple sclerosis.

PubMed

Schmidt, S; Pericak-Vance, M A; Sawcer, S; Barcellos, L F; Hart, J; Sims, J; Prokop, A M; van der Walt, J; DeLoa, C; Lincoln, R R; Oksenberg, J R; Compston, A; Hauser, S L; Haines, J L; Gregory, S G

2006-07-01

Discrepant findings have been reported regarding an association of the apolipoprotein E (APOE) gene with the clinical course of multiple sclerosis (MS). To resolve these discrepancies, we examined common sequence variation in six candidate genes residing in a 380-kb genomic region surrounding and including the APOE locus for an association with MS severity. We genotyped at least three polymorphisms in each of six candidate genes in 1,540 Caucasian MS families (729 single-case and multiple-case families from the United States, 811 single-case families from the UK). By applying the quantitative transmission/disequilibrium test to a recently proposed MS severity score, the only statistically significant (P=0.003) association with MS severity was found for an intronic variant in the Herpes Virus Entry Mediator-B Gene PVRL2. Additional genotyping extended the association to a 16.6 kb block spanning intron 1 to intron 2 of the gene. Sequencing of PVRL2 failed to identify variants with an obvious functional role. In conclusion, the analysis of a very large data set suggests that genetic polymorphisms in PVRL2 may influence MS severity and supports the possibility that viral factors may contribute to the clinical course of MS, consistent with previous reports.
Cloning and Genomic Organization of a Rhamnogalacturonase Gene from Locally Isolated Strain of Aspergillus niger.

PubMed

Damak, Naourez; Abdeljalil, Salma; Taeib, Noomen Hadj; Gargouri, Ali

2015-08-01

The rhg gene encoding a rhamnogalacturonase was isolated from the novel strain A1 of Aspergillus niger. It consists of an ORF of 1.505 kb encoding a putative protein of 446 amino acids with a predicted molecular mass of 47 kDa, belonging to the family 28 of glycosyl hydrolases. The nature and position of amino acids comprising the active site as well as the three-dimensional structure were well conserved between the A. niger CTM10548 and fungal rhamnogalacturonases. The coding region of the rhg gene is interrupted by three short introns of 56 (introns 1 and 3) and 52 (intron 2) bp in length. The comparison of the peptide sequence with A. niger rhg sequences revealed that the A1 rhg should be an endo-rhamnogalacturonases, more homologous to rhg A than rhg B A. niger known enzymes. The comparison of rhg nucleotide sequence from A. niger A1 with rhg A from A. niger shows several base changes. Most of these changes (59 %) are located at the third base of codons suggesting maintaining the same enzyme function. We used the rhamnogalacturonase A from Aspergillus aculeatus as a template to build a structural model of rhg A1 that adopted a right-handed parallel β-helix.
Widespread alternative and aberrant splicing revealed by lariat sequencing

PubMed Central

Stepankiw, Nicholas; Raghavan, Madhura; Fogarty, Elizabeth A.; Grimson, Andrew; Pleiss, Jeffrey A.

2015-01-01

Alternative splicing is an important and ancient feature of eukaryotic gene structure, the existence of which has likely facilitated eukaryotic proteome expansions. Here, we have used intron lariat sequencing to generate a comprehensive profile of splicing events in Schizosaccharomyces pombe, amongst the simplest organisms that possess mammalian-like splice site degeneracy. We reveal an unprecedented level of alternative splicing, including alternative splice site selection for over half of all annotated introns, hundreds of novel exon-skipping events, and thousands of novel introns. Moreover, the frequency of these events is far higher than previous estimates, with alternative splice sites on average activated at ∼3% the rate of canonical sites. Although a subset of alternative sites are conserved in related species, implying functional potential, the majority are not detectably conserved. Interestingly, the rate of aberrant splicing is inversely related to expression level, with lowly expressed genes more prone to erroneous splicing. Although we validate many events with RNAseq, the proportion of alternative splicing discovered with lariat sequencing is far greater, a difference we attribute to preferential decay of aberrantly spliced transcripts. Together, these data suggest the spliceosome possesses far lower fidelity than previously appreciated, highlighting the potential contributions of alternative splicing in generating novel gene structures. PMID:26261211
Sequence analysis of three mitochondrial DNA molecules reveals interesting differences among Saccharomyces yeasts

PubMed Central

Langkjær, R. B.; Casaregola, S.; Ussery, D. W.; Gaillardin, C.; Piškur, J.

2003-01-01

The complete sequences of mitochondrial DNA (mtDNA) from the two budding yeasts Saccharomyces castellii and Saccharomyces servazzii, consisting of 25 753 and 30 782 bp, respectively, were analysed and compared to Saccharomyces cerevisiae mtDNA. While some of the traits are very similar among Saccharomyces yeasts, others have highly diverged. The two mtDNAs are much more compact than that of S.cerevisiae and contain fewer introns and intergenic sequences, although they have almost the same coding potential. A few genes contain group I introns, but group II introns, otherwise found in S.cerevisiae mtDNA, are not present. Surprisingly, four genes (ATP6, COX2, COX3 and COB) in the mtDNA of S.servazzii contain, in total, five +1 frameshifts. mtDNAs of S.castellii, S.servazzii and S.cerevisiae contain all genes on the same strand, except for one tRNA gene. On the other hand, the gene order is very different. Several gene rearrangements have taken place upon separation of the Saccharomyces lineages, and even a part of the transcription units have not been preserved. It seems that the mechanism(s) involved in the generation of the rearrangements has had to ensure that all genes stayed encoded by the same DNA strand. PMID:12799436
Deep sequencing with intronic capture enables identification of an APC exon 10 inversion in a patient with polyposis.

PubMed

Shirts, Brian H; Salipante, Stephen J; Casadei, Silvia; Ryan, Shawnia; Martin, Judith; Jacobson, Angela; Vlaskin, Tatyana; Koehler, Karen; Livingston, Robert J; King, Mary-Claire; Walsh, Tom; Pritchard, Colin C

2014-10-01

Single-exon inversions have rarely been described in clinical syndromes and are challenging to detect using Sanger sequencing. We report the case of a 40-year-old woman with adenomatous colon polyps too numerous to count and who had a complex inversion spanning the entire exon 10 in APC (the gene encoding for adenomatous polyposis coli), causing exon skipping and resulting in a frameshift and premature protein truncation. In this study, we employed complete APC gene sequencing using high-coverage next-generation sequencing by ColoSeq, analysis with BreakDancer and SLOPE software, and confirmatory transcript analysis. ColoSeq identified a complex small genomic rearrangement consisting of an inversion that results in translational skipping of exon 10 in the APC gene. This mutation would not have been detected by traditional sequencing or gene-dosage methods. We report a case of adenomatous polyposis resulting from a complex single-exon inversion. Our report highlights the benefits of large-scale sequencing methods that capture intronic sequences with high enough depth of coverage-as well as the use of informatics tools-to enable detection of small pathogenic structural rearrangements.
A functional study of proximal goat β-casein promoter and intron 1 in immortalized goat mammary epithelial cells.

PubMed

Kung, M H; Lee, Y J; Hsu, J T; Huang, M C; Ju, Y T

2015-06-01

Goat β-casein (CSN2) promoter has been extensively used to derive expression of recombinant therapeutic protein in transgenic goats; however, little direct evidence exists for signaling molecules and the cis-elements of goat CSN2 promoter in response to lactogenic hormone stimulation in goat mammary epithelial cells. Here, we use an immortalized caprine mammary epithelial cell line (CMC) to search for evidence of the above. Serial 5'-flanking regions deleted of promoter and intron 1 in goat CSN2 (-4,047 to +2,054) driven by firefly luciferase reporter gene were constructed and applied to measure promoter activity in CMC. The intron 1 region (+393 to +501) significantly decreased basal activity of the promoter. This finding contradicts other studies of the role of intron 1. The signal transducer and activator of transcription (STAT)5a played a significant role in activating promoter activity by prolactin stimulation. Hydrocortisone enhanced and prolonged the activity of STAT5a and promoter in CMC, but was independent of the glucocorticoid receptor response element. The minimum length of the CSN2 promoter segment in response to lactogenic stimulation was confirmed by 5' serial deletions. A cis-element located from -300 to -90 in proximal goat CSN2 promoter that is absent in bovine and human CSN2 promoter was newly identified. We demonstrated the presence of a STAT5a binding site (-102 to -82) and preservation of the guanosine nucleotide at position -90 based on responses to the presence of lactogenic hormone using internal deletions and point mutations of the predicted STAT5a binding site, and chromatin immunoprecipitation assay. Together, these findings demonstrate that the proximal -300 bp of goat CSN2 promoter containing the STAT5a binding site (-102 to -82) is the response element for lactogenic hormone stimulation. Additionally, intron 1 may be required for tissue or developmental stage-specific expression in mammary gland. The role of the far-distal regions of goat CSN2 promoter in high-level lactogenic hormone induction and specific expression require further examination. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Comparative genomic survey, exon-intron annotation and phylogenetic analysis of NAT-homologous sequences in archaea, protists, fungi, viruses, and invertebrates

USDA-ARS?s Scientific Manuscript database

We have previously published extensive genomic surveys [1-3], reporting NAT-homologous sequences in hundreds of sequenced bacterial, fungal and vertebrate genomes. We present here the results of our latest search of 2445 genomes, representing 1532 (70 archaeal, 1210 bacterial, 43 protist, 97 fungal,...
Structural features of diverse Pin-II proteinase inhibitor genes from Capsicum annuum.

PubMed

Mahajan, Neha S; Dewangan, Veena; Lomate, Purushottam R; Joshi, Rakesh S; Mishra, Manasi; Gupta, Vidya S; Giri, Ashok P

2015-02-01

The proteinase inhibitor (PI) genes from Capsicum annuum were characterized with respect to their UTR, introns and promoter elements. The occurrence of PIs with circularly permuted domain organization was evident. Several potato inhibitor II (Pin-II) type proteinase inhibitor (PI) genes have been analyzed from Capsicum annuum (L.) with respect to their differential expression during plant defense response. However, complete gene characterization of any of these C. annuum PIs (CanPIs) has not been carried out so far. Complete gene architectures of a previously identified CanPI-7 (Beads-on-string, Type A) and a member of newly isolated Bracelet type B, CanPI-69 are reported in this study. The 5' UTR (untranslated region), 3'UTR, and intronic sequences of both the CanPI genes were obtained. The genomic sequence of CanPI-7 exhibited, exon 1 (49 base pair, bp) and exon 2 (740 bp) interrupted by a 294-bp long type I intron. We noted the occurrence of three multi-domain PIs (CanPI-69, 70, 71) with circularly permuted domain organization. CanPI-69 was found to possess exon 1 (49 bp), exon 2 (551 bp) and a 584-bp long type I intron. The upstream sequence analysis of CanPI-7 and CanPI-69 predicted various transcription factor-binding sites including TATA and CAAT boxes, hormone-responsive elements (ABRELATERD1, DOFCOREZM, ERELEE4), and a defense-responsive element (WRKY71OS). Binding of transcription factors such as zinc finger motif MADS-box and MYB to the promoter regions was confirmed using electrophoretic mobility shift assay followed by mass spectrometric identification. The 3' UTR analysis for 25 CanPI genes revealed unique/distinct 3' UTR sequence for each gene. Structures of three domain CanPIs of type A and B were predicted and further analyzed for their attributes. This investigation of CanPI gene architecture will enable the better understanding of the genetic elements present in CanPIs.
Isolation of a promoter region in mouse cytochrome P450 3A (Cyp3A16) gene and its transcriptional control.

PubMed

Itoh, S; Abe, Y; Kubo, A; Okuda, M; Shimoji, M; Nakayama, K; Kamataki, T

1997-02-07

An 11.5 kb fragment of the mouse Cyp3a16 gene containing the 5' flanking region was isolated from the lambda DASHII mouse genomic library. A part of the 5' flanking region and the first exon of Cyp3a16 gene were sequenced. S1 mapping analysis showed the presence of two transcriptional initiation sites. The first exon was completely identical to Cyp3a16 cDNA. The identity of 5' flanking sequences between Cyp3a16 and Cyp3a11 genes was about 69%. A typical TATA box and a basic transcription element (BTE) were found as seen with other CYP3A genes from various animal species Moreover, some putative transcriptional regulatory elements were also found in addition to the sequence motif seen for the formation of Z-type DNA. To examine the transcriptional activity of Cyp3a11 gene, DNA fragments in the 5'-flanking region of the gene were inserted front of the luciferase structural gene, and the constructs were transfected in primary hepatocytes. The analysis of the luciferase activity indicated that the region between -146 and -56 was necessary for the transcription of CYP3a16 gene.
Intronic L1 Retrotransposons and Nested Genes Cause Transcriptional Interference by Inducing Intron Retention, Exonization and Cryptic Polyadenylation

PubMed Central

Kaer, Kristel; Branovets, Jelena; Hallikma, Anni; Nigumann, Pilvi; Speek, Mart

2011-01-01

Background Transcriptional interference has been recently recognized as an unexpectedly complex and mostly negative regulation of genes. Despite a relatively few studies that emerged in recent years, it has been demonstrated that a readthrough transcription derived from one gene can influence the transcription of another overlapping or nested gene. However, the molecular effects resulting from this interaction are largely unknown. Methodology/Principal Findings Using in silico chromosome walking, we searched for prematurely terminated transcripts bearing signatures of intron retention or exonization of intronic sequence at their 3′ ends upstream to human L1 retrotransposons, protein-coding and noncoding nested genes. We demonstrate that transcriptional interference induced by intronic L1s (or other repeated DNAs) and nested genes could be characterized by intron retention, forced exonization and cryptic polyadenylation. These molecular effects were revealed from the analysis of endogenous transcripts derived from different cell lines and tissues and confirmed by the expression of three minigenes in cell culture. While intron retention and exonization were comparably observed in introns upstream to L1s, forced exonization was preferentially detected in nested genes. Transcriptional interference induced by L1 or nested genes was dependent on the presence or absence of cryptic splice sites, affected the inclusion or exclusion of the upstream exon and the use of cryptic polyadenylation signals. Conclusions/Significance Our results suggest that transcriptional interference induced by intronic L1s and nested genes could influence the transcription of the large number of genes in normal as well as in tumor tissues. Therefore, this type of interference could have a major impact on the regulation of the host gene expression. PMID:22022525
The genome sequence of the colonial chordate, Botryllus schlosseri

PubMed Central

Voskoboynik, Ayelet; Neff, Norma F; Sahoo, Debashis; Newman, Aaron M; Pushkarev, Dmitry; Koh, Winston; Passarelli, Benedetto; Fan, H Christina; Mantalas, Gary L; Palmeri, Karla J; Ishizuka, Katherine J; Gissi, Carmela; Griggio, Francesca; Ben-Shlomo, Rachel; Corey, Daniel M; Penland, Lolita; White, Richard A; Weissman, Irving L; Quake, Stephen R

2013-01-01

Botryllus schlosseri is a colonial urochordate that follows the chordate plan of development following sexual reproduction, but invokes a stem cell-mediated budding program during subsequent rounds of asexual reproduction. As urochordates are considered to be the closest living invertebrate relatives of vertebrates, they are ideal subjects for whole genome sequence analyses. Using a novel method for high-throughput sequencing of eukaryotic genomes, we sequenced and assembled 580 Mbp of the B. schlosseri genome. The genome assembly is comprised of nearly 14,000 intron-containing predicted genes, and 13,500 intron-less predicted genes, 40% of which could be confidently parceled into 13 (of 16 haploid) chromosomes. A comparison of homologous genes between B. schlosseri and other diverse taxonomic groups revealed genomic events underlying the evolution of vertebrates and lymphoid-mediated immunity. The B. schlosseri genome is a community resource for studying alternative modes of reproduction, natural transplantation reactions, and stem cell-mediated regeneration. DOI: http://dx.doi.org/10.7554/eLife.00569.001 PMID:23840927
The Anopheles stephensi odorant binding protein 1 (AsteObp1) gene: a new molecular marker for biological forms diagnosis.

PubMed

Gholizadeh, S; Firooziyan, S; Ladonni, H; Hajipirloo, H Mohammadzadeh; Djadid, N Dinparast; Hosseini, A; Raz, A

2015-06-01

Anopheles (Cellia) stephensi Liston 1901 is known as an Asian malaria vector. Three biological forms, namely "mysorensis", "intermediate", and "type" have been earlier reported in this species. Nevertheless, the present morphological and molecular information is insufficient to diagnose these forms. During this investigation, An. stephensi biological forms were morphologically identified and sequenced for odorant-binding protein 1 (Obp1) gene. Also, intron I sequences were used to construct phylogenetic trees. Despite nucleotide sequence variation in exon of AsteObp1, nearly 100% identity was observed at the amino acid level among the three biological forms. In order to overcome difficulties in using egg morphology characters, intron I sequences of An. stephensi Obp1 opens new molecular way to the identification of the main Asian malaria vector biological forms. However, multidisciplinary studies are needed to establish the taxonomic status of An. stephensi. Copyright © 2015 Elsevier B.V. All rights reserved.
Observation of c.260A > G mutation in superoxide dismutase 1 that causes p.Asn86Ser in Iranian amyotrophic lateral sclerosis patient and absence of genotype/phenotype correlation.

PubMed

Khani, Marzieh; Alavi, Afagh; Nafissi, Shahriar; Elahi, Elahe

2015-07-06

Amyotrophic lateral sclerosis (ALS) is the most common motor neuron disorder in European populations. ALS can be sporadic ALS (SALS) or familial ALS (FALS). Among 20 known ALS genes, mutations in C9orf72 and superoxide dismutase 1 (SOD1) are the most common genetic causes of the disease. Whereas C9orf72 mutations are more common in Western populations, the contribution of SOD1 to ALS in Iran is more than C9orf72. At present, a clear genotype/phenotype correlation for ALS has not been identified. We aimed to perform mutation screening of SOD1 in a newly identified Iranian FALS patient and to assess whether a genotype/phenotype correlation for the identified mutation exists. The five exons of SOD1 and flanking intronic sequences of a FALS proband were screened for mutations by direct sequencing. The clinical features of the proband were assessed by a neuromuscular specialist (SN). The phenotypic presentations were compared to previously reported patients with the same mutation. Heterozygous c.260A > G mutation in SOD1 that causes Asn86Ser was identified in the proband. Age at onset was 34 years and site of the first presentation was in the lower extremities. Comparisons of clinical features of different ALS patients with the same mutation evidenced variable presentations. The c.260A > G mutation in SOD1 that causes Asn86Ser appears to cause ALS with variable clinical presentations.
Identification of a Novel HADHB Gene Mutation in an Iranian Patient with Mitochondrial Trifunctional Protein Deficiency.

PubMed

Shahrokhi, Mahdiyeh; Shafiei, Mohammad; Galehdari, Hamid; Shariati, Gholamreza

2017-01-01

Mitochondrial trifunctional protein (MTP) is a hetero-octamer composed of eight parts (subunits): four α-subunits containing LCEH (long-chain 2,3-enoyl-CoA hydratase) and LCHAD (long-chain 3-hydroxyacyl CoA dehydrogenase) activity, and four β-subunits that possess LCKT (long-chain 3-ketoacyl-CoA thiolase) activity which catalyzes three out of four steps in β-oxidation spiral of long-chain fatty acid. Its deficiency is an autosomal recessive disorder that causes a clinical spectrum of diseases. A blood spot was collected from the patient's original newborn screening card with parental informed consent. A newborn screening test and quantity plasma acylcarnitine profile analysis by MS/MS were performed. After isolation of DNA and Amplification of all exons of the HADHA and HADHB, directly Sequence analyses of all exons and the flanking introns both of genes were performed. Here, we report a novel mutation in a patient with MTP deﬁciency diagnosed with newborn screening test and quantity plasma acylcarnitine profile analysis by MS/MS and then confirmed by enzyme analysis in cultured fibroblasts and direct sequencing of the HADHA and HADHB genes. Molecular analysis of causative genes showed a missense mutation (p.Q385P) c.1154A > C in exon 14 of HADHB gene. Since this mutation was not found in 50 normal control cases; so it was concluded that c.1154A > C mutation was a causative mutation. Phenotype analysis of this mutation predicted pathogenesis which reduces the stability of the MTP protein complex.
Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes

PubMed Central

Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu

2014-01-01

It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342
Genome-wide DNase hypersensitivity, and occupancy of RUNX2 and CTCF reveal a highly dynamic gene regulome during MC3T3 pre-osteoblast differentiation.

PubMed

Tai, Phillip W L; Wu, Hai; van Wijnen, André J; Stein, Gary S; Stein, Janet L; Lian, Jane B

2017-01-01

The ability to discover regulatory sequences that control bone-related genes during development has been greatly improved by massively parallel sequencing methodologies. To expand our understanding of cis-regulatory regions critical to the control of gene expression during osteoblastogenesis, we probed the presence of open chromatin states across the osteoblast genome using global DNase hypersensitivity (DHS) mapping. Our profiling of MC3T3 mouse pre-osteoblasts during differentiation has identified more than 224,000 unique DHS sites. Approximately 65% of these sites are dynamic during temporal stages of osteoblastogenesis, and a majority of them are located within non-promoter (intergenic and intronic) regions. Nearly half of all DHS sites (both constitutive and dynamic) overlap binding events of the bone-essential RUNX2 and/or the chromatin-related CTCF transcription factors. This finding reinforces the role of these regulatory proteins as essential components of the bone gene regulome. We observe a reduction in chromatin accessibility throughout the genome between pre-osteoblast and early osteoblasts. Our analysis also defined a class of differentially expressed genes that harbor DHS peaks centered within 1 kb downstream of transcriptional end sites (TES). These DHSs at the 3'-flanks of genes exhibit dynamic changes during differentiation that may impact regulation of the osteoblast genome. Taken together, the distribution of DHS regions within non-promoter locations harboring osteoblast and chromatin related transcription factor binding motifs, reflect novel cis-regulatory requirements to support temporal gene expression in differentiating osteoblasts.

Genotype-phenotype correlation of xeroderma pigmentosum in a Chinese Han population.

PubMed

Sun, Z; Zhang, J; Guo, Y; Ni, C; Liang, J; Cheng, R; Li, M; Yao, Z

2015-04-01

Xeroderma pigmentosum (XP) is a rare autosomal recessive disorder characterized by extreme sensitivity to sunlight, freckle-like pigmentation and a greatly increased incidence of skin cancers. Genetic mutation detection and genotype-phenotype analysis of XP are rarely reported in the Chinese Han population. To investigate the mutational spectrum of XP in a Chinese Han population, to discover any genotype-phenotype correlation and, consequently, to propose a simple and effective tool for the molecular diagnosis of XP. This study was carried out on 12 unrelated Chinese families that included 13 patients with clinically suspected XP. Genomic DNA was extracted from peripheral blood samples. Mutation screening was performed by direct sequencing of exons and flanking intron-exon boundaries for the entire coding region of eight XP genes. In 12 patients, direct sequencing of the whole coding region of eight XP genes revealed pathogenic mutations, including seven compound heterozygous mutations, three homozygous mutations and a Japanese founder mutation. Thirteen mutations have not been previously identified. This cohort was composed of four patients with XP-C (XPC), two with XP-G (ERCC5), three with XP-A (XPA) and three with XP-V (POLH). This study identified 13 novel mutations and extended the mutation spectrum of XP in the Chinese Han population. In this cohort, we found that patients with XP-G have no neurological symptoms, and patients with XP-A and XP-V have a high incidence of malignancy. Furthermore, lack of stringent protection against sunlight, late diagnosis and long duration of disease play an important role. © 2014 British Association of Dermatologists.
Genome-Wide Classification and Evolutionary and Expression Analyses of Citrus MYB Transcription Factor Families in Sweet Orange

PubMed Central

Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

2014-01-01

MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus. PMID:25375352
[Study of gene mutation in 62 hemophilia A children].

PubMed

Hu, Q; Liu, A G; Zhang, L Q; Zhang, A; Wang, Y Q; Wang, S M; Lu, Y J; Wang, X

2017-11-02

Objective: To analyze the mutation type of FⅧ gene in children with hemophilia A and to explore the relationship among hemophilia gene mutation spectrum, gene mutation and clinical phenotype. Method: Sixty-two children with hemophilia A from Department of Pediatric Hematology, Tongji Hospital of Tongji Medical College, Huazhong University of Science and Technology between January 2015 and March 2017 were enrolled. All patients were male, aged from 4 months to 7 years and F Ⅷ activity ranged 0.2%-11.0%. Fifty cases had severe, 10 cases had moderate and 2 cases had mild hemophilia A. DNA was isolated from peripheral blood in hemophilia A children and the target gene fragment was amplified by PCR, in combination with the second generation sequencing, 22 and 1 introns were detected. Negative cases were detected by the second generation sequencing and results were compared with those of the international FⅧ gene mutation database. Result: There were 20 cases (32%) of intron 22 inversion, 2 cases (3%) of intron 1 inversion, 18 cases (29%) of missense mutation, 5 cases (8%) of nonsense mutation, 7 cases (11%) of deletion mutation, 1 case(2%)of splice site mutation, 2 cases (3%) of large fragment deletion and 1 case of insertion mutation (2%). No mutation was detected in 2 cases (3%), and 4 cases (7%) failed to amplify. The correlation between phenotype and genotype showed that the most common gene mutation in severe hemophilia A was intron 22 inversion (20 cases), accounting for 40% of severe patients, followed by 11 cases of missense mutation (22%). The most common mutation in moderate hemophilia A was missense mutation (6 cases), accounting for 60% of moderate patients. Conclusion: The most frequent mutation type in hemophilia A was intron 22 inversion, followed by missense mutation, again for missing mutation. The relationship between phenotype and genotype: the most frequent gene mutation in severe hemophilia A is intron 22 inversion, followed by missense mutation; the most frequent gene mutation in medium hemophilia A is missense mutation.
Gene Deletion in Barley Mediated by LTR-retrotransposon BARE

PubMed Central

Shang, Yi; Yang, Fei; Schulman, Alan H.; Zhu, Jinghuan; Jia, Yong; Wang, Junmei; Zhang, Xiao-Qi; Jia, Qiaojun; Hua, Wei; Yang, Jianming; Li, Chengdao

2017-01-01

A poly-row branched spike (prbs) barley mutant was obtained from soaking a two-rowed barley inflorescence in a solution of maize genomic DNA. Positional cloning and sequencing demonstrated that the prbs mutant resulted from a 28 kb deletion including the inflorescence architecture gene HvRA2. Sequence annotation revealed that the HvRA2 gene is flanked by two LTR (long terminal repeat) retrotransposons (BARE) sharing 89% sequence identity. A recombination between the integrase (IN) gene regions of the two BARE copies resulted in the formation of an intact BARE and loss of HvRA2. No maize DNA was detected in the recombination region although the flanking sequences of HvRA2 gene showed over 73% of sequence identity with repetitive sequences on 10 maize chromosomes. It is still unknown whether the interaction of retrotransposons between barley and maize has resulted in the recombination observed in the present study. PMID:28252053
EvolMarkers: a database for mining exon and intron markers for evolution, ecology and conservation studies.

PubMed

Li, Chenhong; Riethoven, Jean-Jack M; Naylor, Gavin J P

2012-09-01

Recent innovations in next-generation sequencing have lowered the cost of genome projects. Nevertheless, sequencing entire genomes for all representatives in a study remains expensive and unnecessary for most studies in ecology, evolution and conservation. It is still more cost-effective and efficient to target and sequence single-copy nuclear gene markers for such studies. Many tools have been developed for identifying nuclear markers, but most of these have focused on particular taxonomic groups. We have built a searchable database, EvolMarkers, for developing single-copy coding sequence (CDS) and exon-primed-intron-crossing (EPIC) markers that is designed to work across a broad range of phylogenetic divergences. The database is made up of single-copy CDS derived from BLAST searches of a variety of metazoan genomes. Users can search the database for different types of markers (CDS or EPIC) that are common to different sets of input species with different divergence characteristics. EvolMarkers can be applied to any taxonomic group for which genome data are available for two or more species. We included 82 genomes in the first version of EvolMarkers and have found the methods to be effective across Placozoa, Cnidaria, Arthropod, Nematoda, Annelida, Mollusca, Echinodermata, Hemichordata, Chordata and plants. We demonstrate the effectiveness of searching for CDS markers within annelids and show how to find potentially useful intronic markers within the lizard Anolis. © 2012 Blackwell Publishing Ltd.
Structural analysis of the 5{prime} region of mouse and human Huntington disease genes reveals conservation of putative promoter region and Di- and trinucleotide polymorphisms

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lin, Biaoyang; Nasir, J.; Kalchman, M.A.

1995-02-10

We have previously cloned and characterized the murine homologue of the Huntington disease (HD) gene and shown that it maps to mouse chromosome 5 within a region of conserved synteny with human chromosome 4p16.3. Here we present a detailed comparison of the sequence of the putative promoter and the organization of the 5{prime} genomic region of the murine (Hdh) and human HD genes encompassing the first five exons. We show that in this region these two genes share identical exon boundaries, but have different-size introns. Two dinucleotide (CT) and one trinucleotide intronic polymorphism in Hdh and an intronic CA polymorphismmore » in the HD gene were identified. Comparison of 940-bp sequence 5{prime} to the putative translation start site reveals a highly conserved region (78.8% nucleotide identity) between Hdh and the HD gene from nucleotide -56 to -206 (of Hdh). Neither Hdh nor the HD gene have typical TATA or CCAAT elements, but both show one putative AP2 binding site and numerous potential Sp1 binding sites. The high sequence identity between Hdh and the HD gene for approximately 200 bp 5{prime} to the putative translation start site indicates that these sequences may play a role in regulating expression of the Huntington disease gene. 30 refs., 4 figs., 2 tabs.« less
zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs.

PubMed

Parekh, Swati; Ziegenhain, Christoph; Vieth, Beate; Enard, Wolfgang; Hellmann, Ines

2018-06-01

Single-cell RNA-sequencing (scRNA-seq) experiments typically analyze hundreds or thousands of cells after amplification of the cDNA. The high throughput is made possible by the early introduction of sample-specific bar codes (BCs), and the amplification bias is alleviated by unique molecular identifiers (UMIs). Thus, the ideal analysis pipeline for scRNA-seq data needs to efficiently tabulate reads according to both BC and UMI. zUMIs is a pipeline that can handle both known and random BCs and also efficiently collapse UMIs, either just for exon mapping reads or for both exon and intron mapping reads. If BC annotation is missing, zUMIs can accurately detect intact cells from the distribution of sequencing reads. Another unique feature of zUMIs is the adaptive downsampling function that facilitates dealing with hugely varying library sizes but also allows the user to evaluate whether the library has been sequenced to saturation. To illustrate the utility of zUMIs, we analyzed a single-nucleus RNA-seq dataset and show that more than 35% of all reads map to introns. Also, we show that these intronic reads are informative about expression levels, significantly increasing the number of detected genes and improving the cluster resolution. zUMIs flexibility makes if possible to accommodate data generated with any of the major scRNA-seq protocols that use BCs and UMIs and is the most feature-rich, fast, and user-friendly pipeline to process such scRNA-seq data.
Human intron-encoded Alu RNAs are processed and packaged into Wdr79-associated nucleoplasmic box H/ACA RNPs

PubMed Central

Jády, Beáta E.; Ketele, Amandine; Kiss, Tamás

2012-01-01

Alu repetitive sequences are the most abundant short interspersed DNA elements in the human genome. Full-length Alu elements are composed of two tandem sequence monomers, the left and right Alu arms, both derived from the 7SL signal recognition particle RNA. Since Alu elements are common in protein-coding genes, they are frequently transcribed into pre-mRNAs. Here, we demonstrate that the right arms of nascent Alu transcripts synthesized within pre-mRNA introns are processed into metabolically stable small RNAs. The intron-encoded Alu RNAs, termed AluACA RNAs, are structurally highly reminiscent of box H/ACA small Cajal body (CB) RNAs (scaRNAs). They are composed of two hairpin units followed by the essential H (AnAnnA) and ACA box motifs. The mature AluACA RNAs associate with the four H/ACA core proteins: dyskerin, Nop10, Nhp2, and Gar1. Moreover, the 3′ hairpin of AluACA RNAs carries two closely spaced CB localization motifs, CAB boxes (UGAG), which bind Wdr79 in a cumulative fashion. In contrast to canonical H/ACA scaRNPs, which concentrate in CBs, the AluACA RNPs accumulate in the nucleoplasm. Identification of 348 human AluACA RNAs demonstrates that intron-encoded AluACA RNAs represent a novel, large subgroup of H/ACA RNAs, which are apparently confined to human or primate cells. PMID:22892240
Partial androgen insensitivity syndrome caused by a deep intronic mutation creating an alternative splice acceptor site of the AR gene.

PubMed

Ono, Hiroyuki; Saitsu, Hirotomo; Horikawa, Reiko; Nakashima, Shinichi; Ohkubo, Yumiko; Yanagi, Kumiko; Nakabayashi, Kazuhiko; Fukami, Maki; Fujisawa, Yasuko; Ogata, Tsutomu

2018-02-02

Although partial androgen insensitivity syndrome (PAIS) is caused by attenuated responsiveness to androgens, androgen receptor gene (AR) mutations on the coding regions and their splice sites have been identified only in <25% of patients with a diagnosis of PAIS. We performed extensive molecular studies including whole exome sequencing in a Japanese family with PAIS, identifying a deep intronic variant beyond the branch site at intron 6 of AR (NM_000044.4:c.2450-42 G > A). This variant created the splice acceptor motif that was accompanied by pyrimidine-rich sequence and two candidate branch sites. Consistent with this, reverse transcriptase (RT)-PCR experiments for cycloheximide-treated lymphoblastoid cell lines revealed a relatively large amount of aberrant mRNA produced by the newly created splice acceptor site and a relatively small amount of wildtype mRNA produced by the normal splice acceptor site. Furthermore, most of the aberrant mRNA was shown to undergo nonsense mediated decay (NMD) and, if a small amount of aberrant mRNA may have escaped NMD, such mRNA was predicted to generate a truncated AR protein missing some functional domains. These findings imply that the deep intronic mutation creating an alternative splice acceptor site resulted in the production of a relatively small amount of wildtype AR mRNA, leading to PAIS.
Targeted Deep Resequencing Identifies Coding Variants in the PEAR1 Gene That Play a Role in Platelet Aggregation

PubMed Central

Kim, Yoonhee; Suktitipat, Bhoom; Yanek, Lisa R.; Faraday, Nauder; Wilson, Alexander F.; Becker, Diane M.; Becker, Lewis C.; Mathias, Rasika A.

2013-01-01

Platelet aggregation is heritable, and genome-wide association studies have detected strong associations with a common intronic variant of the platelet endothelial aggregation receptor1 (PEAR1) gene both in African American and European American individuals. In this study, we used a sequencing approach to identify additional exonic variants in PEAR1 that may also determine variability in platelet aggregation in the GeneSTAR Study. A 0.3 Mb targeted region on chromosome 1q23.1 including the entire PEAR1 gene was Sanger sequenced in 104 subjects (45% male, 49% African American, age = 52±13) selected on the basis of hyper- and hypo- aggregation across three different agonists (collagen, epinephrine, and adenosine diphosphate). Single-variant and multi-variant burden tests for association were performed. Of the 235 variants identified through sequencing, 61 were novel, and three of these were missense variants. More rare variants (MAF<5%) were noted in African Americans compared to European Americans (108 vs. 45). The common intronic GWAS-identified variant (rs12041331) demonstrated the most significant association signal in African Americans (p = 4.020×10−4); no association was seen for additional exonic variants in this group. In contrast, multi-variant burden tests indicated that exonic variants play a more significant role in European Americans (p = 0.0099 for the collective coding variants compared to p = 0.0565 for intronic variant rs12041331). Imputation of the individual exonic variants in the rest of the GeneSTAR European American cohort (N = 1,965) supports the results noted in the sequenced discovery sample: p = 3.56×10−4, 2.27×10−7, 5.20×10−5 for coding synonymous variant rs56260937 and collagen, epinephrine and adenosine diphosphate induced platelet aggregation, respectively. Sequencing approaches confirm that a common intronic variant has the strongest association with platelet aggregation in African Americans, and show that exonic variants play an additional role in platelet aggregation in European Americans. PMID:23704978
Strong Signature of Natural Selection within an FHIT Intron Implicated in Prostate Cancer Risk

PubMed Central

Ding, Yan; Larson, Garrett; Rivas, Guillermo; Lundberg, Cathryn; Geller, Louis; Ouyang, Ching; Weitzel, Jeffrey; Archambeau, John; Slater, Jerry; Daly, Mary B.; Benson, Al B.; Kirkwood, John M.; O'Dwyer, Peter J.; Sutphen, Rebecca; Stewart, James A.; Johnson, David; Nordborg, Magnus; Krontiris, Theodore G.

2008-01-01

Previously, a candidate gene linkage approach on brother pairs affected with prostate cancer identified a locus of prostate cancer susceptibility at D3S1234 within the fragile histidine triad gene (FHIT), a tumor suppressor that induces apoptosis. Subsequent association tests on 16 SNPs spanning approximately 381 kb surrounding D3S1234 in Americans of European descent revealed significant evidence of association for a single SNP within intron 5 of FHIT. In the current study, re-sequencing and genotyping within a 28.5 kb region surrounding this SNP further delineated the association with prostate cancer risk to a 15 kb region. Multiple SNPs in sequences under evolutionary constraint within intron 5 of FHIT defined several related haplotypes with an increased risk of prostate cancer in European-Americans. Strong associations were detected for a risk haplotype defined by SNPs 138543, 142413, and 152494 in all cases (Pearson's χ2 = 12.34, df 1, P = 0.00045) and for the homozygous risk haplotype defined by SNPs 144716, 142413, and 148444 in cases that shared 2 alleles identical by descent with their affected brothers (Pearson's χ2 = 11.50, df 1, P = 0.00070). In addition to highly conserved sequences encompassing SNPs 148444 and 152413, population studies revealed strong signatures of natural selection for a 1 kb window covering the SNP 144716 in two human populations, the European American (π = 0.0072, Tajima's D = 3.31, 14 SNPs) and the Japanese (π = 0.0049, Fay & Wu's H = 8.05, 14 SNPs), as well as in chimpanzees (Fay & Wu's H = 8.62, 12 SNPs). These results strongly support the involvement of the FHIT intronic region in an increased risk of prostate cancer. PMID:18953408
Deep developmental transcriptome sequencing uncovers numerous new genes and enhances gene annotation in the sponge Amphimedon queenslandica.

PubMed

Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M

2015-05-15

The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.
Mitochondrion-to-Chloroplast DNA Transfers and Intragenomic Proliferation of Chloroplast Group II Introns in Gloeotilopsis Green Algae (Ulotrichales, Ulvophyceae).

PubMed

Turmel, Monique; Otis, Christian; Lemieux, Claude

2016-09-19

To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G planctonica and 262,888-bp G sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Mitochondrion-to-Chloroplast DNA Transfers and Intragenomic Proliferation of Chloroplast Group II Introns in Gloeotilopsis Green Algae (Ulotrichales, Ulvophyceae)

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

2016-01-01

Abstract To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G. planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G. planctonica and 262,888-bp G. sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G. sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. PMID:27503298
Genomic organization, sequence characterization and expression analysis of Tenebrio molitor apolipophorin-III in response to an intracellular pathogen, Listeria monocytogenes.

PubMed

Noh, Ju Young; Patnaik, Bharat Bhusan; Tindwa, Hamisi; Seo, Gi Won; Kim, Dong Hyun; Patnaik, Hongray Howrelia; Jo, Yong Hun; Lee, Yong Seok; Lee, Bok Luel; Kim, Nam Jung; Han, Yeon Soo

2014-01-25

Apolipophorin III (apoLp-III) is a well-known hemolymph protein having a functional role in lipid transport and immune response of insects. We cloned full-length cDNA encoding putative apoLp-III from larvae of the coleopteran beetle, Tenebrio molitor (TmapoLp-III), by identification of clones corresponding to the partial sequence of TmapoLp-III, subsequently followed with full length sequencing by a clone-by-clone primer walking method. The complete cDNA consists of 890 nucleotides, including an ORF encoding 196 amino acid residues. Excluding a putative signal peptide of the first 20 amino acid residues, the 176-residue mature apoLp-III has a calculated molecular mass of 19,146Da. Genomic sequence analysis with respect to its cDNA showed that TmapoLp-III was organized into four exons interrupted by three introns. Several immune-related transcription factor binding sites were discovered in the putative 5'-flanking region. BLAST and phylogenetic analyses reveal that TmapoLp-III has high sequence identity (88%) with Tribolium castaneum apoLp-III but shares little sequence homologies (<26%) with other apoLp-IIIs. Homology modeling of Tm apoLp-III shows a bundle of five amphipathic alpha helices, including a short helix 3'. The 'helix-short helix-helix' motif was predicted to be implicated in lipid binding interactions, through reversible conformational changes and accommodating the hydrophobic residues to the exterior for stability. Highest level of TmapoLp-III mRNA was detected at late pupal stages, albeit it is expressed in the larval and adult stages at lower levels. The tissue specific expression of the transcripts showed significantly higher numbers in larval fat body and adult integument. In addition, TmapoLp-III mRNA was found to be highly upregulated in late stages of L. monocytogenes or E. coli challenge. These results indicate that TmapoLp-III may play an important role in innate immune responses against bacterial pathogens in T. molitor. Copyright © 2013 Elsevier B.V. All rights reserved.
Variations of Human DNA Polymerase Genes as Biomarkers of Prostate Cancer Progression

DTIC Science & Technology

2011-07-01

Forward sequence Reverse sequence Sequence contextb 1 g.39835C4Tc P169S 15 25 gTG GGG TC CTT g.39897C4T Intronic 22 15 AGA T GGt TA AAT g.39985T4C...Intronic 34 25 AGA TT tAA AAG g.40051C4Tc P184S 19 34 TGt CT GGA ATT 4 g.39835C4Tc P169S 19 29 gTG GGG TC CTT g.40051C4Tc P184S 23 34 TGt CT GGA ATT 6 g...39835C4Tc P169S 14 24 gTG GGG TC CTT g.40051C4Tc P184S 21 32 TGt CT GGA ATT 11 g.40055A4G D185G 28 35 TTC C AGA C AAG g.40073A4G Y191C 28 20 gGA T AtG CC
Chromosomal localization and partial genomic structure of the human peroxisome proliferator activated receptor-gamma (hPPAR gamma) gene.

PubMed

Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R

1997-04-28

We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.
Exon Shuffling and Origin of Scorpion Venom Biodiversity

PubMed Central

Wang, Xueli; Gao, Bin; Zhu, Shunyi

2016-01-01

Scorpion venom is a complex combinatorial library of peptides and proteins with multiple biological functions. A combination of transcriptomic and proteomic techniques has revealed its enormous molecular diversity, as identified by the presence of a large number of ion channel-targeted neurotoxins with different folds, membrane-active antimicrobial peptides, proteases, and protease inhibitors. Although the biodiversity of scorpion venom has long been known, how it arises remains unsolved. In this work, we analyzed the exon-intron structures of an array of scorpion venom protein-encoding genes and unexpectedly found that nearly all of these genes possess a phase-1 intron (one intron located between the first and second nucleotides of a codon) near the cleavage site of a signal sequence despite their mature peptides remarkably differ. This observation matches a theory of exon shuffling in the origin of new genes and suggests that recruitment of different folds into scorpion venom might be achieved via shuffling between body protein-coding genes and ancestral venom gland-specific genes that presumably contributed tissue-specific regulatory elements and secretory signal sequences. PMID:28035955
Exon Shuffling and Origin of Scorpion Venom Biodiversity.

PubMed

Wang, Xueli; Gao, Bin; Zhu, Shunyi

2016-12-26

Scorpion venom is a complex combinatorial library of peptides and proteins with multiple biological functions. A combination of transcriptomic and proteomic techniques has revealed its enormous molecular diversity, as identified by the presence of a large number of ion channel-targeted neurotoxins with different folds, membrane-active antimicrobial peptides, proteases, and protease inhibitors. Although the biodiversity of scorpion venom has long been known, how it arises remains unsolved. In this work, we analyzed the exon-intron structures of an array of scorpion venom protein-encoding genes and unexpectedly found that nearly all of these genes possess a phase-1 intron (one intron located between the first and second nucleotides of a codon) near the cleavage site of a signal sequence despite their mature peptides remarkably differ. This observation matches a theory of exon shuffling in the origin of new genes and suggests that recruitment of different folds into scorpion venom might be achieved via shuffling between body protein-coding genes and ancestral venom gland-specific genes that presumably contributed tissue-specific regulatory elements and secretory signal sequences.
Genomic organization and expression of the human MSH3 gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Watanabe, Atsushi; Ikejima, Miyoko; Suzuki, Noriko

1996-02-01

We have studied the expression and genomic organization of the human MSH3 gene, which encodes a human homologue of the bacterial DNA mismatch repair protein MutS. This gene is located upstream of the dihydrofolate reductase (DHFR) gene. Northern analysis has demonstrated that the hMSH3 gene is expressed in a variety of human tissues at low levels, like the DHFR gene. Characterization of cosmid clones has shown that the hMSH3 gene consists of 24 exons spanning at least 160 kb. All exon-intron junction sequences match the classical GT/AG rule, except that intron 6 has AT and AA at the ends. Twomore » major transcripts of 5.0 and 3.8 kb have been shown to be derived from the differential use of two polyadenylation sites. Elucidation of the complete genomic organization and the nucleotide sequences of the introns of the hMSH3 gene should be useful for studying the function of this gene and the possible involvement of specific mutations of the hMSH3 gene in some diseases. 34 refs., 5 figs., 1 tab.« less

Some links on this page may take you to non-federal websites. Their policies may differ from this site.