Alu repeats: A source for the genesis of primate microsatellites
DOE Office of Scientific and Technical Information (OSTI.GOV)
Arcot, S.S.; Batzer, M.A.; Wang, Zhenyuan
1995-09-01
As a result of their abundance, relatively uniform distribution, and high degree of polymorphism, microsatellites and minisatellites have become valuable tools in genetic mapping, forensic identity testing, and population studies. In recent years, a number of microsatellite repeats have been found to be associated with Alu interspersed repeated DNA elements. The association of an Alu element with a microsatellite repeat could result from the integration of an Alu element within a preexisting microsatellite repeat. Alternatively, Alu elements could have a direct role in the origin of microsatellite repeats. Errors introduced during reverse transcription of the primary transcript derived from anmore » Alu {open_quotes}master{close_quote} gene or the accumulation of random mutations in the middle A-rich regions and oligo(dA)-rich tails of Alu elements after insertion and subsequent expansion and contraction of these sequences could result in the genesis of a microsatellite repeat. We have tested these hypotheses by a direct evolutionary comparison of the sequences of some recent Alu elements that are found only in humans and are absent from nonhuman primates, as well as some older Alu elements that are present at orthologous positions in a number of nonhuman primates. The origin of {open_quotes}young{close_quotes} Alu insertions, absence of sequences that resemble microsatellite repeats at the orthologous loci in chimpanzees, and the gradual expansion of microsatellite repeats in some old Alu repeats at orthologous positions within the genomes of a number of nonhuman primates suggest that Alu elements are a source for the genesis of primate microsatellite repeats. 48 refs., 5 figs., 3 tabs.« less
Alu repeated DNAs are differentially methylated in primate germ cells.
Rubin, C M; VandeVoort, C A; Teplitz, R L; Schmid, C W
1994-01-01
A significant fraction of Alu repeats in human sperm DNA, previously found to be unmethylated, is nearly completely methylated in DNA from many somatic tissues. A similar fraction of unmethylated Alus is observed here in sperm DNA from rhesus monkey. However, Alus are almost completely methylated at the restriction sites tested in monkey follicular oocyte DNA. The Alu methylation patterns in mature male and female monkey germ cells are consistent with Alu methylation in human germ cell tumors. Alu sequences are hypomethylated in seminoma DNAs and more methylated in a human ovarian dysgerminoma. These results contrast with methylation patterns reported for germ cell single-copy, CpG island, satellite, and L1 sequences. The function of Alu repeats is not known, but differential methylation of Alu repeats in the male and female germ lines suggests that they may serve as markers for genomic imprinting or in maintaining differences in male and female meiosis. Images PMID:7800508
Fitzpatrick, Terry; Huang, Sui
2012-01-01
Alu repeats within human genes may potentially alter gene expression. Here, we show that 3′-UTR-located inverted Alu repeats significantly reduce expression of an AcGFP reporter gene. Mutational analysis demonstrates that the secondary structure, but not the primary nucleotide sequence, of the inverted Alu repeats is critical for repression. The expression levels and nucleocytoplasmic distribution of reporter mRNAs with or without 3′-UTR inverted Alu repeats are similar; suggesting that reporter gene repression is not due to changes in mRNA levels or mRNA nuclear sequestration. Instead, reporter gene mRNAs harboring 3′-UTR inverted Alu repeats accumulate in cytoplasmic stress granules. These findings may suggest a novel mechanism whereby 3′-UTR-located inverted Alu repeats regulate human gene expression through sequestration of mRNAs within stress granules. PMID:22688648
DOE Office of Scientific and Technical Information (OSTI.GOV)
Onda, M.; Kudo, S.; Fukuda, M.
Human glycophorin A, B, and E (GPA, GPB, and GPE) genes belong to a gene family located at the long arm of chromosome 4. These three genes are homologous from the 5'-flanking sequence to the Alu sequence, which is 1 kb downstream from the exon encoding the transmembrane domain. Analysis of the Alu sequence and flanking direct repeat sequences suggested that the GPA gene most closely resembles the ancestral gene, whereas the GPB and GPE gene arose by homologous recombination within the Alu sequence, acquiring 3' sequences from an unrelated precursor genomic segment. Here the authors describe the identification ofmore » this putative precursor genomic segment. A human genomic library was screened by using the sequence of the 3' region of the GPB gene as a probe. The genomic clones isolated were found to contain an Alu sequence that appeared to be involved in the recombination. Downstream from the Alu sequence, the nucleotide sequence of the precursor genomic segment is almost identical to that of the GPB or GPE gene. In contrast, the upstream sequence of the genomic segment differs entirely from that of the GPA, GPB, and GPE genes. Conservation of the direct repeats flanking the Alu sequence of the genomic segment strongly suggests that the sequence of this genomic segment has been maintained during evolution. This identified genomic segment was found to reside downstream from the GPA gene by both gene mapping and in situ chromosomal localization. The precursor genomic segment was also identified in the orangutan genome, which is known to lack GPB and GPE genes. These results indicate that one of the duplicated ancestral glycophorin genes acquired a unique 3' sequence by unequal crossing-over through its Alu sequence and the further downstream Alu sequence present in the duplicated gene. Further duplication and divergence of this gene yielded the GPB and GPE genes. 37 refs., 5 figs.« less
2010-01-01
Background Adenosine to inosine (A-to-I) RNA-editing is an essential post-transcriptional mechanism that occurs in numerous sites in the human transcriptome, mainly within Alu repeats. It has been shown to have consistent levels of editing across individuals in a few targets in the human brain and altered in several human pathologies. However, the variability across human individuals of editing levels in other tissues has not been studied so far. Results Here, we analyzed 32 skin samples, looking at A-to-I editing level in three genes within coding sequences and in the Alu repeats of six different genes. We observed highly consistent editing levels across different individuals as well as across tissues, not only in coding targets but, surprisingly, also in the non evolutionary conserved Alu repeats. Conclusions Our findings suggest that A-to-I RNA-editing of Alu elements is a tightly regulated process and, as such, might have been recruited in the course of primate evolution for post-transcriptional regulatory mechanisms. PMID:21029430
Alu repeat discovery and characterization within human genomes
Hormozdiari, Fereydoun; Alkan, Can; Ventura, Mario; Hajirasouliha, Iman; Malig, Maika; Hach, Faraz; Yorukoglu, Deniz; Dao, Phuong; Bakhshi, Marzieh; Sahinalp, S. Cenk; Eichler, Evan E.
2011-01-01
Human genomes are now being rapidly sequenced, but not all forms of genetic variation are routinely characterized. In this study, we focus on Alu retrotransposition events and seek to characterize differences in the pattern of mobile insertion between individuals based on the analysis of eight human genomes sequenced using next-generation sequencing. Applying a rapid read-pair analysis algorithm, we discover 4342 Alu insertions not found in the human reference genome and show that 98% of a selected subset (63/64) experimentally validate. Of these new insertions, 89% correspond to AluY elements, suggesting that they arose by retrotransposition. Eighty percent of the Alu insertions have not been previously reported and more novel events were detected in Africans when compared with non-African samples (76% vs. 69%). Using these data, we develop an experimental and computational screen to identify ancestry informative Alu retrotransposition events among different human populations. PMID:21131385
RNA editing of non-coding RNA and its role in gene regulation.
Daniel, Chammiran; Lagergren, Jens; Öhman, Marie
2015-10-01
It has for a long time been known that repetitive elements, particularly Alu sequences in human, are edited by the adenosine deaminases acting on RNA, ADAR, family. The functional interpretation of these events has been even more difficult than that of editing events in coding sequences, but today there is an emerging understanding of their downstream effects. A surprisingly large fraction of the human transcriptome contains inverted Alu repeats, often forming long double stranded structures in RNA transcripts, typically occurring in introns and UTRs of protein coding genes. Alu repeats are also common in other primates, and similar inverted repeats can frequently be found in non-primates, although the latter are less prone to duplex formation. In human, as many as 700,000 Alu elements have been identified as substrates for RNA editing, of which many are edited at several sites. In fact, recent advancements in transcriptome sequencing techniques and bioinformatics have revealed that the human editome comprises at least a hundred million adenosine to inosine (A-to-I) editing sites in Alu sequences. Although substantial additional efforts are required in order to map the editome, already present knowledge provides an excellent starting point for studying cis-regulation of editing. In this review, we will focus on editing of long stem loop structures in the human transcriptome and how it can effect gene expression. Copyright © 2015 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kass, D.H.; Batzer, M.A.; Deininger, P.L.
The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome.more » However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.« less
Alu elements shape the primate transcriptome by cis-regulation of RNA editing
2014-01-01
Background RNA editing by adenosine to inosine deamination is a widespread phenomenon, particularly frequent in the human transcriptome, largely due to the presence of inverted Alu repeats and their ability to form double-stranded structures – a requisite for ADAR editing. While several hundred thousand editing sites have been identified within these primate-specific repeats, the function of Alu-editing has yet to be elucidated. Results We show that inverted Alu repeats, expressed in the primate brain, can induce site-selective editing in cis on sites located several hundred nucleotides from the Alu elements. Furthermore, a computational analysis, based on available RNA-seq data, finds that site-selective editing occurs significantly closer to edited Alu elements than expected. These targets are poorly edited upon deletion of the editing inducers, as well as in homologous transcripts from organisms lacking Alus. Sequences surrounding sites near edited Alus in UTRs, have been subjected to a lesser extent of evolutionary selection than those far from edited Alus, indicating that their editing generally depends on cis-acting Alus. Interestingly, we find an enrichment of primate-specific editing within encoded sequence or the UTRs of zinc finger-containing transcription factors. Conclusions We propose a model whereby primate-specific editing is induced by adjacent Alu elements that function as recruitment elements for the ADAR editing enzymes. The enrichment of site-selective editing with potentially functional consequences on the expression of transcription factors indicates that editing contributes more profoundly to the transcriptomic regulation and repertoire in primates than previously thought. PMID:24485196
Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm
Glunčić, Matko; Paar, Vladimir
2013-01-01
The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes). PMID:22977183
Alu Sb2 subfamily is present in all higher primates but was most succesfully amplified in humans
DOE Office of Scientific and Technical Information (OSTI.GOV)
Richer, C.; Zietkiewicz, E.; Labuda, D.
Alu repeats can be classified into subfamilies which amplified in primate genomes at different evolutionary time periods. A young Alu subfamily, Sb2, with a characteristic 7-nucleotide duplication at position 256, has been described in seven human loci. An Sb2 insertion found near the HD gene was unique to two HD families, indicating that Sb2 was still retropositionally active. Here, we have shown that the Sb2 insertion in the CHOL locus was similarly rare, being absent in 120 individuals of Caucasian, Oriental and Black origin. In contrast, Sb2 inserts in five other loci were found fixed (non-polymorphic), based on measurements inmore » the same population sample, but absent from orthologous positions in higher apes. This suggest that Sb2 repeats spread relatively early in the human lineage following divergence from other primates and that these elements may be human-specific. By quantitative PCR, we investigated the presence of Sb2 sequences in different primate DNA, using one PCR primer anchored at the 5{prime} Alu-end and the other complementary to the duplicated Sb2-specific segment. With an Sb2-containing plasmid as a standard, we estimated the number of Sb2 repeats at 1500-1800 copies per human haploid equivalent; corresponding numbers in chimpanzee and gorilla were almost two orders of magnitude lower, while the signal observed in orangutan and gibbon DNAs was consistent with the presence of a single copy. The analysis of 22 human, 11 chimpanzee and 10 gorilla sequences indicates that the Alu Sb2 dispersed independently in these three primate lineages; gorilla consensus differs from the human Sb2 sequence by one position, while all chimpanzee repeats have their linker expanded by up to eight A-residues. Should they be thus considered as separate subfamilies? It is possible that sequence modifications with respect to the human consensus are responsible for poor retroposition of Sb2 in apes.« less
Tiazhelova, T V; Ivanov, D V; Makeeva, N V; Kapanadze, B I; Nikitin, E A; Semov, A B; Sangfeldt, O; Grander, D; Vorob'ev, A I; Einhorn, S; Iankovskiĭ, N K; Baranova, A V
2001-11-01
Deletions in the region located between the STS markers D13S1168 and D13S25 on chromosome 13 are the most frequent genomic changes in patients with B-cell chronic lymphocytic leukemia (B-CLL). After sequencing of this region, two novel candidate genes were identified: C13orf1 (chromosome 13 open reading frame 1) and PLCC (putative large CLL candidate). Analysis of the repeat distribution revealed two subregions differing in composition of repetitious DNA and gene organization. The interval D13S1168-D13S319 contains 131 Alu repeats accounting for 24.8% of its length, whereas the interval GCT16C05-D13S25, which is no more than 180 kb away from the former one is extremely poor in Alu repeats (4.1% of the total length). Both intervals contain almost the same amount of the LINE-type repeats L1 and L2 (20.3 and 21.24%, respectively). In the chromosomal region studied, 29 Alu repeats were found to belong to the evolutionary young subfamily Y, which is still capable of amplifying. A considerable proportion of repeats of this type with similar nucleotide sequences may contribute to the recombinational activity of the chromosomal region 13q14.3, which is responsible for its rearrangements in some tumors in humans.
Transcriptional activation of short interspersed elements by DNA-damaging agents.
Rudin, C M; Thompson, C B
2001-01-01
Short interspersed elements (SINEs), typified by the human Alu repeat, are RNA polymerase III (pol III)-transcribed sequences that replicate within the genome through an RNA intermediate. Replication of SINEs has been extensive in mammalian evolution: an estimated 5% of the human genome consists of Alu repeats. The mechanisms regulating transcription, reverse transcription, and reinsertion of SINE elements in genomic DNA are poorly understood. Here we report that expression of murine SINE transcripts of both the B1 and B2 classes is strongly upregulated after prolonged exposure to cisplatin, etoposide, or gamma radiation. A similar induction of Alu transcripts in human cells occurs under these conditions. This induction is not due to a general upregulation of pol III activity in either species. Genotoxic treatment of murine cells containing an exogenous human Alu element induced Alu transcription. Concomitant with the increased expression of SINEs, an increase in cellular reverse transcriptase was observed after exposure to these same DNA-damaging agents. These findings suggest that genomic damage may be an important activator of SINEs, and that SINE mobility may contribute to secondary malignancy after exposure to DNA-damaging chemotherapy.
Bazak, Lily; Haviv, Ami; Barak, Michal; Jacob-Hirsch, Jasmine; Deng, Patricia; Zhang, Rui; Isaacs, Farren J; Rechavi, Gideon; Li, Jin Billy; Eisenberg, Eli; Levanon, Erez Y
2014-03-01
RNA molecules transmit the information encoded in the genome and generally reflect its content. Adenosine-to-inosine (A-to-I) RNA editing by ADAR proteins converts a genomically encoded adenosine into inosine. It is known that most RNA editing in human takes place in the primate-specific Alu sequences, but the extent of this phenomenon and its effect on transcriptome diversity are not yet clear. Here, we analyzed large-scale RNA-seq data and detected ∼1.6 million editing sites. As detection sensitivity increases with sequencing coverage, we performed ultradeep sequencing of selected Alu sequences and showed that the scope of editing is much larger than anticipated. We found that virtually all adenosines within Alu repeats that form double-stranded RNA undergo A-to-I editing, although most sites exhibit editing at only low levels (<1%). Moreover, using high coverage sequencing, we observed editing of transcripts resulting from residual antisense expression, doubling the number of edited sites in the human genome. Based on bioinformatic analyses and deep targeted sequencing, we estimate that there are over 100 million human Alu RNA editing sites, located in the majority of human genes. These findings set the stage for exploring how this primate-specific massive diversification of the transcriptome is utilized.
The contribution of alu elements to mutagenic DNA double-strand break repair.
Morales, Maria E; White, Travis B; Streva, Vincent A; DeFreece, Cecily B; Hedges, Dale J; Deininger, Prescott L
2015-03-01
Alu elements make up the largest family of human mobile elements, numbering 1.1 million copies and comprising 11% of the human genome. As a consequence of evolution and genetic drift, Alu elements of various sequence divergence exist throughout the human genome. Alu/Alu recombination has been shown to cause approximately 0.5% of new human genetic diseases and contribute to extensive genomic structural variation. To begin understanding the molecular mechanisms leading to these rearrangements in mammalian cells, we constructed Alu/Alu recombination reporter cell lines containing Alu elements ranging in sequence divergence from 0%-30% that allow detection of both Alu/Alu recombination and large non-homologous end joining (NHEJ) deletions that range from 1.0 to 1.9 kb in size. Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DNA double-strand break (DSB) repair. Further reduction in recombination was observed in a sequence divergence-dependent manner for diverged Alu/Alu recombination constructs with up to 10% sequence divergence. With greater levels of sequence divergence (15%-30%), we observed a significant increase in DSB repair due to a shift from Alu/Alu recombination to variable-length NHEJ which removes sequence between the two Alu elements. This increase in NHEJ deletions depends on the presence of Alu sequence homeology (similar but not identical sequences). Analysis of recombination products revealed that Alu/Alu recombination junctions occur more frequently in the first 100 bp of the Alu element within our reporter assay, just as they do in genomic Alu/Alu recombination events. This is the first extensive study characterizing the influence of Alu element sequence divergence on DNA repair, which will inform predictions regarding the effect of Alu element sequence divergence on both the rate and nature of DNA repair events.
Chillón, Isabel; Pyle, Anna M.
2016-01-01
LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782
Alu Elements as Novel Regulators of Gene Expression in Type 1 Diabetes Susceptibility Genes?
Kaur, Simranjeet; Pociot, Flemming
2015-07-13
Despite numerous studies implicating Alu repeat elements in various diseases, there is sparse information available with respect to the potential functional and biological roles of the repeat elements in Type 1 diabetes (T1D). Therefore, we performed a genome-wide sequence analysis of T1D candidate genes to identify embedded Alu elements within these genes. We observed significant enrichment of Alu elements within the T1D genes (p-value < 10e-16), which highlights their importance in T1D. Functional annotation of T1D genes harboring Alus revealed significant enrichment for immune-mediated processes (p-value < 10e-6). We also identified eight T1D genes harboring inverted Alus (IRAlus) within their 3' untranslated regions (UTRs) that are known to regulate the expression of host mRNAs by generating double stranded RNA duplexes. Our in silico analysis predicted the formation of duplex structures by IRAlus within the 3'UTRs of T1D genes. We propose that IRAlus might be involved in regulating the expression levels of the host T1D genes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hoefler, G.; Forstner, M.; Hulla, W.
1994-01-01
Enoyl-CoA hydratase:3-hydroxyacyl-CoA dehydrogenase bifunctional enzyme is one of the four enzymes of the peroxisomal, [beta]-oxidation pathway. Here, the authors report the full-length human cDNA sequence and the localization of the corresponding gene on chromosome 3q26.3-3q28. The cDNA sequence spans 3779 nucleotides with an open reading frame of 2169 nucleotides. The tripeptide SKL at the carboxy terminus, known to serve as a peroxisomal targeting signal, is present. DNA sequence comparison of the coding region showed an 80% homology between human and rat bifunctional enzyme cDNA. The 3[prime] noncoding sequence contains 117 nucleotides homologous to an Alu repeat. Based on sequence comparison,more » they propose that these nucleotides are a free left Alu arm with 86% homology to the Alu-J family. RNA analysis shows one band with highest intensity in liver and kidney. This cDNA will allow in-depth studies of molecular defects in patients with defective peroxisomal bifunctional enzyme. Moreover, it will also provide a means for studying the regulation of peroxisomal [beta]-oxidation in humans. 33 refs., 5 figs.« less
Fukami, Maki; Dateki, Sumito; Kato, Fumiko; Hasegawa, Yukihiro; Mochizuki, Hiroshi; Horikawa, Reiko; Ogata, Tsutomu
2008-01-01
Although short-stature homeobox-containing gene (SHOX ) haploinsufficiency is responsible for Léri-Weill dyschondrosteosis (LWD), the molecular defect has not been identified in approximately 20% of Japanese LWD patients. Furthermore, although high prevalence of microdeletions affecting SHOX is primarily ascribed to the presence of repeat sequences such as Alu elements around SHOX, it remains to be determined whether microdeletions are actually mediated by repeat sequences. We performed multiple ligation probe amplification (MLPA) assay in six Japanese LWD patients with apparently normal SHOX, followed by fluorescent in situ hybridization (FISH) analysis and sequencing for polymerase chain reaction (PCR) products encompassing the deletion junctions in patients with abnormal MLPA patterns. Consequently, heterozygous intragenic deletions were identified in three cases, i.e., a 5,906-bp deletion involving exons 4-5 in case 1, a 5,594-bp deletion involving exons 4-6a in case 2, and a 50,199-bp deletion involving exons 4-6b in case 3. The deletion breakpoints of cases 1 and 2 were present in nonrepeat sequences, whereas those of case 3 resided within Alu elements. The results suggest that cryptic SHOX intragenic deletions account for a small fraction of LWD and that microdeletions affecting SHOX can be generated by repeat-sequence-mediated aberrant recombinations and by nonhomologous end joining.
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity
2013-01-01
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution. PMID:23984183
DOE Office of Scientific and Technical Information (OSTI.GOV)
Panning, B.; Smiley, J.R.
1993-06-01
Alu elements are the single most abundant class of dispersed repeated sequences in the human genome, comprising 5-10% of the mass of human DNA. This report demonstrates that Ad5 infection strongly stimulates Pol III transcription of human Alu elements in HeLa and 293 cells. In contrast to the cases of Ad5-induced Pol III transcriptional activation, this process requires the E1b 58-kDa protein and the products of E4 open reading frames (ORFs) 3 and 6 in addition to the E1a 289-residue product. These findings suggest novel regulatory properties of the Ad5 E1b and E4 proteins and raise the possibility that analogousmore » cellular trans-acting factors serve to modulate Alu expression in vivo.« less
Konkel, Miriam K; Walker, Jerilyn A; Hotard, Ashley B; Ranck, Megan C; Fontenot, Catherine C; Storer, Jessica; Stewart, Chip; Marth, Gabor T; Batzer, Mark A
2015-08-29
The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Rates and patterns of great ape retrotransposition.
Hormozdiari, Fereydoun; Konkel, Miriam K; Prado-Martinez, Javier; Chiatante, Giorgia; Herraez, Irene Hernando; Walker, Jerilyn A; Nelson, Benjamin; Alkan, Can; Sudmant, Peter H; Huddleston, John; Catacchio, Claudia R; Ko, Arthur; Malig, Maika; Baker, Carl; Marques-Bonet, Tomas; Ventura, Mario; Batzer, Mark A; Eichler, Evan E
2013-08-13
We analyzed 83 fully sequenced great ape genomes for mobile element insertions, predicting a total of 49,452 fixed and polymorphic Alu and long interspersed element 1 (L1) insertions not present in the human reference assembly and assigning each retrotransposition event to a different time point during great ape evolution. We used these homoplasy-free markers to construct a mobile element insertions-based phylogeny of humans and great apes and demonstrate their differential power to discern ape subspecies and populations. Within this context, we find a good correlation between L1 diversity and single-nucleotide polymorphism heterozygosity (r(2) = 0.65) in contrast to Alu repeats, which show little correlation (r(2) = 0.07). We estimate that the "rate" of Alu retrotransposition has differed by a factor of 15-fold in these lineages. Humans, chimpanzees, and bonobos show the highest rates of Alu accumulation--the latter two since divergence 1.5 Mya. The L1 insertion rate, in contrast, has remained relatively constant, with rates differing by less than a factor of three. We conclude that Alu retrotransposition has been the most variable form of genetic variation during recent human-great ape evolution, with increases and decreases occurring over very short periods of evolutionary time.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paule Roth, M.; Malfroy, L.; Offer, C.
1995-07-20
Human myelin oligodendrocyte glycoprotein (MOG), a myelin component of the central nervous system, is a candidate target antigen for autoimmune-mediated demyelination. We have isolated and sequenced part of a cosmid clone that contains the entire human MOG gene. The primary nuclear transcript, extending from the putative start of transcription to the site of poly(A) addition, is 15,561 nucleotides in length. The human MOG gene contains 8 exons, separated by 7 introns; canonical intron/exon boundary sites are observed at each junction. The introns vary in size from 242 to 6484 bp and contain numerous repetitive DNA elements, including 14 Alu sequencesmore » within 3 introns. Another Alu element is located in the 3{prime}-untranslated region of the gene. Alu sequences were classified with respect to subfamily assignment. Seven hundred sixty-three nucleotides 5{prime} of the transcription start and 1214 nucleotides 3{prime} of the poly(A) addition sites were also sequenced. The 5{prime}-flanking region revealed the presence of several consensus sequences that could be relevant in the transcription of the MOG gene, in particular binding sites in common with other myelin gene promoters. Two polymorphic intragenic dinucleotide (CA){sub n} and tetranucleotide (TAAA){sub n} repeats were identified and may provide genetic marker tools for association and linkage studies. 50 refs., 3 figs., 3 tabs.« less
Locus-specific oligonucleotide probes increase the usefulness of inter-Alu polymorphisms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jarnik, M.; Tang, J.Q.; Korab-Laskowska, M.
1994-09-01
Most of the mapping approaches are based on single-locus codominant markers of known location. Their multiplex ratio, defined as the number of loci that can be simultaneously tested, is typically one. An increased multiplex ratio was obtained by typing anonymous polymorphisms using PCR primers anchored in ubiquitous Alu-repeats. These so called alumorphs are revealed by inter-Alu-PCR and seen as the presence or absence of an amplified band of a given length. We decided to map alumorphs and to develop locus-specific oligonucleotide (LSO) probes to facilitate their use and transfer among different laboratories. We studied the segregation of alumorphs in eightmore » CEPH families, using two distinct Alu-primers, both directing PCR between the repeats in a tail-to-tail orientation. The segregating bands were assigned to chromosomal locations by two-point linkage analysis with CEPH markers (V6.0). They were excised from dried gels, reamplified, cloned and sequenced. The resulting LSOs were used as hybridization probes (i) to confirm chromosomal assignments in a human/hamster somatic cell hybrid panel, and (ii) to group certain allelic length variants, originally coded as separate dominant markres, into more informative codominant loci. These codominants were then placed by multipoint analysis on a microsatellite Genethon map. Finally, the LSO probes were used as polymorphic STSs, to identify by hybridization the corresponding markers among products of inter-Alu-PCR. The use of LSOs converts alumorphs into a system of non-anonymous, often multiallelic codominant markes which can be simultaneously typed, thus achieving the goal of high multiplex ratio.« less
Discovery of rare, diagnostic AluYb8/9 elements in diverse human populations.
Feusier, Julie; Witherspoon, David J; Scott Watkins, W; Goubert, Clément; Sasani, Thomas A; Jorde, Lynn B
2017-01-01
Polymorphic human Alu elements are excellent tools for assessing population structure, and new retrotransposition events can contribute to disease. Next-generation sequencing has greatly increased the potential to discover Alu elements in human populations, and various sequencing and bioinformatics methods have been designed to tackle the problem of detecting these highly repetitive elements. However, current techniques for Alu discovery may miss rare, polymorphic Alu elements. Combining multiple discovery approaches may provide a better profile of the polymorphic Alu mobilome. Alu Yb8/9 elements have been a focus of our recent studies as they are young subfamilies (~2.3 million years old) that contribute ~30% of recent polymorphic Alu retrotransposition events. Here, we update our ME-Scan methods for detecting Alu elements and apply these methods to discover new insertions in a large set of individuals with diverse ancestral backgrounds. We identified 5,288 putative Alu insertion events, including several hundred novel Alu Yb8/9 elements from 213 individuals from 18 diverse human populations. Hundreds of these loci were specific to continental populations, and 23 non-reference population-specific loci were validated by PCR. We provide high-quality sequence information for 68 rare Alu Yb8/9 elements, of which 11 have hallmarks of an active source element. Our subfamily distribution of rare Alu Yb8/9 elements is consistent with previous datasets, and may be representative of rare loci. We also find that while ME-Scan and low-coverage, whole-genome sequencing (WGS) detect different Alu elements in 41 1000 Genomes individuals, the two methods yield similar population structure results. Current in-silico methods for Alu discovery may miss rare, polymorphic Alu elements. Therefore, using multiple techniques can provide a more accurate profile of Alu elements in individuals and populations. We improved our false-negative rate as an indicator of sample quality for future ME-Scan experiments. In conclusion, we demonstrate that ME-Scan is a good supplement for next-generation sequencing methods and is well-suited for population-level analyses.
Repetitive Elements May Comprise Over Two-Thirds of the Human Genome
de Koning, A. P. Jason; Gu, Wanjun; Castoe, Todd A.; Batzer, Mark A.; Pollock, David D.
2011-01-01
Transposable elements (TEs) are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo “clouds”). We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%–69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM), to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp). Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed “element-specific” P-clouds (ESPs) to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed. PMID:22144907
Alu expression in human cell lines and their retrotranspositional potential.
Oler, Andrew J; Traina-Dorge, Stephen; Derbes, Rebecca S; Canella, Donatella; Cairns, Brad R; Roy-Engel, Astrid M
2012-06-20
The vast majority of the 1.1 million Alu elements are retrotranspositionally inactive, where only a few loci referred to as 'source elements' can generate new Alu insertions. The first step in identifying the active Alu sources is to determine the loci transcribed by RNA polymerase III (pol III). Previous genome-wide analyses from normal and transformed cell lines identified multiple Alu loci occupied by pol III factors, making them candidate source elements. Analysis of the data from these genome-wide studies determined that the majority of pol III-bound Alus belonged to the older subfamilies Alu S and Alu J, which varied between cell lines from 62.5% to 98.7% of the identified loci. The pol III-bound Alus were further scored for estimated retrotransposition potential (ERP) based on the absence or presence of selected sequence features associated with Alu retrotransposition capability. Our analyses indicate that most of the pol III-bound Alu loci candidates identified lack the sequence characteristics important for retrotransposition. These data suggest that Alu expression likely varies by cell type, growth conditions and transformation state. This variation could extend to where the same cell lines in different laboratories present different Alu expression patterns. The vast majority of Alu loci potentially transcribed by RNA pol III lack important sequence features for retrotransposition and the majority of potentially active Alu loci in the genome (scored high ERP) belong to young Alu subfamilies. Our observations suggest that in an in vivo scenario, the contribution of Alu activity on somatic genetic damage may significantly vary between individuals and tissues.
Rates and patterns of great ape retrotransposition
Hormozdiari, Fereydoun; Konkel, Miriam K.; Prado-Martinez, Javier; Chiatante, Giorgia; Herraez, Irene Hernando; Walker, Jerilyn A.; Nelson, Benjamin; Alkan, Can; Sudmant, Peter H.; Huddleston, John; Catacchio, Claudia R.; Ko, Arthur; Malig, Maika; Baker, Carl; Genome Project, Great Ape; Marques-Bonet, Tomas; Ventura, Mario; Batzer, Mark A.; Eichler, Evan E.
2013-01-01
We analyzed 83 fully sequenced great ape genomes for mobile element insertions, predicting a total of 49,452 fixed and polymorphic Alu and long interspersed element 1 (L1) insertions not present in the human reference assembly and assigning each retrotransposition event to a different time point during great ape evolution. We used these homoplasy-free markers to construct a mobile element insertions-based phylogeny of humans and great apes and demonstrate their differential power to discern ape subspecies and populations. Within this context, we find a good correlation between L1 diversity and single-nucleotide polymorphism heterozygosity (r2 = 0.65) in contrast to Alu repeats, which show little correlation (r2 = 0.07). We estimate that the “rate” of Alu retrotransposition has differed by a factor of 15-fold in these lineages. Humans, chimpanzees, and bonobos show the highest rates of Alu accumulation—the latter two since divergence 1.5 Mya. The L1 insertion rate, in contrast, has remained relatively constant, with rates differing by less than a factor of three. We conclude that Alu retrotransposition has been the most variable form of genetic variation during recent human–great ape evolution, with increases and decreases occurring over very short periods of evolutionary time. PMID:23884656
Dynamic Alu Methylation during Normal Development, Aging, and Tumorigenesis
Lu, Xuemei
2014-01-01
DNA methylation primarily occurs on CpG dinucleotides and plays an important role in transcriptional regulations during tissue development and cell differentiation. Over 25% of CpG dinucleotides in the human genome reside within Alu elements, the most abundant human repeats. The methylation of Alu elements is an important mechanism to suppress Alu transcription and subsequent retrotransposition. Decades of studies revealed that Alu methylation is highly dynamic during early development and aging. Recently, many environmental factors were shown to have a great impact on Alu methylation. In addition, aberrant Alu methylation has been documented to be an early event in many tumors and Alu methylation levels have been associated with tumor aggressiveness. The assessment of the Alu methylation has become an important approach for early diagnosis and/or prognosis of cancer. This review focuses on the dynamic Alu methylation during development, aging, and tumor genesis. The cause and consequence of Alu methylation changes will be discussed. PMID:25243180
Wagstaff, Bradley J; Kroutter, Emily N; Derbes, Rebecca S; Belancio, Victoria P; Roy-Engel, Astrid M
2013-01-01
Non-long terminal repeat retroelements continue to impact the human genome through cis-activity of long interspersed element-1 (LINE-1 or L1) and trans-mobilization of Alu. Current activity is dominated by modern subfamilies of these elements, leaving behind an evolutionary graveyard of extinct Alu and L1 subfamilies. Because Alu is a nonautonomous element that relies on L1 to retrotranspose, there is the possibility that competition between these elements has driven selection and antagonistic coevolution between Alu and L1. Through analysis of synonymous versus nonsynonymous codon evolution across L1 subfamilies, we find that the C-terminal ORF2 cys domain experienced a dramatic increase in amino acid substitution rate in the transition from L1PA5 to L1PA4 subfamilies. This observation coincides with the previously reported rapid evolution of ORF1 during the same transition period. Ancestral Alu sequences have been previously reconstructed, as their short size and ubiquity have made it relatively easy to retrieve consensus sequences from the human genome. In contrast, creating constructs of extinct L1 copies is a more laborious task. Here, we report our efforts to recreate and evaluate the retrotransposition capabilities of two ancestral L1 elements, L1PA4 and L1PA8 that were active ~18 and ~40 Ma, respectively. Relative to the modern L1PA1 subfamily, we find that both elements are similarly active in a cell culture retrotransposition assay in HeLa, and both are able to efficiently trans-mobilize Alu elements from several subfamilies. Although we observe some variation in Alu subfamily retrotransposition efficiency, any coevolution that may have occurred between LINEs and SINEs is not evident from these data. Population dynamics and stochastic variation in the number of active source elements likely play an important role in individual LINE or SINE subfamily amplification. If coevolution also contributes to changing retrotransposition rates and the progression of subfamilies, cell factors are likely to play an important mediating role in changing LINE-SINE interactions over evolutionary time.
Wagstaff, Bradley J.; Kroutter, Emily N.; Derbes, Rebecca S.; Belancio, Victoria P.; Roy-Engel, Astrid M.
2013-01-01
Non-long terminal repeat retroelements continue to impact the human genome through cis-activity of long interspersed element-1 (LINE-1 or L1) and trans-mobilization of Alu. Current activity is dominated by modern subfamilies of these elements, leaving behind an evolutionary graveyard of extinct Alu and L1 subfamilies. Because Alu is a nonautonomous element that relies on L1 to retrotranspose, there is the possibility that competition between these elements has driven selection and antagonistic coevolution between Alu and L1. Through analysis of synonymous versus nonsynonymous codon evolution across L1 subfamilies, we find that the C-terminal ORF2 cys domain experienced a dramatic increase in amino acid substitution rate in the transition from L1PA5 to L1PA4 subfamilies. This observation coincides with the previously reported rapid evolution of ORF1 during the same transition period. Ancestral Alu sequences have been previously reconstructed, as their short size and ubiquity have made it relatively easy to retrieve consensus sequences from the human genome. In contrast, creating constructs of extinct L1 copies is a more laborious task. Here, we report our efforts to recreate and evaluate the retrotransposition capabilities of two ancestral L1 elements, L1PA4 and L1PA8 that were active ∼18 and ∼40 Ma, respectively. Relative to the modern L1PA1 subfamily, we find that both elements are similarly active in a cell culture retrotransposition assay in HeLa, and both are able to efficiently trans-mobilize Alu elements from several subfamilies. Although we observe some variation in Alu subfamily retrotransposition efficiency, any coevolution that may have occurred between LINEs and SINEs is not evident from these data. Population dynamics and stochastic variation in the number of active source elements likely play an important role in individual LINE or SINE subfamily amplification. If coevolution also contributes to changing retrotransposition rates and the progression of subfamilies, cell factors are likely to play an important mediating role in changing LINE-SINE interactions over evolutionary time. PMID:22918960
Jurka, Jerzy W.
1997-01-01
Enhanced homologous recombination is obtained by employing a consensus sequence which has been found to be associated with integration of repeat sequences, such as Alu and ID. The consensus sequence or sequence having a single transition mutation determines one site of a double break which allows for high efficiency of integration at the site. By introducing single or double stranded DNA having the consensus sequence flanking region joined to a sequence of interest, one can reproducibly direct integration of the sequence of interest at one or a limited number of sites. In this way, specific sites can be identified and homologous recombination achieved at the site by employing a second flanking sequence associated with a sequence proximal to the 3'-nick.
A-to-I editing of coding and non-coding RNAs by ADARs
Nishikura, Kazuko
2016-01-01
Adenosine deaminases acting on RNA (ADARs) convert adenosine to inosine in double-stranded RNA. This A-to-I editing occurs not only in protein-coding regions of mRNAs, but also frequently in non-coding regions that contain inverted Alu repeats. Editing of coding sequences can result in the expression of functionally altered proteins that are not encoded in the genome, whereas the significance of Alu editing remains largely unknown. Certain microRNA (miRNA) precursors are also edited, leading to reduced expression or altered function of mature miRNAs. Conversely, recent studies indicate that ADAR1 forms a complex with Dicer to promote miRNA processing, revealing a new function of ADAR1 in the regulation of RNA interference. PMID:26648264
Jády, Beáta E.; Ketele, Amandine; Kiss, Tamás
2012-01-01
Alu repetitive sequences are the most abundant short interspersed DNA elements in the human genome. Full-length Alu elements are composed of two tandem sequence monomers, the left and right Alu arms, both derived from the 7SL signal recognition particle RNA. Since Alu elements are common in protein-coding genes, they are frequently transcribed into pre-mRNAs. Here, we demonstrate that the right arms of nascent Alu transcripts synthesized within pre-mRNA introns are processed into metabolically stable small RNAs. The intron-encoded Alu RNAs, termed AluACA RNAs, are structurally highly reminiscent of box H/ACA small Cajal body (CB) RNAs (scaRNAs). They are composed of two hairpin units followed by the essential H (AnAnnA) and ACA box motifs. The mature AluACA RNAs associate with the four H/ACA core proteins: dyskerin, Nop10, Nhp2, and Gar1. Moreover, the 3′ hairpin of AluACA RNAs carries two closely spaced CB localization motifs, CAB boxes (UGAG), which bind Wdr79 in a cumulative fashion. In contrast to canonical H/ACA scaRNPs, which concentrate in CBs, the AluACA RNPs accumulate in the nucleoplasm. Identification of 348 human AluACA RNAs demonstrates that intron-encoded AluACA RNAs represent a novel, large subgroup of H/ACA RNAs, which are apparently confined to human or primate cells. PMID:22892240
The role of recombination in the origin and evolution of Alu subfamilies.
Teixeira-Silva, Ana; Silva, Raquel M; Carneiro, João; Amorim, António; Azevedo, Luísa
2013-01-01
Alus are the most abundant and successful short interspersed nuclear elements found in primate genomes. In humans, they represent about 10% of the genome, although few are retrotransposition-competent and are clustered into subfamilies according to the source gene from which they evolved. Recombination between them can lead to genomic rearrangements of clinical and evolutionary significance. In this study, we have addressed the role of recombination in the origin of chimeric Alu source genes by the analysis of all known consensus sequences of human Alus. From the allelic diversity of Alu consensus sequences, validated in extant elements resulting from whole genome searches, distinct events of recombination were detected in the origin of particular subfamilies of AluS and AluY source genes. These results demonstrate that at least two subfamilies are likely to have emerged from ectopic Alu-Alu recombination, which stimulates further research regarding the potential of chimeric active Alus to punctuate the genome.
Gaspar, Paulo; Seixas, Susana; Rocha, Jorge
2004-04-01
The genetic variation at a compound nonrecombining haplotype system, consisting of the previously reported SB19.3 Alu insertion polymorphism and a newly identified adjacent short tandem repeat (STR), was studied in population samples from Portugal and São Tomé (Gulf of Guinea, West Africa). Age estimates based on the linked microsatellite variation suggest that the Alu insertion occurred about 190,000 years ago. In accordance with the global patterns of distribution of human genetic variation, the highest haplotype diversity was found in the African sample. This excess in African diversity was due to both a substantial reduction in heterozygosity at the Alu polymorphism and a lower STR variability associated with the predominant Alu insertion allele in the Portuguese sample. The high level of interpopulation differentiation observed at the Alu locus (F(ST) = 0.43) was interpreted under alternative selective and demographic scenarios. The need for compatibility between patterns of variation at the STR and Alu loci could be used to restrict the range of selection coefficients in selection-driven genetic hitchhiking frameworks and to favor demographic scenarios dominated by larger pre-expansion African population sizes. Taken together, the data show that the SB19.3 Alu-STR system is an informative marker that can be included in more extended batteries of compound haplotypes used in human evolutionary studies.
NASA Astrophysics Data System (ADS)
Nallaseth, Ferez Soli
The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1) sequence content of deletion products confirmed the previously unidentified loss of genetic control of mammalian chromosome biology and hybrid dysgenesis.
Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences
Sheinman, Michael; Ramisch, Anna; Massip, Florian; Arndt, Peter F.
2016-01-01
Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf’s law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes. PMID:27488939
The Role of Recombination in the Origin and Evolution of Alu Subfamilies
Teixeira-Silva, Ana; Silva, Raquel M.; Carneiro, João; Amorim, António; Azevedo, Luísa
2013-01-01
Alus are the most abundant and successful short interspersed nuclear elements found in primate genomes. In humans, they represent about 10% of the genome, although few are retrotransposition-competent and are clustered into subfamilies according to the source gene from which they evolved. Recombination between them can lead to genomic rearrangements of clinical and evolutionary significance. In this study, we have addressed the role of recombination in the origin of chimeric Alu source genes by the analysis of all known consensus sequences of human Alus. From the allelic diversity of Alu consensus sequences, validated in extant elements resulting from whole genome searches, distinct events of recombination were detected in the origin of particular subfamilies of AluS and AluY source genes. These results demonstrate that at least two subfamilies are likely to have emerged from ectopic Alu-Alu recombination, which stimulates further research regarding the potential of chimeric active Alus to punctuate the genome. PMID:23750218
DOE Office of Scientific and Technical Information (OSTI.GOV)
Boccaccio, C.; Deshatrette, J.; Meunier-Rotival, M.
1994-05-01
The genomic fragment carrying the human activator of liver function, previously described as an episome capable of inducing differentiation upon transfection into a dedifferentiated rat hepatoma cell line, was mapped on human chromosome 12q24.2-12q24.3. This chromosomal location was indistinguishable by in situ hybridization from that of the gene coding for the hepatic transcription factor HNF1. The sequence of the integrated form of the episome as well as its flanking sequences show that it is rich in retroposons. It contains a human ribosomal protein L21 processed pseudogene, one truncated L1Hs sequence, and 10 Alu repeats, which belong to different subfamilies.
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Sheikh, Faruk G; Mukhopadhyay, Sudit S; Gupta, Prabhakar
2002-02-01
The PstI family of elements are short, highly repetitive DNA sequences interspersed throughout the genome of the Bovidae. We have cloned and sequenced some members of the PstI family from cattle, goat, and buffalo. These elements are approximately 500 bp, have a copy number of 2 x 10(5) - 4 x 10(5), and comprise about 4% of the haploid genome. Studies of nucleotide sequence homology indicate that the buffalo and goat PstI repeats (type II) are similar types of short interspersed nucleotide element (SINE) sequences, but the cattle PstI repeat (type I) is considerably more divergent. Additionally, the goat PstI sequence showed significant sequence homology with bovine serine tRNA, and is therefore likely derived from serine tRNA. Interestingly, Southern hybridization suggests that both types of SINEs (I and II) are present in all the species of Bovidae. Dendrogram analysis indicates that cattle PstI SINE is similar to bovine Alu-like SINEs. Goat and buffalo SINEs formed a separate cluster, suggesting that these two types of SINEs evolved separately in the genome of the Bovidae.
Orangutan Alu quiescence reveals possible source element: support for ancient backseat drivers
2012-01-01
Background Sequence analysis of the orangutan genome revealed that recent proliferative activity of Alu elements has been uncharacteristically quiescent in the Pongo (orangutan) lineage, compared with all previously studied primate genomes. With relatively few young polymorphic insertions, the genomic landscape of the orangutan seemed like the ideal place to search for a driver, or source element, of Alu retrotransposition. Results Here we report the identification of a nearly pristine insertion possessing all the known putative hallmarks of a retrotranspositionally competent Alu element. It is located in an intronic sequence of the DGKB gene on chromosome 7 and is highly conserved in Hominidae (the great apes), but absent from Hylobatidae (gibbon and siamang). We provide evidence for the evolution of a lineage-specific subfamily of this shared Alu insertion in orangutans and possibly the lineage leading to humans. In the orangutan genome, this insertion contains three orangutan-specific diagnostic mutations which are characteristic of the youngest polymorphic Alu subfamily, AluYe5b5_Pongo. In the Homininae lineage (human, chimpanzee and gorilla), this insertion has acquired three different mutations which are also found in a single human-specific Alu insertion. Conclusions This seemingly stealth-like amplification, ongoing at a very low rate over millions of years of evolution, suggests that this shared insertion may represent an ancient backseat driver of Alu element expansion. PMID:22541534
Orangutan Alu quiescence reveals possible source element: support for ancient backseat drivers.
Walker, Jerilyn A; Konkel, Miriam K; Ullmer, Brygg; Monceaux, Christopher P; Ryder, Oliver A; Hubley, Robert; Smit, Arian Fa; Batzer, Mark A
2012-04-30
Sequence analysis of the orangutan genome revealed that recent proliferative activity of Alu elements has been uncharacteristically quiescent in the Pongo (orangutan) lineage, compared with all previously studied primate genomes. With relatively few young polymorphic insertions, the genomic landscape of the orangutan seemed like the ideal place to search for a driver, or source element, of Alu retrotransposition. Here we report the identification of a nearly pristine insertion possessing all the known putative hallmarks of a retrotranspositionally competent Alu element. It is located in an intronic sequence of the DGKB gene on chromosome 7 and is highly conserved in Hominidae (the great apes), but absent from Hylobatidae (gibbon and siamang). We provide evidence for the evolution of a lineage-specific subfamily of this shared Alu insertion in orangutans and possibly the lineage leading to humans. In the orangutan genome, this insertion contains three orangutan-specific diagnostic mutations which are characteristic of the youngest polymorphic Alu subfamily, AluYe5b5_Pongo. In the Homininae lineage (human, chimpanzee and gorilla), this insertion has acquired three different mutations which are also found in a single human-specific Alu insertion. This seemingly stealth-like amplification, ongoing at a very low rate over millions of years of evolution, suggests that this shared insertion may represent an ancient backseat driver of Alu element expansion.
von Both, Ulrich; Berk, Maurice; Agapow, Paul-Michael; Wright, Joseph D; Git, Anna; Hamilton, Melissa Shea; Goldgof, Greg; Siddiqui, Nazneen; Bellos, Evangelos; Wright, Victoria J; Coin, Lachlan J; Newton, Sandra M; Levin, Michael
2018-01-12
Mycobacterium tuberculosis (M. tuberculosis) survives and multiplies inside human macrophages by subversion of immune mechanisms. Although these immune evasion strategies are well characterised functionally, the underlying molecular mechanisms are poorly understood. Here we show that during infection of human whole blood with M. tuberculosis, host gene transcriptional suppression, rather than activation, is the predominant response. Spatial, temporal and functional characterisation of repressed genes revealed their involvement in pathogen sensing and phagocytosis, degradation within the phagolysosome and antigen processing and presentation. To identify mechanisms underlying suppression of multiple immune genes we undertook epigenetic analyses. We identified significantly differentially expressed microRNAs with known targets in suppressed genes. In addition, after searching regions upstream of the start of transcription of suppressed genes for common sequence motifs, we discovered novel enriched composite sequence patterns, which corresponded to Alu repeat elements, transposable elements known to have wide ranging influences on gene expression. Our findings suggest that to survive within infected cells, mycobacteria exploit a complex immune "molecular off switch" controlled by both microRNAs and Alu regulatory elements.
Gómez-Pérez, Luis; Alfonso-Sánchez, Miguel A; Dipierri, José E; Sánchez, Dora; Espinosa, Ibone; De Pancorbo, Marian M; Peña, José A
2013-01-01
Genetic heterogeneity of two Amerindian populations (Jujuy province, Argentina, and Waorani tribe, Ecuador) was characterized by analyzing data on polymorphic Alu insertions within the human major histocompatibility complex (MHC) class I region (6p21.31), which are completely nonexistent in Native Americans. We further evaluated the haplotype distribution and genetic diversity among continental ancestry groups and their potential implications for the dating of the origin of MHC-Alus. Five MHC-Alu elements (AluMicB, AluTF, AluHJ, AluHG, and AluHF) were typed in samples from Jujuy (N = 108) and Waorani (N = 36). Allele and haplotype frequency data on worldwide populations were compiled to explore spatial structuring of the MHC-Alu diversity through AMOVA tests. We utilized the median-joining network approach to illustrate the continental distribution of the MHC-Alu haplotypes and their phylogenetic relationships. Allele and haplotype distributions differed significantly between Jujuy and Waorani. The Waorani featured a low average heterozygosity attributable to strong population isolation. Overall, Alu markers showed great genetic heterogeneity both within and among populations. The haplotype distribution was distinctive of each continental ancestry group. Contrary to expectations, Africans showed the lowest MHC-Alu diversity. Genetic drift mainly associated to population bottlenecks seems to be reflected in the low MHC-Alu diversity of the Amerindians, mainly in Waorani. Geographical structuring of the haplotype distribution supports the efficiency of the MHC-Alu loci as lineage (ancestry) markers. The markedly low Alu diversity of African populations relative to other continental clusters suggests that these MHC-Alus might have arisen after the anatomically modern humans expanded out of Africa. Copyright © 2013 Wiley Periodicals, Inc.
A SINE in the genome of the cephalochordate amphioxus is an Alu element
Holland, Linda Z.
2006-01-01
Transposable elements of about 300 bp, termed “short interspersed nucleotide elements or SINEs are common in eukaryotes. However, Alu elements, SINEs containing restriction sites for the AluI enzyme, have been known only from primates. Here I report the first SINE found in the genome of the cephalochordate, amphioxus. It is an Alu element of 375 bp that does not share substantial identity with any genomic sequences in vertebrates. It was identified because it was located in the FoxD regulatory region in a cosmid derived from one individual, but absent from the two FoxD alleles of BACs from a second individual. However, searches of sequences of BACs and genomic traces from this second individual gave an estimate of 50-100 copies in the amphioxus genome. The finding of an Alu element in amphioxus raises the question of whether Alu elements in amphioxus and primates arose by convergent evolution or by inheritance from a common ancestor. Genome-wide analyses of transposable elements in amphioxus and other chordates such as tunicates, agnathans and cartilaginous fishes could well provide the answer. PMID:16733535
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
Lesmana, Harry; Dyer, Lisa; Li, Xia; Denton, James; Griffiths, Jenna; Chonat, Satheesh; Seu, Katie G; Heeney, Matthew M; Zhang, Kejian; Hopkin, Robert J; Kalfa, Theodosia A
2018-03-01
Pyruvate kinase deficiency (PKD) is the most frequent red blood cell enzyme abnormality of the glycolytic pathway and the most common cause of hereditary nonspherocytic hemolytic anemia. Over 250 PKLR-gene mutations have been described, including missense/nonsense, splicing and regulatory mutations, small insertions, small and gross deletions, causing PKD and hemolytic anemia of variable severity. Alu retrotransposons are the most abundant mobile DNA sequences in the human genome, contributing to almost 11% of its mass. Alu insertions have been associated with a number of human diseases either by disrupting a coding region or a splice signal. Here, we report on two unrelated Middle Eastern patients, both born from consanguineous parents, with transfusion-dependent hemolytic anemia, where sequence analysis revealed a homozygous insertion of AluYb9 within exon 6 of the PKLR gene, causing precipitous decrease of PKLR RNA levels. This Alu element insertion consists a previously unrecognized mechanism underlying pathogenesis of PKD. © 2017 Wiley Periodicals, Inc.
The domain structure and distribution of Alu elements in long noncoding RNAs and mRNAs
Kim, Eugene Z.; Wespiser, Adam R.; Caffrey, Daniel R.
2016-01-01
Approximately 75% of the human genome is transcribed and many of these spliced transcripts contain primate-specific Alu elements, the most abundant mobile element in the human genome. The majority of exonized Alu elements are located in long noncoding RNAs (lncRNAs) and the untranslated regions of mRNA, with some performing molecular functions. To further assess the potential for Alu elements to be repurposed as functional RNA domains, we investigated the distribution and evolution of Alu elements in spliced transcripts. Our analysis revealed that Alu elements are underrepresented in mRNAs and lncRNAs, suggesting that most exonized Alu elements arising in the population are rare or deleterious to RNA function. When mRNAs and lncRNAs retain exonized Alu elements, they have a clear preference for Alu dimers, left monomers, and right monomers. mRNAs often acquire Alu elements when their genes are duplicated within Alu-rich regions. In lncRNAs, reverse-oriented Alu elements are significantly enriched and are not restricted to the 3′ and 5′ ends. Both lncRNAs and mRNAs primarily contain the Alu J and S subfamilies that were amplified relatively early in primate evolution. Alu J subfamilies are typically overrepresented in lncRNAs, whereas the Alu S dimer is overrepresented in mRNAs. The sequences of Alu dimers tend to be constrained in both lncRNAs and mRNAs, whereas the left and right monomers are constrained within particular Alu subfamilies and classes of RNA. Collectively, these findings suggest that Alu-containing RNAs are capable of forming stable structures and that some of these Alu domains might have novel biological functions. PMID:26654912
Alu distribution and mutation types of cancer genes
2011-01-01
Background Alu elements are the most abundant retrotransposable elements comprising ~11% of the human genome. Many studies have highlighted the role that Alu elements have in genetic instability and how their contribution to the assortment of mutagenic events can lead to cancer. As of yet, little has been done to quantitatively assess the association between Alu distribution and genes that are causally implicated in oncogenesis. Results We have investigated the effect of various Alu densities on the mutation type based classifications of cancer genes. In order to establish the direct relationship between Alus and the cancer genes of interest, genome wide Alu-related densities were measured using genes rather than the sliding windows of fixed length as the units. Several novel genomic features, such as the density of the adjacent Alu pairs and the number of Alu-Exon-Alu triplets, were developed in order to extend the investigation via the multivariate statistical analysis toward more advanced biological insight. In addition, we characterized the genome-wide intron Alu distribution with a mixture model that distinguished genes containing Alu elements from those with no Alus, and evaluated the gene-level effect of the 5'-TTAAAA motif associated with Alu insertion sites using a two-step regression analysis method. Conclusions The study resulted in several novel findings worthy of further investigation. They include: (1) Recessive cancer genes (tumor suppressor genes) are enriched with Alu elements (p < 0.01) compared to dominant cancer genes (oncogenes) and the entire set of genes in the human genome; (2) Alu-related genomic features can be used to cluster cancer genes into biological meaningful groups; (3) The retention of exon Alus has been restricted in the human genome development, and an upper limit to the chromosome-level exon Alu densities is suggested by the distribution profile; (4) For the genes with at least one intron Alu repeat in individual chromosomes, the intron Alu densities can be well fitted by a Gamma distribution; (5) The effect of the 5'-TTAAAA motif on Alu densities varies across different chromosomes. PMID:21429208
Genomic characterization of two large Alu-mediated rearrangements of the BRCA1 gene.
Peixoto, Ana; Pinheiro, Manuela; Massena, Lígia; Santos, Catarina; Pinto, Pedro; Rocha, Patrícia; Pinto, Carla; Teixeira, Manuel R
2013-02-01
To determine whether a large genomic rearrangement is actually novel and to gain insight about the mutational mechanism responsible for its occurrence, molecular characterization with breakpoint identification is mandatory. We here report the characterization of two large deletions involving the BRCA1 gene. The first rearrangement harbored a 89,664-bp deletion comprising exon 7 of the BRCA1 gene to exon 11 of the NBR1 gene (c.441+1724_oNBR1:c.1073+480del). Two highly homologous Alu elements were found in the genomic sequences flanking the deletion breakpoints. Furthermore, a 20-bp overlapping sequence at the breakpoint junction was observed, suggesting that the most likely mechanism for the occurrence of this rearrangement was nonallelic homologous recombination. The second rearrangement fully characterized at the nucleotide level was a BRCA1 exons 11-15 deletion (c.671-319_4677-578delinsAlu). The case harbored a 23,363-bp deletion with an Alu element inserted at the breakpoints of the deleted region. As the Alu element inserted belongs to a still active AluY family, the observed rearrangement could be due to an insertion-mediated deletion mechanism caused by Alu retrotransposition. To conclude, we describe the breakpoints of two novel large deletions involving the BRCA1 gene and analysis of their genomic context allowed us to gain insight about the respective mutational mechanism.
Liu, W M; Chu, W M; Choudary, P V; Schmid, C W
1995-01-01
The abundance of Alu RNA is transiently increased by heat shock in human cell lines. This effect is specific to Alu repeats among Pol III transcribed genes, since the abundance of 7SL, 7SK, 5S and U6 RNAs is essentially unaffected by heat shock. The rapid induction of Alu expression precedes the heat shock induction of mRNAs for the ubiquitin and HSP 70 heat shock genes. Heat shock mimetics also transiently induce Alu expression indicating that increased Alu expression is a general cell-stress response. Cycloheximide treatment rapidly and transiently increases the abundance of Alu RNA. Again, compared with other genes transcribed by Pol III, this increase is specific to Alu. However, as distinguished from the cell stress response, cycloheximide does not induce expression of HSP 70 and ubiquitin mRNAs. Puromycin also increases Alu expression, suggesting that this response is generally caused by translational inhibition. The response of mammalian SINEs to cell stress and translational inhibition is not limited to SINEs which are Alu homologues. Heat shock and cycloheximide each transiently induce Pol III directed expression of B1 and B2 RNAs in mouse cells and C-element RNA in rabbit cells. Together, these three species exemplify the known SINE composition of placental mammals, suggesting that mammalian SINEs are similarly regulated and may serve a common function. Images PMID:7784180
An Alu-based, MGB Eclipse real-time PCR method for quantitation of human DNA in forensic samples.
Nicklas, Janice A; Buel, Eric
2005-09-01
The forensic community needs quick, reliable methods to quantitate human DNA in crime scene samples to replace the laborious and imprecise slot blot method. A real-time PCR based method has the possibility of allowing development of a faster and more quantitative assay. Alu sequences are primate-specific and are found in many copies in the human genome, making these sequences an excellent target or marker for human DNA. This paper describes the development of a real-time Alu sequence-based assay using MGB Eclipse primers and probes. The advantages of this assay are simplicity, speed, less hands-on-time and automated quantitation, as well as a large dynamic range (128 ng/microL to 0.5 pg/microL).
Association between growth hormone receptor AluI polymorphism and fertility of Holstein cows.
Schneider, A; Corrêa, M N; Butler, W R
2013-12-01
The aim of this work was to determine the effects of a growth hormone receptor (GHR) AluI polymorphism on the reproductive performance of Holstein cows. The cows (n = 94) were on the study from 3 weeks prepartum until 210 days in milk (DIM). Blood samples were collected at -21, 0, 7, 21, and 60 DIM. For GHR genotyping, DNA was extracted from blood and the presence of the alleles determined after polymerase chain reaction and digestion with the restriction enzyme AluI. Milk samples were collected for progesterone analysis and detection of ovulation until first breeding. Cows were submitted to an OvSynch-TAI protocol at 55 DIM that was repeated for cows diagnosed as not pregnant. Data were analyzed with SAS for polynomial effects of the presence of 0, 1, or 2 GHR AluI (-) alleles. Among the cows, 37% had the AluI(+/+) genotype, 51% had AluI(-/+), and 12% were AluI(-/-). Interval from calving to first ovulation was not different among genotypes (P > 0.05). Cows carrying at least one GHR AluI(-) allele had fewer number of services per conception (P = 0.02). In addition, there was a linear reduction (P = 0.02) in the calving to conception interval among genotypes with fewest days for GHR AluI(-/-) cows. GHR AluI(-/-) cows also had the highest serum IGF-I concentrations (P = 0.03). Milk production and composition were not different among genotypes (P > 0.05). The presence of one or two GHR AluI(-) alleles in Holstein cows was associated with a linear reduction in the calving to conception interval and a reduction in the number of AI/conception. Copyright © 2013 Elsevier Inc. All rights reserved.
Alu elements mediate large SPG11 gene rearrangements: further spatacsin mutations.
Conceição Pereira, Maria; Loureiro, José Leal; Pinto-Basto, Jorge; Brandão, Eva; Margarida Lopes, Ana; Neves, Georgina; Dias, Pureza; Geraldes, Ruth; Martins, Isabel Pavão; Cruz, Vitor Tedim; Kamsteeg, Erik-Jan; Brunner, Han G; Coutinho, Paula; Sequeiros, Jorge; Alonso, Isabel
2012-01-01
Hereditary spastic paraplegias compose a group of neurodegenerative disorders with a large clinical and genetic heterogeneity. Among the autosomal recessive forms, spastic paraplegia type 11 is the most common. To better understand the spastic paraplegia type 11 mutation spectrum, we studied a group of 54 patients with hereditary spastic paraplegia. Mutation screening was performed by PCR amplification of SPG11 coding regions and intron boundaries, followed by sequencing. For the detection of large gene rearrangements, we performed multiplex ligation-dependent probe amplification. We report 13 families with spastic paraplegia type 11 carrying either novel or previously identified mutations. We describe a complex entire SPG11 rearrangement and show that large gene rearrangements are frequent among patients with spastic paraplegia type 11. Moreover, we mapped the deletion breakpoints of three different large SPG11 deletions and provide evidence for Alu microhomology-mediated exon deletion. Our analysis shows that the high number of repeated elements in SPG11 together with the presence of recombination hotspots and the high intrinsic instability of the 15q locus all contribute toward making this genomic region more prone to large gene rearrangements. These findings enlarge the amount of data relating repeated elements with neurodegenerative disorders and highlight their importance in human disease and genome evolution.
Adeno-associated virus inverted terminal repeats stimulate gene editing.
Hirsch, M L
2015-02-01
Advancements in genome editing have relied on technologies to specifically damage DNA which, in turn, stimulates DNA repair including homologous recombination (HR). As off-target concerns complicate the therapeutic translation of site-specific DNA endonucleases, an alternative strategy to stimulate gene editing based on fragile DNA was investigated. To do this, an episomal gene-editing reporter was generated by a disruptive insertion of the adeno-associated virus (AAV) inverted terminal repeat (ITR) into the egfp gene. Compared with a non-structured DNA control sequence, the ITR induced DNA damage as evidenced by increased gamma-H2AX and Mre11 foci formation. As local DNA damage stimulates HR, ITR-mediated gene editing was investigated using DNA oligonucleotides as repair substrates. The AAV ITR stimulated gene editing >1000-fold in a replication-independent manner and was not biased by the polarity of the repair oligonucleotide. Analysis of additional human DNA sequences demonstrated stimulation of gene editing to varying degrees. In particular, inverted yet not direct, Alu repeats induced gene editing, suggesting a role for DNA structure in the repair event. Collectively, the results demonstrate that inverted DNA repeats stimulate gene editing via double-strand break repair in an episomal context and allude to efficient gene editing of the human chromosome using fragile DNA sequences.
Circular RNAs are abundant, conserved, and associated with ALU repeats
Jeck, William R.; Sorrentino, Jessica A.; Wang, Kai; Slevin, Michael K.; Burd, Christin E.; Liu, Jinze; Marzluff, William F.; Sharpless, Norman E.
2013-01-01
Circular RNAs composed of exonic sequence have been described in a small number of genes. Thought to result from splicing errors, circular RNA species possess no known function. To delineate the universe of endogenous circular RNAs, we performed high-throughput sequencing (RNA-seq) of libraries prepared from ribosome-depleted RNA with or without digestion with the RNA exonuclease, RNase R. We identified >25,000 distinct RNA species in human fibroblasts that contained non-colinear exons (a “backsplice”) and were reproducibly enriched by exonuclease degradation of linear RNA. These RNAs were validated as circular RNA (ecircRNA), rather than linear RNA, and were more stable than associated linear mRNAs in vivo. In some cases, the abundance of circular molecules exceeded that of associated linear mRNA by >10-fold. By conservative estimate, we identified ecircRNAs from 14.4% of actively transcribed genes in human fibroblasts. Application of this method to murine testis RNA identified 69 ecircRNAs in precisely orthologous locations to human circular RNAs. Of note, paralogous kinases HIPK2 and HIPK3 produce abundant ecircRNA from their second exon in both humans and mice. Though HIPK3 circular RNAs contain an AUG translation start, it and other ecircRNAs were not bound to ribosomes. Circular RNAs could be degraded by siRNAs and, therefore, may act as competing endogenous RNAs. Bioinformatic analysis revealed shared features of circularized exons, including long bordering introns that contained complementary ALU repeats. These data show that ecircRNAs are abundant, stable, conserved and nonrandom products of RNA splicing that could be involved in control of gene expression. PMID:23249747
Pal, Arnab; Srivastava, Tapasya; Sharma, Manish K; Mehndiratta, Mohit; Das, Prerna; Sinha, Subrata; Chattopadhyay, Parthaprasad
2010-11-01
Hypoxia is an integral part of tumorigenesis and contributes extensively to the neoplastic phenotype including drug resistance and genomic instability. It has also been reported that hypoxia results in global demethylation. Because a majority of the cytosine-phosphate-guanine (CpG) islands are found within the repeat elements of DNA, and are usually methylated under normoxic conditions, we suggested that retrotransposable Alu or short interspersed nuclear elements (SINEs) which show altered methylation and associated changes of gene expression during hypoxia, could be associated with genomic instability. U87MG glioblastoma cells were cultured in 0.1% O₂ for 6 weeks and compared with cells cultured in 21% O₂ for the same duration. Real-time PCR analysis showed a significant increase in SINE and reverse transcriptase coding long interspersed nuclear element (LINE) transcripts during hypoxia. Sequencing of bisulphite treated DNA as well as the Combined Bisulfite Restriction Analysis (COBRA) assay showed that the SINE loci studied underwent significant hypomethylation though there was patchy hypermethylation at a few sites. The inter-alu PCR profile of DNA from cells cultured under 6-week hypoxia, its 4-week revert back to normoxia and 6-week normoxia showed several changes in the band pattern indicating increased alu mediated genomic alteration. Our results show that aberrant methylation leading to increased transcription of SINE and reverse transcriptase associated LINE elements could lead to increased genomic instability in hypoxia. This might be a cause of genetic heterogeneity in tumours especially in variegated hypoxic environment and lead to a development of foci of more aggressive tumour cells. © 2009 The Authors Journal compilation © 2010 Foundation for Cellular and Molecular Medicine/Blackwell Publishing Ltd.
Hueso, Miguel; Cruzado, Josep M; Torras, Joan; Navarro, Estanislao
2018-06-12
Atherosclerosis (ATH) and coronary artery disease (CAD) are chronic inflammatory diseases with an important genetic background; they derive from the cumulative effect of multiple common risk alleles, most of which are located in genomic noncoding regions. These complex diseases behave as nonlinear dynamical systems that show a high dependence on their initial conditions; thus, long-term predictions of disease progression are unreliable. One likely possibility is that the nonlinear nature of ATH could be dependent on nonlinear correlations in the structure of the human genome. In this review, we show how chaos theory analysis has highlighted genomic regions that have shared specific structural constraints, which could have a role in ATH progression. These regions were shown to be enriched with repetitive sequences of the Alu family, genomic parasites that have colonized the human genome, which show a particular secondary structure and are involved in the regulation of gene expression. Here, we show the impact of Alu elements on the mechanisms that regulate gene expression, especially highlighting the molecular mechanisms via which the Alu elements alter the inflammatory response. We devote special attention to their relationship with the long noncoding RNA (lncRNA); antisense noncoding RNA in the INK4 locus ( ANRIL ), a risk factor for ATH; their role as microRNA (miRNA) sponges; and their ability to interfere with the regulatory circuitry of the (nuclear factor kappa B) NF-κB response. We aim to characterize ATH as a nonlinear dynamic system, in which small initial alterations in the expression of a number of repetitive elements are somehow amplified to reach phenotypic significance.
Stacey, Simon N.; Kehr, Birte; Gudmundsson, Julius; Zink, Florian; Jonasdottir, Aslaug; Gudjonsson, Sigurjon A.; Sigurdsson, Asgeir; Halldorsson, Bjarni V.; Agnarsson, Bjarni A.; Benediktsdottir, Kristrun R.; Aben, Katja K.H.; Vermeulen, Sita H.; Cremers, Ruben G.; Panadero, Angeles; Helfand, Brian T.; Cooper, Phillip R.; Donovan, Jenny L.; Hamdy, Freddie C.; Jinga, Viorel; Okamoto, Ichiro; Jonasson, Jon G.; Tryggvadottir, Laufey; Johannsdottir, Hrefna; Kristinsdottir, Anna M.; Masson, Gisli; Magnusson, Olafur T.; Iordache, Paul D.; Helgason, Agnar; Helgason, Hannes; Sulem, Patrick; Gudbjartsson, Daniel F.; Kong, Augustine; Jonsson, Eirikur; Barkardottir, Rosa B.; Einarsson, Gudmundur V.; Rafnar, Thorunn; Thorsteinsdottir, Unnur; Mates, Ioan N.; Neal, David E.; Catalona, William J.; Mayordomo, José I.; Kiemeney, Lambertus A.; Thorleifsson, Gudmar; Stefansson, Kari
2016-01-01
Transcriptional and splicing anomalies have been observed in intron 8 of the CASP8 gene (encoding procaspase-8) in association with cutaneous basal-cell carcinoma (BCC) and linked to a germline SNP rs700635. Here, we show that the rs700635[C] allele, which is associated with increased risk of BCC and breast cancer, is protective against prostate cancer [odds ratio (OR) = 0.91, P = 1.0 × 10−6]. rs700635[C] is also associated with failures to correctly splice out CASP8 intron 8 in breast and prostate tumours and in corresponding normal tissues. Investigation of rs700635[C] carriers revealed that they have a human-specific short interspersed element-variable number of tandem repeat-Alu (SINE-VNTR-Alu), subfamily-E retrotransposon (SVA-E) inserted into CASP8 intron 8. The SVA-E shows evidence of prior activity, because it has transduced some CASP8 sequences during subsequent retrotransposition events. Whole-genome sequence (WGS) data were used to tag the SVA-E with a surrogate SNP rs1035142[T] (r2 = 0.999), which showed associations with both the splicing anomalies (P = 6.5 × 10−32) and with protection against prostate cancer (OR = 0.91, P = 3.8 × 10−7). PMID:26740556
SINE sequences detect DNA fingerprints in salmonid fishes.
Spruell, P; Thorgaard, G H
1996-04-01
DNA probes homologous to two previously described salmonid short interspersed nuclear elements (SINEs) detected DNA fingerprint patterns in 14 species of salmonid fishes. The probes showed more homology to some species than to others and little homology to three nonsalmonid fishes. The DNA fingerprint patterns derived from the SINE probes are individual-specific and inherited in a Mendelian manner. Probes derived from different regions of the same SINE detect only partially overlapping banding patterns, reflecting a more complex SINE structure than has been previously reported. Like the human Alu sequence, the SINEs found in salmonids could provide useful genetic markers and primer sites for PCR-based techniques. These elements may be more desirable for some applications than traditional DNA fingerprinting probes that detect tandemly repeated arrays.
Gonzalez-Perez, E; Moral, P; Via, M; Vona, G; Varesi, L; Santamaria, J; Gaya-Vidal, M; Esteban, E
2007-01-01
The islands of the West Mediterranean have played a central role in numerous archaeological, historical and anthropological studies due to their active participation in the history of main Mediterranean civilisations. However, genetic data failed to fit in both their degree of internal differentiation and relationships. A set of 18 Alu markers and three short tandem repeats (STRs) closely linked to the CD4, F13B and DM Alu have been analysed in seven samples from Majorca, Corsica, Sardinia and Sicily to explore some of these issues. Our samples show a high genetic heterogeneity inside and among islands for the Alu data. Global differentiation among islands (F(ST) 2.2%) is slightly higher than that described for Europeans and North Africans. Both the estimated divergence times among samples and the high population heterogeneity revealed by Alu data are compatible with population differences since the first islands' settlement in the Paleolithic period. However, the high within-population diversities and the remarkable homogeneity observed in both STR and Alu/STR haplotype variation indicated that, at least since Neolithic times, gene flow has been acting in west Mediterranean. Genetic drift in west-coast Sardinia and gene flow in west Sicily have contributed to their general differentiation, whereas Corsica, Majorca and east Sicily seem to reflect more recent historical relationships from continental south Europe.
NASA Astrophysics Data System (ADS)
Lozynskyi, Rostyslav; Lozynska, Maria
2006-04-01
Cytogenetical study of lymphocytes using the light microscopy could reveal a large amount of chromosomal abnormalities, which determine corresponding hereditary disorders. However, geneticists sometimes observe the cases where the same chromosomal rearrangements seen in light microscope cause quite different phenotype (from normal to abnormal) in relatives. The aim of the study was to explain the mechanisms of the different phenotype appearance in family members carrying the same reciprocal translocations. It was carried out the standard chromosome analysis in 12 families, where some relatives had reciprocal translocations. Chromosomes were differentially stained using G-method. The samples were analysed in optical microscope (x1000). Using OMIM gene map, UCSC Genome Browser, eGenome Release v2.3 and Unigene databases it was revealed transposons and transposon derivates in chromosome regions involved in translocations. We suppose that the variability of clinical manifestations in translocation-bearing patient is caused by the influence of the transposons, such as Hsmar2, Alu-elements or some others. We propose the following mechanisms of transposone action in these patients. The first may lie on recombination between the 2 specific DNA-transposon containing sites on different chromosomes resulting in balanced reciprocal translocation with no significant influence on the most genes' activity in corresponding regions. The weakening of transposase repression, which may follow in gametes, increases the transposase activity, and hereby, the probability of transposon dislocation. Dislocation can change the activity of groups of genes, because transposons often carry the regulatory sequences. This can induce multiply innate disorders in the progeny of the phenotypically healthy parents, carrying the translocation. According to the second mechanism, the reciprocal translocation is caused by recombination between 2 Alu repeats. These repeats can undergo reverse transcription, and a DNA-product, formed during this process, can paste in a new chromosome region in gametes. As the Alu repeats contain the CpG-islands, they can change the gene activity resulting in a disorder. The understanding of the cases of such genetical disorders might help to predict the appearance of the progeny with pathological karyotype, making the light microscopy more informative in diagnostic of the diseases.
Small Deletion Variants Have Stable Breakpoints Commonly Associated with Alu Elements
Coin, Lachlan J. M.; Steinfeld, Israel; Yakhini, Zohar; Sladek, Rob; Froguel, Philippe; Blakemore, Alexandra I. F.
2008-01-01
Copy number variants (CNVs) contribute significantly to human genomic variation, with over 5000 loci reported, covering more than 18% of the euchromatic human genome. Little is known, however, about the origin and stability of variants of different size and complexity. We investigated the breakpoints of 20 small, common deletions, representing a subset of those originally identified by array CGH, using Agilent microarrays, in 50 healthy French Caucasian subjects. By sequencing PCR products amplified using primers designed to span the deleted regions, we determined the exact size and genomic position of the deletions in all affected samples. For each deletion studied, all individuals carrying the deletion share identical upstream and downstream breakpoints at the sequence level, suggesting that the deletion event occurred just once and later became common in the population. This is supported by linkage disequilibrium (LD) analysis, which has revealed that most of the deletions studied are in moderate to strong LD with surrounding SNPs, and have conserved long-range haplotypes. Analysis of the sequences flanking the deletion breakpoints revealed an enrichment of microhomology at the breakpoint junctions. More significantly, we found an enrichment of Alu repeat elements, the overwhelming majority of which intersected deletion breakpoints at their poly-A tails. We found no enrichment of LINE elements or segmental duplications, in contrast to other reports. Sequence analysis revealed enrichment of a conserved motif in the sequences surrounding the deletion breakpoints, although whether this motif has any mechanistic role in the formation of some deletions has yet to be determined. Considered together with existing information on more complex inherited variant regions, and reports of de novo variants associated with autism, these data support the presence of different subgroups of CNV in the genome which may have originated through different mechanisms. PMID:18769679
Insertion and deletion polymorphisms of the ancient AluS family in the human genome.
Kryatova, Maria S; Steranka, Jared P; Burns, Kathleen H; Payer, Lindsay M
2017-01-01
Polymorphic Alu elements account for 17% of structural variants in the human genome. The majority of these belong to the youngest AluY subfamilies, and most structural variant discovery efforts have focused on identifying Alu polymorphisms from these currently retrotranspositionally active subfamilies. In this report we analyze polymorphisms from the evolutionarily older AluS subfamily, whose peak activity was tens of millions of years ago. We annotate the AluS polymorphisms, assess their likely mechanism of origin, and evaluate their contribution to structural variation in the human genome. Of 52 previously reported polymorphic AluS elements ascertained for this study, 48 were confirmed to belong to the AluS subfamily using high stringency subfamily classification criteria. Of these, the majority (77%, 37/48) appear to be deletion polymorphisms. Two polymorphic AluS elements (4%) have features of non-classical Alu insertions and one polymorphic AluS element (2%) likely inserted by a mechanism involving internal priming. Seven AluS polymorphisms (15%) appear to have arisen by the classical target-primed reverse transcription (TPRT) retrotransposition mechanism. These seven TPRT products are 3' intact with 3' poly-A tails, and are flanked by target site duplications; L1 ORF2p endonuclease cleavage sites were also observed, providing additional evidence that these are L1 ORF2p endonuclease-mediated TPRT insertions. Further sequence analysis showed strong conservation of both the RNA polymerase III promoter and SRP9/14 binding sites, important for mediating transcription and interaction with retrotransposition machinery, respectively. This conservation of functional features implies that some of these are fairly recent insertions since they have not diverged significantly from their respective retrotranspositionally competent source elements. Of the polymorphic AluS elements evaluated in this report, 15% (7/48) have features consistent with TPRT-mediated insertion, thus suggesting that some AluS elements have been more active recently than previously thought, or that fixation of AluS insertion alleles remains incomplete. These data expand the potential significance of polymorphic AluS elements in contributing to structural variation in the human genome. Future discovery efforts focusing on polymorphic AluS elements are likely to identify more such polymorphisms, and approaches tailored to identify deletion alleles may be warranted.
Alu-derived cis-element regulates tumorigenesis-dependent gastric expression of GASDERMIN B (GSDMB).
Komiyama, Hiromitsu; Aoki, Aya; Tanaka, Shigekazu; Maekawa, Hiroshi; Kato, Yoriko; Wada, Ryo; Maekawa, Takeo; Tamura, Masaru; Shiroishi, Toshihiko
2010-02-01
GASDERMIN B (GSDMB) belongs to the novel gene family GASDERMIN (GSDM). All GSDM family members are located in amplicons, genomic regions often amplified during cancer development. Given that GSDMB is highly expressed in cancerous cells and the locus resides in an amplicon, GSDMB may be involved in cancer development and/or progression. However, only limited information is available on GSDMB expression in tissues, normal and cancerous, from cancer patients. Furthermore, the molecular mechanisms that regulate GSDMB expression in gastric tissues are poorly understood. We investigated the spatiotemporal expression patterns of GSDMB in gastric cancer patients and the 5' regulatory sequences upstream of GSDMB. GSDMB was not expressed in the majority of normal gastric-tissue samples, and the expression level was very low in the few normal samples with GSDMB expression. Most pre-cancer samples showed moderate GSDMB expression, and most cancerous samples showed augmented GSDMB expression. Analysis of genome sequences revealed that an Alu element resides in the 5' region upstream of GSDMB. Reporter assays using intact, deleted, and mutated Alu elements clearly showed that this Alu element positively regulates GSDMB expression and that a putative IKZF binding motif in this element is crucial to upregulate GSDMB expression.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hori, Toshinori; Tomatsu, Shunji; Fukuda, Seiji
1995-04-10
Mucopolysaccharidosis IVA (MPS IVA) is an autosomal recessive disorder caused by a deficiency in N-acetylgalactosamine-6-sulfatase (GALNS). We found two separate deletions of nearly 8.0 and 6.0 kb in the GALNS gene, including some exons. There are Alu repetitive elements near the breakpoints of the 8.0-kb deletion, and this deletion resulted from an Alu-Alu recombination. The other 6.0-kb deletion involved illegitimate recombinational events between incomplete short direct repeats of 8 bp at deletion breakpoints. The same rearrangement has been observed in a heteroallelic state in four unrelated patients. This is the first documentation of a common double deletion a gene thatmore » is not a member of a gene cluster. 39 refs., 5 figs.« less
Engel, Pablo; Angulo, Ana
2016-01-01
Since the discovery of the high abundance of Alu elements in the human genome, the interest for the functional significance of these retrotransposons has been increasing. Primate Alu and rodent Alu-like elements are retrotransposed by a mechanism driven by the LINE1 (L1) encoded proteins, the same machinery that generates the L1 repeats, the processed pseudogenes (PPs), and other retroelements. Apart from free Alu RNAs, Alus are also transcribed and retrotranscribed as part of cellular gene transcripts, generally embedded inside 3’ untranslated regions (UTRs). Despite different proposed hypotheses, the functional implication of the presence of Alus inside 3’UTRs remains elusive. In this study we hypothesized that Alu elements in 3’UTRs could be involved in the genesis of PPs. By analyzing human genome data we discovered that the existence of 3’UTR-embedded Alu elements is overrepresented in genes source of PPs. In contrast, the presence of other retrotransposable elements in 3’UTRs does not show this PP linked overrepresentation. This research was extended to mouse and rat genomes and the results accordingly reveal overrepresentation of 3’UTR-embedded B1 (Alu-like) elements in PP parent genes. Interestingly, we also demonstrated that the overrepresentation of 3’UTR-embedded Alus is particularly significant in PP parent genes with low germline gene expression level. Finally, we provide data that support the hypothesis that the L1 machinery is also the system that herpesviruses, and possibly other large DNA viruses, use to capture host genes expressed in germline or somatic cells. Altogether our results suggest a novel role for Alu or Alu-like elements inside 3’UTRs as facilitators of the genesis of PPs, particularly in lowly expressed genes. Moreover, we propose that this L1-driven mechanism, aided by the presence of 3’UTR-embedded Alus, may also be exploited by DNA viruses to incorporate host genes to their viral genomes. PMID:28033411
Existence of host-related DNA sequences in the schistosome genome.
Iwamura, Y; Irie, Y; Kominami, R; Nara, T; Yasuraoka, K
1991-06-01
DNA sequences homologous to the mouse intracisternal A particle and endogenous type C retrovirus were detected in the DNAs of Schistosoma japonicum adults and S. mansoni eggs. Furthermore, other kinds of repetitive sequences in the host genome such as mouse type 1 Alu sequence (B1), mouse type 2 Alu sequence (B2) and mo-2 sequence, a mouse mini-satellite, were also detected in the DNAs from adults and eggs of S. japonicum and eggs of S. mansoni. Almost all of the sequences described above were absent in the DNAs of S. mansoni adults. The DNA fingerprints of schistosomes, using the mo-2 sequence, were indistinguishable from each other and resembled those of their murine hosts. Moreover, the mo-2 sequence was hypermethylated in the DNAs of schistosomes and its amount was variable in them. These facts indicate that host-related sequences are actually present in schistosomes and that the mo-2 repetitive sequence exists probably in extra-chromosome.
Willett-Brozick, J E; Savul, S A; Richey, L E; Baysal, B E
2001-08-01
Constitutional chromosomal translocations are relatively common causes of human morbidity, yet the DNA double-strand break (DSB) repair mechanisms that generate them are incompletely understood. We cloned, sequenced and analyzed the breakpoint junctions of a familial constitutional reciprocal translocation t(9;11)(p24;q23). Within the 10-kb region flanking the breakpoints, chromosome 11 had 25% repeat elements, whereas chromosome 9 had 98% repeats, 95% of which were L1-type LINE elements. The breakpoints occurred within an L1-type repeat element at 9p24 and at the 3'-end of an Alu sequence at 11q23. At the breakpoint junction of derivative chromosome 9, we discovered an unusually large 41-bp insertion, which showed 100% identity to 12S mitochondrial DNA (mtDNA) between nucleotides 896 and 936 of the mtDNA sequence. Analysis of the human genome failed to show the preexistence of the inserted sequence at normal chromosomes 9 and 11 breakpoint junctions or elsewhere in the genome, strongly suggesting that the insertion was derived from human mtDNA and captured into the junction during the DSB repair process. To our knowledge, these findings represent the first observation of spontaneous germ line insertion of modern human mtDNA sequences and suggest that DSB repair may play a role in inter-organellar gene transfer in vivo. Our findings also provide evidence for a previously unrecognized insertional mechanism in human, by which non-mobile extra-chromosomal fragments can be inserted into the genome at DSB repair junctions.
Alu-mediated large deletion of the CDSN gene as a cause of peeling skin disease.
Wada, T; Matsuda, Y; Muraoka, M; Toma, T; Takehara, K; Fujimoto, M; Yachie, A
2014-10-01
Peeling skin disease (PSD) is an autosomal recessive skin disorder caused by mutations in CDSN and is characterized by superficial peeling of the upper epidermis. Corneodesmosin (CDSN) is a major component of corneodesmosomes that plays an important role in maintaining epidermis integrity. Herein, we report a patient with PSD caused by a novel homozygous large deletion in the 6p21.3 region encompassing the CDSN gene, which abrogates CDSN expression. Several genes including C6orf15, PSORS1C1, PSORS1C2, CCHCR1, and TCF19 were also deleted, however, the patient showed only clinical features typical of PSD. The deletion size was 59.1 kb. Analysis of the sequence surrounding the breakpoint showed that both telomeric and centromeric breakpoints existed within Alu-S sequences that were oriented in opposite directions. These results suggest an Alu-mediated recombination event as the mechanism underlying the deletion in our patient. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Liu, Gang; Ma, Dingyuan; Hu, Ping; Wang, Wen; Luo, Chunyu; Wang, Yan; Sun, Yun; Zhang, Jingjing; Jiang, Tao; Xu, Zhengfeng
2018-01-01
Maple syrup urine disease (MSUD) is an autosomal recessive inherited metabolic disorder caused by mutations in the BCKDHA, BCKDHB, DBT, and DLD genes. Among the wide range of disease-causing mutations in BCKDHB, only one large deletion has been associated with MSUD. Compound heterozygous mutations in BCKDHB were identified in a Chinese patient with typical MSUD using next-generation sequencing, quantitative PCR, and array comparative genomic hybridization. One allele presented a missense mutation (c.391G > A), while the other allele had a large deletion; both were inherited from the patient’s unaffected parents. The deletion breakpoints were characterized using long-range PCR and sequencing. A novel 383,556 bp deletion (chr6: g.80811266_81194921del) was determined, which encompassed the entire BCKDHB gene. The junction site of the deletion was localized within a homologous sequence in two AluYa5 elements. Hence, Alu-mediated non-allelic homologous recombination is speculated as the mutational event underlying the large deletion. In summary, this study reports a recombination mechanism in the BCKDHB gene causing a whole gene deletion in a newborn with MSUD. PMID:29740478
Liu, Gang; Ma, Dingyuan; Hu, Ping; Wang, Wen; Luo, Chunyu; Wang, Yan; Sun, Yun; Zhang, Jingjing; Jiang, Tao; Xu, Zhengfeng
2018-01-01
Maple syrup urine disease (MSUD) is an autosomal recessive inherited metabolic disorder caused by mutations in the BCKDHA, BCKDHB, DBT , and DLD genes. Among the wide range of disease-causing mutations in BCKDHB , only one large deletion has been associated with MSUD. Compound heterozygous mutations in BCKDHB were identified in a Chinese patient with typical MSUD using next-generation sequencing, quantitative PCR, and array comparative genomic hybridization. One allele presented a missense mutation (c.391G > A), while the other allele had a large deletion; both were inherited from the patient's unaffected parents. The deletion breakpoints were characterized using long-range PCR and sequencing. A novel 383,556 bp deletion (chr6: g.80811266_81194921del) was determined, which encompassed the entire BCKDHB gene. The junction site of the deletion was localized within a homologous sequence in two AluYa5 elements. Hence, Alu-mediated non-allelic homologous recombination is speculated as the mutational event underlying the large deletion. In summary, this study reports a recombination mechanism in the BCKDHB gene causing a whole gene deletion in a newborn with MSUD.
Sakkhachornphop, Supachai; Barbas, Carlos F; Keawvichit, Rassamee; Wongworapat, Kanlaya; Tayapiwatana, Chatchai
2012-09-01
Integration of the human immunodeficiency virus type 1 (HIV-1) genome into the host chromosome is a vital step in the HIV life cycle. The highly conserved cytosine-adenine (CA) dinucleotide sequence immediately upstream of the cleavage site is crucial for integrase (IN) activity. As this viral enzyme has an important role early in the HIV-1 replication cycle, interference with the IN substrate has become an attractive strategy for therapeutic intervention. We demonstrated that a designed zinc finger protein (ZFP) fused to green fluorescent protein (GFP) targets the 2-long terminal repeat (2-LTR) circle junctions of HIV-1 DNA with nanomolar affinity. We report now that 2LTRZFP-GFP stably transduced into 293T cells interfered with the expression of vesicular stomatitis virus glycoprotein (VSV-G)-pseudotyped lentiviral red fluorescent protein (RFP), as shown by the suppression of RFP expression. We also used a third-generation lentiviral vector and pCEP4 expression vector to deliver the 2LTRZFP-GFP transgene into human T-lymphocytic cells, and a stable cell line for long-term expression studies was selected for HIV-1 challenge. HIV-1 integration and replication were inhibited as measured by Alu-gag real-time PCR and p24 antigen assay. In addition, the molecular activity of 2LTRZFP-GFP was evaluated in peripheral blood mononuclear cells. The results were confirmed by Alu-gag real-time PCR for integration interference. We suggest that the expression of 2LTRZFP-GFP limited viral integration on intracellular immunization, and that it has potential for use in HIV gene therapy in the future.
Sequence-Level Mechanisms of Human Epigenome Evolution
Prendergast, James G.D.; Chambers, Emily V.; Semple, Colin A.M.
2014-01-01
DNA methylation and chromatin states play key roles in development and disease. However, the extent of recent evolutionary divergence in the human epigenome and the influential factors that have shaped it are poorly understood. To determine the links between genome sequence and human epigenome evolution, we examined the divergence of DNA methylation and chromatin states following segmental duplication events in the human lineage. Chromatin and DNA methylation states were found to have been generally well conserved following a duplication event, with the evolution of the epigenome largely uncoupled from the total number of genetic changes in the surrounding DNA sequence. However, the epigenome at tissue-specific, distal regulatory regions was observed to be unusually prone to diverge following duplication, with particular sequence differences, altering known sequence motifs, found to be associated with divergence in patterns of DNA methylation and chromatin. Alu elements were found to have played a particularly prominent role in shaping human epigenome evolution, and we show that human-specific AluY insertion events are strongly linked to the evolution of the DNA methylation landscape and gene expression levels, including at key neurological genes in the human brain. Studying paralogous regions within the same sample enables the study of the links between genome and epigenome evolution while controlling for biological and technical variation. We show DNA methylation and chromatin divergence between duplicated regions are linked to the divergence of particular genetic motifs, with Alu elements having played a disproportionate role in the evolution of the epigenome in the human lineage. PMID:24966180
Huen, Karen; Calafat, Antonia M.; Bradman, Asa; Yousefi, Paul; Eskenazi, Brenda; Holland, Nina
2016-01-01
Phthalates are frequently used in personal care products and plasticizers and phthalate exposure is ubiquitous in the US population. Exposure to phthalates during critical periods in utero has been associated with a variety of adverse health outcomes but the biological mechanisms linking these exposures with disease are not well characterized. In this study, we examined the relationship of in utero phthalate exposure with repetitive element DNA methylation, an epigenetic marker of genome instability, in children from the longitudinal birth cohort CHAMACOS. Methylation of Alu and long interspersed nucleotide elements (LINE-1) was determined using pyrosequencing of bisulfite-treated DNA isolated from whole blood samples collected from newborns and 9 year old children (n=355). Concentrations of eleven phthalate metabolites were measured in urine collected from pregnant mothers at 13 and 26 weeks gestation. We found a consistent inverse association between prenatal concentrations of monoethyl phthalate, the most frequently detected urinary metabolite, with cord blood methylation of Alu repeats (β(95%CI):−0.14(−0.28,0.00) and −0.16(−0.31,−0.02)) for early and late pregnancy, respectively, and a similar but weaker association with LINE-1 methylation. Additionally, increases in urinary concentrations of di-(2-ethylhexyl) phthalate metabolites during late pregnancy were associated with lower levels of methylation of Alu repeats in 9 year old blood (significant p-values ranged from 0.003 to 0.03). Our findings suggest that prenatal exposure to some phthalates may influence differences in repetitive element methylation, highlighting epigenetics as a plausible biological mechanism through which phthalates may affect health. PMID:27019040
Alu sequence involvement in transcriptional insulation of the keratin 18 gene in transgenic mice.
Thorey, I S; Ceceña, G; Reynolds, W; Oshima, R G
1993-01-01
The human keratin 18 (K18) gene is expressed in a variety of adult simple epithelial tissues, including liver, intestine, lung, and kidney, but is not normally found in skin, muscle, heart, spleen, or most of the brain. Transgenic animals derived from the cloned K18 gene express the transgene in appropriate tissues at levels directly proportional to the copy number and independently of the sites of integration. We have investigated in transgenic mice the dependence of K18 gene expression on the distal 5' and 3' flanking sequences and upon the RNA polymerase III promoter of an Alu repetitive DNA transcription unit immediately upstream of the K18 promoter. Integration site-independent expression of tandemly duplicated K18 transgenes requires the presence of either an 825-bp fragment of the 5' flanking sequence or the 3.5-kb 3' flanking sequence. Mutation of the RNA polymerase III promoter of the Alu element within the 825-bp fragment abolishes copy number-dependent expression in kidney but does not abolish integration site-independent expression when assayed in the absence of the 3' flanking sequence of the K18 gene. The characteristics of integration site-independent expression and copy number-dependent expression are separable. In addition, the formation of the chromatin state of the K18 gene, which likely restricts the tissue-specific expression of this gene, is not dependent upon the distal flanking sequences of the 10-kb K18 gene but rather may depend on internal regulatory regions of the gene. Images PMID:7692231
Shewale, Jaiprakash G; Schneida, Elaine; Wilson, Jonathan; Walker, Jerilyn A; Batzer, Mark A; Sinha, Sudhir K
2007-03-01
The human DNA quantification (H-Quant) system, developed for use in human identification, enables quantitation of human genomic DNA in biological samples. The assay is based on real-time amplification of AluYb8 insertions in hominoid primates. The relatively high copy number of subfamily-specific Alu repeats in the human genome enables quantification of very small amounts of human DNA. The oligonucleotide primers present in H-Quant are specific for human DNA and closely related great apes. During the real-time PCR, the SYBR Green I dye binds to the DNA that is synthesized by the human-specific AluYb8 oligonucleotide primers. The fluorescence of the bound SYBR Green I dye is measured at the end of each PCR cycle. The cycle at which the fluorescence crosses the chosen threshold correlates to the quantity of amplifiable DNA in that sample. The minimal sensitivity of the H-Quant system is 7.6 pg/microL of human DNA. The amplicon generated in the H-Quant assay is 216 bp, which is within the same range of the common amplifiable short tandem repeat (STR) amplicons. This size amplicon enables quantitation of amplifiable DNA as opposed to a quantitation of degraded or nonamplifiable DNA of smaller sizes. Development and validation studies were performed on the 7500 real-time PCR system following the Quality Assurance Standards for Forensic DNA Testing Laboratories.
Sakkhachornphop, Supachai; Barbas, Carlos F.; Keawvichit, Rassamee; Wongworapat, Kanlaya
2012-01-01
Abstract Integration of the human immunodeficiency virus type 1 (HIV-1) genome into the host chromosome is a vital step in the HIV life cycle. The highly conserved cytosine–adenine (CA) dinucleotide sequence immediately upstream of the cleavage site is crucial for integrase (IN) activity. As this viral enzyme has an important role early in the HIV-1 replication cycle, interference with the IN substrate has become an attractive strategy for therapeutic intervention. We demonstrated that a designed zinc finger protein (ZFP) fused to green fluorescent protein (GFP) targets the 2-long terminal repeat (2-LTR) circle junctions of HIV-1 DNA with nanomolar affinity. We report now that 2LTRZFP-GFP stably transduced into 293T cells interfered with the expression of vesicular stomatitis virus glycoprotein (VSV-G)-pseudotyped lentiviral red fluorescent protein (RFP), as shown by the suppression of RFP expression. We also used a third-generation lentiviral vector and pCEP4 expression vector to deliver the 2LTRZFP-GFP transgene into human T-lymphocytic cells, and a stable cell line for long-term expression studies was selected for HIV-1 challenge. HIV-1 integration and replication were inhibited as measured by Alu-gag real-time PCR and p24 antigen assay. In addition, the molecular activity of 2LTRZFP-GFP was evaluated in peripheral blood mononuclear cells. The results were confirmed by Alu-gag real-time PCR for integration interference. We suggest that the expression of 2LTRZFP-GFP limited viral integration on intracellular immunization, and that it has potential for use in HIV gene therapy in the future. PMID:22429108
An Alu-Based Phylogeny of Lemurs (Infraorder: Lemuriformes)
McLain, Adam T.; Meyer, Thomas J.; Faulk, Christopher; Herke, Scott W.; Oldenburg, J. Michael; Bourgeois, Matthew G.; Abshire, Camille F.
2012-01-01
Lemurs (infraorder: Lemuriformes) are a radiation of strepsirrhine primates endemic to the island of Madagascar. As of 2012, 101 lemur species, divided among five families, have been described. Genetic and morphological evidence indicates all species are descended from a common ancestor that arrived in Madagascar ∼55–60 million years ago (mya). Phylogenetic relationships in this species-rich infraorder have been the subject of debate. Here we use Alu elements, a family of primate-specific Short INterspersed Elements (SINEs), to construct a phylogeny of infraorder Lemuriformes. Alu elements are particularly useful SINEs for the purpose of phylogeny reconstruction because they are identical by descent and confounding events between loci are easily resolved by sequencing. The genome of the grey mouse lemur (Microcebus murinus) was computationally assayed for synapomorphic Alu elements. Those that were identified as Lemuriformes-specific were analyzed against other available primate genomes for orthologous sequence in which to design primers for PCR (polymerase chain reaction) verification. A primate phylogenetic panel of 24 species, including 22 lemur species from all five families, was examined for the presence/absence of 138 Alu elements via PCR to establish relationships among species. Of these, 111 were phylogenetically informative. A phylogenetic tree was generated based on the results of this analysis. We demonstrate strong support for the monophyly of Lemuriformes to the exclusion of other primates, with Daubentoniidae, the aye-aye, as the basal lineage within the infraorder. Our results also suggest Lepilemuridae as a sister lineage to Cheirogaleidae, and Indriidae as sister to Lemuridae. Among the Cheirogaleidae, we show strong support for Microcebus and Mirza as sister genera, with Cheirogaleus the sister lineage to both. Our results also support the monophyly of the Lemuridae. Within Lemuridae we place Lemur and Hapalemur together to the exclusion of Eulemur and Varecia, with Varecia the sister lineage to the other three genera. PMID:22937148
Kouprina, Natalay; Noskov, Vladimir N.; Waterfall, Joshua J.; Walker, Robert L.; Meltzer, Paul S.; Topol, Eric J.; Larionov, Vladimir
2018-01-01
Tandem segmental duplications (SDs) greater than 10 kb are widespread in complex genomes. They provide material for gene divergence and evolutionary adaptation, while formation of specific de novo SDs is a hallmark of cancer and some human diseases. Most SDs map to distinct genomic regions termed ‘duplication blocks’. SDs organization within these blocks is often poorly characterized as they are mosaics of ancestral duplicons juxtaposed with younger duplicons arising from more recent duplication events. Structural and functional analysis of SDs is further hampered as long repetitive DNA structures are underrepresented in existing BAC and YAC libraries. We applied Transformation-Associated Recombination (TAR) cloning, a versatile technique for large DNA manipulation, to selectively isolate the coronary artery disease (CAD) interval sequence within the 9p21.3 chromosome locus from a patient with coronary artery disease and normal individuals. Four tandem head-to-tail duplicons, each ∼50 kb long, were recovered in the patient but not in normal individuals. Sequence analysis revealed that the repeats varied by 10-15 SNPs between each other and by 82 SNPs between the human genome sequence (version hg19). SNPs polymorphism within the junctions between repeats allowed two junction types to be distinguished, Type 1 and Type 2, which were found at a 2:1 ratio. The junction sequences contained an Alu element, a sequence previously shown to play a role in duplication. Knowledge of structural variation in the CAD interval from more patients could help link this locus to cardiovascular diseases susceptibility, and maybe relevant to other cases of regional amplification, including cancer. PMID:29632643
Yu, Qichao; Zhang, Wei; Zhang, Xiaolong; Zeng, Yongli; Wang, Yeming; Wang, Yanhui; Xu, Liqin; Huang, Xiaoyun; Li, Nannan; Zhou, Xinlan; Lu, Jie; Guo, Xiaosen; Li, Guibo; Hou, Yong; Liu, Shiping; Li, Bo
2017-09-01
Active retrotransposons play important roles during evolution and continue to shape our genomes today, especially in genetic polymorphisms underlying a diverse set of diseases. However, studies of human retrotransposon insertion polymorphisms (RIPs) based on whole-genome deep sequencing at the population level have not been sufficiently undertaken, despite the obvious need for a thorough characterization of RIPs in the general population. Herein, we present a novel and efficient computational tool called Specific Insertions Detector (SID) for the detection of non-reference RIPs. We demonstrate that SID is suitable for high-depth whole-genome sequencing data using paired-end reads obtained from simulated and real datasets. We construct a comprehensive RIP database using a large population of 90 Han Chinese individuals with a mean ×68 depth per individual. In total, we identify 9342 recent RIPs, and 8433 of these RIPs are novel compared with dbRIP, including 5826 Alu, 2169 long interspersed nuclear element 1 (L1), 383 SVA, and 55 long terminal repeats. Among the 9342 RIPs, 4828 were located in gene regions and 5 were located in protein-coding regions. We demonstrate that RIPs can, in principle, be an informative resource to perform population evolution and phylogenetic analyses. Taking the demographic effects into account, we identify a weak negative selection on SVA and L1 but an approximately neutral selection for Alu elements based on the frequency spectrum of RIPs. SID is a powerful open-source program for the detection of non-reference RIPs. We built a non-reference RIP dataset that greatly enhanced the diversity of RIPs detected in the general population, and it should be invaluable to researchers interested in many aspects of human evolution, genetics, and disease. As a proof of concept, we demonstrate that the RIPs can be used as biomarkers in a similar way as single nucleotide polymorphisms. © The Authors 2017. Published by Oxford University Press.
2018-01-01
FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722
Sokol, Martin; Jessen, Karen Margrethe; Pedersen, Finn Skou
2016-01-01
Several studies have shown that human endogenous retroviruses and endogenous retrovirus-like repeats (here collectively HERVs) impose direct regulation on human genes through enhancer and promoter motifs present in their long terminal repeats (LTRs). Although chimeric transcription in which novel gene isoforms containing retroviral and human sequence are transcribed from viral promoters are commonly associated with disease, regulation by HERVs is beneficial in other settings; for example, in human testis chimeric isoforms of TP63 induced by an ERV9 LTR protect the male germ line upon DNA damage by inducing apoptosis, whereas in the human globin locus the γ- and β-globin switch during normal hematopoiesis is mediated by complex interactions of an ERV9 LTR and surrounding human sequence. The advent of deep sequencing or next-generation sequencing (NGS) has revolutionized the way researchers solve important scientific questions and develop novel hypotheses in relation to human genome regulation. We recently applied next-generation paired-end RNA-sequencing (RNA-seq) together with chromatin immunoprecipitation with sequencing (ChIP-seq) to examine ERV9 chimeric transcription in human reference cell lines from Encyclopedia of DNA Elements (ENCODE). This led to the discovery of advanced regulation mechanisms by ERV9s and other HERVs across numerous human loci including transcription of large gene-unannotated genomic regions, as well as cooperative regulation by multiple HERVs and non-LTR repeats such as Alu elements. In this article, well-established examples of human gene regulation by HERVs are reviewed followed by a description of paired-end RNA-seq, and its application in identifying chimeric transcription genome-widely. Based on integrative analyses of RNA-seq and ChIP-seq, data we then present novel examples of regulation by ERV9s of tumor suppressor genes CADM2 and SEMA3A, as well as transcription of an unannotated region. Taken together, this article highlights the high suitability of contemporary sequencing methods in future analyses of human biology in relation to evolutionary acquired retroviruses in the human genome. © 2016 APMIS. Published by John Wiley & Sons Ltd.
LOU, XIAOLI; HOU, YANQIANG; LIANG, DONGYU; PENG, LIANG; CHEN, HONGWEI; MA, SHANYUAN; ZHANG, LURONG
2015-01-01
In the present study, we aimed to develop and validate a rapid and sensitive, Alu-based real-time PCR method for the detection of circulating cell-free DNA (cfDNA). This method targeted repetitive elements of the Alu reduplicative elements in the human genome, followed by signal amplification using fluorescence quantification. Standard Alu-puc57 vectors were constructed and 5 pairs of specific primers were designed. Valuation was conducted concerning linearity, variation and recovery. We found 5 linear responses (R1–5=0.998–0.999). The average intra- and inter-assay coefficients of variance were 12.98 and 10.75%, respectively. The recovery was 82.33–114.01%, with a mean recovery index of 101.26%. This Alu-based assay was reliable, accurate and sensitive for the quantitative detection of cfDNA. Plasma from normal controls and patients with myocardial infarction (MI) were analyzed, and the baseline levels of cfDNA were higher in the MI group. The area under the receiver operating characteristic (ROC) curve for Alu1, Alu2, Alu3, Alu4, Alu5 and Alu (Alu1 + Alu2 + Alu3 + Alu4 + Alu5) was 0.887, 0.758, 0.857, 0.940, 0.968 and 0.933, respectively. The optimal cut-off value for Alu1, Alu2, Alu3, Alu4, Alu5 and Alu to predict MI was 3.71, 1.93, 0.22, 3.73, 6.13 and 6.40 log copies/ml. We demonstrate that this new method is a reliable, accurate and sensitive method for the quantitative detection of cfDNA and that it is useful for studying the regulation of cfDNA in certain pathological conditions. Alu4, Alu5 and Alu showed better sensitivity and specificity for the diagnosis of MI compared with cardiac troponin I (cTnI), creatine kinase MB (CK-MB) isoenzyme and lactate dehydrogenase (LDH). Alu5 had the best prognostic ability. PMID:25374065
Molecular analysis of the glucocerebrosidase gene locus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Winfield, S.L.; Martin, B.M.; Fandino, A.
1994-09-01
Gaucher disease is due to a deficiency in the activity of the lysosomal enzyme glucocerebrosidase. Both the functional gene for this enzyme and a pseudogene are located in close proximity on chromosome 1q21. Analysis of the mutations present in patient samples has suggested interaction between the functional gene and the pseudogene in the origin of mutant genotypes. To investigate the involvement of regions flanking the functional gene and pseudogene in the origin of mutations found in Gaucher disease, a YAC clone containing DNA from this locus has been subcloned and characterized. The original YAC containing {approximately}360 kb was truncated withmore » the use of fragmentation plasmids to about 85 kb. A lambda library derived from this YAC was screened to obtain clones containing glucocerebrosidase sequences. PCR amplification was used to identify subclones containing 5{prime}, central, or 3{prime} sequences of the functional gene or of the pseudogene. Clones spanning the entire distance from the last exon of the functional gene to intron 1 of the pseudogene, the 5{prime} end of the functional gene and 16 kb of 5{prime} flanking region and approximately 15 kb of 3{prime} flanking region of the pseudogene were sequenced. Sequence data from 48 kb of intergenic and flanking regions of the glucocerebrosidase gene and its pseudogene has been generated. A large number of Alu sequences and several simple repeats have been found. Two of these repeats exhibit fragment length polymorphism. There is almost 100% homology between the 3{prime} flanking regions of the functional gene and the pseudogene, extending to about 4 kb past the termination codons. A much lower degree of homology is observed in the 5{prime} flanking region. Patient samples are currently being screened for polymorphisms in these flanking regions.« less
The human genome: a multifractal analysis
2011-01-01
Background Several studies have shown that genomes can be studied via a multifractal formalism. Recently, we used a multifractal approach to study the genetic information content of the Caenorhabditis elegans genome. Here we investigate the possibility that the human genome shows a similar behavior to that observed in the nematode. Results We report here multifractality in the human genome sequence. This behavior correlates strongly on the presence of Alu elements and to a lesser extent on CpG islands and (G+C) content. In contrast, no or low relationship was found for LINE, MIR, MER, LTRs elements and DNA regions poor in genetic information. Gene function, cluster of orthologous genes, metabolic pathways, and exons tended to increase their frequencies with ranges of multifractality and large gene families were located in genomic regions with varied multifractality. Additionally, a multifractal map and classification for human chromosomes are proposed. Conclusions Based on these findings, we propose a descriptive non-linear model for the structure of the human genome, with some biological implications. This model reveals 1) a multifractal regionalization where many regions coexist that are far from equilibrium and 2) this non-linear organization has significant molecular and medical genetic implications for understanding the role of Alu elements in genome stability and structure of the human genome. Given the role of Alu sequences in gene regulation, genetic diseases, human genetic diversity, adaptation and phylogenetic analyses, these quantifications are especially useful. PMID:21999602
Genomic patterns associated with paternal/maternal distribution of transposable elements
NASA Astrophysics Data System (ADS)
Jurka, Jerzy
2003-03-01
Transposable elements (TEs) are specialized DNA or RNA fragments capable of surviving in intragenomic niches. They are commonly, perhaps unjustifiably referred to as "selfish" or "parasitic" elements. TEs can be divided in two major classes: retroelements and DNA transposons. The former include non-LTR retrotransposons and retrovirus-like elements, using reverse transriptase for their reproduction prior to integration into host DNA. The latter depend mostly on host DNA replication, with possible exception of rolling-circle transposons recently discovered by our team. I will review basic information on TEs, with emphasis on human Alu and L1 retroelements discussed in the context of genomic organization. TEs are non-randomly distributed in chromosomal DNA. In particular, human Alu elements tend to prefer GC-rich regions, whereas L1 accumulate in AT-rich regions. Current explanations of this phenomenon focus on the so called "target effects" and post-insertional selection. However, the proposed models appear to be unsatisfactory and alternative explanations invoking "channeling" to different chromosomal regions will be a major focus of my presentation. Transposable elements (TEs) can be expressed and integrated into host DNA in the male or female germlines, or both. Different models of expression and integration imply different proportions of TEs on sex chromosomes and autosomes. The density of recently retroposed human Alu elements is around three times higher on chromosome Y than on chromosome X, and over two times higher than the average density for all human autosomes. This implies Alu activity in paternal germlines. Analogous inter-chromosomal proportions for other repeat families should determine their compatibility with one of the three basic models describing the inheritance of TEs. Published evidence indicates that maternally and paternally imprinted genes roughly correspond to GC-rich and AT-rich DNA. This may explain the observed chromosomal distribution of Alu and L1 elements. Finally, paternal models of inheritance predict rapid accumulation of active TEs on chromosome Y. I will discuss potential implications of this phenomenon for evolution of chromosome Y and transposable elements.
Effects of Particulate Matter on Genomic DNA Methylation Content and iNOS Promoter Methylation
Tarantini, Letizia; Bonzini, Matteo; Apostoli, Pietro; Pegoraro, Valeria; Bollati, Valentina; Marinelli, Barbara; Cantone, Laura; Rizzo, Giovanna; Hou, Lifang; Schwartz, Joel; Bertazzi, Pier Alberto; Baccarelli, Andrea
2009-01-01
Background Altered patterns of gene expression mediate the effects of particulate matter (PM) on human health, but mechanisms through which PM modifies gene expression are largely undetermined. Objectives We aimed at identifying short- and long-term effects of PM exposure on DNA methylation, a major genomic mechanism of gene expression control, in workers in an electric furnace steel plant with well-characterized exposure to PM with aerodynamic diameters < 10 μm (PM10). Methods We measured global genomic DNA methylation content estimated in Alu and long interspersed nuclear element-1 (LINE-1) repeated elements, and promoter DNA methylation of iNOS (inducible nitric oxide synthase), a gene suppressed by DNA methylation and induced by PM exposure in blood leukocytes. Quantitative DNA methylation analysis was performed through bisulfite PCR pyrosequencing on blood DNA obtained from 63 workers on the first day of a work week (baseline, after 2 days off work) and after 3 days of work (postexposure). Individual PM10 exposure was between 73.4 and 1,220 μg/m3. Results Global methylation content estimated in Alu and LINE-1 repeated elements did not show changes in postexposure measures compared with baseline. PM10 exposure levels were negatively associated with methylation in both Alu [β = −0.19 %5-methylcytosine (%5mC); p = 0.04] and LINE-1 [β = −0.34 %5mC; p = 0.04], likely reflecting long-term PM10 effects. iNOS promoter DNA methylation was significantly lower in postexposure blood samples compared with baseline (difference = −0.61 %5mC; p = 0.02). Conclusions We observed changes in global and gene specific methylation that should be further characterized in future investigations on the effects of PM. PMID:19270791
Effects of age, sex, and persistent organic pollutants on DNA methylation in children
Huen, Karen; Yousefi, Paul; Bradman, Asa; Yan, Liying; Harley, Kim G.; Kogut, Katherine; Eskenazi, Brenda; Holland, Nina
2015-01-01
Epigenetic changes such as DNA methylation may be a molecular mechanism through which environmental exposures affect health. Methylation of Alu and long interspersed nucleotide elements (LINE-1) is a well-established measure of DNA methylation often used in epidemiologic studies. Yet, few studies have examined the effects of host factors on LINE-1 and Alu methylation in children. We characterized the relationship of age, sex, and prenatal exposure to persistent organic pollutants (POPs), dichlorodiphenyl trichloroethane (DDT), dichlorodiphenyldichloroethylene (DDE), and polybrominated diphenyl ethers (PBDEs), with DNA methylation in a birth cohort of Mexican-American children participating in the CHAMACOS study. We measured Alu and LINE-1 methylation by pyrosequencing bisulfite-treated DNA isolated from whole blood samples collected from newborns and 9-year old children (n=358). POPs were measured in maternal serum during late pregnancy. Levels of DNA methylation were lower in 9-year olds compared to newborns and were higher in boys compared to girls. Higher prenatal DDT/E exposure was associated with lower Alu methylation at birth, particularly after adjusting for cell type composition (p=0.02 for o,p′ -DDT). Associations of POPs with LINE-1 methylation were only identified after examining the co-exposure of DDT/E with PBDEs simultaneously. Our data suggest that repeat element methylation can be an informative marker of epigenetic differences by age and sex and that prenatal exposure to POPs may be linked to hypomethylation in fetal blood. Accounting for co-exposure to different types of chemicals and adjusting for blood cell types may increase sensitivity of epigenetic analyses for epidemiological studies. PMID:24375655
Klawitter, Sabine; Fuchs, Nina V; Upton, Kyle R; Muñoz-Lopez, Martin; Shukla, Ruchi; Wang, Jichang; Garcia-Cañadas, Marta; Lopez-Ruiz, Cesar; Gerhardt, Daniel J; Sebe, Attila; Grabundzija, Ivana; Merkert, Sylvia; Gerdes, Patricia; Pulgarin, J Andres; Bock, Anja; Held, Ulrike; Witthuhn, Anett; Haase, Alexandra; Sarkadi, Balázs; Löwer, Johannes; Wolvetang, Ernst J; Martin, Ulrich; Ivics, Zoltán; Izsvák, Zsuzsanna; Garcia-Perez, Jose L; Faulkner, Geoffrey J; Schumann, Gerald G
2016-01-08
Human induced pluripotent stem cells (hiPSCs) are capable of unlimited proliferation and can differentiate in vitro to generate derivatives of the three primary germ layers. Genetic and epigenetic abnormalities have been reported by Wissing and colleagues to occur during hiPSC derivation, including mobilization of engineered LINE-1 (L1) retrotransposons. However, incidence and functional impact of endogenous retrotransposition in hiPSCs are yet to be established. Here we apply retrotransposon capture sequencing to eight hiPSC lines and three human embryonic stem cell (hESC) lines, revealing endogenous L1, Alu and SINE-VNTR-Alu (SVA) mobilization during reprogramming and pluripotent stem cell cultivation. Surprisingly, 4/7 de novo L1 insertions are full length and 6/11 retrotransposition events occurred in protein-coding genes expressed in pluripotent stem cells. We further demonstrate that an intronic L1 insertion in the CADPS2 gene is acquired during hiPSC cultivation and disrupts CADPS2 expression. These experiments elucidate endogenous retrotransposition, and its potential consequences, in hiPSCs and hESCs.
Klawitter, Sabine; Fuchs, Nina V.; Upton, Kyle R.; Muñoz-Lopez, Martin; Shukla, Ruchi; Wang, Jichang; Garcia-Cañadas, Marta; Lopez-Ruiz, Cesar; Gerhardt, Daniel J.; Sebe, Attila; Grabundzija, Ivana; Merkert, Sylvia; Gerdes, Patricia; Pulgarin, J. Andres; Bock, Anja; Held, Ulrike; Witthuhn, Anett; Haase, Alexandra; Sarkadi, Balázs; Löwer, Johannes; Wolvetang, Ernst J.; Martin, Ulrich; Ivics, Zoltán; Izsvák, Zsuzsanna; Garcia-Perez, Jose L.; Faulkner, Geoffrey J.; Schumann, Gerald G.
2016-01-01
Human induced pluripotent stem cells (hiPSCs) are capable of unlimited proliferation and can differentiate in vitro to generate derivatives of the three primary germ layers. Genetic and epigenetic abnormalities have been reported by Wissing and colleagues to occur during hiPSC derivation, including mobilization of engineered LINE-1 (L1) retrotransposons. However, incidence and functional impact of endogenous retrotransposition in hiPSCs are yet to be established. Here we apply retrotransposon capture sequencing to eight hiPSC lines and three human embryonic stem cell (hESC) lines, revealing endogenous L1, Alu and SINE-VNTR-Alu (SVA) mobilization during reprogramming and pluripotent stem cell cultivation. Surprisingly, 4/7 de novo L1 insertions are full length and 6/11 retrotransposition events occurred in protein-coding genes expressed in pluripotent stem cells. We further demonstrate that an intronic L1 insertion in the CADPS2 gene is acquired during hiPSC cultivation and disrupts CADPS2 expression. These experiments elucidate endogenous retrotransposition, and its potential consequences, in hiPSCs and hESCs. PMID:26743714
A mobile threat to genome stability: The impact of non-LTR retrotransposons upon the human genome
Konkel, Miriam K.; Batzer, Mark A.
2010-01-01
It is now commonly agreed that the human genome is not the stable entity originally presumed. Deletions, duplications, inversions, and insertions are common, and contribute significantly to genomic structural variations (SVs). Their collective impact generates much of the inter-individual genomic diversity observed among humans. Not only do these variations change the structure of the genome; they may also have functional implications, e.g. altered gene expression. Some SVs have been identified as the cause of genetic disorders, including cancer predisposition. Cancer cells are notorious for their genomic instability, and often show genomic rearrangements at the microscopic and submicroscopic level to which transposable elements (TEs) contribute. Here, we review the role of TEs in genome instability, with particular focus on non-LTR retrotransposons. Currently, three non-LTR retrotransposon families – long interspersed element 1 (L1), SVA (short interspersed element (SINE-R), variable number of tandem repeats (VNTR), and Alu), and Alu (a SINE) elements – mobilize in the human genome, and cause genomic instability through both insertion- and post-insertion-based mutagenesis. Due to the abundance and high sequence identity of TEs, they frequently mislead the homologous recombination repair pathway into non-allelic homologous recombination, causing deletions, duplications, and inversions. While less comprehensively studied, non-LTR retrotransposon insertions and TE-mediated rearrangements are probably more common in cancer cells than in healthy tissue. This may be at least partially attributed to the commonly seen global hypomethylation as well as general epigenetic dysfunction of cancer cells. Where possible, we provide examples that impact cancer predisposition and/or development. PMID:20307669
Kang, Seung-Hui; Park, Chan Hee; Jeung, Hei Cheul; Kim, Ki-Yeol; Rha, Sun Young; Chung, Hyun Cheol
2007-06-01
In array-CGH, various factors may act as variables influencing the result of experiments. Among them, Cot-1 DNA, which has been used as a repetitive sequence-blocking agent, may become an artifact-inducing factor in BAC array-CGH. To identify the effect of Cot-1 DNA on Microarray-CGH experiments, Cot-1 DNA was labeled directly and Microarray-CGH experiments were performed. The results confirmed that probes which hybridized more completely with Cot-1 DNA had a higher sequence similarity to the Alu element. Further, in the sex-mismatched Microarray-CGH experiments, the variation and intensity in the fluorescent signal were reduced in the high intensity probe group in which probes were better hybridized with Cot-1 DNA. Otherwise, those of the low intensity probe group showed no alterations regardless of Cot-1 DNA. These results confirmed by in silico methods that Cot-1 DNA could block repetitive sequences in gDNA and probes. In addition, it was confirmed biologically that the blocking effect of Cot-1 DNA could be presented via its repetitive sequences, especially Alu elements. Thus, in contrast to BAC-array CGH, the use of Cot-1 DNA is advantageous in controlling experimental variation in Microarray-CGH.
Geoffroy, Véronique; Stoetzel, Corinne; Scheidecker, Sophie; Schaefer, Elise; Perrault, Isabelle; Bär, Séverine; Kröll, Ariane; Delbarre, Marion; Antin, Manuela; Leuvrey, Anne-Sophie; Henry, Charline; Blanché, Hélène; Decker, Eva; Kloth, Katja; Klaus, Günter; Mache, Christoph; Martin-Coignard, Dominique; McGinn, Steven; Boland, Anne; Deleuze, Jean-François; Friant, Sylvie; Saunier, Sophie; Rozet, Jean-Michel; Bergmann, Carsten; Dollfus, Hélène; Muller, Jean
2018-04-24
Ciliopathies represent a wide spectrum of rare diseases with overlapping phenotypes and a high genetic heterogeneity. Among those, IFT140 is implicated in a variety of phenotypes ranging from isolated retinis pigmentosa to more syndromic cases. Using whole-genome sequencing in patients with uncharacterized ciliopathies, we identified a novel recurrent tandem duplication of exon 27-30 (6.7 kb) in IFT140, c.3454-488_4182+2588dup p.(Tyr1152_Thr1394dup), missed by whole-exome sequencing. Pathogenicity of the mutation was assessed on the patients' skin fibroblasts. Several hundreds of patients with a ciliopathy phenotype were screened and biallelic mutations were identified in 11 families representing 12 pathogenic variants of which seven are novel. Among those unrelated families especially with a Mainzer-Saldino syndrome, eight carried the same tandem duplication (two at the homozygous state and six at the heterozygous state). In conclusion, we demonstrated the implication of structural variations in IFT140-related diseases expanding its mutation spectrum. We also provide evidences for a unique genomic event mediated by an Alu-Alu recombination occurring on a shared haplotype. We confirm that whole-genome sequencing can be instrumental in the ability to detect structural variants for genomic disorders. © 2018 Wiley Periodicals, Inc.
Alu-mediated deletion of SOX10 regulatory elements in Waardenburg syndrome type 4
Bondurand, Nadége; Fouquet, Virginie; Baral, Viviane; Lecerf, Laure; Loundon, Natalie; Goossens, Michel; Duriez, Benedicte; Labrune, Philippe; Pingault, Veronique
2012-01-01
Waardenburg syndrome type 4 (WS4) is a rare neural crest disorder defined by the combination of Waardenburg syndrome (sensorineural hearing loss and pigmentation defects) and Hirschsprung disease (intestinal aganglionosis). Three genes are known to be involved in this syndrome, that is, EDN3 (endothelin-3), EDNRB (endothelin receptor type B), and SOX10. However, 15–35% of WS4 remains unexplained at the molecular level, suggesting that other genes could be involved and/or that mutations within known genes may have escaped previous screenings. Here, we searched for deletions within recently identified SOX10 regulatory sequences and describe the first characterization of a WS4 patient presenting with a large deletion encompassing three of these enhancers. Analysis of the breakpoint region suggests a complex rearrangement involving three Alu sequences that could be mediated by a FosTes/MMBIR replication mechanism. Taken together with recent reports, our results demonstrate that the disruption of highly conserved non-coding elements located within or at a long distance from the coding sequences of key genes can result in several neurocristopathies. This opens up new routes to the molecular dissection of neural crest disorders. PMID:22378281
Alu-mediated deletion of SOX10 regulatory elements in Waardenburg syndrome type 4.
Bondurand, Nadége; Fouquet, Virginie; Baral, Viviane; Lecerf, Laure; Loundon, Natalie; Goossens, Michel; Duriez, Benedicte; Labrune, Philippe; Pingault, Veronique
2012-09-01
Waardenburg syndrome type 4 (WS4) is a rare neural crest disorder defined by the combination of Waardenburg syndrome (sensorineural hearing loss and pigmentation defects) and Hirschsprung disease (intestinal aganglionosis). Three genes are known to be involved in this syndrome, that is, EDN3 (endothelin-3), EDNRB (endothelin receptor type B), and SOX10. However, 15-35% of WS4 remains unexplained at the molecular level, suggesting that other genes could be involved and/or that mutations within known genes may have escaped previous screenings. Here, we searched for deletions within recently identified SOX10 regulatory sequences and describe the first characterization of a WS4 patient presenting with a large deletion encompassing three of these enhancers. Analysis of the breakpoint region suggests a complex rearrangement involving three Alu sequences that could be mediated by a FosTes/MMBIR replication mechanism. Taken together with recent reports, our results demonstrate that the disruption of highly conserved non-coding elements located within or at a long distance from the coding sequences of key genes can result in several neurocristopathies. This opens up new routes to the molecular dissection of neural crest disorders.
Saeliw, Thanit; Tangsuwansri, Chayanin; Thongkorn, Surangrat; Chonchaiya, Weerasak; Suphapeetiporn, Kanya; Mutirangura, Apiwat; Tencomnao, Tewin; Hu, Valerie W; Sarachana, Tewarit
2018-01-01
Alu elements are a group of repetitive elements that can influence gene expression through CpG residues and transcription factor binding. Altered gene expression and methylation profiles have been reported in various tissues and cell lines from individuals with autism spectrum disorder (ASD). However, the role of Alu elements in ASD remains unclear. We thus investigated whether Alu elements are associated with altered gene expression profiles in ASD. We obtained five blood-based gene expression profiles from the Gene Expression Omnibus database and human Alu-inserted gene lists from the TranspoGene database. Differentially expressed genes (DEGs) in ASD were identified from each study and overlapped with the human Alu-inserted genes. The biological functions and networks of Alu-inserted DEGs were then predicted by Ingenuity Pathway Analysis (IPA). A combined bisulfite restriction analysis of lymphoblastoid cell lines (LCLs) derived from 36 ASD and 20 sex- and age-matched unaffected individuals was performed to assess the global DNA methylation levels within Alu elements, and the Alu expression levels were determined by quantitative RT-PCR. In ASD blood or blood-derived cells, 320 Alu-inserted genes were reproducibly differentially expressed. Biological function and pathway analysis showed that these genes were significantly associated with neurodevelopmental disorders and neurological functions involved in ASD etiology. Interestingly, estrogen receptor and androgen signaling pathways implicated in the sex bias of ASD, as well as IL-6 signaling and neuroinflammation signaling pathways, were also highlighted. Alu methylation was not significantly different between the ASD and sex- and age-matched control groups. However, significantly altered Alu methylation patterns were observed in ASD cases sub-grouped based on Autism Diagnostic Interview-Revised scores compared with matched controls. Quantitative RT-PCR analysis of Alu expression also showed significant differences between ASD subgroups. Interestingly, Alu expression was correlated with methylation status in one phenotypic ASD subgroup. Alu methylation and expression were altered in LCLs from ASD subgroups. Our findings highlight the association of Alu elements with gene dysregulation in ASD blood samples and warrant further investigation. Moreover, the classification of ASD individuals into subgroups based on phenotypes may be beneficial and could provide insights into the still unknown etiology and the underlying mechanisms of ASD.
NASA Astrophysics Data System (ADS)
Oztekin, Halit; Temurtas, Feyzullah; Gulbag, Ali
The Arithmetic and Logic Unit (ALU) design is one of the important topics in Computer Architecture and Organization course in Computer and Electrical Engineering departments. There are ALU designs that have non-modular nature to be used as an educational tool. As the programmable logic technology has developed rapidly, it is feasible that ALU design based on Field Programmable Gate Array (FPGA) is implemented in this course. In this paper, we have adopted the modular approach to ALU design based on FPGA. All the modules in the ALU design are realized using schematic structure on Altera's Cyclone II Development board. Under this model, the ALU content is divided into four distinct modules. These are arithmetic unit except for multiplication and division operations, logic unit, multiplication unit and division unit. User can easily design any size of ALU unit since this approach has the modular nature. Then, this approach was applied to microcomputer architecture design named BZK.SAU.FPGA10.0 instead of the current ALU unit.
Assembly of ordered contigs of cosmids selected with YACs of human chromosome 13
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fischer, S.G.; Cayanis, E.; Boukhgalter, B.
1994-06-01
The authors have developed an efficient method for assembling ordered cosmid contigs aligned to mega-YACs and midi-YACs (average insert sizes of 1.0 and 0.35 Mb, respectively) and used this general method to initiate high-resolution physical mapping of human chromosome 13 (Chr 13). Chr 13-enriched midi-YAC (mYAC) and mega-YAC (MYAC) sublibraries were obtained from corresponding CEPH total human YAC libraries by selecting colonies with inter-Alu PCR probes derived from Chr 13 monochromosomal cell hybrid DNA. These sublibraries were arrayed on filters at high density. In this approach, the MYAC 13 sublibrary is screened by hybridization with cytogenetically assigned Chr 13 DNAmore » probes to select one or a small subset of MYACs. Inter-Alu PCR products from each mYAC are then hybridized to the MYAC and mYAC sublibraries to identify overlapping YACs and to an arrayed Chr 13-specific cosmid library to select corresponding cosmids. The set of selected cosmids, gridded on filters at high density, is hybridized with inter-Alu PCR products from each of the overlapping YACs to identify subsets of cosmids and also with riboprobes from each cosmid of the arrayed set ({open_quotes}cosmid matrix cross-hybridization{close_quotes}). From these data, cosmid contigs are assembled by a specifically designed computer program. Application of this method generates cosmid contigs spanning the length of a MYAC with few gaps. To provide a high-resolution map, ends of cosmids are sequenced at preselected sites to position densely spaced sequence-tagged sites. 33 refs., 7 figs., 1 tab.« less
A mobile threat to genome stability: The impact of non-LTR retrotransposons upon the human genome.
Konkel, Miriam K; Batzer, Mark A
2010-08-01
It is now commonly agreed that the human genome is not the stable entity originally presumed. Deletions, duplications, inversions, and insertions are common, and contribute significantly to genomic structural variations (SVs). Their collective impact generates much of the inter-individual genomic diversity observed among humans. Not only do these variations change the structure of the genome; they may also have functional implications, e.g. altered gene expression. Some SVs have been identified as the cause of genetic disorders, including cancer predisposition. Cancer cells are notorious for their genomic instability, and often show genomic rearrangements at the microscopic and submicroscopic level to which transposable elements (TEs) contribute. Here, we review the role of TEs in genome instability, with particular focus on non-LTR retrotransposons. Currently, three non-LTR retrotransposon families - long interspersed element 1 (L1), SVA (short interspersed element (SINE-R), variable number of tandem repeats (VNTR), and Alu), and Alu (a SINE) elements - mobilize in the human genome, and cause genomic instability through both insertion- and post-insertion-based mutagenesis. Due to the abundance and high sequence identity of TEs, they frequently mislead the homologous recombination repair pathway into non-allelic homologous recombination, causing deletions, duplications, and inversions. While less comprehensively studied, non-LTR retrotransposon insertions and TE-mediated rearrangements are probably more common in cancer cells than in healthy tissue. This may be at least partially attributed to the commonly seen global hypomethylation as well as general epigenetic dysfunction of cancer cells. Where possible, we provide examples that impact cancer predisposition and/or development. Copyright © 2010 Elsevier Ltd. All rights reserved.
Weber, Andreas Daniel; Pontiggia, Luca; Biedermann, Thomas; Schiestl, Clemens; Meuli, Martin; Reichmann, Ernst
2011-03-01
Definitive and high-quality coverage of large and, in particular, massive skin defects remains a significant challenge in burn as well as plastic and reconstructive surgery because of donor site shortage. A novel and promising approach to overcome these problems is tissue engineering of skin. Clearly, before eventual clinical application, engineered skin substitutes of human origin must be grafted and then evaluated in animal models. For the various tests to be conducted it is indispensable to be able to identify human cells as such in culture and also to distinguish between graft and recipient tissue after transplantation. Here we describe a tool to identify human cells in vitro and in vivo. In situ hybridization allows for the detection and localization of specific DNA or RNA sequences in morphologically preserved cells in culture or tissue sections, respectively. We used digoxigenin-labeled DNA probes corresponding to human-specific Alu repeats in order to identify human keratinocytes grown in culture together with rat cells, and also to label split and full thickness skin grafts of human origin after transplantation on immuno-incompetent rats. Digoxigenin-labeled DNA probing resulted in an intensive nuclear staining of human cells, both in culture and after transplantation onto recipient animals, while recipient animal cells (rat cells) did not stain. In situ hybridization using primate-specific Alu probes reliably allows distinguishing between cells of human and non-human origin both in culture as well as in histological sections. This method is an essential tool for those preclinical experiments (performed on non-primate animals) that must be conducted before novel tissue engineered skin substitutes might be introduced into clinical practice.
Pseudomonas specific 16S rDNA PCR amplification and multiple enzyme restriction fragment length polymorphism (MERFLP) analysis using a single digestion mixture of Alu I, Hinf I, Rsa I, and Tru 9I distinguished 150 published sequences and reference strains of authentic Pseudomonas...
2011-01-01
Background Sequence variants in genes functioning in folate-mediated one-carbon metabolism are hypothesized to lead to changes in levels of homocysteine and DNA methylation, which, in turn, are associated with risk of cardiovascular disease. Methods 330 SNPs in 52 genes were studied in relation to plasma homocysteine and global genomic DNA methylation. SNPs were selected based on functional effects and gene coverage, and assays were completed on the Illumina Goldengate platform. Age-, smoking-, and nutrient-adjusted genotype--phenotype associations were estimated in regression models. Results Using a nominal P ≤ 0.005 threshold for statistical significance, 20 SNPs were associated with plasma homocysteine, 8 with Alu methylation, and 1 with LINE-1 methylation. Using a more stringent false discovery rate threshold, SNPs in FTCD, SLC19A1, and SLC19A3 genes remained associated with plasma homocysteine. Gene by vitamin B-6 interactions were identified for both Alu and LINE-1 methylation, and epistatic interactions with the MTHFR rs1801133 SNP were identified for the plasma homocysteine phenotype. Pleiotropy involving the MTHFD1L and SARDH genes for both plasma homocysteine and Alu methylation phenotypes was identified. Conclusions No single gene was associated with all three phenotypes, and the set of the most statistically significant SNPs predictive of homocysteine or Alu or LINE-1 methylation was unique to each phenotype. Genetic variation in folate-mediated one-carbon metabolism, other than the well-known effects of the MTHFR c.665C>T (known as c.677 C>T, rs1801133, p.Ala222Val), is predictive of cardiovascular disease biomarkers. PMID:22103680
Human Alu insertion polymorphisms in North African populations.
Cherni, Loth; Frigi, Sabeh; Ennafaa, Hajer; Mtiraoui, Nabil; Mahjoub, Touhami; Benammar-Elgaaied, Amel
2011-10-01
Several features make Alu insertions a powerful tool used in population genetic studies: the polymorphic nature of many Alu insertions, the stability of an Alu insertion event and, furthermore, the ancestral state of an Alu insertion is known to be the absence of the Alu element at a particular locus and the presence of an Alu insertion at the site that forward mutational change. This study analyses seven Alu insertion polymorphisms in a sample of 297 individuals from the autochthonous population of Tunisia (Thala, Smar, Zarzis, and Bou Salem) and Libya with the aim of studying their genetic structure with respect to the populations of North Africa, Western, Eastern and Central Europe. The comparative analyses carried out using the MDS and AMOVA methods reveal the existence of spatial heterogeneity, and identify four population groups. Study populations (Libya, Smar, Zarzis, and Bou Salem) are closest to North African populations whereas Thala is isolated and is closest to Western European populations. In conclusion, Results of the present study support the important role that migratory movements have played in the North African gene pool, at least since the Neolithic period.
Effects of rare sugar d-allulose on heat-induced gelation of surimi prepared from marine fish.
Ogawa, Masahiro; Inoue, Masaki; Hayakawa, Shigeru; O'Charoen, Siwaporn; Ogawa, Makiko
2017-11-01
d-Allulose (Alu), the C3-epimer of d-fructose, is a non-caloric sweetener (0.39 kcal g -1 ) with a suppressive effect on postprandial blood glucose elevation. The aim of this study was to investigate the effects of Alu used as a sweetener and gel improver instead of sucrose on heat-induced gelation of surimi. The puncture test of a heat-induced surimi gel showed that with 50 g kg -1 Alu the gel had 15% and 6% higher gel strength than the corresponding gel with sucrose (Suc) and with sorbitol (Sor), respectively. In addition, Alu-gel had 26% and 25% higher water-holding capacity (WHC) than Suc- and Sor-gel. Heating of myofibrillar protein with Alu, unlike Suc and Sor, facilitated the formation of both disulfide and non-disulfide crosslinks that might be associated with the mechanical properties and WHC of Alu-gel. Alu improves the mechanical properties and WHC of the heat-induced surimi gel. Furthermore, Alu is low in calories compared with Suc (4.0 kcal g -1 ) and Sor (3.0 kcal g -1 ). Thus Alu will be an alternative of Suc or Sor for developing surimi-based products with health benefits. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.
Oshima, Junko; Lee, Jennifer A; Breman, Amy M; Fernandes, Priscilla H; Babovic-Vuksanovic, Dusica; Ward, Patricia A; Wolfe, Lynne A; Eng, Christine M; Del Gaudio, Daniela
2011-07-01
Mucopolysaccharidosis type II (MPS II) is caused by mutations in the IDS gene, which encodes the lysosomal enzyme iduronate-2-sulfatase. In ∼20% of MPS II patients the disorder is caused by gross IDS structural rearrangements. We identified two male cases harboring complex rearrangements involving the IDS gene and the nearby pseudogene, IDSP1, which has been annotated as a low-copy repeat (LCR). In both cases the rearrangement included a partial deletion of IDS and an inverted insertion of the neighboring region. In silico analyses revealed the presence of repetitive elements as well as LCRs at the junctions of rearrangements. Our models illustrate two alternative consequences of rearrangements initiated by non-allelic homologous recombination of LCRs: resolution by a second recombination event (that is, Alu-mediated recombination), or resolution by non-homologous end joining repair. These complex rearrangements have the potential to be recurrent and may be present among those MSP II cases with previously uncharacterized aberrations involving IDS.
Chromosomal Inversions between Human and Chimpanzee Lineages Caused by Retrotransposons
Lee, Jungnam; Han, Kyudong; Meyer, Thomas J.; Kim, Heui-Soo; Batzer, Mark A.
2008-01-01
The long interspersed element-1 (LINE-1 or L1) and Alu elements are the most abundant mobile elements comprising 21% and 11% of the human genome, respectively. Since the divergence of human and chimpanzee lineages, these elements have vigorously created chromosomal rearrangements causing genomic difference between humans and chimpanzees by either increasing or decreasing the size of genome. Here, we report an exotic mechanism, retrotransposon recombination-mediated inversion (RRMI), that usually does not alter the amount of genomic material present. Through the comparison of the human and chimpanzee draft genome sequences, we identified 252 inversions whose respective inversion junctions can clearly be characterized. Our results suggest that L1 and Alu elements cause chromosomal inversions by either forming a secondary structure or providing a fragile site for double-strand breaks. The detailed analysis of the inversion breakpoints showed that L1 and Alu elements are responsible for at least 44% of the 252 inversion loci between human and chimpanzee lineages, including 49 RRMI loci. Among them, three RRMI loci inverted exonic regions in known genes, which implicates this mechanism in generating the genomic and phenotypic differences between human and chimpanzee lineages. This study is the first comprehensive analysis of mobile element bases inversion breakpoints between human and chimpanzee lineages, and highlights their role in primate genome evolution. PMID:19112500
Virts, Elizabeth L; Jankowska, Anna; Mackay, Craig; Glaas, Marcel F; Wiek, Constanze; Kelich, Stephanie L; Lottmann, Nadine; Kennedy, Felicia M; Marchal, Christophe; Lehnert, Erik; Scharf, Rüdiger E; Dufour, Carlo; Lanciotti, Marina; Farruggia, Piero; Santoro, Alessandra; Savasan, Süreyya; Scheckenbach, Kathrin; Schipper, Jörg; Wagenmann, Martin; Lewis, Todd; Leffak, Michael; Farlow, Janice L; Foroud, Tatiana M; Honisch, Ellen; Niederacher, Dieter; Chakraborty, Sujata C; Vance, Gail H; Pruss, Dmitry; Timms, Kirsten M; Lanchbury, Jerry S; Alpi, Arno F; Hanenberg, Helmut
2015-09-15
Fanconi anemia (FA) is a rare inherited disorder clinically characterized by congenital malformations, progressive bone marrow failure and cancer susceptibility. At the cellular level, FA is associated with hypersensitivity to DNA-crosslinking genotoxins. Eight of 17 known FA genes assemble the FA E3 ligase complex, which catalyzes monoubiquitination of FANCD2 and is essential for replicative DNA crosslink repair. Here, we identify the first FA patient with biallelic germline mutations in the ubiquitin E2 conjugase UBE2T. Both mutations were aluY-mediated: a paternal deletion and maternal duplication of exons 2-6. These loss-of-function mutations in UBE2T induced a cellular phenotype similar to biallelic defects in early FA genes with the absence of FANCD2 monoubiquitination. The maternal duplication produced a mutant mRNA that could encode a functional protein but was degraded by nonsense-mediated mRNA decay. In the patient's hematopoietic stem cells, the maternal allele with the duplication of exons 2-6 spontaneously reverted to a wild-type allele by monoallelic recombination at the duplicated aluY repeat, thereby preventing bone marrow failure. Analysis of germline DNA of 814 normal individuals and 850 breast cancer patients for deletion or duplication of UBE2T exons 2-6 identified the deletion in only two controls, suggesting aluY-mediated recombinations within the UBE2T locus are rare and not associated with an increased breast cancer risk. Finally, a loss-of-function germline mutation in UBE2T was detected in a high-risk breast cancer patient with wild-type BRCA1/2. Cumulatively, we identified UBE2T as a bona fide FA gene (FANCT) that also may be a rare cancer susceptibility gene. © The Author 2015. Published by Oxford University Press.
Tajaddod, Mansoureh; Tanzer, Andrea; Licht, Konstantin; Wolfinger, Michael T; Badelt, Stefan; Huber, Florian; Pusch, Oliver; Schopoff, Sandy; Janisiw, Michael; Hofacker, Ivo; Jantsch, Michael F
2016-10-25
Short interspersed elements (SINEs) represent the most abundant group of non-long-terminal repeat transposable elements in mammalian genomes. In primates, Alu elements are the most prominent and homogenous representatives of SINEs. Due to their frequent insertion within or close to coding regions, SINEs have been suggested to play a crucial role during genome evolution. Moreover, Alu elements within mRNAs have also been reported to control gene expression at different levels. Here, we undertake a genome-wide analysis of insertion patterns of human Alus within transcribed portions of the genome. Multiple, nearby insertions of SINEs within one transcript are more abundant in tandem orientation than in inverted orientation. Indeed, analysis of transcriptome-wide expression levels of 15 ENCODE cell lines suggests a cis-repressive effect of inverted Alu elements on gene expression. Using reporter assays, we show that the negative effect of inverted SINEs on gene expression is independent of known sensors of double-stranded RNAs. Instead, transcriptional elongation seems impaired, leading to reduced mRNA levels. Our study suggests that there is a bias against multiple SINE insertions that can promote intramolecular base pairing within a transcript. Moreover, at a genome-wide level, mRNAs harboring inverted SINEs are less expressed than mRNAs harboring single or tandemly arranged SINEs. Finally, we demonstrate a novel mechanism by which inverted SINEs can impact on gene expression by interfering with RNA polymerase II.
Shintani, Tomoya; Sakoguchi, Hirofumi; Yoshihara, Akihide; Izumori, Ken; Sato, Masashi
2017-12-02
Dietary restriction (DR) is an effective intervention known to increase lifespan in a wide variety of organisms. DR also delays the onset of aging-associated diseases. DR mimetics, compounds that can mimic the effects of DR, have been intensively explored. d-Allulose (d-Alu), the C3-epimer of d-fructose, is a rare sugar that has various health benefits, including anti-hyperglycemia and anti-obesity effects. Here, we report that d-Alu increased the lifespan of Caenorhabditis elegans both under monoxenic and axenic culture conditions. d-Alu did not further extend the lifespan of the long-lived DR model eat-2 mutant, strongly indicating that the effect is related to DR. However, d-Alu did not reduce the food intake of wild-type C. elegans. To explore the mechanisms of the d-Alu longevity effect, we examined the lifespan of d-Alu-treated mutants deficient for nutrient sensing pathway-related genes daf-16, sir-2.1, aak-2, and skn-1. As a result, d-Alu increased the lifespan of the daf-16, sir-2.1, and skn-1 mutants, but not the aak-2 mutant, indicating that the lifespan extension was dependent on the energy sensor, AMP-activated protein kinase (AMPK). d-Alu also enhanced the mRNA expression and enzyme activities of superoxide dismutase (SOD) and catalase. From these findings, we conclude that d-Alu extends lifespan by increasing oxidative stress resistance through a DR mechanism, making it a candidate DR mimetic. Copyright © 2017 Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Hendry, David F. (Inventor)
1993-01-01
In a data system having a memory, plural input/output (I/O) devices and a bus connecting each of the I/O devices to the memory, a direct memory access (DMA) controller regulating access of each of the I/O devices to the bus, including a priority register storing priorities of bus access requests from the I/O devices, an interrupt register storing bus access requests of the I/O devices, a resolver for selecting one of the I/O devices to have access to the bus, a pointer register storing addresses of locations in the memory for communication with the one I/O device via the bus, a sequence register storing an address of a location in the memory containing a channel program instruction which is to be executed next, an ALU for incrementing and decrementing addresses stored in the pointer register, computing the next address to be stored in the sequence register, computing an initial contents of each of the register. The memory contains a sequence of channel program instructions defining a set up operation wherein the contents of each of the registers in the channel register is initialized in accordance with the initial contents computed by the ALU and an access operation wherein data is transferred on the bus between a location in the memory whose address is currently stored in the pointer register and the one I/O device enabled by the resolver.
Signal recognition particle RNA in dinoflagellates and the Perkinsid Perkinsus marinus.
Zhang, Huan; Campbell, David A; Sturm, Nancy R; Rosenblad, Magnus A; Dungan, Christopher F; Lin, Senjie
2013-09-01
In dinoflagellates and perkinsids, the molecular structure of the protein translocating machinery is unclear. Here, we identified several types of full-length signal recognition particle (SRP) RNA genes from Karenia brevis (dinoflagellate) and Perkinsus marinus (perkinsid). We also identified the four SRP S-domain proteins, but not the two Alu domain proteins, from P. marinus and several dinoflagellates. We mapped both ends of SRP RNA transcripts from K. brevis and P. marinus, and obtained the 3' end from four other dinoflagellates. The lengths of SRP RNA are predicted to be ∼260-300 nt in dinoflagellates and 280-285 nt in P. marinus. Although these SRP RNA sequences are substantially variable, the predicted structures are similar. The genomic organization of the SRP RNA gene differs among species. In K. brevis, this gene is located downstream of the spliced leader (SL) RNA, either as SL RNA-SRP RNA-tRNA gene tandem repeats, or within a SL RNA-SRP RNA-tRNA-U6-5S rRNA gene cluster. In other dinoflagellates, SRP RNA does not cluster with SL RNA or 5S rRNA genes. The majority of P. marinus SRP RNA genes array as tandem repeats without the above-mentioned small RNA genes. Our results capture a snapshot of a potentially complex evolutionary history of SRP RNA in alveolates. Copyright © 2013 Elsevier GmbH. All rights reserved.
Chang, S C; Macêdo, D P C; Souza-Motta, C M; Oliveira, N T
2013-08-12
Fusarium verticillioides is a pathogen of agriculturally important crops, especially maize. It is considered one of the most important pathogens responsible for fumonisin contamination of food products, which causes severe, chronic, and acute intoxication in humans and animals. Moreover, it is recognized as a cause of localized infections in immunocompetent patients and disseminated infections among severely immunosuppressed patients. Several molecular tools have been used to analyze the intraspecific variability of fungi. The objective of this study was to use molecular markers to compare pathogenic isolates of F. verticillioides and isolates of the same species obtained from clinical samples of patients with Fusarium mycoses. The molecular markers that we used were inter-simple sequence repeat markers (primers GTG5 and GACA4), intron splice site primer (primer EI1), random amplified polymorphic DNA marker (primer OPW-6), and restriction fragment length polymorphism-internal transcribed spacer (ITS) from rDNA. From the data obtained, clusters were generated based on the UPGMA clustering method. The amplification products obtained using primers ITS4 and ITS5 and loci ITS1-5.8-ITS2 of the rDNA yielded fragments of approximately 600 bp for all the isolates. Digestion of the ITS region fragment using restriction enzymes such as EcoRI, DraI, BshI, AluI, HaeIII, HinfI, MspI, and PstI did not permit differentiation among pathogenic and clinical isolates. The inter-simple sequence repeat, intron splice site primer, and random amplified polymorphic DNA markers presented high genetic homogeneity among clinical isolates in contrast to the high variability found among the phytopathogenic isolates of F. verticillioides.
Onnby, Linda; Svensson, Christian; Mbundi, Lubinda; Busquets, Rosa; Cundy, Andrew; Kirsebom, Harald
2014-03-01
The generation and development of effective adsorption materials for arsenic removal are urgently needed due to acute arsenic contamination of water sources in many regions around the world. In the search for these new adsorbents, the application of nanomaterials or nanocomposites, and especially the use of nanoparticles (NPs), has proven increasingly attractive. While the adsorptive performance of a range of nanocomposite and nanomaterial-based systems has been extensively reviewed in previously-published literature, the stability of these systems in terms of NP release, i.e. the ability of the nanomaterial or nanocomposite to retain incorporated NPs, is less well understood. Here we examine the performance of nanocomposites comprised of aluminium oxide nanoparticles (AluNPs) incorporated in macroporous polyacrylamide-based cryogels (n-Alu-cryo, where n indicates the percentage of AluNPs in the polymer material (n=0-6%, w/v)) for As(V) adsorption, and evaluate AluNP leakage before and after the use of these materials. A range of techniques is utilised and assessed (SEM, TEM, mass weight change, PIXE and in vitro toxicity studies). The 4-Alu-cryo nanocomposite was shown to be optimal for minimising AluNP losses while maximising As(V) removal. From the same nanocomposite we were further able to show that NP losses were not detectable at the AluNP concentrations used in the study. Toxicity tests revealed that no cytotoxic effects could be observed. The cryogel-AluNPs composites were not only effective in As(V) removal but also in immobilising the AluNPs. More challenging flow-through conditions for the evaluation of NP leakage could be included as a next step in a continued study assessing particle loss and subsequent toxicity. Copyright © 2013 Elsevier B.V. All rights reserved.
De Spiegelaere, Ward; Malatinkova, Eva; Lynch, Lindsay; Van Nieuwerburgh, Filip; Messiaen, Peter; O'Doherty, Una; Vandekerckhove, Linos
2014-06-01
Quantification of integrated proviral HIV DNA by repetitive-sampling Alu-HIV PCR is a candidate virological tool to monitor the HIV reservoir in patients. However, the experimental procedures and data analysis of the assay are complex and hinder its widespread use. Here, we provide an improved and simplified data analysis method by adopting binomial and Poisson statistics. A modified analysis method on the basis of Poisson statistics was used to analyze the binomial data of positive and negative reactions from a 42-replicate Alu-HIV PCR by use of dilutions of an integration standard and on samples of 57 HIV-infected patients. Results were compared with the quantitative output of the previously described Alu-HIV PCR method. Poisson-based quantification of the Alu-HIV PCR was linearly correlated with the standard dilution series, indicating that absolute quantification with the Poisson method is a valid alternative for data analysis of repetitive-sampling Alu-HIV PCR data. Quantitative outputs of patient samples assessed by the Poisson method correlated with the previously described Alu-HIV PCR analysis, indicating that this method is a valid alternative for quantifying integrated HIV DNA. Poisson-based analysis of the Alu-HIV PCR data enables absolute quantification without the need of a standard dilution curve. Implementation of the CI estimation permits improved qualitative analysis of the data and provides a statistical basis for the required minimal number of technical replicates. © 2014 The American Association for Clinical Chemistry.
Costa, Elísio; Duque, Frederico; Oliveira, Jorge; Garcia, Paula; Gonçalves, Isabel; Diogo, Luísa; Santos, Rosário
2007-01-01
Shwachman-Diamond syndrome (SDS) is caused by mutations in the SBDS gene, most of which are the result of gene conversion events involving its highly homologous pseudogene SBDSP. Here we describe the molecular characterization of the first documented gross deletion in the SBDS gene, in a 4-year-old Portuguese girl with SDS. The clinical diagnosis was based on the presence of hematological symptoms (severe anemia and cyclic neutropenia), pancreatic exocrine insufficiency and skeletal abnormalities. Routine molecular screening revealed heterozygosity for the common splicing mutation c.258+2T>C, and a further step-wise approach led to the detection of a large deletion encompassing exon 3, the endpoints of which were subsequently delineated at the gDNA level. This novel mutation (c.258+374_459+250del), predictably giving rise to an internally deleted polypeptide (p.Ile87_Gln153del), appears to have arisen from an excision event mediated by AluSx elements which are present in introns 2 and 3. Our case illustrates the importance of including gross deletion screening in the SDS diagnostic setting, especially in cases where only one deleterious mutation is detected by routine screening methods. In particular, deletional rearrangements involving exon 3 should be considered, since Alu sequences are known to be an important cause of recurrent mutations.
Effects of Temperature and Relative Humidity on DNA Methylation
Bind, Marie-Abele; Zanobetti, Antonella; Gasparrini, Antonio; Peters, Annette; Coull, Brent; Baccarelli, Andrea; Tarantini, Letizia; Koutrakis, Petros; Vokonas, Pantel; Schwartz, Joel
2014-01-01
Background Previous studies have found relationships between DNA methylation and various environmental contaminant exposures. Associations with weather have not been examined. Because temperature and humidity are related to mortality even on non-extreme days, we hypothesized that temperature and relative humidity may affect methylation. Methods We repeatedly measured methylation on long interspersed nuclear elements (LINE-1), Alu, and 9 candidate genes in blood samples from 777 elderly men participating in the normative aging Study (1999–2009). We assessed whether ambient temperature and relative humidity are related to methylation on LINE-1 and Alu, as well as on genes controlling coagulation, inflammation, cortisol, DNA repair, and metabolic pathway. We examined intermediate-term associations of temperature, relative humidity, and their interaction with methylation, using distributed lag models. Results Temperature or relative humidity levels were associated with methylation on tissue factor (F3), intercellular adhesion molecule 1 (ICAM-1), toll-like receptor 2 (TRL-2), carnitine O-acetyltransferase (CRAT), interferon gamma (IFN-γ), inducible nitric oxide synthase (iNOS), and glucocorticoid receptor, LINE-1, and Alu. For instance, a 5°c increase in 3-week average temperature in ICAM-1 methylation was associated with a 9% increase (95% confidence interval: 3% to 15%), whereas a 10% increase in 3-week average relative humidity was associated with a 5% decrease (−8% to −1%). The relative humidity association with ICAM-1 methylation was stronger on hot days than mild days. Conclusions DNA methylation in blood cells may reflect biological effects of temperature and relative humidity. Temperature and relative humidity may also interact to produce stronger effects. PMID:24809956
Effects of temperature and relative humidity on DNA methylation.
Bind, Marie-Abele; Zanobetti, Antonella; Gasparrini, Antonio; Peters, Annette; Coull, Brent; Baccarelli, Andrea; Tarantini, Letizia; Koutrakis, Petros; Vokonas, Pantel; Schwartz, Joel
2014-07-01
Previous studies have found relationships between DNA methylation and various environmental contaminant exposures. Associations with weather have not been examined. Because temperature and humidity are related to mortality even on non-extreme days, we hypothesized that temperature and relative humidity may affect methylation. We repeatedly measured methylation on long interspersed nuclear elements (LINE-1), Alu, and 9 candidate genes in blood samples from 777 elderly men participating in the Normative Aging Study (1999-2009). We assessed whether ambient temperature and relative humidity are related to methylation on LINE-1 and Alu, as well as on genes controlling coagulation, inflammation, cortisol, DNA repair, and metabolic pathway. We examined intermediate-term associations of temperature, relative humidity, and their interaction with methylation, using distributed lag models. Temperature or relative humidity levels were associated with methylation on tissue factor (F3), intercellular adhesion molecule 1 (ICAM-1), toll-like receptor 2 (TRL-2), carnitine O-acetyltransferase (CRAT), interferon gamma (IFN-γ), inducible nitric oxide synthase (iNOS), and glucocorticoid receptor, LINE-1, and Alu. For instance, a 5°C increase in 3-week average temperature in ICAM-1 methylation was associated with a 9% increase (95% confidence interval: 3% to 15%), whereas a 10% increase in 3-week average relative humidity was associated with a 5% decrease (-8% to -1%). The relative humidity association with ICAM-1 methylation was stronger on hot days than mild days. DNA methylation in blood cells may reflect biological effects of temperature and relative humidity. Temperature and relative humidity may also interact to produce stronger effects.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schriner, J.E.; Yi, W.; Hofmann, S.L.
Palmitoyl-protein thioesterase (PPT) is a small glycoprotein that removes palmitate groups from cysteine residues in lipid-modified proteins. We recently reported mutations in PPT in patients with infantile neuronal ceroid lipofuscinosis (INCL), a severe neurodegenerative disorder. INCL is characterized by the accumulation of proteolipid storage material in brain and other tissues, suggesting that the disease is a consequence of abnormal catabolism of acylated proteins. In the current paper, we report the sequence of the human PPT cDNA and the structure of the human PPT gene. The cDNA predicts a protein of 306 amino acids that contains a 25-amino-acid signal peptide, threemore » N-linked glycosylation sites, and consensus motifs characteristic of thioesterases. Northern analysis of a human tissue blot revealed ubiquitous expression of a single 2.5-kb mRNA, with highest expression in lung, brain, and heart. The human PPT gene spans 25 kb and is composed of seven coding exons and a large eighth exon, containing the entire 3{prime}-untranslated region of 1388 bp. An Alu repeat and promoter elements corresponding to putative binding sites for several general transcription factors were identified in the 1060 nucleotides upstream of the transcription start site. The human PPT cDNA sequence and gene structure will provide the means for the identification of further causative mutations in INCL and facilitate genetic screening in selected high-risk populations. 31 refs., 5 figs., 1 tab.« less
ERIC Educational Resources Information Center
Elwess, Nancy L.; Duprey, Stephen L.; Harney, Lindesay A.; Langman, Jessie E.; Marino, Tara C.; Martinez, Carolina; McKeon, Lauren L.; Moss, Chantel I. E.; Myrie, Sasha S.; Taylor, Luke Ryan
2008-01-01
"Alu"-insertion polymorphisms were used by an undergraduate Bioinformatics class to study how these insertion sites could be the basis for an investigation in human population genetics. Based on the students' investigation, both allele and genotype "Alu" frequencies were determined for African-American and Japanese populations as well as a…
Usmanova, N M; Kazakov, V I; Tomilin, N V
2008-01-01
Using computer-based methods we determined the global distribution of short interspersed nuclear elements (SINEs) in the human and mouse X chromosomes. It has been shown that this distributions is similar to the distributions of CpG islands and genes but is different from the distribution of LINE1 elements. Since SINEs (human Alu and mouse B2) may have binding sites for Polycomb protein YY1, we suggest that these repeats can serve as additional signals ("boosters") in Polycomb-dependent silencing of gene rich segments during X inactivation.
Soundararajan, Ramani; Stearns, Timothy M.; Griswold, Anthony J.; Mehta, Arpit; Czachor, Alexander; Fukumoto, Jutaro; Lockey, Richard F.; King, Benjamin L.; Kolliputi, Narasaiah
2015-01-01
RNA editing is a post-transcriptional modification of RNA. The majority of these changes result from adenosine deaminase acting on RNA (ADARs) catalyzing the conversion of adenosine residues to inosine in double-stranded RNAs (dsRNAs). Massively parallel sequencing has enabled the identification of RNA editing sites in human transcriptomes. In this study, we sequenced DNA and RNA from human lungs and identified RNA editing sites with high confidence via a computational pipeline utilizing stringent analysis thresholds. We identified a total of 3,447 editing sites that overlapped in three human lung samples, and with 50% of these sites having canonical A-to-G base changes. Approximately 27% of the edited sites overlapped with Alu repeats, and showed A-to-G clustering (>3 clusters in 100 bp). The majority of edited sites mapped to either 3′ untranslated regions (UTRs) or introns close to splice sites; whereas, only few sites were in exons resulting in non-synonymous amino acid changes. Interestingly, we identified 652 A-to-G editing events in the 3′ UTR of 205 target genes that mapped to 932 potential miRNA target binding sites. Several of these miRNA edited sites were validated in silico. Additionally, we validated several A-to-G edited sites by Sanger sequencing. Altogether, our study suggests a role for RNA editing in miRNA-mediated gene regulation and splicing in human lungs. In this study, we have generated a RNA editome of human lung tissue that can be compared with other RNA editomes across different lung tissues to delineate a role for RNA editing in normal and diseased states. PMID:26486088
Soundararajan, Ramani; Stearns, Timothy M; Griswold, Anthony L; Mehta, Arpit; Czachor, Alexander; Fukumoto, Jutaro; Lockey, Richard F; King, Benjamin L; Kolliputi, Narasaiah
2015-11-03
RNA editing is a post-transcriptional modification of RNA. The majority of these changes result from adenosine deaminase acting on RNA (ADARs) catalyzing the conversion of adenosine residues to inosine in double-stranded RNAs (dsRNAs). Massively parallel sequencing has enabled the identification of RNA editing sites in human transcriptomes. In this study, we sequenced DNA and RNA from human lungs and identified RNA editing sites with high confidence via a computational pipeline utilizing stringent analysis thresholds. We identified a total of 3,447 editing sites that overlapped in three human lung samples, and with 50% of these sites having canonical A-to-G base changes. Approximately 27% of the edited sites overlapped with Alu repeats, and showed A-to-G clustering (>3 clusters in 100 bp). The majority of edited sites mapped to either 3' untranslated regions (UTRs) or introns close to splice sites; whereas, only few sites were in exons resulting in non-synonymous amino acid changes. Interestingly, we identified 652 A-to-G editing events in the 3' UTR of 205 target genes that mapped to 932 potential miRNA target binding sites. Several of these miRNA edited sites were validated in silico. Additionally, we validated several A-to-G edited sites by Sanger sequencing. Altogether, our study suggests a role for RNA editing in miRNA-mediated gene regulation and splicing in human lungs. In this study, we have generated a RNA editome of human lung tissue that can be compared with other RNA editomes across different lung tissues to delineate a role for RNA editing in normal and diseased states.
NASA Astrophysics Data System (ADS)
Pandey, Rajesh; Bhattacharya, Aniket; Bhardwaj, Vivek; Jha, Vineet; Mandal, Amit K.; Mukerji, Mitali
2016-09-01
Primate-specific Alus harbor different regulatory features, including miRNA targets. In this study, we provide evidence for miRNA-mediated modulation of transcript isoform levels during heat-shock response through exaptation of Alu-miRNA sites in mature mRNA. We performed genome-wide expression profiling coupled with functional validation of miRNA target sites within exonized Alus, and analyzed conservation of these targets across primates. We observed that two miRNAs (miR-15a-3p and miR-302d-3p) elevated in stress response, target RAD1, GTSE1, NR2C1, FKBP9 and UBE2I exclusively within Alu. These genes map onto the p53 regulatory network. Ectopic overexpression of miR-15a-3p downregulates GTSE1 and RAD1 at the protein level and enhances cell survival. This Alu-mediated fine-tuning seems to be unique to humans as evident from the absence of orthologous sites in other primate lineages. We further analyzed signatures of selection on Alu-miRNA targets in the genome, using 1000 Genomes Phase-I data. We found that 198 out of 3177 Alu-exonized genes exhibit signatures of selection within Alu-miRNA sites, with 60 of them containing SNPs supported by multiple evidences (global-FST > 0.3, pair-wise-FST > 0.5, Fay-Wu’s H < -20, iHS > 2.0, high ΔDAF) and implicated in p53 network. We propose that by affecting multiple genes, Alu-miRNA interactions have the potential to facilitate population-level adaptations in response to environmental challenges.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Herzog, R.; Lutz, S.; Blin, N.
1991-02-05
Heparin cofactor II (HCII) is a 66-kDa plasma glycoprotein that inhibits thrombin rapidly in the presence of dermatan sulfate or heparin. Clones comprising the entire HCII gene were isolated from a human leukocyte genomic library in EMBL-3 {lambda} phage. The sequence of the gene was determined on both strands of DNA (15,849 bp) and included 1,749 bp of 5{prime}-flanking sequence, five exons, four introns, and 476 bp of DNA 3{prime} to the polyadenylation site. Ten complete and one partial Alu repeats were identified in the introns and 5{prime}-flanking region. The HCII gene was regionally mapped on chromosome 22 using rodent-humanmore » somatic cell hybrids, carrying only parts of human chromosome 22, and the chronic myelogenous leukemia cell line K562. With the cDNA probe HCII7.2, containing the entire coding region of the gene, the HCII gene was shown to be amplified 10-20-fold in K562 cells by Southern analysis and in situ hybridization. From these data, the authors concluded that the HCII gene is localized on the chromosomal band 22q11 proximal to the breakpoint cluster region (BCR). Analysis by pulsed-field gel electrophoresis indicated that the amplified HCII gene in K562 cells maps at least 2 Mbp proximal to BCR-1. Furthermore, the HCII7.2 cDNA probe detected two frequent restriction fragment length polymorphisms with the restriction enzymes BamHI and Hind III.« less
Loke, Y J; Galati, J C; Saffery, R; Craig, J M
2015-04-01
In vitro fertilization (IVF) and its subset intracytoplasmic sperm injection (ICSI), are widely used medical treatments for conception. There has been controversy over whether IVF is associated with adverse short- and long-term health outcomes of offspring. As with other prenatal factors, epigenetic change is thought to be a molecular mediator of any in utero programming effects. Most studies focused on DNA methylation at gene-specific and genomic level, with only a few on associations between DNA methylation and IVF. Using buccal epithelium from 208 twin pairs from the Peri/Postnatal Epigenetic Twin Study (PETS), we investigated associations between IVF and DNA methylation on a global level, using the proxies of Alu and LINE-1 interspersed repeats in addition to two locus-specific regulatory regions within IGF2/H19, controlling for 13 potentially confounding factors. Using multiple correction testing, we found strong evidence that IVF-conceived twins have lower DNA methylation in Alu, and weak evidence of lower methylation in one of the two IGF2/H19 regulatory regions and LINE-1, compared with naturally conceived twins. Weak evidence of a relationship between ICSI and DNA methylation within IGF2/H19 regulatory region was found, suggesting that one or more of the processes associated with IVF/ICSI may contribute to these methylation differences. Lower within- and between-pair DNA methylation variation was also found in IVF-conceived twins for LINE-1, Alu and one IGF2/H19 regulatory region. Although larger sample sizes are needed, our results provide additional insight to the possible influence of IVF and ICSI on DNA methylation. To our knowledge, this is the largest study to date investigating the association of IVF and DNA methylation.
[Analysis of ethnogeographic groups of Kazakhs based on nuclear genome DNA polymorphism].
Salimova, A Z; Kutuev, I A; Khusainova, R I; Akhmetova, V L; Sviatova, G S; Berezina, G M; Khusnutdinova, E K
2005-07-01
Eight nuclear DNA loci, including six Alu insertions (ACE, APOA1, PV92, TPA25, Ya5NBC27, and Ya5NBC148), 32-bp deletion in the CCR5 gene, and VNTR locus at the eNOS gene, were examined in three ethnogeographic groups of Kazakhs (342 individuals). The individuals examined lived in southeastern, central, and southwestern regions of Kazakhstan, and according to their tribal attribution, belonged to the Senior, Middle, and Junior Zhuzes. The Alu insertions appeared to be polymorphic in all populations examined: the insertion frequency varied from 0.264 in the populations of the Senior and Middle Zhuzes at the Ya5NBC27 and Ya5NBC148 loci, to 0.827 in Kazakhs of the Middle Zhuz at the APOA1 locus. In Kazakh groups examined only two alleles of the eNOS VNTR locus were detected with the number of repeats constituting four (A) and five (B) copies. The highest frequency of A allele was found in Kazakhs from the Junior Zhuz (0.113), while the highest frequency of B allele was detected in population of the Senior Zhuz (0.893). The frequency of the 32-bp deletion in the chemokine receptor CCR5 gene varied from 0.027 in the Junior Zhuz to 0.045 in the Senior Zhuz. Kazakhs showed high genetic diversity (Hex = 0.376). In general, in three ethnogeographic groups of Kazakhs, the coefficient of gene differentiation (G(ST)) over eight diallelic markers of nuclear genome constituted 1.1%. The differences in the Alu insertions made the highest contribution to the among-population diversity (G(ST) = 1.2%).
Ichiyanagi, Kenji
2013-01-01
Short interspersed elements (SINEs) are a class of retrotransposons, which amplify their copy numbers in their host genomes by retrotransposition. More than a million copies of SINEs are present in a mammalian genome, constituting over 10% of the total genomic sequence. In contrast to the other two classes of retrotransposons, long interspersed elements (LINEs) and long terminal repeat (LTR) elements, SINEs are transcribed by RNA polymerase III. However, like LINEs and LTR elements, the SINE transcription is likely regulated by epigenetic mechanisms such as DNA methylation, at least for human Alu and mouse B1. Whereas SINEs and other transposable elements have long been thought as selfish or junk DNA, recent studies have revealed that they play functional roles at their genomic locations, for example, as distal enhancers, chromatin boundaries and binding sites of many transcription factors. These activities imply that SINE retrotransposition has shaped the regulatory network and chromatin landscape of their hosts. Whereas it is thought that the epigenetic mechanisms were originated as a host defense system against proliferation of parasitic elements, this review discusses a possibility that the same mechanisms are also used to regulate the SINE-derived functions.
Kralovicova, Jana; Moreno, Pedro M D; Cross, Nicholas C P; Pêgo, Ana Paula; Vorechovsky, Igor
2016-12-01
ATM (ataxia-telangiectasia, mutated) is an important cancer susceptibility gene that encodes a key apical kinase in the DNA damage response pathway. ATM mutations in the germ line result in ataxia-telangiectasia (A-T), a rare genetic syndrome associated with hypersensitivity to double-strand DNA breaks and predisposition to lymphoid malignancies. ATM expression is limited by a tightly regulated nonsense-mediated RNA decay (NMD) switch exon (termed NSE) located in intron 28. In this study, we identify antisense oligonucleotides that modulate NSE inclusion in mature transcripts by systematically targeting the entire 3.1-kb-long intron. Their identification was assisted by a segmental deletion analysis of transposed elements, revealing NSE repression upon removal of a distant antisense Alu and NSE activation upon elimination of a long terminal repeat transposon MER51A. Efficient NSE repression was achieved by delivering optimized splice-switching oligonucleotides to embryonic and lymphoblastoid cells using chitosan-based nanoparticles. Together, these results provide a basis for possible sequence-specific radiosensitization of cancer cells, highlight the power of intronic antisense oligonucleotides to modify gene expression, and demonstrate transposon-mediated regulation of NSEs.
FAST TRACK COMMUNICATION: Reversible arithmetic logic unit for quantum arithmetic
NASA Astrophysics Data System (ADS)
Kirkedal Thomsen, Michael; Glück, Robert; Axelsen, Holger Bock
2010-09-01
This communication presents the complete design of a reversible arithmetic logic unit (ALU) that can be part of a programmable reversible computing device such as a quantum computer. The presented ALU is garbage free and uses reversible updates to combine the standard reversible arithmetic and logical operations in one unit. Combined with a suitable control unit, the ALU permits the construction of an r-Turing complete computing device. The garbage-free ALU developed in this communication requires only 6n elementary reversible gates for five basic arithmetic-logical operations on two n-bit operands and does not use ancillae. This remarkable low resource consumption was achieved by generalizing the V-shape design first introduced for quantum ripple-carry adders and nesting multiple V-shapes in a novel integrated design. This communication shows that the realization of an efficient reversible ALU for a programmable computing device is possible and that the V-shape design is a very versatile approach to the design of quantum networks.
Exploring the Feasibility of a DNA Computer: Design of an ALU Using Sticker-Based DNA Model.
Sarkar, Mayukh; Ghosal, Prasun; Mohanty, Saraju P
2017-09-01
Since its inception, DNA computing has advanced to offer an extremely powerful, energy-efficient emerging technology for solving hard computational problems with its inherent massive parallelism and extremely high data density. This would be much more powerful and general purpose when combined with other existing well-known algorithmic solutions that exist for conventional computing architectures using a suitable ALU. Thus, a specifically designed DNA Arithmetic and Logic Unit (ALU) that can address operations suitable for both domains can mitigate the gap between these two. An ALU must be able to perform all possible logic operations, including NOT, OR, AND, XOR, NOR, NAND, and XNOR; compare, shift etc., integer and floating point arithmetic operations (addition, subtraction, multiplication, and division). In this paper, design of an ALU has been proposed using sticker-based DNA model with experimental feasibility analysis. Novelties of this paper may be in manifold. First, the integer arithmetic operations performed here are 2s complement arithmetic, and the floating point operations follow the IEEE 754 floating point format, resembling closely to a conventional ALU. Also, the output of each operation can be reused for any next operation. So any algorithm or program logic that users can think of can be implemented directly on the DNA computer without any modification. Second, once the basic operations of sticker model can be automated, the implementations proposed in this paper become highly suitable to design a fully automated ALU. Third, proposed approaches are easy to implement. Finally, these approaches can work on sufficiently large binary numbers.
Detailed Mapping of the Alu Volcano, Ethiopia
NASA Astrophysics Data System (ADS)
Agrain, Guillaume; Buso, Roxane; Carlier, Jean; van Wyk de Vries, Benjamin
2017-04-01
The Alu volcano in the Danakil Depression is interpreted as a forced-fold related uplift, related to progressive intrusions of sills, or similar tabular intrusions. Alu is in a very isolated and difficult to access area, but Google Earth provides high resolution images that can be used for mapping the structure and volcanic features. We use the imagery to map in as much detail as possible all the morphological features of Alu, which we separate into primary volcanic features and secondary structural features. The mapping has been undertaken by a group undergraduates, graduates and researchers. The group has checked and validated the interpretation of each feature mapped. The data set is available as a kmz, and has been imported into QGIS. The detailed mapping reveals a complex history of multiple lava fields and fissure eruptions, some which pre-date uplift, while others have occurred during uplift, but are subsequently deformed. Similarly, there are cross-cutting structures, and we are able to set up a chronology of events. This shows that uplift grew in an area which was already covered by lavas, that some lava has been probably erupted from Alu's flanks, while most eruptions have been from around the base of Alu. Early in the deformation, thrust faults developed on the lower flanks, similar to those described near the Grosmanaux uplift (van Wyk de Vries et al 2014). These are cut by the larger faults, and by minor fissures. The mapping provides an accessible way of preparing for dedicated fieldwork in preparation of an eventual field expedition to Alu, while extracting the most from remote sensing data.
Callén, E; Tischkowitz, M D; Creus, A; Marcos, R; Bueren, J A; Casado, J A; Mathew, C G; Surrallés, J
2004-01-01
Fanconi anaemia is an autosomal recessive disease characterized by chromosome fragility, multiple congenital abnormalities, progressive bone marrow failure and a high predisposition to develop malignancies. Most of the Fanconi anaemia patients belong to complementation group FA-A due to mutations in the FANCA gene. This gene contains 43 exons along a 4.3-kb coding sequence with a very heterogeneous mutational spectrum that makes the mutation screening of FANCA a difficult task. In addition, as the FANCA gene is rich in Alu sequences, it was reported that Alu-mediated recombination led to large intragenic deletions that cannot be detected in heterozygous state by conventional PCR, SSCP analysis, or DNA sequencing. To overcome this problem, a method based on quantitative fluorescent multiplex PCR was proposed to detect intragenic deletions in FANCA involving the most frequently deleted exons (exons 5, 11, 17, 21 and 31). Here we apply the proposed method to detect intragenic deletions in 25 Spanish FA-A patients previously assigned to complementation group FA-A by FANCA cDNA retroviral transduction. A total of eight heterozygous deletions involving from one to more than 26 exons were detected. Thus, one third of the patients carried a large intragenic deletion that would have not been detected by conventional methods. These results are in agreement with previously published data and indicate that large intragenic deletions are one of the most frequent mutations leading to Fanconi anaemia. Consequently, this technology should be applied in future studies on FANCA to improve the mutation detection rate. Copyright 2003 S. Karger AG, Basel
Mobile Interspersed Repeats Are Major Structural Variants in the Human Genome
Huang, Cheng Ran Lisa; Schneider, Anna M.; Lu, Yunqi; Niranjan, Tejasvi; Shen, Peilin; Robinson, Matoya A.; Steranka, Jared P.; Valle, David; Civin, Curt I.; Wang, Tao; Wheelan, Sarah J.; Ji, Hongkai; Boeke, Jef D.; Burns, Kathleen H.
2010-01-01
Summary Characterizing structural variants in the human genome is of great importance, but a genome wide analysis to detect interspersed repeats has not been done. Thus, the degree to which mobile DNAs contribute to genetic diversity, heritable disease, and oncogenesis remains speculative. We perform transposon insertion profiling by microarray (TIP-chip) to map human L1(Ta) retrotransposons (LINE-1 s) genome-wide. This identified numerous novel human L1(Ta) insertional polymorphisms with highly variant allelic frequencies. We also explored TIP-chip's usefulness to identify candidate alleles associated with different phenotypes in clinical cohorts. Our data suggest that the occurrence of new insertions is twice as high as previously estimated, and that these repeats are under-recognized as sources of human genomic and phenotypic diversity. We have just begun to probe the universe of human L1(Ta) polymorphisms, and as TIP-chip is applied to other insertions such as Alu SINEs, it will expand the catalog of genomic variants even further. PMID:20602999
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sharma, V.; Bonnycastle, L.; Poorkai, P.
1994-09-01
We have constructed a yeast artificial chromosome (YAC) contig of chromosome 14q24.3 which encompasses the chromosome 14 Alzheimer`s disease locus (AD3). Determined by linkage analysis of early-onset Alzheimer`s disease kindreds, this interval is bounded by the genetic markers D14S61-D14S63 and spans approximately 15 centimorgans. The contig consists of 29 markers and 74 YACs of which 57 are defined by one or more sequence tagged sites (STSs). The STS markers comprise 5 genes, 16 short tandem repeat polymorphisms and 8 cDNA clones. An additional number of genes, expressed sequence tags and cDNA fragments have been identified and localized to the contigmore » by hybridization and sequence analysis of anonymous clones isolated by cDNA direct selection techniques. A minimal contig of about 15 YACs averaging 0.5-1.5 megabase in length will span this interval and is, at first approximation, in rough agreement with the genetic map. For two regions of the contig, our coverage has relied on L1/THE fingerprint and Alu-PCR hybridization data of YACs provided by CEPH/Genethon. We are currently developing sequence tagged sites from these to confirm the overlaps revealed by the fingerprint data. Among the genes which map to the contig are transforming growth factor beta 3, c-fos, and heat shock protein 2A (HSPA2). C-fos is not a candidate gene for AD3 based on the sequence analysis of affected and unaffected individuals. HSPA2 maps to the proximal edge of the contig and Calmodulin 1, a candidate gene from 4q24.3, maps outside of the region. The YAC contig is a framework physical map from which cosmid or P1 clone contigs can be constructed. As more genes and cDNAs are mapped, a highly resolved transcription map will emerge, a necessary step towards positionally cloning the AD3 gene.« less
Viral Impact in Autoimmune Diseases: Expanding the “X Chromosome–Nucleolus Nexus” Hypothesis
Brooks, Wesley H.
2017-01-01
Viruses are suspected of significant roles in autoimmune diseases but the mechanisms are unclear. We get some insight by considering demands a virus places on host cells. Viruses not only require production of their own proteins, RNA and/or DNA, but also production of additional cellular machinery, such as ribosomes, to handle the increased demands. Since the nucleolus is a major site of RNA processing and ribonucleoprotein assembly, nucleoli are targeted by viruses, directly when viral RNA and proteins enter the nucleolus and indirectly when viruses induce increased expression of cellular polyamine genes. Polyamines are at high levels in nucleoli to assist in RNA folding. The size and activity of nucleoli increase directly with increases in polyamines. Nucleolar expansion due to abnormal increases in polyamines could disrupt nearby chromatin, such as the inactive X chromosome, leading to expression of previously sequestered DNA. Sudden expression of a large concentration of Alu elements from the disrupted inactive X can compete with RNA transcripts containing intronic Alu sequences that normally maintain nucleolar structural integrity. Such disruption of nucleolar activity can lead to misfolded RNAs, misassembled ribonucleoprotein complexes, and fragmentation of the nucleolus. Many autoantigens in lupus are, at least transiently, components of the nucleolus. Considering these effects of viruses, the “X chromosome–nucleolus nexus” hypothesis, which proposed disruption of the inactive X by the nucleolus during stress, is now expanded here to propose subsequent disruption of the nucleolus by previously sequestered Alu elements, which can fragment the nucleolus, leading to generation of autoantigens. PMID:29234321
Effect of d-allulose on rheological properties of chicken breast sausage.
Hadipernata, M; Ogawa, M; Hayakawa, S
2016-09-01
d-Allulose (Alu), a rare sugar, was applied to chicken breast sausage as a sucrose (Suc) substitute. The ratio (w/w) of Alu to Suc in sugar that was added to the sausage batter was 0/1 (A0S1), 3/7 (A3S7), 7/3 (A7S3), and 1/0 (A1S0). The total amount of Suc used was 2.5% of the weight of minced chicken breast meat. Substituting Suc with Alu did not affect water content, cooking loss, breaking stress, breaking strain, and modulus of elasticity of chicken breast sausage, but a 100% substitution with Alu caused a 10% decrease in viscosity and a 31% decrease in expressible water. A significant difference appeared in the rheological properties of elasticity, viscosity, and water-holding capacity of chicken breast sausage frozen-stored (-20°C) for 90 d. Particularly, the modulus of elasticity for A1S0 chicken breast sausage was 19% higher than that of the control A0S1 chicken breast sausage, suggesting that Alu appreciably reduced the deterioration in elasticity that is caused by long-term frozen storage of sausage. The quality improvement of frozen-stored chicken breast sausage demonstrates the feasibility and benefits of the application of Alu to frozen foods. © 2016 Poultry Science Association Inc.
[The Russian gene pool: gene geography of Alu-insertions (ACE, APOA1, B65, PV92 TPA25)].
Solov'eva, D S; Balanovskaia, E V; Kuznetsova, M A; Vasinskaia, O A; Frolova, S A; Pocheshkhova, E A; Evseeva, I V; Boldyreva, M N; Balanovskiĭ, O P
2010-01-01
The analysis of five Alu insertion loci (ACE, AP4OA1, B65, PV92, TPA25) has been carried out for the first time in 10 Russian populations (1088 individuals), covered all parts of historical area of the Russian ethnos. Depending on locus, Russian populations exhibit similarity with their western (European populations) or with the eastern (populations of the Ural region) neighbors. Considering frequencies of the studied Alu-insertions, Russian gene pool exhibits low variation: average difference between populations is d = 0.007, whereas on classical markers, mtDNA and Y chromosome heterogeneity of Russian gene pool is essentially higher (0.013, 0.033 and 0.142 respectively). Therefore, this set of five Alu insertions has lower variability on the intra-ethnic level. However in inter-ethnic comparisons the clear pattern was obtained: 13 Eastern European ethnic groups formed three clusters, according with their historical and geographical position--East Slavic, Caucasian and South Ural clusters. The obtained data confirms efficiency of using Alu insertions for studying genetic differentiation and history of a gene pool of the Eastern European populations.
Panning, B; Smiley, J R
1993-01-01
We found that transcription of endogenous human Alu elements by RNA polymerase III was strongly stimulated following infection of HeLa cells with adenovirus type 5, leading to the accumulation of high levels of Alu transcripts initiated from Alu polymerase III promoters. In contrast to previously reported cases of adenovirus-induced activation of polymerase III transcription, induction required the E1b 58-kDa protein and the products of E4 open reading frames 3 and 6 in addition to the 289-residue E1a protein. In addition, E1a function was not required at high multiplicities of infection, suggesting that E1a plays an indirect role in Alu activation. These results suggest previously unsuspected regulatory properties of the adenovirus E1b and E4 gene products and provide a novel approach to the study of the biology of the most abundant class of dispersed repetitive DNA in the human genome. Images PMID:7684492
Atomic interaction of the MEAM type for the study of intermetallics in the Al-U alloy
NASA Astrophysics Data System (ADS)
Pascuet, M. I.; Fernández, J. R.
2015-12-01
Interaction for both pure Al and Al-U alloys of the MEAM type are developed. The obtained Al interatomic potential assures its compatibility with the details of the framework presently adopted. The Al-U interaction fits various properties of the Al2U, Al3U and Al4U intermetallics. The potential verifies the stability of the intermetallic structures in a temperature range compatible with that observed in the phase diagram, and also takes into account the greater stability of these structures relative to others that are competitive in energy. The intermetallics are characterized by calculating elastic and thermal properties and point defect parameters. Molecular dynamics simulations show a growth of the Al3U intermetallic in the Al/U interface in agreement with experimental evidence.
Khan, Imran A; Jahan, Parveen; Hasan, Qurratulain; Rao, Pragna
2014-12-01
Gestational diabetes mellitus (GDM) is defined as glucose intolerance first recognized during pregnancy. Insertion/deletion (I/D) polymorphism of a 287 bp Alu repetitive sequence in intron 16 of the angiotensin-converting enzyme (ACE) gene has been widely investigated in Asian Indian populations with different ethnic origins. The present study examined possible association between I/D polymorphism of the ACE gene and GDM in Asian Indian pregnant women. A total of 200 pregnant women (100 GDM and 100 non-GDM) were recruited in this study and I/D polymorphism of a 287 bp Alu1 element inside intron 16 of the ACE gene was examined by polymerase chain reaction (PCR)-based gel electrophoresis. The distribution of the variants like II, ID, and DD genotypes of ACE gene showed differences between normal GDM versus non-GDM subjects, and the frequency of the ID+ DD Vs II genotype was significant (p=0.0002) in the GDM group. ACE gene polymorphism was associated with GDM in Asian Indian pregnant women. © The Author(s) 2013.
Gene Transfer and Molecular Cloning of the Human NGF Receptor
NASA Astrophysics Data System (ADS)
Chao, Moses V.; Bothwell, Mark A.; Ross, Alonzo H.; Koprowski, Hilary; Lanahan, Anthony A.; Buck, C. Randall; Sehgal, Amita
1986-04-01
Nerve growth factor (NGF) and its receptor are important in the development of cells derived from the neural crest. Mouse L cell transformants have been generated that stably express the human NGF receptor gene transfer with total human DNA. Affinity cross-linking, metabolic labeling and immunoprecipitation, and equilibrium binding with 125I-labeled NGF revealed that this NGF receptor had the same size and binding characteristics as the receptor from human melanoma cells and rat PC12 cells. The sequences encoding the NGF receptor were molecularly cloned using the human Alu repetitive sequence as a probe. A cosmid clone that contained the human NGF receptor gene allowed efficient transfection and expression of the receptor.
Design of Improved Arithmetic Logic Unit in Quantum-Dot Cellular Automata
NASA Astrophysics Data System (ADS)
Heikalabad, Saeed Rasouli; Gadim, Mahya Rahimpour
2018-06-01
The quantum-dot cellular automata (QCA) can be replaced to overcome the limitation of CMOS technology. An arithmetic logic unit (ALU) is a basic structure of any computer devices. In this paper, design of improved single-bit arithmetic logic unit in quantum dot cellular automata is presented. The proposed structure for ALU has AND, OR, XOR and ADD operations. A unique 2:1 multiplexer, an ultra-efficient two-input XOR and a low complexity full adder are used in the proposed structure. Also, an extended design of this structure is provided for two-bit ALU in this paper. The proposed structure of ALU is simulated by QCADesigner and simulation result is evaluated. Evaluation results show that the proposed design has best performance in terms of area, complexity and delay compared to the previous designs.
Design of Improved Arithmetic Logic Unit in Quantum-Dot Cellular Automata
NASA Astrophysics Data System (ADS)
Heikalabad, Saeed Rasouli; Gadim, Mahya Rahimpour
2018-03-01
The quantum-dot cellular automata (QCA) can be replaced to overcome the limitation of CMOS technology. An arithmetic logic unit (ALU) is a basic structure of any computer devices. In this paper, design of improved single-bit arithmetic logic unit in quantum dot cellular automata is presented. The proposed structure for ALU has AND, OR, XOR and ADD operations. A unique 2:1 multiplexer, an ultra-efficient two-input XOR and a low complexity full adder are used in the proposed structure. Also, an extended design of this structure is provided for two-bit ALU in this paper. The proposed structure of ALU is simulated by QCADesigner and simulation result is evaluated. Evaluation results show that the proposed design has best performance in terms of area, complexity and delay compared to the previous designs.
[Analysis of Alu-insertion polymorphism in three subethnic groups of Kalmyks].
Khusainova, R I; Balinova, N V; Kutuev, I A; Spitsina, N Kh; Akhmetova, V L; Valiev, R R; Spitsyn, V A; Khusnutdinova, E K
2009-03-01
Eight Alu insertions at the NBC27, TPA25, NBC148, NBC123, ACE, APOA1, NBC51, and PV92 locus were examined in three subethnic groups of Kalmyks (Torgouds, Derbets, and Buzava). In general, the pattern of allele frequencies in Kalmyks was consistent with that in Asian populations of the world, and was similar to the Alu insertion frequencies pattern in Turkic populations of the Volga--Ural region and Central Asia. Pairwise comparisons of three subpopulations of Kalmyks with respect to the frequency distributions of eight Alu insertions revealed the differences between the groups examined. The coefficient of gene differentiation, F(st), constituted 1.37%, pointing to the common origin of the groups of interest, as well as to the uniformity of the gene pools of subethnic groups of Kalmyks examined.
Optimized 4-bit Quantum Reversible Arithmetic Logic Unit
NASA Astrophysics Data System (ADS)
Ayyoub, Slimani; Achour, Benslama
2017-08-01
Reversible logic has received a great attention in the recent years due to its ability to reduce the power dissipation. The main purposes of designing reversible logic are to decrease quantum cost, depth of the circuits and the number of garbage outputs. The arithmetic logic unit (ALU) is an important part of central processing unit (CPU) as the execution unit. This paper presents a complete design of a new reversible arithmetic logic unit (ALU) that can be part of a programmable reversible computing device such as a quantum computer. The proposed ALU based on a reversible low power control unit and small performance parameters full adder named double Peres gates. The presented ALU can produce the largest number (28) of arithmetic and logic functions and have the smallest number of quantum cost and delay compared with existing designs.
Servín-Villegas, Rosalía; Caamal-Chan, Maria Goretty; Chavez-Medina, Alicia; Loera-Muro, Abraham; Barraza, Aarón; Medina-Hernández, Diana; Holguín-Peña, Ramón Jaime
2018-04-11
The 16SrXIII group from phytoplasma bacteria were identified in salivary glands from Homalodisca liturata, which were collected in El Comitán on the Baja California peninsula in Mexico. We were able to positively identify 15 16S rRNA gene sequences with the corresponding signature sequence of 'CandidatusPhytoplasma' (CAAGAYBATKATGTKTAGCYGGDCT) and in silico restriction fragment length polymorphism (RFLP) profiles (F value estimations) coupled with a phylogenetic analysis to confirm their relatedness to 'CandidatusPhytoplasma hispanicum', which in turn belongs to the 16SrXIII group. A restriction analysis was carried out with AluI and EcoRI to confirm that the five sequences belongs to subgroup D. The rest of the sequences did not exhibit any known RFLP profile related to a subgroup reported in the 16SrXIII group.
Łabno, Anna; Warkocki, Zbigniew; Kuliński, Tomasz; Krawczyk, Paweł Szczepan; Bijata, Krystian; Tomecki, Rafał; Dziembowski, Andrzej
2016-01-01
The exosome-independent exoribonuclease DIS3L2 is mutated in Perlman syndrome. Here, we used extensive global transcriptomic and targeted biochemical analyses to identify novel DIS3L2 substrates in human cells. We show that DIS3L2 regulates pol II transcripts, comprising selected canonical and histone-coding mRNAs, and a novel FTL_short RNA from the ferritin mRNA 5′ UTR. Importantly, DIS3L2 contributes to surveillance of maturing snRNAs during their cytoplasmic processing. Among pol III transcripts, DIS3L2 particularly targets vault and Y RNAs and an Alu-like element BC200 RNA, but not Alu repeats, which are removed by exosome-associated DIS3. Using 3′ RACE-Seq, we demonstrate that all novel DIS3L2 substrates are uridylated in vivo by TUT4/TUT7 poly(U) polymerases. Uridylation-dependent DIS3L2-mediated decay can be recapitulated in vitro, thus reinforcing the tight cooperation between DIS3L2 and TUTases. Together these results indicate that catalytically inactive DIS3L2, characteristic of Perlman syndrome, can lead to deregulation of its target RNAs to disturb transcriptome homeostasis. PMID:27431325
Placental Hypomethylation Is More Pronounced in Genomic Loci Devoid of Retroelements
Chatterjee, Aniruddha; Macaulay, Erin C.; Rodger, Euan J.; Stockwell, Peter A.; Parry, Matthew F.; Roberts, Hester E.; Slatter, Tania L.; Hung, Noelyn A.; Devenish, Celia J.; Morison, Ian M.
2016-01-01
The human placenta is hypomethylated compared to somatic tissues. However, the degree and specificity of placental hypomethylation across the genome is unclear. We assessed genome-wide methylation of the human placenta and compared it to that of the neutrophil, a representative homogeneous somatic cell. We observed global hypomethylation in placenta (relative reduction of 22%) compared to neutrophils. Placental hypomethylation was pronounced in intergenic regions and gene bodies, while the unmethylated state of the promoter remained conserved in both tissues. For every class of repeat elements, the placenta showed lower methylation but the degree of hypomethylation differed substantially between these classes. However, some retroelements, especially the evolutionarily younger Alu elements, retained high levels of placental methylation. Surprisingly, nonretrotransposon-containing sequences showed a greater degree of placental hypomethylation than retrotransposons in every genomic element (intergenic, introns, and exons) except promoters. The differentially methylated fragments (DMFs) in placenta and neutrophils were enriched in gene-poor and CpG-poor regions. The placentally hypomethylated DMFs were enriched in genomic regions that are usually inactive, whereas hypermethylated DMFs were enriched in active regions. Hypomethylation of the human placenta is not specific to retroelements, indicating that the evolutionary advantages of placental hypomethylation go beyond those provided by expression of retrotransposons and retrogenes. PMID:27172225
Zannis-Hadjopoulos, M; Kaufmann, G; Wang, S S; Lechner, R L; Karawya, E; Hesse, J; Martin, R G
1985-07-01
Twelve clones of monkey DNA obtained by a procedure that enriches 10(3)- to 10(4)-fold for nascent sequences activated early in S phase (G. Kaufmann, M. Zannis-Hadjopoulos, and R. G. Martin, Mol. Cell. Biol. 5:721-727, 1985) have been examined. Only 2 of the 12 ors sequences (origin-enriched sequences) are unique (ors1 and ors8). Three contain the highly reiterated Alu family (ors3, ors9, and ors11). One contains the highly reiterated alpha-satellite family (ors12), but none contain the Kpn family. Those remaining contain middle repetitive sequences. Two examples of the same middle repetitive sequence were found (ors2 and ors6). Three of the middle repetitive sequences (the ors2-ors6 pair, ors5, and ors10) are moderately dispersed; one (ors4) is highly dispersed. The last, ors7, has been mapped to the bona fide replication origin of the D loop of mitochondrial DNA. Of the nine ors sequences tested, half possess snapback (intrachain reannealing) properties.
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene.
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-11-29
Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-01-01
Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains. PMID:18047649
Hamilton, Natasha A; Tammen, Imke; Raadsma, Herman W
2013-01-01
Angiotensin converting enzyme (ACE) is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D) polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs) discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism.
Hamilton, Natasha A.; Tammen, Imke; Raadsma, Herman W.
2013-01-01
Angiotensin converting enzyme (ACE) is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D) polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs) discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism. PMID:23408978
Alegría-Torres, Jorge Alejandro; Carrizales-Yánez, Leticia; Díaz-Barriga, Fernando; Rosso-Camacho, Fernando; Motta, Valeria; Tarantini, Letizia; Bollati, Valentina
2016-12-01
Arsenic is a carcinogen and epimutagen that threatens the health of exposed populations worldwide. In this study, we examined the methylation status of Alu and long interspersed nucleotide elements (LINE-1) and their association with levels of urinary arsenic in 84 Mexican children between 6 and 12 years old from two historic mining areas in the State of San Luis Potosí, Mexico. Urinary arsenic levels were determined by atomic absorption spectrophotometry and DNA methylation analysis was performed in peripheral blood leukocytes by bisulfite-pyrosequencing. The geometric mean of urinary arsenic was 26.44 µg/g Cr (range 1.93-139.35). No significant differences in urinary arsenic or methylation patterns due to gender were observed. A positive correlation was found between urinary arsenic and the mean percentage of methylated cytosines in Alu sequences (Spearman correlation coefficient r = 0.532, P < 0.001), and a trend of LINE-1 hypomethylation was also observed (Spearman correlation coefficient r = -0.232, P = 0.038) after adjustment for sex and age. A linear regression model showed an association with log-normalized urinary arsenic for Alu (β = 1.05, 95% CI: 0.67; 1.43, P < 0.001) and LINE-1 (β = -0.703, 95% CI: -1.36; -0.38, P = 0.038). Despite the low-level arsenic exposure, a subtle epigenetic imbalance measured as DNA methylation was detected in the leukocytes of Mexican children living in two historic mining areas. Environ. Mol. Mutagen. 57:717-723, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Alu elements in primates are preferentially lost from areas of high GC content
Brookfield, John FY
2013-01-01
The currently-accepted dogma when analysing human Alu transposable elements is that ‘young’ Alu elements are found in low GC regions and ‘old’ Alus in high GC regions. The correlation between high GC regions and high gene frequency regions make this observation particularly difficult to explain. Although a number of studies have tackled the problem, no analysis has definitively explained the reason for this trend. These observations have been made by relying on the subfamily as a proxy for age of an element. In this study, we suggest that this is a misleading assumption and instead analyse the relationship between the taxonomic distribution of an individual element and its surrounding GC environment. An analysis of 103906 Alu elements across 6 human chromosomes was carried out, using the presence of orthologous Alu elements in other primate species as a proxy for age. We show that the previously-reported effect of GC content correlating with subfamily age is not reflected by the ages of the individual elements. Instead, elements are preferentially lost from areas of high GC content over time. The correlation between GC content and subfamily may be due to a change in insertion bias in the young subfamilies. The link between Alu subfamily age and GC region was made due to an over-simplification of the data and is incorrect. We suggest that use of subfamilies as a proxy for age is inappropriate and that the analysis of ortholog presence in other primate species provides a deeper insight into the data. PMID:23717800
A novel sodium bicarbonate cotransporter-like gene in an ancient duplicated region: SLC4A9 at 5q31
Lipovich, Leonard; Lynch, Eric D; Lee, Ming K; King, Mary-Claire
2001-01-01
Background: Sodium bicarbonate cotransporter (NBC) genes encode proteins that execute coupled Na+ and HCO3- transport across epithelial cell membranes. We report the discovery, characterization, and genomic context of a novel human NBC-like gene, SLC4A9, on chromosome 5q31. Results: SLC4A9 was initially discovered by genomic sequence annotation and further characterized by sequencing of long-insert cDNA library clones. The predicted protein of 990 amino acids has 12 transmembrane domains and high sequence similarity to other NBCs. The 23-exon gene has 14 known mRNA isoforms. In three regions, mRNA sequence variation is generated by the inclusion or exclusion of portions of an exon. Noncoding SLC4A9 cDNAs were recovered multiple times from different libraries. The 3' untranslated region is fragmented into six alternatively spliced exons and contains expressed Alu, LINE and MER repeats. SLC4A9 has two alternative stop codons and six polyadenylation sites. Its expression is largely restricted to the kidney. In silico approaches were used to characterize two additional novel SLC4A genes and to place SLC4A9 within the context of multiple paralogous gene clusters containing members of the epidermal growth factor (EGF), ankyrin (ANK) and fibroblast growth factor (FGF) families. Seven human EGF-SLC4A-ANK-FGF clusters were found. Conclusion: The novel sodium bicarbonate cotransporter-like gene SLC4A9 demonstrates abundant alternative mRNA processing. It belongs to a growing class of functionally diverse genes characterized by inefficient highly variable splicing. The evolutionary history of the EGF-SLC4A-ANK-FGF gene clusters involves multiple rounds of duplication, apparently followed by large insertions and deletions at paralogous loci and genome-wide gene shuffling. PMID:11305939
Khitrinskaia, I Iu; Khar'kov, V N; Voevoda, M I; Stepanova, V A
2014-01-01
We for the first time have examined the autosomal gene pool of the Siberia, Central Asian and the Far East populations (27 populations of 12 ethnic groups) using a set of polymorphic Alu insertions in the human genome. The results of the analysis testify (i) to a significant level of genetic diversity in the Northern Eurasian populations and (ii) to a considerable differentiation of gene pool in the population of this region. It has been shown that at the CD4 locus, the frequency of Alu (-) is inversely related to the Mongoloid component of the population, the lowest and highest frequencies of the Alu deletion at locus CD4 were recorded respectively in Eskimo (0.012) and Russian and Ukrainian (0.35). The analysis of gene flow proved Caucasoid populations (Russian, Tajik and Uzbek), as well as those of Turkic ethnic groups from the Southern Siberia (Altaians and Tuvinians) and Khanty and Mansy populations to be the recipients of a considerable gene flow from the outside of the concerned population system, as compared with the East Siberian and the Far East ethnic groups. The results of the correlation analysis received with use polymorphic Alu insertion testify to the greatest correlation of genetic distances with anthropological characteristics of populations.
Transport of 5-aminolevulinic acid by the dipeptide permease in Salmonella typhimurium.
Elliott, T
1993-01-01
In a previous search for mutants of Salmonella typhimurium that are defective in heme synthesis, one class that is apparently defective in 5-aminolevulinic acid (ALA) uptake (alu) was found. Here, I describe the characterization of these mutations. The mutations all map to a single locus near 77.5 min on the genetic map, which is transcribed counterclockwise. Nutritional tests, genetic and physical mapping, and partial DNA sequence analysis revealed that alu mutants are defective in a periplasmic binding protein-dependent permease that also transports dipeptides, encoded by the dpp operon. The uptake of labeled ALA is defective in dpp mutants and is markedly increased in a strain that has elevated transcription of the dpp locus. Unlabeled L-leucyl-glycine competes with labeled ALA for uptake. In a strain carrying both a dpp-lac operon fusion and a functional copy of the dpp locus, the expression of beta-galactosidase is not induced by ALA, nor, in a hemL mutant, does expression of dpp change substantially during starvation for ALA. The dipeptide permease displays a relaxed substrate specificity that allows transport of the important nonpeptide nutrient ALA, whose structure is closely related to that of glycyl-glycine.
Di Francesco, Cristina E; Di Francesco, Daniela; Di Martino, Barbara; Speranza, Roberto; Santori, Domenico; Boari, Andrea; Marsilio, Fulvio
2012-01-01
A new highly sensitive and specific hemi-nested reverse transcription polymerase chain reaction (RT-PCR) assay was applied to detect nucleoprotein (NP) gene of Canine distemper virus (CDV) in samples collected from dogs showing respiratory, gastrointestinal, and neurological signs. Thirty-eight out of 86 samples were positive suggesting that despite the vaccination, canine distemper may still represent a high risk to the canine population. The 968 base pair (bp) fragments from the hemagglutinin (H) gene of 10 viral strains detected in positive samples were amplified and analyzed by restriction fragment length polymorphism (RFLP) using AluI and PsiI enzymes in order to differentiate among vaccine and wild-type CDV strains and to characterize the field viral strains. The products of the both enzymatic digestions allowed identification all viruses as wild strains of CDV. In addition, the RFLP analysis with AluI provided additional information about the identity level among the strains analyzed on the basis of the positions of the cleavage site in the nucleotide sequences of the H gene. The method could be a more useful and simpler method for molecular studies of CDV strains.
The (r)evolution of SINE versus LINE distributions in primate genomes: Sex chromosomes are important
Kvikstad, Erika M.; Makova, Kateryna D.
2010-01-01
The densities of transposable elements (TEs) in the human genome display substantial variation both within individual chromosomes and among chromosome types (autosomes and the two sex chromosomes). Finding an explanation for this variability has been challenging, especially in light of genome landscapes unique to the sex chromosomes. Here, using a multiple regression framework, we investigate primate Alu and L1 densities shaped by regional genome features and location on a particular chromosome type. As a result of our analysis, first, we build statistical models explaining up to 79% and 44% of variation in Alu and L1 element density, respectively. Second, we analyze sex chromosome versus autosome TE densities corrected for regional genomic effects. We discover that sex-chromosome bias in Alu and L1 distributions not only persists after accounting for these effects, but even presents differences in patterns, confirming preferential Alu integration in the male germline, yet likely integration of L1s in both male and female germlines or in early embryogenesis. Additionally, our models reveal that local base composition (measured by GC content and density of L1 target sites) and natural selection (inferred via density of most conserved elements) are significant to predicting densities of L1s. Interestingly, measurements of local double-stranded breaks (a 13-mer associated with genome instability) strongly correlate with densities of Alu elements; little evidence was found for the role of recombination-driven deletion in driving TE distributions over evolutionary time. Thus, Alu and L1 densities have been influenced by the combination of distinct local genome landscapes and the unique evolutionary dynamics of sex chromosomes. PMID:20219940
Gene flow and genetic structure in the Galician population (NW Spain) according to Alu insertions
Varela, Tito A; Fariña, José; Diéguez, Lois Pérez; Lodeiro, Rosa
2008-01-01
Background The most recent Alu insertions reveal different degrees of polymorphism in human populations, and a series of characteristics that make them particularly suitable genetic markers for Human Biology studies. This has led these polymorphisms to be used to analyse the origin and phylogenetic relationships between contemporary human groups. This study analyses twelve Alu sequences in a sample of 216 individuals from the autochthonous population of Galicia (NW Spain), with the aim of studying their genetic structure and phylogenetic position with respect to the populations of Western and Central Europe and North Africa, research that is of special interest in revealing European population dynamics, given the peculiarities of the Galician population due to its geographical situation in western Europe, and its historical vicissitudes. Results The insertion frequencies of eleven of the Alu elements analysed were within the variability range of European populations, while Yb8NBC125 proved to be the lowest so far recorded to date in Europe. Taking the twelve polymorphisms into account, the GD value for the Galician population was 0.268. The comparative analyses carried out using the MDS, NJ and AMOVA methods reveal the existence of spatial heterogeneity, and identify three population groups that correspond to the geographic areas of Western-Central Europe, Eastern Mediterranean Europe and North Africa. Galicia is shown to be included in the Western-Central European cluster, together with other Spanish populations. When only considering populations from Mediterranean Europe, the Galician population revealed a degree of genetic flow similar to that of the majority of the populations from this geographic area. Conclusion The results of this study reveal that the Galician population, despite its geographic situation in the western edge of the European continent, occupies an intermediate position in relation to other European populations in general, and Iberian populations in particular. This confirms the important role that migratory movements have had in the European gene pool, at least since Neolithic times. In turn, the MDS and NJ analyses place Galicia within the group comprised of Western-Central European populations, which is justified by the influence of Germanic peoples on the Galician population during the Middle Ages. However, it should also be noted that some of the markers analysed have a certain degree of differentiation, possibly due to the region's position as a 'cul-de-sac' in terms of Iberian population dynamics. PMID:19055739
Gene flow and genetic structure in the Galician population (NW Spain) according to Alu insertions.
Varela, Tito A; Fariña, José; Diéguez, Lois Pérez; Lodeiro, Rosa
2008-12-02
The most recent Alu insertions reveal different degrees of polymorphism in human populations, and a series of characteristics that make them particularly suitable genetic markers for Human Biology studies. This has led these polymorphisms to be used to analyse the origin and phylogenetic relationships between contemporary human groups. This study analyses twelve Alu sequences in a sample of 216 individuals from the autochthonous population of Galicia (NW Spain), with the aim of studying their genetic structure and phylogenetic position with respect to the populations of Western and Central Europe and North Africa, research that is of special interest in revealing European population dynamics, given the peculiarities of the Galician population due to its geographical situation in western Europe, and its historical vicissitudes. The insertion frequencies of eleven of the Alu elements analysed were within the variability range of European populations, while Yb8NBC125 proved to be the lowest so far recorded to date in Europe. Taking the twelve polymorphisms into account, the GD value for the Galician population was 0.268. The comparative analyses carried out using the MDS, NJ and AMOVA methods reveal the existence of spatial heterogeneity, and identify three population groups that correspond to the geographic areas of Western-Central Europe, Eastern Mediterranean Europe and North Africa. Galicia is shown to be included in the Western-Central European cluster, together with other Spanish populations. When only considering populations from Mediterranean Europe, the Galician population revealed a degree of genetic flow similar to that of the majority of the populations from this geographic area. The results of this study reveal that the Galician population, despite its geographic situation in the western edge of the European continent, occupies an intermediate position in relation to other European populations in general, and Iberian populations in particular. This confirms the important role that migratory movements have had in the European gene pool, at least since Neolithic times. In turn, the MDS and NJ analyses place Galicia within the group comprised of Western-Central European populations, which is justified by the influence of Germanic peoples on the Galician population during the Middle Ages. However, it should also be noted that some of the markers analysed have a certain degree of differentiation, possibly due to the region's position as a 'cul-de-sac' in terms of Iberian population dynamics.
El-Maarri, Osman; Singer, Heike; Diaz-Lacava, Amalia; Nüsgen, Nicole; Niemann, Barbara; Watzka, Matthias; Reinsberg, Jochen; van der Ven, Hans; Wienker, Thomas; Stoffel-Wagner, Birgit; Schwaab, Rainer; Oldenburg, Johannes
2011-01-01
Previously, we reported on inter-individual and gender specific variations of LINE-1 methylation in healthy individuals. In this study, we investigated whether this variability could be influenced by age or sex hormones in humans. To this end, we studied LINE-1 methylation in vivo in blood-derived DNA from individuals aged 18 to 64 years and from young healthy females at various hormone levels during the menstrual cycle. Our results show that no significant association with age was observed. However, the previously reported increase of LINE-1 methylation in males was reconfirmed. In females, although no correlation between LINE-1 or Alu methylation and hormone levels was observed, a significant stable individual specific level of methylation was noted. In vitro results largely confirmed these findings, as neither estrogen nor dihydrotestosterone affected LINE-1 or Alu methylation in Hek293T, HUVEC, or MDA-kb2 cell lines. In contrast, a decrease in methylation was observed in estrogen-treated T47-Kbluc cell lines strongly expressing estrogen receptor. The very low expression of estrogen receptor in blood cells could explain the observed insensitivity of methylation at LINE-1 to natural hormonal variations in females. In conclusion, neither natural cycle of hormones nor age has a detectable effect on the LINE-1 methylation in peripheral blood cells, while gender remains an important factor. PMID:21311577
Fukami, Maki; Naiki, Yasuhiro; Muroya, Koji; Hamajima, Takashi; Soneda, Shun; Horikawa, Reiko; Jinno, Tomoko; Katsumi, Momori; Nakamura, Akie; Asakura, Yumi; Adachi, Masanori; Ogata, Tsutomu; Kanzaki, Susumu
2015-09-01
Pseudoautosomal region 1 (PAR1) contains SHOX, in addition to seven highly conserved non-coding DNA elements (CNEs) with cis-regulatory activity. Microdeletions involving SHOX exons 1-6a and/or the CNEs result in idiopathic short stature (ISS) and Leri-Weill dyschondrosteosis (LWD). Here, we report six rare copy-number variations (CNVs) in PAR1 identified through copy-number analyzes of 245 ISS/LWD patients and 15 unaffected individuals. The six CNVs consisted of three microduplications encompassing SHOX and some of the CNEs, two microduplications in the SHOX 3'-region affecting one or four of the downstream CNEs, and a microdeletion involving SHOX exon 6b and its neighboring CNE. The amplified DNA fragments of two SHOX-containing duplications were detected at chromosomal regions adjacent to the original positions. The breakpoints of a SHOX-containing duplication resided within Alu repeats. A microduplication encompassing four downstream CNEs was identified in an unaffected father-daughter pair, whereas the other five CNVs were detected in ISS patients. These results suggest that microduplications involving SHOX cause ISS by disrupting the cis-regulatory machinery of this gene and that at least some of microduplications in PAR1 arise from Alu-mediated non-allelic homologous recombination. The pathogenicity of other rare PAR1-linked CNVs, such as CNE-containing microduplications and exon 6b-flanking microdeletions, merits further investigation.
Sun, Zhifu; Wu, Yanhong; Ordog, Tamas; Baheti, Saurabh; Nie, Jinfu; Duan, Xiaohui; Hojo, Kaori; Kocher, Jean-Pierre; Dyck, Peter J; Klein, Christopher J
2014-08-01
DNA methyltransferase 1 (DNMT1) is essential for DNA methylation, gene regulation and chromatin stability. We previously discovered DNMT1 mutations cause hereditary sensory and autonomic neuropathy type 1 with dementia and hearing loss (HSAN1E; OMIM 614116). HSAN1E is the first adult-onset neurodegenerative disorder caused by a defect in a methyltransferase gene. HSAN1E patients appear clinically normal until young adulthood, then begin developing the characteristic symptoms involving central and peripheral nervous systems. Some HSAN1E patients also develop narcolepsy and it has recently been suggested that HSAN1E is allelic to autosomal dominant cerebellar ataxia, deafness, with narcolepsy (ADCA-DN; OMIM 604121), which is also caused by mutations in DNMT1. A hotspot mutation Y495C within the targeting sequence domain of DNMT1 has been identified among HSAN1E patients. The mutant DNMT1 protein shows premature degradation and reduced DNA methyltransferase activity. Herein, we investigate genome-wide DNA methylation at single-base resolution through whole-genome bisulfite sequencing of germline DNA in 3 pairs of HSAN1E patients and their gender- and age-matched siblings. Over 1 billion 75-bp single-end reads were generated for each sample. In the 3 affected siblings, overall methylation loss was consistently found in all chromosomes with X and 18 being most affected. Paired sample analysis identified 564,218 differentially methylated CpG sites (DMCs; P<0.05), of which 300 134 were intergenic and 264 084 genic CpGs. Hypomethylation was predominant in both genic and intergenic regions, including promoters, exons, most CpG islands, L1, L2, Alu, and satellite repeats and simple repeat sequences. In some CpG islands, hypermethylated CpGs outnumbered hypomethylated CpGs. In 201 imprinted genes, there were more DMCs than in non-imprinted genes and most were hypomethylated. Differentially methylated region (DMR) analysis identified 5649 hypomethylated and 1872 hypermethylated regions. Importantly, pathway analysis revealed 1693 genes associated with the identified DMRs were highly associated in diverse neurological disorders and NAD+/NADH metabolism pathways is implicated in the pathogenesis. Our results provide novel insights into the epigenetic mechanism of neurodegeneration arising from a hotspot DNMT1 mutation and reveal pathways potentially important in a broad category of neurological and psychological disorders.
Sun, Zhifu; Wu, Yanhong; Ordog, Tamas; Baheti, Saurabh; Nie, Jinfu; Duan, Xiaohui; Hojo, Kaori; Kocher, Jean-Pierre; Dyck, Peter J; Klein, Christopher J
2014-01-01
DNA methyltransferase 1 (DNMT1) is essential for DNA methylation, gene regulation and chromatin stability. We previously discovered DNMT1 mutations cause hereditary sensory and autonomic neuropathy type 1 with dementia and hearing loss (HSAN1E; OMIM 614116). HSAN1E is the first adult-onset neurodegenerative disorder caused by a defect in a methyltransferase gene. HSAN1E patients appear clinically normal until young adulthood, then begin developing the characteristic symptoms involving central and peripheral nervous systems. Some HSAN1E patients also develop narcolepsy and it has recently been suggested that HSAN1E is allelic to autosomal dominant cerebellar ataxia, deafness, with narcolepsy (ADCA-DN; OMIM 604121), which is also caused by mutations in DNMT1. A hotspot mutation Y495C within the targeting sequence domain of DNMT1 has been identified among HSAN1E patients. The mutant DNMT1 protein shows premature degradation and reduced DNA methyltransferase activity. Herein, we investigate genome-wide DNA methylation at single-base resolution through whole-genome bisulfite sequencing of germline DNA in 3 pairs of HSAN1E patients and their gender- and age-matched siblings. Over 1 billion 75-bp single-end reads were generated for each sample. In the 3 affected siblings, overall methylation loss was consistently found in all chromosomes with X and 18 being most affected. Paired sample analysis identified 564,218 differentially methylated CpG sites (DMCs; P < 0.05), of which 300 134 were intergenic and 264 084 genic CpGs. Hypomethylation was predominant in both genic and intergenic regions, including promoters, exons, most CpG islands, L1, L2, Alu, and satellite repeats and simple repeat sequences. In some CpG islands, hypermethylated CpGs outnumbered hypomethylated CpGs. In 201 imprinted genes, there were more DMCs than in non-imprinted genes and most were hypomethylated. Differentially methylated region (DMR) analysis identified 5649 hypomethylated and 1872 hypermethylated regions. Importantly, pathway analysis revealed 1693 genes associated with the identified DMRs were highly associated in diverse neurological disorders and NAD+/NADH metabolism pathways is implicated in the pathogenesis. Our results provide novel insights into the epigenetic mechanism of neurodegeneration arising from a hotspot DNMT1 mutation and reveal pathways potentially important in a broad category of neurological and psychological disorders. PMID:25033457
Evaluating the protein coding potential of exonized transposable element sequences
Piriyapongsa, Jittima; Rutledge, Mark T; Patel, Sanil; Borodovsky, Mark; Jordan, I King
2007-01-01
Background Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. Results We compared the ability of three classes of sequence similarity search methods to detect TE-derived sequences among data sets of experimentally characterized proteins: 1-a profile-based hidden Markov model (HMM) approach, 2-BLAST methods and 3-RepeatMasker. Profile based methods are more sensitive and more selective than the other methods evaluated. However, the application of profile-based search methods to the detection of TE-derived sequences among well-curated experimentally characterized protein data sets did not turn up many more cases than had been previously detected and nowhere near as many cases as recent genome-wide searches have. We observed that the different search methods used were complementary in the sense that they yielded largely non-overlapping sets of hits and differed in their ability to recover known cases of TE-derived CDS. The probabilistic analysis of TE-derived exon sequences indicates that these sequences have low protein coding potential on average. In particular, non-autonomous TEs that do not encode protein sequences, such as Alu elements, are frequently exonized but unlikely to encode protein sequences. Conclusion The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence. Reviewers: This article was reviewed by Itai Yanai, Kateryna D. Makova, Melissa Wilson (nominated by Kateryna D. Makova) and Cedric Feschotte (nominated by John M. Logsdon Jr.). PMID:18036258
Aberrant DNA methylation patterns of spermatozoa in men with unexplained infertility.
Urdinguio, Rocío G; Bayón, Gustavo F; Dmitrijeva, Marija; Toraño, Estela G; Bravo, Cristina; Fraga, Mario F; Bassas, Lluís; Larriba, Sara; Fernández, Agustín F
2015-05-01
Are there DNA methylation alterations in sperm that could explain the reduced biological fertility of male partners from couples with unexplained infertility? DNA methylation patterns, not only at specific loci but also at Alu Yb8 repetitive sequences, are altered in infertile individuals compared with fertile controls. Aberrant DNA methylation of sperm has been associated with human male infertility in patients demonstrating either deficiencies in the process of spermatogenesis or low semen quality. Case and control prospective study. This study compares 46 sperm samples obtained from 17 normospermic fertile men and 29 normospermic infertile patients. Illumina Infinium HD Human Methylation 450K arrays were used to identify genomic regions showing differences in sperm DNA methylation patterns between five fertile and seven infertile individuals. Additionally, global DNA methylation of sperm was measured using the Methylamp Global DNA Methylation Quantification Ultra kit (Epigentek) in 14 samples, and DNA methylation at several repetitive sequences (LINE-1, Alu Yb8, NBL2, D4Z4) measured by bisulfite pyrosequencing in 44 sperm samples. A sperm-specific DNA methylation pattern was obtained by comparing the sperm methylomes with the DNA methylomes of differentiated somatic cells using data obtained from methylation arrays (Illumina 450 K) of blood, neural and glial cells deposited in public databases. In this study we conduct, for the first time, a genome-wide study to identify alterations of sperm DNA methylation in individuals with unexplained infertility that may account for the differences in their biological fertility compared with fertile individuals. We have identified 2752 CpGs showing aberrant DNA methylation patterns, and more importantly, these differentially methylated CpGs were significantly associated with CpG sites which are specifically methylated in sperm when compared with somatic cells. We also found statistically significant (P < 0.001) associations between DNA hypomethylation and regions corresponding to those which, in somatic cells, are enriched in the repressive histone mark H3K9me3, and between DNA hypermethylation and regions enriched in H3K4me1 and CTCF, suggesting that the relationship between chromatin context and aberrant DNA methylation of sperm in infertile men could be locus-dependent. Finally, we also show that DNA methylation patterns, not only at specific loci but also at several repetitive sequences (LINE-1, Alu Yb8, NBL2, D4Z4), were lower in sperm than in somatic cells. Interestingly, sperm samples at Alu Yb8 repetitive sequences of infertile patients showed significantly lower DNA methylation levels than controls. Our results are descriptive and further studies would be needed to elucidate the functional effects of aberrant DNA methylation on male fertility. Overall, our data suggest that aberrant sperm DNA methylation might contribute to fertility impairment in couples with unexplained infertility and they provide a promising basis for future research. This work has been financially supported by Fundación Cientifica de la AECC (to R.G.U.); IUOPA (to G.F.B.); FICYT (to E.G.T.); the Spanish National Research Council (CSIC; 200820I172 to M.F.F.); Fundación Ramón Areces (to M.F.F); the Plan Nacional de I+D+I 2008-2011/2013-2016/FEDER (PI11/01728 to AF.F., PI12/01080 to M.F.F. and PI12/00361 to S.L.); the PN de I+D+I 2008-20011 and the Generalitat de Catalunya (2009SGR01490). A.F.F. is sponsored by ISCIII-Subdirección General de Evaluación y Fomento de la Investigación (CP11/00131). S.L. is sponsored by the Researchers Stabilization Program from the Spanish National Health System (CES09/020). The IUOPA is supported by the Obra Social Cajastur, Spain. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Zehner, R; Zimmermann, S; Mebs, D
1998-01-01
To identify common animal species by analysis of the cytochrome b gene a method has been developed to obtain PCR products of a large domain of the cytochrome b gene (981 bp out of 1140 bp) in humans, selected mammals and birds using the same specifically designed primers. Species-specific RFLP patterns are generated by co-restriction with the restriction endonucleases ALU I and NCO I. The RFLP patterns obtained are conclusive even in mixtures of two or more species. The results were confirmed by sequence analysis which in addition explained intraspecies variations in the RFLP patterns. The method has been applied to forensic casework studies where the origin of roasted meat, stomach contents and a bone sample has been successfully identified.
Isolation of mini- and microsatellite loci from chromosome 19 library
DOE Office of Scientific and Technical Information (OSTI.GOV)
Prosnyak, M.I.; Belajeva, O.V.; Polukarova, L.G.
Mini- and microsatellite sequences are abundant in the human genome and are very useful as genetic markers. We report the isolation of a panel of clones containing marker sequences from chromosome 19. We screened 10,000 clones from the chromosome 19 cosmid library for the presence of di-(CA)n, tri-(TCC)n, (CAC)n microsatellites and M13-like minisatellite sequences. For this we have used synthetic oligonucleotides and polynucleotides, including micro- (CA, TCC, CAC) and minisatellite (M13 core) sequences. Preliminary results indicated that the chromosome 19 cosmid library contained both human and hamster clones. In order to identify human sequences from this library we have developedmore » the technique of colony and blot hybridization with Alu-PCR, L1-PCR and B1-PCR probes. Dozens of clones have been selected, some of which were analyzed by conventional Southern blot analysis and non-radioactive in situ hybridization of chromosomes. Highly informative markers derived from these clones will be used for physical and genetic mapping of chromosome 19.« less
Maharaj, Ariana; Rampersad, Sephra N
2012-03-01
Members of the genus Colletotrichum include some of the most economically important fungal pathogens in the world. Accurate diagnosis is critical to devising disease management strategies. Two species, Colletotrichum gloeosporioides and C. truncatum, are responsible for anthracnose disease in papaya (Carica papaya L.) and bell pepper (Capsicum annuum L.) in Trinidad. The ITS1-5.8S-ITS2 region of 48 Colletotrichum isolates was sequenced, and the ITS PCR products were analyzed by PCR-RFLP analysis. Restriction site polymorphisms generated from 11 restriction enzymes enabled the identification of specific enzymes that were successful in distinguishing between C. gloeosporioides and C. truncatum isolates. Species-specific restriction fragment length polymorphisms generated by the enzymes AluI, HaeIII, PvuII, RsaI, and Sau3A were used to consistently resolve C. gloeosporioides and C. truncatum isolates from papaya. AluI, ApaI, PvuII, RsaI, and SmaI reliably separated isolates of C. gloeosporioides and C. truncatum from bell pepper. PvuII, RsaI, and Sau3A were also capable of distinguishing among the C. gloeosporioides isolates from papaya based on the different restriction patterns that were obtained as a result of intra-specific variation in restriction enzyme recognition sites in the ITS1-5.8S-ITS2 rDNA region. Of all the isolates tested, C. gloeosporioides from papaya also had the highest number of PCR-RFLP haplotypes. Cluster analysis of sequence and PCR-RFLP data demonstrated that all C. gloeosporioides and C. truncatum isolates clustered separately into species-specific clades regardless of host species. Phylograms also revealed consistent topologies which suggested that the genetic distances for PCR-RFLP-generated data were comparable to that of ITS sequence data. ITS PCR-RFLP fingerprinting is a rapid and reliable method to identify and differentiate between Colletotrichum species.
Favoino, E; Favia, E I; Digiglio, L; Racanelli, V; Shoenfeld, Y; Perosa, F
2014-01-01
The safety of four different adjuvants was assessed in lupus-prone New Zealand black/New Zealand white (BW)F1 mice. Four groups of mice were injected intraperitoneally with incomplete Freund's adjuvant (IFA), complete Freund's adjuvant (CFA), squalene (SQU) or aluminium hydroxide (ALU). An additional group received plain phosphate-buffered saline (PBS) (UNT group). Mice were primed at week 9 and boosted every other week up to week 15. Proteinuria became detectable at weeks 17 (IFA group), 24 (CFA group), 28 (SQU and ALU groups) and 32 (UNT group). Different mean values were obtained among the groups from weeks 17 to 21 [week 17: one-way analysis of variance (anova) P = 0·016; weeks 18 and 19: P = 0·048; weeks 20 and 21: P = 0·013] being higher in the IFA group than the others [Tukey's honestly significant difference (HSD) post-test P < 0·05]. No differences in anti-DNA antibody levels were observed among groups. Anti-RNP/Sm antibody developed at week 19 in only one CFA-treated mouse. Mean mouse weight at week 18 was lower in the ALU group than the IFA (Tukey's HSD post-test P = 0·04), CFA (P = 0·01) and SQU (P < 0·0001) groups, while the mean weight in the SQU group was higher than in the IFA (P = 0·009), CFA (P = 0·013) and UNT (P = 0·005) groups. The ALU group weight decreased by almost half between weeks 29 and 31, indicating some toxic effect of ALU in the late post-immunization period. Thus, SQU was the least toxic adjuvant as it did not (i) accelerate proteinuria onset compared to IFA; (ii) induce toxicity compared to ALU or (iii) elicit anti-RNP/Sm autoantibody, as occurred in the CFA group. © 2013 British Society for Immunology.
Önnby, L; Pakade, V; Mattiasson, B; Kirsebom, H
2012-09-01
Removal of As(V) by adsorption from water solutions was studied using three different synthetic adsorbents. The adsorbents, (a) aluminium nanoparticles (Alu-NPs, <50 nm) incorporated in amine rich cryogels (Alu-cryo), (b) molecular imprinted polymers (<38 μm) in polyacrylamide cryogels (MIP-cryo) and (c) thiol functionalised cryogels (SH-cryo) were evaluated regarding material characteristics and arsenic removal in batch test and continuous mode. Results revealed that a composite design with particles incorporated in cryogels was a successful means for applying small particles (nano- and micro- scale) in water solutions with maintained adsorption capacity and kinetics. Low capacity was obtained from SH-cryo and this adsorbent was hence excluded from the study. The adsorption capacities for the composites were 20.3 ± 0.8 mg/g adsorbent (Alu-cryo) and 7.9 ± 0.7 mg/g adsorbent (MIP-cryo) respectively. From SEM images it was seen that particles were homogeneously distributed in Alu-cryo and heterogeneously distributed in MIP-cryo. The particle incorporation increased the mechanical stability and the polymer backbones of pure polyacrylamide (MIP-cryo) were of better stability than the amine containing polymer backbone (Alu-cryo). Both composites worked well in the studied pH range of pH 2-8. Adsorption tested in real wastewater spiked with arsenic showed that co-ions (nitrate, sulphate and phosphate) affected arsenic removal for Alu-cryo more than for MIP-cryo. Both composites still adsorbed well in the presence of counter-ions (copper and zinc) present at low concentrations (μg/l). The unchanged and selective adsorption in realistic water observed for MIP-cryo was concluded to be due to a successful imprinting, here controlled using a non-imprinted polymer (NIP). A development of MIP-cryo is needed, considering its low adsorption capacity. Copyright © 2012 Elsevier Ltd. All rights reserved.
The PROGINS polymorphism of the human progesterone receptor diminishes the response to progesterone.
Romano, Andrea; Delvoux, Bert; Fischer, Dagmar-Christiane; Groothuis, Patrick
2007-02-01
The human progesterone receptor (PR) is a ligand-dependent transcription factor and two isoforms, (PRA and PRB), can be distinguished. PROGINS, a PR polymorphic variant, affects PRA and PRB and acts as a risk-modulating factor in several gynaecological disorders. Little is known about the functional consequences of this variant. Here, we characterise the properties of PROGINS with respect to transcription, mRNA maturation, protein activity and proliferation. PROGINS is characterised by a 320 bp PV/HS-1 Alu insertion in intron G and two point mutations, V660L in exon 4 and H770H (silent substitution) in exon 5. The Alu element contains a half oestrogen-response element/Sp1-binding site (Alu-ERE/Sp1), which acts as an in-cis intronic enhancer leading to increased transcription of the PROGINS allele in response to 17beta-oestradiol. Moreover, Alu insertions in the human genome are frequently methylated. Our data indicate that the PROGINS-Alu does not affect gene transcription due to DNA methylation. However, the Alu element reduced the stability of the PROGINS transcript compared with the CP allele and does not generate splice variants. The amino acid substitution (V600L) in exon 4 leads to differences in PR phosphorylation and degradation in the two PR variants upon ligand binding, most likely as a result of differences in the three-dimensional structures of the two PR variants. As a consequence, the PR-L660 (PROGINS) variant (1) displays decreased transactivation activity in a luciferase reporter system and (2) is less efficient in opposing cell proliferation in hamster ovarian cells expressing human PRA, when compared with the PR-V660 (most common variant). Taken together, our results indicate that the PROGINS variant of PR is less responsive to progestin compared with the most common PR because of (i) reduced amounts of gene transcript and (ii) decreased protein activity.
Web-Based Analysis for Student-Generated Complex Genetic Profiles
ERIC Educational Resources Information Center
Kass, David H.; LaRoe, Robert
2007-01-01
A simple, rapid method for generating complex genetic profiles using Alu-based markers was recently developed for students primarily at the undergraduate level to learn more about forensics and paternity analysis. On the basis of the Cold Spring Harbor Allele Server, which provides an excellent tool for analyzing a single Alu variant, we present a…
Sciamanna, Ilaria; Gualtieri, Alberto; Cossetti, Cristina; Osimo, Emanuele Felice; Ferracin, Manuela; Macchia, Gianfranco; Aricò, Eleonora; Prosseda, Gianni; Vitullo, Patrizia; Misteli, Tom; Spadafora, Corrado
2013-01-01
LINE-1 elements make up the most abundant retrotransposon family in the human genome. Full-length LINE-1 elements encode a reverse transcriptase (RT) activity required for their own retrotranpsosition as well as that of non-autonomous Alu elements. LINE-1 are poorly expressed in normal cells and abundantly in cancer cells. Decreasing RT activity in cancer cells, by either LINE-1-specific RNA interference, or by RT inhibitory drugs, was previously found to reduce proliferation and promote differentiation and to antagonize tumor growth in animal models. Here we have investigated how RT exerts these global regulatory functions. We report that the RT inhibitor efavirenz (EFV) selectively downregulates proliferation of transformed cell lines, while exerting only mild effects on non-transformed cells; this differential sensitivity matches a differential RT abundance, which is high in the former and undetectable in the latter. Using CsCl density gradients, we selectively identify Alu and LINE-1 containing DNA:RNA hybrid molecules in cancer but not in normal cells. Remarkably, hybrid molecules fail to form in tumor cells treated with EFV under the same conditions that repress proliferation and induce the reprogramming of expression profiles of coding genes, microRNAs (miRNAs) and ultraconserved regions (UCRs). The RT-sensitive miRNAs and UCRs are significantly associated with Alu sequences. The results suggest that LINE-1-encoded RT governs the balance between single-stranded and double-stranded RNA production. In cancer cells the abundant RT reverse-transcribes retroelement-derived mRNAs forming RNA:DNA hybrids. We propose that this impairs the formation of double-stranded RNAs and the ensuing production of small regulatory RNAs, with a direct impact on gene expression. RT inhibition restores the ‘normal’ small RNA profile and the regulatory networks that depend on them. Thus, the retrotransposon-encoded RT drives a previously unrecognized mechanism crucial to the transformed state in tumor cells. PMID:24345856
Analysis of sequence repeats of proteins in the PDB.
Mary Rajathei, David; Selvaraj, Samuel
2013-12-01
Internal repeats in protein sequences play a significant role in the evolution of protein structure and function. Applications of different bioinformatics tools help in the identification and characterization of these repeats. In the present study, we analyzed sequence repeats in a non-redundant set of proteins available in the Protein Data Bank (PDB). We used RADAR for detecting internal repeats in a protein, PDBeFOLD for assessing structural similarity, PDBsum for finding functional involvement and Pfam for domain assignment of the repeats in a protein. Through the analysis of sequence repeats, we found that identity of the sequence repeats falls in the range of 20-40% and, the superimposed structures of the most of the sequence repeats maintain similar overall folding. Analysis sequence repeats at the functional level reveals that most of the sequence repeats are involved in the function of the protein through functionally involved residues in the repeat regions. We also found that sequence repeats in single and two domain proteins often contained conserved sequence motifs for the function of the domain. Copyright © 2013 Elsevier Ltd. All rights reserved.
Design and implementation of low power clock gated 64-bit ALU on ultra scale FPGA
NASA Astrophysics Data System (ADS)
Gupta, Ashutosh; Murgai, Shruti; Gulati, Anmol; Kumar, Pradeep
2016-03-01
64-bit energy efficient Arithmetic and Logic Unit using negative latch based clock gating technique is designed in this paper. The 64-bit ALU is designed using multiplexer based full adder cell. We have designed a 64-bit ALU with a gated clock. We have used negative latch based circuit for generating gated clock. This gated clock is used to control the multiplexer based 64-bit ALU. The circuit has been synthesized on kintex FPGA through Xilinx ISE Design Suite 14.7 using 28 nm technology in Verilog HDL. The circuit has been simulated on Modelsim 10.3c. The design is verified using System Verilog on QuestaSim in UVM environment. We have achieved 74.07%, 92. 93% and 95.53% reduction in total clock power, 89.73%, 91.35% and 92.85% reduction in I/Os power, 67.14%, 62.84% and 74.34% reduction in dynamic power and 25.47%, 29.05% and 46.13% reduction in total supply power at 20 MHz, 200 MHz and 2 GHz frequency respectively. The power has been calculated using XPower Analyzer tool of Xilinx ISE Design Suite 14.3.
Chromosomal arrangement of leghemoglobin genes in soybean.
Lee, J S; Brown, G G; Verma, D P
1983-01-01
A cluster of four different leghemoglobin (Lb) genes was isolated from AluI-HaeIII and EcoRI genomic libraries of soybean in a set of overlapping clones which together include 45 kilobases (kb) of contiguous DNA. These four genes, including a pseudogene, are present in the same orientation and are arranged in the order: 5'-Lba-Lbc1-Lb psi-Lbc3-3'. The intergenic regions average 2.5 kb. In addition to this main Lb locus, there are other Lb genes which do not appear to be contiguous to this locus. A sequence probably common to the 3' region of Lb loci was found flanking the Lbc3 gene. The 3' flanking region of the main Lb locus also contains a sequence that appears to be expressed more abundantly in root tissue. Another sequence which is primarily expressed in root and leaf is found 5' to two Lb loci. Overall, the main leghemoglobin locus is similar in structure to the mammalian globin gene loci. Images PMID:6310504
Dynamic ASXL1 Exon Skipping and Alternative Circular Splicing in Single Human Cells
Natarajan, Sivaraman; Carter, Robert; Brown, Patrick O.
2016-01-01
Circular RNAs comprise a poorly understood new class of noncoding RNA. In this study, we used a combination of targeted deletion, high-resolution splicing detection, and single-cell sequencing to deeply probe ASXL1 circular splicing. We found that efficient circular splicing required the canonical transcriptional start site and inverted AluSx elements. Sequencing-based interrogation of isoforms after ASXL1 overexpression identified promiscuous linear splicing between all exons, with the two most abundant non-canonical linear products skipping the exons that produced the circular isoforms. Single-cell sequencing revealed a strong preference for either the linear or circular ASXL1 isoforms in each cell, and found the predominant exon skipping product is frequently co-expressed with its reciprocal circular isoform. Finally, absolute quantification of ASXL1 isoforms confirmed our findings and suggests that standard methods overestimate circRNA abundance. Taken together, these data reveal a dynamic new view of circRNA genesis, providing additional framework for studying their roles in cellular biology. PMID:27736885
Brookfield, John F. Y.; Johnson, Louise J.
2006-01-01
Some families of mammalian interspersed repetitive DNA, such as the Alu SINE sequence, appear to have evolved by the serial replacement of one active sequence with another, consistent with there being a single source of transposition: the “master gene.” Alternative models, in which multiple source sequences are simultaneously active, have been called “transposon models.” Transposon models differ in the proportion of elements that are active and in whether inactivation occurs at the moment of transposition or later. Here we examine the predictions of various types of transposon model regarding the patterns of sequence variation expected at an equilibrium between transposition, inactivation, and deletion. Under the master gene model, all bifurcations in the true tree of elements occur in a single lineage. We show that this property will also hold approximately for transposon models in which most elements are inactive and where at least some of the inactivation events occur after transposition. Such tree shapes are therefore not conclusive evidence for a single source of transposition. PMID:16790583
Huebner, K L; Kunkel, A K; McConnel, C S; Callan, R J; Dinsmore, R P; Caixeta, L S
2017-05-01
Pain management during and following disbudding procedures has been studied extensively, though few studies have evaluated wound healing following cautery disbudding in dairy calves. The purpose of this study was to observe wound healing following cautery disbudding with or without treatment using a topical aluminum-based aerosol bandage (ALU) in preweaned dairy calves. Dairy calves were disbudded within the first 3 wk of life using a standard cautery disbudding protocol. The ALU treatment was randomly allocated to the right or left horn bud within each animal. The outcomes measured were lesion score (LS) and wound diameter (WD). The LS was evaluated on a scale of 1 to 3, with LS = 1 representing normal healing without a scab or exudate, LS = 2 having the presence of a scab, and LS = 3 showing the presence of wound exudate. Lesion score and WD were evaluated on a weekly basis following dehorning for 3 wk. A total of 209 animals completed the study. No difference was observed in LS between groups during the first 2 wk postdisbudding, but the proportion of LS = 3 on wk 3 postdisbudding was greater for the control group when compared with ALU (17 vs. 8%, respectively). During wk 1 and 2 postdisbudding, the odds of having delayed healing, or a LS ≥2, were similar for both groups. However, the odds tended to be different at wk 3 postdisbudding with control disbudding sites being 1.42 times more likely to have delayed healing than ALU. In wk 3, WD was 1 mm smaller in the treatment group compared with the control, and treatment decreased diameter over time compared with controls. Overall, once abnormal wound healing was observed, the likelihood of having abnormal wound healing the following week was increased. However, treatment with ALU diminished this effect on delayed healing during the follow-up period. Based on these results, the use of ALU improved wound healing following cautery disbudding of preweaned dairy calves. Copyright © 2017 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Naimi, Ashley Isaac; Paquet, Catherine; Gauvin, Lise; Daniel, Mark
2009-12-01
Cardiovascular Disease (CVD) has been linked to "neighbourhood" socioeconomic status (nSES), often operationalized as a composite index of aggregate income, occupation and education within predefined administrative boundaries. The role of specific, non-composite socioeconomic markers has not been clearly explained. It is also unclear whether the relationship between nSES and CVD varies according to sex. We sought to determine whether area-level unemployment (ALU) was associated with CVD risk, and whether this association differed by sex. 342 individuals from the Montreal Neighbourhood Survey of Lifestyle and Health provided self-reported behavioural and socioeconomic information. A nurse collected biochemical and anthropometric data. ALU, a weighted average of the proportion of persons 15-years and older available for but without work, was measured using a Geographic Information System for a 250 m buffer centred on individual residence. Generalized Estimating Equations were used to estimate the associations between ALU, body mass index (BMI) and a cumulative score for total cardiometabolic risk (TCR). After confounder adjustments, the mean 4(th) minus 1(st) quartile difference in BMI was 3.19 kg/m(2) (95% CI: 2.39, 3.99), while the prevalence ratio for the 4(th) relative to 1(st) quartile for TCR was 2.20 (95 % CI: 1.53, 3.17). Sex interacted with ALU; women relative to men had greater mean 3.97 kg/m(2) (95% CI: 2.08, 5.85) BMI and greater mean TCR 1.51 (95% CI: 0.78, 2.90), contrasted at mean ALU. Area-level unemployment is associated with greater CVD risk, and this association is stronger for women.
NASA Astrophysics Data System (ADS)
Spencer, S.; Ogle, S. M.; Wirth, T. C.; Sivakami, G.
2016-12-01
The Intergovernmental Panel on Climate Change (IPCC) provides methods and guidance for estimating anthropogenic greenhouse gas emissions for reporting to the United Nations Framework Convention on Climate Change. The methods are comprehensive and require extensive data compilation, management, aggregation, documentation and calculations of source and sink categories to achieve robust emissions estimates. IPCC Guidelines describe three estimation tiers that require increasing levels of country-specific data and method complexity. Use of higher tiers should improve overall accuracy and reduce uncertainty in estimates. The AFOLU sector represents a complex set of methods for estimating greenhouse gas emissions and carbon sinks. Major AFOLU emissions and sinks include carbon dioxide (CO2) from carbon stock change in biomass, dead organic matter and soils, urea or lime application to soils, and oxidation of carbon in drained organic soils; nitrous oxide (N2O) and methane (CH4) emissions from livestock management and biomass burning; N2O from organic amendments and fertilizer application to soils, and CH4 emissions from rice cultivation. To assist inventory compilers with calculating AFOLU-sector estimates, the Agriculture and Land Use Greenhouse Gas Inventory Tool (ALU) was designed to implement Tier 1 and 2 methods using IPCC Good Practice Guidance. It guides the compiler through activity data entry, emission factor assignment, and emissions calculations while carefully maintaining data integrity. ALU also provides IPCC defaults and can estimate uncertainty. ALU was designed to simplify the AFOLU inventory compilation process at regional or national scales, disaggregating the process into a series of steps reduces the potential for errors in the compilation process. An example application has been developed using ALU to estimate methane emissions from rice production in the United States.
Aires-de-Sousa, João; Aires-de-Sousa, Luisa
2003-01-01
We propose representing individual positions in DNA sequences by virtual potentials generated by other bases of the same sequence. This is a compact representation of the neighbourhood of a base. The distribution of the virtual potentials over the whole sequence can be used as a representation of the entire sequence (SEQREP code). It is a flexible code, with a length independent of the sequence size, does not require previous alignment, and is convenient for processing by neural networks or statistical techniques. To evaluate its biological significance, the SEQREP code was used for training Kohonen self-organizing maps (SOMs) in two applications: (a) detection of Alu sequences, and (b) classification of sequences encoding for HIV-1 envelope glycoprotein (env) into subtypes A-G. It was demonstrated that SOMs clustered sequences belonging to different classes into distinct regions. For independent test sets, very high rates of correct predictions were obtained (97% in the first application, 91% in the second). Possible areas of application of SEQREP codes include functional genomics, phylogenetic analysis, detection of repetitions, database retrieval, and automatic alignment. Software for representing sequences by SEQREP code, and for training Kohonen SOMs is made freely available from http://www.dq.fct.unl.pt/qoa/jas/seqrep. Supplementary material is available at http://www.dq.fct.unl.pt/qoa/jas/seqrep/bioinf2002
Peculiarities of RFLP of highly repetitive DNA in crow genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chelomina, G.N.; Kryukov, A.P.; Ivanov, S.V.
1995-02-01
We present a study of the structural organization of highly repetitive DNA in genomes of hooded crow Corvus cornix L., carrion crow C. corone L., and jungle crow C. macrorhynchos Wagl. RFLP and blot-hybridization with {sup 32}P-labeled Msp I fragment from hooded crow nDNA suggest the interspecific structural conservatism of the most repetitive DNA. The family of repeats we studied had tandem organization and the same (210 bp) period of reiteration for a set of restriction enzymes. However, in parallel to the general similarity of restriction patterns, there are species-specific peculiarities. The repetitive family revealed (Alu I, BsuR I, andmore » Msp I fragments) has quantitative RFLP of nDNA and interspecific differences in the extent of the multimer {open_quotes}ladder{close_quotes} pattern of Msp I fragments. The latter is more pronounced in nDNA of carrion crow than in that of phylogenetically distant jungle crow and closely related hooded crow. This suggests a recent amplification event for highly organized homological repeats in crow genomes. 10 refs., 2 figs.« less
Alexandrou, Angelos; Papaevripidou, Ioannis; Tsangaras, Kyriakos; Alexandrou, Ioanna; Tryfonidis, Marios; Christophidou-Anastasiadou, Violetta; Zamba-Papanicolaou, Eleni; Koumbaris, George; Neocleous, Vassos; Phylactou, Leonidas A; Skordis, Nicos; Tanteles, George A; Sismani, Carolina
2016-12-01
Haploinsufficiency of the short stature homeobox contaning SHOX gene has been shown to result in a spectrum of phenotypes ranging from Leri-Weill dyschondrosteosis (LWD) at the more severe end to SHOX-related short stature at the milder end of the spectrum. Most alterations are whole gene deletions, point mutations within the coding region, or microdeletions in its flanking sequences. Here, we present the clinical and molecular data as well as the potential molecular mechanism underlying a novel microdeletion, causing a variable SHOX-related haploinsufficiency disorder in a three-generation family. The phenotype resembles that of LWD in females, in males, however, the phenotypic expression is milder. The 15523-bp SHOX intragenic deletion, encompassing exons 3-6, was initially detected by array-CGH, followed by MLPA analysis. Sequencing of the breakpoints indicated an Alu recombination-mediated deletion (ARMD) as the potential causative mechanism.
LINE dancing in the human genome: transposable elements and disease.
Belancio, Victoria P; Deininger, Prescott L; Roy-Engel, Astrid M
2009-10-27
Transposable elements (TEs) have been consistently underestimated in their contribution to genetic instability and human disease. TEs can cause human disease by creating insertional mutations in genes, and also contributing to genetic instability through non-allelic homologous recombination and introduction of sequences that evolve into various cis-acting signals that alter gene expression. Other outcomes of TE activity, such as their potential to cause DNA double-strand breaks or to modulate the epigenetic state of chromosomes, are less fully characterized. The currently active human transposable elements are members of the non-LTR retroelement families, LINE-1, Alu (SINE), and SVA. The impact of germline insertional mutagenesis by TEs is well established, whereas the rate of post-insertional TE-mediated germline mutations and all forms of somatic mutations remain less well quantified. The number of human diseases discovered to be associated with non-allelic homologous recombination between TEs, and particularly between Alu elements, is growing at an unprecedented rate. Improvement in the technology for detection of such events, as well as the mounting interest in the research and medical communities in resolving the underlying causes of the human diseases with unknown etiology, explain this increase. Here, we focus on the most recent advances in understanding of the impact of the active human TEs on the stability of the human genome and its relevance to human disease.
Sequence repeats and protein structure
NASA Astrophysics Data System (ADS)
Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos
2012-11-01
Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.
Greally, John M
2002-01-08
To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival.
Greally, John M.
2002-01-01
To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival. PMID:11756672
Goodrich, James A; Kugel, Jennifer F
2010-01-01
With eukaryotic non-coding RNAs (ncRNAs) now established as critical regulators of cellular transcription, the true diversity with which they can elicit biological effects is beginning to be appreciated. Two ncRNAs, mouse B2 RNA and human Alu RNA, have been found to repress mRNA transcription in response to heat shock. They do so by binding directly to RNA polymerase II, assembling into complexes on promoter DNA, and disrupting contacts between the polymerase and the DNA. Such a mechanism of repression had not previously been observed for a eukaryotic ncRNA; however, there are examples of eukaryotic protein domains that repress transcription by blocking essential protein-DNA interactions. Comparing the mechanism of transcriptional repression utilized by these protein domains to that used by B2 and Alu RNAs raises intriguing questions regarding transcriptional control, and how B2 and Alu RNAs might themselves be regulated.
Study of behaviors of aluminum overlayers deposited on uranium via AES, EELS, and XPS
NASA Astrophysics Data System (ADS)
Liu, Kezhao; Luo, Lizhu; Zhou, Wei; Yang, Jiangrong; Xiao, Hong; Hong, Zhanglian; Yang, Hui
2013-04-01
Aluminum overlayers on uranium were prepared by sputtering at room temperature in an ultra-high vacuum chamber. The growth mode of aluminum overlayers and behaviors of the Al/U interface reaction were studied in situ by auger electron spectroscopy, electron energy loss spectroscopy, and X-ray photoelectron spectroscopy. The results suggested that the interdiffusion took place at the Al/U interface during the initial stage of deposition. The U4f spectra of the Al/U interface showed strong correlation satellites at binding energies of 380.4 and 392.7 eV and plasma loss features at 404.2 eV, respectively. The interactions between aluminum and uranium yielded the intermetallic compound of UAlx, inducing the shift to a low binding energy for Al2p peaks. The results indicated that aluminum overlayers were formed on the uranium by sputtering in an island growth mode.
Balajee, A S; Sharma, T
1994-01-01
In situ digestion of metaphase chromosomes with AluI revealed differences in the distribution of Mus musculus-like AT-rich heterochromatin in the complements of the Indian pygmy field mice, M. booduga and M. dunni. In M. booduga, although the banding pattern was almost comparable to that of M. musculus, AluI-resistant bands were much reduced in size at the centromeric regions. In all three chromosome types of the M. dunni complex, M. musculus-like AT-rich heterochromatin was found to be confined mainly to two small segments on the short arm of the X chromosome. This AT-rich heterochromatin varied greatly in both position and quantity in the two X chromosomes. In addition to the polymorphism, a whole block of M. musculus-like AT-rich heterochromatin was found at the centromeric region of an autosome in one individual of M. dunni.
Rocañín-Arjó, Ares; Rodríguez-Botigué, Laura; Esteban, Esther; Theves, Catherine; Evdokimova, Larissa E; Fedorova, Sardana A; Gibert, Morgane; Crubezy, Eric; Moral, Pedro
2013-01-01
Twelve autosomal and 8 X chromosome Alu markers were genotyped for the first time in 161 Central and West Yakuts to test their ability to reconstruct the genetic history of these populations, the northernmost Turkic-speaker ethnic group living in Siberia. Autosomal data revealed that both groups showed extremely close genetic distances to other populations of Siberian origins that occupied areas from Lake Baikal, the ancestral place of origin of Yakuts, to North Siberia, their current territories. Autosomal and X chromosome data revealed some discrepancies on the genetic differentiation and the effective sizes of Central and West Yakuts. Such discrepancies could be related to the patrilineal and occasionally polygamous structure of these populations. Autosomal and X Alu markers are informative markers to reconstruct population past demography and history, but their utility is limited by the available data. This study represents a contribution for further investigations on these populations.
Rangku Alu - A Traditional East Nusa Tenggara Game in Android Platform
NASA Astrophysics Data System (ADS)
Rahmat, R. F.; Ramadhan, R.; Arisandi, D.; Syahputra, M. F.; Sheta, O.
2018-03-01
Rangku Alu is a traditional Indonesian game originated from Manggarai, East Nusa Tenggara, which is played using two pairs of bamboos or sticks in motion until the opponent’s foot is wedged by the bamboos. However, nowadays the game is rarely played, as the rapid development of technology, the game can be played individually by anyone through an online game using media devices such as mobile or PC. Rangku Alu is a game where the moves of a dancer or player varied in each dance. In this research, Fisher-Yates Shuffle algorithm was used as a randomization method to determine the next moves to prevent the tap areas to appear at the same place more than once in a row. From the results, it shows that the tap areas have never been appeared at the same place in succession twice or more.
Alba, Sandra; Hetzel, Manuel W; Goodman, Catherine; Dillip, Angel; Liana, Jafari; Mshinda, Hassan; Lengeler, Christian
2010-06-15
To improve access to treatment in the private retail sector a new class of outlets known as accredited drug dispensing outlets (ADDO) was created in Tanzania. Tanzania changed its first-line treatment for malaria from sulphadoxine-pyrimethamine (SP) to artemether-lumefantrine (ALu) in 2007. Subsidized ALu was made available in both health facilities and ADDOs. The effect of these interventions on access to malaria treatment was studied in rural Tanzania. The study was carried out in the villages of Kilombero and Ulanga Demographic Surveillance System (DSS) and in Ifakara town. Data collection consisted of: 1) yearly censuses of shops selling drugs; 2) collection of monthly data on availability of anti-malarials in public health facilities; and 3) retail audits to measure anti-malarial sales volumes in all public, mission and private outlets. The data were complemented with DSS population data. Between 2004 and 2008 access to malaria treatment greatly improved and the number of anti-malarial treatment doses dispensed increased by 78%. Particular improvements were observed in the availability (from 0.24 shops per 1,000 people in 2004 to 0.39 in 2008) and accessibility (from 71% of households within 5 km of a shop in 2004 to 87% in 2008) of drug shops. Despite no improvements in affordability this resulted in an increase of the market share from 49% of anti-malarial sales 2005 to 59% in 2008. The change of treatment policy from SP to ALu led to severe stock-outs of SP in health facilities in the months leading up to the introduction of ALu (only 40% months in stock), but these were compensated by the wide availability of SP in shops. After the introduction of ALu stock levels of the drug were relatively high in public health facilities (over 80% months in stock), but the drug could only be found in 30% of drug shops and in no general shops. This resulted in a low overall utilization of the drug (19% of all anti-malarial sales) The public health and private retail sector are important complementary sources of treatment in rural Tanzania. Ensuring the availability of ALu in the private retail sector is important for its successful uptake.
2010-01-01
Background To improve access to treatment in the private retail sector a new class of outlets known as accredited drug dispensing outlets (ADDO) was created in Tanzania. Tanzania changed its first-line treatment for malaria from sulphadoxine-pyrimethamine (SP) to artemether-lumefantrine (ALu) in 2007. Subsidized ALu was made available in both health facilities and ADDOs. The effect of these interventions on access to malaria treatment was studied in rural Tanzania. Methods The study was carried out in the villages of Kilombero and Ulanga Demographic Surveillance System (DSS) and in Ifakara town. Data collection consisted of: 1) yearly censuses of shops selling drugs; 2) collection of monthly data on availability of anti-malarials in public health facilities; and 3) retail audits to measure anti-malarial sales volumes in all public, mission and private outlets. The data were complemented with DSS population data. Results Between 2004 and 2008 access to malaria treatment greatly improved and the number of anti-malarial treatment doses dispensed increased by 78%. Particular improvements were observed in the availability (from 0.24 shops per 1,000 people in 2004 to 0.39 in 2008) and accessibility (from 71% of households within 5 km of a shop in 2004 to 87% in 2008) of drug shops. Despite no improvements in affordability this resulted in an increase of the market share from 49% of anti-malarial sales 2005 to 59% in 2008. The change of treatment policy from SP to ALu led to severe stock-outs of SP in health facilities in the months leading up to the introduction of ALu (only 40% months in stock), but these were compensated by the wide availability of SP in shops. After the introduction of ALu stock levels of the drug were relatively high in public health facilities (over 80% months in stock), but the drug could only be found in 30% of drug shops and in no general shops. This resulted in a low overall utilization of the drug (19% of all anti-malarial sales) Conclusions The public health and private retail sector are important complementary sources of treatment in rural Tanzania. Ensuring the availability of ALu in the private retail sector is important for its successful uptake. PMID:20550654
Functional impact of the human mobilome.
Babatz, Timothy D; Burns, Kathleen H
2013-06-01
The human genome is replete with interspersed repetitive sequences derived from the propagation of mobile DNA elements. Three families of human retrotransposons remain active today: LINE1, Alu, and SVA elements. Since 1988, de novo insertions at previously recognized disease loci have been shown to generate highly penetrant alleles in Mendelian disorders. Only recently has the extent of germline-transmitted retrotransposon insertion polymorphism (RIP) in human populations been fully realized. Also exciting are recent studies of somatic retrotransposition in human tissues and reports of tumor-specific insertions, suggesting roles in tissue heterogeneity and tumorigenesis. Here we discuss mobile elements in human disease with an emphasis on exciting developments from the last several years. Copyright © 2013 Elsevier Ltd. All rights reserved.
SINE Retrotransposition: Evaluation of Alu Activity and Recovery of De Novo Inserts.
Ade, Catherine; Roy-Engel, Astrid M
2016-01-01
Mobile element activity is of great interest due to its impact on genomes. However, the types of mobile elements that inhabit any given genome are remarkably varied. Among the different varieties of mobile elements, the Short Interspersed Elements (SINEs) populate many genomes, including many mammalian species. Although SINEs are parasites of Long Interspersed Elements (LINEs), SINEs have been highly successful in both the primate and rodent genomes. When comparing copy numbers in mammals, SINEs have been vastly more successful than other nonautonomous elements, such as the retropseudogenes and SVA. Interestingly, in the human genome the copy number of Alu (a primate SINE) outnumbers LINE-1 (L1) copies 2 to 1. Estimates suggest that the retrotransposition rate for Alu is tenfold higher than LINE-1 with about 1 insert in every twenty births. Furthermore, Alu-induced mutagenesis is responsible for the majority of the documented instances of human retroelement insertion-induced disease. However, little is known on what contributes to these observed differences between SINEs and LINEs. The development of an assay to monitor SINE retrotransposition in culture has become an important tool for the elucidation of some of these differences. In this chapter, we present details of the SINE retrotransposition assay and the recovery of de novo inserts. We also focus on the nuances that are unique to the SINE assay.
[Utility of chromosome banding with ALU I enzyme for identifying methylated areas in breast cancer].
Rojas-Atencio, Alicia; Yamarte, Leonard; Urdaneta, Karelis; Soto-Alvarez, Marisol; Alvarez Nava, Francisco; Cañizalez, Jenny; Quintero, Maribel; Atencio, Raquel; González, Richard
2012-12-01
Cancer is a group of disorders characterized by uncontrolled cell growth which is produced by two successive events: increased cell proliferation (tumor or neoplasia) and the invasive capacity of these cells (metastasis). DNA methylation is an epigenetic process which has been involved as an important pathogenic factor of cancer. DNA methylation participates in the regulation of gene expression, directly, by preventing the union of transcription factors, and indirectly, by promoting the "closed" structure of the chromatine. The objectives of this study were to identify hypermethyled chromosomal regions through the use of restriction Alu I endonuclease, and to relate cytogenetically these regions with tumor suppressive gene loci. Sixty peripheral blood samples of females with breast cancer were analyzed. Cell cultures were performed and cytogenetic spreads, previously digested with Alu I enzyme, were stained with Giemsa. Chromosomal centromeric and not centromeric regions were stained in 37% of cases. About 96% of stained hypermethyled chromosomal regions (1q, 2q, 6q) were linked with methylated genes associated with breast cancer. In addition, centromeric regions in chromosomes 3, 4, 8, 13, 14, 15 and 17, usually unstained, were found positive to digestion with Alu I enzime and Giemsa staining. We suggest the importance of this technique for the global visualization of the genome which can find methylated genes related to breast cancer, and thus lead to a specific therapy, and therefore a better therapeutic response.
Khusainova, R I; Akhmetova, V L; Kutuev, I A; Salimova, A Z; Korshunova, T Iu; Lebedev, Iu B; Khusnutdinova, E K
2004-04-01
Nine Alu loci (Ya5NBC5, Ya5NBC27, Ya5NBC148, Ya5NBC182, YA5NBC361, ACE, ApoA1, PV92, TPA25) were analyzed in six ethnic populations (Trans-Ural Bashkirs, Tatars-Mishars, Mordovians-Moksha, Mountain Maris, Udmurts, and Komi-Permyaks) of the Volga-Ural region and in three Central Asian populations (Uzbeks, Kazakhs, and Uigurs). All Alu insertions analyzed appeared to be polymorphic in all populations examined. The frequency of insertion varied from 0.110 in Mountain Maris at the Ya5NBC5 locus to 0.914 in Tatars at the ApoA1 locus. The data on the allele frequency distribution at nine loci point to the existence of substantial genetic diversity in the populations examined. The value of the observed heterozygosity averaged over nine Alu insertions varied from 0.326 in Mountain Maris to 0.445 in Kazakhs and Uigurs. The level of the interpopulation genetic differences for the Volga-Ural population (Fst = 0.061) was higher than for the populations of Central Asia (Fst = 0.024), Europe (Fst = 0.02), and Southeastern Asia (Fst = 0.018). The populations examined were highly differentiated both in respect of linguistic characteristics and the geographical position. The data obtained confirmed the effectiveness of the marker system used for the assessment of genetic differentiation and the relationships between the ethnic groups.
Polymorphic Alu Insertion/Deletion in Different Caste and Tribal Populations from South India.
Chinniah, Rathika; Vijayan, Murali; Thirunavukkarasu, Manikandan; Mani, Dhivakar; Raju, Kamaraj; Ravi, Padma Malini; Sivanadham, Ramgopal; C, Kandeepan; N, Mahalakshmi; Karuppiah, Balakrishnan
2016-01-01
Seven human-specific Alu markers were studied in 574 unrelated individuals from 10 endogamous groups and 2 hill tribes of Tamil Nadu and Kerala states. DNA was isolated, amplified by PCR-SSP, and subjected to agarose gel electrophoresis, and genotypes were assigned for various Alu loci. Average heterozygosity among caste populations was in the range of 0.292-0.468. Among tribes, the average heterozygosity was higher for Paliyan (0.3759) than for Kani (0.2915). Frequency differences were prominent in all loci studied except Alu CD4. For Alu CD4, the frequency was 0.0363 in Yadavas, a traditional pastoral and herd maintaining population, and 0.2439 in Narikuravars, a nomadic gypsy population. The overall genetic difference (Gst) of 12 populations (castes and tribes) studied was 3.6%, which corresponds to the Gst values of 3.6% recorded earlier for Western Asian populations. Thus, our study confirms the genetic similarities between West Asian populations and South Indian castes and tribes and supported the large scale coastal migrations from Africa into India through West Asia. However, the average genetic difference (Gst) of Kani and Paliyan tribes with other South Indian tribes studied earlier was 8.3%. The average Gst of combined South and North Indian Tribes (CSNIT) was 9.5%. Neighbor joining tree constructed showed close proximity of Kani and Paliyan tribal groups to the other two South Indian tribes, Toda and Irula of Nilgiri hills studied earlier. Further, the analysis revealed the affinities among populations and confirmed the presence of North and South India specific lineages. Our findings have documented the highly diverse (micro differentiated) nature of South Indian tribes, predominantly due to isolation, than the endogamous population groups of South India. Thus, our study firmly established the genetic relationship of South Indian castes and tribes and supported the proposed large scale ancestral migrations from Africa, particularly into South India through West Asian corridor.
Polymorphic Alu Insertion/Deletion in Different Caste and Tribal Populations from South India
Chinniah, Rathika; Vijayan, Murali; Thirunavukkarasu, Manikandan; Mani, Dhivakar; Raju, Kamaraj; Ravi, Padma Malini; Sivanadham, Ramgopal; C, Kandeepan; N, Mahalakshmi; Karuppiah, Balakrishnan
2016-01-01
Seven human-specific Alu markers were studied in 574 unrelated individuals from 10 endogamous groups and 2 hill tribes of Tamil Nadu and Kerala states. DNA was isolated, amplified by PCR-SSP, and subjected to agarose gel electrophoresis, and genotypes were assigned for various Alu loci. Average heterozygosity among caste populations was in the range of 0.292–0.468. Among tribes, the average heterozygosity was higher for Paliyan (0.3759) than for Kani (0.2915). Frequency differences were prominent in all loci studied except Alu CD4. For Alu CD4, the frequency was 0.0363 in Yadavas, a traditional pastoral and herd maintaining population, and 0.2439 in Narikuravars, a nomadic gypsy population. The overall genetic difference (Gst) of 12 populations (castes and tribes) studied was 3.6%, which corresponds to the Gst values of 3.6% recorded earlier for Western Asian populations. Thus, our study confirms the genetic similarities between West Asian populations and South Indian castes and tribes and supported the large scale coastal migrations from Africa into India through West Asia. However, the average genetic difference (Gst) of Kani and Paliyan tribes with other South Indian tribes studied earlier was 8.3%. The average Gst of combined South and North Indian Tribes (CSNIT) was 9.5%. Neighbor joining tree constructed showed close proximity of Kani and Paliyan tribal groups to the other two South Indian tribes, Toda and Irula of Nilgiri hills studied earlier. Further, the analysis revealed the affinities among populations and confirmed the presence of North and South India specific lineages. Our findings have documented the highly diverse (micro differentiated) nature of South Indian tribes, predominantly due to isolation, than the endogamous population groups of South India. Thus, our study firmly established the genetic relationship of South Indian castes and tribes and supported the proposed large scale ancestral migrations from Africa, particularly into South India through West Asian corridor. PMID:27315142
Comparison of simple sequence repeats in 19 Archaea.
Trivedi, S
2006-12-05
All organisms that have been studied until now have been found to have differential distribution of simple sequence repeats (SSRs), with more SSRs in intergenic than in coding sequences. SSR distribution was investigated in Archaea genomes where complete chromosome sequences of 19 Archaea were analyzed with the program SPUTNIK to find di- to penta-nucleotide repeats. The number of repeats was determined for the complete chromosome sequences and for the coding and non-coding sequences. Different from what has been found for other groups of organisms, there is an abundance of SSRs in coding regions of the genome of some Archaea. Dinucleotide repeats were rare and CG repeats were found in only two Archaea. In general, trinucleotide repeats are the most abundant SSR motifs; however, pentanucleotide repeats are abundant in some Archaea. Some of the tetranucleotide and pentanucleotide repeat motifs are organism specific. In general, repeats are short and CG-rich repeats are present in Archaea having a CG-rich genome. Among the 19 Archaea, SSR density was not correlated with genome size or with optimum growth temperature. Pentanucleotide density had an inverse correlation with the CG content of the genome.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sugaya, K.; Fukagawa, T.; Matsumoto, K.
Cosmid walking of about 250 kb from MHC class III gene CYP21 to class II was conducted. The gene for receptor of advanced glycosylation end products of proteins (RAGE, a member of immunoglobulin super-family molecules), the PBX2 homeobox gene designated HOX12, and the human counterpart of the mouse mammary tumor gene int-3 were found. The contiguous RAGE and HOX12 genes were completely sequenced, and the human int-3 counterpart was partially sequenced and assigned to a Notch homolog. This human Notch homolog, designated NOTCH3, showed both the intracellular portion present in the mouse int-3 sequence and the extracellular portion absent inmore » the int-3. It thus corresponds to the intact form of a Notch-type transmembrane protein. About 20 kb of dense Alu clustering was found just centromeric to the NOTCH3. 48 refs., 9 figs., 2 tabs.« less
[Mutation Analysis of 19 STR Loci in 20 723 Cases of Paternity Testing].
Bi, J; Chang, J J; Li, M X; Yu, C Y
2017-06-01
To observe and analyze the confirmed cases of paternity testing, and to explore the mutation rules of STR loci. The mutant STR loci were screened from 20 723 confirmed cases of paternity testing by Goldeneye 20A system.The mutation rates, and the sources, fragment length, steps and increased or decreased repeat sequences of mutant alleles were counted for the analysis of the characteristics of mutation-related factors. A total of 548 mutations were found on 19 STR loci, and 557 mutation events were observed. The loci mutation rate was 0.07‰-2.23‰. The ratio of paternal to maternal mutant events was 3.06:1. One step mutation was the main mutation, and the number of the increased repeat sequences was almost the same as the decreased repeat sequences. The repeat sequences were more likely to decrease in two steps mutation and above. Mutation mainly occurred in the medium allele, and the number of the increased repeat sequences was almost the same as the decreased repeat sequences. In long allele mutations, the decreased repeat sequences were significantly more than the increased repeat sequences. The number of the increased repeat sequences was almost the same as the decreased repeat sequences in paternal mutation, while the decreased repeat sequences were more than the increased in maternal mutation. There are significant differences in the mutation rate of each locus. When one or two loci do not conform to the genetic law, other detection system should be added, and PI value should be calculated combined with the information of the mutate STR loci in order to further clarify the identification opinions. Copyright© by the Editorial Department of Journal of Forensic Medicine
Evolutionary history of 7SL RNA-derived SINEs in Supraprimates.
Kriegs, Jan Ole; Churakov, Gennady; Jurka, Jerzy; Brosius, Jürgen; Schmitz, Jürgen
2007-04-01
The evolutionary relationships of 7SL RNA-derived SINEs such as the primate Alu or the rodent B1 elements have hitherto been obscure. We established an unambiguous phylogenetic tree for Supraprimates, and derived intraordinal relationships of the 7SL RNA-derived SINEs. As well as new elements in Tupaia and primates, we also found that the purported ancestral fossil Alu monomer was restricted to Primates, and provide here the first description of a potential chimeric promoter box region in SINEs.
IC Piracy Protection by APUF and Logic Obfuscation
2014-01-01
ABSTRACT UU 18. NUMBER OF PAGES 19a. NAME OF RESPONSIBLE PERSON GARRETT S. ROSE a. REPORT U b . ABSTRACT U c. THIS PAGE U 19b. TELEPHONE NUMBER...activation energy respectively. A and B are technology dependent constants. As shown in Equation 2, the Vth shift heavily depends on temperature (T) and...shown in Figure 2, two 4-bit operands (operand A and B ) are fed into each ALU and a 4-bit output (S1~ S4) can be obtained from each ALU. For delay
Increased complexity of circRNA expression during species evolution.
Dong, Rui; Ma, Xu-Kai; Chen, Ling-Ling; Yang, Li
2017-08-03
Circular RNAs (circRNAs) are broadly identified from precursor mRNA (pre-mRNA) back-splicing across various species. Recent studies have suggested a cell-/tissue- specific manner of circRNA expression. However, the distinct expression pattern of circRNAs among species and its underlying mechanism still remain to be explored. Here, we systematically compared circRNA expression from human and mouse, and found that only a small portion of human circRNAs could be determined in parallel mouse samples. The conserved circRNA expression between human and mouse is correlated with the existence of orientation-opposite complementary sequences in introns that flank back-spliced exons in both species, but not the circRNA sequences themselves. Quantification of RNA pairing capacity of orientation-opposite complementary sequences across circRNA-flanking introns by Complementary Sequence Index (CSI) identifies that among all types of complementary sequences, SINEs, especially Alu elements in human, contribute the most for circRNA formation and that their diverse distribution across species leads to the increased complexity of circRNA expression during species evolution. Together, our integrated and comparative reference catalog of circRNAs in different species reveals a species-specific pattern of circRNA expression and suggests a previously under-appreciated impact of fast-evolved SINEs on the regulation of (circRNA) gene expression.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-01-01
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363
NASA Astrophysics Data System (ADS)
Li, Qi; Akihiro, Kijima
2007-01-01
The microsatellite-enriched library was constructed using magnetic bead hybridization selection method, and the microsatellite DNA sequences were analyzed in Pacific abalone Haliotis discus hannai. Three hundred and fifty white colonies were screened using PCR-based technique, and 84 clones were identified to potentially contain microsatellite repeat motif. The 84 clones were sequenced, and 42 microsatellites and 4 minisatellites with a minimum of five repeats were found (13.1% of white colonies screened). Besides the motif of CA contained in the oligoprobe, we also found other 16 types of microsatellite repeats including a dinucleotide repeat, two tetranucleotide repeats, twelve pentanucleotide repeats and a hexanucleotide repeat. According to Weber (1990), the microsatellite sequences obtained could be categorized structurally into perfect repeats (73.3%), imperfect repeats (13.3%), and compound repeats (13.4%). Among the microsatellite repeats, relatively short arrays (<20 repeats) were most abundant, accounting for 75.0%. The largest length of microsatellites was 48 repeats, and the average number of repeats was 13.4. The data on the composition and length distribution of microsatellites obtained in the present study can be useful for choosing the repeat motifs for microsatellite isolation in other abalone species.
Iron toxicity in the retina requires Alu RNA and the NLRP3 inflammasome
Gelfand, Bradley D.; Wright, Charles B.; Kim, Younghee; Yasuma, Tetsuhiro; Yasuma, Reo; Li, Shengjian; Fowler, Benjamin J.; Bastos-Carvalho, Ana; Kerur, Nagaraj; Uittenbogaard, Annette; Han, Youn Seon; Lou, Dingyuan; Kleinman, Mark E.; McDonald, W. Hayes; Núñez, Gabriel; Georgel, Philippe; Dunaief, Joshua L.; Ambati, Jayakrishna
2015-01-01
Excess iron induces tissue damage and is implicated in age-related macular degeneration (AMD). Iron toxicity is widely attributed to hydroxyl radical formation through Fenton's reaction. We report that excess iron, but not other Fenton catalytic metals, induces activation of the NLRP3 inflammasome, a pathway also implicated in AMD. Additionally, iron-induced degeneration of the retinal pigmented epithelium (RPE) is suppressed in mice lacking inflammasome components Caspase-1/11 or Nlrp3 or by inhibition of Caspase-1. Iron overload increases abundance of RNAs transcribed from short interspersed nuclear elements (SINEs): Alu RNAs and the rodent equivalent B1 and B2 RNAs, which are inflammasome agonists. Targeting Alu or B2 RNA prevents iron-induced inflammasome activation and RPE degeneration. Iron-induced SINE RNA accumulation is due to suppression of DICER1 via sequestration of the co-factor poly(C)-binding protein 2 (PCBP2). These findings reveal an unexpected mechanism of iron toxicity, with implications for AMD and neurodegenerative diseases associated with excess iron. PMID:26074074
Control of myogenesis by rodent SINE-containing lncRNAs
Wang, Jiashi; Gong, Chenguang; Maquat, Lynne E.
2013-01-01
Staufen1-mediated mRNA decay (SMD) degrades mRNAs that harbor a Staufen1-binding site (SBS) in their 3′ untranslated regions (UTRs). Human SBSs can form by intermolecular base-pairing between a 3′ UTR Alu element and an Alu element within a long noncoding RNA (lncRNA) called a ½-sbsRNA. Since Alu elements are confined to primates, it was unclear how SMD occurs in rodents. Here we identify mouse mRNA 3′ UTRs and lncRNAs that contain a B1, B2, B4, or identifier (ID) element. We show that SMD occurs in mouse cells via mRNA–lncRNA base-pairing of partially complementary elements and that mouse ½-sbsRNA (m½-sbsRNA)-triggered SMD regulates C2C12 cell myogenesis. Our findings define new roles for lncRNAs as well as B and ID short interspersed elements (SINEs) in mice that undoubtedly influence many developmental and homeostatic pathways. PMID:23558772
Kim, Younghee; Tarallo, Valeria; Kerur, Nagaraj; Yasuma, Tetsuhiro; Gelfand, Bradley D.; Bastos-Carvalho, Ana; Hirano, Yoshio; Yasuma, Reo; Mizutani, Takeshi; Fowler, Benjamin J.; Li, Shengjian; Kaneko, Hiroki; Bogdanovich, Sasha; Ambati, Balamurali K.; Hinton, David R.; Hauswirth, William W.; Hakem, Razqallah; Wright, Charles; Ambati, Jayakrishna
2014-01-01
Geographic atrophy, an advanced form of age-related macular degeneration (AMD) characterized by death of the retinal pigmented epithelium (RPE), causes untreatable blindness in millions worldwide. The RPE of human eyes with geographic atrophy accumulates toxic Alu RNA in response to a deficit in the enzyme DICER1, which in turn leads to activation of the NLRP3 inflammasome and elaboration of IL-18. Despite these recent insights, it is still unclear how RPE cells die during the course of the disease. In this study, we implicate the involvement of Caspase-8 as a critical mediator of RPE degeneration. Here we show that DICER1 deficiency, Alu RNA accumulation, and IL-18 up-regulation lead to RPE cell death via activation of Caspase-8 through a Fas ligand-dependent mechanism. Coupled with our observation of increased Caspase-8 expression in the RPE of human eyes with geographic atrophy, our findings provide a rationale for targeting this apoptotic pathway in this disease. PMID:25349431
Cochran, Susan A.; Gibbs, Ann E.; D'Antonio, Nicole L.; Storlazzi, Curt D.
2016-05-18
The coral reef in Faga‘alu Bay, Tutuila, American Samoa, has suffered numerous natural and anthropogenic stresses. Areas once dominated by live coral are now mostly rubble surfaces covered with turf or macroalgae. In an effort to improve the health and resilience of the coral reef system, the U.S. Coral Reef Task Force selected Faga‘alu Bay as a priority study area. To support these efforts, the U.S. Geological Survey mapped nearly 1 km2 of seafloor to depths of about 60 m. Unconsolidated sediment (predominantly sand) constitutes slightly greater than 50 percent of the seafloor in the mapped area; reef and other hardbottom potentially available for coral recruitment constitute nearly 50 percent of the mapped area. Of this potentially available hardbottom, only slightly greater than 37 percent is covered with at least 10 percent coral, which is fairly evenly distributed between the reef flat, fore reef, and offshore bank/shelf.
Negishi, Masamitsu; Wongpalee, Somsakul P.; Sarkar, Sukumar; Park, Jonghoon; Lee, Kyung Yong; Shibata, Yoshiyuki; Reon, Brian J.; Abounader, Roger; Suzuki, Yutaka; Sugano, Sumio; Dutta, Anindya
2014-01-01
Long noncoding RNAs (lncRNAs) have emerged as a major regulator of cell physiology, but many of which have no known function. CDKN1A/p21 is an important inhibitor of the cell-cycle, regulator of the DNA damage response and effector of the tumor suppressor p53, playing a crucial role in tumor development and prevention. In order to identify a regulator for tumor progression, we performed an siRNA screen of human lncRNAs required for cell proliferation, and identified a novel lncRNA, APTR, that acts in trans to repress the CDKN1A/p21 promoter independent of p53 to promote cell proliferation. APTR associates with the promoter of CDKN1A/p21 and this association requires a complementary-Alu sequence encoded in APTR. A different module of APTR associates with and recruits the Polycomb repressive complex 2 (PRC2) to epigenetically repress the p21 promoter. A decrease in APTR is necessary for the induction of p21 after heat stress and DNA damage by doxorubicin, and the levels of APTR and p21 are anti-correlated in human glioblastomas. Our data identify a new regulator of the cell-cycle inhibitor CDKN1A/p21 that acts as a proliferative factor in cancer cell lines and in glioblastomas and demonstrate that Alu elements present in lncRNAs can contribute to targeting regulatory lncRNAs to promoters. PMID:24748121
Comparative and demographic analysis of orangutan genomes
Locke, Devin P.; Hillier, LaDeana W.; Warren, Wesley C.; Worley, Kim C.; Nazareth, Lynne V.; Muzny, Donna M.; Yang, Shiaw-Pyng; Wang, Zhengyuan; Chinwalla, Asif T.; Minx, Pat; Mitreva, Makedonka; Cook, Lisa; Delehaunty, Kim D.; Fronick, Catrina; Schmidt, Heather; Fulton, Lucinda A.; Fulton, Robert S.; Nelson, Joanne O.; Magrini, Vincent; Pohl, Craig; Graves, Tina A.; Markovic, Chris; Cree, Andy; Dinh, Huyen H.; Hume, Jennifer; Kovar, Christie L.; Fowler, Gerald R.; Lunter, Gerton; Meader, Stephen; Heger, Andreas; Ponting, Chris P.; Marques-Bonet, Tomas; Alkan, Can; Chen, Lin; Cheng, Ze; Kidd, Jeffrey M.; Eichler, Evan E.; White, Simon; Searle, Stephen; Vilella, Albert J.; Chen, Yuan; Flicek, Paul; Ma, Jian; Raney, Brian; Suh, Bernard; Burhans, Richard; Herrero, Javier; Haussler, David; Faria, Rui; Fernando, Olga; Darré, Fleur; Farré, Domènec; Gazave, Elodie; Oliva, Meritxell; Navarro, Arcadi; Roberto, Roberta; Capozzi, Oronzo; Archidiacono, Nicoletta; Valle, Giuliano Della; Purgato, Stefania; Rocchi, Mariano; Konkel, Miriam K.; Walker, Jerilyn A.; Ullmer, Brygg; Batzer, Mark A.; Smit, Arian F. A.; Hubley, Robert; Casola, Claudio; Schrider, Daniel R.; Hahn, Matthew W.; Quesada, Victor; Puente, Xose S.; Ordoñez, Gonzalo R.; López-Otín, Carlos; Vinar, Tomas; Brejova, Brona; Ratan, Aakrosh; Harris, Robert S.; Miller, Webb; Kosiol, Carolin; Lawson, Heather A.; Taliwal, Vikas; Martins, André L.; Siepel, Adam; RoyChoudhury, Arindam; Ma, Xin; Degenhardt, Jeremiah; Bustamante, Carlos D.; Gutenkunst, Ryan N.; Mailund, Thomas; Dutheil, Julien Y.; Hobolth, Asger; Schierup, Mikkel H.; Chemnick, Leona; Ryder, Oliver A.; Yoshinaga, Yuko; de Jong, Pieter J.; Weinstock, George M.; Rogers, Jeffrey; Mardis, Elaine R.; Gibbs, Richard A.; Wilson, Richard K.
2011-01-01
“Orangutan” is derived from the Malay term “man of the forest” and aptly describes the Southeast Asian great apes native to Sumatra and Borneo. The orangutan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orangutan draft genome assembly and short read sequence data from five Sumatran and five Bornean orangutan genomes. Our analyses reveal that, compared to other primates, the orangutan genome has many unique features. Structural evolution of the orangutan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe the first primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orangutan genome structure. Orangutans have extremely low energy usage for a eutherian mammal1, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400k years ago (ya), is more recent than most previous studies and underscores the complexity of the orangutan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (Ne) expanded exponentially relative to the ancestral Ne after the split, while Bornean Ne declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts. PMID:21270892
Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L
2013-01-30
Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
2013-01-01
Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705
Larsen, Svend Arild; Mogensen, Line; Dietz, Rune; Baagøe, Hans Jørgen; Andersen, Mogens; Werge, Thomas; Rasmussen, Henrik Berg
2005-12-01
In this study we have identified and characterized dopamine receptor D4 (DRD4) exon III tandem repeats in 33 public available nucleotide sequences from different mammalian species. We found that the tandem repeat in canids could be described in a novel and simple way, namely, as a structure composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp tandem repeat, while a tandem repeat consisting of 27-bp modules was identified in a sequence from European badger. Both these tandem repeats were composed of 9-bp basic units, which were closely related with the 9-bp repeat modules identified in the mink and ferret. Tandem repeats could not be identified in sequences from rodents. All tandem repeats possessed a high GC content with a strong bias for C. On phylogenetic analysis of the tandem repeats evolutionary related species were clustered into the same groups. The degree of conservation of the tandem repeats varied significantly between species. The deduced amino acid sequences of most of the tandem repeats exhibited a high propensity for disorder. This was also the case with an amino acid sequence of the human DRD4 exon III tandem repeat, which was included in the study for comparative purposes. We identified proline-containing motifs for SH3 and WW domain binding proteins, potential phosphorylation sites, PDZ domain binding motifs, and FHA domain binding motifs in the amino acid sequences of the tandem repeats. The numbers of potential functional sites varied pronouncedly between species. Our observations provide a platform for future studies of the architecture and evolution of the DRD4 exon III tandem repeat, and they suggest that differences in the structure of this tandem repeat contribute to specialization and generation of diversity in receptor function.
Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.
2010-01-01
Location analysis for estrogen receptor-α (ERα)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERα-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10–20% nucleotide deviation from the canonical ERE sequence. We demonstrate that ∼50% of all ERα-bound loci do not have a discernable ERE and show that most ERα-bound EREs are not perfect consensus EREs. Approximately one-third of all ERα-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERα-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERα binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers. PMID:20047966
Mason, Christopher E; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M; Kallen, Roland G; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B
2010-04-01
Location analysis for estrogen receptor-alpha (ERalpha)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERalpha-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10-20% nucleotide deviation from the canonical ERE sequence. We demonstrate that approximately 50% of all ERalpha-bound loci do not have a discernable ERE and show that most ERalpha-bound EREs are not perfect consensus EREs. Approximately one-third of all ERalpha-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERalpha-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERalpha binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-11-16
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Liscovitch, Noa; Bazak, Lily; Levanon, Erez Y; Chechik, Gal
2014-01-01
A-to-I RNA editing by adenosine deaminases acting on RNA is a post-transcriptional modification that is crucial for normal life and development in vertebrates. RNA editing has been shown to be very abundant in the human transcriptome, specifically at the primate-specific Alu elements. The functional role of this wide-spread effect is still not clear; it is believed that editing of transcripts is a mechanism for their down-regulation via processes such as nuclear retention or RNA degradation. Here we combine 2 neural gene expression datasets with genome-level editing information to examine the relation between the expression of ADAR genes with the expression of their target genes. Specifically, we computed the spatial correlation across structures of post-mortem human brains between ADAR and a large set of targets that were found to be edited in their Alu repeats. Surprisingly, we found that a large fraction of the edited genes are positively correlated with ADAR, opposing the assumption that editing would reduce expression. When considering the correlations between ADAR and its targets over development, 2 gene subsets emerge, positively correlated and negatively correlated with ADAR expression. Specifically, in embryonic time points, ADAR is positively correlated with many genes related to RNA processing and regulation of gene expression. These findings imply that the suggested mechanism of regulation of expression by editing is probably not a global one; ADAR expression does not have a genome wide effect reducing the expression of editing targets. It is possible, however, that RNA editing by ADAR in non-coding regions of the gene might be a part of a more complex expression regulation mechanism. PMID:25692240
Liscovitch, Noa; Bazak, Lily; Levanon, Erez Y; Chechik, Gal
2014-01-01
A-to-I RNA editing by adenosine deaminases acting on RNA is a post-transcriptional modification that is crucial for normal life and development in vertebrates. RNA editing has been shown to be very abundant in the human transcriptome, specifically at the primate-specific Alu elements. The functional role of this wide-spread effect is still not clear; it is believed that editing of transcripts is a mechanism for their down-regulation via processes such as nuclear retention or RNA degradation. Here we combine 2 neural gene expression datasets with genome-level editing information to examine the relation between the expression of ADAR genes with the expression of their target genes. Specifically, we computed the spatial correlation across structures of post-mortem human brains between ADAR and a large set of targets that were found to be edited in their Alu repeats. Surprisingly, we found that a large fraction of the edited genes are positively correlated with ADAR, opposing the assumption that editing would reduce expression. When considering the correlations between ADAR and its targets over development, 2 gene subsets emerge, positively correlated and negatively correlated with ADAR expression. Specifically, in embryonic time points, ADAR is positively correlated with many genes related to RNA processing and regulation of gene expression. These findings imply that the suggested mechanism of regulation of expression by editing is probably not a global one; ADAR expression does not have a genome wide effect reducing the expression of editing targets. It is possible, however, that RNA editing by ADAR in non-coding regions of the gene might be a part of a more complex expression regulation mechanism.
Prenatal Arsenic Exposure and DNA Methylation in Maternal and Umbilical Cord Blood Leukocytes
Baccarelli, Andrea; Hoffman, Elaine; Tarantini, Letizia; Quamruzzaman, Quazi; Rahman, Mahmuder; Mahiuddin, Golam; Mostofa, Golam; Hsueh, Yu-Mei; Wright, Robert O.; Christiani, David C.
2012-01-01
Background: Arsenic is an epigenetic toxicant and could influence fetal developmental programming. Objectives: We evaluated the association between arsenic exposure and DNA methylation in maternal and umbilical cord leukocytes. Methods: Drinking-water and urine samples were collected when women were at ≤ 28 weeks gestation; the samples were analyzed for arsenic using inductively coupled plasma mass spectrometry. DNA methylation at CpG sites in p16 (n = 7) and p53 (n = 4), and in LINE-1 and Alu repetitive elements (3 CpG sites in each), was quantified using pyrosequencing in 113 pairs of maternal and umbilical blood samples. We used general linear models to evaluate the relationship between DNA methylation and tertiles of arsenic exposure. Results: Mean (± SD) drinking-water arsenic concentration was 14.8 ± 36.2 μg/L (range: < 1–230 μg/L). Methylation in LINE-1 increased by 1.36% [95% confidence interval (CI): 0.52, 2.21%] and 1.08% (95% CI: 0.07, 2.10%) in umbilical cord and maternal leukocytes, respectively, in association with the highest versus lowest tertile of total urinary arsenic per gram creatinine. Arsenic exposure was also associated with higher methylation of some of the tested CpG sites in the promoter region of p16 in umbilical cord and maternal leukocytes. No associations were observed for Alu or p53 methylation. Conclusions: Exposure to higher levels of arsenic was positively associated with DNA methylation in LINE-1 repeated elements, and to a lesser degree at CpG sites within the promoter region of the tumor suppressor gene p16. Associations were observed in both maternal and fetal leukocytes. Future research is needed to confirm these results and determine if these small increases in methylation are associated with any health effects. PMID:22466225
Okimoto, R; Chamberlin, H M; Macfarlane, J L; Wolstenholme, D R
1991-01-01
Within a 7 kb segment of the mtDNA molecule of the root knot nematode, Meloidogyne javanica, that lacks standard mitochondrial genes, are three sets of strictly tandemly arranged, direct repeat sequences: approximately 36 copies of a 102 ntp sequence that contains a TaqI site; 11 copies of a 63 ntp sequence, and 5 copies of an 8 ntp sequence. The 7 kb repeat-containing segment is bounded by putative tRNAasp and tRNAf-met genes and the arrangement of sequences within this segment is: the tRNAasp gene; a unique 1,528 ntp segment that contains two highly stable hairpin-forming sequences; the 102 ntp repeat set; the 8 ntp repeat set; a unique 1,068 ntp segment; the 63 ntp repeat set; and the tRNAf-met gene. The nucleotide sequences of the 102 ntp copies and the 63 ntp copies have been conserved among the species examined. Data from Southern hybridization experiments indicate that 102 ntp and 63 ntp repeats occur in the mtDNAs of three, two and two races of M.incognita, M.hapla and M.arenaria, respectively. Nucleotide sequences of the M.incognita Race-3 102 ntp repeat were found to be either identical or highly similar to those of the M.javanica 102 ntp repeat. Differences in migration distance and number of 102 ntp repeat-containing bands seen in Southern hybridization autoradiographs of restriction-digested mtDNAs of M.javanica and the different host races of M.incognita, M.hapla and M.arenaria are sufficient to distinguish the different host races of each species. Images PMID:2027769
Chemical Approaches for Structure and Function of RNA in Postgenomic Era
Ro-Choi, Tae Suk; Choi, Yong Chun
2012-01-01
In the study of cellular RNA chemistry, a major thrust of research focused upon sequence determinations for decades. Structures of snRNAs (4.5S RNA I (Alu), U1, U2, U3, U4, U5, and U6) were determined at Baylor College of Medicine, Houston, Tex, in an earlier time of pregenomic era. They show novel modifications including base methylation, sugar methylation, 5′-cap structures (types 0–III) and sequence heterogeneity. This work offered an exciting problem of posttranscriptional modification and underwent numerous significant advances through technological revolutions during pregenomic, genomic, and postgenomic eras. Presently, snRNA research is making progresses involved in enzymology of snRNA modifications, molecular evolution, mechanism of spliceosome assembly, chemical mechanism of intron removal, high-order structure of snRNA in spliceosome, and pathology of splicing. These works are destined to reach final pathway of work “Function and Structure of Spliceosome” in addition to exciting new exploitation of other noncoding RNAs in all aspects of regulatory functions. PMID:22347623
Staufen1 senses overall transcript secondary structure to regulate translation
Ricci, Emiliano P; Kucukural, Alper; Cenik, Can; Mercier, Blandine C; Singh, Guramrit; Heyer, Erin E; Ashar-Patel, Ami; Peng, Lingtao; Moore, Melissa J
2015-01-01
Human Staufen1 (Stau1) is a double-stranded RNA (dsRNA)-binding protein implicated in multiple post-transcriptional gene-regulatory processes. Here we combined RNA immunoprecipitation in tandem (RIPiT) with RNase footprinting, formaldehyde cross-linking, sonication-mediated RNA fragmentation and deep sequencing to map Staufen1-binding sites transcriptome wide. We find that Stau1 binds complex secondary structures containing multiple short helices, many of which are formed by inverted Alu elements in annotated 3′ untranslated regions (UTRs) or in ‘strongly distal’ 3′ UTRs. Stau1 also interacts with actively translating ribosomes and with mRNA coding sequences (CDSs) and 3′ UTRs in proportion to their GC content and propensity to form internal secondary structure. On mRNAs with high CDS GC content, higher Stau1 levels lead to greater ribosome densities, thus suggesting a general role for Stau1 in modulating translation elongation through structured CDS regions. Our results also indicate that Stau1 regulates translation of transcription-regulatory proteins. PMID:24336223
Variation, Repetition, And Choice
Abreu-Rodrigues, Josele; Lattal, Kennon A; dos Santos, Cristiano V; Matos, Ricardo A
2005-01-01
Experiment 1 investigated the controlling properties of variability contingencies on choice between repeated and variable responding. Pigeons were exposed to concurrent-chains schedules with two alternatives. In the REPEAT alternative, reinforcers in the terminal link depended on a single sequence of four responses. In the VARY alternative, a response sequence in the terminal link was reinforced only if it differed from the n previous sequences (lag criterion). The REPEAT contingency generated low, constant levels of sequence variation whereas the VARY contingency produced levels of sequence variation that increased with the lag criterion. Preference for the REPEAT alternative tended to increase directly with the degree of variation required for reinforcement. Experiment 2 examined the potential confounding effects in Experiment 1 of immediacy of reinforcement by yoking the interreinforcer intervals in the REPEAT alternative to those in the VARY alternative. Again, preference for REPEAT was a function of the lag criterion. Choice between varying and repeating behavior is discussed with respect to obtained behavioral variability, probability of reinforcement, delay of reinforcement, and switching within a sequence. PMID:15828592
Assessment of Azorean ancestry by Alu insertion polymorphisms.
Branco, Claudia C; Palla, Raquel; Lino, Sílvia; Pacheco, Paula R; Cabral, Rita; De Fez, Laura; Peixoto, Bernardo R; Mota-Vieira, Luisa
2006-01-01
Knowledge of population ancestry from genetic markers is essential, for example, to understand the history of human migration and to carry out admixture and association studies. Here we assess the genome ancestry of the Azorean population through analysis of six Alu polymorphic sites (TPA-25, ACE, APO, B65, PV92, and D1) in 65 Azoreans and 30 Portuguese unrelated blood donors and compare data for the Y-chromosome and mtDNA. Allele frequencies were calculated by direct counting. Statistical analysis was performed using Arlequin 2.0. Nei's genetic distance was calculated with DISPAN software, and trees were constructed by neighbor joining (NJ) using PHYLIP 3.63. The results show that all Alu insertions were polymorphic. APO is the closest to fixation. The less frequent insertions are PV92 and D1 in the Azores and Portugal, respectively. ACE and TPA-25 show the highest values of heterozygosity in both populations. Allele frequencies are very similar to those obtained in European populations. These results are validated by the Y-chromosome and mtDNA data, where the majority of the maternal and paternal lineages are European. Overall, these data are reflected in the phylogenetic tree, in which the Azoreans and the Portuguese branch with Catalans, Andalusians, Moroccans, and Algerians. We conclude that the population of the Azores shows no significant genetic differences from that of mainland Portugal and that it is an outbred population. Moreover, the data validate the use of Alu insertion polymorphisms to assess the origin and history of human populations. Am. J. Hum. Biol. 18:223-226, 2006. (c) 2006 Wiley-Liss, Inc.
Boyne, Devon J; Friedenreich, Christine M; McIntyre, John B; Stanczyk, Frank Z; Courneya, Kerry S; King, Will D
2017-12-01
Epigenetic mechanisms may help to explain the complex and heterogeneous relation between sex hormones and cancer. Few studies have investigated the effects of sex hormones on epigenetic markers related to cancer risk such as levels of methylation within repetitive DNA elements. Our objective was to describe the association between endogenous sex hormone exposure and levels of LINE-1 and Alu methylation in healthy postmenopausal women. We nested a cross-sectional study within the Alberta Physical Activity and Breast Cancer Prevention Trial (2003-2006). Study participants consisted of healthy postmenopausal women who had never been diagnosed with cancer (n = 289). Sex hormone exposures included serum concentrations of estradiol, estrone, testosterone, androstenedione, and sex hormone-binding globulin. We estimated the participants' lifetime number of menstrual cycles (LNMC) as a proxy for cumulative exposure to ovarian sex hormones. Buffy coat samples were assessed for DNA methylation. Linear regression was used to model the associations of interest and to control for confounding. Both estradiol and estrone had a significant positive dose-response association with LINE-1 methylation. LNMC was associated with both LINE-1 and Alu methylation. Specifically, LNMC had a non-linear "U-shaped" association with LINE-1 methylation regardless of folate intake and a negative linear association with Alu methylation, but only amongst low folate consumers. Androgen exposure was not associated with either outcome. Current and cumulative estrogen exposure was associated with repetitive element DNA methylation in a group of healthy postmenopausal women. LINE-1 and Alu methylation may be epigenetic mechanisms through which estrogen exposure impacts cancer risk.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.
Rehm, Charlotte; Wurmthaler, Lena A; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S
2015-01-01
In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.
Rehm, Charlotte; Wurmthaler, Lena A.; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S.
2015-01-01
In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1–5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6–9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria. PMID:26695179
Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Fang, Zhide; Deininger, Prescott; Zhang, Kun
2013-08-28
The exonization of transposable elements (TEs) has proven to be a significant mechanism for the creation of novel exons. Existing knowledge of the retention patterns of TE exons in mRNAs were mainly established by the analysis of Expressed Sequence Tag (EST) data and microarray data. This study seeks to validate and extend previous studies on the expression of TE exons by an integrative statistical analysis of high throughput RNA sequencing data. We collected 26 RNA-seq datasets spanning multiple tissues and cancer types. The exon-level digital expressions (indicating retention rates in mRNAs) were quantified by a double normalized measure, called the rescaled RPKM (Reads Per Kilobase of exon model per Million mapped reads). We analyzed the distribution profiles and the variability (across samples and between tissue/disease groups) of TE exon expressions, and compared them with those of other constitutive or cassette exons. We inferred the effects of four genomic factors, including the location, length, cognate TE family and TE nucleotide proportion (RTE, see Methods section) of a TE exon, on the exons' expression level and expression variability. We also investigated the biological implications of an assembly of highly-expressed TE exons. Our analysis confirmed prior studies from the following four aspects. First, with relatively high expression variability, most TE exons in mRNAs, especially those without exact counterparts in the UCSC RefSeq (Reference Sequence) gene tables, demonstrate low but still detectable expression levels in most tissue samples. Second, the TE exons in coding DNA sequences (CDSs) are less highly expressed than those in 3' (5') untranslated regions (UTRs). Third, the exons derived from chronologically ancient repeat elements, such as MIRs, tend to be highly expressed in comparison with those derived from younger TEs. Fourth, the previously observed negative relationship between the lengths of exons and the inclusion levels in transcripts is also true for exonized TEs. Furthermore, our study resulted in several novel findings. They include: (1) for the TE exons with non-zero expression and as shown in most of the studied biological samples, a high TE nucleotide proportion leads to their lower retention rates in mRNAs; (2) the considered genomic features (i.e. a continuous variable such as the exon length or a category indicator such as 3'UTR) influence the expression level and the expression variability (CV) of TE exons in an inverse manner; (3) not only the exons derived from Alu elements but also the exons from the TEs of other families were preferentially established in zinc finger (ZNF) genes.
Clayton, William; Eaton, Carla Jane; Dupont, Pierre-Yves; Gillanders, Tim; Cameron, Nick; Saikia, Sanjay; Scott, Barry
2017-01-01
Epichloë grass endophytes comprise a group of filamentous fungi of both sexual and asexual species. Known for the beneficial characteristics they endow upon their grass hosts, the identification of these endophyte species has been of great interest agronomically and scientifically. The use of simple sequence repeat loci and the variation in repeat elements has been used to rapidly identify endophyte species and strains, however, little is known of how the structure of repeat elements changes between species and strains, and where these repeat elements are located in the fungal genome. We report on an in-depth analysis of the structure and genomic location of the simple sequence repeat locus B10, commonly used for Epichloë endophyte species identification. The B10 repeat was found to be located within an exon of a putative bZIP transcription factor, suggesting possible impacts on polypeptide sequence and thus protein function. Analysis of this repeat in the asexual endophyte hybrid Epichloë uncinata revealed that the structure of B10 alleles reflects the ancestral species that hybridized to give rise to this species. Understanding the structure and sequence of these simple sequence repeats provides a useful set of tools for readily distinguishing strains and for gaining insights into the ancestral species that have undergone hybridization events.
TRAP: automated classification, quantification and annotation of tandemly repeated sequences.
Sobreira, Tiago José P; Durham, Alan M; Gruber, Arthur
2006-02-01
TRAP, the Tandem Repeats Analysis Program, is a Perl program that provides a unified set of analyses for the selection, classification, quantification and automated annotation of tandemly repeated sequences. TRAP uses the results of the Tandem Repeats Finder program to perform a global analysis of the satellite content of DNA sequences, permitting researchers to easily assess the tandem repeat content for both individual sequences and whole genomes. The results can be generated in convenient formats such as HTML and comma-separated values. TRAP can also be used to automatically generate annotation data in the format of feature table and GFF files.
Faragher, S G; Dalgarno, L
1986-07-20
The 3' untranslated (UT) sequences of the genomic RNAs of five geographic variants of the alphavirus Ross River virus (RRV) were determined and compared with the 3' UT sequence of RRV T48, the prototype strain. Part of the 3' UT region of Getah virus, a close serological relative of RRV, was also sequenced. The RRV 3' UT region varies markedly in length between variants. Large deletions or insertions, sequence rearrangements and single nucleotide substitutions are observed. A sequence tract of 49 to 58 nucleotides, which is repeated as four blocks in the RRV T48 3' UT region, occurs only once in the 3' UT region of one RRV strain (NB5092), indicating that the existence of repeat sequence blocks is not essential for RRV replication. However, the precise sequence of the 3' proximal copy of the repeat block and its position relative to the poly(A) tail were identical in all RRV isolates examined, suggesting that it has an important role in RRV replication. Nucleotide substitutions between RRV variants are distributed non-randomly along the length of the 3' UT region. The sequence of 120 to 130 nucleotides adjacent to the poly(A) tail is strongly conserved. Getah virus RNA contains three repeat sequence blocks in the 3' UT region. These are similar in sequence to those in RRV RNA but differ in their arrangement. Homology between the RRV and Getah 3' UT sequences is greatest in the 3' proximal repeat sequence block that shows three differences in 49 nucleotides. The 3' proximal repeat in Getah RNA occurs at the same position, relative to the poly(A) tail, as in all RRV variants. The RRV and Getah virus 3' UT sequences show extensive homology in the region between the 3' proximal repeat and the poly(A) tail but, apart from the repeat blocks themselves, they show no significant homology elsewhere.
Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K
2017-04-01
There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
DNA motifs associated with aberrant CpG island methylation.
Feltus, F Alex; Lee, Eva K; Costello, Joseph F; Plass, Christoph; Vertino, Paula M
2006-05-01
Epigenetic silencing involving the aberrant methylation of promoter region CpG islands is widely recognized as a tumor suppressor silencing mechanism in cancer. However, the molecular pathways underlying aberrant DNA methylation remain elusive. Recently we showed that, on a genome-wide level, CpG island loci differ in their intrinsic susceptibility to aberrant methylation and that this susceptibility can be predicted based on underlying sequence context. These data suggest that there are sequence/structural features that contribute to the protection from or susceptibility to aberrant methylation. Here we use motif elicitation coupled with classification techniques to identify DNA sequence motifs that selectively define methylation-prone or methylation-resistant CpG islands. Motifs common to 28 methylation-prone or 47 methylation-resistant CpG island-containing genomic fragments were determined using the MEME and MAST algorithms (). The five most discriminatory motifs derived from methylation-prone sequences were found to be associated with CpG islands in general and were nonrandomly distributed throughout the genome. In contrast, the eight most discriminatory motifs derived from the methylation-resistant CpG islands were randomly distributed throughout the genome. Interestingly, this latter group tended to associate with Alu and other repetitive sequences. Used together, the frequency of occurrence of these motifs successfully discriminated methylation-prone and methylation-resistant CpG island groups with an accuracy of 87% after 10-fold cross-validation. The motifs identified here are candidate methylation-targeting or methylation-protection DNA sequences.
A Comprehensive Strategy for Accurate Mutation Detection of the Highly Homologous PMS2.
Li, Jianli; Dai, Hongzheng; Feng, Yanming; Tang, Jia; Chen, Stella; Tian, Xia; Gorman, Elizabeth; Schmitt, Eric S; Hansen, Terah A A; Wang, Jing; Plon, Sharon E; Zhang, Victor Wei; Wong, Lee-Jun C
2015-09-01
Germline mutations in the DNA mismatch repair gene PMS2 underlie the cancer susceptibility syndrome, Lynch syndrome. However, accurate molecular testing of PMS2 is complicated by a large number of highly homologous sequences. To establish a comprehensive approach for mutation detection of PMS2, we have designed a strategy combining targeted capture next-generation sequencing (NGS), multiplex ligation-dependent probe amplification, and long-range PCR followed by NGS to simultaneously detect point mutations and copy number changes of PMS2. Exonic deletions (E2 to E9, E5 to E9, E8, E10, E14, and E1 to E15), duplications (E11 to E12), and a nonsense mutation, p.S22*, were identified. Traditional multiplex ligation-dependent probe amplification and Sanger sequencing approaches cannot differentiate the origin of the exonic deletions in the 3' region when PMS2 and PMS2CL share identical sequences as a result of gene conversion. Our approach allows unambiguous identification of mutations in the active gene with a straightforward long-range-PCR/NGS method. Breakpoint analysis of multiple samples revealed that recurrent exon 14 deletions are mediated by homologous Alu sequences. Our comprehensive approach provides a reliable tool for accurate molecular analysis of genes containing multiple copies of highly homologous sequences and should improve PMS2 molecular analysis for patients with Lynch syndrome. Copyright © 2015 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Suhasini, G; Sonaa, E; Shila, S; Srikumari, C R; Jayaraman, G; Ramesh, A
2011-08-01
We analysed the genetic structure of ≈ 1000 samples representing 27 ethnic groups settled in Tamil Nadu, south India, derived from two linguistic families (Dravidians and Indo-Europeans) representing four religious groups (Hinduism, Islam, Christianity and Jainism) using 11 mtDNA markers. Out of 27 ethnic groups, four are in situ populations (Anglo-Indian, Labbai Muslim, Nadar Christian and south Indian Jain) and two are migrants (Gypsy and north Indian Jain) from north India to Tamil Nadu, and 21 are native ethnic groups. Six of the markers we used were monomorphic (HaeIII663, HpaI3592, AluI5176, AluI7025, AluI13262, 9-bp deletion) and five markers were polymorphic (DdeI10394, AluI10397, HinfI12308, HincII13259 and HaeIII16517). Haplogroup frequencies, genetic affinities and admixture analysis are based on the genotype data of polymorphic markers observed in these populations. Haplogroup frequencies indicate that various ethnic groups entered Tamil Nadu during different time periods. Genetic affinities and admixture estimates revealed that the ethnic groups possessing advanced knowledge of farming cluster in a branch (C), and could be the late arrived settlers as agriculture, was introduced to this region at about 5 to 3 thousand years ago. In situ ethnic groups appear to have arisen at various times as a result of the prevailing dominant socio-cultural forces. Hierarchical Hindu caste system created many ethnic groups in the history of its existence; some of them became isolated for considerable period of time. Over all, among Tamil ethnic groups, in spite of caste systems' rigidity, built in flexibility in the system in the form of hypergamy and hypogamy had allowed maternal gene flow between them.
NASA Astrophysics Data System (ADS)
Inkoom, J. N.; Fürst, C.
2014-12-01
The relationship between agricultural land uses (ALU) and their impact on ecosystems services (ES) including biodiversity conservation is complex. This complexity has been augmented by isolated research on the impact of ALU on the landscape's capacity to provide ES in most climatically vulnerable areas of Sub-Saharan Africa. Though a considerable number of studies emphasise the nexus between specific land use types and their impact on ES, a sufficient modelling basis for an empirical consideration of spatial interactions between different agricultural land uses at the landscape scale within peri-urban areas in Sub-Saharan Africa is consistently missing. The need to assess and address significant issues regarding size, shape, spatial location, and interactivity of different land use patches in assessing land use interactions and their impact on ecosystem service provision necessitated this investigation. To formulate a methodology to correspond to this complexity, ES obtained from a characteristically agricultural and urbanizing landscapes were mapped using analytical hierarchical processes and management expert approaches. Further, landscape metrics and mean enrichment factor approaches are explored as neighbourhood assessment tools aimed at assessing the mutual impact gradient of agricultural and adjacent urban land uses on ES provision. Implementation is undertaken in GISCAME using a 2012 rapideye image classification and primary data collected on selected ES from local farmers within the VEA catchment of Upper East, Ghana. The outcome aims to provide the understanding of expected trade-offs and synergies varying ALU could pose to current and potential ES provision within urbanizing landscapes. Policy implications for observed trade-offs and synergies of ALU interaction on ES, rural livelihoods, and food security are communicated to farmers and decision makers. Keywords: Agricultural land use, neighbourhood interaction, ecosystems services, livelihoods, GISCAME.
Genetic Differentiation of North-East Argentina Populations Based on 30 Binary X Chromosome Markers.
Di Santo Meztler, Gabriela P; Del Palacio, Santiago; Esteban, María E; Armoa, Isaías; Argüelles, Carina F; Catanesi, Cecilia I
2018-01-01
Alu insertions, INDELs, and SNPs in the X chromosome can be useful not only for revealing relationships among populations but also for identification purposes. We present data of 10 Alu insertions, 5 INDELs, and 15 SNPs of X-chromosome from three Argentinian north-east cities in order to gain insight into the genetic diversity of the X chromosome within this region of the country. Data from 198 unrelated individuals belonging to Posadas, Corrientes, and Eldorado cities were genotyped for Ya5DP62, Yb8DP49, Ya5DP3, Ya5NBC37, Ya5DP77, Ya5NBC491, Ya5DP4, Ya5DP13, Yb8NBC634, and Yb8NBC102 Alu insertions, for MID193, MID1705, MID3754, MID3756 and MID1540 Indels and for rs6639398, rs5986751, rs5964206, rs9781645, rs2209420, rs1299087, rs318173, rs933315, rs1991961, rs4825889, rs1781116, rs1937193, rs1781104, rs149910, and rs652 SNPs. No deviations from Hardy-Weinberg equilibrium were observed for Posadas and Corrientes. However, Eldorado showed significant values, and it was found to have an internal substructuring with two groups of different origin, one showing higher similarity with European countries, and the other with more similarities to Posadas and Corrientes. F st pairwise genetic distances emerged for some markers among the studied populations and also between our data and those from other countries and continents. Of particular interest, Alu insertions demonstrated the most differences, and could be of use in ancestry studies for these populations, while INDELs and SNPs variation were informative for differentiation within the country.
Methods for sequencing GC-rich and CCT repeat DNA templates
Robinson, Donna L.
2007-02-20
The present invention is directed to a PCR-based method of cycle sequencing DNA and other polynucleotide sequences having high CG content and regions of high GC content, and includes for example DNA strands with a high Cytosine and/or Guanosine content and repeated motifs such as CCT repeats.
The Contribution of Short Repeats of Low Sequence Complexity to Large Conifer Genomes
A. Schmidt; R.L. Doudrick; J.S. Heslop-Harrison; T. Schmidt
2000-01-01
Abstract: The abundance and genomic organization of six simple sequence repeats, consisting of di-, tri-, and tetranucleotide sequence motifs, and a minisatellite repeat have been analyzed in different gymnosperms by Southern hybridization. Within the gymnosperm genomes investigated, the abundance and genomic organization of micro- and...
USDA-ARS?s Scientific Manuscript database
Simple sequence repeat (SSR) markers are widely used tools for inferences about genetic diversity, phylogeography and spatial genetic structure. Their applications assume that variation among alleles is essentially caused by an expansion or contraction of the number of repeats and that, accessorily,...
Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin
2015-04-01
This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.
Kapila, R; Das, S; Srivastava, P S; Lakshmikumaran, M
1996-08-01
DNA sequences representing a tandemly repeated DNA family of the Sinapis arvensis genome were cloned and characterized. The 700-bp tandem repeat family is represented by two clones, pSA35 and pSA52, which are 697 and 709 bp in length, respectively. Dot matrix analysis of the sequences indicates the presence of repeated elements within each monomeric unit. Sequence analysis of the repetitive region of clones pSA35 and pSA52 shows that there are several copies of a 7-bp repeat element organized in tandem. The consensus sequence of this repeat element is 5'-TTTAGGG-3'. These elements are highly mutated and the difference in length between the two clones is due to different copy numbers of these elements. The repetitive region of clone pSA35 has 26 copies of the element TTTAGGG, whereas clone pSA52 has 28 copies. The repetitive region in both clones is flanked on either side by inverted repeats that may be footprints of a transposition event. Sequence comparison indicates that the element TTTAGGG is identical to telomeric repeats present in Arabidopsis, maize, tomato, and other plants. However, Bal31 digestion kinetics indicates non-telomeric localization of the 700-bp tandem repeats. The clones represent a novel repeat family as (i) they contain telomere-like motifs as subrepeats within each unit; and (ii) they do not hybridize to related crucifers and are species-specific in nature.
Stability of Tandem Repeats in the Drosophila Melanogaster HSR-Omega Nuclear RNA
Hogan, N. C.; Slot, F.; Traverse, K. L.; Garbe, J. C.; Bendena, W. G.; Pardue, M. L.
1995-01-01
The Drosophila melanogaster Hsr-omega locus produces a nuclear RNA containing >5 kb of tandem repeat sequences. These repeats are unique to Hsr-omega and show concerted evolution similar to that seen with classical satellite DNAs. In D. melanogaster the monomer is ~280 bp. Sequences of 191/2 monomers differ by 8 +/- 5% (mean +/- SD), when all pairwise comparisons are considered. Differences are single nucleotide substitutions and 1-3 nucleotide deletions/insertions. Changes appear to be randomly distributed over the repeat unit. Outer repeats do not show the decrease in monomer homogeneity that might be expected if homogeneity is maintained by recombination. However, just outside the last complete repeat at each end, there are a few fragments of sequence similar to the monomer. The sequences in these flanking regions are not those predicted for sequences decaying in the absence of recombination. Instead, the fragmentation of the sequence homology suggests that flanking regions have undergone more severe disruptions, possibly during an insertion or amplification event. Hsr-omega alleles differing in the number of repeats are detected and appear to be stable over a few thousand generations; however, both increases and decreases in repeat numbers have been observed. The new alleles appear to be as stable as their predecessors. No alleles of less than ~5 kb nor more than ~16 kb of repeats were seen in any stocks examined. The evidence that there is a limit on the minimum number of repeats is consistent with the suggestion that these repeats are important in the function of the unusual Hsr-omega nuclear RNA. PMID:7540581
Typing Clostridium difficile strains based on tandem repeat sequences
2009-01-01
Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124
Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian
2009-11-01
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
Development of swine-specific DNA markers for biosensor-based halal authentication.
Ali, M E; Hashim, U; Kashif, M; Mustafa, S; Che Man, Y B; Abd Hamid, S B
2012-06-29
The pig (Sus scrofa) mitochondrial genome was targeted to design short (15-30 nucleotides) DNA markers that would be suitable for biosensor-based hybridization detection of target DNA. Short DNA markers are reported to survive harsh conditions in which longer ones are degraded into smaller fragments. The whole swine mitochondrial-genome was in silico digested with AluI restriction enzyme. Among 66 AluI fragments, five were selected as potential markers because of their convenient lengths, high degree of interspecies polymorphism and intraspecies conservatism. These were confirmed by NCBI blast analysis and ClustalW alignment analysis with 11 different meat-providing animal and fish species. Finally, we integrated a tetramethyl rhodamine-labeled 18-nucleotide AluI fragment into a 3-nm diameter citrate-tannate coated gold nanoparticle to develop a swine-specific hybrid nanobioprobe for the determination of pork adulteration in 2.5-h autoclaved pork-beef binary mixtures. This hybrid probe detected as low as 1% pork in deliberately contaminated autoclaved pork-beef binary mixtures and no cross-species detection was recorded, demonstrating the feasibility of this type of probe for biosensor-based detection of pork adulteration of halal and kosher foods.
Jing, Rong-Rong; Wang, Hui-Min; Cui, Ming; Fang, Meng-Kang; Qiu, Xiao-Jun; Wu, Xin-Hua; Qi, Jin; Wang, Yue-Guo; Zhang, Lu-Rong; Zhu, Jian-Hua; Ju, Shao-Qing
2011-09-01
Human cell-free circulating DNA (cf-DNA) derived mainly from cell apoptosis and necrosis can be measured by a variety of laboratory techniques, but almost all of these methods require sample preparation. We have developed a branched DNA (bDNA)-based Alu assay for quantifying cf-DNA in myocardial infarction (MI) patients. A total of 82 individuals were included in the study; 22 MI and 60 normal controls. cf-DNA was quantified using a bDNA-based Alu assay. cf-DNA was higher in serum compared to plasma and there was a difference between genders. cf-DNA was significantly higher in MI patients compared to the controls. There was no correlation between cf-DNA and creatine kinase-MB (CK-MB), troponin I (cTnI) or myoglobin (MYO). In serial specimens, cf-DNA was sensitive and peaked earlier than cTnI. The bDNA-based Alu assay is a novel method for quantifying human cf-DNA. Increased cf-DNA in MI patients might complement cTnI, CK-MB and MYO in a multiple marker format. Copyright © 2011 The Canadian Society of Clinical Chemists. All rights reserved.
Genome Wide Characterization of Simple Sequence Repeats in Cucumber
USDA-ARS?s Scientific Manuscript database
The whole genome sequence of the cucumber cultivar Gy14 was recently sequenced at 15× coverage with the Roche 454 Titanium technology. The microsatellite DNA sequences (simple sequence repeats, SSRs) in the assembled scaffolds were computationally explored and characterized. A total of 112,073 SSRs ...
Lee, Michael; Hills, Mark; Conomos, Dimitri; Stutz, Michael D.; Dagg, Rebecca A.; Lau, Loretta M.S.; Reddel, Roger R.; Pickett, Hilda A.
2014-01-01
Telomeres are terminal repetitive DNA sequences on chromosomes, and are considered to comprise almost exclusively hexameric TTAGGG repeats. We have evaluated telomere sequence content in human cells using whole-genome sequencing followed by telomere read extraction in a panel of mortal cell strains and immortal cell lines. We identified a wide range of telomere variant repeats in human cells, and found evidence that variant repeats are generated by mechanistically distinct processes during telomerase- and ALT-mediated telomere lengthening. Telomerase-mediated telomere extension resulted in biased repeat synthesis of variant repeats that differed from the canonical sequence at positions 1 and 3, but not at positions 2, 4, 5 or 6. This indicates that telomerase is most likely an error-prone reverse transcriptase that misincorporates nucleotides at specific positions on the telomerase RNA template. In contrast, cell lines that use the ALT pathway contained a large range of variant repeats that varied greatly between lines. This is consistent with variant repeats spreading from proximal telomeric regions throughout telomeres in a stochastic manner by recombination-mediated templating of DNA synthesis. The presence of unexpectedly large numbers of variant repeats in cells utilizing either telomere maintenance mechanism suggests a conserved role for variant sequences at human telomeres. PMID:24225324
Srivastava, Deepika; Shanker, Asheesh
2016-12-01
Basal angiosperms or Magnoliids is an important clade of commercially important plants which mainly include spices and edible fruits. In this study, 17 chloroplast genome sequences belonging to clade Magnoliids were screened for the identification of chloroplast simple sequence repeats (cpSSRs). Simple sequence repeats or microsatellites are short stretches of DNA up to 1-6 base pair in length. These repeats are ubiquitous and play important role in the development of molecular markers and to study the mapping of traits of economic, medical or ecological interest. A total of 479 SSRs were detected, showing average density of 1 SSR/6.91 kb. Depending on the repeat units, the length of SSRs ranged from 12 to 24 bp for mono-, 12 to 18 bp for di-, 12 to 26 bp for tri-, 12 to 24 bp for tetra-, 15 bp for penta- and 18 bp for hexanucleotide repeats. Mononucleotide repeats were the most frequent (207, 43.21 %) followed by tetranucleotide repeats (130, 27.13 %). Penta- and hexanucleotide repeats were least frequent or absent in these chloroplast genomes.
Park, Eonyoung; Maquat, Lynne E.
2013-01-01
Staufen1 (STAU1)-mediated mRNA decay (SMD) is an mRNA degradation process in mammalian cells that is mediated by the binding of STAU1 to a STAU1-binding site (SBS) within the 3'-untranslated region (3'UTR) of target mRNAs. During SMD, STAU1, a double-stranded (ds) RNA-binding protein, recognizes dsRNA structures formed either by intramolecular base-pairing of 3'UTR sequences or by intermolecular base-pairing of 3'UTR sequences with a long noncoding RNA (lncRNA) via partially complementary Alu elements. Recently, STAU2, a paralog of STAU1, has also been reported to mediate SMD. Both STAU1 and STAU2 interact directly with the ATP-dependent RNA helicase UPF1, a key SMD factor, enhancing its helicase activity to promote effective SMD. Moreover, STAU1 and STAU2 form homodimeric and heterodimeric interactions via domain-swapping. Since both SMD and the mechanistically related nonsense-mediated mRNA decay (NMD) employ UPF1, SMD and NMD are competitive pathways. Competition contributes to cellular differentiation processes, such as myogenesis and adipogenesis, placing SMD at the heart of various physiologically important mechanisms. PMID:23681777
IL-6, a central acute-phase mediator, as an early biomarker for exposure to zinc-based metal fumes.
Baumann, R; Joraslafsky, S; Markert, A; Rack, I; Davatgarbenam, S; Kossack, V; Gerhards, B; Kraus, T; Brand, P; Gube, M
2016-12-12
Systemic C-reactive protein (CRP) increases 1day after short-term inhalation of welding fumes containing zinc and/or copper. The aim of the current study was to find further, possibly earlier systemic biomarkers after inhalation of different welding fumes containing zinc and traces of aluminum, with or without copper, as these metal combinations become more common in modern joining technology. The study group consisted of 15 non-smoking male volunteers with healthy lung function data and without any occupational metal fume exposure. On 4 different exposure days, the members of the study group were exposed under controlled conditions to ambient air or 3 different welding fumes for 6h. Spirometric and impulse oscillometric measurements and differential blood counts were performed and serum samples were collected before exposure and 6, 10 and 29h after start of exposure. The biomarker concentrations in serum were measured by electrochemiluminescent assays. Systemic increases of IL-6 peaked significantly at 10h compared to baseline ("ZincZinc": P=0.0005 (median increase (m. incr.)=1.36pg/mL); "ZincAlu": P=0.0012 (m. incr.=1.48pg/mL); "AluBronze": P=0.0005 (m. incr.=2.66pg/mL)). At 29h, CRP and serum amyloid A (SAA) increased distinctively ("ZincZinc": P=0.032 (m. incr.=0.65μg/mL) [CRP], 0.077 (m. incr.=0.61μg/mL) [SAA]; "ZincAlu": P=0.001 (m. incr.=1.15μg/mL) [CRP], 0.0024 (m. incr.=0.94μg/mL) [SAA]; "AluBronze": P=0.002 (m. incr.=2.5μg/mL) [CRP], 0.002 (m. incr.=0.97μg/mL) [SAA]). The median increases of CRP and IL-6 were most pronounced for the welding fume which contained besides zinc also copper (AluBronze). For differentiating AluBronze from control exposure, receiver operating characteristic (ROC) curve analysis was performed and the area under the ROC curve (AUC) for the IL-6 increases (10h versus 0h) was 0.931. The additional inflammatory mediators [vascular cell adhesion molecule-1 (VCAM-1), intercellular adhesion molecule-1 (ICAM-1), interferon-γ (IFN-γ), cell counts] and the lung function parameters did not show any significant changes after exposure. Consistent with its role of the mediation of the acute-phase response, systemic increases of IL-6 after welding fume exposure peak at 10h before the increases of the acute-phase reactants CRP and SAA at 29h. IL-6 may represent a highly sensitive and early biomarker for the exposure to metal fumes containing zinc and copper. As IL-6, CRP and SAA are independent, strong risk markers for future cardiovascular diseases, these data may particularly be important for long-term welders with respect to their cardiovascular health. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Molecular characterization and distribution of a 145-bp tandem repeat family in the genus Populus.
Rajagopal, J; Das, S; Khurana, D K; Srivastava, P S; Lakshmikumaran, M
1999-10-01
This report aims to describe the identification and molecular characterization of a 145-bp tandem repeat family that accounts for nearly 1.5% of the Populus genome. Three members of this repeat family were cloned and sequenced from Populus deltoides and P. ciliata. The dimers of the repeat were sequenced in order to confirm the head-to-tail organization of the repeat. Hybridization-based analysis using the 145-bp tandem repeat as a probe on genomic DNA gave rise to ladder patterns which were identified to be a result of methylation and (or) sequence heterogeneity. Analysis of the methylation pattern of the repeat family using methylation-sensitive isoschizomers revealed variable methylation of the C residues and lack of methylation of the A residues. Sequence comparisons between the monomers revealed a high degree of sequence divergence that ranged between 6% and 11% in P. deltoides and between 4.2% and 8.3% in P. ciliata. This indicated the presence of sub-families within the 145-bp tandem family of repeats. Divergence was mainly due to the accumulation of point mutations and was concentrated in the central region of the repeat. The 145-bp tandem repeat family did not show significant homology to known tandem repeats from plants. A short stretch of 36 bp was found to show homology of 66.7% to a centromeric repeat from Chironomus plumosus. Dot-blot analysis and Southern hybridization data revealed the presence of the repeat family in 13 of the 14 Populus species examined. The absence of the 145-bp repeat from P. euphratica suggested that this species is relatively distant from other members of the genus, which correlates with taxonomic classifications. The widespread occurrence of the tandem family in the genus indicated that this family may be of ancient origin.
Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.
Benslimane, A A; Dron, M; Hartmann, C; Rode, A
1986-01-01
Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553
Santini, A C; Magalhães, J T; Cascardo, J C M; Corrêa, R X
2016-04-28
Chromobacterium violaceum is a free-living Gram-negative bacillus usually found in the water and soil in tropical regions, which causes infections in humans. Chromobacteriosis is characterized by rapid dissemination and high mortality. The aim of this study was to detect the genetic variability among C. violaceum type strain ATCC 12472, and seven isolates from the environment and one from a pulmonary secretion from a chromobacteriosis patient from Ilhéus, Bahia. The molecular characterization of all samples was performed by polymerase chain reaction (PCR) sequencing and 16S rDNA analysis. Primers specific for two ATCC 12472 pathogenicity genes, hilA and yscD, as well as random amplified polymorphic DNA (RAPD), were used for PCR amplification and comparative sequencing of the products. For a more specific approach, the PCR products of 16S rDNA were digested with restriction enzymes. Seven of the samples, including type-strain ATCC 12472, were amplified by the hilA primers; these were subsequently sequenced. Gene yscD was amplified only in type-strain ATCC 12472. MspI and AluI digestion revealed 16S rDNA polymorphisms. This data allowed the generation of a dendogram for each analysis. The isolates of C. violaceum have variability in random genomic regions demonstrated by RAPD. Also, these isolates have variability in pathogenicity genes, as demonstrated by sequencing and restriction enzyme digestion.
Functions of the RNA Editing Enzyme ADAR1 and Their Relevance to Human Diseases.
Song, Chunzi; Sakurai, Masayuki; Shiromoto, Yusuke; Nishikura, Kazuko
2016-12-17
Adenosine deaminases acting on RNA (ADARs) convert adenosine to inosine in double-stranded RNA (dsRNA). Among the three types of mammalian ADARs, ADAR1 has long been recognized as an essential enzyme for normal development. The interferon-inducible ADAR1p150 is involved in immune responses to both exogenous and endogenous triggers, whereas the functions of the constitutively expressed ADAR1p110 are variable. Recent findings that ADAR1 is involved in the recognition of self versus non-self dsRNA provide potential explanations for its links to hematopoiesis, type I interferonopathies, and viral infections. Editing in both coding and noncoding sequences results in diseases ranging from cancers to neurological abnormalities. Furthermore, editing of noncoding sequences, like microRNAs, can regulate protein expression, while editing of Alu sequences can affect translational efficiency and editing of proximal sequences. Novel identifications of long noncoding RNA and retrotransposons as editing targets further expand the effects of A-to-I editing. Besides editing, ADAR1 also interacts with other dsRNA-binding proteins in editing-independent manners. Elucidating the disease-specific patterns of editing and/or ADAR1 expression may be useful in making diagnoses and prognoses. In this review, we relate the mechanisms of ADAR1's actions to its pathological implications, and suggest possible mechanisms for the unexplained associations between ADAR1 and human diseases.
Functions of the RNA Editing Enzyme ADAR1 and Their Relevance to Human Diseases
Song, Chunzi; Sakurai, Masayuki; Shiromoto, Yusuke; Nishikura, Kazuko
2016-01-01
Adenosine deaminases acting on RNA (ADARs) convert adenosine to inosine in double-stranded RNA (dsRNA). Among the three types of mammalian ADARs, ADAR1 has long been recognized as an essential enzyme for normal development. The interferon-inducible ADAR1p150 is involved in immune responses to both exogenous and endogenous triggers, whereas the functions of the constitutively expressed ADAR1p110 are variable. Recent findings that ADAR1 is involved in the recognition of self versus non-self dsRNA provide potential explanations for its links to hematopoiesis, type I interferonopathies, and viral infections. Editing in both coding and noncoding sequences results in diseases ranging from cancers to neurological abnormalities. Furthermore, editing of noncoding sequences, like microRNAs, can regulate protein expression, while editing of Alu sequences can affect translational efficiency and editing of proximal sequences. Novel identifications of long noncoding RNA and retrotransposons as editing targets further expand the effects of A-to-I editing. Besides editing, ADAR1 also interacts with other dsRNA-binding proteins in editing-independent manners. Elucidating the disease-specific patterns of editing and/or ADAR1 expression may be useful in making diagnoses and prognoses. In this review, we relate the mechanisms of ADAR1′s actions to its pathological implications, and suggest possible mechanisms for the unexplained associations between ADAR1 and human diseases. PMID:27999332
Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.
Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C
1997-12-01
Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.
Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis
Tong, Helin; Chen, You; Wang, Jingyi; Chen, Yeyuan; Sun, Guangming; He, Junhu; Wu, Yaoting
2013-01-01
Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8%) of the 94 Simple Sequence Repeat (SSR) loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp.), and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus). Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region. PMID:24024187
A novel tandem repeat sequence located on human chromosome 4p: isolation and characterization.
Kogi, M; Fukushige, S; Lefevre, C; Hadano, S; Ikeda, J E
1997-06-01
In an effort to analyze the genomic region of the distal half of human chromosome 4p, to where Huntington disease and other diseases have been mapped, we have isolated the cosmid clone (CRS447) that was likely to contain a region with specific repeat sequences. Clone CRS447 was subjected to detailed analysis, including chromosome mapping, restriction mapping, and DNA sequencing. Chromosome mapping by both a human-CHO hybrid cell panel and FISH revealed that CRS447 was predominantly located in the 4p15.1-15.3 region. CRS447 was shown to consist of tandem repeats of 4.7-kb units present on chromosome 4p. A single EcoRI unit was subcloned (pRS447), and the complete sequence was determined as 4752 nucleotides. When pRS447 was used as a probe, the number of copies of this repeat per haploid genome was estimated to be 50-70. Sequence analysis revealed that it contained two internal CA repeats and one putative ORF. Database search established that this sequence was unreported. However, two homologous STS markers were found in the database. We concluded that CRS447/pRS447 is a novel tandem repeat sequence that is mainly specific to human chromosome 4p.
Genome-wide characterization of centromeric satellites from multiple mammalian genomes.
Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario
2011-01-01
Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.
Survey and Analysis of Microsatellites in the Silkworm, Bombyx mori
Prasad, M. Dharma; Muthulakshmi, M.; Madhu, M.; Archak, Sunil; Mita, K.; Nagaraju, J.
2005-01-01
We studied microsatellite frequency and distribution in 21.76-Mb random genomic sequences, 0.67-Mb BAC sequences from the Z chromosome, and 6.3-Mb EST sequences of Bombyx mori. We mined microsatellites of ≥15 bases of mononucleotide repeats and ≥5 repeat units of other classes of repeats. We estimated that microsatellites account for 0.31% of the genome of B. mori. Microsatellite tracts of A, AT, and ATT were the most abundant whereas their number drastically decreased as the length of the repeat motif increased. In general, tri- and hexanucleotide repeats were overrepresented in the transcribed sequences except TAA, GTA, and TGA, which were in excess in genomic sequences. The Z chromosome sequences contained shorter repeat types than the rest of the chromosomes in addition to a higher abundance of AT-rich repeats. Our results showed that base composition of the flanking sequence has an influence on the origin and evolution of microsatellites. Transitions/transversions were high in microsatellites of ESTs, whereas the genomic sequence had an equal number of substitutions and indels. The average heterozygosity value for 23 polymorphic microsatellite loci surveyed in 13 diverse silkmoth strains having 2–14 alleles was 0.54. Only 36 (18.2%) of 198 microsatellite loci were polymorphic between the two divergent silkworm populations and 10 (5%) loci revealed null alleles. The microsatellite map generated using these polymorphic markers resulted in 8 linkage groups. B. mori microsatellite loci were the most conserved in its immediate ancestor, B. mandarina, followed by the wild saturniid silkmoth, Antheraea assama. PMID:15371363
Zhang, Yang; Peng, Xueqin; Tang, Yunlian; Gan, Xiaoning; Wang, Chengkun; Xie, Lu; Xie, Xiaoli; Gan, Runliang; Wu, Yimou
2016-10-01
Epstein-Barr virus (EBV) is a human oncogenic herpesvirus associated with lymphoma and nasopharyngeal carcinoma. Because the susceptible hosts of EB virus are limited to human and cotton-top tamarins (Saguinus oedipus), there have been no appropriate animal models until the lymphoma model induced by EBV in human peripheral blood lymphocyte (hu-PBL)/SCID chimeric mice was reported. However, it is still controversial whether the EBV-associated lymphoma induced in hu-PBL/SCID mice is a monoclonal tumor. In this study, we transplanted normal human peripheral blood lymphocytes (hu-PBL) from six donors infected with EBV into SCID mice to construct hu-PBL/SCID chimeric mice. The induced tumors were found in the mediastinum or abdominal cavity of SCID mice. Microscopic observation exhibited tumor cells that were large and had a plasmablastic, centroblastic or immunoblastic-like appearance. Immunophenotyping assays showed the induced tumors were LCA-positive, CD20/CD79a-positive (markers of B cells), and CD3/CD45RO-negative (markers of T cells). A human-specific Alu sequence could be amplified by Alu-PCR. This confirmed that induced tumors were B-cell lymphomas originating from the transplanted human lymphocytes rather than mouse cells. EBER in situ hybridization detected positive signals in the nuclei of the tumor cells. Expression of EBV-encoded LMP1, EBNA-1, and EBNA-2 in the tumors was significantly positive. PCR-based capillary electrophoresis analysis of IgH gene rearrangement revealed a monoclonal peak and single amplification product in all six cases of induced tumors. This indicated that EBV can induce monoclonal proliferation of human B lymphocytes and promotes the development of lymphoma. J. Med. Virol. 88:1804-1813, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
2012-01-01
Background The aim of this study was to clarify the role of global hypomethylation of repetitive elements in determining the genetic and clinical features of multiple myeloma (MM). Methods We assessed global methylation levels using four repetitive elements (long interspersed nuclear element-1 (LINE-1), Alu Ya5, Alu Yb8, and Satellite-α) in clinical samples comprising 74 MM samples and 11 benign control samples (7 cases of monoclonal gammopathy of undetermined significance (MGUS) and 4 samples of normal plasma cells (NPC)). We also evaluated copy-number alterations using array-based comparative genomic hybridization, and performed methyl-CpG binding domain sequencing (MBD-seq). Results Global levels of the repetitive-element methylation declined with the degree of malignancy of plasma cells (NPC>MGUS>MM), and there was a significant inverse correlation between the degree of genomic loss and the LINE-1 methylation levels. We identified 80 genomic loci as common breakpoints (CBPs) around commonly lost regions, which were significantly associated with increased LINE-1 densities. MBD-seq analysis revealed that average DNA-methylation levels at the CBP loci and relative methylation levels in regions with higher LINE-1 densities also declined during the development of MM. We confirmed that levels of methylation of the 5' untranslated region of respective LINE-1 loci correlated strongly with global LINE-1 methylation levels. Finally, there was a significant association between LINE-1 hypomethylation and poorer overall survival (hazard ratio 2.8, P = 0.015). Conclusion Global hypomethylation of LINE-1 is associated with the progression of and poorer prognosis for MM, possibly due to frequent copy-number loss. PMID:23259664
NASA Astrophysics Data System (ADS)
Dominguez, L. A.; Taira, T.; Hjorleifsdottir, V.; Santoyo, M. A.
2015-12-01
Repeating earthquake sequences are sets of events that are thought to rupture the same area on the plate interface and thus provide nearly identical waveforms. We systematically analyzed seismic records from 2001 through 2014 to identify repeating earthquakes with highly correlated waveforms occurring along the subduction zone of the Cocos plate. Using the correlation coefficient (cc) and spectral coherency (coh) of the vertical components as selection criteria, we found a set of 214 sequences whose waveforms exceed cc≥95% and coh≥95%. Spatial clustering along the trench shows large variations in repeating earthquakes activity. Particularly, the rupture zone of the M8.1, 1985 earthquake shows an almost absence of characteristic repeating earthquakes, whereas the Guerrero Gap zone and the segment of the trench close to the Guerrero-Oaxaca border shows a significantly larger number of repeating earthquakes sequences. Furthermore, temporal variations associated to stress changes due to major shows episodes of unlocking and healing of the interface. Understanding the different components that control the location and recurrence time of characteristic repeating sequences is a key factor to pinpoint areas where large megathrust earthquakes may nucleate and consequently to improve the seismic hazard assessment.
Kumar, Deepak; Singh, S P; Karabasanavar, Nagappa S; Singh, Rashmi; Umapathi, V
2014-11-01
Authentication of meat assumes significance in view of religious, quality assurance, food safety, public health, conservation and legal concerns. Here, we describe a PCR-RFLP (Polymerase Chain Reaction- Restriction Fragment Length Polymorphism) assay targeting mitochondrial cytochrome-b gene for the identification of meats of five most common food animals namely cattle, buffalo, goat, sheep and pig. A pair of forward and reverse primers (VPH-F & VPH-R) amplifying a conserved region (168-776 bp) of mitochondrial cytochrome-b (cytb) gene for targeted species was designed which yielded a 609 bp PCR amplicon. Further, restriction enzyme digestion of the amplicons with Alu1 and Taq1 restriction enzymes resulted in a distinctive digestion pattern that was able to discriminate each species. The repeatability of the PCR-RFLP assay was validated ten times with consistent results observed. The developed assay can be used in routine diagnostic laboratories to differentiate the meats of closely related domestic livestock species namely cattle from buffalo and sheep from goat.
HIV-1 integration landscape during latent and active infection
Cohn, Lillian; Silva, Israel T.; Oliveira, Thiago Y.; Rosales, Rafael A.; Parrish, Erica H.; Learn, Gerald H.; Hahn, Beatrice H.; Czartoski, Julie L.; McElrath, M. Juliana; Lehmann, Clara; Klein, Florian; Caskey, Marina; Walker, Bruce D.; Siliciano, Janet D.; Siliciano, Robert F.; Jankovic, Mila; Nussenzweig, Michel C.
2015-01-01
SUMMARY The barrier to curing HIV-1 is thought to reside primarily in CD4+ T cells containing silent proviruses. To characterize these latently infected cells, we studied the integration profile of HIV-1 in viremic progressors, individuals receiving antiretroviral therapy, and viremic controllers. Clonally expanded T cells represented the majority of all integrations and increased during therapy. However, none of the 75 expanded T cell clones assayed contained intact virus. In contrast, the cells bearing single integration events decreased in frequency over time on therapy, and the surviving cells were enriched for HIV-1 integration in silent regions of the genome. Finally, there was a strong preference for integration into, or in close proximity to Alu repeats, which were also enriched in local hotspots for integration. The data indicate that dividing clonally expanded T cells contain defective proviruses, and that the replication competent reservoir is primarily found in CD4+ T cells that remain relatively quiescent. PMID:25635456
Algorithm to find distant repeats in a single protein sequence
Banerjee, Nirjhar; Sarani, Rangarajan; Ranjani, Chellamuthu Vasuki; Sowmiya, Govindaraj; Michael, Daliah; Balakrishnan, Narayanasamy; Sekar, Kanagaraj
2008-01-01
Distant repeats in protein sequence play an important role in various aspects of protein analysis. A keen analysis of the distant repeats would enable to establish a firm relation of the repeats with respect to their function and three-dimensional structure during the evolutionary process. Further, it enlightens the diversity of duplication during the evolution. To this end, an algorithm has been developed to find all distant repeats in a protein sequence. The scores from Point Accepted Mutation (PAM) matrix has been deployed for the identification of amino acid substitutions while detecting the distant repeats. Due to the biological importance of distant repeats, the proposed algorithm will be of importance to structural biologists, molecular biologists, biochemists and researchers involved in phylogenetic and evolutionary studies. PMID:19052663
2013-01-01
Background Exposure to pollutants including metals and particulate air pollution can alter DNA methylation. Yet little is known about intra-individual changes in DNA methylation over time in relationship to environmental exposures. Therefore, we evaluated the effects of acute- and chronic metal-rich PM2.5 exposures on DNA methylation. Methods Thirty-eight male boilermaker welders participated in a panel study for a total of 54 person days. Whole blood was collected prior to any welding activities (pre-shift) and immediately after the exposure period (post-shift). The percentage of methylated cytosines (%mC) in LINE-1, Alu, and inducible nitric oxide synthase gene (iNOS) were quantified using pyrosequencing. Personal PM2.5 (particulate matter with an aerodynamic diameter ≤ 2.5 μm) was measured over the work-shift. A questionnaire assessed job history and years worked as a boilermaker. Linear mixed models with repeated measures evaluated associations between DNA methylation, PM2.5 concentration (acute exposure), and years worked as a boilermaker (chronic exposure). Results PM2.5 exposure was associated with increased methylation in the promoter region of the iNOS gene (β = 0.25, SE: 0.11, p-value = 0.04). Additionally, the number of years worked as a boilermaker was associated with increased iNOS methylation (β = 0.03, SE: 0.01, p-value = 0.03). No associations were observed for Alu or LINE-1. Conclusions Acute and chronic exposure to PM2.5 generated from welding activities was associated with a modest change in DNA methylation of the iNOS gene. Future studies are needed to confirm this association and determine if the observed small increase in iNOS methylation are associated with changes in NO production or any adverse health effect. PMID:23758843
Characterization of (CA)n microsatellite repeats from large-insert clones.
Litt, M; Browne, D
2001-05-01
The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit determination of sequences flanking the microsatellites. When cosmids or large-insert phage clones are used as primary sources of (CA)n repeat markers, they have traditionally been subcloned into plasmid vectors such as pUC18 or M13 mp 18/19 cloning vectors to obtain fragments of suitable size for DNA sequencing. This unit presents an alternative approach whereby a set of degenerate sequencing primers that anneal directly to (CA)n microsatellites can be used to determine sequences that are inaccessible with vector-derived primers. Because the primers anneal to the repeat and not to the vector, they can be used with subclones containing inserts of several kilobases and should, in theory, always give sequence in the regions directly flanking the repeat. Degeneracy at the 3 end of each of these primers prevents elongation of primers that have annealed out-of-register. The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit.
Bhatia, S; Singh Negi, M; Lakshmikumaran, M
1996-11-01
EcoRI restriction of the B. nigra rDNA recombinants, isolated from a lambda genomic library, showed that the 3.9-kb fragment corresponded to the Intergenic Spacer (IGS), which was sequenced and found to be 3,928 bp in size. Sequence and dot-matrix analyses showed that the organization of the B. nigra rDNA IGS was typical of most rDNA spacers, consisting of a central repetitive region and flanking unique sequences on either side. The repetitive region was composed of two repeat families-RF 'A' and RF 'B.' The B. nigra RF 'A' consisted of a tandem array of three full-length copies of a 106-bp sequence element. RF 'B' was composed of 66 tandemly repeated elements. Each 'B' element was only 21-bp in size and this is the smallest repeat unit identified in plant rDNA to date. The putative transcription initiation site (TIS) was identified as nucleotide position 3,110. Based on the sequence analysis it was suggested that the present organization of the repeat families was generated by successive cycles of deletions and amplifications and was being maintained by homogenization processes such as gene conversion and crossing-over.A detailed comparison of the rDNA IGS sequences of the three diploid Brassica species-namely, B. nigra, B. campestris, and B. oleracea-was carried out. First, comparisons revealed that B. campestris and B. oleracea were close to each other as the repeat families in both showed high sequence homology between each other. Second, the repeat elements in both the species were organized in an interspersed manner. Third, a 52-bp sequence, present just downstream of the repeats in B. campestris, was found to be identical to the B. oleracea repeats, thereby suggesting a common progenitor. On the other hand, in B. nigra no interspersion pattern of organization of repeats was observed. Further, the B. nigra RF 'A' was identified as distinct from the repeat families of B. campestris and B. oleracea. Based on this analysis, it was suggested that during speciation B. campestris and B. oleracea evolved in one lineage whereas B. nigra diverged into a separate lineage. The comparative analysis of the IGS helped in identifying not only conserved ancestral sequence motifs of possible functional significance such as promoters and enhancers, but also sequences which showed variation between the three diploid species and were therefore identified as species-specific sequences.
Bolzán, Alejandro D
2017-07-01
By definition, telomeric sequences are located at the very ends or terminal regions of chromosomes. However, several vertebrate species show blocks of (TTAGGG)n repeats present in non-terminal regions of chromosomes, the so-called interstitial telomeric sequences (ITSs), interstitial telomeric repeats or interstitial telomeric bands, which include those intrachromosomal telomeric-like repeats located near (pericentromeric ITSs) or within the centromere (centromeric ITSs) and those telomeric repeats located between the centromere and the telomere (i.e., truly interstitial telomeric sequences) of eukaryotic chromosomes. According with their sequence organization, localization and flanking sequences, ITSs can be classified into four types: 1) short ITSs, 2) subtelomeric ITSs, 3) fusion ITSs, and 4) heterochromatic ITSs. The first three types have been described mainly in the human genome, whereas heterochromatic ITSs have been found in several vertebrate species but not in humans. Several lines of evidence suggest that ITSs play a significant role in genome instability and evolution. This review aims to summarize our current knowledge about the origin, function, instability and evolution of these telomeric-like repeats in vertebrate chromosomes. Copyright © 2017 Elsevier B.V. All rights reserved.
Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine
2009-01-01
Clustered regularly interspaced short palindromic repeats (CRISPRs) are DNA sequences composed of a succession of repeats (23- to 47-bp long) separated by unique sequences called spacers. Polymorphism can be observed in different strains of a species and may be used for genotyping. We describe protocols and bioinformatics tools that allow the identification of CRISPRs from sequenced genomes, their comparison, and their component determination (the direct repeats and the spacers). A schematic representation of the spacer organization can be produced, allowing an easy comparison between strains.
Mapping Simple Repeated DNA Sequences in Heterochromatin of Drosophila Melanogaster
Lohe, A. R.; Hilliker, A. J.; Roberts, P. A.
1993-01-01
Heterochromatin in Drosophila has unusual genetic, cytological and molecular properties. Highly repeated DNA sequences (satellites) are the principal component of heterochromatin. Using probes from cloned satellites, we have constructed a chromosome map of 10 highly repeated, simple DNA sequences in heterochromatin of mitotic chromosomes of Drosophila melanogaster. Despite extensive sequence homology among some satellites, chromosomal locations could be distinguished by stringent in situ hybridizations for each satellite. Only two of the localizations previously determined using gradient-purified bulk satellite probes are correct. Eight new satellite localizations are presented, providing a megabase-level chromosome map of one-quarter of the genome. Five major satellites each exhibit a multichromosome distribution, and five minor satellites hybridize to single sites on the Y chromosome. Satellites closely related in sequence are often located near one another on the same chromosome. About 80% of Y chromosome DNA is composed of nine simple repeated sequences, in particular (AAGAC)(n) (8 Mb), (AAGAG)(n) (7 Mb) and (AATAT)(n) (6 Mb). Similarly, more than 70% of the DNA in chromosome 2 heterochromatin is composed of five simple repeated sequences. We have also generated a high resolution map of satellites in chromosome 2 heterochromatin, using a series of translocation chromosomes whose breakpoints in heterochromatin were ordered by N-banding. Finally, staining and banding patterns of heterochromatic regions are correlated with the locations of specific repeated DNA sequences. The basis for the cytochemical heterogeneity in banding appears to depend exclusively on the different satellite DNAs present in heterochromatin. PMID:8375654
Gayà-Vidal, Magdalena; Dugoujon, Jean-Michel; Esteban, Esther; Athanasiadis, Georgios; Rodríguez, Armando; Villena, Mercedes; Vasquez, René; Moral, Pedro
2010-01-01
Thirty-two polymorphic Alu insertions (18 autosomal and 14 from the X chromosome) were studied in 192 individuals from two Amerindian populations of the Bolivian Altiplano (Aymara and Quechua speakers: the two main Andean linguistic groups), to provide relevant information about their genetic relationships and demographic processes. The main objective was to determine from genetic data whether the expansion of the Quechua language into Bolivia could be associated with demographic (Inca migration of Quechua-speakers from Peru into Bolivia) or cultural (language imposition by the Inca Empire) processes. Allele frequencies were used to assess the genetic relationships between these two linguistic groups. Our results indicated that the two Bolivian samples showed a high genetic similarity for both sets of markers and were clearly differentiated from the two Peruvian Quechua samples available in the literature. Additionally, our data were compared with the available literature to determine the genetic and linguistic structure, and East-West differentiation in South America. The close genetic relationship between the two Bolivian samples and their differentiation from the Quechua-speakers from Peru suggests that the Quechua language expansion in Bolivia took place without any important demographic contribution. Moreover, no clear geographical or linguistic structure was found for the Alu variation among South Amerindians. (c) 2009 Wiley-Liss, Inc.
Cech, Jennifer N; Peichel, Catherine L
2015-12-01
Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.
2012-01-01
Background Staphylococcus aureus Repeat (STAR) elements are a type of interspersed intergenic direct repeat. In this study the conservation and variation in these elements was explored by bioinformatic analyses of published staphylococcal genome sequences and through sequencing of specific STAR element loci from a large set of S. aureus isolates. Results Using bioinformatic analyses, we found that the STAR elements were located in different genomic loci within each staphylococcal species. There was no correlation between the number of STAR elements in each genome and the evolutionary relatedness of staphylococcal species, however higher levels of repeats were observed in both S. aureus and S. lugdunensis compared to other staphylococcal species. Unexpectedly, sequencing of the internal spacer sequences of individual repeat elements from multiple isolates showed conservation at the sequence level within deep evolutionary lineages of S. aureus. Whilst individual STAR element loci were demonstrated to expand and contract, the sequences associated with each locus were stable and distinct from one another. Conclusions The high degree of lineage and locus-specific conservation of these intergenic repeat regions suggests that STAR elements are maintained due to selective or molecular forces with some of these elements having an important role in cell physiology. The high prevalence in two of the more virulent staphylococcal species is indicative of a potential role for STAR elements in pathogenesis. PMID:23020678
Repeatless and repeat-based centromeres in potato: implications for centromere evolution.
Gong, Zhiyun; Wu, Yufeng; Koblízková, Andrea; Torres, Giovana A; Wang, Kai; Iovene, Marina; Neumann, Pavel; Zhang, Wenli; Novák, Petr; Buell, C Robin; Macas, Jirí; Jiang, Jiming
2012-09-01
Centromeres in most higher eukaryotes are composed of long arrays of satellite repeats. By contrast, most newly formed centromeres (neocentromeres) do not contain satellite repeats and instead include DNA sequences representative of the genome. An unknown question in centromere evolution is how satellite repeat-based centromeres evolve from neocentromeres. We conducted a genome-wide characterization of sequences associated with CENH3 nucleosomes in potato (Solanum tuberosum). Five potato centromeres (Cen4, Cen6, Cen10, Cen11, and Cen12) consisted primarily of single- or low-copy DNA sequences. No satellite repeats were identified in these five centromeres. At least one transcribed gene was associated with CENH3 nucleosomes. Thus, these five centromeres structurally resemble neocentromeres. By contrast, six potato centromeres (Cen1, Cen2, Cen3, Cen5, Cen7, and Cen8) contained megabase-sized satellite repeat arrays that are unique to individual centromeres. The satellite repeat arrays likely span the entire functional cores of these six centromeres. At least four of the centromeric repeats were amplified from retrotransposon-related sequences and were not detected in Solanum species closely related to potato. The presence of two distinct types of centromeres, coupled with the boom-and-bust cycles of centromeric satellite repeats in Solanum species, suggests that repeat-based centromeres can rapidly evolve from neocentromeres by de novo amplification and insertion of satellite repeats in the CENH3 domains.
Repeatless and Repeat-Based Centromeres in Potato: Implications for Centromere Evolution[C][W
Gong, Zhiyun; Wu, Yufeng; Koblížková, Andrea; Torres, Giovana A.; Wang, Kai; Iovene, Marina; Neumann, Pavel; Zhang, Wenli; Novák, Petr; Buell, C. Robin; Macas, Jiří; Jiang, Jiming
2012-01-01
Centromeres in most higher eukaryotes are composed of long arrays of satellite repeats. By contrast, most newly formed centromeres (neocentromeres) do not contain satellite repeats and instead include DNA sequences representative of the genome. An unknown question in centromere evolution is how satellite repeat-based centromeres evolve from neocentromeres. We conducted a genome-wide characterization of sequences associated with CENH3 nucleosomes in potato (Solanum tuberosum). Five potato centromeres (Cen4, Cen6, Cen10, Cen11, and Cen12) consisted primarily of single- or low-copy DNA sequences. No satellite repeats were identified in these five centromeres. At least one transcribed gene was associated with CENH3 nucleosomes. Thus, these five centromeres structurally resemble neocentromeres. By contrast, six potato centromeres (Cen1, Cen2, Cen3, Cen5, Cen7, and Cen8) contained megabase-sized satellite repeat arrays that are unique to individual centromeres. The satellite repeat arrays likely span the entire functional cores of these six centromeres. At least four of the centromeric repeats were amplified from retrotransposon-related sequences and were not detected in Solanum species closely related to potato. The presence of two distinct types of centromeres, coupled with the boom-and-bust cycles of centromeric satellite repeats in Solanum species, suggests that repeat-based centromeres can rapidly evolve from neocentromeres by de novo amplification and insertion of satellite repeats in the CENH3 domains. PMID:22968715
Mlinarec, Jelena; Chester, Mike; Siljak-Yakovlev, Sonja; Papes, Drazena; Leitch, Andrew R; Besendorfer, Visnja
2009-01-01
The structure, abundance and location of repetitive DNA sequences on chromosomes can characterize the nature of higher plant genomes. Here we report on three new repeat DNA families isolated from Anemone hortensis L.; (i) AhTR1, a family of satellite DNA (stDNA) composed of a 554-561 bp long EcoRV monomer; (ii) AhTR2, a stDNA family composed of a 743 bp long HindIII monomer and; (iii) AhDR, a repeat family composed of a 945 bp long HindIII fragment that exhibits some sequence similarity to Ty3/gypsy-like retroelements. Fluorescence in-situ hybridization (FISH) to metaphase chromosomes of A. hortensis (2n = 16) revealed that both AhTR1 and AhTR2 sequences co-localized with DAPI-positive AT-rich heterochromatic regions. AhTR1 sequences occur at intercalary DAPI bands while AhTR2 sequences occur at 8-10 terminally located heterochromatic blocks. In contrast AhDR sequences are dispersed over all chromosomes as expected of a Ty3/gypsy-like element. AhTR2 and AhTR1 repeat families include polyA- and polyT-tracks, AT/TA-motifs and a pentanucleotide sequence (CAAAA) that may have consequences for chromatin packing and sequence homogeneity. AhTR2 repeats also contain TTTAGGG motifs and degenerate variants. We suggest that they arose by interspersion of telomeric repeats with subtelomeric repeats, before hybrid unit(s) amplified through the heterochromatic domain. The three repetitive DNA families together occupy approximately 10% of the A. hortensis genome. Comparative analyses of eight Anemone species revealed that the divergence of the A. hortensis genome was accompanied by considerable modification and/or amplification of repeats.
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using
Weier, H.U.G.; Gray, J.W.
1995-06-27
A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using
Weier, Heinz-Ulrich G.; Gray, Joe W.
1995-01-01
A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.
NASA Astrophysics Data System (ADS)
Welch, E.; Dulai, H.; El-Kadi, A. I.; Shuler, C. K.
2017-12-01
To examine contaminant transport paths, groundwater and surface water interactions were investigated as a vector of pesticide migration on the island Tutuila in American Samoa. During a field campaign in summer 2016, water from wells, springs, and streams was collected across the island to analyze for selected pesticides. In addition, a detailed watershed-study, involving sampling along the mountain to ocean gradient was conducted in Faga`alu, a U.S. Coral Reef Task Force priority watershed that drains into the Pago Pago Harbor. Samples were screened at the University of Hawai`i for multiple agricultural chemicals using the ELISA method. The pesticides analyzed include glyphosate, azoxystrobin, imidacloprid and DDT/DDE. Field data was integrated into a MODFLOW-based groundwater model of the Faga`alu watershed to reconstruct flow paths, solute concentrations, and dispersion of the analytes. In combination with land-use maps, these tools were used to identify potential pesticide sources and their contaminant contributions. Across the island, pesticide concentrations were well below EPA regulated limits and azoxystrobin was absent. Glyphosate had detectable amounts in 56% of collected groundwater and 62% of collected stream samples. Respectively, 72% and 36% had imidacloprid detected and 98% and 97% had DDT/DDE detected. The highest observed concentration of glyphosate was 0.3 ppb, of imidacloprid was 0.17 ppb, and of DDT was 3.7 ppb. The persistence and ubiquity of DDT/DDE in surface and groundwater since its last island-wide application decades ago is notable. Groundwater flow paths modeled by MODFLOW imply that glyphosate sources match documented agricultural land-use areas. Groundwater-derived pesticide fluxes to the reef in Faga`alu are 977 mg/d of glyphosate and 1642 mg/d of DDT/DDE. Our study shows that pesticides are transported not only via surface runoff, but also via groundwater through the stream's base flow and are exiting the aquifer via submarine groundwater discharge (SGD) in the coastal region as well.
De novo identification of highly diverged protein repeats by probabilistic consistency.
Biegert, A; Söding, J
2008-03-15
An estimated 25% of all eukaryotic proteins contain repeats, which underlines the importance of duplication for evolving new protein functions. Internal repeats often correspond to structural or functional units in proteins. Methods capable of identifying diverged repeated segments or domains at the sequence level can therefore assist in predicting domain structures, inferring hypotheses about function and mechanism, and investigating the evolution of proteins from smaller fragments. We present HHrepID, a method for the de novo identification of repeats in protein sequences. It is able to detect the sequence signature of structural repeats in many proteins that have not yet been known to possess internal sequence symmetry, such as outer membrane beta-barrels. HHrepID uses HMM-HMM comparison to exploit evolutionary information in the form of multiple sequence alignments of homologs. In contrast to a previous method, the new method (1) generates a multiple alignment of repeats; (2) utilizes the transitive nature of homology through a novel merging procedure with fully probabilistic treatment of alignments; (3) improves alignment quality through an algorithm that maximizes the expected accuracy; (4) is able to identify different kinds of repeats within complex architectures by a probabilistic domain boundary detection method and (5) improves sensitivity through a new approach to assess statistical significance. Server: http://toolkit.tuebingen.mpg.de/hhrepid; Executables: ftp://ftp.tuebingen.mpg.de/pub/protevo/HHrepID
Pineda, Gina M; Montgomery, Anne H; Thompson, Robyn; Indest, Brooke; Carroll, Marion; Sinha, Sudhir K
2014-11-01
There is a constant need in forensic casework laboratories for an improved way to increase the first-pass success rate of forensic samples. The recent advances in mini STR analysis, SNP, and Alu marker systems have now made it possible to analyze highly compromised samples, yet few tools are available that can simultaneously provide an assessment of quantity, inhibition, and degradation in a sample prior to genotyping. Currently there are several different approaches used for fluorescence-based quantification assays which provide a measure of quantity and inhibition. However, a system which can also assess the extent of degradation in a forensic sample will be a useful tool for DNA analysts. Possessing this information prior to genotyping will allow an analyst to more informatively make downstream decisions for the successful typing of a forensic sample without unnecessarily consuming DNA extract. Real-time PCR provides a reliable method for determining the amount and quality of amplifiable DNA in a biological sample. Alu are Short Interspersed Elements (SINE), approximately 300bp insertions which are distributed throughout the human genome in large copy number. The use of an internal primer to amplify a segment of an Alu element allows for human specificity as well as high sensitivity when compared to a single copy target. The advantage of an Alu system is the presence of a large number (>1000) of fixed insertions in every human genome, which minimizes the individual specific variation possible when using a multi-copy target quantification system. This study utilizes two independent retrotransposon genomic targets to obtain quantification of an 80bp "short" DNA fragment and a 207bp "long" DNA fragment in a degraded DNA sample in the multiplex system InnoQuant™. The ratio of the two quantitation values provides a "Degradation Index", or a qualitative measure of a sample's extent of degradation. The Degradation Index was found to be predictive of the observed loss of STR markers and alleles as degradation increases. Use of a synthetic target as an internal positive control (IPC) provides an additional assessment for the presence of PCR inhibitors in the test sample. In conclusion, a DNA based qualitative/quantitative/inhibition assessment system that accurately predicts the status of a biological sample, will be a valuable tool for deciding which DNA test kit to utilize and how much target DNA to use, when processing compromised forensic samples for DNA testing. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Detecting and Characterizing Repeating Earthquake Sequences During Volcanic Eruptions
NASA Astrophysics Data System (ADS)
Tepp, G.; Haney, M. M.; Wech, A.
2017-12-01
A major challenge in volcano seismology is forecasting eruptions. Repeating earthquake sequences often precede volcanic eruptions or lava dome activity, providing an opportunity for short-term eruption forecasting. Automatic detection of these sequences can lead to timely eruption notification and aid in continuous monitoring of volcanic systems. However, repeating earthquake sequences may also occur after eruptions or along with magma intrusions that do not immediately lead to an eruption. This additional challenge requires a better understanding of the processes involved in producing these sequences to distinguish those that are precursory. Calculation of the inverse moment rate and concepts from the material failure forecast method can lead to such insights. The temporal evolution of the inverse moment rate is observed to differ for precursory and non-precursory sequences, and multiple earthquake sequences may occur concurrently. These observations suggest that sequences may occur in different locations or through different processes. We developed an automated repeating earthquake sequence detector and near real-time alarm to send alerts when an in-progress sequence is identified. Near real-time inverse moment rate measurements can further improve our ability to forecast eruptions by allowing for characterization of sequences. We apply the detector to eruptions of two Alaskan volcanoes: Bogoslof in 2016-2017 and Redoubt Volcano in 2009. The Bogoslof eruption produced almost 40 repeating earthquake sequences between its start in mid-December 2016 and early June 2017, 21 of which preceded an explosive eruption, and 2 sequences in the months before eruptive activity. Three of the sequences occurred after the implementation of the alarm in late March 2017 and successfully triggered alerts. The nearest seismometers to Bogoslof are over 45 km away, requiring a detector that can work with few stations and a relatively low signal-to-noise ratio. During the Redoubt eruption, earthquake sequences were observed in the months leading up to the eruptive activity beginning in March 2009 as well as immediately preceding 7 of the 19 explosive events. In contrast to Bogoslof, Redoubt has a local monitoring network which allows for better detection and more detailed analysis of the repeating earthquake sequences.
Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine
2007-01-01
Background In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element) are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the dictionary creator. CRISPRdb is accessible at PMID:17521438
2013-01-01
Background Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely resolved. Results We construct a phased diploid genome assembly by deep sequencing a standard laboratory wild-type strain and a panel of strains homozygous for particular chromosomes. The assembly has 700-fold coverage on average, allowing extensive revision and expansion of the number of known SNPs and indels. This phased genome significantly enhances the sensitivity and specificity of allele-specific expression measurements by enabling pooling and cross-validation of signal across multiple polymorphic sites. Additionally, the diploid assembly reveals pervasive and unexpected patterns in allelic differences between homologous chromosomes. Firstly, we see striking clustering of indels, concentrated primarily in the repeat sequences in promoters. Secondly, both indels and their repeat-sequence substrate are enriched near replication origins. Finally, we reveal an intimate link between repeat sequences and indels, which argues that repeat length is under selective pressure for most eukaryotes. This connection is described by a concise one-parameter model that explains repeat-sequence abundance in C. albicans as a function of the indel rate, and provides a general framework to interpret repeat abundance in species ranging from bacteria to humans. Conclusions The phased genome assembly and insights into repeat plasticity will be valuable for better understanding allele-specific phenomena and genome evolution. PMID:24025428
Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies.
Wincker, P; Jubier-Maurin, V; Roizès, G
1987-01-01
Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies. Images PMID:3684566
Plant chromosomes from end to end: telomeres, heterochromatin and centromeres.
Lamb, Jonathan C; Yu, Weichang; Han, Fangpu; Birchler, James A
2007-04-01
Recent evidence indicates that heterochromatin in plants is composed of heterogeneous sequences, which are usually composed of transposable elements or tandem repeat arrays. These arrays are associated with chromatin modifications that produce a closed configuration that limits transcription. Centromere sequences in plants are usually composed of tandem repeat arrays that are homogenized across the genome. Analysis of such arrays in closely related taxa suggests a rapid turnover of the repeat unit that is typical of a particular species. In addition, two lines of evidence for an epigenetic component of centromere specification have been reported, namely an example of a neocentromere formed over sequences without the typical repeat array and examples of centromere inactivation. Although the telomere repeat unit is quite prevalent in the plant kingdom, unusual repeats have been found in some families. Recently, it was demonstrated that the introduction of telomere sequences into plants cells causes truncation of the chromosomes, and that this technique can be used to produce artificial chromosome platforms.
SSRscanner: a program for reporting distribution and exact location of simple sequence repeats.
Anwar, Tamanna; Khan, Asad U
2006-02-20
Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com.
2010-01-01
Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by HGT and intra-genomic shuffling. Conclusions We describe novel features of PARCELs (Palindromic Amphipathic Repeat Coding ELements), a set of widely distributed repeat protein domains and coding sequences that were likely acquired through HGT by diverse unicellular microbes, further mobilized and diversified within genomes, and co-opted for expression in the membrane proteome of some taxa. Disseminated by multiple gene-centric vehicles, ORFs harboring these elements enhance accessory gene pools as part of the "mobilome" connecting genomes of various clades, in taxa sharing common niches. PMID:20626840
A TALE-inspired computational screen for proteins that contain approximate tandem repeats.
Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.
A TALE-inspired computational screen for proteins that contain approximate tandem repeats
Krwawicz, Joanna
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen. PMID:28617832
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing.
Hribová, Eva; Neumann, Pavel; Matsumoto, Takashi; Roux, Nicolas; Macas, Jirí; Dolezel, Jaroslav
2010-09-16
Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection.
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing
2010-01-01
Background Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. Results In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. Conclusion A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection. PMID:20846365
Alu polymorphic insertions reveal genetic structure of north Indian populations.
Tripathi, Manorama; Tripathi, Piyush; Chauhan, Ugam Kumari; Herrera, Rene J; Agrawal, Suraksha
2008-10-01
The Indian subcontinent is characterized by the ancestral and cultural diversity of its people. Genetic input from several unique source populations and from the unique social architecture provided by the caste system has shaped the current genetic landscape of India. In the present study 200 individuals each from three upper-caste and four middle-caste Hindu groups and from two Muslim populations in North India were examined for 10 polymorphic Alu insertions (PAIs). The investigated PAIs exhibit high levels of polymorphism and average heterozygosity. Limited interpopulation variance and genetic flow in the present study suggest admixture. The results of this study demonstrate that, contrary to common belief, the caste system has not provided an impermeable barrier to genetic exchange among Indian groups.
Ribeiro, Guilherme Galvarros Bueno Lobo; De Lima, Reginaldo Ramos; Wiezel, Cláudia Emília Vieira; Ferreira, Luzitano Brandão; Sousa, Sandra Mara Bispo; Rocha, Dulce Maria Sucena; Canas, Maria do Carmo Tomitão; Nardelli-Costa, Juliana; Klautau-Guimarães, Maria De Nazaré; Simões, Aguinaldo Luiz; Oliveira, Silviene Fabiana
2009-01-01
The genetic constitution of Afro-derived Brazilian populations is barely studied. To improve that knowledge, we investigated the AluYAP element and five Y-chromosome STRs (DYS19, DYS390, DYS391, DYS392, and DYS393) to estimate ethnic male contribution in the constitution of four Brazilian quilombos remnants: Mocambo, Rio das Rãs, Kalunga, and Riacho de Sacutiaba. Results indicated significant differences among communities, corroborating historical information about the Brazilian settlement. We concluded that besides African contribution, there was a great European participation in the constitution of these four populations and that observed haplotype variability could be explained by gene flow to quilombos remnants and mutational events in microsatellites (STRs). (c) 2009 Wiley-Liss, Inc.
Polymorphic Alu insertions among Mayan populations.
Herrera, R J; Rojas, D P; Terreros, M C
2007-01-01
The Mayan homeland within Mesoamerica spans five countries: Belize, El Salvador, Guatemala, Honduras and Mexico. There are indications that the people we call the Maya migrated from the north to the highlands of Guatemala as early as 4000 B.C. Their existence was village-based and agricultural. The culture of these Preclassic Mayans owes much to the earlier Olmec civilization, which flourished in the southern portion of North America. In this study, four different Mayan groups were examined to assess their genetic variability. Ten polymorphic Alu insertion (PAI) loci were employed to ascertain the genetic affinities among these Mayan groups. North American, African, European and Asian populations were also examined as reference populations. Our results suggest that the Mayan groups examined in this study are not genetically homogeneous.
Optimization of sequence alignment for simple sequence repeat regions.
Jighly, Abdulqader; Hamwieh, Aladdin; Ogbonnaya, Francis C
2011-07-20
Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs).SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type.When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.
Do, Hoang Dang Khoa; Kim, Joo-Hwan
2017-01-01
Chloroplast genomes (cpDNA) are highly valuable resources for evolutionary studies of angiosperms, since they are highly conserved, are small in size, and play critical roles in plants. Slipped-strand mispairing (SSM) was assumed to be a mechanism for generating repeat units in cpDNA. However, research on the employment of different small repeated sequences through SSM events, which may induce the accumulation of distinct types of repeats within the same region in cpDNA, has not been documented. Here, we sequenced two chloroplast genomes from the endemic species Heloniopsis tubiflora (Korea) and Xerophyllum tenax (USA) to cover the gap between molecular data and explore "hot spots" for genomic events in Melanthiaceae. Comparative analysis of 23 complete cpDNA sequences revealed that there were different stages of deletion in the rps16 region across the Melanthiaceae. Based on the partial or complete loss of rps16 gene in cpDNA, we have firstly reported potential molecular markers for recognizing two sections ( Veratrum and Fuscoveratrum ) of Veratrum . Melathiaceae exhibits a significant change in the junction between large single copy and inverted repeat regions, ranging from trnH_GUG to a part of rps3 . Our results show an accumulation of tandem repeats in the rpl23-ycf2 regions of cpDNAs. Small conserved sequences exist and flank tandem repeats in further observation of this region across most of the examined taxa of Liliales. Therefore, we propose three scenarios in which different small repeated sequences were used during SSM events to generate newly distinct types of repeats. Occasionally, prior to the SSM process, point mutation event and double strand break repair occurred and induced the formation of initial repeat units which are indispensable in the SSM process. SSM may have likely occurred more frequently for short repeats than for long repeat sequences in tribe Parideae (Melanthiaceae, Liliales). Collectively, these findings add new evidence of dynamic results from SSM in chloroplast genomes which can be useful for further evolutionary studies in angiosperms. Additionally, genomics events in cpDNA are potential resources for mining molecular markers in Liliales.
Molecular and bioinformatic analysis of the FB-NOF transposable element.
Badal, Martí; Portela, Anna; Xamena, Noel; Cabré, Oriol
2006-04-12
The Drosophila melanogaster transposable element FB-NOF is known to play a role in genome plasticity through the generation of all sort of genomic rearrangements. Moreover, several insertional mutants due to FB mobilizations have been reported. Its structure and sequence, however, have been poorly studied mainly as a consequence of the long, complex and repetitive sequence of FB inverted repeats. This repetitive region is composed of several 154 bp blocks, each with five almost identical repeats. In this paper, we report the sequencing process of 2 kb long FB inverted repeats of a complete FB-NOF element, with high precision and reliability. This achievement has been possible using a new map of the FB repetitive region, which identifies unambiguously each repeat with new features that can be used as landmarks. With this new vision of the element, a list of FB-NOF in the D. melanogaster genomic clones has been done, improving previous works that used only bioinformatic algorithms. The availability of many FB and FB-NOF sequences allowed an analysis of the FB insertion sequences that showed no sequence specificity, but a preference for A/T rich sequences. The position of NOF into FB is also studied, revealing that it is always located after a second repeat in a random block. With the results of this analysis, we propose a model of transposition in which NOF jumps from FB to FB, using an unidentified transposase enzyme that should specifically recognize the second repeat end of the FB blocks.
The repetitive landscape of the chicken genome.
Wicker, Thomas; Robertson, Jon S; Schulze, Stefan R; Feltus, F Alex; Magrini, Vincent; Morrison, Jason A; Mardis, Elaine R; Wilson, Richard K; Peterson, Daniel G; Paterson, Andrew H; Ivarie, Robert
2005-01-01
Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7 x coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available.
The repetitive landscape of the chicken genome
Wicker, Thomas; Robertson, Jon S.; Schulze, Stefan R.; Feltus, F. Alex; Magrini, Vincent; Morrison, Jason A.; Mardis, Elaine R.; Wilson, Richard K.; Peterson, Daniel G.; Paterson, Andrew H.; Ivarie, Robert
2005-01-01
Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7× coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available. PMID:15256510
Danilowicz, Claudia; Hermans, Laura; Coljee, Vincent; Prévost, Chantal
2017-01-01
Abstract During DNA recombination and repair, RecA family proteins must promote rapid joining of homologous DNA. Repeated sequences with >100 base pair lengths occupy more than 1% of bacterial genomes; however, commitment to strand exchange was believed to occur after testing ∼20–30 bp. If that were true, pairings between different copies of long repeated sequences would usually become irreversible. Our experiments reveal that in the presence of ATP hydrolysis even 75 bp sequence-matched strand exchange products remain quite reversible. Experiments also indicate that when ATP hydrolysis is present, flanking heterologous dsDNA regions increase the reversibility of sequence matched strand exchange products with lengths up to ∼75 bp. Results of molecular dynamics simulations provide insight into how ATP hydrolysis destabilizes strand exchange products. These results inspired a model that shows how pairings between long repeated sequences could be efficiently rejected even though most homologous pairings form irreversible products. PMID:28854739
Trinh, T. Q.; Sinden, R. R.
1993-01-01
We describe a system to measure the frequency of both deletions and duplications between direct repeats. Short 17- and 18-bp palindromic and nonpalindromic DNA sequences were cloned into the EcoRI site within the chloramphenicol acetyltransferase gene of plasmids pBR325 and pJT7. This creates an insert between direct repeated EcoRI sites and results in a chloramphenicol-sensitive phenotype. Selection for chloramphenicol resistance was utilized to select chloramphenicol resistant revertants that included those with precise deletion of the insert from plasmid pBR325 and duplication of the insert in plasmid pJT7. The frequency of deletion or duplication varied more than 500-fold depending on the sequence of the short sequence inserted into the EcoRI site. For the nonpalindromic inserts, multiple internal direct repeats and the length of the direct repeats appear to influence the frequency of deletion. Certain palindromic DNA sequences with the potential to form DNA hairpin structures that might stabilize the misalignment of direct repeats had a high frequency of deletion. Other DNA sequences with the potential to form structures that might destabilize misalignment of direct repeats had a very low frequency of deletion. Duplication mutations occurred at the highest frequency when the DNA between the direct repeats contained no direct or inverted repeats. The presence of inverted repeats dramatically reduced the frequency of duplications. The results support the slippage-misalignment model, suggesting that misalignment occurring during DNA replication leads to deletion and duplication mutations. The results also support the idea that the formation of DNA secondary structures during DNA replication can facilitate and direct specific mutagenic events. PMID:8325478
2014-01-01
Background DNA repeats, such as transposable elements, minisatellites and palindromic sequences, are abundant in sequences and have been shown to have significant and functional roles in the evolution of the host genomes. In a previous study, we introduced the concept of a repeat DNA module, a flexible motif present in at least two occurences in the sequences. This concept was embedded into ModuleOrganizer, a tool allowing the detection of repeat modules in a set of sequences. However, its implementation remains difficult for larger sequences. Results Here we present Visual ModuleOrganizer, a Java graphical interface that enables a new and optimized version of the ModuleOrganizer tool. To implement this version, it was recoded in C++ with compressed suffix tree data structures. This leads to less memory usage (at least 120-fold decrease in average) and decreases by at least four the computation time during the module detection process in large sequences. Visual ModuleOrganizer interface allows users to easily choose ModuleOrganizer parameters and to graphically display the results. Moreover, Visual ModuleOrganizer dynamically handles graphical results through four main parameters: gene annotations, overlapping modules with known annotations, location of the module in a minimal number of sequences, and the minimal length of the modules. As a case study, the analysis of FoldBack4 sequences clearly demonstrated that our tools can be extended to comparative and evolutionary analyses of any repeat sequence elements in a set of genomic sequences. With the increasing number of sequences available in public databases, it is now possible to perform comparative analyses of repeated DNA modules in a graphic and friendly manner within a reasonable time period. Availability Visual ModuleOrganizer interface and the new version of the ModuleOrganizer tool are freely available at: http://lcb.cnrs-mrs.fr/spip.php?rubrique313. PMID:24678954
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.
Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V
1985-09-01
The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this.
USDA-ARS?s Scientific Manuscript database
Expressed sequence tag (EST) simple sequence repeats (SSRs) in Prunus were mined, and flanking primers designed and used for genome-wide characterization and selection of primers to optimize marker distribution and reliability. A total of 12,618 contigs were assembled from 84,727 ESTs, along with 34...
Microsatellite analysis in the genome of Acanthaceae: An in silico approach.
Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar
2015-01-01
Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future.
Alcivar-Warren, Acacia; Meehan-Meola, Dawn; Wang, Yongping; Guo, Ximing; Zhou, Linghua; Xiang, Jianhai; Moss, Shaun; Arce, Steve; Warren, William; Xu, Zhenkang; Bell, Kireina
2006-01-01
To develop genetic and physical maps for shrimp, accurate information on the actual number of chromosomes and a large number of genetic markers is needed. Previous reports have shown two different chromosome numbers for the Pacific whiteleg shrimp, Penaeus vannamei, the most important penaeid shrimp species cultured in the Western hemisphere. Preliminary results obtained by direct sequencing of clones from a Sau3A-digested genomic library of P. vannamei ovary identified a large number of (TAACC/GGTTA)-containing SSRs. The objectives of this study were to (1) examine the frequency of (TAACC)n repeats in 662 P. vannamei genomic clones that were directly sequenced, and perform homology searches of these clones, (2) confirm the number of chromosomes in testis of P. vannamei, and (3) localize the TAACC repeats in P. vannamei chromosome spreads using fluorescence in situ hybridization (FISH). Results for objective 1 showed that 395 out of the 662 clones sequenced contained single or multiple SSRs with three or more repeat motifs, 199 of which contained variable tandem repeats of the pentanucleotide (TAACC/GGTTA)n, with 3 to 14 copies per sequence. The frequency of (TAACC)n repeats in P. vannamei is 4.68 kb for SSRs with five or more repeat motifs. Sequence comparisons using the BLASTN nonredundant and expressed sequence tag (EST) databases indicated that most of the TAACC-containing clones were similar to either the core pentanucleotide repeat in PVPENTREP locus (GenBank accession no. X82619) or portions of 28S rRNA. Transposable elements (transposase for Tn1000 and reverse transcriptase family members), hypothetical or unnamed protein products, and genes of known function such as 18S and 28S rRNAs, heat shock protein 70, and thrombospondin were identified in non-TAACC-containing clones. For objective 2, the meiotic chromosome number of P. vannamei was confirmed as N = 44. For objective 3, four FISH probes (P1 to P4) containing different numbers of TAACC repeats produced positive signals on telomeres of P. vannamei chromosomes. A few chromosomes had positive signals interstitially. Probe signal strength and chromosome coverage differed in the general order of P1>P2>P3>P4, which correlated with the length of TAACC repeats within the probes: 83, 66, 35, and 30 bp, respectively, suggesting that the TAACC repeats, and not the flanking sequences, produced the TAACC signals at chromosome ends and TAACC is likely the telomere sequence for P. vannamei.
Kim, Min Jee; Im, Hyun Hwak; Lee, Kwang Youll; Han, Yeon Soo; Kim, Iksoo
2014-06-01
Abstract The complete nucleotide sequences of the mitochondrial genome from the whiter-spotted flower chafer, Protaetia brevitarsis (Coleoptera: Scarabaeidae), was determined. The 20,319-bp long circular genome is the longest among completely sequenced Coleoptera. As is typical in animals, the P. brevitarsis genome consisted of two ribosomal RNAs, 22 transfer RNAs, 13 protein-coding genes and one A + T-rich region. Although the size of the coding genes was typical, the non-coding A + T-rich region was 5654 bp, which is the longest in insects. The extraordinary length of this region was composed of 28,117-bp tandem repeats and 782-bp tandem repeats. These repeat sequences were encompassed by three non-repeat sequences constituting 1804 bp.
Parson, Walther; Ballard, David; Budowle, Bruce; Butler, John M; Gettings, Katherine B; Gill, Peter; Gusmão, Leonor; Hares, Douglas R; Irwin, Jodi A; King, Jonathan L; Knijff, Peter de; Morling, Niels; Prinz, Mechthild; Schneider, Peter M; Neste, Christophe Van; Willuweit, Sascha; Phillips, Christopher
2016-05-01
The DNA Commission of the International Society for Forensic Genetics (ISFG) is reviewing factors that need to be considered ahead of the adoption by the forensic community of short tandem repeat (STR) genotyping by massively parallel sequencing (MPS) technologies. MPS produces sequence data that provide a precise description of the repeat allele structure of a STR marker and variants that may reside in the flanking areas of the repeat region. When a STR contains a complex arrangement of repeat motifs, the level of genetic polymorphism revealed by the sequence data can increase substantially. As repeat structures can be complex and include substitutions, insertions, deletions, variable tandem repeat arrangements of multiple nucleotide motifs, and flanking region SNPs, established capillary electrophoresis (CE) allele descriptions must be supplemented by a new system of STR allele nomenclature, which retains backward compatibility with the CE data that currently populate national DNA databases and that will continue to be produced for the coming years. Thus, there is a pressing need to produce a standardized framework for describing complex sequences that enable comparison with currently used repeat allele nomenclature derived from conventional CE systems. It is important to discern three levels of information in hierarchical order (i) the sequence, (ii) the alignment, and (iii) the nomenclature of STR sequence data. We propose a sequence (text) string format the minimal requirement of data storage that laboratories should follow when adopting MPS of STRs. We further discuss the variant annotation and sequence comparison framework necessary to maintain compatibility among established and future data. This system must be easy to use and interpret by the DNA specialist, based on a universally accessible genome assembly, and in place before the uptake of MPS by the general forensic community starts to generate sequence data on a large scale. While the established nomenclature for CE-based STR analysis will remain unchanged in the future, the nomenclature of sequence-based STR genotypes will need to follow updated rules and be generated by expert systems that translate MPS sequences to match CE conventions in order to guarantee compatibility between the different generations of STR data. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
SSRscanner: a program for reporting distribution and exact location of simple sequence repeats
Anwar, Tamanna; Khan, Asad U
2006-01-01
Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. Availability This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com PMID:17597863
Structure and stability of the ankyrin domain of the Drosophila Notch receptor.
Zweifel, Mark E; Leahy, Daniel J; Hughson, Frederick M; Barrick, Doug
2003-11-01
The Notch receptor contains a conserved ankyrin repeat domain that is required for Notch-mediated signal transduction. The ankyrin domain of Drosophila Notch contains six ankyrin sequence repeats previously identified as closely matching the ankyrin repeat consensus sequence, and a putative seventh C-terminal sequence repeat that exhibits lower similarity to the consensus sequence. To better understand the role of the Notch ankyrin domain in Notch-mediated signaling and to examine how structure is distributed among the seven ankyrin sequence repeats, we have determined the crystal structure of this domain to 2.0 angstroms resolution. The seventh, C-terminal, ankyrin sequence repeat adopts a regular ankyrin fold, but the first, N-terminal ankyrin repeat, which contains a 15-residue insertion, appears to be largely disordered. The structure reveals a substantial interface between ankyrin polypeptides, showing a high degree of shape and charge complementarity, which may be related to homotypic interactions suggested from indirect studies. However, the Notch ankyrin domain remains largely monomeric in solution, demonstrating that this interface alone is not sufficient to promote tight association. Using the structure, we have classified reported mutations within the Notch ankyrin domain that are known to disrupt signaling into those that affect buried residues and those restricted to surface residues. We show that the buried substitutions greatly decrease protein stability, whereas the surface substitutions have only a marginal affect on stability. The surface substitutions are thus likely to interfere with Notch signaling by disrupting specific Notch-effector interactions and map the sites of these interactions.
Park, Eonyoung; Maquat, Lynne E
2013-01-01
Staufen1 (STAU1)-mediated mRNA decay (SMD) is an mRNA degradation process in mammalian cells that is mediated by the binding of STAU1 to a STAU1-binding site (SBS) within the 3'-untranslated region (3'-UTR) of target mRNAs. During SMD, STAU1, a double-stranded (ds) RNA-binding protein, recognizes dsRNA structures formed either by intramolecular base pairing of 3'-UTR sequences or by intermolecular base pairing of 3'-UTR sequences with a long-noncoding RNA (lncRNA) via partially complementary Alu elements. Recently, STAU2, a paralog of STAU1, has also been reported to mediate SMD. Both STAU1 and STAU2 interact directly with the ATP-dependent RNA helicase UPF1, a key SMD factor, enhancing its helicase activity to promote effective SMD. Moreover, STAU1 and STAU2 form homodimeric and heterodimeric interactions via domain-swapping. Because both SMD and the mechanistically related nonsense-mediated mRNA decay (NMD) employ UPF1; SMD and NMD are competitive pathways. Competition contributes to cellular differentiation processes, such as myogenesis and adipogenesis, placing SMD at the heart of various physiologically important mechanisms. Copyright © 2013 John Wiley & Sons, Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Micklos, David A.
2006-10-30
This project achieved its goal of implementing a nationwide training program to introduce high school biology teachers to the key uses and societal implications of human DNA polymorphisms. The 2.5-day workshop introduced high school biology faculty to a laboratory-based unit on human DNA polymorphisms â which provides a uniquely personal perspective on the science and Ethical, Legal and Social Implications (ELSI) of the Human Genome Project. As proposed, 12 workshops were conducted at venues across the United States. The workshops were attended by 256 high school faculty, exceeding proposed attendance of 240 by 7%. Each workshop mixed theoretical, laboratory, andmore » computer work with practical and ethical implications. Program participants learned simplified lab techniques for amplifying three types of chromosomal polymorphisms: an Alu insertion (PV92), a VNTR (pMCT118/D1S80), and single nucleotide polymorphisms (SNPs) in the mitochondrial control region. These polymorphisms illustrate the use of DNA variations in disease diagnosis, forensic biology, and identity testing - and provide a starting point for discussing the uses and potential abuses of genetic technology. Participants also learned how to use their Alu and mitochondrial data as an entrée to human population genetics and evolution. Our work to simplify lab techniques for amplifying human DNA polymorphisms in educational settings culminated with the release in 1998 of three Advanced Technology (AT) PCR kits by Carolina Biological Supply Company, the nationâÂÂs oldest educational science supplier. The kits use a simple 30-minute method to isolate template DNA from hair sheaths or buccal cells and streamlined PCR chemistry based on Pharmacia Ready-To-Go Beads, which incorporate Taq polymerase, deoxynucleotide triphosphates, and buffer in a freeze-dried pellet. These kits have greatly simplified teacher implementation of human PCR labs, and their use is growing at a rapid pace. Sales of human polymorphism kits by Carolina Biological rose from 700 units in 1999 to 1,132 in 2000 â a 62% increase. Competing kits using the Alu system, and based substantially on our earlier work, are also marketed by Biorad and Edvotek. In parallel with the lab experiments, we developed a suite of database/statistical applications and easy-to-use interfaces that allow students to use their own DNA data to explore human population genetics and to test theories of human evolution. Database searches and statistical analyses are launched from a centralized workspace. Workshop participants were introduced to these and other resources available at the DNALC WWW site (http://vector.cshl.org/bioserver/): 1) Allele Server tests Hardy-Weinberg equilibrium and statistically compares PV92 data from world populations. 2) Sequence Server uses DNA sequence data to search Genbank using BLASTN, compare sequences using CLUSTALW, and create phylogenetic trees using PHYLIP. 3) Simulation Server uses a Monte Carlo generator to model the long-term effects of drift, selection, and population bottlenecks. By targeting motivated and innovative biology faculty, we believe that this project offered a cost-effective means to bring high school biology education up-to-the-minute with genomic biology. The workshop reached a target audience of highly professional faculty who have already implemented hands-on labs in molecular genetics and many of whom offer laboratory electives in biotechnology. Many attend professional meetings, develop curriculum, collaborate with scientists, teach faculty workshops, and manage equipment-sharing programs. These individuals are life-long learners, anxious for deeper insight and additional training to further extend their leadership. This contention was supported by data from a mail survey, conducted in February-March 2000 and 2001, of 256 faculty who participated in workshops conducted during the current term of DOE support. Seventy percent of participants responded, providing direct reports on how their teaching behavior had changed since taking the DOE workshop. About nine of ten respondents said they had provided new classroom materials and first-hand accounts of DNA typing, sequencing, or PCR. Three-fourths had introduced new units on human molecular genetics. Most strikingly, half had students use PCR to amplify their own insertion polymorphisms (PV92), and better than one-fourth amplified a VNTR polymorphism and the mitochondrial control region. One in five had mitochondrial DNA sequenced by the DNALC Sequencing Service. A majority (58%) used online materials at the DNALC WWW site, and 28% analyzed student polymorphism data with Bioservers at the DNALC site. A majority (58%) assisted other faculty with student labs on polymorphisms, reaching an additional 786 teachers.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
David. A Micklos
2006-10-30
This project achieved its goal of implementing a nationwide training program to introduce high school biology teachers to the key uses and societal implications of human DNA polymorphisms. The 2.5-day workshop introduced high school biology faculty to a laboratory-based unit on human DNA polymorphisms – which provides a uniquely personal perspective on the science and Ethical, Legal and Social Implications (ELSI) of the Human Genome Project. As proposed, 12 workshops were conducted at venues across the United States. The workshops were attended by 256 high school faculty, exceeding proposed attendance of 240 by 7%. Each workshop mixed theoretical, laboratory, andmore » computer work with practical and ethical implications. Program participants learned simplified lab techniques for amplifying three types of chromosomal polymorphisms: an Alu insertion (PV92), a VNTR (pMCT118/D1S80), and single nucleotide polymorphisms (SNPs) in the mitochondrial control region. These polymorphisms illustrate the use of DNA variations in disease diagnosis, forensic biology, and identity testing - and provide a starting point for discussing the uses and potential abuses of genetic technology. Participants also learned how to use their Alu and mitochondrial data as an entrée to human population genetics and evolution. Our work to simplify lab techniques for amplifying human DNA polymorphisms in educational settings culminated with the release in 1998 of three Advanced Technology (AT) PCR kits by Carolina Biological Supply Company, the nation’s oldest educational science supplier. The kits use a simple 30-minute method to isolate template DNA from hair sheaths or buccal cells and streamlined PCR chemistry based on Pharmacia Ready-To-Go Beads, which incorporate Taq polymerase, deoxynucleotide triphosphates, and buffer in a freeze-dried pellet. These kits have greatly simplified teacher implementation of human PCR labs, and their use is growing at a rapid pace. Sales of human polymorphism kits by Carolina Biological rose from 700 units in 1999 to 1,132 in 2000 – a 62% increase. Competing kits using the Alu system, and based substantially on our earlier work, are also marketed by Biorad and Edvotek. In parallel with the lab experiments, we developed a suite of database/statistical applications and easy-to-use interfaces that allow students to use their own DNA data to explore human population genetics and to test theories of human evolution. Database searches and statistical analyses are launched from a centralized workspace. Workshop participants were introduced to these and other resources available at the DNALC WWW site (http://vector.cshl.org/bioserver/): 1) Allele Server tests Hardy-Weinberg equilibrium and statistically compares PV92 data from world populations. 2) Sequence Server uses DNA sequence data to search Genbank using BLASTN, compare sequences using CLUSTALW, and create phylogenetic trees using PHYLIP. 3) Simulation Server uses a Monte Carlo generator to model the long-term effects of drift, selection, and population bottlenecks. By targeting motivated and innovative biology faculty, we believe that this project offered a cost-effective means to bring high school biology education up-to-the-minute with genomic biology. The workshop reached a target audience of highly professional faculty who have already implemented hands-on labs in molecular genetics and many of whom offer laboratory electives in biotechnology. Many attend professional meetings, develop curriculum, collaborate with scientists, teach faculty workshops, and manage equipment-sharing programs. These individuals are life-long learners, anxious for deeper insight and additional training to further extend their leadership. This contention was supported by data from a mail survey, conducted in February-March 2000 and 2001, of 256 faculty who participated in workshops conducted during the current term of DOE support. Seventy percent of participants responded, providing direct reports on how their teaching behavior had changed since taking the DOE workshop. About nine of ten respondents said they had provided new classroom materials and first-hand accounts of DNA typing, sequencing, or PCR. Three-fourths had introduced new units on human molecular genetics. Most strikingly, half had students use PCR to amplify their own insertion polymorphisms (PV92), and better than one-fourth amplified a VNTR polymorphism and the mitochondrial control region. One in five had mitochondrial DNA sequenced by the DNALC Sequencing Service. A majority (58%) used online materials at the DNALC WWW site, and 28% analyzed student polymorphism data with Bioservers at the DNALC site. A majority (58%) assisted other faculty with student labs on polymorphisms, reaching an additional 786 teachers.« less
The Zinc-Finger Antiviral Protein ZAP Inhibits LINE and Alu Retrotransposition
Moldovan, John B.; Moran, John V.
2015-01-01
Long INterspersed Element-1 (LINE-1 or L1) is the only active autonomous retrotransposon in the human genome. To investigate the interplay between the L1 retrotransposition machinery and the host cell, we used co-immunoprecipitation in conjunction with liquid chromatography and tandem mass spectrometry to identify cellular proteins that interact with the L1 first open reading frame-encoded protein, ORF1p. We identified 39 ORF1p-interacting candidate proteins including the zinc-finger antiviral protein (ZAP or ZC3HAV1). Here we show that the interaction between ZAP and ORF1p requires RNA and that ZAP overexpression in HeLa cells inhibits the retrotransposition of engineered human L1 and Alu elements, an engineered mouse L1, and an engineered zebrafish LINE-2 element. Consistently, siRNA-mediated depletion of endogenous ZAP in HeLa cells led to a ~2-fold increase in human L1 retrotransposition. Fluorescence microscopy in cultured human cells demonstrated that ZAP co-localizes with L1 RNA, ORF1p, and stress granule associated proteins in cytoplasmic foci. Finally, molecular genetic and biochemical analyses indicate that ZAP reduces the accumulation of full-length L1 RNA and the L1-encoded proteins, yielding mechanistic insight about how ZAP may inhibit L1 retrotransposition. Together, these data suggest that ZAP inhibits the retrotransposition of LINE and Alu elements. PMID:25951186
Feng, Jiang; Gang, Feng; Li, Xiao; Jin, Tang; Houbao, Huang; Yu, Cao; Guorong, Li
2013-08-01
To investigate whether plasma cell-free DNA (cfDNA) or its integrity could differentiate prostate cancer from benign prostate hyperplasia (BPH) in patients with serum prostate-specific antigen (PSA) ≥ 4 ng/ml. Ninety-six patients with prostate cancer and 112 patients with BPH were enrolled. cfDNA levels in plasma before prostate biopsy were quantified by real-time PCR amplification of ALU gene (product size of 115 bp), and quantitative ratio of ALU (247 bp) to ALU (115 bp) reflected the integrity of cfDNA. In patients with serum PSA ≥ 4 ng/ml, there were significant differences in plasma cfDNA or its integrity between the patients with prostate cancer (19.74 ± 4.43, 0.34 ± 0.05) and patients with BPH (7.36 ± 1.58, 0.19 ± 0.03; P < 0.001, P < 0.001). Prostate cancer could be differentiated with a sensitivity of 73.2 % and a specificity of 72.7 % by cfDNA (AUC = 0.864). The integrity of cfDNA had a sensitivity of 81.7 % and a specificity of 78.8 % for the distinguishing prostate cancer from BPH (AUC = 0.910). cfDNA and its integrity could be applied to differentiate prostate cancer from BPH in patients with serum PSA ≥ 4 ng/ml.
Molecular architecture of classical cytological landmarks: Centromeres and telomeres
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meyne, J.
1994-11-01
Both the human telomere repeat and the pericentromeric repeat sequence (GGAAT)n were isolated based on evolutionary conservation. Their isolation was based on the premise that chromosomal features as structurally and functionally important as telomeres and centromeres should be highly conserved. Both sequences were isolated by high stringency screening of a human repetitive DNA library with rodent repetitive DNA. The pHuR library (plasmid Human Repeat) used for this project was enriched for repetitive DNA by using a modification of the standard DNA library preparation method. Usually DNA for a library is cut with restriction enzymes, packaged, infected, and the library ismore » screened. A problem with this approach is that many tandem repeats don`t have any (or many) common restriction sites. Therefore, many of the repeat sequences will not be represented in the library because they are not restricted to a viable length for the vector used. To prepare the pHuR library, human DNA was mechanically sheared to a small size. These relatively short DNA fragments were denatured and then renatured to C{sub o}t 50. Theoretically only repetitive DNA sequences should renature under C{sub o}t 50 conditions. The single-stranded regions were digested using S1 nuclease, leaving the double-stranded, renatured repeat sequences.« less
Wei, Yunzhou; Chesne, Megan T.; Terns, Rebecca M.; Terns, Michael P.
2015-01-01
CRISPR-Cas systems are RNA-based immune systems that protect prokaryotes from invaders such as phages and plasmids. In adaptation, the initial phase of the immune response, short foreign DNA fragments are captured and integrated into host CRISPR loci to provide heritable defense against encountered foreign nucleic acids. Each CRISPR contains a ∼100–500 bp leader element that typically includes a transcription promoter, followed by an array of captured ∼35 bp sequences (spacers) sandwiched between copies of an identical ∼35 bp direct repeat sequence. New spacers are added immediately downstream of the leader. Here, we have analyzed adaptation to phage infection in Streptococcus thermophilus at the CRISPR1 locus to identify cis-acting elements essential for the process. We show that the leader and a single repeat of the CRISPR locus are sufficient for adaptation in this system. Moreover, we identified a leader sequence element capable of stimulating adaptation at a dormant repeat. We found that sequences within 10 bp of the site of integration, in both the leader and repeat of the CRISPR, are required for the process. Our results indicate that information at the CRISPR leader-repeat junction is critical for adaptation in this Type II-A system and likely other CRISPR-Cas systems. PMID:25589547
Waye, J S; Willard, H F
1986-09-01
The centromeric regions of all human chromosomes are characterized by distinct subsets of a diverse tandemly repeated DNA family, alpha satellite. On human chromosome 17, the predominant form of alpha satellite is a 2.7-kilobase-pair higher-order repeat unit consisting of 16 alphoid monomers. We present the complete nucleotide sequence of the 16-monomer repeat, which is present in 500 to 1,000 copies per chromosome 17, as well as that of a less abundant 15-monomer repeat, also from chromosome 17. These repeat units were approximately 98% identical in sequence, differing by the exclusion of precisely 1 monomer from the 15-monomer repeat. Homologous unequal crossing-over is suggested as a probable mechanism by which the different repeat lengths on chromosome 17 were generated, and the putative site of such a recombination event is identified. The monomer organization of the chromosome 17 higher-order repeat unit is based, in part, on tandemly repeated pentamers. A similar pentameric suborganization has been previously demonstrated for alpha satellite of the human X chromosome. Despite the organizational similarities, substantial sequence divergence distinguishes these subsets. Hybridization experiments indicate that the chromosome 17 and X subsets are more similar to each other than to the subsets found on several other human chromosomes. We suggest that the chromosome 17 and X alpha satellite subsets may be related components of a larger alphoid subfamily which have evolved from a common ancestral repeat into the contemporary chromosome-specific subsets.
Shaw, D R; Richter, H; Giorda, R; Ohmachi, T; Ennis, H L
1989-09-01
A Dictyostelium discoideum repetitive element composed of long repeats of the codon (AAC) is found in developmentally regulated transcripts. The concentration of (AAC) sequences is low in mRNA from dormant spores and growing cells and increases markedly during spore germination and multicellular development. The sequence hybridizes to many different sized Dictyostelium DNA restriction fragments indicating that it is scattered throughout the genome. Four cDNA clones isolated contain (AAC) sequences in the deduced coding region. Interestingly, the (AAC)-rich sequences are present in all three reading frames in the deduced proteins, i.e., AAC (asparagine), ACA (threonine) and CAA (glutamine). Three of the clones contain only one of these in-frame so that the individual proteins carry either asparagine, threonine, or glutamine clusters, not mixtures. However, one clone is both glutamine- and asparagine-rich. The (AAC) portion of the transcripts are reiterated 300 times in the haploid genome while the other portions of the cDNAs represent single copy genes, whose sequences show no similarity other than the (AAC) repeats. The repeated sequence is similar to the opa or M sequence found in Drosophila melanogaster notch and homeo box genes and in fly developmentally regulated transcripts. The transcripts are present on polysomes suggesting that they are translated. Although the function of these repeats is unknown, long amino acid repeats are a characteristic feature of extracellular proteins of lower eukaryotes.
Urvoas, Agathe; Guellouz, Asma; Valerio-Lepiniec, Marie; Graille, Marc; Durand, Dominique; Desravines, Danielle C; van Tilbeurgh, Herman; Desmadril, Michel; Minard, Philippe
2010-11-26
Repeat proteins have a modular organization and a regular architecture that make them attractive models for design and directed evolution experiments. HEAT repeat proteins, although very common, have not been used as a scaffold for artificial proteins, probably because they are made of long and irregular repeats. Here, we present and validate a consensus sequence for artificial HEAT repeat proteins. The sequence was defined from the structure-based sequence analysis of a thermostable HEAT-like repeat protein. Appropriate sequences were identified for the N- and C-caps. A library of genes coding for artificial proteins based on this sequence design, named αRep, was assembled using new and versatile methodology based on circular amplification. Proteins picked randomly from this library are expressed as soluble proteins. The biophysical properties of proteins with different numbers of repeats and different combinations of side chains in hypervariable positions were characterized. Circular dichroism and differential scanning calorimetry experiments showed that all these proteins are folded cooperatively and are very stable (T(m) >70 °C). Stability of these proteins increases with the number of repeats. Detailed gel filtration and small-angle X-ray scattering studies showed that the purified proteins form either monomers or dimers. The X-ray structure of a stable dimeric variant structure was solved. The protein is folded with a highly regular topology and the repeat structure is organized, as expected, as pairs of alpha helices. In this protein variant, the dimerization interface results directly from the variable surface enriched in aromatic residues located in the randomized positions of the repeats. The dimer was crystallized both in an apo and in a PEG-bound form, revealing a very well defined binding crevice and some structure flexibility at the interface. This fortuitous binding site could later prove to be a useful binding site for other low molecular mass partners. Copyright © 2010 Elsevier Ltd. All rights reserved.
TRedD—A database for tandem repeats over the edit distance
Sokol, Dina; Atagun, Firat
2010-01-01
A ‘tandem repeat’ in DNA is a sequence of two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats are common in the genomes of both eukaryotic and prokaryotic organisms. They are significant markers for human identity testing, disease diagnosis, sequence homology and population studies. In this article, we describe a new database, TRedD, which contains the tandem repeats found in the human genome. The database is publicly available online, and the software for locating the repeats is also freely available. The definition of tandem repeats used by TRedD is a new and innovative definition based upon the concept of ‘evolutive tandem repeats’. In addition, we have developed a tool, called TandemGraph, to graphically depict the repeats occurring in a sequence. This tool can be coupled with any repeat finding software, and it should greatly facilitate analysis of results. Database URL: http://tandem.sci.brooklyn.cuny.edu/ PMID:20624712
Ben Rekaya, Mariem; Laroussi, Nadia; Messaoud, Olfa; Jones, Mariem; Jerbi, Manel; Naouali, Chokri; Bouyacoub, Yosra; Chargui, Mariem; Kefi, Rym; Fazaa, Becima; Boubaker, Mohamed Samir; Boussen, Hamouda; Mokni, Mourad; Abdelhak, Sonia; Zghal, Mohamed; Khaled, Aida; Yacoub-Youssef, Houda
2014-01-01
Xeroderma pigmentosum Variant (XP-V) form is characterized by a late onset of skin symptoms. Our aim is the clinical and genetic investigations of XP-V Tunisian patients in order to develop a simple tool for early diagnosis. We investigated 16 suspected XP patients belonging to ten consanguineous families. Analysis of the POLH gene was performed by linkage analysis, long range PCR, and sequencing. Genetic analysis showed linkage to the POLH gene with a founder haplotype in all affected patients. Long range PCR of exon 9 to exon 11 showed a 3926 bp deletion compared to control individuals. Sequence analysis demonstrates that this deletion has occurred between two Alu-Sq2 repetitive sequences in the same orientation, respectively, in introns 9 and 10. We suggest that this mutation POLH NG_009252.1: g.36847_40771del3925 is caused by an equal crossover event that occurred between two homologous chromosomes at meiosis. These results allowed us to develop a simple test based on a simple PCR in order to screen suspected XP-V patients. In Tunisia, the prevalence of XP-V group seems to be underestimated and clinical diagnosis is usually later. Cascade screening of this founder mutation by PCR in regions with high frequency of XP provides a rapid and cost-effective tool for early diagnosis of XP-V in Tunisia and North Africa.
Ben Rekaya, Mariem; Laroussi, Nadia; Messaoud, Olfa; Jones, Mariem; Jerbi, Manel; Bouyacoub, Yosra; Chargui, Mariem; Kefi, Rym; Fazaa, Becima; Boubaker, Mohamed Samir; Boussen, Hamouda; Mokni, Mourad; Abdelhak, Sonia; Zghal, Mohamed; Khaled, Aida; Yacoub-Youssef, Houda
2014-01-01
Xeroderma pigmentosum Variant (XP-V) form is characterized by a late onset of skin symptoms. Our aim is the clinical and genetic investigations of XP-V Tunisian patients in order to develop a simple tool for early diagnosis. We investigated 16 suspected XP patients belonging to ten consanguineous families. Analysis of the POLH gene was performed by linkage analysis, long range PCR, and sequencing. Genetic analysis showed linkage to the POLH gene with a founder haplotype in all affected patients. Long range PCR of exon 9 to exon 11 showed a 3926 bp deletion compared to control individuals. Sequence analysis demonstrates that this deletion has occurred between two Alu-Sq2 repetitive sequences in the same orientation, respectively, in introns 9 and 10. We suggest that this mutation POLH NG_009252.1: g.36847_40771del3925 is caused by an equal crossover event that occurred between two homologous chromosomes at meiosis. These results allowed us to develop a simple test based on a simple PCR in order to screen suspected XP-V patients. In Tunisia, the prevalence of XP-V group seems to be underestimated and clinical diagnosis is usually later. Cascade screening of this founder mutation by PCR in regions with high frequency of XP provides a rapid and cost-effective tool for early diagnosis of XP-V in Tunisia and North Africa. PMID:24877075
Diversity of Cronobacter spp. isolates from the vegetables in the middle-east coastline of China.
Chen, Wanyi; Yang, Jielin; You, Chunping; Liu, Zhenmin
2016-06-01
Cronobacter spp. has caused life-threatening neonatal infections mainly resulted from consumption of contaminated powdered infant formula. A total of 102 vegetable samples from retail markets were evaluated for the presence of Cronobacter spp. Thirty-five presumptive Cronobacter isolates were isolated and identified using API 20E and 16S rDNA sequencing analyses. All isolates and type strains were characterized using enterobacterial repetitive intergenic consensus sequence PCR (ERIC-PCR), and genetic profiles of cluster analysis from this molecular typing test clearly showed that there were differences among isolates from different vegetables. A polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP) based on the amplification of the gyrB gene (1258 bp) was developed to differentiate among Cronobacter species. A new PCR-RFLP assay based on the amplification of the gyrB gene using Alu I and Hinf I endonuclease combination is established and it has been confirmed an accurate and rapid subtyping method to differentiate Cronobacter species. Sequence analysis of the gyrB gene was proven to be suitable for the phylogenetic analysis of the Cronobacter strains, which has much better resolution based on SNPs in the identification of Cronobacter species specificity than PCR-RFLP and ERIC-PCR. Our study further confirmed that vegetables are one of the most common habitats or sources of Cronobacter spp. contamination in the middle-east coastline of China.
Feature co-localization landscape of the human genome
Ng, Siu-Kin; Hu, Taobo; Long, Xi; Chan, Cheuk-Hin; Tsang, Shui-Ying; Xue, Hong
2016-01-01
Although feature co-localizations could serve as useful guide-posts to genome architecture, a comprehensive and quantitative feature co-localization map of the human genome has been lacking. Herein we show that, in contrast to the conventional bipartite division of genomic sequences into genic and inter-genic regions, pairwise co-localizations of forty-two genomic features in the twenty-two autosomes based on 50-kb to 2,000-kb sequence windows indicate a tripartite zonal architecture comprising Genic zones enriched with gene-related features and Alu-elements; Proximal zones enriched with MIR- and L2-elements, transcription-factor-binding-sites (TFBSs), and conserved-indels (CIDs); and Distal zones enriched with L1-elements. Co-localizations between single-nucleotide-polymorphisms (SNPs) and copy-number-variations (CNVs) reveal a fraction of sequence windows displaying steeply enhanced levels of SNPs, CNVs and recombination rates that point to active adaptive evolution in such pathways as immune response, sensory perceptions, and cognition. The strongest positive co-localization observed between TFBSs and CIDs suggests a regulatory role of CIDs in cooperation with TFBSs. The positive co-localizations of cancer somatic CNVs (CNVT) with all Proximal zone and most Genic zone features, in contrast to the distinctly more restricted co-localizations exhibited by germline CNVs (CNVG), reveal disparate distributions of CNVTs and CNVGs indicative of dissimilarity in their underlying mechanisms. PMID:26854351
Preferential cleavage sites for Sau3A restriction endonuclease in human ribosomal DNA.
Kupriyanova, N S; Kirilenko, P M; Netchvolodov, K K; Ryskov, A P
2000-07-21
Previous studies of cloned ribosomal DNA (rDNA) variants isolated from the cosmid library of human chromosome 13 have revealed some disproportion in representativity of different rDNA regions (N. S. Kupriyanova, K. K. Netchvolodov, P. M. Kirilenko, B. I. Kapanadze, N. K. Yankovsky, and A. P. Ryskov, Mol. Biol. 30, 51-60, 1996). Here we show nonrandom cleavage of human rDNA with Sau3A or its isoshizomer MboI under mild hydrolysis conditions. The hypersensitive cleavage sites were found to be located in the ribosomal intergenic spacer (rIGS), especially in the regions of about 5-5.5 and 11 kb upstream of the rRNA transcription start point. This finding is based on sequencing mapping of the rDNA insert ends in randomly selected cosmid clones of human chromosome 13 and on the data of digestion kinetics of cloned and noncloned human genomic rDNA with Sau3A and MboI. The results show that a methylation status and superhelicity state of the rIGS have no effect on cleavage site sensitivity. It is interesting that all primary cleavage sites are adjacent to or entering into Alu or Psi cdc 27 retroposons of the rIGS suggesting a possible role of neighboring sequences in nuclease accessibility. The results explain nonequal representation of rDNA sequences in the human genomic DNA library used for this study. Copyright 2000 Academic Press.
Combined pituitary hormone deficiency (CPHD) due to a complete PROP1 deletion.
Abrão, M G; Leite, M V; Carvalho, L R; Billerbeck, A E C; Nishi, M Y; Barbosa, A S; Martin, R M; Arnhold, I J P; Mendonca, B B
2006-09-01
PROP1 mutations are the most common cause of genetic combined pituitary hormone deficiency (CPHD). The aim of this study was to investigate the PROP1 gene in two siblings with CPHD. Pituitary function and imaging assessment and molecular analysis of PROP1. Two siblings, born to consanguineous parents, presented with GH deficiency associated with other pituitary hormone deficiencies (TSH, PRL and gonadotrophins). The male sibling also had an evolving cortisol deficiency. Pituitary size was evaluated by magnetic resonance imaging (MRI). PROP1 gene analysis was performed by polymerase chain reaction (PCR), automatic sequencing and Southern blotting. Amplification of sequence tag sites (STS) and the Q8N6H0 gene flanking PROP1 were performed to define the extension of PROP1 deletion. MRI revealed a hypoplastic anterior pituitary in the girl at 14 years and pituitary enlargement in the boy at 18 years. The PROP1 gene failed to amplify in both siblings, whereas other genes were amplified. Southern blotting analysis revealed the PROP1 band in the controls and confirmed complete PROP1 deletion in both siblings. The extension of the deletion was 18.4 kb. The region flanking PROP1 contains several Alu core sequences that might have facilitated stem-loop-mediated excision of PROP1. We report here a complete deletion of PROP1 in two siblings with CPHD phenotype.
Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta
2012-11-07
Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found.
2012-01-01
Background Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. Results ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. Conclusions We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found. PMID:23134664
A candidate gene for choanal atresia in alpaca.
Reed, Kent M; Bauer, Miranda M; Mendoza, Kristelle M; Armién, Aníbal G
2010-03-01
Choanal atresia (CA) is a common nasal craniofacial malformation in New World domestic camelids (alpaca and llama). CA results from abnormal development of the nasal passages and is especially debilitating to newborn crias. CA in camelids shares many of the clinical manifestations of a similar condition in humans (CHARGE syndrome). Herein we report on the regulatory gene CHD7 of alpaca, whose homologue in humans is most frequently associated with CHARGE. Sequence of the CHD7 coding region was obtained from a non-affected cria. The complete coding region was 9003 bp, corresponding to a translated amino acid sequence of 3000 aa. Additional genomic sequences corresponding to a significant portion of the CHD7 gene were identified and assembled from the 2x alpaca whole genome sequence, providing confirmatory sequence for much of the CHD7 coding region. The alpaca CHD7 mRNA sequence was 97.9% similar to the human sequence, with the greatest sequence difference being an insertion in exon 38 that results in a polyalanine repeat (A12). Polymorphism in this repeat was tested for association with CA in alpaca by cloning and sequencing the repeat from both affected and non-affected individuals. Variation in length of the poly-A repeat was not associated with CA. Complete sequencing of the CHD7 gene will be necessary to determine whether other mutations in CHD7 are the cause of CA in camelids.
Naveilhan, P; Baudet, C; Jabbour, W; Wion, D
1994-09-01
A model that may explain the limited division potential of certain cells such as human fibroblasts in culture is presented. The central postulate of this theory is that there exists, prior to certain key exons that code for materials needed for cell division, a unique sequence of specific repeating segments of DNA. One copy of such repeating segments is deleted during each cell cycle in cells that are not protected from such deletion through methylation of their cytosine residues. According to this theory, the means through which such repeated sequences are removed, one per cycle, is through the sequential action of enzymes that act much as bacterial restriction enzymes do--namely to produce scissions in both strands of DNA in areas that correspond to the DNA base sequence recognition specificities of such enzymes. After the first scission early in a replicative cycle, that enzyme becomes inhibited, but the cleavage of the first site exposes the closest site in the repetitive element to the action of a second restriction enzyme after which that enzyme also becomes inhibited. Then repair occurs, regenerating the original first site. Through this sequential activation and inhibition of two different restriction enzymes, only one copy of the repeating sequence is deleted during each cell cycle. In effect, the repeating sequence operates as a precise counter of the numbers of cell doubling that have occurred since the cells involved differentiated during development.
Kuipers, A G J; Kamstra, S A; de Jeu, M J; Visser, R G F
2002-01-01
Highly repetitive DNA sequences were isolated from genomic DNA libraries of Alstroemeria psittacina and A. inodora. Among the repetitive sequences that were isolated, tandem repeats as well as dispersed repeats could be discerned. The tandem repeats belonged to a family of interlinked Sau3A subfragments with sizes varying from 68-127 bp, and constituted a larger HinfI repeat of approximately 400 bp. Southern hybridization showed a similar molecular organization of the tandem repeats in each of the Brazilian Alstroemeria species tested. None of the repeats hybridized with DNA from Chilean Alstroemeria species, which indicates that they are specific for the Brazilian species. In-situ localization studies revealed the tandem repeats to be localized in clusters on the chromosomes of A. inodora and A. psittacina: distal hybridization sites were found on chromosome arms 2PS, 6PL, 7PS, 7PL and 8PL, interstitial sites on chromosome arms 2PL, 3PL, 4PL and 5PL. The applicability of the tandem repeats for cytogenetic analysis of interspecific hybrids and their role in heterochromatin organization are discussed.
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.
Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D
2015-05-01
Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.
USDA-ARS?s Scientific Manuscript database
Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...
USDA-ARS?s Scientific Manuscript database
Simple sequence repeats (SSR) markers were developed from a small insert genomic library for Bipolaris sorokiniana, a mitosporic fungal pathogen that causes spot blotch and root rot in switchgrass. About 59% of sequenced clones (n=384) harbored various SSR motifs. After eliminating the redundant seq...
Are the TTAGG and TTAGGG telomeric repeats phylogenetically conserved in aculeate Hymenoptera?
NASA Astrophysics Data System (ADS)
Menezes, Rodolpho S. T.; Bardella, Vanessa B.; Cabral-de-Mello, Diogo C.; Lucena, Daercio A. A.; Almeida, Eduardo A. B.
2017-10-01
Despite the (TTAGG)n telomeric repeat supposed being the ancestral DNA motif of telomeres in insects, it was repeatedly lost within some insect orders. Notably, parasitoid hymenopterans and the social wasp Metapolybia decorata (Gribodo) lack the (TTAGG)n sequence, but in other representatives of Hymenoptera, this motif was noticed, such as different ant species and the honeybee. These findings raise the question of whether the insect telomeric repeat is or not phylogenetically predominant in Hymenoptera. Thus, we evaluated the occurrence of both the (TTAGG)n sequence and the vertebrate telomere sequence (TTAGGG)n using dot-blotting hybridization in 25 aculeate species of Hymenoptera. Our results revealed the absence of (TTAGG)n sequence in all tested species, elevating the number of hymenopteran families lacking this telomeric sequence to 13 out of the 15 tested families so far. The (TTAGGG)n was not observed in any tested species. Based on our data and compiled information, we suggest that the (TTAGG)n sequence was putatively lost in the ancestor of Apocrita with at least two subsequent independent regains (in Formicidae and Apidae).
Macas, Jiří; Neumann, Pavel; Navrátilová, Alice
2007-01-01
Background Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (Pisum sativum). Results Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in M. truncatula. Conclusion We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data provide a starting point for further investigations of legume plant genomes based on their global comparative analysis and for the development of more sophisticated approaches for data mining. PMID:18031571
Molecular basis of length polymorphism in the human zeta-globin gene complex.
Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J
1983-01-01
The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667
Hemalatha, G. R.; Rao, D. Satyanarayana; Guruprasad, L.
2007-01-01
We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure. PMID:17538688
Complete mitochondrial genome of the larch hawk moth, Sphinx morio (Lepidoptera: Sphingidae).
Kim, Min Jee; Choi, Sei-Woong; Kim, Iksoo
2013-12-01
The larch hawk moth, Sphinx morio, belongs to the lepidopteran family Sphingidae that has long been studied as a family of model insects in a diverse field. In this study, we describe the complete mitochondrial genome (mitogenome) sequences of the species in terms of general genomic features and characteristic short repetitive sequences found in the A + T-rich region. The 15,299-bp-long genome consisted of a typical set of genes (13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes) and one major non-coding A + T-rich region, with the typical arrangement found in Lepidoptera. The 316-bp-long A + T-rich region located between srRNA and tRNA(Met) harbored the conserved sequence blocks that are typically found in lepidopteran insects. Additionally, the A + T-rich region of S. morio contained three characteristic repeat sequences that are rarely found in Lepidoptera: two identical 12-bp repeat, three identical 5-bp-long tandem repeat, and six nearly identical 5-6 bp long repeat sequences.
Kumar, Pankaj; Chaitanya, Pasumarthy S; Nagarajaram, Hampapathalu A
2011-01-01
PSSRdb (Polymorphic Simple Sequence Repeats database) (http://www.cdfd.org.in/PSSRdb/) is a relational database of polymorphic simple sequence repeats (PSSRs) extracted from 85 different species of prokaryotes. Simple sequence repeats (SSRs) are the tandem repeats of nucleotide motifs of the sizes 1-6 bp and are highly polymorphic. SSR mutations in and around coding regions affect transcription and translation of genes. Such changes underpin phase variations and antigenic variations seen in some bacteria. Although SSR-mediated phase variation and antigenic variations have been well-studied in some bacteria there seems a lot of other species of prokaryotes yet to be investigated for SSR mediated adaptive and other evolutionary advantages. As a part of our on-going studies on SSR polymorphism in prokaryotes we compared the genome sequences of various strains and isolates available for 85 different species of prokaryotes and extracted a number of SSRs showing length variations and created a relational database called PSSRdb. This database gives useful information such as location of PSSRs in genomes, length variation across genomes, the regions harboring PSSRs, etc. The information provided in this database is very useful for further research and analysis of SSRs in prokaryotes.
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.
Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V
1985-01-01
The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this. Images PMID:3016521
Spectroscopic insights into quadruplexes of five-repeat telomere DNA sequences upon G-block damage.
Dvořáková, Zuzana; Vorlíčková, Michaela; Renčiuk, Daniel
2017-11-01
The DNA lesions, resulting from oxidative damage, were shown to destabilize human telomere four-repeat quadruplex and to alter its structure. Long telomere DNA, as a repetitive sequence, offers, however, other mechanisms of dealing with the lesion: extrusion of the damaged repeat into loop or shifting the quadruplex position by one repeat. Using circular dichroism and UV absorption spectroscopy and polyacrylamide electrophoresis, we studied consequences of lesions at different positions of the model five-repeat human telomere DNA sequences on the structure and stability of their quadruplexes in sodium and in potassium. The repeats affected by lesion are preferentially positioned as terminal overhangs of the core quadruplex structurally similar to the four-repeat one. Forced affecting of the inner repeats leads to presence of variety of more parallel folds in potassium. In sodium the designed models form mixture of two dominant antiparallel quadruplexes whose population varies with the position of the affected repeat. The shapes of quadruplex CD spectra, namely the height of dominant peaks, significantly correlate with melting temperatures. Lesion in one guanine tract of a more than four repeats long human telomere DNA sequence may cause re-positioning of its quadruplex arrangement associated with a shift of the structure to less common quadruplex conformations. The type of the quadruplex depends on the loop position and external conditions. The telomere DNA quadruplexes are quite resistant to the effect of point mutations due to the telomere DNA repetitive nature, although their structure and, consequently, function might be altered. Copyright © 2017. Published by Elsevier B.V.
Microsatellite analysis in the genome of Acanthaceae: An in silico approach
Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar
2015-01-01
Background: Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. Objective: The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. Materials and Methods: The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Results: Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. Conclusion: The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future. PMID:25709226
Van Kreijl, C F; Bos, J L
1977-01-01
The repeating nucleotide sequence of 68 base pairs in the mtDNA from an ethidium-induced cytoplasmic petite mutant of yeast has been determined. For sequence analysis specifically primed and terminated RNA copies, obtained by in vitro transcription of the separated strands, were use. The sequence consists of 66 consecutive AT base pairs flanked by two GC pairs and comprises nearly all of the mutant mitochondrial genome. The sequence, moreover, also represents the first part of wild-type mtDNA sequence so far. Images PMID:198740
A Lossy Compression Technique Enabling Duplication-Aware Sequence Alignment
Freschi, Valerio; Bogliolo, Alessandro
2012-01-01
In spite of the recognized importance of tandem duplications in genome evolution, commonly adopted sequence comparison algorithms do not take into account complex mutation events involving more than one residue at the time, since they are not compliant with the underlying assumption of statistical independence of adjacent residues. As a consequence, the presence of tandem repeats in sequences under comparison may impair the biological significance of the resulting alignment. Although solutions have been proposed, repeat-aware sequence alignment is still considered to be an open problem and new efficient and effective methods have been advocated. The present paper describes an alternative lossy compression scheme for genomic sequences which iteratively collapses repeats of increasing length. The resulting approximate representations do not contain tandem duplications, while retaining enough information for making their comparison even more significant than the edit distance between the original sequences. This allows us to exploit traditional alignment algorithms directly on the compressed sequences. Results confirm the validity of the proposed approach for the problem of duplication-aware sequence alignment. PMID:22518086
A Fault-tolerant RISC Microprocessor for Spacecraft Applications
NASA Technical Reports Server (NTRS)
Timoc, Constantin; Benz, Harry
1990-01-01
Viewgraphs on a fault-tolerant RISC microprocessor for spacecraft applications are presented. Topics covered include: reduced instruction set computer; fault tolerant registers; fault tolerant ALU; and double rail CMOS logic.
Blaiotta, Giuseppe; Fusco, Vincenzina; Ercolini, Danilo; Aponte, Maria; Pepe, Olimpia; Villani, Francesco
2008-01-01
A phylogenetic tree showing diversities among 116 partial (499-bp) Lactobacillus hsp60 (groEL, encoding a 60-kDa heat shock protein) nucleotide sequences was obtained and compared to those previously described for 16S rRNA and tuf gene sequences. The topology of the tree produced in this study showed a Lactobacillus species distribution similar, but not identical, to those previously reported. However, according to the most recent systematic studies, a clear differentiation of 43 single-species clusters was detected/identified among the sequences analyzed. The slightly higher variability of the hsp60 nucleotide sequences than of the 16S rRNA sequences offers better opportunities to design or develop molecular assays allowing identification and differentiation of either distant or very closely related Lactobacillus species. Therefore, our results suggest that hsp60 can be considered an excellent molecular marker for inferring the taxonomy and phylogeny of members of the genus Lactobacillus and that the chosen primers can be used in a simple PCR procedure allowing the direct sequencing of the hsp60 fragments. Moreover, in this study we performed a computer-aided restriction endonuclease analysis of all 499-bp hsp60 partial sequences and we showed that the PCR-restriction fragment length polymorphism (RFLP) patterns obtainable by using both endonucleases AluI and TacI (in separate reactions) can allow identification and differentiation of all 43 Lactobacillus species considered, with the exception of the pair L. plantarum/L. pentosus. However, the latter species can be differentiated by further analysis with Sau3AI or MseI. The hsp60 PCR-RFLP approach was efficiently applied to identify and to differentiate a total of 110 wild Lactobacillus strains (including closely related species, such as L. casei and L. rhamnosus or L. plantarum and L. pentosus) isolated from cheese and dry-fermented sausages.
Blaiotta, Giuseppe; Fusco, Vincenzina; Ercolini, Danilo; Aponte, Maria; Pepe, Olimpia; Villani, Francesco
2008-01-01
A phylogenetic tree showing diversities among 116 partial (499-bp) Lactobacillus hsp60 (groEL, encoding a 60-kDa heat shock protein) nucleotide sequences was obtained and compared to those previously described for 16S rRNA and tuf gene sequences. The topology of the tree produced in this study showed a Lactobacillus species distribution similar, but not identical, to those previously reported. However, according to the most recent systematic studies, a clear differentiation of 43 single-species clusters was detected/identified among the sequences analyzed. The slightly higher variability of the hsp60 nucleotide sequences than of the 16S rRNA sequences offers better opportunities to design or develop molecular assays allowing identification and differentiation of either distant or very closely related Lactobacillus species. Therefore, our results suggest that hsp60 can be considered an excellent molecular marker for inferring the taxonomy and phylogeny of members of the genus Lactobacillus and that the chosen primers can be used in a simple PCR procedure allowing the direct sequencing of the hsp60 fragments. Moreover, in this study we performed a computer-aided restriction endonuclease analysis of all 499-bp hsp60 partial sequences and we showed that the PCR-restriction fragment length polymorphism (RFLP) patterns obtainable by using both endonucleases AluI and TacI (in separate reactions) can allow identification and differentiation of all 43 Lactobacillus species considered, with the exception of the pair L. plantarum/L. pentosus. However, the latter species can be differentiated by further analysis with Sau3AI or MseI. The hsp60 PCR-RFLP approach was efficiently applied to identify and to differentiate a total of 110 wild Lactobacillus strains (including closely related species, such as L. casei and L. rhamnosus or L. plantarum and L. pentosus) isolated from cheese and dry-fermented sausages. PMID:17993558
Yakubu, Abdulmojeed; Salako, Adebowale E; De Donato, Marcos; Peters, Sunday O; Takeet, Michael I; Wheto, Mathew; Okpeku, Moses; Imumorin, Ikhide G
2017-02-01
Host defense in vertebrates depend on many secreted regulatory proteins such as major histocompatibility complex (MHC) class II which provide important regulatory and effector functions of T cells. Gene polymorphism in the second exon of Capra-DRB gene in three major Nigerian goat breeds [West African Dwarf (WAD), Red Sokoto (RS), and Sahel (SH)] was analyzed by restriction fragment length polymorphisms (RFLP). Four restriction enzymes, BsaHI, AluI, HaeIII, and SacII, were utilized. The association between the polymorphic sites and some heat tolerance traits were also investigated in a total of 70 WAD, 90 RS, and 50 SH goats. Fourteen different types of alleles identified in the Nigerian goats, four of which were found in the peptide coding region (A57G, Q89R, G104D, and T112I), indicate a high degree of polymorphism at the DRB locus in this species. An obvious excess (P < 0.01) of non-synonymous substitutions than synonymous (dN/dS) in this locus is a reflection of adaptive evolution and positive selection. The phylogenetic trees revealed largely species-wise clustering in DRB gene. BsaHI, AluI, HaeIII, and SacII genotype frequencies were in Hardy-Weinberg equilibrium (P > 0.05), except AluI in RS goats and HaeIII in WAD goats (P < 0.05). The expected heterozygosity (H), which is a measure of gene diversity in the goat populations, ranged from 0.16 to 0.50. Genotypes AA (BsaHI), GG, GC and CC (AluI) and GG, GA, AA (HaeIII) appeared better in terms of heat tolerance. The heat-tolerant ability of SH and RS goats to the hot and humid tropical environment of Nigeria seemed better than that of the WAD goats. Sex effect (P < 0.05) was mainly on pulse rate and heat stress index, while there were varying interaction effects on heat tolerance. Variation at the DRB locus may prove to be important in possible selection and breeding for genetic resistance to heat stress in the tropics.
Tandemly repeated sequences in mtDNA control region of whitefish, Coregonus lavaretus.
Brzuzan, P
2000-06-01
Length variation of the mitochondrial DNA control region was observed with PCR amplification of a sample of 138 whitefish (Coregonus lavaretus). Nucleotide sequences of representative PCR products showed that the variation was due to the presence of an approximately 100-bp motif tandemly repeated two, three, or five times in the region between the conserved sequence block-3 (CSB-3) and the gene for phenylalanine tRNA. This is the first report on the tandem array composed of long repeat units in mitochondrial DNA of salmonids.
Heideman, Simone G; van Ede, Freek; Nobre, Anna C
2018-05-24
In daily life, temporal expectations may derive from incidental learning of recurring patterns of intervals. We investigated the incidental acquisition and utilisation of combined temporal-ordinal (spatial/effector) structure in complex visual-motor sequences using a modified version of a serial reaction time (SRT) task. In this task, not only the series of targets/responses, but also the series of intervals between subsequent targets was repeated across multiple presentations of the same sequence. Each participant completed three sessions. In the first session, only the repeating sequence was presented. During the second and third session, occasional probe blocks were presented, where a new (unlearned) spatial-temporal sequence was introduced. We first confirm that participants not only got faster over time, but that they were slower and less accurate during probe blocks, indicating that they incidentally learned the sequence structure. Having established a robust behavioural benefit induced by the repeating spatial-temporal sequence, we next addressed our central hypothesis that implicit temporal orienting (evoked by the learned temporal structure) would have the largest influence on performance for targets following short (as opposed to longer) intervals between temporally structured sequence elements, paralleling classical observations in tasks using explicit temporal cues. We found that indeed, reaction time differences between new and repeated sequences were largest for the short interval, compared to the medium and long intervals, and that this was the case, even when comparing late blocks (where the repeated sequence had been incidentally learned), to early blocks (where this sequence was still unfamiliar). We conclude that incidentally acquired temporal expectations that follow a sequential structure can have a robust facilitatory influence on visually-guided behavioural responses and that, like more explicit forms of temporal orienting, this effect is most pronounced for sequence elements that are expected at short inter-element intervals. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Evolutionary conservation of sequence and secondary structures inCRISPR repeats
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kunin, Victor; Sorek, Rotem; Hugenholtz, Philip
Clustered Regularly Interspaced Palindromic Repeats (CRISPRs) are a novel class of direct repeats, separated by unique spacer sequences of similar length, that are present in {approx}40% of bacterial and all archaeal genomes analyzed to date. More than 40 gene families, called CRISPR-associated sequences (CAS), appear in conjunction with these repeats and are thought to be involved in the propagation and functioning of CRISPRs. It has been proposed that the CRISPR/CAS system samples, maintains a record of, and inactivates invasive DNA that the cell has encountered, and therefore constitutes a prokaryotic analog of an immune system. Here we analyze CRISPR repeatsmore » identified in 195 microbial genomes and show that they can be organized into multiple clusters based on sequence similarity. All individual repeats in any given cluster were inferred to form characteristic RNA secondary structure, ranging from non-existent to pronounced. Stable secondary structures included G:U base pairs and exhibited multiple compensatory base changes in the stem region, indicating evolutionary conservation and functional importance. We also show that the repeat-based classification corresponds to, and expands upon, a previously reported CAS gene-based classification including specific relationships between CRISPR and CAS subtypes.« less
Ba Abdullah, Mohammed M; Palermo, Richard D; Palser, Anne L; Grayson, Nicholas E; Kellam, Paul; Correia, Samantha; Szymula, Agnieszka; White, Robert E
2017-12-01
Epstein-Barr virus (EBV) is a ubiquitous pathogen of humans that can cause several types of lymphoma and carcinoma. Like other herpesviruses, EBV has diversified through both coevolution with its host and genetic exchange between virus strains. Sequence analysis of the EBV genome is unusually challenging because of the large number and lengths of repeat regions within the virus. Here we describe the sequence assembly and analysis of the large internal repeat 1 of EBV (IR1; also known as the BamW repeats) for more than 70 strains. The diversity of the latency protein EBV nuclear antigen leader protein (EBNA-LP) resides predominantly within the exons downstream of IR1. The integrity of the putative BWRF1 open reading frame (ORF) is retained in over 80% of strains, and deletions truncating IR1 always spare BWRF1. Conserved regions include the IR1 latency promoter (Wp) and one zone upstream of and two within BWRF1. IR1 is heterogeneous in 70% of strains, and this heterogeneity arises from sequence exchange between strains as well as from spontaneous mutation, with interstrain recombination being more common in tumor-derived viruses. This genetic exchange often incorporates regions of <1 kb, and allelic gene conversion changes the frequency of small regions within the repeat but not close to the flanks. These observations suggest that IR1-and, by extension, EBV-diversifies through both recombination and breakpoint repair, while concerted evolution of IR1 is driven by gene conversion of small regions. Finally, the prototype EBV strain B95-8 contains four nonconsensus variants within a single IR1 repeat unit, including a stop codon in the EBNA-LP gene. Repairing IR1 improves EBNA-LP levels and the quality of transformation by the B95-8 bacterial artificial chromosome (BAC). IMPORTANCE Epstein-Barr virus (EBV) infects the majority of the world population but causes illness in only a small minority of people. Nevertheless, over 1% of cancers worldwide are attributable to EBV. Recent sequencing projects investigating virus diversity to see if different strains have different disease impacts have excluded regions of repeating sequence, as they are more technically challenging. Here we analyze the sequence of the largest repeat in EBV (IR1). We first characterized the variations in protein sequences encoded across IR1. In studying variations within the repeat of each strain, we identified a mutation in the main laboratory strain of EBV that impairs virus function, and we suggest that tumor-associated viruses may be more likely to contain DNA mixed from two strains. The patterns of this mixing suggest that sequences can spread between strains (and also within the repeat) by copying sequence from another strain (or repeat unit) to repair DNA damage. Copyright © 2017 Ba abdullah et al.
Pilotte, Nils; Papaiakovou, Marina; Grant, Jessica R; Bierwert, Lou Ann; Llewellyn, Stacey; McCarthy, James S; Williams, Steven A
2016-03-01
The soil transmitted helminths are a group of parasitic worms responsible for extensive morbidity in many of the world's most economically depressed locations. With growing emphasis on disease mapping and eradication, the availability of accurate and cost-effective diagnostic measures is of paramount importance to global control and elimination efforts. While real-time PCR-based molecular detection assays have shown great promise, to date, these assays have utilized sub-optimal targets. By performing next-generation sequencing-based repeat analyses, we have identified high copy-number, non-coding DNA sequences from a series of soil transmitted pathogens. We have used these repetitive DNA elements as targets in the development of novel, multi-parallel, PCR-based diagnostic assays. Utilizing next-generation sequencing and the Galaxy-based RepeatExplorer web server, we performed repeat DNA analysis on five species of soil transmitted helminths (Necator americanus, Ancylostoma duodenale, Trichuris trichiura, Ascaris lumbricoides, and Strongyloides stercoralis). Employing high copy-number, non-coding repeat DNA sequences as targets, novel real-time PCR assays were designed, and assays were tested against established molecular detection methods. Each assay provided consistent detection of genomic DNA at quantities of 2 fg or less, demonstrated species-specificity, and showed an improved limit of detection over the existing, proven PCR-based assay. The utilization of next-generation sequencing-based repeat DNA analysis methodologies for the identification of molecular diagnostic targets has the ability to improve assay species-specificity and limits of detection. By exploiting such high copy-number repeat sequences, the assays described here will facilitate soil transmitted helminth diagnostic efforts. We recommend similar analyses when designing PCR-based diagnostic tests for the detection of other eukaryotic pathogens.
Evolution Analysis of Simple Sequence Repeats in Plant Genome.
Qin, Zhen; Wang, Yanping; Wang, Qingmei; Li, Aixian; Hou, Fuyun; Zhang, Liming
2015-01-01
Simple sequence repeats (SSRs) are widespread units on genome sequences, and play many important roles in plants. In order to reveal the evolution of plant genomes, we investigated the evolutionary regularities of SSRs during the evolution of plant species and the plant kingdom by analysis of twelve sequenced plant genome sequences. First, in the twelve studied plant genomes, the main SSRs were those which contain repeats of 1-3 nucleotides combination. Second, in mononucleotide SSRs, the A/T percentage gradually increased along with the evolution of plants (except for P. patens). With the increase of SSRs repeat number the percentage of A/T in C. reinhardtii had no significant change, while the percentage of A/T in terrestrial plants species gradually declined. Third, in dinucleotide SSRs, the percentage of AT/TA increased along with the evolution of plant kingdom and the repeat number increased in terrestrial plants species. This trend was more obvious in dicotyledon than monocotyledon. The percentage of CG/GC showed the opposite pattern to the AT/TA. Forth, in trinucleotide SSRs, the percentages of combinations including two or three A/T were in a rising trend along with the evolution of plant kingdom; meanwhile with the increase of SSRs repeat number in plants species, different species chose different combinations as dominant SSRs. SSRs in C. reinhardtii, P. patens, Z. mays and A. thaliana showed their specific patterns related to evolutionary position or specific changes of genome sequences. The results showed that, SSRs not only had the general pattern in the evolution of plant kingdom, but also were associated with the evolution of the specific genome sequence. The study of the evolutionary regularities of SSRs provided new insights for the analysis of the plant genome evolution.
Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N
2003-09-01
Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.
LINEs, SINEs and other retroelements: do birds of a feather flock together?
Roy-Engel, Astrid M
2012-01-01
Mobile elements account for almost half of the mass of the human genome. Only the retroelements from the non-LTR (long terminal repeat) retrotransposon family, which include the LINE-1 (L1) and its non-autonomous partners, are currently active and contributing to new insertions. Although these elements seem to share the same basic amplification mechanism, the activity and success of the different types of retroelements varies. For example, Alu-induced mutagenesis is responsible for the majority of the documented instances of human disease induced by insertion of retroelements. Using copy number in mammals as an indicator, some SINEs have been vastly more successful than other retroelements, such as the retropseudogenes and even L1, likely due to differences in post-insertion selection and ability to overcome cellular controls. SINE and LINE integration can be differentially influenced by cellular factors, indicating some differences between in their amplification mechanisms. We focus on the known aspects of this group of retroelements and highlight their similarities and differences that may significantly influence their biological impact.
LINEs, SINEs and other retroelements: do birds of a feather flock together?
Roy-Engel, Astrid M.
2012-01-01
Mobile elements account for almost half of the mass of the human genome. Only the retroelements from the non-LTR (long terminal repeat) retrotransposon family, which include the LINE-1 (L1) and its non-autonomous partners, are currently active and contributing to new insertions. Although these elements seem to share the same basic amplification mechanism, the activity and success of the different types of retroelements varies. For example, Alu-induced mutagenesis is responsible for the majority of the documented instances of human disease induced by insertion of retroelements. Using copy number in mammals as an indicator, some SINEs have been vastly more successful than other retroelements, such as the retropseudogenes and even L1, likely due to differences in post-insertion selection and ability to overcome cellular controls. SINE and LINE integration can be differentially influenced by cellular factors, indicating some differences between in their amplification mechanisms. We focus on the known aspects of this group of retroelements and highlight their similarities and differences that may significantly influence their biological impact. PMID:22201808
Regulation of DNA methylation patterns by CK2-mediated phosphorylation of Dnmt3a.
Deplus, Rachel; Blanchon, Loïc; Rajavelu, Arumugam; Boukaba, Abdelhalim; Defrance, Matthieu; Luciani, Judith; Rothé, Françoise; Dedeurwaerder, Sarah; Denis, Hélène; Brinkman, Arie B; Simmer, Femke; Müller, Fabian; Bertin, Benjamin; Berdasco, Maria; Putmans, Pascale; Calonne, Emilie; Litchfield, David W; de Launoit, Yvan; Jurkowski, Tomasz P; Stunnenberg, Hendrik G; Bock, Christoph; Sotiriou, Christos; Fraga, Mario F; Esteller, Manel; Jeltsch, Albert; Fuks, François
2014-08-07
DNA methylation is a central epigenetic modification that is established by de novo DNA methyltransferases. The mechanisms underlying the generation of genomic methylation patterns are still poorly understood. Using mass spectrometry and a phosphospecific Dnmt3a antibody, we demonstrate that CK2 phosphorylates endogenous Dnmt3a at two key residues located near its PWWP domain, thereby downregulating the ability of Dnmt3a to methylate DNA. Genome-wide DNA methylation analysis shows that CK2 primarily modulates CpG methylation of several repeats, most notably of Alu SINEs. This modulation can be directly attributed to CK2-mediated phosphorylation of Dnmt3a. We also find that CK2-mediated phosphorylation is required for localization of Dnmt3a to heterochromatin. By revealing phosphorylation as a mode of regulation of de novo DNA methyltransferase function and by uncovering a mechanism for the regulation of methylation at repetitive elements, our results shed light on the origin of DNA methylation patterns. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Kuroda, Tsuyoshi; Tomimatsu, Erika; Grondin, Simon; Miyazaki, Makoto
2016-11-01
We investigated how perceived duration of empty time intervals would be modulated by the length of sounds marking those intervals. Three sounds were successively presented in Experiment 1. Each sound was short (S) or long (L), and the temporal position of the middle sound's onset was varied. The lengthening of each sound resulted in delayed perception of the onset; thus, the middle sound's onset had to be presented earlier in the SLS than in the LSL sequence so that participants perceived the three sounds as presented at equal interonset intervals. In Experiment 2, a short sound and a long sound were alternated repeatedly, and the relative duration of the SL interval to the LS interval was varied. This repeated sequence was perceived as consisting of equal interonset intervals when the onsets of all sounds were aligned at physically equal intervals. If the same onset delay as in the preceding experiment had occurred, participants should have perceived equality between the interonset intervals in the repeated sequence when the SL interval was physically shortened relative to the LS interval. The effects of sound length seemed to be canceled out when the presentation of intervals was repeated. Finally, the perceived duration of the interonset intervals in the repeated sequence was not influenced by whether the participant's native language was French or Japanese, or by how the repeated sequence was perceptually segmented into rhythmic groups.
Genetic and DNA sequence analysis of the kanamycin resistance transposon Tn903.
Grindley, N D; Joyce, C M
1980-01-01
The kanamycin resistance transposon Tn903 consists of a unique region of about 1000 base pairs bounded by a pair of 1050-base-pair inverted repeat sequences. Each repeat contains two Pvu II endonuclease cleavage sites separated by 520 base pairs. We have constructed derivatives of Tn903 in which this 520-base-pair fragment is deleted from one or both repeats. Those derivatives that lack both 520-base-pair fragments cannot transpose, whereas those that lack just one remain transposition proficient. One such transposable derivative, Tn903 delta I, has been selected for further study. We have determined the sequence of the intact inverted repeat. The 18 base pairs at each end are identical and inverted relative to one another, a structure characteristic of insertion sequences. Additional experiments indicate that a single inverted repeat from Tn903 can, in fact, transpose; we propose that this element be called IS903. To correlate the DNA sequence with genetic activities, we have created mutations by inserting a 10-base-pair DNA fragment at several sites within the intact repeat of Tn903 delta 1, and we have examined the effect of such insertions on transposability. The results suggest that IS903 encodes a 307-amino-acid polypeptide (a "transposase") that is absolutely required for transposition of IS903 or Tn903. Images PMID:6261245
Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.
Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru
2015-01-01
The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.
Ni, Xiangyang; Westpheling, Janet
1997-01-01
The chi63 promoter directs glucose-sensitive, chitin-dependent transcription of a gene involved in the utilization of chitin as carbon source. Analysis of 5′ and 3′ deletions of the promoter region revealed that a 350-bp segment is sufficient for wild-type levels of expression and regulation. The analysis of single base changes throughout the promoter region, introduced by random and site-directed mutagenesis, identified several sequences to be important for activity and regulation. Single base changes at −10, −12, −32, −33, −35, and −37 upstream of the transcription start site resulted in loss of activity from the promoter, suggesting that bases in these positions are important for RNA polymerase interaction. The sequences centered around −10 (TATTCT) and −35 (TTGACC) in this promoter are, in fact, prototypical of eubacterial promoters. Overlapping the RNA polymerase binding site is a perfect 12-bp direct repeat sequence. Some base changes within this direct repeat resulted in constitutive expression, suggesting that this sequence is an operator for negative regulation. Other base changes resulted in loss of glucose repression while retaining the requirement for chitin induction, suggesting that this sequence is also involved in glucose repression. The fact that cis-acting mutations resulted in glucose resistance but not inducer independence rules out the possibility that glucose repression acts exclusively by inducer exclusion. The fact that mutations that affect glucose repression and chitin induction fall within the same direct repeat sequence module suggests that the direct repeat sequence facilitates both chitin induction and glucose repression. PMID:9371809
Zeng, Lu; Kortschak, R Daniel; Raison, Joy M; Bertozzi, Terry; Adelson, David L
2018-01-01
Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package.
Zeng, Lu; Kortschak, R. Daniel; Raison, Joy M.
2018-01-01
Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package. PMID:29538441
Oshima, Masao; Kikuchi, Rie; Imamura, Jun; Handa, Hirokazu
2010-01-01
CMS (cytoplasmic male sterile) rapeseed is produced by asymmetrical somatic cell fusion between the Brassica napus cv. Westar and the Raphanus sativus Kosena CMS line (Kosena radish). The CMS rapeseed contains a CMS gene, orf125, which is derived from Kosena radish. Our sequence analyses revealed that the orf125 region in CMS rapeseed originated from recombination between the orf125/orfB region and the nad1C/ccmFN1 region by way of a 63 bp repeat. A precise sequence comparison among the related sequences in CMS rapeseed, Kosena radish and normal rapeseed showed that the orf125 region in CMS rapeseed consisted of the Kosena orf125/orfB region and the rapeseed nad1C/ccmFN1 region, even though Kosena radish had both the orf125/orfB region and the nad1C/ccmFN1 region in its mitochondrial genome. We also identified three tandem repeat sequences in the regions surrounding orf125, including a 63 bp repeat, which were involved in several recombination events. Interestingly, differences in the recombination activity for each repeat sequence were observed, even though these sequences were located adjacent to each other in the mitochondrial genome. We report results indicating that recombination events within the mitochondrial genomes are regulated at the level of specific repeat sequences depending on the cellular environment.
Locke, John; Podemski, Lynn; Roy, Ken; Pilgrim, David; Hodgetts, Ross
1999-01-01
Chromosome 4 from Drosophila melanogaster has several unusual features that distinguish it from the other chromosomes. These include a diffuse appearance in salivary gland polytene chromosomes, an absence of recombination, and the variegated expression of P-element transgenes. As part of a larger project to understand these properties, we are assembling a physical map of this chromosome. Here we report the sequence of two cosmids representing ∼5% of the polytenized region. Both cosmid clones contain numerous repeated DNA sequences, as identified by cross hybridization with labeled genomic DNA, BLAST searches, and dot matrix analysis, which are positioned between and within the transcribed sequences. The repetitive sequences include three copies of the mobile element Hoppel, one copy of the mobile element HB, and 18 DINE repeats. DINE is a novel, short repeated sequence dispersed throughout both cosmid sequences. One cosmid includes the previously described cubitus interruptus (ci) gene and two new genes: that a gene with a predicted amino acid sequence similar to ribosomal protein S3a which is consistent with the Minute(4)101 locus thought to be in the region, and a novel member of the protein family that includes plexin and met–hepatocyte growth factor receptor. The other cosmid contains only the two short 5′-most exons from the zinc-finger-homolog-2 (zfh-2) gene. This is the first extensive sequence analysis of noncoding DNA from chromosome 4. The distribution of the various repeats suggests its organization is similar to the β-heterochromatic regions near the base of the major chromosome arms. Such a pattern may account for the diffuse banding of the polytene chromosome 4 and the variegation of many P-element transgenes on the chromosome. PMID:10022978
Cell type-specific termination of transcription by transposable element sequences.
Conley, Andrew B; Jordan, I King
2012-09-30
Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription termination by TEs seen here, along with the preference for sense-oriented TE insertions to provide TTS, is consistent with the observed antisense orientation bias of human TEs.
Goren, Moran G; Yosef, Ido; Auster, Oren; Qimron, Udi
2012-10-12
We analyzed sequences of newly inserted repeats in an Escherichia coli CRISPR (clustered regularly interspaced short palindromic repeats) array in vivo and showed that a base previously thought to belong to the repeat is actually derived from a protospacer. Based on further experimental results, we propose to use the term "duplicon" for a repeated sequence in a CRISPR array that serves as a template for a new duplicon. Our findings suggest the possibility of redrawing the borders between repeats, spacers, and protospacer adjacent motifs. Copyright © 2012 Elsevier Ltd. All rights reserved.
Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M; Kashi, Yechezkel
2004-04-01
Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria.
Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M.; Kashi, Yechezkel
2004-01-01
Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria. PMID:15066845
Quinn, J S; Guglich, E; Seutin, G; Lau, R; Marsolais, J; Parna, L; Boag, P T; White, B N
1992-02-01
The first tandemly repeated sequence examined in a passerine bird, a 431-bp PstI fragment named pMAT1, has been cloned from the genome of the brown-headed cowbird (Molothrus ater). The sequence represents about 5-10% of the genome (about 4 x 10(5) copies) and yields prominent ethidium bromide stained bands when genomic DNA cut with a variety of restriction enzymes is electrophoresed in agarose gels. A particularly striking ladder of fragments is apparent when the DNA is cut with HinfI, indicative of a tandem arrangement of the monomer. The cloned PstI monomer has been sequenced, revealing no internal repeated structure. There are sequences that hybridize with pMAT1 found in related nine-primaried oscines but not in more distantly related oscines, suboscines, or nonpasserine species. Little sequence similarity to tandemly repeated PstI cut sequences from the merlin (Falco columbarius), saurus crane (Grus antigone), or Puerto Rican parrot (Amazona vittata) or to HinfI digested sequence from the Toulouse goose (Anser anser) was detected. The isolated sequence was used as a probe to examine DNA samples of eight members of the tribe Icterini. This examination revealed phylogenetically informative characters. The repeat contains cutting sites from a number of restriction enzymes, which, if sufficiently polymorphic, would provide new phylogenetic characters. Sequences like these, conserved within a species, but variable between closely related species, may be very useful for phylogenetic studies of closely related taxa.
Ramos, Antonio M.; Crooijmans, Richard P. M. A.; Affara, Nabeel A.; Amaral, Andreia J.; Archibald, Alan L.; Beever, Jonathan E.; Bendixen, Christian; Churcher, Carol; Clark, Richard; Dehais, Patrick; Hansen, Mark S.; Hedegaard, Jakob; Hu, Zhi-Liang; Kerstens, Hindrik H.; Law, Andy S.; Megens, Hendrik-Jan; Milan, Denis; Nonneman, Danny J.; Rohrer, Gary A.; Rothschild, Max F.; Smith, Tim P. L.; Schnabel, Robert D.; Van Tassell, Curt P.; Taylor, Jeremy F.; Wiedmann, Ralph T.; Schook, Lawrence B.; Groenen, Martien A. M.
2009-01-01
Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs. PMID:19654876
Comprehensive analysis of CpG islands in human chromosomes 21 and 22
NASA Astrophysics Data System (ADS)
Takai, Daiya; Jones, Peter A.
2002-03-01
CpG islands are useful markers for genes in organisms containing 5-methylcytosine in their genomes. In addition, CpG islands located in the promoter regions of genes can play important roles in gene silencing during processes such as X-chromosome inactivation, imprinting, and silencing of intragenomic parasites. The generally accepted definition of what constitutes a CpG island was proposed in 1987 by Gardiner-Garden and Frommer [Gardiner-Garden, M. & Frommer, M. (1987) J. Mol. Biol. 196, 261-282] as being a 200-bp stretch of DNA with a C+G content of 50% and an observed CpG/expected CpG in excess of 0.6. Any definition of a CpG island is somewhat arbitrary, and this one, which was derived before the sequencing of mammalian genomes, will include many sequences that are not necessarily associated with controlling regions of genes but rather are associated with intragenomic parasites. We have therefore used the complete genomic sequences of human chromosomes 21 and 22 to examine the properties of CpG islands in different sequence classes by using a search algorithm that we have developed. Regions of DNA of greater than 500 bp with a G+C equal to or greater than 55% and observed CpG/expected CpG of 0.65 were more likely to be associated with the 5' regions of genes and this definition excluded most Alu-repetitive elements. We also used genome sequences to show strong CpG suppression in the human genome and slight suppression in Drosophila melanogaster and Saccharomyces cerevisiae. This finding is compatible with the recent detection of 5-methylcytosine in Drosophila, and might suggest that S. cerevisiae has, or once had, CpG methylation.
Process development of beam-lead silicon-gate COS/MOS integrated circuits
NASA Technical Reports Server (NTRS)
Baptiste, B.; Boesenberg, W.
1974-01-01
Two processes for the fabrication of beam-leaded COS/MOS integrated circuits are described. The first process utilizes a composite gate dielectric of 800 A of silicon dioxide and 450 A of pyrolytically deposited A12O3 as an impurity barrier. The second process utilizes polysilicon gate metallization over which a sealing layer of 1000 A of pyrolytic Si3N4 is deposited. Three beam-lead integrated circuits have been implemented with the first process: (1) CD4000BL - three-input NOR gate; (2) CD4007BL - triple inverter; and (3) CD4013BL - dual D flip flop. An arithmetic and logic unit (ALU) integrated circuit was designed and implemented with the second process. The ALU chip allows addition with four bit accuracy. Processing details, device design and device characterization, circuit performance and life data are presented.
Nucleoside reverse transcriptase inhibitors possess intrinsic anti-inflammatory activity
Fowler, Benjamin J.; Gelfand, Bradley D.; Kim, Younghee; Kerur, Nagaraj; Tarallo, Valeria; Hirano, Yoshio; Amarnath, Shoba; Fowler, Daniel H.; Radwan, Marta; Young, Mark T.; Pittman, Keir; Kubes, Paul; Agarwal, Hitesh K.; Parang, Keykavous A.; Hinton, David R.; Bastos-Carvalho, Ana; Li, Shengjian; Yasuma, Tetsuhiro; Mizutani, Takeshi; Yasuma, Reo; Wright, Charles; Ambati, Jayakrishna
2014-01-01
Nucleoside reverse transcriptase inhibitors (NRTIs) are mainstay therapeutics for HIV that block retrovirus replication. Alu (an endogenous retroelement that also requires reverse transcriptase for its life cycle)-derived RNAs activate P2X7 and the NLRP3 inflammasome to cause cell death of the retinal pigment epithelium (RPE) in geographic atrophy, a type of age-related macular degeneration. We found that NRTIs inhibit P2X7-mediated NLRP3 inflammasome activation independent of reverse transcriptase inhibition. Multiple approved and clinically relevant NRTIs prevented caspase-1 activation, the effector of the NLRP3 inflammasome, induced by Alu RNA. NRTIs were efficacious in mouse models of geographic atrophy, choroidal neovascularization, graft-versus-host disease (GVHD), and sterile liver inflammation. Our findings suggest that NRTIs are ripe for drug repurposing in P2X7-driven diseases. PMID:25414314
Cell-Free circulating DNA: a new biomarker for the acute coronary syndrome.
Cui, Ming; Fan, Mengkang; Jing, Rongrong; Wang, Huimin; Qin, Jingfeng; Sheng, Hongzhuan; Wang, Yueguo; Wu, Xinhua; Zhang, Lurong; Zhu, Jianhua; Ju, Shaoqing
2013-01-01
In recent studies, concentrations of cell-free circulating DNA (cf-DNA) have been correlated with clinical characteristics and prognosis in several diseases. The relationship between cf-DNA concentrations and the acute coronary syndrome (ACS) remains unknown. Moreover, no data are available for the detection cf-DNA in ACS by a branched DNA (bDNA)-based Alu assay. The aim of the present study was to investigate cf-DNA concentrations in ACS and their relationship with clinical features. Plasma cf-DNA concentrations of 137 ACS patients at diagnosis, of 60 healthy individuals and of 13 patients with stable angina (SA) were determined using a bDNA-based Alu assay. ACS patients (median 2,285.0, interquartile range 916.4-4,857.3 ng/ml), especially in ST-segment elevation myocardial infarction patients (median 5,745.4, interquartile range 4,013.5-8,643.9 ng/ml), showed a significant increase in plasma cf-DNA concentrations compared with controls (healthy controls: median 118.3, interquartile range 81.1-221.1 ng/ml; SA patients: median 202.3, interquartile range 112.7-256.1 ng/ml) using a bDNA-based Alu assay. Moreover, we found positive correlations between cf-DNA and Gensini scoring and GRACE (Global Registry of Acute Coronary Events) scoring in ACS. cf-DNA may be a valuable marker for diagnosing and predicting the severity of coronary artery lesions and risk stratification in ACS. Copyright © 2013 S. Karger AG, Basel.
Pichler, Irene; Mueller, Jakob C; Stefanov, Stefan A; De Grandi, Alessandro; Volpato, Claudia Beu; Pinggera, Gerd K; Mayr, Agnes; Ogriseg, Martin; Ploner, Franz; Meitinger, Thomas; Pramstaller, Peter P
2006-08-01
Most of the inhabitants of South Tyrol in the eastern Italian Alps can be considered isolated populations because of their physical separation by mountain barriers and their sociocultural heritage. We analyzed the genetic structure of South Tyrolean populations using three types of genetic markers: Y-chromosome, mitochondrial DNA (mtDNA), and autosomal Alu markers. Using random samples taken from the populations of Val Venosta, Val Pusteria, Val Isarco, Val Badia, and Val Gardena, we calculated genetic diversity within and among the populations. Microsatellite diversity and unique event polymorphism diversity (on the Y chromosome) were substantially lower in the Ladin-speaking population of Val Badia compared to the neighboring German-speaking populations. In contrast, the genetic diversity of mtDNA haplotypes was lowest for the upper Val Venosta and Val Pusteria. These data suggest a low effective population size, or little admixture, for the gene pool of the Ladin-speaking population from Val Badia. Interestingly, this is more pronounced for Ladin males than for Ladin females. For the pattern of genetic Alu variation, both Ladin samples (Val Gardena and Val Badia) are among the samples with the lowest diversity. An admixture analysis of one German-speaking valley (Val Venosta) indicates a relatively high genetic contribution of Ladin origin. The reduced genetic diversity and a high genetic differentiation in the Rhaetoroman- and German-speaking South Tyrolean populations may constitute an important basis for future medical genetic research and gene mapping studies in South Tyrol.
Luo, Yan; Liu, Qin; Lei, Xun; Wen, Yi; Yang, Ya-Lan; Zhang, Rui; Hu, Meng-Yao
2015-07-01
This study aims to estimate the association between ESR1 polymorphisms (PvuII and XbaI) and ESR2 polymorphisms (RsaI and AluI) with precocious puberty. Relevant studies published before March 2014 were retrieved by a electronic search among nine databases. Meta-analysis of the pooled odds ratios (ORs) with 95% confidence intervals (CIs) was calculated. Four eligible case-control studies including 491 precocious puberty patients and 370 healthy controls were identified. Three studies reported ESR1 PvuII and XbaI polymorphism and one study reported ESR2 RsaI and AluI polymorphism. Increment of precocious puberty risk was associated with PvuII polymorphism in the heterosis model ((CT) versus TT: OR 1.42, 95% CI: 1.05-1.91, p = 0.02). Risk of precocious puberty was associated with XbaI polymorphism in the dominant model (GG + GA versus AA: OR 1.48, 95% CI: 1.11-1.97, p = 0.007) and the heterosis model (GA versus AA: OR 1.68, 95% CI: 1.23-2.29, p = 0.001). This meta-analysis suggests that ESR1 XbaI and PvuII polymorphisms are associated with precocious puberty susceptibility, and the relationship between ESR2 RsaI and AluI polymorphism with precocious puberty remains to be further investigated. Well-designed studies with large sample size among different polymorphisms and ethnicities are in urgent need to provide and update reliable data for comprehensive and definite conclusion.
“One code to find them all”: a perl tool to conveniently parse RepeatMasker output files
2014-01-01
Background Of the different bioinformatic methods used to recover transposable elements (TEs) in genome sequences, one of the most commonly used procedures is the homology-based method proposed by the RepeatMasker program. RepeatMasker generates several output files, including the .out file, which provides annotations for all detected repeats in a query sequence. However, a remaining challenge consists of identifying the different copies of TEs that correspond to the identified hits. This step is essential for any evolutionary/comparative analysis of the different copies within a family. Different possibilities can lead to multiple hits corresponding to a unique copy of an element, such as the presence of large deletions/insertions or undetermined bases, and distinct consensus corresponding to a single full-length sequence (like for long terminal repeat (LTR)-retrotransposons). These possibilities must be taken into account to determine the exact number of TE copies. Results We have developed a perl tool that parses the RepeatMasker .out file to better determine the number and positions of TE copies in the query sequence, in addition to computing quantitative information for the different families. To determine the accuracy of the program, we tested it on several RepeatMasker .out files corresponding to two organisms (Drosophila melanogaster and Homo sapiens) for which the TE content has already been largely described and which present great differences in genome size, TE content, and TE families. Conclusions Our tool provides access to detailed information concerning the TE content in a genome at the family level from the .out file of RepeatMasker. This information includes the exact position and orientation of each copy, its proportion in the query sequence, and its quality compared to the reference element. In addition, our tool allows a user to directly retrieve the sequence of each copy and obtain the same detailed information at the family level when a local library with incomplete TE class/subclass information was used with RepeatMasker. We hope that this tool will be helpful for people working on the distribution and evolution of TEs within genomes.
Effects of "D"-Amphetamine and Ethanol on Variable and Repetitive Key-Peck Sequences in Pigeons
ERIC Educational Resources Information Center
Ward, Ryan D.; Bailey, Ericka M.; Odum, Amy L.
2006-01-01
This experiment assessed the effects of "d"-Amphetamine and ethanol on reinforced variable and repetitive key-peck sequences in pigeons. Pigeons responded on two keys under a multiple schedule of Repeat and Vary components. In the Repeat component, completion of a target sequence of right, right, left, left resulted in food. In the Vary component,…
Alverson, Andrew J; Zhuo, Shi; Rice, Danny W; Sloan, Daniel B; Palmer, Jeffrey D
2011-01-20
The mitochondrial genomes of seed plants are exceptionally fluid in size, structure, and sequence content, with the accumulation and activity of repetitive sequences underlying much of this variation. We report the first fully sequenced mitochondrial genome of a legume, Vigna radiata (mung bean), and show that despite its unexceptional size (401,262 nt), the genome is unusually depauperate in repetitive DNA and "promiscuous" sequences from the chloroplast and nuclear genomes. Although Vigna lacks the large, recombinationally active repeats typical of most other seed plants, a PCR survey of its modest repertoire of short (38-297 nt) repeats nevertheless revealed evidence for recombination across all of them. A set of novel control assays showed, however, that these results could instead reflect, in part or entirely, artifacts of PCR-mediated recombination. Consequently, we recommend that other methods, especially high-depth genome sequencing, be used instead of PCR to infer patterns of plant mitochondrial recombination. The average-sized but repeat- and feature-poor mitochondrial genome of Vigna makes it ever more difficult to generalize about the factors shaping the size and sequence content of plant mitochondrial genomes.
Highly Informative Simple Sequence Repeat (SSR) Markers for Fingerprinting Hazelnut
USDA-ARS?s Scientific Manuscript database
Simple sequence repeat (SSR) or microsatellite markers have many applications in breeding and genetic studies of plants, including fingerprinting of cultivars and investigations of genetic diversity, and therefore provide information for better management of germplasm collections. They are repeatab...
Distribution and sequence homogeneity of an abundant satellite DNA in the beetle, Tenebrio molitor.
Davis, C A; Wyatt, G R
1989-01-01
The mealworm beetle, Tenebrio molitor, contains an unusually abundant and homogeneous satellite DNA which constitutes up to 60% of its genome. The satellite DNA is shown to be present in all of the chromosomes by in situ hybridization. 18 dimers of the repeat unit were cloned and sequenced. The consensus sequence is 142 nt long and lacks any internal repeat structure. Monomers of the sequence are very similar, showing on average a 2% divergence from the calculated consensus. Variant nucleotides are scattered randomly throughout the sequence although some variants are more common than others. Neighboring repeat units are no more alike than randomly chosen ones. The results suggest that some mechanism, perhaps gene conversion, is acting to maintain the homogeneity of the satellite DNA despite its abundance and distribution on all of the chromosomes. Images PMID:2762148
Amino acid sequence analysis of the annexin super-gene family of proteins.
Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J
1991-06-15
The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of the predictions and shows the power of techniques for the determination of tertiary structural information from the amino acid sequences of an aligned protein family.
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.
Militello, Kevin T; Lazatin, Justine C
2017-05-01
Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
Sequence investigation of 34 forensic autosomal STRs with massively parallel sequencing.
Zhang, Suhua; Niu, Yong; Bian, Yingnan; Dong, Rixia; Liu, Xiling; Bao, Yun; Jin, Chao; Zheng, Hancheng; Li, Chengtao
2018-05-01
STRs vary not only in the length of the repeat units and the number of repeats but also in the region with which they conform to an incremental repeat pattern. Massively parallel sequencing (MPS) offers new possibilities in the analysis of STRs since they can simultaneously sequence multiple targets in a single reaction and capture potential internal sequence variations. Here, we sequenced 34 STRs applied in the forensic community of China with a custom-designed panel. MPS performance were evaluated from sequencing reads analysis, concordance study and sensitivity testing. High coverage sequencing data were obtained to determine the constitute ratios and heterozygous balance. No actual inconsistent genotypes were observed between capillary electrophoresis (CE) and MPS, demonstrating the reliability of the panel and the MPS technology. With the sequencing data from the 200 investigated individuals, 346 and 418 alleles were obtained via CE and MPS technologies at the 34 STRs, indicating MPS technology provides higher discrimination than CE detection. The whole study demonstrated that STR genotyping with the custom panel and MPS technology has the potential not only to reveal length and sequence variations but also to satisfy the demands of high throughput and high multiplexing with acceptable sensitivity.
Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor.
Kohany, Oleksiy; Gentles, Andrew J; Hankus, Lukasz; Jurka, Jerzy
2006-10-25
Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at http://www.girinst.org/repbase/submission.html (RepbaseSubmitter) and http://www.girinst.org/censor/index.php (Censor).
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence
2017-01-01
During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana. We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays, although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. PMID:28223399
Repeating aftershocks of the great 2004 Sumatra and 2005 Nias earthquakes
NASA Astrophysics Data System (ADS)
Yu, Wen-che; Song, Teh-Ru Alex; Silver, Paul G.
2013-05-01
We investigate repeating aftershocks associated with the great 2004 Sumatra-Andaman (Mw 9.2) and 2005 Nias-Simeulue (Mw 8.6) earthquakes by cross-correlating waveforms recorded by the regional seismographic station PSI and teleseismic stations. We identify 10 and 18 correlated aftershock sequences associated with the great 2004 Sumatra and 2005 Nias earthquakes, respectively. The majority of the correlated aftershock sequences are located near the down-dip end of a large afterslip patch. We determine the precise relative locations of event pairs among these sequences and estimate the source rupture areas. The correlated event pairs identified are appropriately referred to as repeating aftershocks, in that the source rupture areas are comparable and significantly overlap within a sequence. We use the repeating aftershocks to estimate afterslip based on the slip-seismic moment scaling relationship and to infer the temporal decay rate of the recurrence interval. The estimated afterslip resembles that measured from the near-field geodetic data to the first order. The decay rate of repeating aftershocks as a function of lapse time t follows a power-law decay 1/tp with the exponent p in the range 0.8-1.1. Both types of observations indicate that repeating aftershocks are governed by post-seismic afterslip.
Plourde, Marie; Gingras, Hélène; Roy, Gaétan; Lapointe, Andréanne; Leprohon, Philippe; Papadopoulou, Barbara; Corbeil, Jacques; Ouellette, Marc
2014-01-01
Gene amplification of specific loci has been described in all kingdoms of life. In the protozoan parasite Leishmania, the product of amplification is usually part of extrachromosomal circular or linear amplicons that are formed at the level of direct or inverted repeated sequences. A bioinformatics screen revealed that repeated sequences are widely distributed in the Leishmania genome and the repeats are chromosome-specific, conserved among species, and generally present in low copy number. Using sensitive PCR assays, we provide evidence that the Leishmania genome is continuously being rearranged at the level of these repeated sequences, which serve as a functional platform for constitutive and stochastic amplification (and deletion) of genomic segments in the population. This process is adaptive as the copy number of advantageous extrachromosomal circular or linear elements increases upon selective pressure and is reversible when selection is removed. We also provide mechanistic insights on the formation of circular and linear amplicons through RAD51 recombinase-dependent and -independent mechanisms, respectively. The whole genome of Leishmania is thus stochastically rearranged at the level of repeated sequences, and the selection of parasite subpopulations with changes in the copy number of specific loci is used as a strategy to respond to a changing environment. PMID:24844805
Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao
2018-05-01
Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.
Molecular analysis of mucopolysaccharidosis IVA: Common mutations and racial difference
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tomatsu, S.; Hori, T.; Nakashima, Y.
1994-09-01
Mucopolysaccharidosis IVA (MPS IVA) is an autosomal recessive disorder caused by a deficiency in N-acetylgalactosamine -6-sulfate sulfatase (GALNS). Studies on the molecular basis of MPS IVA have been facilitated following cloning of the full-length cDNA and genomic DNA. In this study we detected mutations from 20 Caucasian and 19 Japanese MPS IVA patients using SSCP system and compared mutations of Caucasian origin with those of Japanese origin. The results showed the presence of 16 various mutations (3 small, deletions, 2 nonsense and 11 missense mutations) for Caucasian patients and 15 (1 deletion, 1 large alteration and 13 missense mutations) formore » Japanese. Moreover, two common mutations existed; one is double gene deletion characteristic for Japanese (6 alleles; 15%) and the other is a point mutation (1113F A{yields}T transition) characteristic for Caucasian (9 alleles; 22.5%). And the clear genotype/phenotype relationship among 1342delCA, IVS1(-2), P151S, Q148X, R386C, I113F, Q473X, W220G, P151L, A291T, R90W, and P77R, for a severe type, G96B N204K and V138A for a milder type, was observed. Only R386 mutation was seen in both of the populations. Further, the precise DNA analysis for double gene deletion of a common double gene deletion has been performed by defining the breakpoints and the results showed that one deletion was caused by homologous recombination due to Alu repetitive sequences and the other was due to nonhomologous recombination of short direct repeat. Haplotype analysis for six alleles with double deletion were different, indicating the different origin of this mutation or the frequent recombination events before a mutational event. Thus the mutations in GALNS gene are very heterogeneous and the racial difference is characteristic.« less
Begum, Rabeya; Alam, Sheikh Shamimul; Menzel, Gerhard; Schmidt, Thomas
2009-01-01
Background and Aims Dendrobium species show tremendous morphological diversity and have broad geographical distribution. As repetitive sequence analysis is a useful tool to investigate the evolution of chromosomes and genomes, the aim of the present study was the characterization of repetitive sequences from Dendrobium moschatum for comparative molecular and cytogenetic studies in the related species Dendrobium aphyllum, Dendrobium aggregatum and representatives from other orchid genera. Methods In order to isolate highly repetitive sequences, a c0t-1 DNA plasmid library was established. Repeats were sequenced and used as probes for Southern hybridization. Sequence divergence was analysed using bioinformatic tools. Repetitive sequences were localized along orchid chromosomes by fluorescence in situ hybridization (FISH). Key Results Characterization of the c0t-1 library resulted in the detection of repetitive sequences including the (GA)n dinucleotide DmoO11, numerous Arabidopsis-like telomeric repeats and the highly amplified dispersed repeat DmoF14. The DmoF14 repeat is conserved in six Dendrobium species but diversified in representative species of three other orchid genera. FISH analyses showed the genome-wide distribution of DmoF14 in D. moschatum, D. aphyllum and D. aggregatum. Hybridization with the telomeric repeats demonstrated Arabidopsis-like telomeres at the chromosome ends of Dendrobium species. However, FISH using the telomeric probe revealed two pairs of chromosomes with strong intercalary signals in D. aphyllum. FISH showed the terminal position of 5S and 18S–5·8S–25S rRNA genes and a characteristic number of rDNA sites in the three Dendrobium species. Conclusions The repeated sequences isolated from D. moschatum c0t-1 DNA constitute major DNA families of the D. moschatum, D. aphyllum and D. aggregatum genomes with DmoF14 representing an ancient component of orchid genomes. Large intercalary telomere-like arrays suggest chromosomal rearrangements in D. aphyllum while the number and localization of rRNA genes as well as the species-specific distribution pattern of an abundant microsatellite reflect the genomic diversity of the three Dendrobium species. PMID:19635741
Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R
2006-12-01
Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.
Correlation between fibroin amino acid sequence and physical silk properties.
Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek
2003-09-12
The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet.
Evidence for Long-Timescale Patterns of Synaptic Inputs in CA1 of Awake Behaving Mice.
Kolb, Ilya; Talei Franzesi, Giovanni; Wang, Michael; Kodandaramaiah, Suhasa B; Forest, Craig R; Boyden, Edward S; Singer, Annabelle C
2018-02-14
Repeated sequences of neural activity are a pervasive feature of neural networks in vivo and in vitro In the hippocampus, sequential firing of many neurons over periods of 100-300 ms reoccurs during behavior and during periods of quiescence. However, it is not known whether the hippocampus produces longer sequences of activity or whether such sequences are restricted to specific network states. Furthermore, whether long repeated patterns of activity are transmitted to single cells downstream is unclear. To answer these questions, we recorded intracellularly from hippocampal CA1 of awake, behaving male mice to examine both subthreshold activity and spiking output in single neurons. In eight of nine recordings, we discovered long (900 ms) reoccurring subthreshold fluctuations or "repeats." Repeats generally were high-amplitude, nonoscillatory events reoccurring with 10 ms precision. Using statistical controls, we determined that repeats occurred more often than would be expected from unstructured network activity (e.g., by chance). Most spikes occurred during a repeat, and when a repeat contained a spike, the spike reoccurred with precision on the order of ≤20 ms, showing that long repeated patterns of subthreshold activity are strongly connected to spike output. Unexpectedly, we found that repeats occurred independently of classic hippocampal network states like theta oscillations or sharp-wave ripples. Together, these results reveal surprisingly long patterns of repeated activity in the hippocampal network that occur nonstochastically, are transmitted to single downstream neurons, and strongly shape their output. This suggests that the timescale of information transmission in the hippocampal network is much longer than previously thought. SIGNIFICANCE STATEMENT We found long (≥900 ms), repeated, subthreshold patterns of activity in CA1 of awake, behaving mice. These repeated patterns ("repeats") occurred more often than expected by chance and with 10 ms precision. Most spikes occurred within repeats and reoccurred with a precision on the order of 20 ms. Surprisingly, there was no correlation between repeat occurrence and classical network states such as theta oscillations and sharp-wave ripples. These results provide strong evidence that long patterns of activity are repeated and transmitted to downstream neurons, suggesting that the hippocampus can generate longer sequences of repeated activity than previously thought. Copyright © 2018 the authors 0270-6474/18/381822-14$15.00/0.
CRF: detection of CRISPR arrays using random forest.
Wang, Kai; Liang, Chun
2017-01-01
CRISPRs (clustered regularly interspaced short palindromic repeats) are particular repeat sequences found in wide range of bacteria and archaea genomes. Several tools are available for detecting CRISPR arrays in the genomes of both domains. Here we developed a new web-based CRISPR detection tool named CRF (CRISPR Finder by Random Forest). Different from other CRISPR detection tools, a random forest classifier was used in CRF to filter out invalid CRISPR arrays from all putative candidates and accordingly enhanced detection accuracy. In CRF, particularly, triplet elements that combine both sequence content and structure information were extracted from CRISPR repeats for classifier training. The classifier achieved high accuracy and sensitivity. Moreover, CRF offers a highly interactive web interface for robust data visualization that is not available among other CRISPR detection tools. After detection, the query sequence, CRISPR array architecture, and the sequences and secondary structures of CRISPR repeats and spacers can be visualized for visual examination and validation. CRF is freely available at http://bioinfolab.miamioh.edu/crf/home.php.
Expanded complexity of unstable repeat diseases
Polak, Urszula; McIvor, Elizabeth; Dent, Sharon Y.R.; Wells, Robert D.; Napierala, Marek
2015-01-01
Unstable Repeat Diseases (URDs) share a common mutational phenomenon of changes in the copy number of short, tandemly repeated DNA sequences. More than 20 human neurological diseases are caused by instability, predominantly expansion, of microsatellite sequences. Changes in the repeat size initiate a cascade of pathological processes, frequently characteristic of a unique disease or a small subgroup of the URDs. Understanding of both the mechanism of repeat instability and molecular consequences of the repeat expansions is critical to developing successful therapies for these diseases. Recent technological breakthroughs in whole genome, transcriptome and proteome analyses will almost certainly lead to new discoveries regarding the mechanisms of repeat instability, the pathogenesis of URDs, and will facilitate development of novel therapeutic approaches. The aim of this review is to give a general overview of unstable repeats diseases, highlight the complexities of these diseases, and feature the emerging discoveries in the field. PMID:23233240
Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.
Mayer, K; Schüller, C; Wambutt, R; Murphy, G; Volckaert, G; Pohl, T; Düsterhöft, A; Stiekema, W; Entian, K D; Terryn, N; Harris, B; Ansorge, W; Brandt, P; Grivell, L; Rieger, M; Weichselgartner, M; de Simone, V; Obermaier, B; Mache, R; Müller, M; Kreis, M; Delseny, M; Puigdomenech, P; Watson, M; Schmidtheini, T; Reichert, B; Portatelle, D; Perez-Alonso, M; Boutry, M; Bancroft, I; Vos, P; Hoheisel, J; Zimmermann, W; Wedler, H; Ridley, P; Langham, S A; McCullagh, B; Bilham, L; Robben, J; Van der Schueren, J; Grymonprez, B; Chuang, Y J; Vandenbussche, F; Braeken, M; Weltjens, I; Voet, M; Bastiaens, I; Aert, R; Defoor, E; Weitzenegger, T; Bothe, G; Ramsperger, U; Hilbert, H; Braun, M; Holzer, E; Brandt, A; Peters, S; van Staveren, M; Dirske, W; Mooijman, P; Klein Lankhorst, R; Rose, M; Hauf, J; Kötter, P; Berneiser, S; Hempel, S; Feldpausch, M; Lamberth, S; Van den Daele, H; De Keyser, A; Buysshaert, C; Gielen, J; Villarroel, R; De Clercq, R; Van Montagu, M; Rogers, J; Cronin, A; Quail, M; Bray-Allen, S; Clark, L; Doggett, J; Hall, S; Kay, M; Lennard, N; McLay, K; Mayes, R; Pettett, A; Rajandream, M A; Lyne, M; Benes, V; Rechmann, S; Borkova, D; Blöcker, H; Scharfe, M; Grimm, M; Löhnert, T H; Dose, S; de Haan, M; Maarse, A; Schäfer, M; Müller-Auer, S; Gabel, C; Fuchs, M; Fartmann, B; Granderath, K; Dauner, D; Herzl, A; Neumann, S; Argiriou, A; Vitale, D; Liguori, R; Piravandi, E; Massenet, O; Quigley, F; Clabauld, G; Mündlein, A; Felber, R; Schnabl, S; Hiller, R; Schmidt, W; Lecharny, A; Aubourg, S; Chefdor, F; Cooke, R; Berger, C; Montfort, A; Casacuberta, E; Gibbons, T; Weber, N; Vandenbol, M; Bargues, M; Terol, J; Torres, A; Perez-Perez, A; Purnelle, B; Bent, E; Johnson, S; Tacon, D; Jesse, T; Heijnen, L; Schwarz, S; Scholler, P; Heber, S; Francs, P; Bielke, C; Frishman, D; Haase, D; Lemcke, K; Mewes, H W; Stocker, S; Zaccaria, P; Bevan, M; Wilson, R K; de la Bastide, M; Habermann, K; Parnell, L; Dedhia, N; Gnoj, L; Schutz, K; Huang, E; Spiegel, L; Sehkon, M; Murray, J; Sheet, P; Cordes, M; Abu-Threideh, J; Stoneking, T; Kalicki, J; Graves, T; Harmon, G; Edwards, J; Latreille, P; Courtney, L; Cloud, J; Abbott, A; Scott, K; Johnson, D; Minx, P; Bentley, D; Fulton, B; Miller, N; Greco, T; Kemp, K; Kramer, J; Fulton, L; Mardis, E; Dante, M; Pepin, K; Hillier, L; Nelson, J; Spieth, J; Ryan, E; Andrews, S; Geisel, C; Layman, D; Du, H; Ali, J; Berghoff, A; Jones, K; Drone, K; Cotton, M; Joshu, C; Antonoiu, B; Zidanic, M; Strong, C; Sun, H; Lamar, B; Yordan, C; Ma, P; Zhong, J; Preston, R; Vil, D; Shekher, M; Matero, A; Shah, R; Swaby, I K; O'Shaughnessy, A; Rodriguez, M; Hoffmann, J; Till, S; Granat, S; Shohdy, N; Hasegawa, A; Hameed, A; Lodhi, M; Johnson, A; Chen, E; Marra, M; Martienssen, R; McCombie, W R
1999-12-16
The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
Pavia, Paula X; Thomas, M Carmen; López, Manuel C; Puerta, Concepción J
2012-10-01
Repetitive sequences constitute an important proportion of the Trypanosoma cruzi genome; hence, they have been used as molecular markers and as amplification targets to identify the parasite presence via PCR. In this study, a molecular characterization of the SIRE repetitive element was performed in the six discrete typing units (DTUs) of T. cruzi. The results evidenced that this element, located in multiple chromosomes, was interspersed in the genome of all DTUs of the parasite. The presence of several motifs implicated in element insertion, duplication, and functionality suggests that SIRE could be an active element in the parasite genome. Of interest, there were SIRE specific Alu I fragments that allowed to discriminate DTU I from the others DTUs. Moreover, an UPGMA phenetic tree constructed from fragment sharing Southern blot data showed that T. cruzi I isolates conform a cluster separated from the T. cruzi II-VI isolates. When the relative number of SIRE copies was determined, a variation from 105 to 2,000 copies per haploid genome was observed among the different isolates without kept a DTU-relationship. In all, these findings suggest that SIRE sequence is a good target for parasite DNA amplification. Copyright © 2012 Elsevier Inc. All rights reserved.
The isolation of cDNAs from OATL1 at Xp11.2 using a 480-kb YAC
DOE Office of Scientific and Technical Information (OSTI.GOV)
Geraghty, M.T.; Brody, L.C.; Martin, L.S.
1993-05-01
Using an ornithine-{delta}-aminotransferase (OAT) cDNA, the authors identified five YACs that cover two nonadjacent OAT-related loci in Xp11.2-p11.3, designated OATL1 (distal) and OATL2 (proximal). Because several retinal degenerative disorders map to this region, they used YAC2 (480 kb), which covers the most distal part of OATL1, as a probe to screen a retinal cDNA library. From 8 {times} 10{sup 4} plaques screened, they isolated 13 clones. Two were OAT cDNAs. The remaining 11 were divided into eight groups by cross-hybridization. Groups 1-4 contain cDNAs that originate from single-copy X-linked genes in YAC2. Each has an open reading frame of >500more » bp and detects one or more transcripts on a Northern blot. The gene for each was sublocalized and ordered in YAC2. The cDNAs in groups 5-8 contained two or more Alu sequences, had no open reading frames, and did not detect transcripts. The cDNAs from groups 1-4 provide expressed sequence tags and identify candidate genes for the genetic disorders that map to this region. 28 refs., 5 figs., 1 tab.« less
Characterization of species-specific repeated DNA sequences from B. nigra.
Gupta, V; Lakshmisita, G; Shaila, M S; Jagannathan, V; Lakshmikumaran, M S
1992-07-01
The construction and characterization of two genome-specific recombinant DNA clones from B. nigra are described. Southern analysis showed that the two clones belong to a dispersed repeat family. They differ from each other in their length, distribution and sequence, though the average GC content is nearly the same (45%). These B genome-specific repeats have been used to analyse the phylogenetic relationships between cultivated and wild species of the family Brassicaceae.
Maca-Meyer, N; Villar, J; Pérez-Méndez, L; Cabrera de León, A; Flores, C
2004-11-01
Classical, mitochondrial DNA (mtDNA) and Y chromosome markers have been used to examine the genetic admixture in present day inhabitants of the Canary Islands. In this study, we report the analysis of ten autosomal Alu insertion polymorphisms in 364 samples from the seven main islands of the Archipelago, and their comparison to continental samples. The detection of population-specific alleles from the Iberian Peninsula and Northwest Africa, as well as their affinities on the basis of genetic distances and principal component analysis, support a clear link between these populations. Coincident with previous results, the Canarian gene pool can be distinguished as being halfway between those of its putative parents, although with a major Iberian contribution (62-78%). Both the substantial Northwest African contribution (23-38%), and the minor sub-Saharan African input (3%), suggest that the genetic legacy from the aborigines and slaves still persists in the Canary Islanders.
Paszun, Sylwia K; Stanisz, Beata J; Gradowska, Agnieszka
2013-01-01
The presented study aimed at the evaluation of hydrochlorothiazide influence on cilazapril stability in model mixture and fixed dose tablet formulation. The degradation of cilazapril in the presence of hydrochlorothiazide took place according to autocatalytic reaction kinetic mechanism, described mathematically by Prout-Tompkins equation. Hydrochlorothiazide coexistence with cilazapril in model mixture and fixed dose tablet without blister package accelerated cilazapril degradation in comparison with degradation of cilazapril substance. Values of reaction induction time shortened, while those of observed reaction rate constant increased. Increasing values of relative humidity and temperature have negative impact on cilazapril stability. Determined semi-logarithmic relationships: In k = f(RH) and Arrhenius ln k = f(1/T) are linear and are cilazapril stability predictive. The blister (OPA/Alu/PVC//Alu) package of fixed dose tablets, constitutes absolute moisture protection and prevent cilazapril--hydrochlorothiazide interaction occurrence.
The Microprocessor controls the activity of mammalian retrotransposons
Heras, Sara R.; Macias, Sara; Plass, Mireya; Fernandez, Noemí; Cano, David; Eyras, Eduardo; Garcia-Perez, José L.; Cáceres, Javier F.
2013-01-01
More than half of the human genome is made of Transposable Elements. Their ongoing mobilization is a driving force in genetic diversity; however, little is known about how the host regulates their activity. Here, we show that the Microprocessor (Drosha-DGCR8), which is required for microRNA biogenesis, also recognizes and binds RNAs derived from human LINE-1 (Long INterspersed Element 1), Alu and SVA retrotransposons. Expression analyses demonstrate that cells lacking a functional Microprocessor accumulate LINE-1 mRNA and encoded proteins. Furthermore, we show that structured regions of the LINE-1 mRNA can be cleaved in vitro by Drosha. Additionally, we used a cell culture-based assay to show that the Microprocessor negatively regulates LINE-1 and Alu retrotransposition in vivo. Altogether, these data reveal a new role for the Microprocessor as a post-transcriptional repressor of mammalian retrotransposons acting as a defender of human genome integrity. PMID:23995758
The Microprocessor controls the activity of mammalian retrotransposons.
Heras, Sara R; Macias, Sara; Plass, Mireya; Fernandez, Noemí; Cano, David; Eyras, Eduardo; Garcia-Perez, José L; Cáceres, Javier F
2013-10-01
More than half of the human genome is made of transposable elements whose ongoing mobilization is a driving force in genetic diversity; however, little is known about how the host regulates their activity. Here, we show that the Microprocessor (Drosha-DGCR8), which is required for microRNA biogenesis, also recognizes and binds RNAs derived from human long interspersed element 1 (LINE-1), Alu and SVA retrotransposons. Expression analyses demonstrate that cells lacking a functional Microprocessor accumulate LINE-1 mRNA and encoded proteins. Furthermore, we show that structured regions of the LINE-1 mRNA can be cleaved in vitro by Drosha. Additionally, we used a cell culture-based assay to show that the Microprocessor negatively regulates LINE-1 and Alu retrotransposition in vivo. Altogether, these data reveal a new role for the Microprocessor as a post-transcriptional repressor of mammalian retrotransposons and a defender of human genome integrity.
Regulation of nucleolus assembly by non-coding RNA polymerase II transcripts
Caudron-Herger, Maïwen; Pankert, Teresa; Rippe, Karsten
2016-01-01
ABSTRACT The nucleolus is a nuclear subcompartment for tightly regulated rRNA production and ribosome subunit biogenesis. It also acts as a cellular stress sensor and can release enriched factors in response to cellular stimuli. Accordingly, the content and structure of the nucleolus change dynamically, which is particularly evident during cell cycle progression: the nucleolus completely disassembles during mitosis and reassembles in interphase. Although the mechanisms that drive nucleolar (re)organization have been the subject of a number of studies, they are only partly understood. Recently, we identified Alu element-containing RNA polymerase II transcripts (aluRNAs) as important for nucleolar structure and rRNA synthesis. Integrating these findings with studies on the liquid droplet-like nature of the nucleolus leads us to propose a model on how RNA polymerase II transcripts could regulate the assembly of the nucleolus in response to external stimuli and during cell cycle progression. PMID:27416361
Regulation of nucleolus assembly by non-coding RNA polymerase II transcripts.
Caudron-Herger, Maïwen; Pankert, Teresa; Rippe, Karsten
2016-05-03
The nucleolus is a nuclear subcompartment for tightly regulated rRNA production and ribosome subunit biogenesis. It also acts as a cellular stress sensor and can release enriched factors in response to cellular stimuli. Accordingly, the content and structure of the nucleolus change dynamically, which is particularly evident during cell cycle progression: the nucleolus completely disassembles during mitosis and reassembles in interphase. Although the mechanisms that drive nucleolar (re)organization have been the subject of a number of studies, they are only partly understood. Recently, we identified Alu element-containing RNA polymerase II transcripts (aluRNAs) as important for nucleolar structure and rRNA synthesis. Integrating these findings with studies on the liquid droplet-like nature of the nucleolus leads us to propose a model on how RNA polymerase II transcripts could regulate the assembly of the nucleolus in response to external stimuli and during cell cycle progression.
Zhao, Hong-Quan; Kasai, Seiya; Shiratori, Yuta; Hashizume, Tamotsu
2009-06-17
A two-bit arithmetic logic unit (ALU) was successfully fabricated on a GaAs-based regular nanowire network with hexagonal topology. This fundamental building block of central processing units can be implemented on a regular nanowire network structure with simple circuit architecture based on graphical representation of logic functions using a binary decision diagram and topology control of the graph. The four-instruction ALU was designed by integrating subgraphs representing each instruction, and the circuitry was implemented by transferring the logical graph structure to a GaAs-based nanowire network formed by electron beam lithography and wet chemical etching. A path switching function was implemented in nodes by Schottky wrap gate control of nanowires. The fabricated circuit integrating 32 node devices exhibits the correct output waveforms at room temperature allowing for threshold voltage variation.
Solov'ev, V V; Kel', A E; Kolchanov, N A
1989-01-01
The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-01-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163
Reneker, Jeff; Shyu, Chi-Ren; Zeng, Peiyu; Polacco, Joseph C.; Gassmann, Walter
2004-01-01
We have developed a web server for the life sciences community to use to search for short repeats of DNA sequence of length between 3 and 10 000 bases within multiple species. This search employs a unique and fast hash function approach. Our system also applies information retrieval algorithms to discover knowledge of cross-species conservation of repeat sequences. Furthermore, we have incorporated a part of the Gene Ontology database into our information retrieval algorithms to broaden the coverage of the search. Our web server and tutorial can be found at http://acmes.rnet.missouri.edu. PMID:15215469
Gupta, Rashmi; Mirdha, Bijay Ranjan; Guleria, Randeep; Kumar, Lalit; Luthra, Kalpana; Agarwal, Sanjay Kumar; Sreenivas, Vishnubhatla
2013-01-01
Pneumocystis jirovecii is an opportunistic pathogen that causes severe pneumonia in immunocompromised patients. To study the genetic diversity of P. jirovecii in India the upstream conserved sequence (UCS) region of Pneumocystis genome was amplified, sequenced and genotyped from a set of respiratory specimens obtained from 50 patients with a positive result for nested mitochondrial large subunit ribosomal RNA (mtLSU rRNA) PCR during the years 2005-2008. Of these 50 cases, 45 showed a positive PCR for UCS region. Variations in the tandem repeats in UCS region were characterized by sequencing all the positive cases. Of the 45 cases, one case showed five repeats, 11 cases showed four repeats, 29 cases showed three repeats and four cases showed two repeats. By running amplified DNA from all these cases on a high-resolution gel, mixed infection was observed in 12 cases (26.7%, 12/45). Forty three of 45 cases included in this study had previously been typed at mtLSU rRNA and internal transcribed spacer (ITS) region by our group. In the present study, the genotypes at those two regions were combined with UCS repeat patterns to construct allelic profiles of 43 cases. A total of 36 allelic profiles were observed in 43 isolates indicating high genetic variability. A statistically significant association was observed between mtLSU rRNA genotype 1, ITS type Ea and UCS repeat pattern 4. Copyright © 2012 Elsevier B.V. All rights reserved.
Evolution and selection of Rhg1, a copy-number variant nematode-resistance locus
Lee, Tong Geon; Kumar, Indrajit; Diers, Brian W; Hudson, Matthew E
2015-01-01
The soybean cyst nematode (SCN) resistance locus Rhg1 is a tandem repeat of a 31.2 kb unit of the soybean genome. Each 31.2-kb unit contains four genes. One allele of Rhg1, Rhg1-b, is responsible for protecting most US soybean production from SCN. Whole-genome sequencing was performed, and PCR assays were developed to investigate allelic variation in sequence and copy number of the Rhg1 locus across a population of soybean germplasm accessions. Four distinct sequences of the 31.2-kb repeat unit were identified, and some Rhg1 alleles carry up to three different types of repeat unit. The total number of copies of the repeat varies from 1 to 10 per haploid genome. Both copy number and sequence of the repeat correlate with the resistance phenotype, and the Rhg1 locus shows strong signatures of selection. Significant linkage disequilibrium in the genome outside the boundaries of the repeat allowed the Rhg1 genotype to be inferred using high-density single nucleotide polymorphism genotyping of 15 996 accessions. Over 860 germplasm accessions were found likely to possess Rhg1 alleles. The regions surrounding the repeat show indications of non-neutral evolution and high genetic variability in populations from different geographic locations, but without evidence of fixation of the resistant genotype. A compelling explanation of these results is that balancing selection is in operation at Rhg1. PMID:25735447
Varshney, Dhaval; Vavrova-Anderson, Jana; Oler, Andrew J.; Cowling, Victoria H.; Cairns, Bradley R.; White, Robert J.
2015-01-01
Short interspersed nuclear elements (SINEs), such as Alu, spread by retrotransposition, which requires their transcripts to be copied into DNA and then inserted into new chromosomal sites. This can lead to genetic damage through insertional mutagenesis and chromosomal rearrangements between non-allelic SINEs at distinct loci. SINE DNA is heavily methylated and this was thought to suppress its accessibility and transcription, thereby protecting against retrotransposition. Here we provide several lines of evidence that methylated SINE DNA is occupied by RNA polymerase III, including the use of high-throughput bisulphite sequencing of ChIP DNA. We find that loss of DNA methylation has little effect on accessibility of SINEs to transcription machinery or their expression in vivo. In contrast, a histone methyltransferase inhibitor selectively promotes SINE expression and occupancy by RNA polymerase III. The data suggest that methylation of histones rather than DNA plays a dominant role in suppressing SINE transcription. PMID:25798578
Molecular bases of the ABO blood groups of Indians from the Brazilian Amazon region.
Franco, R F; Simões, B P; Guerreiro, J F; Santos, S E; Zago, M A
1994-01-01
Phenotype studies of ABO blood groups in most Amerindian populations revealed the exclusive presence of group O. Since group O is the result of the absence of glycosyltransferase activity, its molecular bases may be heterogeneous. We carried out ABO blood group genotyping by analysis of DNA of 30 Indians from 2 Amazonian tribes (Yanomami and Arara), and compared the findings with other populations (Caucasians and Blacks). Two segments of the glycosyltransferase gene were amplified by PCR and digested with KpnI or AluI to detect deletion or base change at positions 258 and 700, respectively. For all subjects, the gene basis of blood group O is the deletion of a single nucleotide at position 258 of the glycosyltransferase A gene, similar to that observed in Caucasoids and Negroids. DNA sequencing of limited regions of the gene supports this conclusion. This finding does not exclude, however, that a heterogeneity of the O allele may be revealed by a more extensive analysis.
Schurr, T G; Ballinger, S W; Gan, Y Y; Hodge, J A; Merriwether, D A; Lawrence, D N; Knowler, W C; Weiss, K M; Wallace, D C
1990-01-01
The mitochondrial DNA (mtDNA) sequence variation of the South American Ticuna, the Central American Maya, and the North American Pima was analyzed by restriction-endonuclease digestion and oligonucleotide hybridization. The analysis revealed that Amerindian populations have high frequencies of mtDNAs containing the rare Asian RFLP HincII morph 6, a rare HaeIII site gain, and a unique AluI site gain. In addition, the Asian-specific deletion between the cytochrome c oxidase subunit II (COII) and tRNA(Lys) genes was also prevalent in both the Pima and the Maya. These data suggest that Amerindian mtDNAs derived from at least four primary maternal lineages, that new tribal-specific variants accumulated as these mtDNAs became distributed throughout the Americas, and that some genetic variation may have been lost when the progenitors of the Ticuna separated from the North and Central American populations. Images Figure 1 PMID:1968708
SSR allelic variation in almond (Prunus dulcis Mill.).
Xie, Hua; Sui, Yi; Chang, Feng-Qi; Xu, Yong; Ma, Rong-Cai
2006-01-01
Sixteen SSR markers including eight EST-SSR and eight genomic SSRs were used for genetic diversity analysis of 23 Chinese and 15 international almond cultivars. EST- and genomic SSR markers previously reported in species of Prunus, mainly peach, proved to be useful for almond genetic analysis. DNA sequences of 117 alleles of six of the 16 SSR loci were analysed to reveal sequence variation among the 38 almond accessions. For the four SSR loci with AG/CT repeats, no insertions or deletions were observed in the flanking regions of the 98 alleles sequenced. Allelic size variation of these loci resulted exclusively from differences in the structures of repeat motifs, which involved interruptions or occurrences of new motif repeats in addition to varying number of AG/CT repeats. Some alleles had a high number of uninterrupted repeat motifs, indicating that SSR mutational patterns differ among alleles at a given SSR locus within the almond species. Allelic homoplasy was observed in the SSR loci because of base substitutions, interruptions or compound repeat motifs. Substitutions in the repeat regions were found at two SSR loci, suggesting that point mutations operate on SSRs and hinder the further SSR expansion by introducing repeat interruptions to stabilize SSR loci. Furthermore, it was shown that some potential point mutations in the flanking regions are linked with new SSR repeat motif variation in almond and peach.
Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.
2000-01-01
In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863
Tasaki, E; Hirayama, J; Tazumi, A; Hayashi, K; Hara, Y; Ueno, H; Moore, J E; Millar, B C; Matsuda, M
2012-02-01
Novel clustered regularly-interspaced short palindromic repeats (CRISPRs) locus [7,500 base pairs (bp) in length] occurred in the urease-positive thermophilic Campylobacter (UPTC) Japanese isolate, CF89-12. The 7,500 bp gene loci consisted of the 5'-methylaminomethyl-2-thiouridylate methyltransferase gene, putative (P) CRISPR associated (p-Cas), putative open reading frames, Cas1 and Cas2, leader sequence region (146 bp), 12 CRISPRs consensus sequence repeats (each 36 bp) separated by a non-repetitive unique spacer region of similar length (26-31 bp) and the phosphatidyl glycerophosphatase A gene. When the CRISPRs loci in the UPTC CF89-12 and five C. jejuni isolates were compared with one another, these six isolates contained p-Cas, Cas1 and Cas2 within the loci. Four to 12 CRISPRs consensus sequence repeats separated by a non-repetitive unique spacer region occurred in six isolates and the nucleotide sequences of those repeats gave approximately 92-100% similarity with each other. However, no sequence similarity occurred in the unique spacer regions among these isolates. The putative σ(70) transcriptional promoter and the hypothetical ρ-independent terminator structures for the CRISPRs and Cas were detected. No in vivo transcription of p-Cas, Cas1 and Cas2 was confirmed in the UPTC cells.
Wu, Ying; Liu, Fang; Yang, Dai-Gang; Li, Wei; Zhou, Xiao-Jian; Pei, Xiao-Yu; Liu, Yan-Gai; He, Kun-Lun; Zhang, Wen-Sheng; Ren, Zhong-Ying; Zhou, Ke-Hai; Ma, Xiong-Feng; Li, Zhong-Hu
2018-01-01
Cotton is one of the most economically important fiber crop plants worldwide. The genus Gossypium contains a single allotetraploid group (AD) and eight diploid genome groups (A–G and K). However, the evolution of repeat sequences in the chloroplast genomes and the phylogenetic relationships of Gossypium species are unclear. Thus, we determined the variations in the repeat sequences and the evolutionary relationships of 40 cotton chloroplast genomes, which represented the most diverse in the genus, including five newly sequenced diploid species, i.e., G. nandewarense (C1-n), G. armourianum (D2-1), G. lobatum (D7), G. trilobum (D8), and G. schwendimanii (D11), and an important semi-wild race of upland cotton, G. hirsutum race latifolium (AD1). The genome structure, gene order, and GC content of cotton species were similar to those of other higher plant plastid genomes. In total, 2860 long sequence repeats (>10 bp in length) were identified, where the F-genome species had the largest number of repeats (G. longicalyx F1: 108) and E-genome species had the lowest (G. stocksii E1: 53). Large-scale repeat sequences possibly enrich the genetic information and maintain genome stability in cotton species. We also identified 10 divergence hotspot regions, i.e., rpl33-rps18, psbZ-trnG (GCC), rps4-trnT (UGU), trnL (UAG)-rpl32, trnE (UUC)-trnT (GGU), atpE, ndhI, rps2, ycf1, and ndhF, which could be useful molecular genetic markers for future population genetics and phylogenetic studies. Site-specific selection analysis showed that some of the coding sites of 10 chloroplast genes (atpB, atpE, rps2, rps3, petB, petD, ccsA, cemA, ycf1, and rbcL) were under protein sequence evolution. Phylogenetic analysis based on the whole plastomes suggested that the Gossypium species grouped into six previously identified genetic clades. Interestingly, all 13 D-genome species clustered into a strong monophyletic clade. Unexpectedly, the cotton species with C, G, and K-genomes were admixed and nested in a large clade, which could have been due to their recent radiation, incomplete lineage sorting, and introgression hybridization among different cotton lineages. In conclusion, the results of this study provide new insights into the evolution of repeat sequences in chloroplast genomes and interspecific relationships in the genus Gossypium. PMID:29619041
Liu, Qian; Xu, Xue-Nian; Zhou, Yan; Cheng, Na; Dong, Yu-Ting; Zheng, Hua-Jun; Zhu, Yong-Qiang; Zhu, Yong-Qiang
2013-08-01
To find and clone new antigen genes from the lambda-ZAP cDNA expression library of adult Clonorchis sinensis, and determine the immunological characteristics of the recombinant proteins. The cDNA expression library of adult C. sinensis was screened by pooled sera of clonorchiasis patients. The sequences of the positive phage clones were compared with the sequences in EST database, and the full-length sequence of the gene (Cs22 gene) was obtained by RT-PCR. cDNA fragments containing 2 and 3 times tandem repeat sequences were generated by jumping PCR. The sequence encoding the mature peptide or the tandem repeat sequence was respectively cloned into the prokaryotic expression vector pET28a (+), and then transformed into E. coli Rosetta DE3 cells for expression. The recombinant proteins (rCs22-2r, rCs22-3r, rCs22M-2r, and rCs22M-3r) were purified by His-bind-resin (Ni-NTA) affinity chromatography. The immunogenicity of rCs22-2r and rCs22-3r was identified by ELISA. To evaluate the immunological diagnostic value of rCs22-2r and rCs22-3r, serum samples from 35 clonorchiasis patients, 31 healthy individuals, 15 schistosomiasis patients, 15 paragonimiasis westermani patients and 13 cysticercosis patients were examined by ELISA. To locate antigenic determinants, the pooled sera of clonorchiasis patients and healthy persons were analyzed for specific antibodies by ELISA with recombinant protein rCs22M-2r and rCs22M-3r containing the tandem repeat sequences. The full-length sequence of Cs22 antigen gene of C. sinensis was obtained. It contained 13 times tandem repeat sequences of EQQDGDEEGMGGDGGRGKEKGKVEGEDGAGEQKEQA. Bioinformatics analysis indicated that the protein (Cs22) belonged to GPI-anchored proteins family. The recombinant proteins rCs22-2r and rCs22-3r showed a certain level of immunogenicity. The positive rate by ELISA coated with the purified PrCs22-2r and PrCs22-3r for sera of clonorchiasis patients both were 45.7% (16/35), and 3.2% (1/31) for those of healthy persons. There was no cross reaction with sera of schistosomiasis and cysticercosis patients. The cross reaction with sera of paragonimiasis westermani patients was 1/15. The recombinant proteins rCs22M-2r and rCs22M-3r which only contained tandem repeats were specifically recognized by pooled sera of clonorchiasis patients. The Cs22 antigen gene of Clonorchis sinensis is obtained, and the recombinant proteins have certain diagnostic value. The antigenic determinant is located in tandem repeat sequences.
Gómez-Pérez, Luis; Alfonso-Sánchez, Miguel A; Sánchez, Dora; García-Obregón, Susana; Espinosa, Ibone; Martínez-Jarreta, Begoña; De Pancorbo, Marian M; Peña, José A
2011-01-01
The Amazon basin is inhabited by some of the most isolated human groups worldwide. Among them, the Waorani tribe is one of the most interesting Native American populations from the anthropological perspective. This study reports a genetic characterization of the Waorani based on autosomal genetic loci. We analyzed 12 polymorphic Alu insertions in 36 Waorani individuals from different communal longhouses settled in the Yasuní National Park. The most notable finding was the strikingly reduced genetic diversity detected in the Waorani, corroborated by the existence of four monomorphic loci (ACE, APO, FXIIIB, and HS4.65), and of other four Alu markers that were very close to the fixation for the presence (PV92 and D1) or the absence (A25 and HS4.32) of the insertion. Furthermore, results of the centroid analysis supported the notion of the Waorani being one of the Amerindian groups less impacted by gene flow processes. The prolonged isolation of the Waorani community, in conjunction with a historically low effective population size and high inbreeding levels, have resulted in the drastic reduction of their genetic diversity, because of the effects of severe genetic drift. Recurrent population bottlenecks most likely determined by certain deep-rooted sociocultural practices of the Waorani (characterized by violence, internal quarrels, and revenge killings until recent times) are likely responsible for this pattern of diversity. The findings of this study illustrate how sociocultural factors can shape the gene pool of human populations. Copyright © 2011 Wiley-Liss, Inc.
NASA Astrophysics Data System (ADS)
Böhme, Steffi; Stärk, Hans-Joachim; Meißner, Tobias; Springer, Armin; Reemtsma, Thorsten; Kühnel, Dana; Busch, Wibke
2014-09-01
In order to quantify and compare the uptake of aluminum oxide nanoparticles of three different sizes into two human cell lines (skin keratinocytes (HaCaT) and lung epithelial cells (A549)), three analytical methods were applied: digestion followed by nebulization inductively coupled plasma mass spectrometry (neb-ICP-MS), direct laser ablation ICP-MS (LA-ICP-MS), and flow cytometry. Light and electron microscopy revealed an accumulation and agglomeration of all particle types within the cell cytoplasm, whereas no particles were detected in the cell nuclei. The internalized Al2O3 particles exerted no toxicity in the two cell lines after 24 h of exposure. The smallest particles with a primary particle size ( x BET) of 14 nm (Alu1) showed the lowest sedimentation velocity within the cell culture media, but were calculated to have settled completely after 20 h. Alu2 ( x BET = 111 nm) and Alu3 ( x BET = 750 nm) were calculated to reach the cell surface after 7 h and 3 min, respectively. The internal concentrations determined with the different methods lay in a comparable range of 2-8 µg Al2O3/cm2 cell layer, indicating the suitability of all methods to quantify the nanoparticle uptake. Nevertheless, particle size limitations of analytical methods using optical devices were demonstrated for LA-ICP-MS and flow cytometry. Furthermore, the consideration and comparison of particle properties as parameters for particle internalization revealed the particle size and the exposure concentration as determining factors for particle uptake.
Akers, Stacey N; Moysich, Kirsten; Zhang, Wa; Collamat Lai, Golda; Miller, Austin; Lele, Shashikant; Odunsi, Kunle; Karpf, Adam R
2014-02-01
We determined whether DNA methylation of repetitive elements (RE) is altered in epithelial ovarian cancer (EOC) patient tumors and white blood cells (WBC), compared to normal tissue controls. Two different quantitative measures of RE methylation (LINE1 and Alu bisulfite pyrosequencing) were used in normal and tumor tissues from EOC cases and controls. Tissues analyzed included: i) EOC, ii) normal ovarian surface epithelia (OSE), iii) normal fallopian tube surface epithelia (FTE), iv) WBC from EOC patients, obtained before and after treatment, and v) WBC from demographically-matched controls. REs were significantly hypomethylated in EOC compared to OSE and FTE, and LINE1 and Alu methylation showed a significant direct association in these tissues. In contrast, WBC RE methylation was significantly higher in EOC cases compared to controls. RE methylation in patient-matched EOC tumors and pre-treatment WBC did not correlate. EOC shows robust RE hypomethylation compared to normal tissues from which the disease arises. In contrast, RE are generally hypermethylated in EOC patient WBC compared to controls. EOC tumor and WBC methylation did not correlate in matched patients, suggesting that RE methylation is independently controlled in tumor and normal tissues. Despite the significant differences observed over the population, the range of RE methylation in patient and control WBC overlapped, limiting their specific utility as an EOC biomarker. However, our data demonstrate that DNA methylation is deranged in normal tissues from EOC patients, supporting further investigation of WBC DNA methylation biomarkers suitable for EOC risk assessment. Copyright © 2013 Elsevier Inc. All rights reserved.
The Norrie disease gene maps to a 150 kb region on chromosome Xp11.3.
Sims, K B; Lebo, R V; Benson, G; Shalish, C; Schuback, D; Chen, Z Y; Bruns, G; Craig, I W; Golbus, M S; Breakefield, X O
1992-05-01
Norrie disease is a human X-linked recessive disorder of unknown etiology characterized by congenital blindness, sensory neural deafness and mental retardation. This disease gene was previously linked to the DXS7 (L1.28) locus and the MAO genes in band Xp11.3. We report here fine physical mapping of the obligate region containing the Norrie disease gene (NDP) defined by a recombination and by the smallest submicroscopic chromosomal deletion associated with Norrie disease identified to date. Analysis, using in addition two overlapping YAC clones from this region, allowed orientation of the MAOA and MAOB genes in a 5'-3'-3'-5' configuration. A recombination event between a (GT)n polymorphism in intron 2 of the MAOB gene and the NDP locus, in a family previously reported to have a recombination between DXS7 and NDP, delineates a flanking marker telomeric to this disease gene. An anonymous DNA probe, dc12, present in one of the YACs and in a patient with a submicroscopic deletion which includes MAOA and MAOB but not L1.28, serves as a flanking marker centromeric to the disease gene. An Alu-PCR fragment from the right arm of the MAO YAC (YMAO.AluR) is not deleted in this patient and also delineates the centromeric extent of the obligate disease region. The apparent order of these loci is telomere ... DXS7-MAOA-MAOB-NDP-dc12-YMAO.AluR ... centromere. Together these data define the obligate region containing the NDP gene to a chromosomal segment less than 150 kb.
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence.
Maheshwari, Shamoni; Ishii, Takayoshi; Brown, C Titus; Houben, Andreas; Comai, Luca
2017-03-01
During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays , although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. © 2017 Maheshwari et al.; Published by Cold Spring Harbor Laboratory Press.
Structural analysis of two length variants of the rDNA intergenic spacer from Eruca sativa.
Lakshmikumaran, M; Negi, M S
1994-03-01
Restriction enzyme analysis of the rRNA genes of Eruca sativa indicated the presence of many length variants within a single plant and also between different cultivars which is unusual for most crucifers studied so far. Two length variants of the rDNA intergenic spacer (IGS) from a single individual E. sativa (cv. Itsa) plant were cloned and characterized. The complete nucleotide sequences of both the variants (3 kb and 4 kb) were determined. The intergenic spacer contains three families of tandemly repeated DNA sequences denoted as A, B and C. However, the long (4 kb) variant shows the presence of an additional repeat, denoted as D, which is a duplication of a 224 bp sequence just upstream of the putative transcription initiation site. Repeat units belonging to the three different families (A, B and C) were in the size range of 22 to 30 bp. Such short repeat elements are present in the IGS of most of the crucifers analysed so far. Sequence analysis of the variants (3 kb and 4 kb) revealed that the length heterogeneity of the spacer is located at three different regions and is due to the varying copy numbers of repeat units belonging to families A and B. Length variation of the spacer is also due to the presence of a large duplication (D repeats) in the 4 kb variant which is absent in the 3 kb variant. The putative transcription initiation site was identified by comparisons with the rDNA sequences from other plant species.
White, J H; Johnson, A L; Lowndes, N F; Johnston, L H
1991-01-01
By fusing the CDC9 structural gene to the PGK upstream sequences and the CDC9 upstream to lacZ, we showed that the cell cycle expression of CDC9 is largely due to transcriptional regulation. To investigate the role of six ATGATT upstream repeats in CDC9 regulation, synthetic copies of the sequence were attached to a heterologous gene. The repeats stimulated transcription strongly and additively, but, unlike conventional yeast UAS elements, only when present in one orientation. Transcription driven by the repeats declines in cells held at START of the cell cycle or in stationary phase, as occurs with CDC9. However, the repeats by themselves cannot impart cell cycle regulation to a heterologous gene. CDC9 may therefore be controlled by an activating system operating through the repeats that is sensitive to cellular proliferation and a separate mechanism that governs the periodic expression in the cell cycle. Images PMID:1901644
In Vitro Expansion of CAG, CAA, and Mixed CAG/CAA Repeats.
Figura, Grzegorz; Koscianska, Edyta; Krzyzosiak, Wlodzimierz J
2015-08-11
Polyglutamine diseases, including Huntington's disease and a number of spinocerebellar ataxias, are caused by expanded CAG repeats that are located in translated sequences of individual, functionally-unrelated genes. Only mutant proteins containing polyglutamine expansions have long been thought to be pathogenic, but recent evidence has implicated mutant transcripts containing long CAG repeats in pathogenic processes. The presence of two pathogenic factors prompted us to attempt to distinguish the effects triggered by mutant protein from those caused by mutant RNA in cellular models of polyglutamine diseases. We used the SLIP (Synthesis of Long Iterative Polynucleotide) method to generate plasmids expressing long CAG repeats (forming a hairpin structure), CAA-interrupted CAG repeats (forming multiple unstable hairpins) or pure CAA repeats (not forming any secondary structure). We successfully modified the original SLIP protocol to generate repeats of desired length starting from constructs containing short repeat tracts. We demonstrated that the SLIP method is a time- and cost-effective approach to manipulate the lengths of expanded repeat sequences.
Guizard, Sébastien; Piégu, Benoît; Arensburger, Peter; Guillou, Florian; Bigot, Yves
2016-08-19
The program RepeatMasker and the database Repbase-ISB are part of the most widely used strategy for annotating repeats in animal genomes. They have been used to show that avian genomes have a lower repeat content (8-12 %) than the sequenced genomes of many vertebrate species (30-55 %). However, the efficiency of such a library-based strategies is dependent on the quality and completeness of the sequences in the database that is used. An alternative to these library based methods are methods that identify repeats de novo. These alternative methods have existed for a least a decade and may be more powerful than the library based methods. We have used an annotation strategy involving several complementary de novo tools to determine the repeat content of the model genome galGal4 (1.04 Gbp), including identifying simple sequence repeats (SSRs), tandem repeats and transposable elements (TEs). We annotated over one Gbp. of the galGal4 genome and showed that it is composed of approximately 19 % SSRs and TEs repeats. Furthermore, we estimate that the actual genome of the red jungle fowl contains about 31-35 % repeats. We find that library-based methods tend to overestimate TE diversity. These results have a major impact on the current understanding of repeats distributions throughout chromosomes in the red jungle fowl. Our results are a proof of concept of the reliability of using de novo tools to annotate repeats in large animal genomes. They have also revealed issues that will need to be resolved in order to develop gold-standard methodologies for annotating repeats in eukaryote genomes.
Rational design of alpha-helical tandem repeat proteins with closed architectures
Doyle, Lindsey; Hallinan, Jazmine; Bolduc, Jill; Parmeggiani, Fabio; Baker, David; Stoddard, Barry L.; Bradley, Philip
2015-01-01
Tandem repeat proteins, which are formed by repetition of modular units of protein sequence and structure, play important biological roles as macromolecular binding and scaffolding domains, enzymes, and building blocks for the assembly of fibrous materials1,2. The modular nature of repeat proteins enables the rapid construction and diversification of extended binding surfaces by duplication and recombination of simple building blocks3,4. The overall architecture of tandem repeat protein structures – which is dictated by the internal geometry and local packing of the repeat building blocks – is highly diverse, ranging from extended, super-helical folds that bind peptide, DNA, and RNA partners5–9, to closed and compact conformations with internal cavities suitable for small molecule binding and catalysis10. Here we report the development and validation of computational methods for de novo design of tandem repeat protein architectures driven purely by geometric criteria defining the inter-repeat geometry, without reference to the sequences and structures of existing repeat protein families. We have applied these methods to design a series of closed alpha-solenoid11 repeat structures (alpha-toroids) in which the inter-repeat packing geometry is constrained so as to juxtapose the N- and C-termini; several of these designed structures have been validated by X-ray crystallography. Unlike previous approaches to tandem repeat protein engineering12–20, our design procedure does not rely on template sequence or structural information taken from natural repeat proteins and hence can produce structures unlike those seen in nature. As an example, we have successfully designed and validated closed alpha-solenoid repeats with a left-handed helical architecture that – to our knowledge – is not yet present in the protein structure database21. PMID:26675735
Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai
2017-01-01
The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.
Wang, Q Z; Huang, M; Downie, S R; Chen, Z X
2016-05-23
Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes
Huang, Yongjie; Mrázek, Jan
2014-01-01
Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877
Simple sequence repeat markers that identify Claviceps species and strains
USDA-ARS?s Scientific Manuscript database
Claviceps purpurea is a pathogen that infects most members of the Pooideae subfamily and causes ergot, a floral disease in which the ovary is replaced with a sclerotium. This study was initiated to develop Simple Sequence Repeat (SSRs) markers for rapid identification of C. purpurea. SSRs were desi...
Biological sequence compression algorithms.
Matsumoto, T; Sadakane, K; Imai, H
2000-01-01
Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.
Energy-Efficient Wide Datapath Integer Arithmetic Logic Units Using Superconductor Logic
NASA Astrophysics Data System (ADS)
Ayala, Christopher Lawrence
Complementary Metal-Oxide-Semiconductor (CMOS) technology is currently the most widely used integrated circuit technology today. As CMOS approaches the physical limitations of scaling, it is unclear whether or not it can provide long-term support for niche areas such as high-performance computing and telecommunication infrastructure, particularly with the emergence of cloud computing. Alternatively, superconductor technologies based on Josephson junction (JJ) switching elements such as Rapid Single Flux Quantum (RSFQ) logic and especially its new variant, Energy-Efficient Rapid Single Flux Quantum (ERSFQ) logic have the capability to provide an ultra-high-speed, low power platform for digital systems. The objective of this research is to design and evaluate energy-efficient, high-speed 32-bit integer Arithmetic Logic Units (ALUs) implemented using RSFQ and ERSFQ logic as the first steps towards achieving practical Very-Large-Scale-Integration (VLSI) complexity in digital superconductor electronics. First, a tunable VHDL superconductor cell library is created to provide a mechanism to conduct design exploration and evaluation of superconductor digital circuits from the perspectives of functionality, complexity, performance, and energy-efficiency. Second, hybrid wave-pipelining techniques developed earlier for wide datapath RSFQ designs have been used for efficient arithmetic and logic circuit implementations. To develop the core foundation of the ALU, the ripple-carry adder and the Kogge-Stone parallel prefix carry look-ahead adder are studied as representative candidates on opposite ends of the design spectrum. By combining the high-performance features of the Kogge-Stone structure and the low complexity of the ripple-carry adder, a 32-bit asynchronous wave-pipelined hybrid sparse-tree ALU has been designed and evaluated using the VHDL cell library tuned to HYPRES' gate-level characteristics. The designs and techniques from this research have been implemented using RSFQ logic and prototype chips have been fabricated. As a joint work with HYPRES, a 20 GHz 8-bit Kogge-Stone ALU consisting of 7,950 JJs total has been fabricated using a 1.5 μm 4.5 kA/cm2 process and fully demonstrated. An 8-bit sparse-tree ALU (8,832 JJs total) and a 16-bit sparse-tree adder (12,785 JJs total) have also been fabricated using a 1.0 μm 10 kA/cm 2 process and demonstrated under collaboration with Yokohama National University and Nagoya University (Japan).
Kaale, Eliangiringa; Manyanga, Vicky; Chambuso, Mhina; Liana, Jafary; Rutta, Edmund; Embrey, Martha; Layloff, Thomas; Johnson, Keith
2016-01-01
The purpose of this study was to investigate the quality of a select group of medicines sold in accredited drug dispensing outlets (ADDOs) and pharmacies in different regions of Tanzania as part of an in-depth cross-sectional assessment of community access to medicines and community use of medicines. We collected 242 samples of amoxicillin trihydrate, artemether-lumefantrine (ALu), co-trimoxazole, ergometrine maleate, paracetamol, and quinine from selected ADDOs and pharmacies in Mbeya, Morogoro, Singida, and Tanga regions. The analysis included physical examination and testing with validated analytical techniques. Assays for eight of nine products were conducted using high-performance thin-layer chromatography (HPTLC). For ALu tablets, we used a two-tiered approach, where tier 1 was a semi-quantitative Global Pharma Health Fund-Minilab® method and tier 2 was high-performance liquid chromatography (HPLC) as described in The International Pharmacopoeia's monograph for artemether-lumefantrine. The physical examination of samples revealed no defects in the solid and oral liquid dosage forms, but unusual discoloration in an injectable solution, ergometrine maleate. For ALu, the results showed that of 38 samples, 31 (81.6%) passed tier 1 testing and 7 (18.4%) gave inconclusive drug content results. The inconclusive ALu samples were submitted for tier 2 testing and all met the quality standards. The pass rate using the HPTLC and TLC/HPLC assays was 93.8%; the failures were the ergometrine maleate samples purchased from both ADDOs and pharmacies. The disintegration testing of the solid dosage forms was conducted in accordance with US Pharmacopeia monographs. Only two samples of paracetamol, 1.2% of the solid dosage forms, failed to comply to standards. The study revealed a high overall rate of 92.6% of samples that met the quality standards. Although the overall failure rate was 7.4%, it is important to note that this was largely limited to one product and likely due to poor distribution and storage rather than poor manufacturing practices. Over 90% of the medicines sold in ADDOs and pharmacies met quality standards. Policy makers need to reconsider ergometrine maleate's place on the list of medicines that ADDOs are allowed to dispense, by either substituting a more temperature-stable therapeutically equivalent product or requiring those sites to have refrigerators, which is not a feasible option for rural Tanzania.
Manyanga, Vicky; Chambuso, Mhina; Liana, Jafary; Rutta, Edmund; Embrey, Martha; Layloff, Thomas; Johnson, Keith
2016-01-01
Introduction The purpose of this study was to investigate the quality of a select group of medicines sold in accredited drug dispensing outlets (ADDOs) and pharmacies in different regions of Tanzania as part of an in-depth cross-sectional assessment of community access to medicines and community use of medicines. Methods We collected 242 samples of amoxicillin trihydrate, artemether-lumefantrine (ALu), co-trimoxazole, ergometrine maleate, paracetamol, and quinine from selected ADDOs and pharmacies in Mbeya, Morogoro, Singida, and Tanga regions. The analysis included physical examination and testing with validated analytical techniques. Assays for eight of nine products were conducted using high-performance thin-layer chromatography (HPTLC). For ALu tablets, we used a two-tiered approach, where tier 1 was a semi-quantitative Global Pharma Health Fund-Minilab® method and tier 2 was high-performance liquid chromatography (HPLC) as described in The International Pharmacopoeia’s monograph for artemether-lumefantrine. Results and Discussion The physical examination of samples revealed no defects in the solid and oral liquid dosage forms, but unusual discoloration in an injectable solution, ergometrine maleate. For ALu, the results showed that of 38 samples, 31 (81.6%) passed tier 1 testing and 7 (18.4%) gave inconclusive drug content results. The inconclusive ALu samples were submitted for tier 2 testing and all met the quality standards. The pass rate using the HPTLC and TLC/HPLC assays was 93.8%; the failures were the ergometrine maleate samples purchased from both ADDOs and pharmacies. The disintegration testing of the solid dosage forms was conducted in accordance with US Pharmacopeia monographs. Only two samples of paracetamol, 1.2% of the solid dosage forms, failed to comply to standards. The study revealed a high overall rate of 92.6% of samples that met the quality standards. Although the overall failure rate was 7.4%, it is important to note that this was largely limited to one product and likely due to poor distribution and storage rather than poor manufacturing practices. Conclusions Over 90% of the medicines sold in ADDOs and pharmacies met quality standards. Policy makers need to reconsider ergometrine maleate’s place on the list of medicines that ADDOs are allowed to dispense, by either substituting a more temperature-stable therapeutically equivalent product or requiring those sites to have refrigerators, which is not a feasible option for rural Tanzania. PMID:27846216
Short-Sequence DNA Repeats in Prokaryotic Genomes
van Belkum, Alex; Scherer, Stewart; van Alphen, Loek; Verbrugh, Henri
1998-01-01
Short-sequence DNA repeat (SSR) loci can be identified in all eukaryotic and many prokaryotic genomes. These loci harbor short or long stretches of repeated nucleotide sequence motifs. DNA sequence motifs in a single locus can be identical and/or heterogeneous. SSRs are encountered in many different branches of the prokaryote kingdom. They are found in genes encoding products as diverse as microbial surface components recognizing adhesive matrix molecules and specific bacterial virulence factors such as lipopolysaccharide-modifying enzymes or adhesins. SSRs enable genetic and consequently phenotypic flexibility. SSRs function at various levels of gene expression regulation. Variations in the number of repeat units per locus or changes in the nature of the individual repeat sequences may result from recombination processes or polymerase inadequacy such as slipped-strand mispairing (SSM), either alone or in combination with DNA repair deficiencies. These rather complex phenomena can occur with relative ease, with SSM approaching a frequency of 10−4 per bacterial cell division and allowing high-frequency genetic switching. Bacteria use this random strategy to adapt their genetic repertoire in response to selective environmental pressure. SSR-mediated variation has important implications for bacterial pathogenesis and evolutionary fitness. Molecular analysis of changes in SSRs allows epidemiological studies on the spread of pathogenic bacteria. The occurrence, evolution and function of SSRs, and the molecular methods used to analyze them are discussed in the context of responsiveness to environmental factors, bacterial pathogenicity, epidemiology, and the availability of full-genome sequences for increasing numbers of microorganisms, especially those that are medically relevant. PMID:9618442
Cho, Charles J; Jung, Jaeeun; Jiang, Lushang; Lee, Eun Ji; Kim, Dae-Soo; Kim, Byung Sik; Kim, Hee Sung; Jung, Hwoon-Yong; Song, Ho-June; Hwang, Sung Wook; Park, Yangsoon; Jung, Min Kyo; Pack, Chan Gi; Myung, Seung-Jae; Chang, Suhwan
2018-04-25
Adenosine deaminase acting on RNA 1 (ADAR1) is known to mediate deamination of adenosine-to-inosine through binding to double-stranded RNA, the phenomenon known as RNA editing. Currently, the function of ADAR1 in gastric cancer is unclear. This study was aimed at investigating RNA editing-dependent and editing-independent functions of ADAR1 in gastric cancer, especially focusing on its influence on editing of 3' untranslated regions (UTRs) and subsequent changes in expression of messenger RNAs (mRNAs) as well as microRNAs (miRNAs). RNA-sequencing and small RNA-sequencing were performed on AGS and MKN-45 cells with a stable ADAR1 knockdown. Changed frequencies of editing and mRNA and miRNA expression were then identified by bioinformatic analyses. Targets of RNA editing were further validated in patients' samples. In the Alu region of both gastric cell lines, editing was most commonly of the A-to-I type in 3'-UTR or intron. mRNA and protein levels of PHACTR4 increased in ADAR1 knockdown cells, because of the loss of seed sequences in 3'-UTR of PHACTR4 mRNA that are required for miRNA-196a-3p binding. Immunohistochemical analyses of tumor and paired normal samples from 16 gastric cancer patients showed that ADAR1 expression was higher in tumors than in normal tissues and inversely correlated with PHACTR4 staining. On the other hand, decreased miRNA-148a-3p expression in ADAR1 knockdown cells led to increased mRNA and protein expression of NFYA, demonstrating ADAR1's editing-independent function. ADAR1 regulates post-transcriptional gene expression in gastric cancer through both RNA editing-dependent and editing-independent mechanisms.
GATA simple sequence repeats function as enhancer blocker boundaries.
Kumar, Ram P; Krishnan, Jaya; Pratap Singh, Narendra; Singh, Lalji; Mishra, Rakesh K
2013-01-01
Simple sequence repeats (SSRs) account for ~3% of the human genome, but their functional significance still remains unclear. One of the prominent SSRs the GATA tetranucleotide repeat has preferentially accumulated in complex organisms. GATA repeats are particularly enriched on the human Y chromosome, and their non-random distribution and exclusive association with genes expressed during early development indicate their role in coordinated gene regulation. Here we show that GATA repeats have enhancer blocker activity in Drosophila and human cells. This enhancer blocker activity is seen in transgenic as well as native context of the enhancers at various developmental stages. These findings ascribe functional significance to SSRs and offer an explanation as to why SSRs, especially GATA, may have accumulated in complex organisms.
CRISPRDetect: A flexible algorithm to define CRISPR arrays.
Biswas, Ambarish; Staals, Raymond H J; Morales, Sergio E; Fineran, Peter C; Brown, Chris M
2016-05-17
CRISPR (clustered regularly interspaced short palindromic repeats) RNAs provide the specificity for noncoding RNA-guided adaptive immune defence systems in prokaryotes. CRISPR arrays consist of repeat sequences separated by specific spacer sequences. CRISPR arrays have previously been identified in a large proportion of prokaryotic genomes. However, currently available detection algorithms do not utilise recently discovered features regarding CRISPR loci. We have developed a new approach to automatically detect, predict and interactively refine CRISPR arrays. It is available as a web program and command line from bioanalysis.otago.ac.nz/CRISPRDetect. CRISPRDetect discovers putative arrays, extends the array by detecting additional variant repeats, corrects the direction of arrays, refines the repeat/spacer boundaries, and annotates different types of sequence variations (e.g. insertion/deletion) in near identical repeats. Due to these features, CRISPRDetect has significant advantages when compared to existing identification tools. As well as further support for small medium and large repeats, CRISPRDetect identified a class of arrays with 'extra-large' repeats in bacteria (repeats 44-50 nt). The CRISPRDetect output is integrated with other analysis tools. Notably, the predicted spacers can be directly utilised by CRISPRTarget to predict targets. CRISPRDetect enables more accurate detection of arrays and spacers and its gff output is suitable for inclusion in genome annotation pipelines and visualisation. It has been used to analyse all complete bacterial and archaeal reference genomes.
Han, Yonghua; Wang, Guixiang; Liu, Zhao; Liu, Jinhua; Yue, Wei; Song, Rentao; Zhang, Xueyong; Jin, Weiwei
2010-02-01
Knowledge about the composition and structure of centromeres is critical for understanding how centromeres perform their functional roles. Here, we report the sequences of one centromere-associated bacterial artificial chromosome clone from a Coix lacryma-jobi library. Two Ty3/gypsy-class retrotransposons, centromeric retrotransposon of C. lacryma-jobi (CRC) and peri-centromeric retrotransposon of C. lacryma-jobi, and a (peri)centromere-specific tandem repeat with a unit length of 153 bp were identified. The CRC is highly homologous to centromere-specific retrotransposons reported in grass species. An 80-bp DNA region in the 153-bp satellite repeat was found to be conserved to centromeric satellite repeats from maize, rice, and pearl millet. Fluorescence in situ hybridization showed that the three repetitive sequences were located in (peri-)centromeric regions of both C. lacryma-jobi and Coix aquatica. However, the 153-bp satellite repeat was only detected on 20 out of the 30 chromosomes in C. aquatica. Immunostaining with an antibody against rice CENH3 indicates that the 153-bp satellite repeat and CRC might be both the major components for functional centromeres, but not all the 153-bp satellite repeats or CRC sequences are associated with CENH3. The evolution of centromeric repeats of C. lacryma-jobi during the polyploidization was discussed.
CRISPR Detection From Short Reads Using Partial Overlap Graphs.
Ben-Bassat, Ilan; Chor, Benny
2016-06-01
Clustered regularly interspaced short palindromic repeats (CRISPR) are structured regions in bacterial and archaeal genomes, which are part of an adaptive immune system against phages. CRISPRs are important for many microbial studies and are playing an essential role in current gene editing techniques. As such, they attract substantial research interest. The exponential growth in the amount of bacterial sequence data in recent years enables the exploration of CRISPR loci in more and more species. Most of the automated tools that detect CRISPR loci rely on fully assembled genomes. However, many assemblers do not handle repetitive regions successfully. The first tool to work directly on raw sequence data is Crass, which requires reads that are long enough to contain two copies of the same repeat. We present a method to identify CRISPR repeats from raw sequence data of short reads. The algorithm is based on an observation differentiating CRISPR repeats from other types of repeats, and it involves a series of partial constructions of the overlap graph. This enables us to avoid many of the difficulties that assemblers face, as we merely aim to identify the repeats that belong to CRISPR loci. A preliminary implementation of the algorithm shows good results and detects CRISPR repeats in cases where other existing tools fail to do so.
Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh
2014-01-01
Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1–6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ PMID:25380781
Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh
2014-01-01
Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ © The Author(s) 2014. Published by Oxford University Press.
Zhu, H; Senalik, D; McCown, B H; Zeldin, E L; Speers, J; Hyman, J; Bassil, N; Hummer, K; Simon, P W; Zalapa, J E
2012-01-01
The American cranberry (Vaccinium macrocarpon Ait.) is a major commercial fruit crop in North America, but limited genetic resources have been developed for the species. Furthermore, the paucity of codominant DNA markers has hampered the advance of genetic research in cranberry and the Ericaceae family in general. Therefore, we used Roche 454 sequencing technology to perform low-coverage whole genome shotgun sequencing of the cranberry cultivar 'HyRed'. After de novo assembly, the obtained sequence covered 266.3 Mb of the estimated 540-590 Mb in cranberry genome. A total of 107,244 SSR loci were detected with an overall density across the genome of 403 SSR/Mb. The AG repeat was the most frequent motif in cranberry accounting for 35% of all SSRs and together with AAG and AAAT accounted for 46% of all loci discovered. To validate the SSR loci, we designed 96 primer-pairs using contig sequence data containing perfect SSR repeats, and studied the genetic diversity of 25 cranberry genotypes. We identified 48 polymorphic SSR loci with 2-15 alleles per locus for a total of 323 alleles in the 25 cranberry genotypes. Genetic clustering by principal coordinates and genetic structure analyzes confirmed the heterogeneous nature of cranberries. The parentage composition of several hybrid cultivars was evident from the structure analyzes. Whole genome shotgun 454 sequencing was a cost-effective and efficient way to identify numerous SSR repeats in the cranberry sequence for marker development.
Target Site Recognition by a Diversity-Generating Retroelement
Guo, Huatao; Tse, Longping V.; Nieh, Angela W.; Czornyj, Elizabeth; Williams, Steven; Oukil, Sabrina; Liu, Vincent B.; Miller, Jeff F.
2011-01-01
Diversity-generating retroelements (DGRs) are in vivo sequence diversification machines that are widely distributed in bacterial, phage, and plasmid genomes. They function to introduce vast amounts of targeted diversity into protein-encoding DNA sequences via mutagenic homing. Adenine residues are converted to random nucleotides in a retrotransposition process from a donor template repeat (TR) to a recipient variable repeat (VR). Using the Bordetella bacteriophage BPP-1 element as a prototype, we have characterized requirements for DGR target site function. Although sequences upstream of VR are dispensable, a 24 bp sequence immediately downstream of VR, which contains short inverted repeats, is required for efficient retrohoming. The inverted repeats form a hairpin or cruciform structure and mutational analysis demonstrated that, while the structure of the stem is important, its sequence can vary. In contrast, the loop has a sequence-dependent function. Structure-specific nuclease digestion confirmed the existence of a DNA hairpin/cruciform, and marker coconversion assays demonstrated that it influences the efficiency, but not the site of cDNA integration. Comparisons with other phage DGRs suggested that similar structures are a conserved feature of target sequences. Using a kanamycin resistance determinant as a reporter, we found that transplantation of the IMH and hairpin/cruciform-forming region was sufficient to target the DGR diversification machinery to a heterologous gene. In addition to furthering our understanding of DGR retrohoming, our results suggest that DGRs may provide unique tools for directed protein evolution via in vivo DNA diversification. PMID:22194701
Tran, Trung D; Cao, Hieu X; Jovtchev, Gabriele; Neumann, Pavel; Novák, Petr; Fojtová, Miloslava; Vu, Giang T H; Macas, Jiří; Fajkus, Jiří; Schubert, Ingo; Fuchs, Joerg
2015-12-01
Linear chromosomes of eukaryotic organisms invariably possess centromeres and telomeres to ensure proper chromosome segregation during nuclear divisions and to protect the chromosome ends from deterioration and fusion, respectively. While centromeric sequences may differ between species, with arrays of tandemly repeated sequences and retrotransposons being the most abundant sequence types in plant centromeres, telomeric sequences are usually highly conserved among plants and other organisms. The genome size of the carnivorous genus Genlisea (Lentibulariaceae) is highly variable. Here we study evolutionary sequence plasticity of these chromosomal domains at an intrageneric level. We show that Genlisea nigrocaulis (1C = 86 Mbp; 2n = 40) and G. hispidula (1C = 1550 Mbp; 2n = 40) differ as to their DNA composition at centromeres and telomeres. G. nigrocaulis and its close relative G. pygmaea revealed mainly 161 bp tandem repeats, while G. hispidula and its close relative G. subglabra displayed a combination of four retroelements at centromeric positions. G. nigrocaulis and G. pygmaea chromosome ends are characterized by the Arabidopsis-type telomeric repeats (TTTAGGG); G. hispidula and G. subglabra instead revealed two intermingled sequence variants (TTCAGG and TTTCAGG). These differences in centromeric and, surprisingly, also in telomeric DNA sequences, uncovered between groups with on average a > 9-fold genome size difference, emphasize the fast genome evolution within this genus. Such intrageneric evolutionary alteration of telomeric repeats with cytosine in the guanine-rich strand, not yet known for plants, might impact the epigenetic telomere chromatin modification. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K.; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Clarke, Jihong Liu
2009-01-01
Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19–37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16–21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C–U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae. PMID:17534593
Oggioni, M R; Claverys, J P
1999-10-01
A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.
Analysis of SINE and LINE repeat content of Y chromosomes in the platypus, Ornithorhynchus anatinus.
Kortschak, R Daniel; Tsend-Ayush, Enkhjargal; Grützner, Frank
2009-01-01
Monotremes feature an extraordinary sex-chromosome system that consists of five X and five Y chromosomes in males. These sex chromosomes share homology with bird sex chromosomes but no homology with the therian X. The genome of a female platypus was recently completed, providing unique insights into sequence and gene content of autosomes and X chromosomes, but no Y-specific sequence has so far been analysed. Here we report the isolation, sequencing and analysis of approximately 700 kb of sequence of the non-recombining regions of Y2, Y3 and Y5, which revealed differences in base composition and repeat content between autosomes and sex chromosomes, and within the sex chromosomes themselves. This provides the first insights into repeat content of Y chromosomes in platypus, which overall show similar patterns of repeat composition to Y chromosomes in other species. Interestingly, we also observed differences between the various Y chromosomes, and in combination with timing and activity patterns we provide an approach that can be used to examine the evolutionary history of the platypus sex-chromosome chain.
Efficient production of artificially designed gelatins with a Bacillus brevis system.
Kajino, T; Takahashi, H; Hirai, M; Yamada, Y
2000-01-01
Artificially designed gelatins comprising tandemly repeated 30-amino-acid peptide units derived from human alphaI collagen were successfully produced with a Bacillus brevis system. The DNA encoding the peptide unit was synthesized by taking into consideration the codon usage of the host cells, but no clones having a tandemly repeated gene were obtained through the above-mentioned strategy. Minirepeat genes could be selected in vivo from a mixture of every possible sequence encoding an artificial gelatin by randomly ligating the mixed sequence unit and transforming it into Escherichia coli. Larger repeat genes constructed by connecting minirepeat genes obtained by in vivo selection were also stable in the expression host cells. Gelatins derived from the eight-unit and six-unit repeat genes were extracellularly produced at the level of 0.5 g/liter and easily purified by ammonium sulfate fractionation and anion-exchange chromatography. The purified artificial gelatins had the predicted N-terminal sequences and amino acid compositions and a solgel property similar to that of the native gelatin. These results suggest that the selection of a repeat unit sequence stable in an expression host is a shortcut for the efficient production of repetitive proteins and that it can conveniently be achieved by the in vivo selection method. This study revealed the possible industrial application of artificially designed repetitive proteins.
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-06-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Characterization of genetic sequence variation of 58 STR loci in four major population groups.
Novroski, Nicole M M; King, Jonathan L; Churchill, Jennifer D; Seah, Lay Hong; Budowle, Bruce
2016-11-01
Massively parallel sequencing (MPS) can identify sequence variation within short tandem repeat (STR) alleles as well as their nominal allele lengths that traditionally have been obtained by capillary electrophoresis. Using the MiSeq FGx Forensic Genomics System (Illumina), STRait Razor, and in-house excel workbooks, genetic variation was characterized within STR repeat and flanking regions of 27 autosomal, 7 X-chromosome and 24 Y-chromosome STR markers in 777 unrelated individuals from four population groups. Seven hundred and forty six autosomal, 227 X-chromosome, and 324 Y-chromosome STR alleles were identified by sequence compared with 357 autosomal, 107 X-chromosome, and 189 Y-chromosome STR alleles that were identified by length. Within the observed sequence variation, 227 autosomal, 156 X-chromosome, and 112 Y-chromosome novel alleles were identified and described. One hundred and seventy six autosomal, 123 X-chromosome, and 93 Y-chromosome sequence variants resided within STR repeat regions, and 86 autosomal, 39 X-chromosome, and 20 Y-chromosome variants were located in STR flanking regions. Three markers, D18S51, DXS10135, and DYS385a-b had 1, 4, and 1 alleles, respectively, which contained both a novel repeat region variant and a flanking sequence variant in the same nucleotide sequence. There were 50 markers that demonstrated a relative increase in diversity with the variant sequence alleles compared with those of traditional nominal length alleles. These population data illustrate the genetic variation that exists in the commonly used STR markers in the selected population samples and provide allele frequencies for statistical calculations related to STR profiling with MPS data. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Pietras, D F; Bennett, K L; Siracusa, L D; Woodworth-Gutai, M; Chapman, V M; Gross, K W; Kane-Haas, C; Hastie, N D
1983-01-01
We report the construction of a small library of recombinant plasmids containing Mus musculus repetitive DNA inserts. The repetitive cloned fraction was derived from denatured genomic DNA by reassociation to a Cot value at which repetitive, but not unique, sequences have reannealed followed by exhaustive S1 nuclease treatment to degrade single stranded DNA. Initial characterizations of this library by colony filter hybridizations have led to the identification of a previously undetected M. musculus minor satellite as well as to clones containing M. musculus major satellite sequences. This new satellite is repeated 10-20 times less than the major satellite in the M. musculus genome. It has a repeat length of 130 nucleotides compared with the M. musculus major satellite with a repeat length of 234 nucleotides. Sequence analysis of the minor satellite has shown that it has a 29 base pair region with extensive homology to one of the major satellite repeating subunits. We also show by in situ hybridization that this minor satellite sequence is located at the centromeres and possibly the arms of at least half the M musculus chromosomes. Sequences related to the minor satellite have been found in the DNA of a related Mus species, Mus spretus, and may represent the major satellite of that species. Images PMID:6314268
Cho, Kwang-Soo; Yun, Bong-Kyoung; Yoon, Young-Ho; Hong, Su-Young; Mekapogu, Manjulatha; Kim, Kyung-Hee; Yang, Tae-Jin
2015-01-01
We report the chloroplast (cp) genome sequence of tartary buckwheat (Fagopyrum tataricum) obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale) cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp) were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats) and F. esculentum (one repeat), and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes—rpoC2, ycf3, accD, and clpP—have high synonymous (Ks) value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum. PMID:25966355
Two new miniature inverted-repeat transposable elements in the genome of the clam Donax trunculus.
Šatović, Eva; Plohl, Miroslav
2017-10-01
Repetitive sequences are important components of eukaryotic genomes that drive their evolution. Among them are different types of mobile elements that share the ability to spread throughout the genome and form interspersed repeats. To broaden the generally scarce knowledge on bivalves at the genome level, in the clam Donax trunculus we described two new non-autonomous DNA transposons, miniature inverted-repeat transposable elements (MITEs), named DTC M1 and DTC M2. Like other MITEs, they are characterized by their small size, their A + T richness, and the presence of terminal inverted repeats (TIRs). DTC M1 and DTC M2 are 261 and 286 bp long, respectively, and in addition to TIRs, both of them contain a long imperfect palindrome sequence in their central parts. These elements are present in complete and truncated versions within the genome of the clam D. trunculus. The two new MITEs share only structural similarity, but lack any nucleotide sequence similarity to each other. In a search for related elements in databases, blast search revealed within the Crassostrea gigas genome a larger element sharing sequence similarity only to DTC M1 in its TIR sequences. The lack of sequence similarity with any previously published mobile elements indicates that DTC M1 and DTC M2 elements may be unique to D. trunculus.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miller, Matthew T.; Higgin, Joshua J.; Hall, Traci M.Tanaka
2008-06-06
Pumilio/FBF (PUF) family proteins are found in eukaryotic organisms and regulate gene expression post-transcriptionally by binding to sequences in the 3' untranslated region of target transcripts. PUF proteins contain an RNA binding domain that typically comprises eight {alpha}-helical repeats, each of which recognizes one RNA base. Some PUF proteins, including yeast Puf4p, have altered RNA binding specificity and use their eight repeats to bind to RNA sequences with nine or ten bases. Here we report the crystal structures of Puf4p alone and in complex with a 9-nucleotide (nt) target RNA sequence, revealing that Puf4p accommodates an 'extra' nucleotide by modestmore » adaptations allowing one base to be turned away from the RNA binding surface. Using structural information and sequence comparisons, we created a mutant Puf4p protein that preferentially binds to an 8-nt target RNA sequence over a 9-nt sequence and restores binding of each protein repeat to one RNA base.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bhattacharya, Monolekha; Das, Amit Kumar, E-mail: amitk@hijli.iitkgp.ernet.in
Highlights: Black-Right-Pointing-Pointer The regulatory sequences recognized by TcrX have been identified. Black-Right-Pointing-Pointer The regulatory region comprises of inverted repeats segregated by 30 bp region. Black-Right-Pointing-Pointer The mode of binding of TcrX with regulatory sequence is unique. Black-Right-Pointing-Pointer In silico TcrX-DNA docked model binds one of the inverted repeats. Black-Right-Pointing-Pointer Both phosphorylated and unphosphorylated TcrX binds regulatory sequence in vitro. -- Abstract: TcrY, a histidine kinase, and TcrX, a response regulator, constitute a two-component system in Mycobacterium tuberculosis. tcrX, which is expressed during iron scarcity, is instrumental in the survival of iron-dependent M. tuberculosis. However, the regulator of tcrX/Y has notmore » been fully characterized. Crosslinking studies of TcrX reveal that it can form oligomers in vitro. Electrophoretic mobility shift assays (EMSAs) show that TcrX recognizes two regions in the promoter that are comprised of inverted repeats separated by {approx}30 bp. The dimeric in silico model of TcrX predicts binding to one of these inverted repeat regions. Site-directed mutagenesis and radioactive phosphorylation indicate that D54 of TcrX is phosphorylated by H256 of TcrY. However, phosphorylated and unphosphorylated TcrX bind the regulatory sequence with equal efficiency, which was shown with an EMSA using the D54A TcrX mutant.« less
Structural features of the rice chromosome 4 centromere.
Zhang, Yu; Huang, Yuchen; Zhang, Lei; Li, Ying; Lu, Tingting; Lu, Yiqi; Feng, Qi; Zhao, Qiang; Cheng, Zhukuan; Xue, Yongbiao; Wing, Rod A; Han, Bin
2004-01-01
A complete sequence of a chromosome centromere is necessary for fully understanding centromere function. We reported the sequence structures of the first complete rice chromosome centromere through sequencing a large insert bacterial artificial chromosome clone-based contig, which covered the rice chromosome 4 centromere. Complete sequencing of the 124-kb rice chromosome 4 centromere revealed that it consisted of 18 tracts of 379 tandemly arrayed repeats known as CentO and a total of 19 centromeric retroelements (CRs) but no unique sequences were detected. Four tracts, composed of 65 CentO repeats, were located in the opposite orientation, and 18 CentO tracts were flanked by 19 retroelements. The CRs were classified into four types, and the type I retroelements appeared to be more specific to rice centromeres. The preferential insert of the CRs among CentO repeats indicated that the centromere-specific retroelements may contribute to centromere expansion during evolution. The presence of three intact retrotransposons in the centromere suggests that they may be responsible for functional centromere initiation through a transcription-mediated mechanism.
Chromosome rearrangements via template switching between diverged repeated sequences
Anand, Ranjith P.; Tsaponina, Olga; Greenwell, Patricia W.; Lee, Cheng-Sheng; Du, Wei; Petes, Thomas D.
2014-01-01
Recent high-resolution genome analyses of cancer and other diseases have revealed the occurrence of microhomology-mediated chromosome rearrangements and copy number changes. Although some of these rearrangements appear to involve nonhomologous end-joining, many must have involved mechanisms requiring new DNA synthesis. Models such as microhomology-mediated break-induced replication (MM-BIR) have been invoked to explain these rearrangements. We examined BIR and template switching between highly diverged sequences in Saccharomyces cerevisiae, induced during repair of a site-specific double-strand break (DSB). Our data show that such template switches are robust mechanisms that give rise to complex rearrangements. Template switches between highly divergent sequences appear to be mechanistically distinct from the initial strand invasions that establish BIR. In particular, such jumps are less constrained by sequence divergence and exhibit a different pattern of microhomology junctions. BIR traversing repeated DNA sequences frequently results in complex translocations analogous to those seen in mammalian cells. These results suggest that template switching among repeated genes is a potent driver of genome instability and evolution. PMID:25367035
Simple sequence repeat marker loci discovery using SSR primer.
Robinson, Andrew J; Love, Christopher G; Batley, Jacqueline; Barker, Gary; Edwards, David
2004-06-12
Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. With the increase in the availability of DNA sequence information, an automated process to identify and design PCR primers for amplification of SSR loci would be a useful tool in plant breeding programs. We report an application that integrates SPUTNIK, an SSR repeat finder, with Primer3, a PCR primer design program, into one pipeline tool, SSR Primer. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. The results are parsed to Primer3 for locus-specific primer design. The script makes use of a Web-based interface, enabling remote use. This program has been written in PERL and is freely available for non-commercial users by request from the authors. The Web-based version may be accessed at http://hornbill.cspp.latrobe.edu.au/
Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining
2014-01-01
Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.
Jiang, W; Gupta, D; Gallagher, D; Davis, S; Bhavanandan, V P
2000-04-01
We previously elucidated five distinct protein domains (I-V) for bovine submaxillary mucin, which is encoded by two genes, BSM1 and BSM2. Using Southern blot analysis, genomic cloning and sequencing of the BSM1 gene, we now show that the central domain (V) consists of approximately 55 tandem repeats of 329 amino acids and that domains III-V are encoded by a 58.4-kb exon, the largest exon known for all genes to date. The BSM1 gene was mapped by fluorescence in situ hybridization to the proximal half of chromosome 5 at bands q2. 2-q2.3. The amino-acid sequence of six tandem repeats (two full and four partial) were found to have only 92-94% identities. We propose that the variability in the amino-acid sequences of the mucin tandem repeat is important for generating the combinatorial library of saccharides that are necessary for the protective function of mucins. The deduced peptide sequences of the central domain match those determined from the purified bovine submaxillary mucin and also show 68-94% identity to published peptide sequences of ovine submaxillary mucin. This indicates that the core protein of ovine submaxillary mucin is closely related to that of bovine submaxillary mucin and contains similar tandem repeats in the central domain. In contrast, the central domain of porcine submaxillary mucin is reported to consist of 81-amino-acid tandem repeats. However, both bovine submaxillary mucin and porcine submaxillary mucin contain similar N-terminal and C-terminal domains and the corresponding genes are in the conserved linkage regions of the respective genomes.
Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining
2014-01-01
Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551
Identification of antisense long noncoding RNAs that function as SINEUPs in human cells.
Schein, Aleks; Zucchelli, Silvia; Kauppinen, Sakari; Gustincich, Stefano; Carninci, Piero
2016-09-20
Mammalian genomes encode numerous natural antisense long noncoding RNAs (lncRNAs) that regulate gene expression. Recently, an antisense lncRNA to mouse Ubiquitin carboxyl-terminal hydrolase L1 (Uchl1) was reported to increase UCHL1 protein synthesis, representing a new functional class of lncRNAs, designated as SINEUPs, for SINE element-containing translation UP-regulators. Here, we show that an antisense lncRNA to the human protein phosphatase 1 regulatory subunit 12A (PPP1R12A), named as R12A-AS1, which overlaps with the 5' UTR and first coding exon of the PPP1R12A mRNA, functions as a SINEUP, increasing PPP1R12A protein translation in human cells. The SINEUP activity depends on the aforementioned sense-antisense interaction and a free right Alu monomer repeat element at the 3' end of R12A-AS1. In addition, we identify another human antisense lncRNA with SINEUP activity. Our results demonstrate for the first time that human natural antisense lncRNAs can up-regulate protein translation, suggesting that endogenous SINEUPs may be widespread and present in many mammalian species.
Gene polymorphisms of fibrinolytic enzymes in coal workers' pneumoconiosis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chang, L.C.; Tseng, J.C.; Hua, C.C.
2006-03-15
The authors assessed the gene polymorphisms of missense C/T polymorphism in exon 6 of the urokinase-plasminogen activator (PLAU) gene (PLAU P141L), A/u-repeat in intron 8 of the tissue-type plasminogen activator (PLAT) gene (PLAT TPA25 Alu insertion), and 4G/5G in the promoter region of the serine proteinase inhibitor, clade E (SERPINE) or plasminogen activator inhibitor type 1 gene (SERPINE1 -675 4G/5G) in 153 healthy volunteers and 154 retired coal miners with coal miners' pneumoconiosis (CWP). The CWP subjects included 94 individuals with simple pneumoconiosis and 60 individuals with progressive massive fibrosis presenting with worse pulmonary function. The distributions of genotypes ofmore » these three genes did not differ between the control and CWP subjects or between subjects with simple pneumoconiosis and those with progressive massive fibrosis. However, by assessing duration of work and its interaction with genotypes by means of logistic regression, the authors found the missense C/T polymorphism in exon 6 of the PLAU gene to be an effect modifier of the association between work duration and the development of progressive massive fibrosis.« less
Repeat-aware modeling and correction of short read errors.
Yang, Xiao; Aluru, Srinivas; Dorman, Karin S
2011-02-15
High-throughput short read sequencing is revolutionizing genomics and systems biology research by enabling cost-effective deep coverage sequencing of genomes and transcriptomes. Error detection and correction are crucial to many short read sequencing applications including de novo genome sequencing, genome resequencing, and digital gene expression analysis. Short read error detection is typically carried out by counting the observed frequencies of kmers in reads and validating those with frequencies exceeding a threshold. In case of genomes with high repeat content, an erroneous kmer may be frequently observed if it has few nucleotide differences with valid kmers with multiple occurrences in the genome. Error detection and correction were mostly applied to genomes with low repeat content and this remains a challenging problem for genomes with high repeat content. We develop a statistical model and a computational method for error detection and correction in the presence of genomic repeats. We propose a method to infer genomic frequencies of kmers from their observed frequencies by analyzing the misread relationships among observed kmers. We also propose a method to estimate the threshold useful for validating kmers whose estimated genomic frequency exceeds the threshold. We demonstrate that superior error detection is achieved using these methods. Furthermore, we break away from the common assumption of uniformly distributed errors within a read, and provide a framework to model position-dependent error occurrence frequencies common to many short read platforms. Lastly, we achieve better error correction in genomes with high repeat content. The software is implemented in C++ and is freely available under GNU GPL3 license and Boost Software V1.0 license at "http://aluru-sun.ece.iastate.edu/doku.php?id = redeem". We introduce a statistical framework to model sequencing errors in next-generation reads, which led to promising results in detecting and correcting errors for genomes with high repeat content.
Mangericao, Tatiana C; Peng, Zhanhao; Zhang, Xuegong
2016-01-11
CRISPR has been becoming a hot topic as a powerful technique for genome editing for human and other higher organisms. The original CRISPR-Cas (Clustered Regularly Interspaced Short Palindromic Repeats coupled with CRISPR-associated proteins) is an important adaptive defence system for prokaryotes that provides resistance against invading elements such as viruses and plasmids. A CRISPR cassette contains short nucleotide sequences called spacers. These unique regions retain a history of the interactions between prokaryotes and their invaders in individual strains and ecosystems. One important ecosystem in the human body is the human gut, a rich habitat populated by a great diversity of microorganisms. Gut microbiomes are important for human physiology and health. Metagenome sequencing has been widely applied for studying the gut microbiomes. Most efforts in metagenome study has been focused on profiling taxa compositions and gene catalogues and identifying their associations with human health. Less attention has been paid to the analysis of the ecosystems of microbiomes themselves especially their CRISPR composition. We conducted a preliminary analysis of CRISPR sequences in a human gut metagenomic data set of Chinese individuals of type-2 diabetes patients and healthy controls. Applying an available CRISPR-identification algorithm, PILER-CR, we identified 3169 CRISPR cassettes in the data, from which we constructed a set of 1302 unique repeat sequences and 36,709 spacers. A more extensive analysis was made for the CRISPR repeats: these repeats were submitted to a more comprehensive clustering and classification using the web server tool CRISPRmap. All repeats were compared with known CRISPRs in the database CRISPRdb. A total of 784 repeats had matches in the database, and the remaining 518 repeats from our set are potentially novel ones. The computational analysis of CRISPR composition based contigs of metagenome sequencing data is feasible. It provides an efficient approach for finding potential novel CRISPR arrays and for analysing the ecosystem and history of human microbiomes.
Siefen, H.T.; Campbell, J.M.
1959-02-01
A method is described for removing aluminumuranium-silicon alloy bonded to metallic U comprising subjecting the Al-U -Si alloy to treatment with hot concentrated HNO/sun 3/ to partially dissolve and embrittle the alloy and shot- blasting the embrittled alloy to loosen it from the U.
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis
Zheng, Jin-shuang; Sun, Cheng-zhen; Zhang, Shu-ning; Hou, Xi-lin; Bonnema, Guusje
2016-01-01
A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis. PMID:27507974
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.
Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje
2016-01-01
A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis.
Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada
2002-07-01
Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Evolutional dynamics of 45S and 5S ribosomal DNA in ancient allohexaploid Atropa belladonna.
Volkov, Roman A; Panchuk, Irina I; Borisjuk, Nikolai V; Hosiawa-Baranska, Marta; Maluszynska, Jolanta; Hemleben, Vera
2017-01-23
Polyploid hybrids represent a rich natural resource to study molecular evolution of plant genes and genomes. Here, we applied a combination of karyological and molecular methods to investigate chromosomal structure, molecular organization and evolution of ribosomal DNA (rDNA) in nightshade, Atropa belladonna (fam. Solanaceae), one of the oldest known allohexaploids among flowering plants. Because of their abundance and specific molecular organization (evolutionarily conserved coding regions linked to variable intergenic spacers, IGS), 45S and 5S rDNA are widely used in plant taxonomic and evolutionary studies. Molecular cloning and nucleotide sequencing of A. belladonna 45S rDNA repeats revealed a general structure characteristic of other Solanaceae species, and a very high sequence similarity of two length variants, with the only difference in number of short IGS subrepeats. These results combined with the detection of three pairs of 45S rDNA loci on separate chromosomes, presumably inherited from both tetraploid and diploid ancestor species, example intensive sequence homogenization that led to substitution/elimination of rDNA repeats of one parent. Chromosome silver-staining revealed that only four out of six 45S rDNA sites are frequently transcriptionally active, demonstrating nucleolar dominance. For 5S rDNA, three size variants of repeats were detected, with the major class represented by repeats containing all functional IGS elements required for transcription, the intermediate size repeats containing partially deleted IGS sequences, and the short 5S repeats containing severe defects both in the IGS and coding sequences. While shorter variants demonstrate increased rate of based substitution, probably in their transition into pseudogenes, the functional 5S rDNA variants are nearly identical at the sequence level, pointing to their origin from a single parental species. Localization of the 5S rDNA genes on two chromosome pairs further supports uniparental inheritance from the tetraploid progenitor. The obtained molecular, cytogenetic and phylogenetic data demonstrate complex evolutionary dynamics of rDNA loci in allohexaploid species of Atropa belladonna. The high level of sequence unification revealed in 45S and 5S rDNA loci of this ancient hybrid species have been seemingly achieved by different molecular mechanisms.
Identification of presumed ancestral DNA sequences of phaseolin in Phaseolus vulgaris.
Kami, J; Velásquez, V B; Debouck, D G; Gepts, P
1995-01-01
Common bean (Phaseolus vulgaris) consists of two major geographic gene pools, one distributed in Mexico, Central America, and Colombia and the other in the southern Andes (southern Peru, Bolivia, and Argentina). Amplification and sequencing of members of the multigene family coding for phaseolin, the major seed storage protein of the common bean, provide evidence for accumulation of tandem direct repeats in both introns and exons during evolution of the multigene family in this species. The presumed ancestral phaseolin sequences, without tandem repeats, were found in recently discovered but nearly extinct wild common bean populations of Ecuador and northern Peru that are intermediate between the two major gene pools of the species based on geographical and molecular arguments. Our results illustrate the usefulness of tandem direct repeats in establishing the polarity of DNA sequence divergence and therefore in proposing phylogenies. Images Fig. 1 Fig. 3 PMID:7862642
Tek, Ahmet L; Kashihara, Kazunari; Murata, Minoru; Nagaki, Kiyotaka
2011-11-01
The centromere plays an essential role for proper chromosome segregation during cell division and usually harbors long arrays of tandem repeated satellite DNA sequences. Although this function is conserved among eukaryotes, the sequences of centromeric DNA repeats are variable. Most of our understanding of functional centromeres, which are defined by localization of a centromere-specific histone H3 (CENH3) protein, comes from model organisms. The components of the functional centromere in legumes are poorly known. The genus Astragalus is a member of the legumes and bears the largest numbers of species among angiosperms. Therefore, we studied the components of centromeres in Astragalus sinicus. We identified the CenH3 homolog of A. sinicus, AsCenH3 that is the most compact in size among higher eukaryotes. A CENH3-based assay revealed the functional centromeric DNA sequences from A. sinicus, called CentAs. The CentAs repeat is localized in A. sinicus centromeres, and comprises an AT-rich tandem repeat with a monomer size of 20 nucleotides.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ostrander, E.A.; Sprague, G.F. Jr.; Rine, J.
1993-04-01
A large block of simple sequence repeat (SSR) polymorphisms for the dog genome has been isolated and characterized. Screening of primary libraries by conventional hybridization methods as well as by screening of enriched marker-selected libraries led to the isolation of a large number of genomic clones that contained (CA)[sub n] repeats. The sequences of 101 clones showed that the size and complexity of (CA)[sub n] repeats in the dog genome were similar to those reported for these markers in the human genome. Detailed analysis of a representative subset of these markers revealed that most markers were moderately to highly polymorphic,more » with PIC values exceeding 0.70 for 33% of the markers tested. An association between higher PIC values and markers containing longer (CA)[sub n] repeats was observed in these studies, as previously noted for similar markers in the human genome. A list of primer sequences that tag each characterized marker is provided, and a comprehensive system of nomenclature for the dog genome is suggested. 28 refs., 4 figs., 2 tabs.« less
van Gijlswijk, R P; Wiegant, J; Vervenne, R; Lasan, R; Tanke, H J; Raap, A K
1996-01-01
We present a sensitive and rapid fluorescence in situ hybridization (FISH) strategy for detecting chromosome-specific repeat sequences. It uses horseradish peroxidase (HRP)-labeled oligonucleotide sequences in combination with fluorescent tyramide-based detection. After in situ hybridization, the HRP conjugated to the oligonucleotide probe is used to deposit fluorescently labeled tyramide molecules at the site of hybridization. The method features full chemical synthesis of probes, strong FISH signals, and short processing periods, as well as multicolor capabilities.