repetitive element sequence-based: Topics by Science.gov

Sample records for repetitive element sequence-based

Palindromic repetitive DNA elements with coding potential in Methanocaldococcus jannaschii.

PubMed

Suyama, Mikita; Lathe, Warren C; Bork, Peer

2005-10-10

We have identified 141 novel palindromic repetitive elements in the genome of euryarchaeon Methanocaldococcus jannaschii. The total length of these elements is 14.3kb, which corresponds to 0.9% of the total genomic sequence and 6.3% of all extragenic regions. The elements can be divided into three groups (MJRE1-3) based on the sequence similarity. The low sequence identity within each of the groups suggests rather old origin of these elements in M. jannaschii. Three MJRE2 elements were located within the protein coding regions without disrupting the coding potential of the host genes, indicating that insertion of repeats might be a widespread mechanism to enhance sequence diversity in coding regions.
Repetitive sequence analysis and karyotyping reveals centromere-associated DNA sequences in radish (Raphanus sativus L.).

PubMed

He, Qunyan; Cai, Zexi; Hu, Tianhua; Liu, Huijun; Bao, Chonglai; Mao, Weihai; Jin, Weiwei

2015-04-18

Radish (Raphanus sativus L., 2n = 2x = 18) is a major root vegetable crop especially in eastern Asia. Radish root contains various nutritions which play an important role in strengthening immunity. Repetitive elements are primary components of the genomic sequence and the most important factors in genome size variations in higher eukaryotes. To date, studies about repetitive elements of radish are still limited. To better understand genome structure of radish, we undertook a study to evaluate the proportion of repetitive elements and their distribution in radish. We conducted genome-wide characterization of repetitive elements in radish with low coverage genome sequencing followed by similarity-based cluster analysis. Results showed that about 31% of the genome was composed of repetitive sequences. Satellite repeats were the most dominating elements of the genome. The distribution pattern of three satellite repeat sequences (CL1, CL25, and CL43) on radish chromosomes was characterized using fluorescence in situ hybridization (FISH). CL1 was predominantly located at the centromeric region of all chromosomes, CL25 located at the subtelomeric region, and CL43 was a telomeric satellite. FISH signals of two satellite repeats, CL1 and CL25, together with 5S rDNA and 45S rDNA, provide useful cytogenetic markers to identify each individual somatic metaphase chromosome. The centromere-specific histone H3 (CENH3) has been used as a marker to identify centromere DNA sequences. One putative CENH3 (RsCENH3) was characterized and cloned from radish. Its deduced amino acid sequence shares high similarities to those of the CENH3s in Brassica species. An antibody against B. rapa CENH3, specifically stained radish centromeres. Immunostaining and chromatin immunoprecipitation (ChIP) tests with anti-BrCENH3 antibody demonstrated that both the centromere-specific retrotransposon (CR-Radish) and satellite repeat (CL1) are directly associated with RsCENH3 in radish. Proportions of repetitive elements in radish were estimated and satellite repeats were the most dominating elements. Fine karyotyping analysis was established which allow us to easily identify each individual somatic metaphase chromosome. Immunofluorescence- and ChIP-based assays demonstrated the functional significance of satellite and centromere-specific retrotransposon at centromeres. Our study provides a valuable basis for future genomic studies in radish.
Characterization of the repetitive DNA elements in the genome of fish lymphocystis disease viruses.

PubMed

Schnitzler, P; Darai, G

1989-09-01

The complete DNA nucleotide sequence of the repetitive DNA elements in the genome of fish lymphocystis disease virus (FLDV) isolated from two different species (flounder and dab) was determined. The size of these repetitive DNA elements was found to be 1413 bp which corresponds to the DNA sequences of the 5' terminus of the EcoRI DNA fragment B (0.034 to 0.052 m.u.) and to the EcoRI DNA fragment M (0.718 to 0.736 m.u.) of the FLDV genome causing lymphocystis disease in flounder and plaice. The degree of DNA nucleotide homology between both regions was found to be 99%. The repetitive DNA element in the genome of FLDV isolated from other fish species (dab) was identified and is located within the EcoRI DNA fragment B and J of the viral genome. The DNA nucleotide sequence of one duplicate of this repetition (EcoRI DNA fragment J) was determined (1410 bp) and compared to the DNA nucleotide sequences of the repetitive DNA elements of the genome of FLDV isolated from flounder. It was found that the repetitive DNA elements of the genome of FLDV derived from two different fish species are highly conserved and possess a degree of DNA sequence homology of 94%. The DNA sequences of each strand of the individual repetitive element possess one open reading frame.
Prediction and phylogenetic analysis of mammalian short interspersed elements (SINEs).

PubMed

Rogozin, I B; Mayorov, V I; Lavrentieva, M V; Milanesi, L; Adkison, L R

2000-09-01

The presence of repetitive elements can create serious problems for sequence analysis, especially in the case of homology searches in nucleotide sequence databases. Repetitive elements should be treated carefully by using special programs and databases. In this paper, various aspects of SINE (short interspersed repetitive element) identification, analysis and evolution are discussed.
Characterization and distribution of repetitive elements in association with genes in the human genome.

PubMed

Liang, Kai-Chiang; Tseng, Joseph T; Tsai, Shaw-Jenq; Sun, H Sunny

2015-08-01

Repetitive elements constitute more than 50% of the human genome. Recent studies implied that the complexity of living organisms is not just a direct outcome of a number of coding sequences; the repetitive elements, which do not encode proteins, may also play a significant role. Though scattered studies showed that repetitive elements in the regulatory regions of a gene control gene expression, no systematic survey has been done to report the characterization and distribution of various types of these repetitive elements in the human genome. Sequences from 5' and 3' untranslated regions and upstream and downstream of a gene were downloaded from the Ensembl database. The repetitive elements in the neighboring of each gene were identified and classified using cross-matching implemented in the RepeatMasker. The annotation and distribution of distinct classes of repetitive elements associated with individual gene were collected to characterize genes in association with different types of repetitive elements using systems biology program. We identified a total of 1,068,400 repetitive elements which belong to 37-class families and 1235 subclasses that are associated with 33,761 genes and 57,365 transcripts. In addition, we found that the tandem repeats preferentially locate proximal to the transcription start site (TSS) of genes and the major function of these genes are involved in developmental processes. On the other hand, interspersed repetitive elements showed a tendency to be accumulated at distal region from the TSS and the function of interspersed repeat-containing genes took part in the catabolic/metabolic processes. Results from the distribution analysis were collected and used to construct a gene-based repetitive element database (GBRED; http://www.binfo.ncku.edu.tw/GBRED/index.html). A user-friendly web interface was designed to provide the information of repetitive elements associated with any particular gene(s). This is the first study focusing on the gene-associated repetitive elements in the human genome. Our data showed distinct genes associated with different kinds of repetitive element and implied such combination may shape the function of these genes. Aside from the conventional view of these elements in genome evolution, results from this study offer a systemic review to facilitate exploitation of these elements in genome function. Copyright © 2015 Elsevier Ltd. All rights reserved.
Repetitive Elements May Comprise Over Two-Thirds of the Human Genome

PubMed Central

de Koning, A. P. Jason; Gu, Wanjun; Castoe, Todd A.; Batzer, Mark A.; Pollock, David D.

2011-01-01

Transposable elements (TEs) are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo “clouds”). We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%–69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM), to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp). Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed “element-specific” P-clouds (ESPs) to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed. PMID:22144907
A ribosomal orphon sequence from Xenopus laevis flanked by novel low copy number repetitive elements.

PubMed

Guimond, A; Moss, T

1999-02-01

We have used a differential cloning approach to isolate ribosomal/non-ribosomal frontier sequences from Xenopus laevis. A ribosomal intergenic spacer sequence (IGS) was cloned and shown not to be physically linked with the ribosomal locus. This ribosomal orphon contained the IGS sequences found immediately downstream of the 28S gene and included an array of enhancer repetitions and a non-functional spacer promoter. The orphon sequence was flanked by a member of the novel 'Frt' low copy repetitive element family. Three individual Frt repeats were sequenced and all members of this family were shown to lie clustered at two chromosomal sites, one of which contained the ribosomal orphon. One of the Frt elements contained an insertion of 297 bp that showed extensive homology to sequences within at least three other Xenopus genes. Each homology region was flanked by members of the T2 family of short interspersed repetitive elements, (SINEs), and by its target insertion sequence, suggesting multiple translocation events. The data are discussed in terms of the evolution of the ribosomal gene locus.
Repetitive sequences and epigenetic modification: inseparable partners play important roles in the evolution of plant sex chromosomes.

PubMed

Li, Shu-Fen; Zhang, Guo-Jun; Yuan, Jin-Hong; Deng, Chuan-Liang; Gao, Wu-Jun

2016-05-01

The present review discusses the roles of repetitive sequences played in plant sex chromosome evolution, and highlights epigenetic modification as potential mechanism of repetitive sequences involved in sex chromosome evolution. Sex determination in plants is mostly based on sex chromosomes. Classic theory proposes that sex chromosomes evolve from a specific pair of autosomes with emergence of a sex-determining gene(s). Subsequently, the newly formed sex chromosomes stop recombination in a small region around the sex-determining locus, and over time, the non-recombining region expands to almost all parts of the sex chromosomes. Accumulation of repetitive sequences, mostly transposable elements and tandem repeats, is a conspicuous feature of the non-recombining region of the Y chromosome, even in primitive one. Repetitive sequences may play multiple roles in sex chromosome evolution, such as triggering heterochromatization and causing recombination suppression, leading to structural and morphological differentiation of sex chromosomes, and promoting Y chromosome degeneration and X chromosome dosage compensation. In this article, we review the current status of this field, and based on preliminary evidence, we posit that repetitive sequences are involved in sex chromosome evolution probably via epigenetic modification, such as DNA and histone methylation, with small interfering RNAs as the mediator.
Repetitive element transcripts are elevated in the brain of C9orf72 ALS/FTLD patients.

PubMed

Prudencio, Mercedes; Gonzales, Patrick K; Cook, Casey N; Gendron, Tania F; Daughrity, Lillian M; Song, Yuping; Ebbert, Mark T W; van Blitterswijk, Marka; Zhang, Yong-Jie; Jansen-West, Karen; Baker, Matthew C; DeTure, Michael; Rademakers, Rosa; Boylan, Kevin B; Dickson, Dennis W; Petrucelli, Leonard; Link, Christopher D

2017-09-01

Significant transcriptome alterations are detected in the brain of patients with amyotrophic lateral sclerosis (ALS), including carriers of the C9orf72 repeat expansion and C9orf72-negative sporadic cases. Recently, the expression of repetitive element transcripts has been associated with toxicity and, while increased repetitive element expression has been observed in several neurodegenerative diseases, little is known about their contribution to ALS. To assess whether aberrant expression of repetitive element sequences are observed in ALS, we analysed RNA sequencing data from C9orf72-positive and sporadic ALS cases, as well as healthy controls. Transcripts from multiple classes and subclasses of repetitive elements (LINEs, endogenous retroviruses, DNA transposons, simple repeats, etc.) were significantly increased in the frontal cortex of C9orf72 ALS patients. A large collection of patient samples, representing both C9orf72 positive and negative ALS, ALS/FTLD, and FTLD cases, was used to validate the levels of several repetitive element transcripts. These analyses confirmed that repetitive element expression was significantly increased in C9orf72-positive compared to C9orf72-negative or control cases. While previous studies suggest an important link between TDP-43 and repetitive element biology, our data indicate that TDP-43 pathology alone is insufficient to account for the observed changes in repetitive elements in ALS/FTLD. Instead, we found that repetitive element expression positively correlated with RNA polymerase II activity in postmortem brain, and pharmacologic modulation of RNA polymerase II activity altered repetitive element expression in vitro. We conclude that increased RNA polymerase II activity in ALS/FTLD may lead to increased repetitive element transcript expression, a novel pathological feature of ALS/FTLD. © The Author 2017. Published by Oxford University Press.
ProGeRF: Proteome and Genome Repeat Finder Utilizing a Fast Parallel Hash Function

PubMed Central

Moraes, Walas Jhony Lopes; Rodrigues, Thiago de Souza; Bartholomeu, Daniella Castanheira

2015-01-01

Repetitive element sequences are adjacent, repeating patterns, also called motifs, and can be of different lengths; repetitions can involve their exact or approximate copies. They have been widely used as molecular markers in population biology. Given the sizes of sequenced genomes, various bioinformatics tools have been developed for the extraction of repetitive elements from DNA sequences. However, currently available tools do not provide options for identifying repetitive elements in the genome or proteome, displaying a user-friendly web interface, and performing-exhaustive searches. ProGeRF is a web site for extracting repetitive regions from genome and proteome sequences. It was designed to be efficient, fast, and accurate and primarily user-friendly web tool allowing many ways to view and analyse the results. ProGeRF (Proteome and Genome Repeat Finder) is freely available as a stand-alone program, from which the users can download the source code, and as a web tool. It was developed using the hash table approach to extract perfect and imperfect repetitive regions in a (multi)FASTA file, while allowing a linear time complexity. PMID:25811026
Transcriptional "silencer" element in rat repetitive sequences associated with the rat insulin 1 gene locus.

PubMed Central

Laimins, L; Holmgren-König, M; Khoury, G

1986-01-01

The enhancer elements from either simian virus 40 or murine sarcoma virus activate the expression of a transfected rat insulin 1 (rI1) gene when placed within 2.0 kilobases or less of the rI1 gene cap site. Inclusion of 4.0 kilobases of upstream rI1 sequence, however, results in a substantial reduction in the enhancer-dependent insulin gene expression. These observations suggested that a negative transcriptional regulatory element was present between 2.0 and 4.0 kilobases of the rI1 sequence. To test this notion, we employed a heterologous enhancer-dependent transcription assay in which the simian virus 40 72-base-pair repeat is linked to a human beta-globin gene. Addition of the upstream rI1 element to this system decreased the level of enhancer-dependent beta-globin transcription by a factor of 5 to 15. This rI1 "silencer" element functions in a manner relatively independent of position and orientation and requires a cis-dependent relationship to the transcription unit on which it acts. Thus, the silencer sequence seems to have a number of the characteristics of enhancer elements, and we suggest that it may function by the converse of the enhancer mechanism. The rI1 silencer sequence was identified as a member of a long interspersed rat repetitive family. Thus, a potential role for certain repetitive sequences interspersed throughout the eukaryotic genome may be to regulate gene expression by retaining transcriptional activity within defined domains. Images PMID:3010279
Applications of the rep-PCR DNA fingerprinting technique to study microbial diversity, ecology and evolution.

PubMed

Ishii, Satoshi; Sadowsky, Michael J

2009-04-01

A large number of repetitive DNA sequences are found in multiple sites in the genomes of numerous bacteria, archaea and eukarya. While the functions of many of these repetitive sequence elements are unknown, they have proven to be useful as the basis of several powerful tools for use in molecular diagnostics, medical microbiology, epidemiological analyses and environmental microbiology. The repetitive sequence-based PCR or rep-PCR DNA fingerprint technique uses primers targeting several of these repetitive elements and PCR to generate unique DNA profiles or 'fingerprints' of individual microbial strains. Although this technique has been extensively used to examine diversity among variety of prokaryotic microorganisms, rep-PCR DNA fingerprinting can also be applied to microbial ecology and microbial evolution studies since it has the power to distinguish microbes at the strain or isolate level. Recent advancement in rep-PCR methodology has resulted in increased accuracy, reproducibility and throughput. In this minireview, we summarize recent improvements in rep-PCR DNA fingerprinting methodology, and discuss its applications to address fundamentally important questions in microbial ecology and evolution.
Transposon-like properties of the major, long repetitive sequence family in the genome of Physarum polycephalum

PubMed Central

Pearston, Douglas H.; Gordon, Mairi; Hardman, Norman

1985-01-01

A family of long, highly-repetitive sequences, referred to previously as `HpaII-repeats', dominates the genome of the eukaryotic slime mould Physarum polycephalum. These sequences are found exclusively in scrambled clusters. They account for about one-half of the total complement of repetitive DNA in Physarum, and represent the major sequence component found in hypermethylated, 20-50 kb segments of Physarum genomic DNA that fail to be cleaved using the restriction endonuclease HpaII. The structure of this abundant repetitive element was investigated by analysing cloned segments derived from the hypermethylated genomic DNA compartment. We show that the `HpaII-repeat' forms part of a larger repetitive DNA structure, ∼8.6 kb in length, with several structural features in common with recognised eukaryotic transposable genetic elements. Scrambled clusters of the sequence probably arise as a result of transposition-like events, during which the element preferentially recombines in either orientation with target sites located in other copies of the same repeated sequence. The target sites for transposition/recombination are not related in sequence but in all cases studied they are potentially capable of promoting the formation of small `cruciforms' or `Z-DNA' structures which might be recognised during the recombination process. ImagesFig. 3.Fig. 4. PMID:16453652
Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor.

PubMed

Kohany, Oleksiy; Gentles, Andrew J; Hankus, Lukasz; Jurka, Jerzy

2006-10-25

Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at http://www.girinst.org/repbase/submission.html (RepbaseSubmitter) and http://www.girinst.org/censor/index.php (Censor).
Repetitive sequences: the hidden diversity of heterochromatin in prochilodontid fish

PubMed Central

Terencio, Maria L.; Schneider, Carlos H.; Gross, Maria C.; do Carmo, Edson Junior; Nogaroto, Viviane; de Almeida, Mara Cristina; Artoni, Roberto Ferreira; Vicari, Marcelo R.; Feldberg, Eliana

2015-01-01

Abstract The structure and organization of repetitive elements in fish genomes are still relatively poorly understood, although most of these elements are believed to be located in heterochromatic regions. Repetitive elements are considered essential in evolutionary processes as hotspots for mutations and chromosomal rearrangements, among other functions – thus providing new genomic alternatives and regulatory sites for gene expression. The present study sought to characterize repetitive DNA sequences in the genomes of Semaprochilodus insignis (Jardine & Schomburgk, 1841) and Semaprochilodus taeniurus (Valenciennes, 1817) and identify regions of conserved syntenic blocks in this genome fraction of three species of Prochilodontidae (Semaprochilodus insignis, Semaprochilodus taeniurus, and Prochilodus lineatus (Valenciennes, 1836) by cross-FISH using Cot-1 DNA (renaturation kinetics) probes. We found that the repetitive fractions of the genomes of Semaprochilodus insignis and Semaprochilodus taeniurus have significant amounts of conserved syntenic blocks in hybridization sites, but with low degrees of similarity between them and the genome of Prochilodus lineatus, especially in relation to B chromosomes. The cloning and sequencing of the repetitive genomic elements of Semaprochilodus insignis and Semaprochilodus taeniurus using Cot-1 DNA identified 48 fragments that displayed high similarity with repetitive sequences deposited in public DNA databases and classified as microsatellites, transposons, and retrotransposons. The repetitive fractions of the Semaprochilodus insignis and Semaprochilodus taeniurus genomes exhibited high degrees of conserved syntenic blocks in terms of both the structures and locations of hybridization sites, but a low degree of similarity with the syntenic blocks of the Prochilodus lineatus genome. Future comparative analyses of other prochilodontidae species will be needed to advance our understanding of the organization and evolution of the genomes in this group of fish. PMID:26752156
Location analysis for the estrogen receptor-α reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements

PubMed Central

Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.

2010-01-01

Location analysis for estrogen receptor-α (ERα)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERα-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10–20% nucleotide deviation from the canonical ERE sequence. We demonstrate that ∼50% of all ERα-bound loci do not have a discernable ERE and show that most ERα-bound EREs are not perfect consensus EREs. Approximately one-third of all ERα-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERα-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERα binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers. PMID:20047966
Location analysis for the estrogen receptor-alpha reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements.

PubMed

Mason, Christopher E; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M; Kallen, Roland G; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B

2010-04-01

Location analysis for estrogen receptor-alpha (ERalpha)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERalpha-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10-20% nucleotide deviation from the canonical ERE sequence. We demonstrate that approximately 50% of all ERalpha-bound loci do not have a discernable ERE and show that most ERalpha-bound EREs are not perfect consensus EREs. Approximately one-third of all ERalpha-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERalpha-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERalpha binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers.
Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution

PubMed Central

Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Hubisz, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Zhang, Peili; Liu, Jing; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catharine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenée; Verduzco, Daniel; Clerc-Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.

2005-01-01

We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25–55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species—but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila. PMID:15632085
The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences

PubMed Central

2010-01-01

Background In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24). The size and complexity of these genomes have prompted much speculation as to the feasibility of completing a conifer genome sequence. Conifer genomes are reputed to be highly repetitive, but there is little information available on the nature and identity of repetitive units in gymnosperms. The pines have extensive genetic resources, with approximately 329000 ESTs from eleven species and genetic maps in eight species, including a dense genetic map of the twelve linkage groups in Pinus taeda. Results We present here the Sanger sequence and annotation of ten P. taeda BAC clones and Genome Analyzer II whole genome shotgun (WGS) sequences representing 7.5% of the genome. Computational annotation of ten BACs predicts three putative protein-coding genes and at least fifteen likely pseudogenes in nearly one megabase of sequence. We found three conifer-specific LTR retroelements in the BACs, and tentatively identified at least 15 others based on evidence from the distantly related angiosperms. Alignment of WGS sequences to the BACs indicates that 80% of BAC sequences have similar copies (≥ 75% nucleotide identity) elsewhere in the genome, but only 23% have identical copies (99% identity). The three most common repetitive elements in the genome were identified and, when combined, represent less than 5% of the genome. Conclusions This study indicates that the majority of repeats in the P. taeda genome are 'novel' and will therefore require additional BAC or genomic sequencing for accurate characterization. The pine genome contains a very large number of diverged and probably defunct repetitive elements. This study also provides new evidence that sequencing a pine genome using a WGS approach is a feasible goal. PMID:20609256
Control of transcriptional pausing by biased thermal fluctuations on repetitive genomic sequences

PubMed Central

Imashimizu, Masahiko; Afek, Ariel; Takahashi, Hiroki; Lubkowska, Lucyna; Lukatsky, David B.

2016-01-01

In the process of transcription elongation, RNA polymerase (RNAP) pauses at highly nonrandom positions across genomic DNA, broadly regulating transcription; however, molecular mechanisms responsible for the recognition of such pausing positions remain poorly understood. Here, using a combination of statistical mechanical modeling and high-throughput sequencing and biochemical data, we evaluate the effect of thermal fluctuations on the regulation of RNAP pausing. We demonstrate that diffusive backtracking of RNAP, which is biased by repetitive DNA sequence elements, causes transcriptional pausing. This effect stems from the increased microscopic heterogeneity of an elongation complex, and thus is entropy-dominated. This report shows a linkage between repetitive sequence elements encoded in the genome and regulation of RNAP pausing driven by thermal fluctuations. PMID:27830653

The repetitive landscape of the chicken genome.

PubMed

Wicker, Thomas; Robertson, Jon S; Schulze, Stefan R; Feltus, F Alex; Magrini, Vincent; Morrison, Jason A; Mardis, Elaine R; Wilson, Richard K; Peterson, Daniel G; Paterson, Andrew H; Ivarie, Robert

2005-01-01

Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7 x coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available.
The repetitive landscape of the chicken genome

PubMed Central

Wicker, Thomas; Robertson, Jon S.; Schulze, Stefan R.; Feltus, F. Alex; Magrini, Vincent; Morrison, Jason A.; Mardis, Elaine R.; Wilson, Richard K.; Peterson, Daniel G.; Paterson, Andrew H.; Ivarie, Robert

2005-01-01

Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7× coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available. PMID:15256510
A Deluge of Complex Repeats: The Solanum Genome

PubMed Central

Mehra, Mrigaya; Gangwar, Indu; Shankar, Ravi

2015-01-01

Repetitive elements have lately emerged as key components of genome, performing varieties of roles. It has now become necessary to have an account of repeats for every genome to understand its dynamics and state. Recently, genomes of two major Solanaceae species, Solanum tuberosum and Solanum lycopersicum, were sequenced. These species are important crops having high commercial significance as well as value as model species. However, there is a reasonable gap in information about repetitive elements and their possible roles in genome regulation for these species. The present study was aimed at detailed identification and characterization of complex repetitive elements in these genomes, along with study of their possible functional associations as well as to assess possible transcriptionally active repetitive elements. In this study, it was found that ~50–60% of genomes of S. tuberosum and S. lycopersicum were composed of repetitive elements. It was also found that complex repetitive elements were associated with >95% of genes in both species. These two genomes are mostly composed of LTR retrotransposons. Two novel repeat families very similar to LTR/ERV1 and LINE/RTE-BovB have been reported for the first time. Active existence of complex repeats was estimated by measuring their transcriptional abundance using Next Generation Sequencing read data and Microarray platforms. A reasonable amount of regulatory components like transcription factor binding sites and miRNAs appear to be under the influence of these complex repetitive elements in these species, while several genes appeared to possess exonized repeats. PMID:26241045
The landscape of transposable elements in the finished genome of the fungal wheat pathogen Mycosphaerella graminicola

USDA-ARS?s Scientific Manuscript database

Repetitive sequence analysis has become an integral part of genome sequencing projects in addition to gene identification and annotation. Identification of repeats is important not only because it improves gene prediction, but also because of the role that repetitive sequences play in determining th...
The annotation of repetitive elements in the genome of channel catfish (Ictalurus punctatus).

PubMed

Yuan, Zihao; Zhou, Tao; Bao, Lisui; Liu, Shikai; Shi, Huitong; Yang, Yujia; Gao, Dongya; Dunham, Rex; Waldbieser, Geoff; Liu, Zhanjiang

2018-01-01

Channel catfish (Ictalurus punctatus) is a highly adaptive species and has been used as a research model for comparative immunology, physiology, and toxicology among ectothermic vertebrates. It is also economically important for aquaculture. As such, its reference genome was generated and annotated with protein coding genes. However, the repetitive elements in the catfish genome are less well understood. In this study, over 417.8 Megabase (MB) of repetitive elements were identified and characterized in the channel catfish genome. Among them, the DNA/TcMar-Tc1 transposons are the most abundant type, making up ~20% of the total repetitive elements, followed by the microsatellites (14%). The prevalence of repetitive elements, especially the mobile elements, may have provided a driving force for the evolution of the catfish genome. A number of catfish-specific repetitive elements were identified including the previously reported Xba elements whose divergence rate was relatively low, slower than that in untranslated regions of genes but faster than the protein coding sequences, suggesting its evolutionary restrictions.
The annotation of repetitive elements in the genome of channel catfish (Ictalurus punctatus)

PubMed Central

Yuan, Zihao; Zhou, Tao; Bao, Lisui; Liu, Shikai; Shi, Huitong; Yang, Yujia; Gao, Dongya; Dunham, Rex; Waldbieser, Geoff

2018-01-01

Channel catfish (Ictalurus punctatus) is a highly adaptive species and has been used as a research model for comparative immunology, physiology, and toxicology among ectothermic vertebrates. It is also economically important for aquaculture. As such, its reference genome was generated and annotated with protein coding genes. However, the repetitive elements in the catfish genome are less well understood. In this study, over 417.8 Megabase (MB) of repetitive elements were identified and characterized in the channel catfish genome. Among them, the DNA/TcMar-Tc1 transposons are the most abundant type, making up ~20% of the total repetitive elements, followed by the microsatellites (14%). The prevalence of repetitive elements, especially the mobile elements, may have provided a driving force for the evolution of the catfish genome. A number of catfish-specific repetitive elements were identified including the previously reported Xba elements whose divergence rate was relatively low, slower than that in untranslated regions of genes but faster than the protein coding sequences, suggesting its evolutionary restrictions. PMID:29763462
[Learning and Repetive Reproduction of Memorized Sequences by the Right and the Left Hand].

PubMed

Bobrova, E V; Lyakhovetskii, V A; Bogacheva, I N

2015-01-01

An important stage of learning a new skill is repetitive reproduction of one and the same sequence of movements, which plays a significant role in forming of the movement stereotypes. Two groups of right-handers repeatedly memorized (6-10 repetitions) the sequences of their hand transitions by experimenter in 6 positions, firstly by the right hand (RH), and then--by the left hand (LH) or vice versa. Random sequences previously unknown to the volunteers were reproduced in the 11 series. Modified sequences were tested in the 2nd and 3rd series, where the same elements' positions were presented in different order. The processes of repetitive sequence reproduction were similar for RH and LH. However, the learning of the modified sequences differed: Information about elements' position disregarding the reproduction order was used only when LH initiated task performing. This information was not used when LH followed RH and when RH performed the task. Consequently, the type of information coding activated by LH helped learn the positions of sequence elements, while the type of information coding activated by RH prevented learning. It is supposedly connected with the predominant role of right hemisphere in the processes of positional coding and motor learning.
Development and validation of an rDNA operon based primer walking strategy applicable to de novo bacterial genome finishing

PubMed Central

Eastman, Alexander W.; Yuan, Ze-Chun

2015-01-01

Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing projects. PMID:25653642
A novel species-specific tandem repeat DNA family from Sinapis arvensis: detection of telomere-like sequences.

PubMed

Kapila, R; Das, S; Srivastava, P S; Lakshmikumaran, M

1996-08-01

DNA sequences representing a tandemly repeated DNA family of the Sinapis arvensis genome were cloned and characterized. The 700-bp tandem repeat family is represented by two clones, pSA35 and pSA52, which are 697 and 709 bp in length, respectively. Dot matrix analysis of the sequences indicates the presence of repeated elements within each monomeric unit. Sequence analysis of the repetitive region of clones pSA35 and pSA52 shows that there are several copies of a 7-bp repeat element organized in tandem. The consensus sequence of this repeat element is 5'-TTTAGGG-3'. These elements are highly mutated and the difference in length between the two clones is due to different copy numbers of these elements. The repetitive region of clone pSA35 has 26 copies of the element TTTAGGG, whereas clone pSA52 has 28 copies. The repetitive region in both clones is flanked on either side by inverted repeats that may be footprints of a transposition event. Sequence comparison indicates that the element TTTAGGG is identical to telomeric repeats present in Arabidopsis, maize, tomato, and other plants. However, Bal31 digestion kinetics indicates non-telomeric localization of the 700-bp tandem repeats. The clones represent a novel repeat family as (i) they contain telomere-like motifs as subrepeats within each unit; and (ii) they do not hybridize to related crucifers and are species-specific in nature.
Efficacy of Pulsed-Field Gel Electrophoresis and Repetitive Element Sequence-Based PCR in Typing of Salmonella Isolates from Assam, India.

PubMed

Gogoi, Purnima; Borah, Probodh; Hussain, Iftikar; Das, Leena; Hazarika, Girin; Tamuly, Shantanu; Barkalita, Luit Moni

2018-05-01

A total of 12 Salmonella isolates belonging to different serovars, viz , Salmonella enterica serovar Enteritidis ( n = 4), Salmonella enterica serovar Weltevreden ( n = 4), Salmonella enterica serovar Newport ( n = 1), Salmonella enterica serovar Litchifield ( n = 1), and untypeable strains ( n = 2) were isolated from 332 diarrheic fecal samples collected from animals, birds, and humans. Of the two molecular typing methods applied, viz , repetitive element sequence-based PCR (REP-PCR) and pulsed-field gel electrophoresis (PFGE), PFGE could clearly differentiate the strains belonging to different serovars as well as differentiate between strains of the same serovar with respect to their source of isolation, whereas REP-PCR could not differentiate between strains of the same serovar. Thus, it can be suggested that PFGE is more useful and appropriate for molecular typing of Salmonella isolates during epidemiological investigations than REP-PCR. Copyright © 2018 American Society for Microbiology.
On the roles of repetitive DNA elements in the context of a unified genomic-epigenetic system.

PubMed

von Sternberg, Richard

2002-12-01

Repetitive DNA sequences comprise a substantial portion of most eukaryotic and some prokaryotic chromosomes. Despite nearly forty years of research, the functions of various sequence families as a whole and their monomer units remain largely unknown. The inability to map specific functional roles onto many repetitive DNA elements (REs), coupled with the taxon-specificity of sequence families, have led many to speculate that these genomic components are "selfish" replicators generating genomic "junk." The purpose of this paper is to critically examine the selfishness, evolutionary effects, and functionality of REs. First, a brief overview of the range of ideas pertaining to RE function is presented. Second, the argument is presented that the selfish DNA "hypothesis" is actually a narrative scheme, that it serves to protect neo-Darwinian assumptions from criticism, and that this story is untestable and therefore not a hypothesis. Third, attempts to synthesize the selfish DNA concept with complex systems models of the genome and RE functionality are critiqued. Fourth, the supposed connection between RE-induced mutations and macroevolutionary events are stated to be at variance with empirical evidence and theoretical considerations. Hypotheses that base phylogenetic transitions in repetitive sequence changes thus remain speculative. Fifth and finally, the case is made for viewing REs as integrally functional components of chromosomes, genomes, and cells. It is argued throughout that a new conceptual framework is needed for understanding the roles of repetitive DNA in genomic/epigenetic systems, and that neo-Darwinian "narratives" have been the primary obstacle to elucidating the effects of these enigmatic components of chromosomes.
Chromosome Evolution in Connection with Repetitive Sequences and Epigenetics in Plants.

PubMed

Li, Shu-Fen; Su, Ting; Cheng, Guang-Qian; Wang, Bing-Xiao; Li, Xu; Deng, Chuan-Liang; Gao, Wu-Jun

2017-10-24

Chromosome evolution is a fundamental aspect of evolutionary biology. The evolution of chromosome size, structure and shape, number, and the change in DNA composition suggest the high plasticity of nuclear genomes at the chromosomal level. Repetitive DNA sequences, which represent a conspicuous fraction of every eukaryotic genome, particularly in plants, are found to be tightly linked with plant chromosome evolution. Different classes of repetitive sequences have distinct distribution patterns on the chromosomes. Mounting evidence shows that repetitive sequences may play multiple generative roles in shaping the chromosome karyotypes in plants. Furthermore, recent development in our understanding of the repetitive sequences and plant chromosome evolution has elucidated the involvement of a spectrum of epigenetic modification. In this review, we focused on the recent evidence relating to the distribution pattern of repetitive sequences in plant chromosomes and highlighted their potential relevance to chromosome evolution in plants. We also discussed the possible connections between evolution and epigenetic alterations in chromosome structure and repatterning, such as heterochromatin formation, centromere function, and epigenetic-associated transposable element inactivation.
Identification of both copy number variation-type and constant-type core elements in a large segmental duplication region of the mouse genome

PubMed Central

2013-01-01

Background Copy number variation (CNV), an important source of diversity in genomic structure, is frequently found in clusters called CNV regions (CNVRs). CNVRs are strongly associated with segmental duplications (SDs), but the composition of these complex repetitive structures remains unclear. Results We conducted self-comparative-plot analysis of all mouse chromosomes using the high-speed and large-scale-homology search algorithm SHEAP. For eight chromosomes, we identified various types of large SD as tartan-checked patterns within the self-comparative plots. A complex arrangement of diagonal split lines in the self-comparative-plots indicated the presence of large homologous repetitive sequences. We focused on one SD on chromosome 13 (SD13M), and developed SHEPHERD, a stepwise ab initio method, to extract longer repetitive elements and to characterize repetitive structures in this region. Analysis using SHEPHERD showed the existence of 60 core elements, which were expected to be the basic units that form SDs within the repetitive structure of SD13M. The demonstration that sequences homologous to the core elements (>70% homology) covered approximately 90% of the SD13M region indicated that our method can characterize the repetitive structure of SD13M effectively. Core elements were composed largely of fragmented repeats of a previously identified type, such as long interspersed nuclear elements (LINEs), together with partial genic regions. Comparative genome hybridization array analysis showed that whereas 42 core elements were components of CNVR that varied among mouse strains, 8 did not vary among strains (constant type), and the status of the others could not be determined. The CNV-type core elements contained significantly larger proportions of long terminal repeat (LTR) types of retrotransposon than the constant-type core elements, which had no CNV. The higher divergence rates observed in the CNV-type core elements than in the constant type indicate that the CNV-type core elements have a longer evolutionary history than constant-type core elements in SD13M. Conclusions Our methodology for the identification of repetitive core sequences simplifies characterization of the structures of large SDs and detailed analysis of CNV. The results of detailed structural and quantitative analyses in this study might help to elucidate the biological role of one of the SDs on chromosome 13. PMID:23834397
Comparative genome analysis of 52 fish species suggests differential associations of repetitive elements with their living aquatic environments.

PubMed

Yuan, Zihao; Liu, Shikai; Zhou, Tao; Tian, Changxu; Bao, Lisui; Dunham, Rex; Liu, Zhanjiang

2018-02-13

Repetitive elements make up significant proportions of genomes. However, their roles in evolution remain largely unknown. To provide insights into the roles of repetitive elements in fish genomes, we conducted a comparative analysis of repetitive elements of 52 fish species in 22 orders in relation to their living aquatic environments. The proportions of repetitive elements in various genomes were found to be positively correlated with genome sizes, with a few exceptions. More importantly, there appeared to be specific enrichment between some repetitive element categories with species habitat. Specifically, class II transposons appear to be more abundant in freshwater bony fish than in marine bony fish when phylogenetic relationship is not considered. In contrast, marine bony fish harbor more tandem repeats than freshwater species. In addition, class I transposons appear to be more abundant in primitive species such as cartilaginous fish and lamprey than in bony fish. The enriched association of specific categories of repetitive elements with fish habitats suggests the importance of repetitive elements in genome evolution and their potential roles in fish adaptation to their living environments. However, due to the restriction of the limited sequenced species, further analysis needs to be done to alleviate the phylogenetic biases.
Repetitive DNA in the pea (Pisum sativum L.) genome: comprehensive characterization using 454 sequencing and comparison to soybean and Medicago truncatula

PubMed Central

Macas, Jiří; Neumann, Pavel; Navrátilová, Alice

2007-01-01

Background Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (Pisum sativum). Results Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in M. truncatula. Conclusion We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data provide a starting point for further investigations of legume plant genomes based on their global comparative analysis and for the development of more sophisticated approaches for data mining. PMID:18031571
Chromosome Evolution in Connection with Repetitive Sequences and Epigenetics in Plants

PubMed Central

Li, Shu-Fen; Su, Ting; Cheng, Guang-Qian; Wang, Bing-Xiao; Li, Xu; Deng, Chuan-Liang; Gao, Wu-Jun

2017-01-01

Chromosome evolution is a fundamental aspect of evolutionary biology. The evolution of chromosome size, structure and shape, number, and the change in DNA composition suggest the high plasticity of nuclear genomes at the chromosomal level. Repetitive DNA sequences, which represent a conspicuous fraction of every eukaryotic genome, particularly in plants, are found to be tightly linked with plant chromosome evolution. Different classes of repetitive sequences have distinct distribution patterns on the chromosomes. Mounting evidence shows that repetitive sequences may play multiple generative roles in shaping the chromosome karyotypes in plants. Furthermore, recent development in our understanding of the repetitive sequences and plant chromosome evolution has elucidated the involvement of a spectrum of epigenetic modification. In this review, we focused on the recent evidence relating to the distribution pattern of repetitive sequences in plant chromosomes and highlighted their potential relevance to chromosome evolution in plants. We also discussed the possible connections between evolution and epigenetic alterations in chromosome structure and repatterning, such as heterochromatin formation, centromere function, and epigenetic-associated transposable element inactivation. PMID:29064432
The Repeat Sequences and Elevated Substitution Rates of the Chloroplast accD Gene in Cupressophytes

PubMed Central

Li, Jia; Su, Yingjuan; Wang, Ting

2018-01-01

The plastid accD gene encodes a subunit of the acetyl-CoA carboxylase (ACCase) enzyme. The length of accD gene has been supposed to expand in Cryptomeria japonica, Taiwania cryptomerioides, Cephalotaxus, Taxus chinensis, and Podocarpus lambertii, and the main reason for this phenomenon was the existence of tandemly repeated sequences. However, it is still unknown whether the accD gene length in other cupressophytes has expanded. Here, in order to investigate how widespread this phenomenon was, 18 accD sequences and its surrounding regions of cupressophyte were sequenced and analyzed. Together with 39 GenBank sequence data, our taxon sampling covered all the extant gymnosperm orders. The repetitive elements and substitution rates of accD among 57 gymnosperm species were analyzed, the results show: (1) Reading frame length of accD gene in 18 cupressophytes species has also expanded. (2) Many repetitive elements were identified in accD gene of cupressophyte lineages. (3) The synonymous and non-synonymous substitution rates of accD were accelerated in cupressophytes. (4) accD was located in rearrangement endpoints. These results suggested that repetitive elements may mediate the chloroplast genome rearrangement and accelerated the substitution rates. PMID:29731764
Organization and evolution of highly repeated satellite DNA sequences in plant chromosomes.

PubMed

Sharma, S; Raina, S N

2005-01-01

A major component of the plant nuclear genome is constituted by different classes of repetitive DNA sequences. The structural, functional and evolutionary aspects of the satellite repetitive DNA families, and their organization in the chromosomes is reviewed. The tandem satellite DNA sequences exhibit characteristic chromosomal locations, usually at subtelomeric and centromeric regions. The repetitive DNA family(ies) may be widely distributed in a taxonomic family or a genus, or may be specific for a species, genome or even a chromosome. They may acquire large-scale variations in their sequence and copy number over an evolutionary time-scale. These features have formed the basis of extensive utilization of repetitive sequences for taxonomic and phylogenetic studies. Hybrid polyploids have especially proven to be excellent models for studying the evolution of repetitive DNA sequences. Recent studies explicitly show that some repetitive DNA families localized at the telomeres and centromeres have acquired important structural and functional significance. The repetitive elements are under different evolutionary constraints as compared to the genes. Satellite DNA families are thought to arise de novo as a consequence of molecular mechanisms such as unequal crossing over, rolling circle amplification, replication slippage and mutation that constitute "molecular drive". Copyright 2005 S. Karger AG, Basel.
A retrotransposable element from the mosquito Anopheles gambiae .

PubMed Central

Besansky, N J

1990-01-01

A family of middle repetitive elements from the African malaria vector Anopheles gambiae is described. Approximately 100 copies of the element, designated T1Ag, are dispersed in the genome. Full-length elements are 4.6 kilobase pairs in length, but truncation of the 5' end is common. Nucleotide sequences of one full-length, two 5'-truncated, and two 5' ends of T1Ag elements were determined and aligned to define a consensus sequence. Sequence analysis revealed two long, overlapping open reading frames followed by a polyadenylation signal, AATAAA, and a tail consisting of tandem repetitions of the motif TGAAA. No direct or inverted long terminal repeats (LTRs) were detected. The first open reading frame, 442 amino acids in length, includes a domain resembling that of nucleic acid-binding proteins. The second open reading frame, 975 amino acids long, resembles the reverse transcriptases of a category of retrotransposable elements without LTRs, variously termed class II retrotransposons, class III elements or non-LTR retrotransposons. Similarity at the sequence and structural levels places T1Ag in this category. Images PMID:1689457
Molecular and bioinformatic analysis of the FB-NOF transposable element.

PubMed

Badal, Martí; Portela, Anna; Xamena, Noel; Cabré, Oriol

2006-04-12

The Drosophila melanogaster transposable element FB-NOF is known to play a role in genome plasticity through the generation of all sort of genomic rearrangements. Moreover, several insertional mutants due to FB mobilizations have been reported. Its structure and sequence, however, have been poorly studied mainly as a consequence of the long, complex and repetitive sequence of FB inverted repeats. This repetitive region is composed of several 154 bp blocks, each with five almost identical repeats. In this paper, we report the sequencing process of 2 kb long FB inverted repeats of a complete FB-NOF element, with high precision and reliability. This achievement has been possible using a new map of the FB repetitive region, which identifies unambiguously each repeat with new features that can be used as landmarks. With this new vision of the element, a list of FB-NOF in the D. melanogaster genomic clones has been done, improving previous works that used only bioinformatic algorithms. The availability of many FB and FB-NOF sequences allowed an analysis of the FB insertion sequences that showed no sequence specificity, but a preference for A/T rich sequences. The position of NOF into FB is also studied, revealing that it is always located after a second repeat in a random block. With the results of this analysis, we propose a model of transposition in which NOF jumps from FB to FB, using an unidentified transposase enzyme that should specifically recognize the second repeat end of the FB blocks.

Dfam: a database of repetitive DNA based on profile hidden Markov models.

PubMed

Wheeler, Travis J; Clements, Jody; Eddy, Sean R; Hubley, Robert; Jones, Thomas A; Jurka, Jerzy; Smit, Arian F A; Finn, Robert D

2013-01-01

We present a database of repetitive DNA elements, called Dfam (http://dfam.janelia.org). Many genomes contain a large fraction of repetitive DNA, much of which is made up of remnants of transposable elements (TEs). Accurate annotation of TEs enables research into their biology and can shed light on the evolutionary processes that shape genomes. Identification and masking of TEs can also greatly simplify many downstream genome annotation and sequence analysis tasks. The commonly used TE annotation tools RepeatMasker and Censor depend on sequence homology search tools such as cross_match and BLAST variants, as well as Repbase, a collection of known TE families each represented by a single consensus sequence. Dfam contains entries corresponding to all Repbase TE entries for which instances have been found in the human genome. Each Dfam entry is represented by a profile hidden Markov model, built from alignments generated using RepeatMasker and Repbase. When used in conjunction with the hidden Markov model search tool nhmmer, Dfam produces a 2.9% increase in coverage over consensus sequence search methods on a large human benchmark, while maintaining low false discovery rates, and coverage of the full human genome is 54.5%. The website provides a collection of tools and data views to support improved TE curation and annotation efforts. Dfam is also available for download in flat file format or in the form of MySQL table dumps.
Evolution of the Largest Mammalian Genome.

PubMed

Evans, Ben J; Upham, Nathan S; Golding, Goeffrey B; Ojeda, Ricardo A; Ojeda, Agustina A

2017-06-01

The genome of the red vizcacha rat (Rodentia, Octodontidae, Tympanoctomys barrerae) is the largest of all mammals, and about double the size of their close relative, the mountain vizcacha rat Octomys mimax, even though the lineages that gave rise to these species diverged from each other only about 5 Ma. The mechanism for this rapid genome expansion is controversial, and hypothesized to be a consequence of whole genome duplication or accumulation of repetitive elements. To test these alternative but nonexclusive hypotheses, we gathered and evaluated evidence from whole transcriptome and whole genome sequences of T. barrerae and O. mimax. We recovered support for genome expansion due to accumulation of a diverse assemblage of repetitive elements, which represent about one half and one fifth of the genomes of T. barrerae and O. mimax, respectively, but we found no strong signal of whole genome duplication. In both species, repetitive sequences were rare in transcribed regions as compared with the rest of the genome, and mostly had no close match to annotated repetitive sequences from other rodents. These findings raise new questions about the genomic dynamics of these repetitive elements, their connection to widespread chromosomal fissions that occurred in the T. barrerae ancestor, and their fitness effects-including during the evolution of hypersaline dietary tolerance in T. barrerae. ©The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Chromosome arm-specific BAC end sequences permit comparative analysis of homoeologous chromosomes and genomes of polyploid wheat

PubMed Central

2012-01-01

Background Bread wheat, one of the world’s staple food crops, has the largest, highly repetitive and polyploid genome among the cereal crops. The wheat genome holds the key to crop genetic improvement against challenges such as climate change, environmental degradation, and water scarcity. To unravel the complex wheat genome, the International Wheat Genome Sequencing Consortium (IWGSC) is pursuing a chromosome- and chromosome arm-based approach to physical mapping and sequencing. Here we report on the use of a BAC library made from flow-sorted telosomic chromosome 3A short arm (t3AS) for marker development and analysis of sequence composition and comparative evolution of homoeologous genomes of hexaploid wheat. Results The end-sequencing of 9,984 random BACs from a chromosome arm 3AS-specific library (TaaCsp3AShA) generated 11,014,359 bp of high quality sequence from 17,591 BAC-ends with an average length of 626 bp. The sequence represents 3.2% of t3AS with an average DNA sequence read every 19 kb. Overall, 79% of the sequence consisted of repetitive elements, 1.38% as coding regions (estimated 2,850 genes) and another 19% of unknown origin. Comparative sequence analysis suggested that 70-77% of the genes present in both 3A and 3B were syntenic with model species. Among the transposable elements, gypsy/sabrina (12.4%) was the most abundant repeat and was significantly more frequent in 3A compared to homoeologous chromosome 3B. Twenty novel repetitive sequences were also identified using de novo repeat identification. BESs were screened to identify simple sequence repeats (SSR) and transposable element junctions. A total of 1,057 SSRs were identified with a density of one per 10.4 kb, and 7,928 junctions between transposable elements (TE) and other sequences were identified with a density of one per 1.39 kb. With the objective of enhancing the marker density of chromosome 3AS, oligonucleotide primers were successfully designed from 758 SSRs and 695 Insertion Site Based Polymorphisms (ISBPs). Of the 96 ISBP primer pairs tested, 28 (29%) were 3A-specific and compared to 17 (18%) for 96 SSRs. Conclusion This work reports on the use of wheat chromosome arm 3AS-specific BAC library for the targeted generation of sequence data from a particular region of the huge genome of wheat. A large quantity of sequences were generated from the A genome of hexaploid wheat for comparative genome analysis with homoeologous B and D genomes and other model grass genomes. Hundreds of molecular markers were developed from the 3AS arm-specific sequences; these and other sequences will be useful in gene discovery and physical mapping. PMID:22559868
Optimization of analytical parameters for inferring relationships among Escherichia coli isolates from repetitive-element PCR by maximizing correspondence with multilocus sequence typing data.

PubMed

Goldberg, Tony L; Gillespie, Thomas R; Singer, Randall S

2006-09-01

Repetitive-element PCR (rep-PCR) is a method for genotyping bacteria based on the selective amplification of repetitive genetic elements dispersed throughout bacterial chromosomes. The method has great potential for large-scale epidemiological studies because of its speed and simplicity; however, objective guidelines for inferring relationships among bacterial isolates from rep-PCR data are lacking. We used multilocus sequence typing (MLST) as a "gold standard" to optimize the analytical parameters for inferring relationships among Escherichia coli isolates from rep-PCR data. We chose 12 isolates from a large database to represent a wide range of pairwise genetic distances, based on the initial evaluation of their rep-PCR fingerprints. We conducted MLST with these same isolates and systematically varied the analytical parameters to maximize the correspondence between the relationships inferred from rep-PCR and those inferred from MLST. Methods that compared the shapes of densitometric profiles ("curve-based" methods) yielded consistently higher correspondence values between data types than did methods that calculated indices of similarity based on shared and different bands (maximum correspondences of 84.5% and 80.3%, respectively). Curve-based methods were also markedly more robust in accommodating variations in user-specified analytical parameter values than were "band-sharing coefficient" methods, and they enhanced the reproducibility of rep-PCR. Phylogenetic analyses of rep-PCR data yielded trees with high topological correspondence to trees based on MLST and high statistical support for major clades. These results indicate that rep-PCR yields accurate information for inferring relationships among E. coli isolates and that accuracy can be enhanced with the use of analytical methods that consider the shapes of densitometric profiles.
The Organization of Repetitive DNA in the Genomes of Amazonian Lizard Species in the Family Teiidae.

PubMed

Carvalho, Natalia D M; Pinheiro, Vanessa S S; Carmo, Edson J; Goll, Leonardo G; Schneider, Carlos H; Gross, Maria C

2015-01-01

Repetitive DNA is the largest fraction of the eukaryote genome and comprises tandem and dispersed sequences. It presents variations in relation to its composition, number of copies, distribution, dynamics, and genome organization, and participates in the evolutionary diversification of different vertebrate species. Repetitive sequences are usually located in the heterochromatin of centromeric and telomeric regions of chromosomes, contributing to chromosomal structures. Therefore, the aim of this study was to physically map repetitive DNA sequences (5S rDNA, telomeric sequences, tropomyosin gene 1, and retroelements Rex1 and SINE) of mitotic chromosomes of Amazonian species of teiids (Ameiva ameiva, Cnemidophorus sp. 1, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin) to understand their genome organization and karyotype evolution. The mapping of repetitive sequences revealed a distinct pattern in Cnemidophorus sp. 1, whereas the other species showed all sequences interspersed in the heterochromatic region. Physical mapping of the tropomyosin 1 gene was performed for the first time in lizards and showed that in addition to being functional, this gene has a structural function similar to the mapped repetitive elements as it is located preferentially in centromeric regions and termini of chromosomes. © 2016 S. Karger AG, Basel.
Genetic exchange between endogenous and exogenous LINE-1 repetitive elements in mouse cells.

PubMed Central

Belmaaza, A; Wallenburg, J C; Brouillette, S; Gusew, N; Chartrand, P

1990-01-01

The repetitive LINE (L1) elements of the mouse, which are present at about 10(5) copies per genome and share over 80% of sequence homology, were examined for their ability to undergo genetic exchange with exogenous L1 sequences. The exogenous L1 sequences, carried by a shuttle vector, consisted of an internal fragment from L1Md-A2, a previously described member of the L1 family of the mouse. Using an assay that does not require the reconstitution of a selectable marker we found that this vector, in either circular or linear form, acquired DNA sequences from endogenous L1 elements at a frequency of 10(-3) to 10(-4) per rescued vector. Physical analysis of the acquired L1 sequences revealed that distinct endogenous L1 elements acted as donors and that different subfamilies participated. These results demonstrate that L1 elements are readily capable of genetic exchange. Apart from gene conversion events, the acquisition of L1 sequences outside the region of homology suggested that a second mechanism was also involved in the genetic exchange. A model which accounts for this mechanism is presented and its potential implication on the rearrangement of L1 elements is discussed. Images PMID:1978749
A Repeat Look at Repeating Patterns

ERIC Educational Resources Information Center

Markworth, Kimberly A.

2016-01-01

A "repeating pattern" is a cyclical repetition of an identifiable core. Children in the primary grades usually begin pattern work with fairly simple patterns, such as AB, ABC, or ABB patterns. The unique letters represent unique elements, whereas the sequence of letters represents the core that is repeated. Based on color, shape,…
Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions.

PubMed

Nishizawa, M; Nishizawa, K

2000-10-01

The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions

PubMed Central

Nishizawa, Manami; Nishizawa, Kazuhisa

2000-01-01

The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273
Expanding probe repertoire and improving reproducibility in human genomic hybridization

PubMed Central

Dorman, Stephanie N.; Shirley, Ben C.; Knoll, Joan H. M.; Rogan, Peter K.

2013-01-01

Diagnostic DNA hybridization relies on probes composed of single copy (sc) genomic sequences. Sc sequences in probe design ensure high specificity and avoid cross-hybridization to other regions of the genome, which could lead to ambiguous results that are difficult to interpret. We examine how the distribution and composition of repetitive sequences in the genome affects sc probe performance. A divide and conquer algorithm was implemented to design sc probes. With this approach, sc probes can include divergent repetitive elements, which hybridize to unique genomic targets under higher stringency experimental conditions. Genome-wide custom probe sets were created for fluorescent in situ hybridization (FISH) and microarray genomic hybridization. The scFISH probes were developed for detection of copy number changes within small tumour suppressor genes and oncogenes. The microarrays demonstrated increased reproducibility by eliminating cross-hybridization to repetitive sequences adjacent to probe targets. The genome-wide microarrays exhibited lower median coefficients of variation (17.8%) for two HapMap family trios. The coefficients of variations of commercial probes within 300 nt of a repetitive element were 48.3% higher than the nearest custom probe. Furthermore, the custom microarray called a chromosome 15q11.2q13 deletion more consistently. This method for sc probe design increases probe coverage for FISH and lowers variability in genomic microarrays. PMID:23376933
Billions of basepairs of recently expanded, repetitive sequences are eliminated from the somatic genome during copepod development.

PubMed

Sun, Cheng; Wyngaard, Grace; Walton, D Brian; Wichman, Holly A; Mueller, Rachel Lockridge

2014-03-11

Chromatin diminution is the programmed deletion of DNA from presomatic cell or nuclear lineages during development, producing single organisms that contain two different nuclear genomes. Phylogenetically diverse taxa undergo chromatin diminution--some ciliates, nematodes, copepods, and vertebrates. In cyclopoid copepods, chromatin diminution occurs in taxa with massively expanded germline genomes; depending on species, germline genome sizes range from 15 - 75 Gb, 12-74 Gb of which are lost from pre-somatic cell lineages at germline--soma differentiation. This is more than an order of magnitude more sequence than is lost from other taxa. To date, the sequences excised from copepods have not been analyzed using large-scale genomic datasets, and the processes underlying germline genomic gigantism in this clade, as well as the functional significance of chromatin diminution, have remained unknown. Here, we used high-throughput genomic sequencing and qPCR to characterize the germline and somatic genomes of Mesocyclops edax, a freshwater cyclopoid copepod with a germline genome of ~15 Gb and a somatic genome of ~3 Gb. We show that most of the excised DNA consists of repetitive sequences that are either 1) verifiable transposable elements (TEs), or 2) non-simple repeats of likely TE origin. Repeat elements in both genomes are skewed towards younger (i.e. less divergent) elements. Excised DNA is a non-random sample of the germline repeat element landscape; younger elements, and high frequency DNA transposons and LINEs, are disproportionately eliminated from the somatic genome. Our results suggest that germline genome expansion in M. edax reflects explosive repeat element proliferation, and that billions of base pairs of such repeats are deleted from the somatic genome every generation. Thus, we hypothesize that chromatin diminution is a mechanism that controls repeat element load, and that this load can evolve to be divergent between tissue types within single organisms.
Billions of basepairs of recently expanded, repetitive sequences are eliminated from the somatic genome during copepod development

PubMed Central

2014-01-01

Background Chromatin diminution is the programmed deletion of DNA from presomatic cell or nuclear lineages during development, producing single organisms that contain two different nuclear genomes. Phylogenetically diverse taxa undergo chromatin diminution — some ciliates, nematodes, copepods, and vertebrates. In cyclopoid copepods, chromatin diminution occurs in taxa with massively expanded germline genomes; depending on species, germline genome sizes range from 15 – 75 Gb, 12–74 Gb of which are lost from pre-somatic cell lineages at germline – soma differentiation. This is more than an order of magnitude more sequence than is lost from other taxa. To date, the sequences excised from copepods have not been analyzed using large-scale genomic datasets, and the processes underlying germline genomic gigantism in this clade, as well as the functional significance of chromatin diminution, have remained unknown. Results Here, we used high-throughput genomic sequencing and qPCR to characterize the germline and somatic genomes of Mesocyclops edax, a freshwater cyclopoid copepod with a germline genome of ~15 Gb and a somatic genome of ~3 Gb. We show that most of the excised DNA consists of repetitive sequences that are either 1) verifiable transposable elements (TEs), or 2) non-simple repeats of likely TE origin. Repeat elements in both genomes are skewed towards younger (i.e. less divergent) elements. Excised DNA is a non-random sample of the germline repeat element landscape; younger elements, and high frequency DNA transposons and LINEs, are disproportionately eliminated from the somatic genome. Conclusions Our results suggest that germline genome expansion in M. edax reflects explosive repeat element proliferation, and that billions of base pairs of such repeats are deleted from the somatic genome every generation. Thus, we hypothesize that chromatin diminution is a mechanism that controls repeat element load, and that this load can evolve to be divergent between tissue types within single organisms. PMID:24618421
Heterochromatin and molecular characterization of DsmarMITE transposable element in the beetle Dichotomius schiffleri (Coleoptera: Scarabaeidae).

PubMed

Xavier, Crislaine; Cabral-de-Mello, Diogo Cavalcanti; de Moura, Rita Cássia

2014-12-01

Cytogenetic studies of the Neotropical beetle genus Dichotomius (Scarabaeinae, Coleoptera) have shown dynamism for centromeric constitutive heterochromatin sequences. In the present work we studied the chromosomes and isolated repetitive sequences of Dichotomius schiffleri aiming to contribute to the understanding of coleopteran genome/chromosomal organization. Dichotomius schiffleri presented a conserved karyotype and heterochromatin distribution in comparison to other species of the genus with 2n = 18, biarmed chromosomes, and pericentromeric C-positive blocks. Similarly to heterochromatin distributional patterns, the highly and moderately repetitive DNA fraction (C 0 t-1 DNA) was detected in pericentromeric areas, contrasting with the euchromatic mapping of an isolated TE (named DsmarMITE). After structural analyses, the DsmarMITE was classified as a non-autonomous element of the type miniature inverted-repeat transposable element (MITE) with terminal inverted repeats similar to Mariner elements of insects from different orders. The euchromatic distribution for DsmarMITE indicates that it does not play a part in the dynamics of constitutive heterochromatin sequences.
Complete Genome Sequence of Sporisorium scitamineum and Biotrophic Interaction Transcriptome with Sugarcane

PubMed Central

Benevenuto, Juliana; Peters, Leila P.; Carvalho, Giselle; Palhares, Alessandra; Quecine, Maria C.; Nunes, Filipe R. S.; Kmit, Maria C. P.; Wai, Alvan; Hausner, Georg; Aitken, Karen S.; Berkman, Paul J.; Fraser, James A.; Moolhuijzen, Paula M.; Coutinho, Luiz L.; Creste, Silvana; Vieira, Maria L. C.; Kitajima, João P.; Monteiro-Vitorello, Claudia B.

2015-01-01

Sporisorium scitamineum is a biotrophic fungus responsible for the sugarcane smut, a worldwide spread disease. This study provides the complete sequence of individual chromosomes of S. scitamineum from telomere to telomere achieved by a combination of PacBio long reads and Illumina short reads sequence data, as well as a draft sequence of a second fungal strain. Comparative analysis to previous available sequences of another strain detected few polymorphisms among the three genomes. The novel complete sequence described herein allowed us to identify and annotate extended subtelomeric regions, repetitive elements and the mitochondrial DNA sequence. The genome comprises 19,979,571 bases, 6,677 genes encoding proteins, 111 tRNAs and 3 assembled copies of rDNA, out of our estimated number of copies as 130. Chromosomal reorganizations were detected when comparing to sequences of S. reilianum, the closest smut relative, potentially influenced by repeats of transposable elements. Repetitive elements may have also directed the linkage of the two mating-type loci. The fungal transcriptome profiling from in vitro and from interaction with sugarcane at two time points (early infection and whip emergence) revealed that 13.5% of the genes were differentially expressed in planta and particular to each developmental stage. Among them are plant cell wall degrading enzymes, proteases, lipases, chitin modification and lignin degradation enzymes, sugar transporters and transcriptional factors. The fungus also modulates transcription of genes related to surviving against reactive oxygen species and other toxic metabolites produced by the plant. Previously described effectors in smut/plant interactions were detected but some new candidates are proposed. Ten genomic islands harboring some of the candidate genes unique to S. scitamineum were expressed only in planta. RNAseq data was also used to reassure gene predictions. PMID:26065709
Identification and nucleotide sequence analysis of the repetitive DNA element in the genome of fish lymphocystis disease virus.

PubMed

Schnitzler, P; Delius, H; Scholz, J; Touray, M; Orth, E; Darai, G

1987-12-01

The genome of the fish lymphocystis disease virus (FLDV) was screened for the existence of repetitive DNA sequences using a defined and complete gene library of the viral genome (98 kbp) by DNA-DNA hybridization, heteroduplex analysis, and restriction fine mapping. A repetitive DNA sequence was detected at the coordinates 0.034 to 0.057 and 0.718 to 0.736 map units (m.u.) of the FLDV genome. The first region (0.034 to 0.057 m.u.) corresponds to the 5' terminus of the EcoRI FLDV DNA fragment B (0.034 to 0.165 m.u.) and the second region (0.718 to 0.736 m.u.) is identical to the EcoRI DNA fragment M of the viral genome. The DNA nucleotide sequence of the EcoRI FLDV DNA fragment M was determined. This analysis revealed the presence of many short direct and inverted repetitions, e.g., a 18-mer direct repetition (TTTAAAATTTAATTAA) that started at nucleotide positions 812 and 942 and a 14-mer inverted repeat (TTAAATTTAAATTT) at nucleotide positions 820 and 959. Only short open reading frames were detected within this region. The DNA repetitions are discussed as sequences that play a possible regulatory role for virus replication. Furthermore, hybridization experiments revealed that the repetitive DNA sequences are conserved in the genome of different strains of fish lymphocystis disease virus isolated from two species of Pleuronectidae (flounder and dab).
[Short interspersed repetitive sequences (SINEs) and their use as a phylogenetic tool].

PubMed

Kramerov, D A; Vasetskiĭ, N S

2009-01-01

The data on one of the most common repetitive elements of eukaryotic genomes, short interspersed elements (SINEs), are reviewed. Their structure, origin, and functioning in the genome are discussed. The variation and abundance of these neutral genomic markers makes them a convenient and reliable tool for phylogenetic analysis. The main methods of such analysis are presented, and the potential and limitations of this approach are discussed using specific examples.
Transposon fingerprinting using low coverage whole genome shotgun sequencing in cacao (Theobroma cacao L.) and related species.

PubMed

Sveinsson, Saemundur; Gill, Navdeep; Kane, Nolan C; Cronk, Quentin

2013-07-24

Transposable elements (TEs) and other repetitive elements are a large and dynamically evolving part of eukaryotic genomes, especially in plants where they can account for a significant proportion of genome size. Their dynamic nature gives them the potential for use in identifying and characterizing crop germplasm. However, their repetitive nature makes them challenging to study using conventional methods of molecular biology. Next generation sequencing and new computational tools have greatly facilitated the investigation of TE variation within species and among closely related species. (i) We generated low-coverage Illumina whole genome shotgun sequencing reads for multiple individuals of cacao (Theobroma cacao) and related species. These reads were analysed using both an alignment/mapping approach and a de novo (graph based clustering) approach. (ii) A standard set of ultra-conserved orthologous sequences (UCOS) standardized TE data between samples and provided phylogenetic information on the relatedness of samples. (iii) The mapping approach proved highly effective within the reference species but underestimated TE abundance in interspecific comparisons relative to the de novo methods. (iv) Individual T. cacao accessions have unique patterns of TE abundance indicating that the TE composition of the genome is evolving actively within this species. (v) LTR/Gypsy elements are the most abundant, comprising c.10% of the genome. (vi) Within T. cacao the retroelement families show an order of magnitude greater sequence variability than the DNA transposon families. (vii) Theobroma grandiflorum has a similar TE composition to T. cacao, but the related genus Herrania is rather different, with LTRs making up a lower proportion of the genome, perhaps because of a massive presence (c. 20%) of distinctive low complexity satellite-like repeats in this genome. (i) Short read alignment/mapping to reference TE contigs provides a simple and effective method of investigating intraspecific differences in TE composition. It is not appropriate for comparing repetitive elements across the species boundaries, for which de novo methods are more appropriate. (ii) Individual T. cacao accessions have unique spectra of TE composition indicating active evolution of TE abundance within this species. TE patterns could potentially be used as a "fingerprint" to identify and characterize cacao accessions.
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1987-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3575113
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1990-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2333227
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1988-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3368330

A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1989-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2654889
GREAM: A Web Server to Short-List Potentially Important Genomic Repeat Elements Based on Over-/Under-Representation in Specific Chromosomal Locations, Such as the Gene Neighborhoods, within or across 17 Mammalian Species

PubMed Central

Chandrashekar, Darshan Shimoga; Dey, Poulami; Acharya, Kshitish K.

2015-01-01

Background Genome-wide repeat sequences, such as LINEs, SINEs and LTRs share a considerable part of the mammalian nuclear genomes. These repeat elements seem to be important for multiple functions including the regulation of transcription initiation, alternative splicing and DNA methylation. But it is not possible to study all repeats and, hence, it would help to short-list before exploring their potential functional significance via experimental studies and/or detailed in silico analyses. Result We developed the ‘Genomic Repeat Element Analyzer for Mammals’ (GREAM) for analysis, screening and selection of potentially important mammalian genomic repeats. This web-server offers many novel utilities. For example, this is the only tool that can reveal a categorized list of specific types of transposons, retro-transposons and other genome-wide repetitive elements that are statistically over-/under-represented in regions around a set of genes, such as those expressed differentially in a disease condition. The output displays the position and frequency of identified elements within the specified regions. In addition, GREAM offers two other types of analyses of genomic repeat sequences: a) enrichment within chromosomal region(s) of interest, and b) comparative distribution across the neighborhood of orthologous genes. GREAM successfully short-listed a repeat element (MER20) known to contain functional motifs. In other case studies, we could use GREAM to short-list repetitive elements in the azoospermia factor a (AZFa) region of the human Y chromosome and those around the genes associated with rat liver injury. GREAM could also identify five over-represented repeats around some of the human and mouse transcription factor coding genes that had conserved expression patterns across the two species. Conclusion GREAM has been developed to provide an impetus to research on the role of repetitive sequences in mammalian genomes by offering easy selection of more interesting repeats in various contexts/regions. GREAM is freely available at http://resource.ibab.ac.in/GREAM/. PMID:26208093
Repetitive sequences in plant nuclear DNA: types, distribution, evolution and function.

PubMed

Mehrotra, Shweta; Goyal, Vinod

2014-08-01

Repetitive DNA sequences are a major component of eukaryotic genomes and may account for up to 90% of the genome size. They can be divided into minisatellite, microsatellite and satellite sequences. Satellite DNA sequences are considered to be a fast-evolving component of eukaryotic genomes, comprising tandemly-arrayed, highly-repetitive and highly-conserved monomer sequences. The monomer unit of satellite DNA is 150-400 base pairs (bp) in length. Repetitive sequences may be species- or genus-specific, and may be centromeric or subtelomeric in nature. They exhibit cohesive and concerted evolution caused by molecular drive, leading to high sequence homogeneity. Repetitive sequences accumulate variations in sequence and copy number during evolution, hence they are important tools for taxonomic and phylogenetic studies, and are known as "tuning knobs" in the evolution. Therefore, knowledge of repetitive sequences assists our understanding of the organization, evolution and behavior of eukaryotic genomes. Repetitive sequences have cytoplasmic, cellular and developmental effects and play a role in chromosomal recombination. In the post-genomics era, with the introduction of next-generation sequencing technology, it is possible to evaluate complex genomes for analyzing repetitive sequences and deciphering the yet unknown functional potential of repetitive sequences. Copyright © 2014 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Discrimination of probiotic Lactobacillus strains for poultry by repetitive sequenced-based PCR fingerprinting.

PubMed

Lee, Chin Mei; Sieo, Chin Chin; Cheah, Yoke-Kqueen; Abdullah, Norhani; Ho, Yin Wan

2012-02-01

Four repetitive element sequence-based polymerase chain reaction (rep-PCR) methods, namely repetitive extragenic palindromic PCR (REP-PCR), enterobacterial repetitive intergenic consensus PCR (ERIC-PCR), polytrinucleotide (GTG)₅ -PCR and BOX-PCR, were evaluated for the molecular differentiation of 12 probiotic Lactobacillus strains previously isolated from the gastrointestinal tract of chickens and used as a multistrain probiotic. This study represents the first analysis of the comparative efficacy of these four rep-PCR methods and their combination (composite rep-PCR) in the molecular typing of Lactobacillus strains based on a discriminatory index (D). Species-specific and strain-specific profiles were observed from rep-PCR. From the numerical analysis of composite rep-PCR, BOX-PCR, (GTG)₅ -PCR, REP-PCR and ERIC-PCR, D values of 0.9118, 0.9044, 0.8897, 0.8750 and 0.8529 respectively were obtained. Composite rep-PCR analysis was the most discriminative method, with eight Lactobacillus strains, namely L. brevis ATCC 14869(T) , L. reuteri C 10, L. reuteri ATCC 23272(T) , L. gallinarum ATCC 33199(T) , L. salivarius ATCC 11741(T) , L. salivarius I 24, L. panis JCM 11053(T) and L. panis C 17, being differentiated at the strain level. Composite rep-PCR analysis is potentially a useful fingerprinting method to discriminate probiotic Lactobacillus strains isolated from the gastrointestinal tract of chickens. Copyright © 2011 Society of Chemical Industry.
Molecular characterization of the short interspersed repetitive element SIRE in the six discrete typing units (DTUs) of Trypanosoma cruzi.

PubMed

Pavia, Paula X; Thomas, M Carmen; López, Manuel C; Puerta, Concepción J

2012-10-01

Repetitive sequences constitute an important proportion of the Trypanosoma cruzi genome; hence, they have been used as molecular markers and as amplification targets to identify the parasite presence via PCR. In this study, a molecular characterization of the SIRE repetitive element was performed in the six discrete typing units (DTUs) of T. cruzi. The results evidenced that this element, located in multiple chromosomes, was interspersed in the genome of all DTUs of the parasite. The presence of several motifs implicated in element insertion, duplication, and functionality suggests that SIRE could be an active element in the parasite genome. Of interest, there were SIRE specific Alu I fragments that allowed to discriminate DTU I from the others DTUs. Moreover, an UPGMA phenetic tree constructed from fragment sharing Southern blot data showed that T. cruzi I isolates conform a cluster separated from the T. cruzi II-VI isolates. When the relative number of SIRE copies was determined, a variation from 105 to 2,000 copies per haploid genome was observed among the different isolates without kept a DTU-relationship. In all, these findings suggest that SIRE sequence is a good target for parasite DNA amplification. Copyright © 2012 Elsevier Inc. All rights reserved.
Mutation in a primate-conserved retrotransposon reveals a noncoding RNA as a mediator of infantile encephalopathy

PubMed Central

Cartault, François; Munier, Patrick; Benko, Edgar; Desguerre, Isabelle; Hanein, Sylvain; Boddaert, Nathalie; Bandiera, Simonetta; Vellayoudom, Jeanine; Krejbich-Trotot, Pascale; Bintner, Marc; Hoarau, Jean-Jacques; Girard, Muriel; Génin, Emmanuelle; de Lonlay, Pascale; Fourmaintraux, Alain; Naville, Magali; Rodriguez, Diana; Feingold, Josué; Renouil, Michel; Munnich, Arnold; Westhof, Eric; Fähling, Michael; Lyonnet, Stanislas; Henrion-Caude, Alexandra

2012-01-01

The human genome is densely populated with transposons and transposon-like repetitive elements. Although the impact of these transposons and elements on human genome evolution is recognized, the significance of subtle variations in their sequence remains mostly unexplored. Here we report homozygosity mapping of an infantile neurodegenerative disease locus in a genetic isolate. Complete DNA sequencing of the 400-kb linkage locus revealed a point mutation in a primate-specific retrotransposon that was transcribed as part of a unique noncoding RNA, which was expressed in the brain. In vitro knockdown of this RNA increased neuronal apoptosis, consistent with the inappropriate dosage of this RNA in vivo and with the phenotype. Moreover, structural analysis of the sequence revealed a small RNA-like hairpin that was consistent with the putative gain of a functional site when mutated. We show here that a mutation in a unique transposable element-containing RNA is associated with lethal encephalopathy, and we suggest that RNAs that harbor evolutionarily recent repetitive elements may play important roles in human brain development. PMID:22411793
[Genome-scale sequence data processing and epigenetic analysis of DNA methylation].

PubMed

Wang, Ting-Zhang; Shan, Gao; Xu, Jian-Hong; Xue, Qing-Zhong

2013-06-01

A new approach recently developed for detecting cytosine DNA methylation (mC) and analyzing the genome-scale DNA methylation profiling, is called BS-Seq which is based on bisulfite conversion of genomic DNA combined with next-generation sequencing. The method can not only provide an insight into the difference of genome-scale DNA methylation among different organisms, but also reveal the conservation of DNA methylation in all contexts and nucleotide preference for different genomic regions, including genes, exons, and repetitive DNA sequences. It will be helpful to under-stand the epigenetic impacts of cytosine DNA methylation on the regulation of gene expression and maintaining silence of repetitive sequences, such as transposable elements. In this paper, we introduce the preprocessing steps of DNA methylation data, by which cytosine (C) and guanine (G) in the reference sequence are transferred to thymine (T) and adenine (A), and cytosine in reads is transferred to thymine, respectively. We also comprehensively review the main content of the DNA methylation analysis on the genomic scale: (1) the cytosine methylation under the context of different sequences; (2) the distribution of genomic methylcytosine; (3) DNA methylation context and the preference for the nucleotides; (4) DNA- protein interaction sites of DNA methylation; (5) degree of methylation of cytosine in the different structural elements of genes. DNA methylation analysis technique provides a powerful tool for the epigenome study in human and other species, and genes and environment interaction, and founds the theoretical basis for further development of disease diagnostics and therapeutics in human.
Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics

PubMed Central

Ardui, Simon; Ameur, Adam; Vermeesch, Joris R; Hestand, Matthew S

2018-01-01

Abstract Short read massive parallel sequencing has emerged as a standard diagnostic tool in the medical setting. However, short read technologies have inherent limitations such as GC bias, difficulties mapping to repetitive elements, trouble discriminating paralogous sequences, and difficulties in phasing alleles. Long read single molecule sequencers resolve these obstacles. Moreover, they offer higher consensus accuracies and can detect epigenetic modifications from native DNA. The first commercially available long read single molecule platform was the RS system based on PacBio's single molecule real-time (SMRT) sequencing technology, which has since evolved into their RSII and Sequel systems. Here we capsulize how SMRT sequencing is revolutionizing constitutional, reproductive, cancer, microbial and viral genetic testing. PMID:29401301
The DL1 repeats in the genome of Diphyllobothrium latum.

PubMed

Usmanova, Nadezhda M; Kazakov, Vasiliy I

2010-07-01

Diphyllobothrium latum is a widespread intestinal parasite, which has a great clinical relevance, but there are no sequences of its nuclear genome. In this paper, a repetitive element in the D. latum genome is firstly described. The adult D. latum was obtained in the result of expulsion from intestinum of a patient suffering from diphyllobothriasis. Genomic DNA was isolated from several proglottids of this individual. PstI restriction products of D. latum genomic DNA were sequenced. Polymerase chain reaction (PCR) amplification of these products using genomic DNA and selected primers was carried out. Thereby a cluster of a repetitive element, called DL1, was discovered. For precise identification of a beginning and an end of the repeat, a product of PCR amplification of D. latum genomic DNA with one specific primer was sequenced. In discussion, several evidences that DL1 repeat is a member of the SINE family of retroposons were adduced.
Integrated in silico and biological validation of the blocking effect of Cot-1 DNA on Microarray-CGH.

PubMed

Kang, Seung-Hui; Park, Chan Hee; Jeung, Hei Cheul; Kim, Ki-Yeol; Rha, Sun Young; Chung, Hyun Cheol

2007-06-01

In array-CGH, various factors may act as variables influencing the result of experiments. Among them, Cot-1 DNA, which has been used as a repetitive sequence-blocking agent, may become an artifact-inducing factor in BAC array-CGH. To identify the effect of Cot-1 DNA on Microarray-CGH experiments, Cot-1 DNA was labeled directly and Microarray-CGH experiments were performed. The results confirmed that probes which hybridized more completely with Cot-1 DNA had a higher sequence similarity to the Alu element. Further, in the sex-mismatched Microarray-CGH experiments, the variation and intensity in the fluorescent signal were reduced in the high intensity probe group in which probes were better hybridized with Cot-1 DNA. Otherwise, those of the low intensity probe group showed no alterations regardless of Cot-1 DNA. These results confirmed by in silico methods that Cot-1 DNA could block repetitive sequences in gDNA and probes. In addition, it was confirmed biologically that the blocking effect of Cot-1 DNA could be presented via its repetitive sequences, especially Alu elements. Thus, in contrast to BAC-array CGH, the use of Cot-1 DNA is advantageous in controlling experimental variation in Microarray-CGH.
Genomic Repeat Abundances Contain Phylogenetic Signal

PubMed Central

Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.

2015-01-01

A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464
Transposon fingerprinting using low coverage whole genome shotgun sequencing in Cacao (Theobroma cacao L.) and related species

PubMed Central

2013-01-01

Background Transposable elements (TEs) and other repetitive elements are a large and dynamically evolving part of eukaryotic genomes, especially in plants where they can account for a significant proportion of genome size. Their dynamic nature gives them the potential for use in identifying and characterizing crop germplasm. However, their repetitive nature makes them challenging to study using conventional methods of molecular biology. Next generation sequencing and new computational tools have greatly facilitated the investigation of TE variation within species and among closely related species. Results (i) We generated low-coverage Illumina whole genome shotgun sequencing reads for multiple individuals of cacao (Theobroma cacao) and related species. These reads were analysed using both an alignment/mapping approach and a de novo (graph based clustering) approach. (ii) A standard set of ultra-conserved orthologous sequences (UCOS) standardized TE data between samples and provided phylogenetic information on the relatedness of samples. (iii) The mapping approach proved highly effective within the reference species but underestimated TE abundance in interspecific comparisons relative to the de novo methods. (iv) Individual T. cacao accessions have unique patterns of TE abundance indicating that the TE composition of the genome is evolving actively within this species. (v) LTR/Gypsy elements are the most abundant, comprising c.10% of the genome. (vi) Within T. cacao the retroelement families show an order of magnitude greater sequence variability than the DNA transposon families. (vii) Theobroma grandiflorum has a similar TE composition to T. cacao, but the related genus Herrania is rather different, with LTRs making up a lower proportion of the genome, perhaps because of a massive presence (c. 20%) of distinctive low complexity satellite-like repeats in this genome. Conclusions (i) Short read alignment/mapping to reference TE contigs provides a simple and effective method of investigating intraspecific differences in TE composition. It is not appropriate for comparing repetitive elements across the species boundaries, for which de novo methods are more appropriate. (ii) Individual T. cacao accessions have unique spectra of TE composition indicating active evolution of TE abundance within this species. TE patterns could potentially be used as a “fingerprint” to identify and characterize cacao accessions. PMID:23883295
Mind the gap; seven reasons to close fragmented genome assemblies.

PubMed

Thomma, Bart P H J; Seidl, Michael F; Shi-Kunne, Xiaoqian; Cook, David E; Bolton, Melvin D; van Kan, Jan A L; Faino, Luigi

2016-05-01

Like other domains of life, research into the biology of filamentous microbes has greatly benefited from the advent of whole-genome sequencing. Next-generation sequencing (NGS) technologies have revolutionized sequencing, making genomic sciences accessible to many academic laboratories including those that study non-model organisms. Thus, hundreds of fungal genomes have been sequenced and are publically available today, although these initiatives have typically yielded considerably fragmented genome assemblies that often lack large contiguous genomic regions. Many important genomic features are contained in intergenic DNA that is often missing in current genome assemblies, and recent studies underscore the significance of non-coding regions and repetitive elements for the life style, adaptability and evolution of many organisms. The study of particular types of genetic elements, such as telomeres, centromeres, repetitive elements, effectors, and clusters of co-regulated genes, but also of phenomena such as structural rearrangements, genome compartmentalization and epigenetics, greatly benefits from having a contiguous and high-quality, preferably even complete and gapless, genome assembly. Here we discuss a number of important reasons to produce gapless, finished, genome assemblies to help answer important biological questions. Copyright © 2015 Elsevier Inc. All rights reserved.
Structural analysis of the rDNA intergenic spacer of Brassica nigra: evolutionary divergence of the spacers of the three diploid Brassica species.

PubMed

Bhatia, S; Singh Negi, M; Lakshmikumaran, M

1996-11-01

EcoRI restriction of the B. nigra rDNA recombinants, isolated from a lambda genomic library, showed that the 3.9-kb fragment corresponded to the Intergenic Spacer (IGS), which was sequenced and found to be 3,928 bp in size. Sequence and dot-matrix analyses showed that the organization of the B. nigra rDNA IGS was typical of most rDNA spacers, consisting of a central repetitive region and flanking unique sequences on either side. The repetitive region was composed of two repeat families-RF 'A' and RF 'B.' The B. nigra RF 'A' consisted of a tandem array of three full-length copies of a 106-bp sequence element. RF 'B' was composed of 66 tandemly repeated elements. Each 'B' element was only 21-bp in size and this is the smallest repeat unit identified in plant rDNA to date. The putative transcription initiation site (TIS) was identified as nucleotide position 3,110. Based on the sequence analysis it was suggested that the present organization of the repeat families was generated by successive cycles of deletions and amplifications and was being maintained by homogenization processes such as gene conversion and crossing-over.A detailed comparison of the rDNA IGS sequences of the three diploid Brassica species-namely, B. nigra, B. campestris, and B. oleracea-was carried out. First, comparisons revealed that B. campestris and B. oleracea were close to each other as the repeat families in both showed high sequence homology between each other. Second, the repeat elements in both the species were organized in an interspersed manner. Third, a 52-bp sequence, present just downstream of the repeats in B. campestris, was found to be identical to the B. oleracea repeats, thereby suggesting a common progenitor. On the other hand, in B. nigra no interspersion pattern of organization of repeats was observed. Further, the B. nigra RF 'A' was identified as distinct from the repeat families of B. campestris and B. oleracea. Based on this analysis, it was suggested that during speciation B. campestris and B. oleracea evolved in one lineage whereas B. nigra diverged into a separate lineage. The comparative analysis of the IGS helped in identifying not only conserved ancestral sequence motifs of possible functional significance such as promoters and enhancers, but also sequences which showed variation between the three diploid species and were therefore identified as species-specific sequences.
Harmonic Structure Predicts the Enjoyment of Uplifting Trance Music.

PubMed

Agres, Kat; Herremans, Dorien; Bigo, Louis; Conklin, Darrell

2016-01-01

An empirical investigation of how local harmonic structures (e.g., chord progressions) contribute to the experience and enjoyment of uplifting trance (UT) music is presented. The connection between rhythmic and percussive elements and resulting trance-like states has been highlighted by musicologists, but no research, to our knowledge, has explored whether repeated harmonic elements influence affective responses in listeners of trance music. Two alternative hypotheses are discussed, the first highlighting the direct relationship between repetition/complexity and enjoyment, and the second based on the theoretical inverted-U relationship described by the Wundt curve. We investigate the connection between harmonic structure and subjective enjoyment through interdisciplinary behavioral and computational methods: First we discuss an experiment in which listeners provided enjoyment ratings for computer-generated UT anthems with varying levels of harmonic repetition and complexity. The anthems were generated using a statistical model trained on a corpus of 100 uplifting trance anthems created for this purpose, and harmonic structure was constrained by imposing particular repetition structures (semiotic patterns defining the order of chords in the sequence) on a professional UT music production template. Second, the relationship between harmonic structure and enjoyment is further explored using two computational approaches, one based on average Information Content, and another that measures average tonal tension between chords. The results of the listening experiment indicate that harmonic repetition does in fact contribute to the enjoyment of uplifting trance music. More compelling evidence was found for the second hypothesis discussed above, however some maximally repetitive structures were also preferred. Both computational models provide evidence for a Wundt-type relationship between complexity and enjoyment. By systematically manipulating the structure of chord progressions, we have discovered specific harmonic contexts in which repetitive or complex structure contribute to the enjoyment of uplifting trance music.
Harmonic Structure Predicts the Enjoyment of Uplifting Trance Music

PubMed Central

Agres, Kat; Herremans, Dorien; Bigo, Louis; Conklin, Darrell

2017-01-01

An empirical investigation of how local harmonic structures (e.g., chord progressions) contribute to the experience and enjoyment of uplifting trance (UT) music is presented. The connection between rhythmic and percussive elements and resulting trance-like states has been highlighted by musicologists, but no research, to our knowledge, has explored whether repeated harmonic elements influence affective responses in listeners of trance music. Two alternative hypotheses are discussed, the first highlighting the direct relationship between repetition/complexity and enjoyment, and the second based on the theoretical inverted-U relationship described by the Wundt curve. We investigate the connection between harmonic structure and subjective enjoyment through interdisciplinary behavioral and computational methods: First we discuss an experiment in which listeners provided enjoyment ratings for computer-generated UT anthems with varying levels of harmonic repetition and complexity. The anthems were generated using a statistical model trained on a corpus of 100 uplifting trance anthems created for this purpose, and harmonic structure was constrained by imposing particular repetition structures (semiotic patterns defining the order of chords in the sequence) on a professional UT music production template. Second, the relationship between harmonic structure and enjoyment is further explored using two computational approaches, one based on average Information Content, and another that measures average tonal tension between chords. The results of the listening experiment indicate that harmonic repetition does in fact contribute to the enjoyment of uplifting trance music. More compelling evidence was found for the second hypothesis discussed above, however some maximally repetitive structures were also preferred. Both computational models provide evidence for a Wundt-type relationship between complexity and enjoyment. By systematically manipulating the structure of chord progressions, we have discovered specific harmonic contexts in which repetitive or complex structure contribute to the enjoyment of uplifting trance music. PMID:28119641
The diversity of sequence and chromosomal distribution of new transposable element-related segments in the rye genome revealed by FISH and lineage annotation

USDA-ARS?s Scientific Manuscript database

The rye genome features a high percentage of repetitive elements, especially transposable elements (TEs). However, studies about the constitution and organization of TEs on rye chromosomes are limited. In this study, 97 unique TE segments were isolated and characterized; 50 TE segmemts showed varyin...
Identification of a non-LTR retrotransposon from the gypsy moth

Treesearch

K.J. Garner; J.M. Slavicek

1999-01-01

A family of highly repetitive elements, named LDT1, has been identified in the gypsy moth, Lymantria dispar. The complete element is 5.4 kb in length and lacks long-terminal repeats, The element contains two open reading frames with a significant amino acid sequence similarity to several non-LTR retrotransposons. The first open reading frame contains...
Impacts of visuomotor sequence learning methods on speed and accuracy: Starting over from the beginning or from the point of error.

PubMed

Tanaka, Kanji; Watanabe, Katsumi

2016-02-01

The present study examined whether sequence learning led to more accurate and shorter performance time if people who are learning a sequence start over from the beginning when they make an error (i.e., practice the whole sequence) or only from the point of error (i.e., practice a part of the sequence). We used a visuomotor sequence learning paradigm with a trial-and-error procedure. In Experiment 1, we found fewer errors, and shorter performance time for those who restarted their performance from the beginning of the sequence as compared to those who restarted from the point at which an error occurred, indicating better learning of spatial and motor representations of the sequence. This might be because the learned elements were repeated when the next performance started over from the beginning. In subsequent experiments, we increased the occasions for the repetitions of learned elements by modulating the number of fresh start points in the sequence after errors. The results showed that fewer fresh start points were likely to lead to fewer errors and shorter performance time, indicating that the repetitions of learned elements enabled participants to develop stronger spatial and motor representations of the sequence. Thus, a single or two fresh start points in the sequence (i.e., starting over only from the beginning or from the beginning or midpoint of the sequence after errors) is likely to lead to more accurate and faster performance. Copyright © 2016 Elsevier B.V. All rights reserved.
Transposable element junctions in marker development and genomic characterization of barley

USDA-ARS?s Scientific Manuscript database

Barley is a model plant in genomic studies of Triticeae species. A complete barley genome sequence will facilitate not only barley breeding programs, but also those for related species. However, the large genome size and high repetitive sequence content complicate the barley genome assembly. The ma...

Repetitive DNA loci and their modulation by the non-canonical nucleic acid structures R-loops and G-quadruplexes

PubMed Central

Hall, Amanda C.; Ostrowski, Lauren A.; Mekhail, Karim

2017-01-01

ABSTRACT Cells have evolved intricate mechanisms to maintain genome stability despite allowing mutational changes to drive evolutionary adaptation. Repetitive DNA sequences, which represent the bulk of most genomes, are a major threat to genome stability often driving chromosome rearrangements and disease. The major source of repetitive DNA sequences and thus the most vulnerable constituents of the genome are the rDNA (rDNA) repeats, telomeres, and transposable elements. Maintaining the stability of these loci is critical to overall cellular fitness and lifespan. Therefore, cells have evolved mechanisms to regulate rDNA copy number, telomere length and transposon activity, as well as DNA repair at these loci. In addition, non-canonical structure-forming DNA motifs can also modulate the function of these repetitive DNA loci by impacting their transcription, replication, and stability. Here, we discuss key mechanisms that maintain rDNA repeats, telomeres, and transposons in yeast and human before highlighting emerging roles for non-canonical DNA structures at these repetitive loci. PMID:28406751
Neural coding of syntactic structure in learned vocalizations in the songbird.

PubMed

Fujimoto, Hisataka; Hasegawa, Taku; Watanabe, Dai

2011-07-06

Although vocal signals including human languages are composed of a finite number of acoustic elements, complex and diverse vocal patterns can be created from combinations of these elements, linked together by syntactic rules. To enable such syntactic vocal behaviors, neural systems must extract the sequence patterns from auditory information and establish syntactic rules to generate motor commands for vocal organs. However, the neural basis of syntactic processing of learned vocal signals remains largely unknown. Here we report that the basal ganglia projecting premotor neurons (HVC(X) neurons) in Bengalese finches represent syntactic rules that generate variable song sequences. When vocalizing an alternative transition segment between song elements called syllables, sparse burst spikes of HVC(X) neurons code the identity of a specific syllable type or a specific transition direction among the alternative trajectories. When vocalizing a variable repetition sequence of the same syllable, HVC(X) neurons not only signal the initiation and termination of the repetition sequence but also indicate the progress and state-of-completeness of the repetition. These different types of syntactic information are frequently integrated within the activity of single HVC(X) neurons, suggesting that syntactic attributes of the individual neurons are not programmed as a basic cellular subtype in advance but acquired in the course of vocal learning and maturation. Furthermore, some auditory-vocal mirroring type HVC(X) neurons display transition selectivity in the auditory phase, much as they do in the vocal phase, suggesting that these songbirds may extract syntactic rules from auditory experience and apply them to form their own vocal behaviors.
Retrotransposon insertion targeting: a mechanism for homogenization of centromere sequences on nonhomologous chromosomes.

PubMed

Birchler, James A; Presting, Gernot G

2012-04-01

The centromeres of most eukaryotic organisms consist of highly repetitive arrays that are similar across nonhomologous chromosomes. These sequences evolve rapidly, thus posing a mystery as to how such arrays can be homogenized. Recent work in species in which centromere-enriched retrotransposons occur indicates that these elements preferentially insert into the centromeric regions. In two different Arabidopsis species, a related element was recognized in which the specificity for such targeting was altered. These observations provide a partial explanation for how homogenization of centromere DNA sequences occurs.
The unique genomic landscape surrounding the EPSPS gene in glyphosate resistant Amaranthus palmeri: a repetitive path to resistance.

PubMed

Molin, William T; Wright, Alice A; Lawton-Rauh, Amy; Saski, Christopher A

2017-01-17

The expanding number and global distributions of herbicide resistant weedy species threaten food, fuel, fiber and bioproduct sustainability and agroecosystem longevity. Amongst the most competitive weeds, Amaranthus palmeri S. Wats has rapidly evolved resistance to glyphosate primarily through massive amplification and insertion of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene across the genome. Increased EPSPS gene copy numbers results in higher titers of the EPSPS enzyme, the target of glyphosate, and confers resistance to glyphosate treatment. To understand the genomic unit and mechanism of EPSPS gene copy number proliferation, we developed and used a bacterial artificial chromosome (BAC) library from a highly resistant biotype to sequence the local genomic landscape flanking the EPSPS gene. By sequencing overlapping BACs, a 297 kb sequence was generated, hereafter referred to as the "EPSPS cassette." This region included several putative genes, dense clusters of tandem and inverted repeats, putative helitron and autonomous replication sequences, and regulatory elements. Whole genome shotgun sequencing (WGS) of two biotypes exhibiting high and no resistance to glyphosate was performed to compare genomic representation across the EPSPS cassette. Mapping of sequences for both biotypes to the reference EPSPS cassette revealed significant differences in upstream and downstream sequences relative to EPSPS with regard to both repetitive units and coding content between these biotypes. The differences in sequence may have resulted from a compounded-building mechanism such as repetitive transpositional events. The association of putative helitron sequences with the cassette suggests a possible amplification and distribution mechanism. Flow cytometry revealed that the EPSPS cassette added measurable genomic content. The adoption of glyphosate resistant cropping systems in major crops such as corn, soybean, cotton and canola coupled with excessive use of glyphosate herbicide has led to evolved glyphosate resistance in several important weeds. In Amaranthus palmeri, the amplification of the EPSPS cassette, characterized by a complex array of repetitive elements and putative helitron sequences, suggests an adaptive structural genomic mechanism that drives amplification and distribution around the genome. The added genomic content not found in glyphosate sensitive plants may be driving evolution through genome expansion.
Male Germline Control of Transposable Elements1

PubMed Central

Bao, Jianqiang; Yan, Wei

2012-01-01

ABSTRACT Repetitive sequences, especially transposon-derived interspersed repetitive elements, account for a large fraction of the genome in most eukaryotes. Despite the repetitive nature, these transposable elements display quantitative and qualitative differences even among species of the same lineage. Although transposable elements contribute greatly as a driving force to the biological diversity during evolution, they can induce embryonic lethality and genetic disorders as a result of insertional mutagenesis and genomic rearrangement. Temporary relaxation of the epigenetic control of retrotransposons during early germline development opens a risky window that can allow retrotransposons to escape from host constraints and to propagate abundantly in the host genome. Because germline mutations caused by retrotransposon activation are heritable and thus can be deleterious to the offspring, an adaptive strategy has evolved in host cells, especially in the germline. In this review, we will attempt to summarize general defense mechanisms deployed by the eukaryotic genome, with an emphasis on pathways utilized by the male germline to confer retrotransposon silencing. PMID:22357546
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.

PubMed

VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C

2015-11-26

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.
The effect of episodic retrieval on inhibition in task switching.

PubMed

Grange, James A; Kowalczyk, Agnieszka W; O'Loughlin, Rory

2017-08-01

Inhibition in task switching is inferred from n-2 repetition costs: the observation that ABA task switching sequences are responded to slower than CBA sequences. This is thought to reflect the persisting inhibition of Task A, which slows reactivation attempts. Mayr (2002) reported an experiment testing a critical noninhibitory account of this effect, namely episodic retrieval: If the trial parameters for Task A match across an ABA sequence, responses should be facilitated because of priming from episodic retrieval; a cost would occur if trial parameters mismatch. In a rule-switching paradigm, Mayr reported no significant difference in n-2 repetition cost when the trial parameters repeated or switched across an ABA sequence, in clear contrast to the episodic retrieval account. What remains unclear is whether successful episodic retrieval modulates the n-2 repetition cost. Across 3 experiments-including a close replication of Mayr-we find clear evidence of reduced n-2 task repetition costs when episodic retrieval is controlled. We find that the effect of episodic retrieval on the n-2 task repetition cost is increased when the cue-task relationship is made more abstract, suggesting the effect is because of interference in establishing the relevant attentional set. We also demonstrate that the episodic retrieval effect is not influenced by retrieval of low-level, perceptual, elements. Together, the data suggest the n-2 task repetition cost-typically attributable to an inhibitory mechanism-also reflects episodic retrieval effects. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Zaba: a novel miniature transposable element present in genomes of legume plants.

PubMed

Macas, J; Neumann, P; Pozárková, D

2003-08-01

A novel family of miniature transposable elements, named Zaba, was identified in pea (Pisum sativum) and subsequently also in other legume species using computer analysis of their DNA sequences. Zaba elements are 141-190 bp long, generate 10-bp target site duplications, and their terminal inverted repeats make up most of the sequence. Zaba elements thus resemble class 3 foldback transposons. The elements are only moderately repetitive in pea (tens to hundreds copies per haploid genome), but they are present in up to thousands of copies in the genomes of several Medicago and Vicia species. More detailed analysis of the elements from pea, including isolation of new sequences from a genomic library, revealed that a fraction of these elements are truncated, and that their last transposition probably did not occur recently. A search for Zaba sequences in EST databases showed that at least some elements are transcribed, most probably due to their association with genic regions.
Distribution of genes and repetitive elements in the Diabrotica virgifera virgifera genome estimated using BAC sequencing

USDA-ARS?s Scientific Manuscript database

Feeding damage caused by the corn rootworm, Diabrotica virgifera virgifera, is destructive to corn plants in North America and Europe where control remains challenging due to evolution of resistance traits that allow survival when exposed to chemical and transgenic toxins. Genome sequencing of an i...
The Dfam database of repetitive DNA families.

PubMed

Hubley, Robert; Finn, Robert D; Clements, Jody; Eddy, Sean R; Jones, Thomas A; Bao, Weidong; Smit, Arian F A; Wheeler, Travis J

2016-01-04

Repetitive DNA, especially that due to transposable elements (TEs), makes up a large fraction of many genomes. Dfam is an open access database of families of repetitive DNA elements, in which each family is represented by a multiple sequence alignment and a profile hidden Markov model (HMM). The initial release of Dfam, featured in the 2013 NAR Database Issue, contained 1143 families of repetitive elements found in humans, and was used to produce more than 100 Mb of additional annotation of TE-derived regions in the human genome, with improved speed. Here, we describe recent advances, most notably expansion to 4150 total families including a comprehensive set of known repeat families from four new organisms (mouse, zebrafish, fly and nematode). We describe improvements to coverage, and to our methods for identifying and reducing false annotation. We also describe updates to the website interface. The Dfam website has moved to http://dfam.org. Seed alignments, profile HMMs, hit lists and other underlying data are available for download. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
A team of heterochromatin factors collaborates with small RNA pathways to combat repetitive elements and germline stress

PubMed Central

McMurchy, Alicia N; Stempor, Przemyslaw; Gaarenstroom, Tessa; Wysolmerski, Brian; Dong, Yan; Aussianikava, Darya; Appert, Alex; Huang, Ni; Kolasinska-Zwierz, Paulina; Sapetschnig, Alexandra; Miska, Eric A; Ahringer, Julie

2017-01-01

Repetitive sequences derived from transposons make up a large fraction of eukaryotic genomes and must be silenced to protect genome integrity. Repetitive elements are often found in heterochromatin; however, the roles and interactions of heterochromatin proteins in repeat regulation are poorly understood. Here we show that a diverse set of C. elegans heterochromatin proteins act together with the piRNA and nuclear RNAi pathways to silence repetitive elements and prevent genotoxic stress in the germ line. Mutants in genes encoding HPL-2/HP1, LIN-13, LIN-61, LET-418/Mi-2, and H3K9me2 histone methyltransferase MET-2/SETDB1 also show functionally redundant sterility, increased germline apoptosis, DNA repair defects, and interactions with small RNA pathways. Remarkably, fertility of heterochromatin mutants could be partially restored by inhibiting cep-1/p53, endogenous meiotic double strand breaks, or the expression of MIRAGE1 DNA transposons. Functional redundancy among factors and pathways underlies the importance of safeguarding the genome through multiple means. DOI: http://dx.doi.org/10.7554/eLife.21666.001 PMID:28294943
What tangled web: barriers to rampant horizontal gene transfer.

PubMed

Kurland, Charles G

2005-07-01

Dawkins in his The Selfish Gene(1) quite aptly applies the term "selfish" to parasitic repetitive DNA sequences endemic to eukaryotic genomes, especially vertebrates. Doolittle and Sapienza(2) as well as Orgel and Crick(3) enlivened this notion of selfish DNA with the identification of such repetitive sequences as remnants of mobile elements such as transposons. In addition, Orgel and Crick(3) associated parasitic DNA with a potential to outgrow their host genomes by propagating both vertically via conventional genome replication as well as infectiously by horizontal gene transfer (HGT) to other genomes. Still later, Doolittle(4) speculated that unchecked HGT between unrelated genomes so complicates phylogeny that the conventional representation of a tree of life would have to be replaced by a thicket or a web of life.(4) In contrast, considerable data now show that reconstructions based on whole genome sequences are consistent with the conventional "tree of life".(5-10) Here, we identify natural barriers that protect modern genome populations from the inroads of rampant HGT. Copyright (c) 2005 Wiley Periodicals, Inc.
PlantFuncSSR: Integrating First and Next Generation Transcriptomics for Mining of SSR-Functional Domains Markers

PubMed Central

Sablok, Gaurav; Pérez-Pulido, Antonio J.; Do, Thac; Seong, Tan Y.; Casimiro-Soriguer, Carlos S.; La Porta, Nicola; Ralph, Peter J.; Squartini, Andrea; Muñoz-Merida, Antonio; Harikrishna, Jennifer A.

2016-01-01

Analysis of repetitive DNA sequence content and divergence among the repetitive functional classes is a well-accepted approach for estimation of inter- and intra-generic differences in plant genomes. Among these elements, microsatellites, or Simple Sequence Repeats (SSRs), have been widely demonstrated as powerful genetic markers for species and varieties discrimination. We present PlantFuncSSRs platform having more than 364 plant species with more than 2 million functional SSRs. They are provided with detailed annotations for easy functional browsing of SSRs and with information on primer pairs and associated functional domains. PlantFuncSSRs can be leveraged to identify functional-based genic variability among the species of interest, which might be of particular interest in developing functional markers in plants. This comprehensive on-line portal unifies mining of SSRs from first and next generation sequencing datasets, corresponding primer pairs and associated in-depth functional annotation such as gene ontology annotation, gene interactions and its identification from reference protein databases. PlantFuncSSRs is freely accessible at: http://www.bioinfocabd.upo.es/plantssr. PMID:27446111
Translocation and gross deletion breakpoints in human inherited disease and cancer II: Potential involvement of repetitive sequence elements in secondary structure formation between DNA ends.

PubMed

Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N

2003-09-01

Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.
Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.

PubMed

Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru

2015-01-01

The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.
Assembly of highly repetitive genomes using short reads: the genome of discrete typing unit III Trypanosoma cruzi strain 231.

PubMed

Baptista, Rodrigo P; Reis-Cunha, Joao Luis; DeBarry, Jeremy D; Chiari, Egler; Kissinger, Jessica C; Bartholomeu, Daniella C; Macedo, Andrea M

2018-02-14

Next-generation sequencing (NGS) methods are low-cost high-throughput technologies that produce thousands to millions of sequence reads. Despite the high number of raw sequence reads, their short length, relative to Sanger, PacBio or Nanopore reads, complicates the assembly of genomic repeats. Many genome tools are available, but the assembly of highly repetitive genome sequences using only NGS short reads remains challenging. Genome assembly of organisms responsible for important neglected diseases such as Trypanosoma cruzi, the aetiological agent of Chagas disease, is known to be challenging because of their repetitive nature. Only three of six recognized discrete typing units (DTUs) of the parasite have their draft genomes published and therefore genome evolution analyses in the taxon are limited. In this study, we developed a computational workflow to assemble highly repetitive genomes via a combination of de novo and reference-based assembly strategies to better overcome the intrinsic limitations of each, based on Illumina reads. The highly repetitive genome of the human-infecting parasite T. cruzi 231 strain was used as a test subject. The combined-assembly approach shown in this study benefits from the reference-based assembly ability to resolve highly repetitive sequences and from the de novo capacity to assemble genome-specific regions, improving the quality of the assembly. The acceptable confidence obtained by analyzing our results showed that our combined approach is an attractive option to assemble highly repetitive genomes with NGS short reads. Phylogenomic analysis including the 231 strain, the first representative of DTU III whose genome was sequenced, was also performed and provides new insights into T. cruzi genome evolution.
Motor programming when sequencing multiple elements of the same duration.

PubMed

Magnuson, Curt E; Robin, Donald A; Wright, David L

2008-11-01

Motor programming at the self-select paradigm was adopted in 2 experiments to examine the processing demands of independent processes. One process (INT) is responsible for organizing the internal features of the individual elements in a movement (e.g., response duration). The 2nd process (SEQ) is responsible for placing the elements into the proper serial order before execution. Participants in Experiment 1 performed tasks involving 1 key press or sequences of 4 key presses of the same duration. Implementing INT and SEQ was more time consuming for key-pressing sequences than for single key-press tasks. Experiment 2 examined whether the INT costs resulting from the increase in sequence length observed in Experiment 1 resulted from independent planning of each sequence element or via a separate "multiplier" process that handled repetitions of elements of the same duration. Findings from Experiment 2, in which participants performed single key presses or double or triple key sequences of the same duration, suggested that INT is involved with the independent organization of each element contained in the sequence. Researchers offer an elaboration of the 2-process account of motor programming to incorporate the present findings and the findings from other recent sequence-learning research.
An alternative nested-PCR assay for the detection of Toxoplasma gondii strains based on GRA7 gene sequences.

PubMed

Costa, Maria Eduarda S M; Oliveira, Claudio Bruno S; Andrade, Joelma Maria de A; Medeiros, Thatiany A; Neto, Valter F Andrade; Lanza, Daniel C F

2016-07-01

Toxoplasma gondii is a widespread parasite able to infect virtually any nucleated cells of warm-blooded hosts. In some cases, T. gondii detection using already developed PCR primers can be inefficient in routine laboratory tests, especially to detect atypical strains. Here we report a new nested-PCR protocol able to detect virtually all T. gondii isolates. Analyzing 685 sequences available in GenBank, we determine that GRA7 is one of the most conserved genes of T. gondii genome. Based on an alignment of 85 GRA7 sequences new primer sets that anneal in the highly conserved regions of this gene were designed. The new GRA7 nested-PCR assay providing sensitivity and specificity equal to or greater than the gold standard PCR assays for T. gondii detection, that amplify the B1 sequence or the repetitive 529bp element. Copyright © 2016 Elsevier B.V. All rights reserved.
Proteomics technique opens new frontiers in mobilome research.

PubMed

Davidson, Andrew D; Matthews, David A; Maringer, Kevin

2017-01-01

A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the "mobilome," which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the "domestication" of transposon proteins for cellular functions. Although 'omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called "proteomics informed by transcriptomics" (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti ). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease.
Identification of Genetic Elements Associated with EPSPS Gene Amplification

PubMed Central

Gaines, Todd A.; Wright, Alice A.; Molin, William T.; Lorentz, Lothar; Riggins, Chance W.; Tranel, Patrick J.; Beffa, Roland; Westra, Philip; Powles, Stephen B.

2013-01-01

Weed populations can have high genetic plasticity and rapid responses to environmental selection pressures. For example, 100-fold amplification of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene evolved in the weed species Amaranthus palmeri to confer resistance to glyphosate, the world’s most important herbicide. However, the gene amplification mechanism is unknown. We sequenced the EPSPS gene and genomic regions flanking EPSPS loci in A. palmeri, and searched for mobile genetic elements or repetitive sequences. The EPSPS gene was 10,229 bp, containing 8 exons and 7 introns. The gene amplification likely proceeded through a DNA-mediated mechanism, as introns exist in the amplified gene copies and the entire amplified sequence is at least 30 kb in length. Our data support the presence of two EPSPS loci in susceptible (S) A. palmeri, and that only one of these was amplified in glyphosate-resistant (R) A. palmeri. The EPSPS gene amplification event likely occurred recently, as no sequence polymorphisms were found within introns of amplified EPSPS copies from R individuals. Sequences with homology to miniature inverted-repeat transposable elements (MITEs) were identified next to EPSPS gene copies only in R individuals. Additionally, a putative Activator (Ac) transposase and a repetitive sequence region were associated with amplified EPSPS genes. The mechanism controlling this DNA-mediated amplification remains unknown. Further investigation is necessary to determine if the gene amplification may have proceeded via DNA transposon-mediated replication, and/or unequal recombination between different genomic regions resulting in replication of the EPSPS gene. PMID:23762434

Structurally Complex Organization of Repetitive DNAs in the Genome of Cobia (Rachycentron canadum).

PubMed

Costa, Gideão W W F; Cioffi, Marcelo de B; Bertollo, Luiz A C; Molina, Wagner F

2015-06-01

Repetitive DNAs comprise the largest fraction of the eukaryotic genome. They include microsatellites or simple sequence repeats (SSRs), which play an important role in the chromosome differentiation among fishes. Rachycentron canadum is the only representative of the family Rachycentridae. This species has been focused on several multidisciplinary studies in view of its important potential for marine fish farming. In the present study, distinct classes of repetitive DNAs, with emphasis on SSRs, were mapped in the chromosomes of this species to improve the knowledge of its genome organization. Microsatellites exhibited a diversified distribution, both dispersed in euchromatin and clustered in the heterochromatin. The multilocus location of SSRs strengthened the heterochromatin heterogeneity in this species, as suggested by some previous studies. The colocalization of SSRs with retrotransposons and transposons pointed to a close evolutionary relationship between these repetitive sequences. A number of heterochromatic regions highlighted a greater complex organization than previously supposed, harboring a diversity of repetitive elements. In this sense, there was also evidence of colocalization of active genetic regions and different classes of repetitive DNAs in a common heterochromatic region, which offers a potential opportunity for further researches regarding the interaction of these distinct fractions in fish genomes.
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

DOE Office of Scientific and Technical Information (OSTI.GOV)

VanBuren, Robert; Bryant, Doug; Edger, Patrick P.

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

DOE PAGES

VanBuren, Robert; Bryant, Doug; Edger, Patrick P.; ...

2015-11-11

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
The short interspersed repetitive element of Trypanosoma cruzi, SIRE, is part of VIPER, an unusual retroelement related to long terminal repeat retrotransposons

PubMed Central

Vázquez, Martín; Ben-Dov, Claudia; Lorenzi, Hernan; Moore, Troy; Schijman, Alejandro; Levin, Mariano J.

2000-01-01

The short interspersed repetitive element (SIRE) of Trypanosoma cruzi was first detected when comparing the sequences of loci that encode the TcP2β genes. It is present in about 1,500–3,000 copies per genome, depending on the strain, and it is distributed in all chromosomes. An initial analysis of SIRE sequences from 21 genomic fragments allowed us to derive a consensus nucleotide sequence and structure for the element, consisting of three regions (I, II, and III) each harboring distinctive features. Analysis of 158 transcribed SIREs demonstrates that the consensus is highly conserved. The sequences of 51 cDNAs show that SIRE is included in the 3′ end of several mRNAs, always transcribed from the sense strand, contributing the polyadenylation site in 63% of the cases. This study led to the characterization of VIPER (vestigial interposed retroelement), a 2,326-bp-long unusual retroelement. VIPER's 5′ end is formed by the first 182 bp of SIRE, whereas its 3′ end is formed by the last 220 bp of the element. Both SIRE moieties are connected by a 1,924-bp-long fragment that carries a unique ORF encoding a complete reverse transcriptase-RNase H gene whose 15 C-terminal amino acids derive from codons specified by SIRE's region II. The amino acid sequence of VIPER's reverse transcriptase-RNase H shares significant homology to that of long terminal repeat retrotransposons. The fact that SIRE and VIPER sequences are found only in the T. cruzi genome may be of relevance for studies concerning the evolution and the genome flexibility of this protozoan parasite. PMID:10688909
Interplay between DNA methylation, histone modification and chromatin remodeling in stem cells and during development.

PubMed

Ikegami, Kohta; Ohgane, Jun; Tanaka, Satoshi; Yagi, Shintaro; Shiota, Kunio

2009-01-01

Genes constitute only a small proportion of the mammalian genome, the majority of which is composed of non-genic repetitive elements including interspersed repeats and satellites. A unique feature of the mammalian genome is that there are numerous tissue-dependent, differentially methylated regions (T-DMRs) in the non-repetitive sequences, which include genes and their regulatory elements. The epigenetic status of T-DMRs varies from that of repetitive elements and constitutes the DNA methylation profile genome-wide. Since the DNA methylation profile is specific to each cell and tissue type, much like a fingerprint, it can be used as a means of identification. The formation of DNA methylation profiles is the basis for cell differentiation and development in mammals. The epigenetic status of each T-DMR is regulated by the interplay between DNA methyltransferases, histone modification enzymes, histone subtypes, non-histone nuclear proteins and non-coding RNAs. In this review, we will discuss how these epigenetic factors cooperate to establish cell- and tissue-specific DNA methylation profiles.
RNA-Seq Analysis to Measure the Expression of SINE Retroelements.

PubMed

Román, Ángel Carlos; Morales-Hernández, Antonio; Fernández-Salguero, Pedro M

2016-01-01

The intrinsic features of retroelements, like their repetitive nature and disseminated presence in their host genomes, demand the use of advanced methodologies for their bioinformatic and functional study. The short length of SINE (short interspersed elements) retrotransposons makes such analyses even more complex. Next-generation sequencing (NGS) technologies are currently one of the most widely used tools to characterize the whole repertoire of gene expression in a specific tissue. In this chapter, we will review the molecular and computational methods needed to perform NGS analyses on SINE elements. We will also describe new methods of potential interest for researchers studying repetitive elements. We intend to outline the general ideas behind the computational analyses of NGS data obtained from SINE elements, and to stimulate other scientists to expand our current knowledge on SINE biology using RNA-seq and other NGS tools.
Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts.

PubMed

Trofimova, Irina; Krasikova, Alla

2016-12-01

Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription.
Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts

PubMed Central

Krasikova, Alla

2016-01-01

ABSTRACT Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription. PMID:27763817
Primary analysis of repeat elements of the Asian seabass (Lates calcarifer) transcriptome and genome

PubMed Central

Kuznetsova, Inna S.; Thevasagayam, Natascha M.; Sridatta, Prakki S. R.; Komissarov, Aleksey S.; Saju, Jolly M.; Ngoh, Si Y.; Jiang, Junhui; Shen, Xueyan; Orbán, László

2014-01-01

As part of our Asian seabass genome project, we are generating an inventory of repeat elements in the genome and transcriptome. The karyotype showed a diploid number of 2n = 24 chromosomes with a variable number of B-chromosomes. The transcriptome and genome of Asian seabass were searched for repetitive elements with experimental and bioinformatics tools. Six different types of repeats constituting 8–14% of the genome were characterized. Repetitive elements were clustered in the pericentromeric heterochromatin of all chromosomes, but some of them were preferentially accumulated in pretelomeric and pericentromeric regions of several chromosomes pairs and have chromosomes specific arrangement. From the dispersed class of fish-specific non-LTR retrotransposon elements Rex1 and MAUI-like repeats were analyzed. They were wide-spread both in the genome and transcriptome, accumulated on the pericentromeric and peritelomeric areas of all chromosomes. Every analyzed repeat was represented in the Asian seabass transcriptome, some showed differential expression between the gonads. The other group of repeats analyzed belongs to the rRNA multigene family. FISH signal for 5S rDNA was located on a single pair of chromosomes, whereas that for 18S rDNA was found on two pairs. A BAC-derived contig containing rDNA was sequenced and assembled into a scaffold containing incomplete fragments of 18S rDNA. Their assembly and chromosomal position revealed that this part of Asian seabass genome is extremely rich in repeats containing evolutionarily conserved and novel sequences. In summary, transcriptome assemblies and cDNA data are suitable for the identification of repetitive DNA from unknown genomes and for comparative investigation of conserved elements between teleosts and other vertebrates. PMID:25120555
An ancient trans-kingdom horizontal transfer of Penelope -like retroelements from arthropods to conifers

Treesearch

Xuan Lin; Nurul Faridi; Claudio Casola

2016-01-01

Comparative genomics analyses empowered by the wealth of sequenced genomes haveÂ revealed numerous instances of horizontal DNA transfers between distantly related species. In Â eukaryotes, repetitive DNA sequences known as transposable elements (TEs) are especially prone to Â move across species boundaries. Such horizontal transposon transfers, or HTTs, are relatively Â ...
Proteomics technique opens new frontiers in mobilome research

PubMed Central

Davidson, Andrew D.; Matthews, David A.

2017-01-01

ABSTRACT A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the “mobilome,” which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the “domestication” of transposon proteins for cellular functions. Although ‘omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called “proteomics informed by transcriptomics” (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease. PMID:28932623
Analysis of Two Cosmid Clones from Chromosome 4 of Drosophila melanogaster Reveals Two New Genes Amid an Unusual Arrangement of Repeated Sequences

PubMed Central

Locke, John; Podemski, Lynn; Roy, Ken; Pilgrim, David; Hodgetts, Ross

1999-01-01

Chromosome 4 from Drosophila melanogaster has several unusual features that distinguish it from the other chromosomes. These include a diffuse appearance in salivary gland polytene chromosomes, an absence of recombination, and the variegated expression of P-element transgenes. As part of a larger project to understand these properties, we are assembling a physical map of this chromosome. Here we report the sequence of two cosmids representing ∼5% of the polytenized region. Both cosmid clones contain numerous repeated DNA sequences, as identified by cross hybridization with labeled genomic DNA, BLAST searches, and dot matrix analysis, which are positioned between and within the transcribed sequences. The repetitive sequences include three copies of the mobile element Hoppel, one copy of the mobile element HB, and 18 DINE repeats. DINE is a novel, short repeated sequence dispersed throughout both cosmid sequences. One cosmid includes the previously described cubitus interruptus (ci) gene and two new genes: that a gene with a predicted amino acid sequence similar to ribosomal protein S3a which is consistent with the Minute(4)101 locus thought to be in the region, and a novel member of the protein family that includes plexin and met–hepatocyte growth factor receptor. The other cosmid contains only the two short 5′-most exons from the zinc-finger-homolog-2 (zfh-2) gene. This is the first extensive sequence analysis of noncoding DNA from chromosome 4. The distribution of the various repeats suggests its organization is similar to the β-heterochromatic regions near the base of the major chromosome arms. Such a pattern may account for the diffuse banding of the polytene chromosome 4 and the variegation of many P-element transgenes on the chromosome. PMID:10022978
Next-generation sequencing detects repetitive elements expansion in giant genomes of annual killifish genus Austrolebias (Cyprinodontiformes, Rivulidae).

PubMed

García, G; Ríos, N; Gutiérrez, V

2015-06-01

Among Neotropical fish fauna, the South American killifish genus Austrolebias (Cyprinodontiformes: Rivulidae) constitutes an excellent model to study the genomic evolutionary processes underlying speciation events. Recently, unusually large genome size has been described in 16 species of this genus, with an average DNA content of about 5.95 ± 0.45 pg per diploid cell (mean C-value of about 2.98 pg). In the present paper we explore the possible origin of this unparallel genomic increase by means of comparative analysis of the repetitive components using NGS (454-Roche) technology in the lowest and highest Rivulidae genomes. Here, we provide the first annotated Rivulidae-repeated sequences composition and their relative repetitive fraction in both genomes. Remarkably, the genomic proportion of the moderately repetitive DNA in Austrolebias charrua genome represents approximately twice (45%) of the repetitive components of the highly related rivulinae taxon Cynopoecilus melanotaenia (25%). Present work provides evidence about the impact of the repeat families that could be distinctly proliferated among sublineages within Rivulidae fish group, explaining the great genome size differences encompassing the differentiation and speciation events in this family.
DNA Fingerprinting of Lactobacillus crispatus Strain CTV-05 by Repetitive Element Sequence-Based PCR Analysis in a Pilot Study of Vaginal Colonization

PubMed Central

Antonio, May A. D.; Hillier, Sharon L.

2003-01-01

Lactobacillus crispatus is one of the predominant hydrogen peroxide (H2O2)-producing species found in the vagina and is under development as a probiotic for the treatment of bacterial vaginosis. In this study, we assessed whether DNA fingerprinting by repetitive element sequence-based PCR (rep-PCR) can be used to distinguish the capsule strain of L. crispatus (CTV-05) from other endogenous strains as well as other species of vaginal lactobacilli. Vaginal and rectal lactobacilli were identified to the species level by using whole-chromosome probe DNA hybridization. The DNAs from L. crispatus, L. jensenii, L. gasseri, and an as-yet-unnamed H2O2-negative Lactobacillus species designated 1086V were subjected to rep-PCR. The results of gel electrophoresis and ethidium bromide staining of the DNA fingerprints obtained were compared. L. crispatus CTV-05 had a unique DNA fingerprint compared to all other lactobacilli. DNA fingerprints for 27 production lots of L. crispatus sampled from 1994 through 2001 were identical to that of the original strain isolated in 1993, suggesting strain stability. In a pilot study of nine women, this DNA fingerprinting method distinguished CTV-05 from other endogenous vaginal lactobacilli prior to and after vaginal capsule use. rep-PCR DNA fingerprinting is useful for strain typing and for evaluating longitudinal loss or acquisition of vaginal lactobacilli used as probiotics. PMID:12734221
In Vivo Control of CpG and Non-CpG DNA Methylation by DNA Methyltransferases

PubMed Central

Arand, Julia; Spieler, David; Karius, Tommy; Branco, Miguel R.; Meilinger, Daniela; Meissner, Alexander; Jenuwein, Thomas; Xu, Guoliang; Leonhardt, Heinrich; Wolf, Verena; Walter, Jörn

2012-01-01

The enzymatic control of the setting and maintenance of symmetric and non-symmetric DNA methylation patterns in a particular genome context is not well understood. Here, we describe a comprehensive analysis of DNA methylation patterns generated by high resolution sequencing of hairpin-bisulfite amplicons of selected single copy genes and repetitive elements (LINE1, B1, IAP-LTR-retrotransposons, and major satellites). The analysis unambiguously identifies a substantial amount of regional incomplete methylation maintenance, i.e. hemimethylated CpG positions, with variant degrees among cell types. Moreover, non-CpG cytosine methylation is confined to ESCs and exclusively catalysed by Dnmt3a and Dnmt3b. This sequence position–, cell type–, and region-dependent non-CpG methylation is strongly linked to neighboring CpG methylation and requires the presence of Dnmt3L. The generation of a comprehensive data set of 146,000 CpG dyads was used to apply and develop parameter estimated hidden Markov models (HMM) to calculate the relative contribution of DNA methyltransferases (Dnmts) for de novo and maintenance DNA methylation. The comparative modelling included wild-type ESCs and mutant ESCs deficient for Dnmt1, Dnmt3a, Dnmt3b, or Dnmt3a/3b, respectively. The HMM analysis identifies a considerable de novo methylation activity for Dnmt1 at certain repetitive elements and single copy sequences. Dnmt3a and Dnmt3b contribute de novo function. However, both enzymes are also essential to maintain symmetrical CpG methylation at distinct repetitive and single copy sequences in ESCs. PMID:22761581
Evolution of the Australian lungfish (Neoceratodus forsteri) genome: a major role for CR1 and L2 LINE elements.

PubMed

Metcalfe, Cushla J; Filée, Jonathan; Germon, Isabelle; Joss, Jean; Casane, Didier

2012-11-01

Haploid genomes greater than 25,000 Mb are rare, within the animals only the lungfish and some of the salamanders and crustaceans are known to have genomes this large. There is very little data on the structure of genomes this size. It is known, however, that for animal genomes up to 3,000 Mb, there is in general a good correlation between genome size and the percent of the genome composed of repetitive sequence and that this repetitive component is highly dynamic. In this study, we sampled the Australian lungfish genome using three mini-genomic libraries and found that with very little sequence, the results converged on an estimate of 40% of the genome being composed of recognizable transposable elements (TEs), chiefly from the CR1 and L2 long interspersed nuclear element clades. We further characterized the CR1 and L2 elements in the lungfish genome and show that although most CR1 elements probably represent recent amplifications, the L2 elements are more diverse and are more likely the result of a series of amplifications. We suggest that our sampling method has probably underestimated the recognizable TE content. However, on the basis of the most likely sources of error, we suggest that this very large genome is not largely composed of recently amplified, undetected TEs but may instead include a large component of older degenerate TEs. Based on these estimates, and on Thomson's (Thomson K. 1972. An attempt to reconstruct evolutionary changes in the cellular DNA content of lungfish. J Exp Zool. 180:363-372) inference that in the lineage leading to the extant Australian lungfish, there was massive increase in genome size between 350 and 200 mya, after which the size of the genome changed little, we speculate that the very large Australian lungfish genome may be the result of a massive amplification of TEs followed by a long period with a very low rate of sequence removal and some ongoing TE activity.
The role of stimulus-specific adaptation in songbird syntax generation

NASA Astrophysics Data System (ADS)

Wittenbach, Jason D.

Sequential behaviors are an important part of the behavioral repertoire of many animals and understanding how neural circuits encode and generate such sequences is a long-standing question in neuroscience. The Bengalese finch is a useful model system for studying variable action sequences. The songs of these birds consist of well-defined vocal elements (syllables) that are strung together to form sequences. The ordering of the syllables within the sequence is variable but not random - it shows complex statistical patterns (syntax). While often thought to be first-order, the syntax of the Bengalese finch song shows a distinct form of history dependence where the probability of repeating a syllable decreases as a function of the number of repetitions that have already occurred. Current models of the Bengalese finch song control circuitry offer no explanation for this repetition adaptation. The Bengalese finch also uses real-time auditory feedback to control the song syntax. Considering these facts, we hypothesize that repetition adaptation in the Bengalese finch syntax may be caused by stimulus-specific adaptation - a wide-spread phenomenon where neural responses to a specific stimulus become weaker with repeated presentations of the same stimulus. We begin by proposing a computational model for the song-control circuit where an auditory feedback signal that undergoes stimulus-specific adaptation helps drive repeated syllables. We show that this model does indeed capture the repetition adaptation observed in Bengalese finch syntax; along the way, we derive a new probabilistic model for repetition adaptation. Key predictions of our model are analyzed in light of experiments performed by collaborators. Next we extend the model in order to predict how the syntax will change as a function of brain temperature. These predictions are compared to experimental results from collaborators where portions of the Bengalese finch song circuit are cooled in awake and behaving birds. Finally we show that repetition adaptation persists even in a simplified dynamical system model when a parameter controlling the repeat probability changes slowly over repetitions.
Chromodomains direct integration of retrotransposons to heterochromatin

PubMed Central

Gao, Xiang; Hou, Yi; Ebina, Hirotaka; Levin, Henry L.; Voytas, Daniel F.

2008-01-01

The enrichment of mobile genetic elements in heterochromatin may be due, in part, to targeted integration. The chromoviruses are Ty3/gypsy retrotransposons with chromodomains at their integrase C termini. Chromodomains are logical determinants for targeting to heterochromatin, because the chromodomain of heterochromatin protein 1 (HP1) typically recognizes histone H3 K9 methylation, an epigenetic mark characteristic of heterochromatin. We describe three groups of chromoviruses based on amino acid sequence relationships of their integrase C termini. Genome sequence analysis indicates that representative chromoviruses from each group are enriched in gene-poor regions of the genome relative to other retrotransposons, and when fused to fluorescent marker proteins, the chromodomains target proteins to specific subnuclear foci coincident with heterochromatin. The chromodomain of the fungal element, MAGGY, interacts with histone H3 dimethyl- and trimethyl-K9, and when the MAGGY chromodomain is fused to integrase of the Schizosaccharomyces pombe Tf1 retrotransposon, new Tf1 insertions are directed to sites of H3 K9 methylation. Repetitive sequences such as transposable elements trigger the RNAi pathway resulting in their epigenetic modification. Our results suggest a dynamic interplay between retrotransposons and heterochromatin, wherein mobile elements recognize heterochromatin at the time of integration and then perpetuate the heterochromatic mark by triggering epigenetic modification. PMID:18256242
Molecular typing of Lactobacillus brevis isolates from Korean food using repetitive element-polymerase chain reaction.

PubMed

Kaur, Jasmine; Sharma, Anshul; Lee, Sulhee; Park, Young-Seo

2018-06-01

Lactobacillus brevis is a part of a large family of lactic acid bacteria that are present in cheese, sauerkraut, sourdough, silage, cow manure, feces, and the intestinal tract of humans and rats. It finds its use in food fermentation, and so is considered a "generally regarded as safe" organism. L. brevis strains are extensively used as probiotics and hence, there is a need for identifying and characterizing these strains. For identification and discrimination of the bacterial species at the subspecific level, repetitive element-polymerase chain reaction method is a reliable genomic fingerprinting tool. The objective of the present study was to characterize 13 strains of L. brevis isolated from various fermented foods using repetitive element-polymerase chain reaction. Repetitive element-polymerase chain reaction was performed using three primer sets, REP, Enterobacterial Repetitive Intergenic Consensus (ERIC), and (GTG) 5 , which produced different fingerprinting patterns that enable us to distinguish between the closely related strains. Fingerprinting patterns generated band range in between 150 and 5000 bp with REP, 200-7500 bp with ERIC, and 250-2000 bp with (GTG) 5 primers, respectively. The Jaccard's dissimilarity matrices were used to obtain dendrograms by the unweighted neighbor-joining method using genetic dissimilarities based on repetitive element-polymerase chain reaction fingerprinting data. Repetitive element-polymerase chain reaction proved to be a rapid and easy method that can produce reliable results in L. brevis species.
Diversities in Virulence, Antifungal Activity, Pigmentation and DNA Fingerprint among Strains of Burkholderia glumae

PubMed Central

Karki, Hari S.; Shrestha, Bishnu K.; Han, Jae Woo; Groth, Donald E.; Barphagha, Inderjit K.; Rush, Milton C.; Melanson, Rebecca A.; Kim, Beom Seok; Ham, Jong Hyun

2012-01-01

Burkholderia glumae is the primary causal agent of bacterial panicle blight of rice. In this study, 11 naturally avirulent and nine virulent strains of B. glumae native to the southern United States were characterized in terms of virulence in rice and onion, toxofalvin production, antifungal activity, pigmentation and genomic structure. Virulence of B. glumae strains on rice panicles was highly correlated to virulence on onion bulb scales, suggesting that onion bulb can be a convenient alternative host system to efficiently determine the virulence of B. glumae strains. Production of toxoflavin, the phytotoxin that functions as a major virulence factor, was closely associated with the virulence phenotypes of B. glumae strains in rice. Some strains of B. glumae showed various levels of antifungal activity against Rhizoctonia solani, the causal agent of sheath blight, and pigmentation phenotypes on casamino acid-peptone-glucose (CPG) agar plates regardless of their virulence traits. Purple and yellow-green pigments were partially purified from a pigmenting strain of B. glumae, 411gr-6, and the purple pigment fraction showed a strong antifungal activity against Collectotrichum orbiculare. Genetic variations were detected among the B. glumae strains from DNA fingerprinting analyses by repetitive element sequence-based PCR (rep-PCR) for BOX-A1R-based repetitive extragenic palindromic (BOX) or enterobacterial repetitive intergenic consensus (ERIC) sequences of bacteria; and close genetic relatedness among virulent but pigment-deficient strains were revealed by clustering analyses of DNA fingerprints from BOX-and ERIC-PCR. PMID:23028972

Two new miniature inverted-repeat transposable elements in the genome of the clam Donax trunculus.

PubMed

Šatović, Eva; Plohl, Miroslav

2017-10-01

Repetitive sequences are important components of eukaryotic genomes that drive their evolution. Among them are different types of mobile elements that share the ability to spread throughout the genome and form interspersed repeats. To broaden the generally scarce knowledge on bivalves at the genome level, in the clam Donax trunculus we described two new non-autonomous DNA transposons, miniature inverted-repeat transposable elements (MITEs), named DTC M1 and DTC M2. Like other MITEs, they are characterized by their small size, their A + T richness, and the presence of terminal inverted repeats (TIRs). DTC M1 and DTC M2 are 261 and 286 bp long, respectively, and in addition to TIRs, both of them contain a long imperfect palindrome sequence in their central parts. These elements are present in complete and truncated versions within the genome of the clam D. trunculus. The two new MITEs share only structural similarity, but lack any nucleotide sequence similarity to each other. In a search for related elements in databases, blast search revealed within the Crassostrea gigas genome a larger element sharing sequence similarity only to DTC M1 in its TIR sequences. The lack of sequence similarity with any previously published mobile elements indicates that DTC M1 and DTC M2 elements may be unique to D. trunculus.
Pstl repeat: a family of short interspersed nucleotide element (SINE)-like sequences in the genomes of cattle, goat, and buffalo.

PubMed

Sheikh, Faruk G; Mukhopadhyay, Sudit S; Gupta, Prabhakar

2002-02-01

The PstI family of elements are short, highly repetitive DNA sequences interspersed throughout the genome of the Bovidae. We have cloned and sequenced some members of the PstI family from cattle, goat, and buffalo. These elements are approximately 500 bp, have a copy number of 2 x 10(5) - 4 x 10(5), and comprise about 4% of the haploid genome. Studies of nucleotide sequence homology indicate that the buffalo and goat PstI repeats (type II) are similar types of short interspersed nucleotide element (SINE) sequences, but the cattle PstI repeat (type I) is considerably more divergent. Additionally, the goat PstI sequence showed significant sequence homology with bovine serine tRNA, and is therefore likely derived from serine tRNA. Interestingly, Southern hybridization suggests that both types of SINEs (I and II) are present in all the species of Bovidae. Dendrogram analysis indicates that cattle PstI SINE is similar to bovine Alu-like SINEs. Goat and buffalo SINEs formed a separate cluster, suggesting that these two types of SINEs evolved separately in the genome of the Bovidae.
Stimulus-Category and Response-Repetition Effects in Task Switching: An Evaluation of Four Explanations

ERIC Educational Resources Information Center

Druey, Michel D.

2014-01-01

In many task-switch studies, task sequence and response sequence interact: Response repetitions produce benefits when the task repeats but produce costs when the task switches. Four different theoretical frameworks have been proposed to explain these effects: a reconfiguration-based account, association-learning models, an episodic-retrieval…
Identification and chromosome mapping of repetitive elements in the Astyanax scabripinnis (Teleostei: Characidae) species complex.

PubMed

Barbosa, Patrícia; de Oliveira, Luiz Antonio; Pucci, Marcela Baer; Santos, Mateus Henrique; Moreira-Filho, Orlando; Vicari, Marcelo Ricardo; Nogaroto, Viviane; de Almeida, Mara Cristina; Artoni, Roberto Ferreira

2015-02-01

Most part of the eukaryotic genome is composed of repeated sequences or multiple copies of DNA, which were considered as "junk DNA", and may be associated to the heterochromatin. In this study, three populations of Astyanax aff. scabripinnis from Brazilian rivers of Guaratinguetá and Pindamonhangaba (São Paulo) and a population from Maringá (Paraná) were analyzed concerning the localization of the nucleolar organizer regions (Ag-NORs), the As51 satellite DNA, the 18S ribosomal DNA (rDNA), and the 5S rDNA. Repeated sequences were also isolated and identified by the Cot - 1 method, which indicated similarity (90%) with the LINE UnaL2 retrotransposon. The fluorescence in situ hybridization (FISH) showed the retrotransposon dispersed and more concentrated markers in centromeric and telomeric chromosomal regions. These sequences were co-localized and interspaced with 18S and 5S rDNA and As51, confirmed by fiber-FISH essay. The B chromosome found in these populations pointed to a conspicuous hybridization with LINE probe, which is also co-located in As51 sequences. The NORs were active at unique sites of a homologous pair in the three populations. There were no evidences that transposable elements and repetitive DNA had influence in the transcriptional regulation of ribosomal genes in our analyses.
Structure and Function of Centromeric and Pericentromeric Heterochromatin in Arabidopsis thaliana.

PubMed

Simon, Lauriane; Voisin, Maxime; Tatout, Christophe; Probst, Aline V

2015-01-01

The centromere is a specific chromosomal region where the kinetochore assembles to ensure the faithful segregation of sister chromatids during mitosis and meiosis. Centromeres are defined by a local enrichment of the specific histone variant CenH3 mostly at repetitive satellite sequences. A larger pericentromeric region containing repetitive sequences and transposable elements surrounds the centromere that adopts a particular chromatin state characterized by specific histone variants and post-translational modifications and forms a transcriptionally repressive chromosomal environment. In the model organism Arabidopsis thaliana centromeric and pericentromeric domains form conspicuous heterochromatin clusters called chromocenters in interphase. Here we discuss, using Arabidopsis as example, recent insight into mechanisms involved in maintenance and establishment of centromeric and pericentromeric chromatin signatures as well as in chromocenter formation.
Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

PubMed

Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

2010-07-16

Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.
Genomic vulnerability to LINE-1 hypomethylation is a potential determinant of the clinicogenetic features of multiple myeloma

PubMed Central

2012-01-01

Background The aim of this study was to clarify the role of global hypomethylation of repetitive elements in determining the genetic and clinical features of multiple myeloma (MM). Methods We assessed global methylation levels using four repetitive elements (long interspersed nuclear element-1 (LINE-1), Alu Ya5, Alu Yb8, and Satellite-α) in clinical samples comprising 74 MM samples and 11 benign control samples (7 cases of monoclonal gammopathy of undetermined significance (MGUS) and 4 samples of normal plasma cells (NPC)). We also evaluated copy-number alterations using array-based comparative genomic hybridization, and performed methyl-CpG binding domain sequencing (MBD-seq). Results Global levels of the repetitive-element methylation declined with the degree of malignancy of plasma cells (NPC>MGUS>MM), and there was a significant inverse correlation between the degree of genomic loss and the LINE-1 methylation levels. We identified 80 genomic loci as common breakpoints (CBPs) around commonly lost regions, which were significantly associated with increased LINE-1 densities. MBD-seq analysis revealed that average DNA-methylation levels at the CBP loci and relative methylation levels in regions with higher LINE-1 densities also declined during the development of MM. We confirmed that levels of methylation of the 5' untranslated region of respective LINE-1 loci correlated strongly with global LINE-1 methylation levels. Finally, there was a significant association between LINE-1 hypomethylation and poorer overall survival (hazard ratio 2.8, P = 0.015). Conclusion Global hypomethylation of LINE-1 is associated with the progression of and poorer prognosis for MM, possibly due to frequent copy-number loss. PMID:23259664
Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies.

PubMed

Zeng, Lu; Kortschak, R Daniel; Raison, Joy M; Bertozzi, Terry; Adelson, David L

2018-01-01

Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package.
Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies

PubMed Central

Zeng, Lu; Kortschak, R. Daniel; Raison, Joy M.

2018-01-01

Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package. PMID:29538441
Transposable elements in fish chromosomes: a study in the marine cobia species.

PubMed

Costa, G W W F; Cioffi, M B; Bertollo, L A C; Molina, W F

2013-01-01

Rachycentron canadum, a unique representative of the Rachycentridae family, has been the subject of considerable biotechnological interest due to its potential use in marine fish farming. This species has undergone extensive research concerning the location of genes and multigene families on its chromosomes. Although most of the genome of some organisms is composed of repeated DNA sequences, aspects of the origin and dispersion of these elements are still largely unknown. The physical mapping of repetitive sequences on the chromosomes of R. canadum proved to be relevant for evolutionary and applied purposes. Therefore, here, we present the mapping by fluorescence in situ hybridization of the transposable element (TE) Tol2, the non-LTR retrotransposons Rex1 and Rex3, together with the 18S and 5S rRNA genes in the chromosome of this species. The Tol2 TE, belonging to the family of hAT transposons, is homogeneously distributed in the euchromatic regions of the chromosomes but with huge colocalization with the 18S rDNA sites. The hybridization signals for Rex1 and Rex3 revealed a semi-arbitrary distribution pattern, presenting differentiated dispersion in euchromatic and heterochromatic regions. Rex1 elements are associated preferentially in heterochromatic regions, while Rex3 shows a scarce distribution in the euchromatic regions of the chromosomes. The colocalization of TEs with 18S and 5S rDNA revealed complex chromosomal regions of repetitive sequences. In addition, the nonpreferential distribution of Rex1 and Rex3 in all heterochromatic regions, as well as the preferential distribution of the Tol2 transposon associated with 18S rDNA sequences, reveals a distinct pattern of organization of TEs in the genome of this species. A heterogeneous chromosomal colonization of TEs may confer different evolutionary rates to the heterochromatic regions of this species.
Factor structure of paediatric timed motor examination and its relationship with IQ

PubMed Central

MARTIN, REBECCA; TIGERA, CASSIE; DENCKLA, MARTHA B; MAHONE, E MARK

2012-01-01

AIM Brain systems supporting higher cognitive and motor control develop in a parallel manner, dependent on functional integrity and maturation of related regions, suggesting neighbouring neural circuitry. Concurrent examination of motor and cognitive control can provide a window into neurological development. However, identification of performance-based measures that do not correlate with IQ has been a challenge. METHOD Timed motor performance from the Physical and Neurological Examination of Subtle Signs and IQ were analysed in 136 children aged 6 to 16 (mean age 10y 2.6mo, SD 2y 6.4mo; 98 female, 38male) attending an outpatient neuropsychology clinic and 136 right-handed comparison individuals aged 6 to 16 (mean age 10y 3.1mo, SD 2y 6.1mo; 98 female, 38male). Timed activities – three repetitive movements (toe tapping, hand patting, finger tapping) and three sequenced movements (heel–toe tap, hand pronate/supinate, finger sequencing) each performed on the right and left – were included in exploratory factor analyses. RESULTS Among comparison individuals, factor analysis yielded two factors – repetitive and sequenced movements – with the sequenced factor significantly predictive of Verbal IQ (VIQ) (ΔR2=0.018, p=0.019), but not the repetitive factor (ΔR2=0.004, p=0.39). Factor analysis within the clinical group yielded two similar factors (repetitive and sequenced), both significantly predictive of VIQ, (ΔR2=0.028, p=0.015; ΔR2=0.046, p=0.002 respectively). INTERPRETATION Among typical children, repetitive timed tasks may be independent of IQ; however, sequenced tasks share more variance, implying shared neural substrates. Among neurologically vulnerable populations, however, both sequenced and repetitive movements covary with IQ, suggesting that repetitive speed is more indicative of underlying neurological integrity. PMID:20412260
Molecular Strain Typing of Mycobacterium tuberculosis: a Review of Frequently Used Methods

PubMed Central

2016-01-01

Tuberculosis, caused by the bacterium Mycobacterium tuberculosis, remains one of the most serious global health problems. Molecular typing of M. tuberculosis has been used for various epidemiologic purposes as well as for clinical management. Currently, many techniques are available to type M. tuberculosis. Choosing the most appropriate technique in accordance with the existing laboratory conditions and the specific features of the geographic region is important. Insertion sequence IS6110-based restriction fragment length polymorphism (RFLP) analysis is considered the gold standard for the molecular epidemiologic investigations of tuberculosis. However, other polymerase chain reaction-based methods such as spacer oligonucleotide typing (spoligotyping), which detects 43 spacer sequence-interspersing direct repeats (DRs) in the genomic DR region; mycobacterial interspersed repetitive units–variable number tandem repeats, (MIRU-VNTR), which determines the number and size of tandem repetitive DNA sequences; repetitive-sequence-based PCR (rep-PCR), which provides high-throughput genotypic fingerprinting of multiple Mycobacterium species; and the recently developed genome-based whole genome sequencing methods demonstrate similar discriminatory power and greater convenience. This review focuses on techniques frequently used for the molecular typing of M. tuberculosis and discusses their general aspects and applications. PMID:27709842
Molecular Strain Typing of Mycobacterium tuberculosis: a Review of Frequently Used Methods.

PubMed

Ei, Phyu Win; Aung, Wah Wah; Lee, Jong Seok; Choi, Go Eun; Chang, Chulhun L

2016-11-01

Tuberculosis, caused by the bacterium Mycobacterium tuberculosis, remains one of the most serious global health problems. Molecular typing of M. tuberculosis has been used for various epidemiologic purposes as well as for clinical management. Currently, many techniques are available to type M. tuberculosis. Choosing the most appropriate technique in accordance with the existing laboratory conditions and the specific features of the geographic region is important. Insertion sequence IS6110-based restriction fragment length polymorphism (RFLP) analysis is considered the gold standard for the molecular epidemiologic investigations of tuberculosis. However, other polymerase chain reaction-based methods such as spacer oligonucleotide typing (spoligotyping), which detects 43 spacer sequence-interspersing direct repeats (DRs) in the genomic DR region; mycobacterial interspersed repetitive units-variable number tandem repeats, (MIRU-VNTR), which determines the number and size of tandem repetitive DNA sequences; repetitive-sequence-based PCR (rep-PCR), which provides high-throughput genotypic fingerprinting of multiple Mycobacterium species; and the recently developed genome-based whole genome sequencing methods demonstrate similar discriminatory power and greater convenience. This review focuses on techniques frequently used for the molecular typing of M. tuberculosis and discusses their general aspects and applications.
Comparative analysis on the structural features of the 5' flanking region of κ-casein genes from six different species

PubMed Central

Gerencsér, Ákos; Barta, Endre; Boa, Simon; Kastanis, Petros; Bösze, Zsuzsanna; Whitelaw, C Bruce A

2002-01-01

κ-casein plays an essential role in the formation, stabilisation and aggregation of milk micelles. Control of κ-casein expression reflects this essential role, although an understanding of the mechanisms involved lags behind that of the other milk protein genes. We determined the 5'-flanking sequences for the murine, rabbit and human κ-casein genes and compared them to the published ruminant sequences. The most conserved region was not the proximal promoter region but an approximately 400 bp long region centred 800 bp upstream of the TATA box. This region contained two highly conserved MGF/STAT5 sites with common spacing relative to each other. In this region, six conserved short stretches of similarity were also found which did not correspond to known transcription factor consensus sites. On the contrary to ruminant and human 5' regulatory sequences, the rabbit and murine 5'-flanking regions did not harbour any kind of repetitive elements. We generated a phylogenetic tree of the six species based on multiple alignment of the κ-casein sequences. This study identified conserved candidate transcriptional regulatory elements within the κ-casein gene promoter. PMID:11929628
Atrx promotes heterochromatin formation at retrotransposons

PubMed Central

Sadic, Dennis; Schmidt, Katharina; Groh, Sophia; Kondofersky, Ivan; Ellwart, Joachim; Fuchs, Christiane; Theis, Fabian J; Schotta, Gunnar

2015-01-01

More than 50% of mammalian genomes consist of retrotransposon sequences. Silencing of retrotransposons by heterochromatin is essential to ensure genomic stability and transcriptional integrity. Here, we identified a short sequence element in intracisternal A particle (IAP) retrotransposons that is sufficient to trigger heterochromatin formation. We used this sequence in a genome-wide shRNA screen and identified the chromatin remodeler Atrx as a novel regulator of IAP silencing. Atrx binds to IAP elements and is necessary for efficient heterochromatin formation. In addition, Atrx facilitates a robust and largely inaccessible heterochromatin structure as Atrx knockout cells display increased chromatin accessibility at retrotransposons and non-repetitive heterochromatic loci. In summary, we demonstrate a direct role of Atrx in the establishment and robust maintenance of heterochromatin. PMID:26012739
Molecular architecture of classical cytological landmarks: Centromeres and telomeres

DOE Office of Scientific and Technical Information (OSTI.GOV)

Meyne, J.

1994-11-01

Both the human telomere repeat and the pericentromeric repeat sequence (GGAAT)n were isolated based on evolutionary conservation. Their isolation was based on the premise that chromosomal features as structurally and functionally important as telomeres and centromeres should be highly conserved. Both sequences were isolated by high stringency screening of a human repetitive DNA library with rodent repetitive DNA. The pHuR library (plasmid Human Repeat) used for this project was enriched for repetitive DNA by using a modification of the standard DNA library preparation method. Usually DNA for a library is cut with restriction enzymes, packaged, infected, and the library ismore » screened. A problem with this approach is that many tandem repeats don`t have any (or many) common restriction sites. Therefore, many of the repeat sequences will not be represented in the library because they are not restricted to a viable length for the vector used. To prepare the pHuR library, human DNA was mechanically sheared to a small size. These relatively short DNA fragments were denatured and then renatured to C{sub o}t 50. Theoretically only repetitive DNA sequences should renature under C{sub o}t 50 conditions. The single-stranded regions were digested using S1 nuclease, leaving the double-stranded, renatured repeat sequences.« less
A new family of satellite DNA sequences as a major component of centromeric heterochromatin in owls (Strigiformes).

PubMed

Yamada, Kazuhiko; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2004-03-01

We isolated a new family of satellite DNA sequences from HaeIII- and EcoRI-digested genomic DNA of the Blakiston's fish owl ( Ketupa blakistoni). The repetitive sequences were organized in tandem arrays of the 174 bp element, and localized to the centromeric regions of all macrochromosomes, including the Z and W chromosomes, and microchromosomes. This hybridization pattern was consistent with the distribution of C-band-positive centromeric heterochromatin, and the satellite DNA sequences occupied 10% of the total genome as a major component of centromeric heterochromatin. The sequences were homogenized between macro- and microchromosomes in this species, and therefore intraspecific divergence of the nucleotide sequences was low. The 174 bp element cross-hybridized to the genomic DNA of six other Strigidae species, but not to that of the Tytonidae, suggesting that the satellite DNA sequences are conserved in the same family but fairly divergent between the different families in the Strigiformes. Secondly, the centromeric satellite DNAs were cloned from eight Strigidae species, and the nucleotide sequences of 41 monomer fragments were compared within and between species. Molecular phylogenetic relationships of the nucleotide sequences were highly correlated with both the taxonomy based on morphological traits and the phylogenetic tree constructed by DNA-DNA hybridization. These results suggest that the satellite DNA sequence has evolved by concerted evolution in the Strigidae and that it is a good taxonomic and phylogenetic marker to examine genetic diversity between Strigiformes species.
Genomic Organization of Repetitive DNA in Woodpeckers (Aves, Piciformes): Implications for Karyotype and ZW Sex Chromosome Differentiation

PubMed Central

Kretschmer, Rafael; Bertocchi, Natasha Avila; Degrandi, Tiago Marafiga; de Oliveira, Edivaldo Herculano Corrêa; Cioffi, Marcelo de Bello; Garnero, Analía del Valle; Gunski, Ricardo José

2017-01-01

Birds are characterized by a low proportion of repetitive DNA in their genome when compared to other vertebrates. Among birds, species belonging to Piciformes order, such as woodpeckers, show a relatively higher amount of these sequences. The aim of this study was to analyze the distribution of different classes of repetitive DNA—including microsatellites, telomere sequences and 18S rDNA—in the karyotype of three Picidae species (Aves, Piciformes)—Colaptes melanochloros (2n = 84), Colaptes campestris (2n = 84) and Melanerpes candidus (2n = 64)–by means of fluorescence in situ hybridization. Clusters of 18S rDNA were found in one microchromosome pair in each of the three species, coinciding to a region of (CGG)10 sequence accumulation. Interstitial telomeric sequences were found in some macrochromosomes pairs, indicating possible regions of fusions, which can be related to variation of diploid number in the family. Only one, from the 11 different microsatellite sequences used, did not produce any signals. Both species of genus Colaptes showed a similar distribution of microsatellite sequences, with some difference when compared to M. candidus. Microsatellites were found preferentially in the centromeric and telomeric regions of micro and macrochromosomes. However, some sequences produced patterns of interstitial bands in the Z chromosome, which corresponds to the largest element of the karyotype in all three species. This was not observed in the W chromosome of Colaptes melanochloros, which is heterochromatic in most of its length, but was not hybridized by any of the sequences used. These results highlight the importance of microsatellite sequences in differentiation of sex chromosomes, and the accumulation of these sequences is probably responsible for the enlargement of the Z chromosome. PMID:28081238
Genome wide survey, discovery and evolution of repetitive elements in three Entamoeba species

PubMed Central

Lorenzi, Hernan; Thiagarajan, Mathangi; Haas, Brian; Wortman, Jennifer; Hall, Neil; Caler, Elisabet

2008-01-01

Background Identification and mapping of repetitive elements is a key step for accurate gene prediction and overall structural annotation of genomes. During the assembly and annotation of three highly repetitive amoeba genomes, Entamoeba histolytica, Entamoeba dispar, and Entamoeba invadens, we performed comparative sequence analysis to identify and map all class I and class II transposable elements in their sequences. Results Here, we report the identification of two novel Entamoeba-specific repeats: ERE1 and ERE2; ERE1 is spread across the three genomes and associated with different repeats in a species-specific manner, while ERE2 is unique to E. histolytica. We also report the identification of two novel subfamilies of LINE and SINE retrotransposons in E. dispar and provide evidence for how the different LINE and SINE subfamilies evolved in these species. Additionally, we found a putative transposase-coding gene in E. histolytica and E. dispar related to the mariner transposon Hydargos from E. invadens. The distribution of transposable elements in these genomes is markedly skewed with a tendency of forming clusters. More than 70% of the three genomes have a repeat density below their corresponding average value indicating that transposable elements are not evenly distributed. We show that repeats and repeat-clusters are found at syntenic break points between E. histolytica and E. dispar and hence, could work as recombination hot spots promoting genome rearrangements. Conclusion The mapping of all transposable elements found in these parasites shows that repeat coverage is up to three times higher than previously reported. LINE, ERE1 and mariner elements were present in the common ancestor to the three Entamoeba species while ERE2 was likely acquired by E. histolytica after its separation from E. dispar. We demonstrate that E. histolytica and E. dispar share their entire repertoire of LINE and SINE retrotransposons and that Eh_SINE3/Ed_SINE1 originated as a chimeric SINE from Eh/Ed_SINE2 and Eh_SINE1/Ed_SINE3. Our work shows that transposable elements are organized in clusters, frequently found at syntenic break points providing insights into their contribution to chromosome instability and therefore, to genomic variation and speciation in these parasites. PMID:19077187
Gene-enriched draft genome of the cattle tick Rhipicephalus microplus: assembly by the hybrid Pacific Biosciences/Illumina approach enabled analysis of the highly repetitive genome.

PubMed

Barrero, Roberto A; Guerrero, Felix D; Black, Michael; McCooke, John; Chapman, Brett; Schilkey, Faye; Pérez de León, Adalberto A; Miller, Robert J; Bruns, Sara; Dobry, Jason; Mikhaylenko, Galina; Stormo, Keith; Bell, Callum; Tao, Quanzhou; Bogden, Robert; Moolhuijzen, Paula M; Hunter, Adam; Bellgard, Matthew I

2017-08-01

The genome of the cattle tick Rhipicephalus microplus, an ectoparasite with global distribution, is estimated to be 7.1Gbp in length and consists of approximately 70% repetitive DNA. We report the draft assembly of a tick genome that utilized a hybrid sequencing and assembly approach to capture the repetitive fractions of the genome. Our hybrid approach produced an assembly consisting of 2.0Gbp represented in 195,170 scaffolds with a N50 of 60,284bp. The Rmi v2.0 assembly is 51.46% repetitive with a large fraction of unclassified repeats, short interspersed elements, long interspersed elements and long terminal repeats. We identified 38,827 putative R. microplus gene loci, of which 24,758 were protein coding genes (≥100 amino acids). OrthoMCL comparative analysis against 11 selected species including insects and vertebrates identified 10,835 and 3,423 protein coding gene loci that are unique to R. microplus or common to both R. microplus and Ixodes scapularis ticks, respectively. We identified 191 microRNA loci, of which 168 have similarity to known miRNAs and 23 represent novel miRNA families. We identified the genomic loci of several highly divergent R. microplus esterases with sequence similarity to acetylcholinesterase. Additionally we report the finding of a novel cytochrome P450 CYP41 homolog that shows similar protein folding structures to known CYP41 proteins known to be involved in acaricide resistance. Copyright © 2017 Australian Society for Parasitology. Published by Elsevier Ltd. All rights reserved.

Rat prostatic steroid binding protein: DNA sequence and transcript maps of the two C3 genes.

PubMed Central

Hurst, H C; Parker, M G

1983-01-01

In the rat there are two non-allelic genes C3(1) and C3(2) for the C3 polypeptide of prostatic steroid binding protein. We have cloned and sequenced both genes and show that only C3(1) is responsible for the production of authentic C3. Although there is a marked difference in their transcriptional activity, the two genes share extensive DNA sequence homology there being only one base difference from nucleotide - 235 to within the first intron. Transcript mapping has shown that there are two distinct C3 transcripts which share a unique 3' terminus but have 5' termini 38 bases apart each preceded by a 'TATA' box homology. Interestingly, an identical repetitive element is present just upstream of both genes. Both families of transcripts, which are produced in a ratio of 18:1, are coordinately regulated by testosterone. Images Fig. 3. Fig. 4. Fig. 5. PMID:6685625
On the role of the SMA in the discrete sequence production task: a TMS study. Transcranial Magnetic Stimulation.

PubMed

Verwey, Willem B; Lammens, Robin; van Honk, Jack

2002-01-01

Participants practiced two discrete six-key sequences for a total of 420 trials. The 1 x 6 sequence had a unique order of key presses while the 2 x 3 sequence involved repetition of a three-key segment. Both sequences showed a long interkey interval halfway the sequence indicating hierarchical sequence control in that not only the 2 x 3 but also the 1 x 6 sequence was executed as two successive motor chunks. Besides, the second part of both sequences was executed faster than the first part. This supports the earlier notion of a motor processor executing the elements of familiar motor chunks and a cognitive processor triggering either these motor chunks or individual sequence elements. Low-frequency, off-line transcranial magnetic stimulation (TMS) of the supplementary motor area (SMA) counteracted normal improvement with practice of key presses at all sequence positions. Together, these results are in line with the notion that with moderate practice, the SMA executes short sequence fragments that are concatenated by other brain structures.
Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies.

PubMed Central

Wincker, P; Jubier-Maurin, V; Roizès, G

1987-01-01

Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies. Images PMID:3684566
Cocaine dynamically regulates heterochromatin and repetitive element unsilencing in nucleus accumbens.

PubMed

Maze, Ian; Feng, Jian; Wilkinson, Matthew B; Sun, HaoSheng; Shen, Li; Nestler, Eric J

2011-02-15

Repeated cocaine exposure induces persistent alterations in genome-wide transcriptional regulatory networks, chromatin remodeling activity and, ultimately, gene expression profiles in the brain's reward circuitry. Virtually all previous investigations have centered on drug-mediated effects occurring throughout active euchromatic regions of the genome, with very little known concerning the impact of cocaine exposure on the regulation and maintenance of heterochromatin in adult brain. Here, we report that cocaine dramatically and dynamically alters heterochromatic histone H3 lysine 9 trimethylation (H3K9me3) in the nucleus accumbens (NAc), a key brain reward region. Furthermore, we demonstrate that repeated cocaine exposure causes persistent decreases in heterochromatization in this brain region, suggesting a potential role for heterochromatic regulation in the long-term actions of cocaine. To identify precise genomic loci affected by these alterations, chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-Seq) was performed on NAc. ChIP-Seq analyses confirmed the existence of the H3K9me3 mark mainly within intergenic regions of the genome and identified specific patterns of cocaine-induced H3K9me3 regulation at repetitive genomic sequences. Cocaine-mediated decreases in H3K9me3 enrichment at specific genomic repeats [e.g., long interspersed nuclear element (LINE)-1 repeats] were further confirmed by the increased expression of LINE-1 retrotransposon-associated repetitive elements in NAc. Such increases likely reflect global patterns of genomic destabilization in this brain region after repeated cocaine administration and open the door for future investigations into the epigenetic and genetic basis of drug addiction.
Retroposition of the AFC family of SINEs (short interspersed repetitive elements) before and during the adaptive radiation of cichlid fishes in Lake Malawi and related inferences about phylogeny.

PubMed

Takahashi, K; Nishida, M; Yuma, M; Okada, N

2001-01-01

Lake Malawi is home to more than 450 species of endemic cichlids, which provide a spectacular example of adaptive radiation. To clarify the phylogenetic relationships among these fish, we examined the presence and absence of SINEs (short interspersed repetitive elements) at orthologous loci. We identified six loci at which a SINE sequence had apparently been specifically inserted by retroposition in the common ancestor of all the investigated species of endemic cichlids in Lake Malawi. At another locus, unique sharing of a SINE sequence was evident among all the investigated species of endemic non-Mbuna cichlids with the exception of Rhamphochromis sp. The relationships were in good agreement with those deduced in previous studies with various different markers, demonstrating that the SINE method is useful for the elucidation of phylogenetic relationships among cichlids in Lake Malawi. We also characterized a locus that exhibited transspecies polymorphism with respect to the presence or absence of the SINE sequence among non-Mbuna species. This result suggests that incomplete lineage sorting and/or interspecific hybridization might have occurred or be occurring among the species in this group, which might potentially cause misinterpretation of phylogenetic data, in particular when a single-locus marker, such as a sequence in the mitochondrial DNA, is used for analysis.
Molecular Cloning and Analysis of a DNA Repetitive Element from the Mouse Genome

ERIC Educational Resources Information Center

Geisinger, Adriana; Cossio, Gabriela; Wettstein, Rodolfo

2006-01-01

We report the development of a 3-week laboratory activity for an undergraduate molecular biology course. This activity introduces students to the practice of basic molecular techniques such as restriction enzyme digestion, agarose gel electrophoresis, cloning, plasmid DNA purification, Southern blotting, and sequencing. Students learn how to carry…
Optimization and comparative analysis of plant organellar DNA enrichment methods suitable for next generation sequencing

USDA-ARS?s Scientific Manuscript database

Plant organellar genomes contain large repetitive elements that may undergo pairing or recombination to form complex structures and/or sub-genomic fragments. Organellar genomes also exist in admixtures within a given cell or tissue type (heteroplasmy) and abundance of sub-types may change through de...
Genomic Organization of Repetitive DNA Elements and Its Implications for the Chromosomal Evolution of Channid Fishes (Actinopterygii, Perciformes).

PubMed

Cioffi, Marcelo de Bello; Bertollo, Luiz Antonio Carlos; Villa, Mateo Andres; de Oliveira, Ezequiel Aguiar; Tanomtong, Alongklod; Yano, Cassia Fernanda; Supiwong, Weerayuth; Chaveerach, Arunrat

2015-01-01

Channid fishes, commonly referred to as "snakeheads", are currently very important in Asian fishery and aquaculture due to the substantial decline in natural populations because of overexploitation. A large degree of chromosomal variation has been found in this family, mainly through the use of conventional cytogenetic investigations. In this study, we analyzed the karyotype structure and the distribution of 7 repetitive DNA sequences in several Channa species from different Thailand river basins. The aim of this study was to investigate the chromosomal differentiation among species and populations to improve upon the knowledge of its biodiversity and evolutionary history. Rearrangements, such as pericentric inversions, fusions and polyploidization, appear to be important events during the karyotypic evolution of this genus, resulting in the chromosomal diversity observed among the distinct species and even among populations of the same species. In addition, such variability is also increased by the genomic dynamism of repetitive elements, particularly by the differential distribution and accumulation of rDNA sequences on chromosomes. This marked diversity is likely linked to the lifestyle of the snakehead fishes and their population fragmentation, as already identified for other fish species. The karyotypic features highlight the biodiversity of the channid fishes and justify a taxonomic revision of the genus Channa, as well as of the Channidae family as a whole, as some nominal species may actually constitute species complexes.
Evaluation of an automated repetitive sequence-based PCR system for subtyping Enterobacter sakazakii.

PubMed

Healy, B; Mullane, N; Collin, V; Mailler, S; Iversen, C; Chatellier, S; Storrs, M; Fanning, S

2008-07-01

Enterobacter sakazakii is regarded as a ubiquitous organism that can be isolated from a wide range of foods and environments. Infection in at-risk infants has been epidemiologically linked to the consumption of contaminated powdered infant formula. Preventing the dissemination of this pathogen in a powdered infant formula manufacturing facility is an important step in ensuring consumer confidence in a given brand together with the protection of the health status of a vulnerable population. In this study we report the application of a repetitive sequence-based PCR typing method to subtype a previously well-characterized collection of E. sakazakii isolates of diverse origin. While both methods successfully discriminated between the collection of isolates, repetitive sequence-based PCR identified 65 types, whereas pulsed-field gel electrophoresis identified 110 types showing > or =95% similarity. The method was quick and easy to perform, and our data demonstrated the utility and value of this approach to monitor in-process contamination, which could potentially contribute to a reduction in the transmission of E. sakazakii.
New families of site-specific repetitive DNA sequences that comprise constitutive heterochromatin of the Syrian hamster (Mesocricetus auratus, Cricetinae, Rodentia).

PubMed

Yamada, Kazuhiko; Kamimura, Eikichi; Kondo, Mariko; Tsuchiya, Kimiyuki; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2006-02-01

We molecularly cloned new families of site-specific repetitive DNA sequences from BglII- and EcoRI-digested genomic DNA of the Syrian hamster (Mesocricetus auratus, Cricetrinae, Rodentia) and characterized them by chromosome in situ hybridization and filter hybridization. They were classified into six different types of repetitive DNA sequence families according to chromosomal distribution and genome organization. The hybridization patterns of the sequences were consistent with the distribution of C-positive bands and/or Hoechst-stained heterochromatin. The centromeric major satellite DNA and sex chromosome-specific and telomeric region-specific repetitive sequences were conserved in the same genus (Mesocricetus) but divergent in different genera. The chromosome-2-specific sequence was conserved in two genera, Mesocricetus and Cricetulus, and a low copy number of repetitive sequences on the heterochromatic chromosome arms were conserved in the subfamily Cricetinae but not in the subfamily Calomyscinae. By contrast, the other type of repetitive sequences on the heterochromatic chromosome arms, which had sequence similarities to a LINE sequence of rodents, was conserved through the three subfamilies, Cricetinae, Calomyscinae and Murinae. The nucleotide divergence of the repetitive sequences of heterochromatin was well correlated with the phylogenetic relationships of the Cricetinae species, and each sequence has been independently amplified and diverged in the same genome.
ELECTRONIC PULSE SCALING CIRCUITS

DOEpatents

Cooke-Yarborough, E.H.

1958-11-18

Electronic pulse scaling circults of the klnd comprlsing a serles of bi- stable elements connected ln sequence, usually in the form of a rlng so as to be cycllcally repetitive at the highest scallng factor, are described. The scaling circuit comprises a ring system of bi-stable elements each arranged on turn-off to cause, a succeeding element of the ring to be turned-on, and one being arranged on turn-off to cause a further element of the ring to be turned-on. In addition, separate means are provided for applying a turn-off pulse to all the elements simultaneously, and for resetting the elements to a starting condition at the end of each cycle.
The Foldback-like element Galileo belongs to the P superfamily of DNA transposons and is widespread within the Drosophila genus.

PubMed

Marzo, Mar; Puig, Marta; Ruiz, Alfredo

2008-02-26

Galileo is the only transposable element (TE) known to have generated natural chromosomal inversions in the genus Drosophila. It was discovered in Drosophila buzzatii and classified as a Foldback-like element because of its long, internally repetitive, terminal inverted repeats (TIRs) and lack of coding capacity. Here, we characterized a seemingly complete copy of Galileo from the D. buzzatii genome. It is 5,406 bp long, possesses 1,229-bp TIRs, and encodes a 912-aa transposase similar to those of the Drosophila melanogaster 1360 (Hoppel) and P elements. We also searched the recently available genome sequences of 12 Drosophila species for elements similar to Dbuz\\Galileo by using bioinformatic tools. Galileo was found in six species (ananassae, willistoni, peudoobscura, persimilis, virilis, and mojavensis) from the two main lineages within the Drosophila genus. Our observations place Galileo within the P superfamily of cut-and-paste transposons and extend considerably its phylogenetic distribution. The interspecific distribution of Galileo indicates an ancient presence in the genus, but the phylogenetic tree built with the transposase amino acid sequences contrasts significantly with that of the species, indicating lineage sorting and/or horizontal transfer events. Our results also suggest that Foldback-like elements such as Galileo may evolve from DNA-based transposon ancestors by loss of the transposase gene and disproportionate elongation of TIRs.
The Foldback-like element Galileo belongs to the P superfamily of DNA transposons and is widespread within the Drosophila genus

PubMed Central

Marzo, Mar; Puig, Marta; Ruiz, Alfredo

2008-01-01

Galileo is the only transposable element (TE) known to have generated natural chromosomal inversions in the genus Drosophila. It was discovered in Drosophila buzzatii and classified as a Foldback-like element because of its long, internally repetitive, terminal inverted repeats (TIRs) and lack of coding capacity. Here, we characterized a seemingly complete copy of Galileo from the D. buzzatii genome. It is 5,406 bp long, possesses 1,229-bp TIRs, and encodes a 912-aa transposase similar to those of the Drosophila melanogaster 1360 (Hoppel) and P elements. We also searched the recently available genome sequences of 12 Drosophila species for elements similar to Dbuz\\Galileo by using bioinformatic tools. Galileo was found in six species (ananassae, willistoni, peudoobscura, persimilis, virilis, and mojavensis) from the two main lineages within the Drosophila genus. Our observations place Galileo within the P superfamily of cut-and-paste transposons and extend considerably its phylogenetic distribution. The interspecific distribution of Galileo indicates an ancient presence in the genus, but the phylogenetic tree built with the transposase amino acid sequences contrasts significantly with that of the species, indicating lineage sorting and/or horizontal transfer events. Our results also suggest that Foldback-like elements such as Galileo may evolve from DNA-based transposon ancestors by loss of the transposase gene and disproportionate elongation of TIRs. PMID:18287066
Molecular structure and chromosome distribution of three repetitive DNA families in Anemone hortensis L. (Ranunculaceae).

PubMed

Mlinarec, Jelena; Chester, Mike; Siljak-Yakovlev, Sonja; Papes, Drazena; Leitch, Andrew R; Besendorfer, Visnja

2009-01-01

The structure, abundance and location of repetitive DNA sequences on chromosomes can characterize the nature of higher plant genomes. Here we report on three new repeat DNA families isolated from Anemone hortensis L.; (i) AhTR1, a family of satellite DNA (stDNA) composed of a 554-561 bp long EcoRV monomer; (ii) AhTR2, a stDNA family composed of a 743 bp long HindIII monomer and; (iii) AhDR, a repeat family composed of a 945 bp long HindIII fragment that exhibits some sequence similarity to Ty3/gypsy-like retroelements. Fluorescence in-situ hybridization (FISH) to metaphase chromosomes of A. hortensis (2n = 16) revealed that both AhTR1 and AhTR2 sequences co-localized with DAPI-positive AT-rich heterochromatic regions. AhTR1 sequences occur at intercalary DAPI bands while AhTR2 sequences occur at 8-10 terminally located heterochromatic blocks. In contrast AhDR sequences are dispersed over all chromosomes as expected of a Ty3/gypsy-like element. AhTR2 and AhTR1 repeat families include polyA- and polyT-tracks, AT/TA-motifs and a pentanucleotide sequence (CAAAA) that may have consequences for chromatin packing and sequence homogeneity. AhTR2 repeats also contain TTTAGGG motifs and degenerate variants. We suggest that they arose by interspersion of telomeric repeats with subtelomeric repeats, before hybrid unit(s) amplified through the heterochromatic domain. The three repetitive DNA families together occupy approximately 10% of the A. hortensis genome. Comparative analyses of eight Anemone species revealed that the divergence of the A. hortensis genome was accompanied by considerable modification and/or amplification of repeats.
Modeling repetitive, non‐globular proteins

PubMed Central

Basu, Koli; Campbell, Robert L.; Guo, Shuaiqi; Sun, Tianjun

2016-01-01

Abstract While ab initio modeling of protein structures is not routine, certain types of proteins are more straightforward to model than others. Proteins with short repetitive sequences typically exhibit repetitive structures. These repetitive sequences can be more amenable to modeling if some information is known about the predominant secondary structure or other key features of the protein sequence. We have successfully built models of a number of repetitive structures with novel folds using knowledge of the consensus sequence within the sequence repeat and an understanding of the likely secondary structures that these may adopt. Our methods for achieving this success are reviewed here. PMID:26914323
DDM1 represses noncoding RNA expression and RNA-directed DNA methylation in heterochromatin.

PubMed

Tan, Feng; Lu, Yue; Jiang, Wei; Zhao, Yu; Wu, Tian; Zhang, Ruoyu; Zhou, Dao-Xiu

2018-05-24

Cytosine methylation of DNA, which occurs at CG, CHG, and CHH (H=A, C, or T) sequences in plants, is a hallmark for epigenetic repression of repetitive sequences. The chromatin remodeling factor DECREASE IN DNA METHYLATION1 (DDM1) is essential for DNA methylation, especially at CG and CHG sequences. However, its potential role in RNA-directed DNA methylation (RdDM) and in chromatin function is not completely understood in rice (Oryza sativa). In this work, we used high-throughput approaches to study the function of rice DDM1 (OsDDM1) in RdDM and the expression of non-coding RNA (ncRNA). We show that loss of function of OsDDM1 results in ectopic CHH methylation of transposable elements and repeats. The ectopic CHH methylation was dependent on rice DOMAINS REARRANGED METHYLTRANSFERASE2 (OsDRM2), a DNA methyltransferase involved in RdDM. Mutations in OsDDM1 lead to decreases of histone H3K9me2 and increases in the levels of heterochromatic small RNA (sRNA) and long noncoding RNA (lncRNA). In particular, OsDDM1 was found to be essential to repress transcription of the two repetitive sequences, Centromeric Retrotransposons of Rice1 (CRR1) and the dominant centromeric CentO repeats. These results suggest that OsDDM1 antagonizes RdDM at heterochromatin and represses tissue-specific expression of ncRNA from repetitive sequences in the rice genome. {copyright, serif} 2018 American Society of Plant Biologists. All rights reserved.
Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kass, D.H.; Batzer, M.A.; Deininger, P.L.

The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome.more » However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.« less
Process of labeling specific chromosomes using recombinant repetitive DNA

DOEpatents

Moyzis, R.K.; Meyne, J.

1988-02-12

Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
Recognition of the CDEI motif GTCACATG by mouse nuclear proteins and interference with the early development of the mouse embryo.

PubMed Central

Blangy, A; Léopold, P; Vidal, F; Rassoulzadegan, M; Cuzin, F

1991-01-01

We have reported previously (1) two unexpected consequences of the microinjection into fertilized mouse eggs of a recombinant plasmid designated p12B1, carrying a 343 bp insert of non-repetitive mouse DNA. Injected at very low concentrations, this plasmid could be established as an extrachromosomal genetic element. When injected in greater concentration, an early arrest of embryonic development resulted. In the present work, we have studied this toxic effect in more detail by microinjecting short synthetic oligonucleotides with sequences from the mouse insert. Lethality was associated with the nucleotide sequence GTCACATG, identical with the CDEl element of yeast centromeres. Development of injected embryos was arrested between the one-cell and the early morula stages, with abnormal structures and DNA contents. Electrophoretic mobility shift and DNAse foot-printing assays demonstrated the binding of mouse nuclear protein(s) to the CDEl-like box. Base changes within the CDEl sequence prevented both the toxic effects in embryos and the formation of protein complex in vitro, suggesting that protein binding at such sites in chromosomal DNA plays an important role in early development. Images PMID:1766880
Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

PubMed Central

2010-01-01

Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079

An Adapting Auditory-motor Feedback Loop Can Contribute to Generating Vocal Repetition

PubMed Central

Brainard, Michael S.; Jin, Dezhe Z.

2015-01-01

Consecutive repetition of actions is common in behavioral sequences. Although integration of sensory feedback with internal motor programs is important for sequence generation, if and how feedback contributes to repetitive actions is poorly understood. Here we study how auditory feedback contributes to generating repetitive syllable sequences in songbirds. We propose that auditory signals provide positive feedback to ongoing motor commands, but this influence decays as feedback weakens from response adaptation during syllable repetitions. Computational models show that this mechanism explains repeat distributions observed in Bengalese finch song. We experimentally confirmed two predictions of this mechanism in Bengalese finches: removal of auditory feedback by deafening reduces syllable repetitions; and neural responses to auditory playback of repeated syllable sequences gradually adapt in sensory-motor nucleus HVC. Together, our results implicate a positive auditory-feedback loop with adaptation in generating repetitive vocalizations, and suggest sensory adaptation is important for feedback control of motor sequences. PMID:26448054
A pulsed magnetic stress applied to Drosophila melanogaster flies

NASA Astrophysics Data System (ADS)

Delle Side, D.; Bozzetti, M. P.; Friscini, A.; Giuffreda, E.; Nassisi, V.; Specchia, V.; Velardi, L.

2014-04-01

We report the development of a system to feed pulsed magnetic stress to biological samples. The device is based on a RLC circuit that transforms the energy stored in a high voltage capacitor into a magnetic field inside a coil. The field has been characterized and we found that charging the capacitor with 24 kV results in a peak field of 0.4 T. In order to test its effect, we applied such a stress to the Drosophila melanogaster model and we examined its bio-effects. We analysed, in the germ cells, the effects on the control of specific DNA repetitive sequences that are activated after different environmental stresses. The deregulation of these sequences causes genomic instability and chromosomes breaks leading to sterility. The magnetic field treatment did not produce effects on repetitive sequences in the germ cells of Drosophila. Hence, this field doesn't produce deleterious effects linked to repetitive sequences derepression.
Tandem Repeat Proteins Inspired By Squid Ring Teeth

NASA Astrophysics Data System (ADS)

Pena-Francesch, Abdon

Proteins are large biomolecules consisting of long chains of amino acids that hierarchically assemble into complex structures, and provide a variety of building blocks for biological materials. The repetition of structural building blocks is a natural evolutionary strategy for increasing the complexity and stability of protein structures. However, the relationship between amino acid sequence, structure, and material properties of protein systems remains unclear due to the lack of control over the protein sequence and the intricacies of the assembly process. In order to investigate the repetition of protein building blocks, a recently discovered protein from squids is examined as an ideal protein system. Squid ring teeth are predatory appendages located inside the suction cups that provide a strong grasp of prey, and are solely composed of a group of proteins with tandem repetition of building blocks. The objective of this thesis is the understanding of sequence, structure and property relationship in repetitive protein materials inspired in squid ring teeth for the first time. Specifically, this work focuses on squid-inspired structural proteins with tandem repeat units in their sequence (i.e., repetition of alternating building blocks) that are physically cross-linked via beta-sheet structures. The research work presented here tests the hypothesis that, in these systems, increasing the number of building blocks in the polypeptide chain decreases the protein network defects and improves the material properties. Hence, the sequence, nanostructure, and properties (thermal, mechanical, and conducting) of tandem repeat squid-inspired protein materials are examined. Spectroscopic structural analysis, advanced materials characterization, and entropic elasticity theory are combined to elucidate the structure and material properties of these repetitive proteins. This approach is applied not only to native squid proteins but also to squid-inspired synthetic polypeptides that allow for a fine control of the sequence and network morphology. The results provided in this work establish a clear dependence between the repetitive building blocks, the network morphology, and the properties of squid-inspired repetitive protein materials. Increasing the number of tandem repeat units in SRT-inspired proteins led to more effective protein networks with superior properties. Through increasing tandem repetition and optimization of network morphology, highly efficient protein materials capable of withstanding deformations up to 400% of their original length, with MPa-GPa modulus, high energy absorption (50 MJ m-3), peak proton conductivity of 3.7 mS cm-1 (at pH 7, highest reported to date for biological materials), and peak thermal conductivity of 1.4 W m-1 K -1 (which exceeds that of most polymer materials) were developed. These findings introduce new design rules in the engineering of proteins based on tandem repetition and morphology control, and provide a novel framework for tailoring and optimizing the properties of protein-based materials.
Transposon Insertion Finder (TIF): a novel program for detection of de novo transpositions of transposable elements.

PubMed

Nakagome, Mariko; Solovieva, Elena; Takahashi, Akira; Yasue, Hiroshi; Hirochika, Hirohiko; Miyao, Akio

2014-03-14

Transposition event detection of transposable element (TE) in the genome using short reads from the next-generation sequence (NGS) was difficult, because the nucleotide sequence of TE itself is repetitive, making it difficult to identify locations of its insertions by alignment programs for NGS. We have developed a program with a new algorithm to detect the transpositions from NGS data. In the process of tool development, we used next-generation sequence (NGS) data of derivative lines (ttm2 and ttm5) of japonica rice cv. Nipponbare, regenerated through cell culture. The new program, called a transposon insertion finder (TIF), was applied to detect the de novo transpositions of Tos17 in the regenerated lines. TIF searched 300 million reads of a line within 20 min, identifying 4 and 12 de novo transposition in ttm2 and ttm5 lines, respectively. All of the transpositions were confirmed by PCR/electrophoresis and sequencing. Using the program, we also detected new transposon insertions of P-element from NGS data of Drosophila melanogaster. TIF operates to find the transposition of any elements provided that target site duplications (TSDs) are generated by their transpositions.
Typing Clostridium difficile strains based on tandem repeat sequences

PubMed Central

2009-01-01

Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124
Genome-wide mapping of autonomous promoter activity in human cells

PubMed Central

van Arensbergen, Joris; FitzPatrick, Vincent D.; de Haas, Marcel; Pagie, Ludo; Sluimer, Jasper; Bussemaker, Harmen J.; van Steensel, Bas

2017-01-01

Previous methods to systematically characterize sequence-intrinsic activity of promoters have been limited by relatively low throughput and the length of sequences that could be tested. Here we present Survey of Regulatory Elements (SuRE), a method to assay more than 108 DNA fragments, each 0.2–2kb in size, for their ability to drive transcription autonomously. In SuRE, a plasmid library is constructed of random genomic fragments upstream of a 20bp barcode and decoded by paired-end sequencing. This library is then transfected into cells and transcribed barcodes are quantified in the RNA by high throughput sequencing. When applied to the human genome, we achieved a 55-fold genome coverage, allowing us to map autonomous promoter activity genome-wide. By computational modeling we delineated subregions within promoters that are relevant for their activity. For instance, we show that antisense promoter transcription is generally dependent on the sense core promoter sequences, and that most enhancers and several families of repetitive elements act as autonomous transcription initiation sites. PMID:28024146
Two cis elements collaborate to spatially repress transcription from a sea urchin promoter

NASA Technical Reports Server (NTRS)

Frudakis, T. N.; Wilt, F.

1995-01-01

The expression pattern of many territory-specific genes in metazoan embryos is maintained by an active process of negative spatial regulation. However, the mechanism of this strategy of gene regulation is not well understood in any system. Here we show that reporter constructs containing regulatory sequence for the SM30-alpha gene of Stronglyocentrotus purpuratus are expressed in a pattern congruent with that of the endogenous SM30 gene(s), largely as a result of active transcriptional repression in cell lineages in which the gene is not normally expressed. Chloramphenicol acetyl transferase assays of deletion constructs from the 2600-bp upstream region showed that repressive elements were present in the region from -1628 to -300. In situ hybridization analysis showed that the spatial fidelity of expression was severely compromised when the region from -1628 to -300 was deleted. Two highly repetitive sequence motifs, (G/A/C)CCCCT and (T/C)(T/A/C)CTTTT(T/A/C), are present in the -1628 to -300 region. Representatives of these elements were analyzed by gel mobility shift experiments and were found to interact specifically with protein in crude nuclear extracts. When oligonucleotides containing either sequence element were co-injected with a correctly regulated reporter as potential competitors, the reporter was expressed in inappropriate cells. When composite oligonucleotides, containing both sequence elements, were fused to a misregulated reporter, the expression of the reporter in inappropriate cells was suppressed. Comparison of composite oligonucleotides with oligonucleotides containing single constituent elements show that both sequence elements are required for effective spatial regulation. Thus, both individual elements are required, but only a composite element containing both elements is sufficient to function as a tissue-specific repressive element.
Striking structural dynamism and nucleotide sequence variation of the transposon Galileo in the genome of Drosophila mojavensis

PubMed Central

2013-01-01

Background Galileo is a transposable element responsible for the generation of three chromosomal inversions in natural populations of Drosophila buzzatii. Although the most characteristic feature of Galileo is the long internally-repetitive terminal inverted repeats (TIRs), which resemble the Drosophila Foldback element, its transposase-coding sequence has led to its classification as a member of the P-element superfamily (Class II, subclass 1, TIR order). Furthermore, Galileo has a wide distribution in the genus Drosophila, since it has been found in 6 of the 12 Drosophila sequenced genomes. Among these species, D. mojavensis, the one closest to D. buzzatii, presented the highest diversity in sequence and structure of Galileo elements. Results In the present work, we carried out a thorough search and annotation of all the Galileo copies present in the D. mojavensis sequenced genome. In our set of 170 Galileo copies we have detected 5 Galileo subfamilies (C, D, E, F, and X) with different structures ranging from nearly complete, to only 2 TIR or solo TIR copies. Finally, we have explored the structural and length variation of the Galileo copies that point out the relatively frequent rearrangements within and between Galileo elements. Different mechanisms responsible for these rearrangements are discussed. Conclusions Although Galileo is a transposable element with an ancient history in the D. mojavensis genome, our data indicate a recent transpositional activity. Furthermore, the dynamism in sequence and structure, mainly affecting the TIRs, suggests an active exchange of sequences among the copies. This exchange could lead to new subfamilies of the transposon, which could be crucial for the long-term survival of the element in the genome. PMID:23374229
Striking structural dynamism and nucleotide sequence variation of the transposon Galileo in the genome of Drosophila mojavensis.

PubMed

Marzo, Mar; Bello, Xabier; Puig, Marta; Maside, Xulio; Ruiz, Alfredo

2013-02-04

Galileo is a transposable element responsible for the generation of three chromosomal inversions in natural populations of Drosophila buzzatii. Although the most characteristic feature of Galileo is the long internally-repetitive terminal inverted repeats (TIRs), which resemble the Drosophila Foldback element, its transposase-coding sequence has led to its classification as a member of the P-element superfamily (Class II, subclass 1, TIR order). Furthermore, Galileo has a wide distribution in the genus Drosophila, since it has been found in 6 of the 12 Drosophila sequenced genomes. Among these species, D. mojavensis, the one closest to D. buzzatii, presented the highest diversity in sequence and structure of Galileo elements. In the present work, we carried out a thorough search and annotation of all the Galileo copies present in the D. mojavensis sequenced genome. In our set of 170 Galileo copies we have detected 5 Galileo subfamilies (C, D, E, F, and X) with different structures ranging from nearly complete, to only 2 TIR or solo TIR copies. Finally, we have explored the structural and length variation of the Galileo copies that point out the relatively frequent rearrangements within and between Galileo elements. Different mechanisms responsible for these rearrangements are discussed. Although Galileo is a transposable element with an ancient history in the D. mojavensis genome, our data indicate a recent transpositional activity. Furthermore, the dynamism in sequence and structure, mainly affecting the TIRs, suggests an active exchange of sequences among the copies. This exchange could lead to new subfamilies of the transposon, which could be crucial for the long-term survival of the element in the genome.
The Release 6 reference sequence of the Drosophila melanogaster genome

DOE PAGES

Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.; ...

2015-01-14

Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
The Release 6 reference sequence of the Drosophila melanogaster genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.

Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
The Evolution of Mobile DNAs: When Will Transposons Create Phylogenies That Look As If There Is a Master Gene?

PubMed Central

Brookfield, John F. Y.; Johnson, Louise J.

2006-01-01

Some families of mammalian interspersed repetitive DNA, such as the Alu SINE sequence, appear to have evolved by the serial replacement of one active sequence with another, consistent with there being a single source of transposition: the “master gene.” Alternative models, in which multiple source sequences are simultaneously active, have been called “transposon models.” Transposon models differ in the proportion of elements that are active and in whether inactivation occurs at the moment of transposition or later. Here we examine the predictions of various types of transposon model regarding the patterns of sequence variation expected at an equilibrium between transposition, inactivation, and deletion. Under the master gene model, all bifurcations in the true tree of elements occur in a single lineage. We show that this property will also hold approximately for transposon models in which most elements are inactive and where at least some of the inactivation events occur after transposition. Such tree shapes are therefore not conclusive evidence for a single source of transposition. PMID:16790583
High copy number of highly similar mariner-like transposons in planarian (Platyhelminthe): evidence for a trans-phyla horizontal transfer.

PubMed

Garcia-Fernàndez, J; Bayascas-Ramírez, J R; Marfany, G; Muñoz-Mármol, A M; Casali, A; Baguñà, J; Saló, E

1995-05-01

Several DNA sequences similar to the mariner element were isolated and characterized in the platyhelminthe Dugesia (Girardia) tigrina. They were 1,288 bp long, flanked by two 32 bp-inverted repeats, and contained a single 339 amino acid open-reading frame (ORF) encoding the transposase. The number of copies of this element is approximately 8,000 per haploid genome, constituting a member of the middle-repetitive DNA of Dugesia tigrina. Sequence analysis of several elements showed a high percentage of conservation between the different copies. Most of them presented an intact ORF and the standard signals of actively expressed genes, which suggests that some of them are or have recently been functional transposons. The high degree of similarity shared with other mariner elements from some arthropods, together with the fact that this element is undetectable in other planarian species, strongly suggests a case of horizontal transfer between these two distant phyla.
Control method of Three-phase Four-leg converter based on repetitive control

NASA Astrophysics Data System (ADS)

Hui, Wang

2018-03-01

The research chose the magnetic levitation force of wind power generation system as the object. In order to improve the power quality problem caused by unbalanced load in power supply system, we combined the characteristics and repetitive control principle of magnetic levitation wind power generation system, and then an independent control strategy for three-phase four-leg converter was proposed. In this paper, based on the symmetric component method, the second order generalized integrator was used to generate the positive and negative sequence of signals, and the decoupling control was carried out under the synchronous rotating reference frame, in which the positive and negative sequence voltage is PI double closed loop, and a PI regulator with repetitive control was introduced to eliminate the static error regarding the fundamental frequency fluctuation characteristic of zero sequence component. The simulation results based on Matlab/Simulink show that the proposed control project can effectively suppress the disturbance caused by unbalanced loads and maintain the load voltage balance. The project is easy to be achieved and remarkably improves the quality of the independent power supply system.
Gene space and transcriptome assemblies of leafy spurge (Euphorbia esula) identify promoter sequences, repetitive elements, high-quality markers, and a full-length chloroplast genome

USDA-ARS?s Scientific Manuscript database

Leafy spurge is an invasive perennial weed infesting range and recreational lands of North America. Previous research and omics projects with leafy spurge have helped develop it as a model for studying numerous aspects of perennial plant development and response to abiotic stress. However, the lack ...
Deep Investigation of Arabidopsis thaliana Junk DNA Reveals a Continuum between Repetitive Elements and Genomic Dark Matter

PubMed Central

Maumus, Florian; Quesneville, Hadi

2014-01-01

Eukaryotic genomes contain highly variable amounts of DNA with no apparent function. This so-called junk DNA is composed of two components: repeated and repeat-derived sequences (together referred to as the repeatome), and non-annotated sequences also known as genomic dark matter. Because of their high duplication rates as compared to other genomic features, transposable elements are predominant contributors to the repeatome and the products of their decay is thought to be a major source of genomic dark matter. Determining the origin and composition of junk DNA is thus important to help understanding genome evolution as well as host biology. In this study, we have used a combination of tools enabling to show that the repeatome from the small and reducing A. thaliana genome is significantly larger than previously thought. Furthermore, we present the concepts and results from a series of innovative approaches suggesting that a significant amount of the A. thaliana dark matter is of repetitive origin. As a tentative standard for the community, we propose a deep compendium annotation of the A. thaliana repeatome that may help addressing farther genome evolution as well as transcriptional and epigenetic regulation in this model plant. PMID:24709859
A multigenerational family study of oral and hand motor sequencing ability provides evidence for a familial speech sound disorder subtype

PubMed Central

Peter, Beate; Raskind, Wendy H.

2011-01-01

Purpose To evaluate phenotypic expressions of speech sound disorder (SSD) in multigenerational families with evidence of familial forms of SSD. Method Members of five multigenerational families (N = 36) produced rapid sequences of monosyllables and disyllables and tapped computer keys with repetitive and alternating movements. Results Measures of repetitive and alternating motor speed were correlated within and between the two motor systems. Repetitive and alternating motor speeds increased in children and decreased in adults as a function of age. In two families with children who had severe speech deficits consistent with disrupted praxis, slowed alternating, but not repetitive, oral movements characterized most of the affected children and adults with a history of SSD, and slowed alternating hand movements were seen in some of the biologically related participants as well. Conclusion Results are consistent with a familial motor-based SSD subtype with incomplete penetrance, motivating new clinical questions about motor-based intervention not only in the oral but also the limb system. PMID:21909176
Chromosome-based survey sequencing reveals the genome organization of wild wheat progenitor Triticum dicoccoides.

PubMed

Akpinar, Bala Ani; Biyiklioglu, Sezgi; Alptekin, Burcu; Havránková, Miroslava; Vrána, Jan; Doležel, Jaroslav; Distelfeld, Assaf; Hernandez, Pilar; Budak, Hikmet

2018-05-04

Wild emmer wheat (Triticum turgidum ssp. dicoccoides) is the progenitor of wheat. We performed chromosome-based survey sequencing of the 14 chromosomes, examining repetitive sequences, protein-coding genes, miRNA/target pairs and tRNA genes, as well as syntenic relationships with related grasses. We found considerable differences in the content and distribution of repetitive sequences between the A and B subgenomes. The gene contents of individual chromosomes varied widely, not necessarily correlating with chromosome size. We catalogued candidate agronomically important loci, along with new alleles and flanking sequences that can be used to design exome sequencing. Syntenic relationships and virtual gene orders revealed several small-scale evolutionary rearrangements, in addition to providing evidence for the 4AL-5AL-7BS translocation in wild emmer wheat. Chromosome-based sequence assemblies contained five novel miRNA families, among 59 families putatively encoded in the entire genome which provide insight into the domestication of wheat and an overview of the genome content and organization. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Chromatin and RNAi factors protect the C. elegans germline against repetitive sequences

PubMed Central

Robert, Valérie J.P.; Sijen, Titia; van Wolfswinkel, Josien; Plasterk, Ronald H.A.

2005-01-01

Protection of genomes against invasion by repetitive sequences, such as transposons, viruses, and repetitive transgenes, involves strong and selective silencing of these sequences. During silencing of repetitive transgenes, a trans effect (“cosuppression”) occurs that results in silencing of cognate endogenous genes. Here we report RNA interference (RNAi) screens performed to catalog genes required for cosuppression in the Caenorhabditis elegans germline. We find factors with a putative role in chromatin remodeling and factors involved in RNAi. Together with molecular data also presented in this study, these results suggest that in C. elegans repetitive sequences trigger transcriptional gene silencing using RNAi and chromatin factors. PMID:15774721
A whole-genome, radiation hybrid map of wheat

USDA-ARS?s Scientific Manuscript database

Generating a reference sequence of bread wheat (Triticum aestivum L.) is a challenging task because of its large, highly repetitive and allopolyploid genome. Ordering of BAC- and NGS-based contigs in ongoing wheat genome-sequencing projects primarily uses recombination and comparative genomics-base...

Construction of a small Mus musculus repetitive DNA library: identification of a new satellite sequence in Mus musculus.

PubMed Central

Pietras, D F; Bennett, K L; Siracusa, L D; Woodworth-Gutai, M; Chapman, V M; Gross, K W; Kane-Haas, C; Hastie, N D

1983-01-01

We report the construction of a small library of recombinant plasmids containing Mus musculus repetitive DNA inserts. The repetitive cloned fraction was derived from denatured genomic DNA by reassociation to a Cot value at which repetitive, but not unique, sequences have reannealed followed by exhaustive S1 nuclease treatment to degrade single stranded DNA. Initial characterizations of this library by colony filter hybridizations have led to the identification of a previously undetected M. musculus minor satellite as well as to clones containing M. musculus major satellite sequences. This new satellite is repeated 10-20 times less than the major satellite in the M. musculus genome. It has a repeat length of 130 nucleotides compared with the M. musculus major satellite with a repeat length of 234 nucleotides. Sequence analysis of the minor satellite has shown that it has a 29 base pair region with extensive homology to one of the major satellite repeating subunits. We also show by in situ hybridization that this minor satellite sequence is located at the centromeres and possibly the arms of at least half the M musculus chromosomes. Sequences related to the minor satellite have been found in the DNA of a related Mus species, Mus spretus, and may represent the major satellite of that species. Images PMID:6314268
Sequencing of the sea lamprey (Petromyzon marinus) genome provides insights into vertebrate evolution

PubMed Central

Smith, Jeramiah J; Kuraku, Shigehiro; Holt, Carson; Sauka-Spengler, Tatjana; Jiang, Ning; Campbell, Michael S; Yandell, Mark D; Manousaki, Tereza; Meyer, Axel; Bloom, Ona E; Morgan, Jennifer R; Buxbaum, Joseph D; Sachidanandam, Ravi; Sims, Carrie; Garruss, Alexander S; Cook, Malcolm; Krumlauf, Robb; Wiedemann, Leanne M; Sower, Stacia A; Decatur, Wayne A; Hall, Jeffrey A; Amemiya, Chris T; Saha, Nil R; Buckley, Katherine M; Rast, Jonathan P; Das, Sabyasachi; Hirano, Masayuki; McCurley, Nathanael; Guo, Peng; Rohner, Nicolas; Tabin, Clifford J; Piccinelli, Paul; Elgar, Greg; Ruffier, Magali; Aken, Bronwen L; Searle, Stephen MJ; Muffato, Matthieu; Pignatelli, Miguel; Herrero, Javier; Jones, Matthew; Brown, C Titus; Chung-Davidson, Yu-Wen; Nanlohy, Kaben G; Libants, Scot V; Yeh, Chu-Yin; McCauley, David W; Langeland, James A; Pancer, Zeev; Fritzsch, Bernd; de Jong, Pieter J; Zhu, Baoli; Fulton, Lucinda L; Theising, Brenda; Flicek, Paul; Bronner, Marianne E; Warren, Wesley C; Clifton, Sandra W; Wilson, Richard K; Li, Weiming

2013-01-01

Lampreys are representatives of an ancient vertebrate lineage that diverged from our own ~500 million years ago. By virtue of this deeply shared ancestry, the sea lamprey (P. marinus) genome is uniquely poised to provide insight into the ancestry of vertebrate genomes and the underlying principles of vertebrate biology. Here, we present the first lamprey whole-genome sequence and assembly. We note challenges faced owing to its high content of repetitive elements and GC bases, as well as the absence of broad-scale sequence information from closely related species. Analyses of the assembly indicate that two whole-genome duplications likely occurred before the divergence of ancestral lamprey and gnathostome lineages. Moreover, the results help define key evolutionary events within vertebrate lineages, including the origin of myelin-associated proteins and the development of appendages. The lamprey genome provides an important resource for reconstructing vertebrate origins and the evolutionary events that have shaped the genomes of extant organisms. PMID:23435085
Chromosomal Mapping of Repetitive DNAs in Myiopsitta monachus and Amazona aestiva (Psittaciformes, Psittacidae) with Emphasis on the Sex Chromosomes.

PubMed

de Oliveira Furo, Ivanete; Kretschmer, Rafael; Dos Santos, Michelly S; de Lima Carvalho, Carlos A; Gunski, Ricardo J; O'Brien, Patrícia C M; Ferguson-Smith, Malcolm A; Cioffi, Marcelo B; de Oliveira, Edivaldo H C

2017-01-01

Here, for the first time, we describe the karyotype of Myiopsitta monachus (Psittacidae, Arini). We found 2n = 48, corresponding to the lowest diploid number observed in Neotropical Psittaciformes so far, with an uncommonly large W chromosome homomorphic to the Z. In order to better understand the evolution of the sex chromosomes in this species, we applied several molecular cytogenetic approaches, including C-banding, FISH mapping of repetitive DNAs (several microsatellite repeats), and whole-chromosome painting on metaphases of M. monachus. For comparison, another species belonging to the same tribe but with a smaller W chromosome (A. aestiva) was also analyzed. The results show that the constitutive heterochromatin has a very diverse distribution pattern in these species revealing heterochromatic blocks in the centromeric region of all chromosomes and in most of the length of the W chromosome in A. aestiva, while in M. monachus they were found in interstitial and telomeric regions. Concerning the microsatellites, only the sequence (CG)n produced signals on the W chromosome of A. aestiva, in the distal region of both arms. However, in M. monachus, (CAA)n, (CAG)n, and (CG)n probes were accumulated on the W chromosome, and, in addition, the sequence (CAG)n also hybridized to heterochromatic regions in macrochromosomes, as well as in microchromosomes. Based on these results, we suggest that the increase in length of the W chromosome in M. monachus is due to the amplification of repetitive elements, which highlights their significant role in the evolutionary process of sex chromosome differentiation. © 2017 S. Karger AG, Basel.
Eukaryotic gene regulation by targeted chromatin re-modeling at dispersed, middle-repetitive sequence elements.

PubMed

Hodgetts, Ross

2004-12-01

RNA interference might have evolved to minimize the deleterious impact of transposable elements and viruses on eukaryotic genomes, because mutations in genes within the RNAi pathway cause mobilization of transposons in nematodes and flies. Although the first examples of RNAi involved post-transcriptional gene silencing, recently the pathway has been shown to act at the transcriptional level. It does so by establishing a chromatin configuration on the target DNA that has many of the hallmarks of heterochromatin, thus preventing its transcription. Members of dispersed, repeated sequence families appear to have been utilized by the RNAi machinery to regulate nearby genes in yeast. The unusual genomic distribution of three repeated element families in the chicken, fruit-fly and nematode genomes prompts speculation that some of these repeats have been co-opted to control gene expression, either locally or over extended chromosomal domains.
Improved PCR-Based Detection of Soil Transmitted Helminth Infections Using a Next-Generation Sequencing Approach to Assay Design.

PubMed

Pilotte, Nils; Papaiakovou, Marina; Grant, Jessica R; Bierwert, Lou Ann; Llewellyn, Stacey; McCarthy, James S; Williams, Steven A

2016-03-01

The soil transmitted helminths are a group of parasitic worms responsible for extensive morbidity in many of the world's most economically depressed locations. With growing emphasis on disease mapping and eradication, the availability of accurate and cost-effective diagnostic measures is of paramount importance to global control and elimination efforts. While real-time PCR-based molecular detection assays have shown great promise, to date, these assays have utilized sub-optimal targets. By performing next-generation sequencing-based repeat analyses, we have identified high copy-number, non-coding DNA sequences from a series of soil transmitted pathogens. We have used these repetitive DNA elements as targets in the development of novel, multi-parallel, PCR-based diagnostic assays. Utilizing next-generation sequencing and the Galaxy-based RepeatExplorer web server, we performed repeat DNA analysis on five species of soil transmitted helminths (Necator americanus, Ancylostoma duodenale, Trichuris trichiura, Ascaris lumbricoides, and Strongyloides stercoralis). Employing high copy-number, non-coding repeat DNA sequences as targets, novel real-time PCR assays were designed, and assays were tested against established molecular detection methods. Each assay provided consistent detection of genomic DNA at quantities of 2 fg or less, demonstrated species-specificity, and showed an improved limit of detection over the existing, proven PCR-based assay. The utilization of next-generation sequencing-based repeat DNA analysis methodologies for the identification of molecular diagnostic targets has the ability to improve assay species-specificity and limits of detection. By exploiting such high copy-number repeat sequences, the assays described here will facilitate soil transmitted helminth diagnostic efforts. We recommend similar analyses when designing PCR-based diagnostic tests for the detection of other eukaryotic pathogens.
DNABIT Compress - Genome compression algorithm.

PubMed

Rajarajeswari, Pothuraju; Apparao, Allam

2011-01-22

Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.
A Short Interspersed Nuclear Element (SINE)-Based Real-Time PCR Approach to Detect and Quantify Porcine Component in Meat Products.

PubMed

Zhang, Chi; Fang, Xin; Qiu, Haopu; Li, Ning

2015-01-01

Real-time PCR amplification of mitochondria gene could not be used for DNA quantification, and that of single copy DNA did not allow an ideal sensitivity. Moreover, cross-reactions among similar species were commonly observed in the published methods amplifying repetitive sequence, which hindered their further application. The purpose of this study was to establish a short interspersed nuclear element (SINE)-based real-time PCR approach having high specificity for species detection that could be used in DNA quantification. After massive screening of candidate Sus scrofa SINEs, one optimal combination of primers and probe was selected, which had no cross-reaction with other common meat species. LOD of the method was 44 fg DNA/reaction. Further, quantification tests showed this approach was practical in DNA estimation without tissue variance. Thus, this study provided a new tool for qualitative detection of porcine component, which could be promising in the QC of meat products.
TEcandidates: Prediction of genomic origin of expressed Transposable Elements using RNA-seq data.

PubMed

Valdebenito-Maturana, Braulio; Riadi, Gonzalo

2018-06-01

In recent years, Transposable Elements (TEs) have been related to gene regulation. However, estimating the origin of expression of TEs through RNA-seq is complicated by multimapping reads coming from their repetitive sequences. Current approaches that address multimapping reads are focused in expression quantification and not in finding the origin of expression. Addressing the genomic origin of expressed TEs could further aid in understanding the role that TEs might have in the cell. We have developed a new pipeline called TEcandidates, based on de novo transcriptome assembly to assess the instances of TEs being expressed, along with their location, to include in downstream DE analysis. TEcandidates takes as input the RNA-seq data, the genome sequence and the TE annotation file, and returns a list of coordinates of candidate TEs being expressed, the TEs that have been removed, and the genome sequence with removed TEs as masked. This masked genome is suited to include TEs in downstream expression analysis, as the ambiguity of reads coming from TEs is significantly reduced in the mapping step of the analysis. The script which runs the pipeline can be downloaded at http://www.mobilomics.org/tecandidates/downloads or http://github.com/TEcandidates/TEcandidates. griadi@utalca.cl. Supplementary data are available at Bioinformatics online.
A Bioinformatics Approach for Detecting Repetitive Nested Motifs using Pattern Matching.

PubMed

Romero, José R; Carballido, Jessica A; Garbus, Ingrid; Echenique, Viviana C; Ponzoni, Ignacio

2016-01-01

The identification of nested motifs in genomic sequences is a complex computational problem. The detection of these patterns is important to allow the discovery of transposable element (TE) insertions, incomplete reverse transcripts, deletions, and/or mutations. In this study, a de novo strategy for detecting patterns that represent nested motifs was designed based on exhaustive searches for pairs of motifs and combinatorial pattern analysis. These patterns can be grouped into three categories, motifs within other motifs, motifs flanked by other motifs, and motifs of large size. The methodology used in this study, applied to genomic sequences from the plant species Aegilops tauschii and Oryza sativa , revealed that it is possible to identify putative nested TEs by detecting these three types of patterns. The results were validated through BLAST alignments, which revealed the efficacy and usefulness of the new method, which is called Mamushka.
AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.

PubMed

Zheng, Qi; Grice, Elizabeth A

2016-10-01

Accurate mapping of next-generation sequencing (NGS) reads to reference genomes is crucial for almost all NGS applications and downstream analyses. Various repetitive elements in human and other higher eukaryotic genomes contribute in large part to ambiguously (non-uniquely) mapped reads. Most available NGS aligners attempt to address this by either removing all non-uniquely mapping reads, or reporting one random or "best" hit based on simple heuristics. Accurate estimation of the mapping quality of NGS reads is therefore critical albeit completely lacking at present. Here we developed a generalized software toolkit "AlignerBoost", which utilizes a Bayesian-based framework to accurately estimate mapping quality of ambiguously mapped NGS reads. We tested AlignerBoost with both simulated and real DNA-seq and RNA-seq datasets at various thresholds. In most cases, but especially for reads falling within repetitive regions, AlignerBoost dramatically increases the mapping precision of modern NGS aligners without significantly compromising the sensitivity even without mapping quality filters. When using higher mapping quality cutoffs, AlignerBoost achieves a much lower false mapping rate while exhibiting comparable or higher sensitivity compared to the aligner default modes, therefore significantly boosting the detection power of NGS aligners even using extreme thresholds. AlignerBoost is also SNP-aware, and higher quality alignments can be achieved if provided with known SNPs. AlignerBoost's algorithm is computationally efficient, and can process one million alignments within 30 seconds on a typical desktop computer. AlignerBoost is implemented as a uniform Java application and is freely available at https://github.com/Grice-Lab/AlignerBoost.
Transposon facilitated DNA sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Berg, D.E.; Berg, C.M.; Huang, H.V.

1990-01-01

The purpose of this research is to investigate and develop methods that exploit the power of bacterial transposable elements for large scale DNA sequencing: Our premise is that the use of transposons to put primer binding sites randomly in target DNAs should provide access to all portions of large DNA fragments, without the inefficiencies of methods involving random subcloning and attendant repetitive sequencing, or of sequential synthesis of many oligonucleotide primers that are used to match systematically along a DNA molecule. Two unrelated bacterial transposons, Tn5 and {gamma}{delta}, are being used because they have both proven useful for molecular analyses,more » and because they differ sufficiently in mechanism and specificity of transposition to merit parallel development.« less
rep-PCR-Mediated Genomic Fingerprinting: A Rapid and Effective Method to Identify Clavibacter michiganensis.

PubMed

Louws, F J; Bell, J; Medina-Mora, C M; Smart, C D; Opgenorth, D; Ishimaru, C A; Hausbeck, M K; de Bruijn, F J; Fulbright, D W

1998-08-01

ABSTRACT The genomic DNA fingerprinting technique known as repetitive-sequence-based polymerase chain reaction (rep-PCR) was evaluated as a tool to differentiate subspecies of Clavibacter michiganensis, with special emphasis on C. michiganensis subsp. michiganensis, the pathogen responsible for bacterial canker of tomato. DNA primers (REP, ERIC, and BOX), corresponding to conserved repetitive element motifs in the genomes of diverse bacterial species, were used to generate genomic fingerprints of C. michiganensis subsp. michiganensis, C. michiganensis subsp. sepedonicus, C. michiganensis subsp. nebraskensis, C. michiganensis subsp. tessellarius, and C. michiganensis subsp. insidiosum. The rep-PCR-generated patterns of DNA fragments observed after agarose gel electrophoresis support the current division of C. michiganensis into five subspecies. In addition, the rep-PCR fingerprints identified at least four types (A, B, C, and D) within C. michiganensis subsp. michiganensis based on limited DNA polymorphisms; the ability to differentiate individual strains may be of potential use in studies on the epidemiology and host-pathogen interactions of this organism. In addition, we have recovered from diseased tomato plants a relatively large number of naturally occurring avirulent C. michiganensis subsp. michiganensis strains with rep-PCR fingerprints identical to those of virulent C. michiganensis subsp. michiganensis strains.
Chromosome specific repetitive DNA sequences

DOEpatents

Moyzis, Robert K.; Meyne, Julianne

1991-01-01

A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
Functional impact of the human mobilome.

PubMed

Babatz, Timothy D; Burns, Kathleen H

2013-06-01

The human genome is replete with interspersed repetitive sequences derived from the propagation of mobile DNA elements. Three families of human retrotransposons remain active today: LINE1, Alu, and SVA elements. Since 1988, de novo insertions at previously recognized disease loci have been shown to generate highly penetrant alleles in Mendelian disorders. Only recently has the extent of germline-transmitted retrotransposon insertion polymorphism (RIP) in human populations been fully realized. Also exciting are recent studies of somatic retrotransposition in human tissues and reports of tumor-specific insertions, suggesting roles in tissue heterogeneity and tumorigenesis. Here we discuss mobile elements in human disease with an emphasis on exciting developments from the last several years. Copyright © 2013 Elsevier Ltd. All rights reserved.
Comparison of Methods of Detection of Exceptional Sequences in Prokaryotic Genomes.

PubMed

Rusinov, I S; Ershova, A S; Karyagina, A S; Spirin, S A; Alexeevski, A V

2018-02-01

Many proteins need recognition of specific DNA sequences for functioning. The number of recognition sites and their distribution along the DNA might be of biological importance. For example, the number of restriction sites is often reduced in prokaryotic and phage genomes to decrease the probability of DNA cleavage by restriction endonucleases. We call a sequence an exceptional one if its frequency in a genome significantly differs from one predicted by some mathematical model. An exceptional sequence could be either under- or over-represented, depending on its frequency in comparison with the predicted one. Exceptional sequences could be considered biologically meaningful, for example, as targets of DNA-binding proteins or as parts of abundant repetitive elements. Several methods to predict frequency of a short sequence in a genome, based on actual frequencies of certain its subsequences, are used. The most popular are methods based on Markov chain models. But any rigorous comparison of the methods has not previously been performed. We compared three methods for the prediction of short sequence frequencies: the maximum-order Markov chain model-based method, the method that uses geometric mean of extended Markovian estimates, and the method that utilizes frequencies of all subsequences including discontiguous ones. We applied them to restriction sites in complete genomes of 2500 prokaryotic species and demonstrated that the results depend greatly on the method used: lists of 5% of the most under-represented sites differed by up to 50%. The method designed by Burge and coauthors in 1992, which utilizes all subsequences of the sequence, showed a higher precision than the other two methods both on prokaryotic genomes and randomly generated sequences after computational imitation of selective pressure. We propose this method as the first choice for detection of exceptional sequences in prokaryotic genomes.
Unique nucleotide sequence-guided assembly of repetitive DNA parts for synthetic biology applications

DOE Office of Scientific and Technical Information (OSTI.GOV)

Torella, JP; Lienert, F; Boehm, CR

2014-08-07

Recombination-based DNA construction methods, such as Gibson assembly, have made it possible to easily and simultaneously assemble multiple DNA parts, and they hold promise for the development and optimization of metabolic pathways and functional genetic circuits. Over time, however, these pathways and circuits have become more complex, and the increasing need for standardization and insulation of genetic parts has resulted in sequence redundancies-for example, repeated terminator and insulator sequences-that complicate recombination-based assembly. We and others have recently developed DNA assembly methods, which we refer to collectively as unique nucleotide sequence (UNS)-guided assembly, in which individual DNA parts are flanked withmore » UNSs to facilitate the ordered, recombination-based assembly of repetitive sequences. Here we present a detailed protocol for UNS-guided assembly that enables researchers to convert multiple DNA parts into sequenced, correctly assembled constructs, or into high-quality combinatorial libraries in only 2-3 d. If the DNA parts must be generated from scratch, an additional 2-5 d are necessary. This protocol requires no specialized equipment and can easily be implemented by a student with experience in basic cloning techniques.« less
Unique nucleotide sequence (UNS)-guided assembly of repetitive DNA parts for synthetic biology applications

PubMed Central

Torella, Joseph P.; Lienert, Florian; Boehm, Christian R.; Chen, Jan-Hung; Way, Jeffrey C.; Silver, Pamela A.

2016-01-01

Recombination-based DNA construction methods, such as Gibson assembly, have made it possible to easily and simultaneously assemble multiple DNA parts and hold promise for the development and optimization of metabolic pathways and functional genetic circuits. Over time, however, these pathways and circuits have become more complex, and the increasing need for standardization and insulation of genetic parts has resulted in sequence redundancies — for example repeated terminator and insulator sequences — that complicate recombination-based assembly. We and others have recently developed DNA assembly methods that we refer to collectively as unique nucleotide sequence (UNS)-guided assembly, in which individual DNA parts are flanked with UNSs to facilitate the ordered, recombination-based assembly of repetitive sequences. Here we present a detailed protocol for UNS-guided assembly that enables researchers to convert multiple DNA parts into sequenced, correctly-assembled constructs, or into high-quality combinatorial libraries in only 2–3 days. If the DNA parts must be generated from scratch, an additional 2–5 days are necessary. This protocol requires no specialized equipment and can easily be implemented by a student with experience in basic cloning techniques. PMID:25101822
Sequence-Independent Cloning and Post-Translational Modification of Repetitive Protein Polymers through Sortase and Sfp-Mediated Enzymatic Ligation.

PubMed

Ott, Wolfgang; Nicolaus, Thomas; Gaub, Hermann E; Nash, Michael A

2016-04-11

Repetitive protein-based polymers are important for many applications in biotechnology and biomaterials development. Here we describe the sequential additive ligation of highly repetitive DNA sequences, their assembly into genes encoding protein-polymers with precisely tunable lengths and compositions, and their end-specific post-translational modification with organic dyes and fluorescent protein domains. Our new Golden Gate-based cloning approach relies on incorporation of only type IIS BsaI restriction enzyme recognition sites using PCR, which allowed us to install ybbR-peptide tags, Sortase c-tags, and cysteine residues onto either end of the repetitive gene polymers without leaving residual cloning scars. The assembled genes were expressed in Escherichia coli and purified using inverse transition cycling (ITC). Characterization by cloud point spectrophotometry, and denaturing polyacrylamide gel electrophoresis with fluorescence detection confirmed successful phosphopantetheinyl transferase (Sfp)-mediated post-translational N-terminal labeling of the protein-polymers with a coenzyme A-647 dye (CoA-647) and simultaneous sortase-mediated C-terminal labeling with a GFP domain containing an N-terminal GG-motif in a one-pot reaction. In a further demonstration, we installed an N-terminal cysteine residue into an elastin-like polypeptide (ELP) that was subsequently conjugated to a single chain poly(ethylene glycol)-maleimide (PEG-maleimide) synthetic polymer, noticeably shifting the ELP cloud point. The ability to straightforwardly assemble repetitive DNA sequences encoding ELPs of precisely tunable length and to post-translationally modify them specifically at the N- and C- termini provides a versatile platform for the design and production of multifunctional smart protein-polymeric materials.
The Regulatory Properties of Autonomous Subtelomeric P Elements Are Sensitive to a Suppressor of Variegation in Drosophila Melanogaster

PubMed Central

Ronsseray, S.; Lehmann, M.; Nouaud, D.; Anxolabehere, D.

1996-01-01

Genetic recombination was used in Drosophila melanogaster to isolate P elements, inserted at the telomeres of X chromosomes (cytological site 1A) from natural populations, in a genetic background devoid of other P elements. We show that complete maternally inherited P repression in the germline (P cytotype) can be elicited by only two autonomous P elements at 1A and that a single element at this site has partial regulatory properties. The analysis of the surrounding chromosomal regions of the P elements at 1A shows that in all cases these elements are flanked by Telomeric Associated Sequences, tandemly repetitive noncoding sequences that have properties of heterochromatin. In addition, we show that the regulatory properties of P elements at 1A can be inhibited by some of the mutant alleles of the Su(var)205 gene and by a deficiency of this gene. However, the regulatory properties of reference P strains (Harwich and Texas 007) are not impaired by Su(var)205 mutations. Su(var)205 encodes Heterochromatin Protein 1 (HP1). These results suggest that the HP1 dosage effect on the P element properties is site-dependent and could involve the structure of the chromatin. PMID:8844154
DNABIT Compress – Genome compression algorithm

PubMed Central

Rajarajeswari, Pothuraju; Apparao, Allam

2011-01-01

Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, “DNABIT Compress” for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that “DNABIT Compress” algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases. PMID:21383923

A major insertion accounts for a significant proportion of mutations underlying human lipoprotein lipase deficiency

DOE Office of Scientific and Technical Information (OSTI.GOV)

Langlois, S.; Kastelein, J.J.; Hayden, M.R.

1989-02-01

Lipoprotein lipase is an important enzyme involved in triacylglycerol metabolism. Primary LPL deficiency is a genetic disorder that is usually manifested by a severe elevation in triacylglycerol levels. The authors have used a recently isolated LPL cDNA clone to study 15 probands from 11 families with this inherited disorder. Surprisingly, 7 of the probands from 4 families, of different ancestries, had a similar insertion in their LPL gene. In contrast to other human genetic disorders, where insertions are rare causes of mutation, this insertion accounts for a significant proportion of the alleles causing LPL deficiency. Detailed restriction mapping of themore » insertion revealed that it was unlikely to be a duplication of neighboring DNA and that it was not similar to the consensus sequence of human L1 repetitive elements. This suggests that there must be other mechanisms of insertional mutagenesis in human genetic disease besides transposition of mobile L1 repetitive elements.« less
Characteristics of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) rRNA genes of Apis mellifera (Insecta: Hymenoptera): structure, organization, and retrotransposable elements

PubMed Central

Gillespie, J J; Johnston, J S; Cannone, J J; Gutell, R R

2006-01-01

As an accompanying manuscript to the release of the honey bee genome, we report the entire sequence of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) ribosomal RNA (rRNA)-encoding gene sequences (rDNA) and related internally and externally transcribed spacer regions of Apis mellifera (Insecta: Hymenoptera: Apocrita). Additionally, we predict secondary structures for the mature rRNA molecules based on comparative sequence analyses with other arthropod taxa and reference to recently published crystal structures of the ribosome. In general, the structures of honey bee rRNAs are in agreement with previously predicted rRNA models from other arthropods in core regions of the rRNA, with little additional expansion in non-conserved regions. Our multiple sequence alignments are made available on several public databases and provide a preliminary establishment of a global structural model of all rRNAs from the insects. Additionally, we provide conserved stretches of sequences flanking the rDNA cistrons that comprise the externally transcribed spacer regions (ETS) and part of the intergenic spacer region (IGS), including several repetitive motifs. Finally, we report the occurrence of retrotransposition in the nuclear large subunit rDNA, as R2 elements are present in the usual insertion points found in other arthropods. Interestingly, functional R1 elements usually present in the genomes of insects were not detected in the honey bee rRNA genes. The reverse transcriptase products of the R2 elements are deduced from their putative open reading frames and structurally aligned with those from another hymenopteran insect, the jewel wasp Nasonia (Pteromalidae). Stretches of conserved amino acids shared between Apis and Nasonia are illustrated and serve as potential sites for primer design, as target amplicons within these R2 elements may serve as novel phylogenetic markers for Hymenoptera. Given the impending completion of the sequencing of the Nasonia genome, we expect our report eventually to shed light on the evolution of the hymenopteran genome within higher insects, particularly regarding the relative maintenance of conserved rDNA genes, related variable spacer regions and retrotransposable elements. PMID:17069639
The repeating nucleotide sequence in the repetitive mitochondrial DNA from a "low-density" petite mutant of yeast.

PubMed Central

Van Kreijl, C F; Bos, J L

1977-01-01

The repeating nucleotide sequence of 68 base pairs in the mtDNA from an ethidium-induced cytoplasmic petite mutant of yeast has been determined. For sequence analysis specifically primed and terminated RNA copies, obtained by in vitro transcription of the separated strands, were use. The sequence consists of 66 consecutive AT base pairs flanked by two GC pairs and comprises nearly all of the mutant mitochondrial genome. The sequence, moreover, also represents the first part of wild-type mtDNA sequence so far. Images PMID:198740
A test matrix sequencer for research test facility automation

NASA Technical Reports Server (NTRS)

Mccartney, Timothy P.; Emery, Edward F.

1990-01-01

The hardware and software configuration of a Test Matrix Sequencer, a general purpose test matrix profiler that was developed for research test facility automation at the NASA Lewis Research Center, is described. The system provides set points to controllers and contact closures to data systems during the course of a test. The Test Matrix Sequencer consists of a microprocessor controlled system which is operated from a personal computer. The software program, which is the main element of the overall system is interactive and menu driven with pop-up windows and help screens. Analog and digital input/output channels can be controlled from a personal computer using the software program. The Test Matrix Sequencer provides more efficient use of aeronautics test facilities by automating repetitive tasks that were once done manually.
Variation in a surface-exposed region of the Mycoplasma pneumoniae P40 protein as a consequence of homologous DNA recombination between RepMP5 elements.

PubMed

Spuesens, Emiel B M; van de Kreeke, Nick; Estevão, Silvia; Hoogenboezem, Theo; Sluijter, Marcel; Hartwig, Nico G; van Rossum, Annemarie M C; Vink, Cornelis

2011-02-01

Mycoplasma pneumoniae is a human pathogen that causes a range of respiratory tract infections. The first step in infection is adherence of the bacteria to the respiratory epithelium. This step is mediated by a specialized organelle, which contains several proteins (cytadhesins) that have an important function in adherence. Two of these cytadhesins, P40 and P90, represent the proteolytic products from a single 130 kDa protein precursor, which is encoded by the MPN142 gene. Interestingly, MPN142 contains a repetitive DNA element, termed RepMP5, of which homologues are found at seven other loci within the M. pneumoniae genome. It has been hypothesized that these RepMP5 elements, which are similar but not identical in sequence, recombine with their counterpart within MPN142 and thereby provide a source of sequence variation for this gene. As this variation may give rise to amino acid changes within P40 and P90, the recombination between RepMP5 elements may constitute the basis of antigenic variation and, possibly, immune evasion by M. pneumoniae. To investigate the sequence variation of MPN142 in relation to inter-RepMP5 recombination, we determined the sequences of all RepMP5 elements in a collection of 25 strains. The results indicate that: (i) inter-RepMP5 recombination events have occurred in seven of the strains, and (ii) putative RepMP5 recombination events involving MPN142 have induced amino acid changes in a surface-exposed part of the P40 protein in two of the strains. We conclude that recombination between RepMP5 elements is a common phenomenon that may lead to sequence variation of MPN142-encoded proteins.
Genomic Organization Under Different Environmental Conditions: Hoplosternum Littorale as a Model

PubMed Central

Schneider, Carlos Henrique; Feldberg, Eliana; Baccaro, Fabricio Beggiato; Carvalho, Natália Dayane Moura; Gross, Maria Claudia

2016-01-01

Abstract The Amazon has abundant rivers, streams, and floodplains in both polluted and nonpolluted environments, which show great adaptability. Thus, the goal of this study was to map repetitive DNA sequences in both mitotic chromosomes and erythrocyte micronuclei of tamoatás from polluted and nonpolluted environments and to assess the possible genotoxic effects of these environments. Individuals were collected in Manaus, Amazonas (AM), and submitted to classical and molecular cytogenetic techniques, as well as to a blood micronucleus test. Diploid number equal to 60 chromosomes are present in all individuals, with 18S ribosomal DNA sites present in one chromosome pair and no interstitial telomeric sites on chromosomes. The micronucleus test showed no significant differences in pairwise comparisons between environments or collection sites, but the Rex3 retroelement was dispersed on the chromosomes of individuals from unpolluted environments and compartmentalized in individuals from polluted environments. Divergent numbers of 5S rDNA sites are present in individuals from unpolluted and polluted environments. The mapping of repetitive sequences revealed that micronuclei have different compositions both intra- and interindividually that suggests different regions are lost in the formation of micronuclei, and no single fragile region undergoes breaks, although repetitive DNA elements are involved in this process. PMID:26981695
Comparative Sequence and X-Inactivation Analyses of a Domain of Escape in Human Xp11.2 and the Conserved Segment in Mouse

PubMed Central

Tsuchiya, Karen D.; Greally, John M.; Yi, Yajun; Noel, Kevin P.; Truong, Jean-Pierre; Disteche, Christine M.

2004-01-01

We have performed X-inactivation and sequence analyses on 350 kb of sequence from human Xp11.2, a region shown previously to contain a cluster of genes that escape X inactivation, and we compared this region with the region of conserved synteny in mouse. We identified several new transcripts from this region in human and in mouse, which defined the full extent of the domain escaping X inactivation in both species. In human, escape from X inactivation involves an uninterrupted 235-kb domain of multiple genes. Despite highly conserved gene content and order between the two species, Smcx is the only mouse gene from the conserved segment that escapes inactivation. As repetitive sequences are believed to facilitate spreading of X inactivation along the chromosome, we compared the repetitive sequence composition of this region between the two species. We found that long terminal repeats (LTRs) were decreased in the human domain of escape, but not in the majority of the conserved mouse region adjacent to Smcx in which genes were subject to X inactivation, suggesting that these repeats might be excluded from escape domains to prevent spreading of silencing. Our findings indicate that genomic context, as well as gene-specific regulatory elements, interact to determine expression of a gene from the inactive X-chromosome. PMID:15197169
Combinatorial codon scrambling enables scalable gene synthesis and amplification of repetitive proteins

NASA Astrophysics Data System (ADS)

Tang, Nicholas C.; Chilkoti, Ashutosh

2016-04-01

Most genes are synthesized using seamless assembly methods that rely on the polymerase chain reaction (PCR). However, PCR of genes encoding repetitive proteins either fails or generates nonspecific products. Motivated by the need to efficiently generate new protein polymers through high-throughput gene synthesis, here we report a codon-scrambling algorithm that enables the PCR-based gene synthesis of repetitive proteins by exploiting the codon redundancy of amino acids and finding the least-repetitive synonymous gene sequence. We also show that the codon-scrambling problem is analogous to the well-known travelling salesman problem, and obtain an exact solution to it by using De Bruijn graphs and a modern mixed integer linear programme solver. As experimental proof of the utility of this approach, we use it to optimize the synthetic genes for 19 repetitive proteins, and show that the gene fragments are amenable to PCR-based gene assembly and recombinant expression.
Complete Genomic Structure of the Bloom-forming Toxic Cyanobacterium Microcystis aeruginosa NIES-843

PubMed Central

Kaneko, Takakazu; Nakajima, Nobuyoshi; Okamoto, Shinobu; Suzuki, Iwane; Tanabe, Yuuhiko; Tamaoki, Masanori; Nakamura, Yasukazu; Kasai, Fumie; Watanabe, Akiko; Kawashima, Kumiko; Kishida, Yoshie; Ono, Akiko; Shimizu, Yoshimi; Takahashi, Chika; Minami, Chiharu; Fujishiro, Tsunakazu; Kohara, Mitsuyo; Katoh, Midori; Nakazaki, Naomi; Nakayama, Shinobu; Yamada, Manabu; Tabata, Satoshi; Watanabe, Makoto M.

2007-01-01

Abstract The nucleotide sequence of the complete genome of a cyanobacterium, Microcystis aeruginosa NIES-843, was determined. The genome of M. aeruginosa is a single, circular chromosome of 5 842 795 base pairs (bp) in length, with an average GC content of 42.3%. The chromosome comprises 6312 putative protein-encoding genes, two sets of rRNA genes, 42 tRNA genes representing 41 tRNA species, and genes for tmRNA, the B subunit of RNase P, SRP RNA, and 6Sa RNA. Forty-five percent of the putative protein-encoding sequences showed sequence similarity to genes of known function, 32% were similar to hypothetical genes, and the remaining 23% had no apparent similarity to reported genes. A total of 688 kb of the genome, equivalent to 11.8% of the entire genome, were composed of both insertion sequences and miniature inverted-repeat transposable elements. This is indicative of a plasticity of the M. aeruginosa genome, through a mechanism that involves homologous recombination mediated by repetitive DNA elements. In addition to known gene clusters related to the synthesis of microcystin and cyanopeptolin, novel gene clusters that may be involved in the synthesis and modification of toxic small polypeptides were identified. Compared with other cyanobacteria, a relatively small number of genes for two component systems and a large number of genes for restriction-modification systems were notable characteristics of the M. aeruginosa genome. PMID:18192279
Isolation of Vibrio alginolyticus and Vibrio splendidus from captive-bred seahorses with disease symptoms.

PubMed

Balcázar, José L; Gallo-Bueno, Alfonso; Planas, Miquel; Pintado, José

2010-02-01

Vibrio species isolated from diseased seahorses were characterized by PCR amplification of repetitive bacterial DNA elements (rep-PCR) and identified by 16S ribosomal RNA gene sequence analysis. The results demonstrated that Vibrio alginolyticus and Vibrio splendidus were predominant in the lesions of these seahorses. To our knowledge, this is the first time that these bacterial species have been associated with disease symptoms in captive-bred seahorses.
Influence of Geographical Origin and Flour Type on Diversity of Lactic Acid Bacteria in Traditional Belgian Sourdoughs▿ †

PubMed Central

Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Vancanneyt, Marc; De Vuyst, Luc; Vandamme, Peter; Huys, Geert

2007-01-01

A culture-based approach was used to investigate the diversity of lactic acid bacteria (LAB) in Belgian traditional sourdoughs and to assess the influence of flour type, bakery environment, geographical origin, and technological characteristics on the taxonomic composition of these LAB communities. For this purpose, a total of 714 LAB from 21 sourdoughs sampled at 11 artisan bakeries throughout Belgium were subjected to a polyphasic identification approach. The microbial composition of the traditional sourdoughs was characterized by bacteriological culture in combination with genotypic identification methods, including repetitive element sequence-based PCR fingerprinting and phenylalanyl-tRNA synthase (pheS) gene sequence analysis. LAB from Belgian sourdoughs belonged to the genera Lactobacillus, Pediococcus, Leuconostoc, Weissella, and Enterococcus, with the heterofermentative species Lactobacillus paralimentarius, Lactobacillus sanfranciscensis, Lactobacillus plantarum, and Lactobacillus pontis as the most frequently isolated taxa. Statistical analysis of the identification data indicated that the microbial composition of the sourdoughs is mainly affected by the bakery environment rather than the flour type (wheat, rye, spelt, or a mixture of these) used. In conclusion, the polyphasic approach, based on rapid genotypic screening and high-resolution, sequence-dependent identification, proved to be a powerful tool for studying the LAB diversity in traditional fermented foods such as sourdough. PMID:17675431
Influence of geographical origin and flour type on diversity of lactic acid bacteria in traditional Belgian sourdoughs.

PubMed

Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Vancanneyt, Marc; De Vuyst, Luc; Vandamme, Peter; Huys, Geert

2007-10-01

A culture-based approach was used to investigate the diversity of lactic acid bacteria (LAB) in Belgian traditional sourdoughs and to assess the influence of flour type, bakery environment, geographical origin, and technological characteristics on the taxonomic composition of these LAB communities. For this purpose, a total of 714 LAB from 21 sourdoughs sampled at 11 artisan bakeries throughout Belgium were subjected to a polyphasic identification approach. The microbial composition of the traditional sourdoughs was characterized by bacteriological culture in combination with genotypic identification methods, including repetitive element sequence-based PCR fingerprinting and phenylalanyl-tRNA synthase (pheS) gene sequence analysis. LAB from Belgian sourdoughs belonged to the genera Lactobacillus, Pediococcus, Leuconostoc, Weissella, and Enterococcus, with the heterofermentative species Lactobacillus paralimentarius, Lactobacillus sanfranciscensis, Lactobacillus plantarum, and Lactobacillus pontis as the most frequently isolated taxa. Statistical analysis of the identification data indicated that the microbial composition of the sourdoughs is mainly affected by the bakery environment rather than the flour type (wheat, rye, spelt, or a mixture of these) used. In conclusion, the polyphasic approach, based on rapid genotypic screening and high-resolution, sequence-dependent identification, proved to be a powerful tool for studying the LAB diversity in traditional fermented foods such as sourdough.
Amplification of a specific repetitive DNA sequence for Trypanosoma rangeli identification and its potential application in epidemiological investigations.

PubMed

Vargas, N; Souto, R P; Carranza, J C; Vallejo, G A; Zingales, B

2000-11-01

Trypanosoma rangeli can infect humans as well as the same domestic and wild animals and triatomine vectors infected by Trypanosoma cruzi in Central and South America. This overlapping distribution complicates the epidemiology of American trypanosomiasis due to the cross-reactivity between T. rangeli and T. cruzi antigens and the presence of conserved DNA sequences in these parasites. We have isolated a T. rangeli-specific DNA repetitive element which is represented in approximately 103 copies per parasite genome and is distributed in several chromosomal bands. The 542-bp nucleotide sequence of this element, named P542, was determined and a PCR assay was standardized for its amplification. The sensitivity of the assay is high, allowing the detection of one tenth of the DNA content of a single parasite. The presence of the P542 element was confirmed in 11 T. rangeli isolates from mammalian hosts and insect vectors originating from several countries in Latin America. Negative amplification was observed with different T. cruzi strains and other trypanosomatids. The potential field application of the P542 PCR assay was investigated in simulated samples containing T. rangeli and/or T. cruzi and intestinal tract and feces of Rhodnius prolixus. Epidemiological studies were conducted in DNA preparations obtained from the digestive tracts of 12 Rhodnius colombiensis insects collected in a sylvatic area in Colombia. Positive amplification of the P542 element was obtained in 9/12 insects. We have also compared in the same samples the diagnostic performance of two PCR assays for the amplification of the variable domain of minicircle kinetoplast DNA (kDNA) and of the large subunit (LSU) of the ribosomal RNA gene of T. cruzi and T. rangeli. Data indicate that the kDNA PCR assay does not allow diagnosis of mixed infections in most insects. On the other hand, the PCR assay of the LSU RNA gene showed lower sensitivity in the detection of T. rangeli than the PCR assay of the P542 element. It is predicted that the use of sensitive detection techniques will indicate that the actual distribution of T. rangeli in America is wider than presumed. Copyright 2000 Academic Press.
Discovery of rare, diagnostic AluYb8/9 elements in diverse human populations.

PubMed

Feusier, Julie; Witherspoon, David J; Scott Watkins, W; Goubert, Clément; Sasani, Thomas A; Jorde, Lynn B

2017-01-01

Polymorphic human Alu elements are excellent tools for assessing population structure, and new retrotransposition events can contribute to disease. Next-generation sequencing has greatly increased the potential to discover Alu elements in human populations, and various sequencing and bioinformatics methods have been designed to tackle the problem of detecting these highly repetitive elements. However, current techniques for Alu discovery may miss rare, polymorphic Alu elements. Combining multiple discovery approaches may provide a better profile of the polymorphic Alu mobilome. Alu Yb8/9 elements have been a focus of our recent studies as they are young subfamilies (~2.3 million years old) that contribute ~30% of recent polymorphic Alu retrotransposition events. Here, we update our ME-Scan methods for detecting Alu elements and apply these methods to discover new insertions in a large set of individuals with diverse ancestral backgrounds. We identified 5,288 putative Alu insertion events, including several hundred novel Alu Yb8/9 elements from 213 individuals from 18 diverse human populations. Hundreds of these loci were specific to continental populations, and 23 non-reference population-specific loci were validated by PCR. We provide high-quality sequence information for 68 rare Alu Yb8/9 elements, of which 11 have hallmarks of an active source element. Our subfamily distribution of rare Alu Yb8/9 elements is consistent with previous datasets, and may be representative of rare loci. We also find that while ME-Scan and low-coverage, whole-genome sequencing (WGS) detect different Alu elements in 41 1000 Genomes individuals, the two methods yield similar population structure results. Current in-silico methods for Alu discovery may miss rare, polymorphic Alu elements. Therefore, using multiple techniques can provide a more accurate profile of Alu elements in individuals and populations. We improved our false-negative rate as an indicator of sample quality for future ME-Scan experiments. In conclusion, we demonstrate that ME-Scan is a good supplement for next-generation sequencing methods and is well-suited for population-level analyses.
Towards a physical map of the fertility genes on the heterochromatic Y chromosome of Drosophila hydei: families of repetitive sequences transcribed on the lampbrush loops Nooses and Threads are organized in extended clusters of several hundred kilobases.

PubMed

Trapitz, P; Glätzer, K H; Bünemann, H

1992-11-01

The understanding of structure and function of the so-called fertility genes of Drosophila is very limited due to their unusual size--several megabases--and their location on the heterochromatic Y chromosome. Since mapping of these genes has mainly been done by classical cytogenetic analyses using a small number of cytologically visible lampbrush loops as the sole markers for particular fertility genes, the resolution of the genetic map of the Y chromosome is restricted to 3-5 Mb. Here we demonstrate that a substantially finer subdivision of the megabase-sized fertility genes in the subtelomeric regions of the Y chromosome of Drosophila hydei can be achieved by a combination of digestion with restriction enzymes having 6 bp recognition sequences, and pulsed field gel electrophoresis. The physical subdivision is based upon large conserved fragments of repetitive DNA in the size range from 50 up to 1600 kb and refers to the long-range organization of several families of repetitive DNA involved in Y chromosomal transcription processes in primary spermatocytes. We conclude from our results that at least five different families of repetitive DNA specifically transcribed on the lampbrush loops nooses and threads are organized as extended clusters of several hundred kb, essentially free of interspersed non-repetitive sequences.
Single haplotype assembly of the human genome from a hydatidiform mole.

PubMed

Steinberg, Karyn Meltz; Schneider, Valerie A; Graves-Lindsay, Tina A; Fulton, Robert S; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C; Church, Deanna M; Eichler, Evan E; Wilson, Richard K

2014-12-01

A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. © 2014 Steinberg et al.; Published by Cold Spring Harbor Laboratory Press.
Single haplotype assembly of the human genome from a hydatidiform mole

PubMed Central

Steinberg, Karyn Meltz; Schneider, Valerie A.; Graves-Lindsay, Tina A.; Fulton, Robert S.; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A.; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C.; Church, Deanna M.; Eichler, Evan E.; Wilson, Richard K.

2014-01-01

A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. PMID:25373144
Alternation blindness in the representation of binary sequences.

PubMed

Yu, Ru Qi; Osherson, Daniel; Zhao, Jiaying

2018-03-01

Binary information is prevalent in the environment and contains 2 distinct outcomes. Binary sequences consist of a mixture of alternation and repetition. Understanding how people perceive such sequences would contribute to a general theory of information processing. In this study, we examined how people process alternation and repetition in binary sequences. Across 4 paradigms involving estimation, working memory, change detection, and visual search, we found that the number of alternations is underestimated compared with repetitions (Experiment 1). Moreover, recall for binary sequences deteriorates as the sequence alternates more (Experiment 2). Changes in bits are also harder to detect as the sequence alternates more (Experiment 3). Finally, visual targets superimposed on bits of a binary sequence take longer to process as alternation increases (Experiment 4). Overall, our results indicate that compared with repetition, alternation in a binary sequence is less salient in the sense of requiring more attention for successful encoding. The current study thus reveals the cognitive constraints in the representation of alternation and provides a new explanation for the overalternation bias in randomness perception. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Entropic fluctuations in DNA sequences

NASA Astrophysics Data System (ADS)

Thanos, Dimitrios; Li, Wentian; Provata, Astero

2018-03-01

The Local Shannon Entropy (LSE) in blocks is used as a complexity measure to study the information fluctuations along DNA sequences. The LSE of a DNA block maps the local base arrangement information to a single numerical value. It is shown that despite this reduction of information, LSE allows to extract meaningful information related to the detection of repetitive sequences in whole chromosomes and is useful in finding evolutionary differences between organisms. More specifically, large regions of tandem repeats, such as centromeres, can be detected based on their low LSE fluctuations along the chromosome. Furthermore, an empirical investigation of the appropriate block sizes is provided and the relationship of LSE properties with the structure of the underlying repetitive units is revealed by using both computational and mathematical methods. Sequence similarity between the genomic DNA of closely related species also leads to similar LSE values at the orthologous regions. As an application, the LSE covariance function is used to measure the evolutionary distance between several primate genomes.
Interchromosomal recombination in Zea mays.

PubMed Central

Hu, W; Timmermans, M C; Messing, J

1998-01-01

A new allele of the 27-kD zein locus in maize has been generated by interchromosomal recombination between chromosomes of two different inbred lines. A continuous patch of at least 11,817 bp of inbred W64A, containing the previously characterized Ra allele of the 27-kD zein gene, has been inserted into the genome of A188 by a single crossover. While both junction sequences are conserved, sequences of the two homologs between these junctions differ considerably. W64A contains the 7313-bp-long retrotransposon, Zeon-1. A188 contains a second copy of the 27-kD zein gene and a 2-kb repetitive element. Therefore, recombination results in a 7.3-kb insertion and a 14-kb deletion compared to the original S+A188 allele. If nonpairing sequences are looped out, 206 single base changes, frequently clustered, are present. The structure of this allele may explain how a recently discovered example of somatic recombination occurred in an A188/W64A hybrid. This would indicate that despite these sequence differences, pairing between these alleles could occur early during plant development. Therefore, such a somatically derived chimeric chromosome can also be heritable and give rise to new alleles. PMID:9799274

The American cranberry mitochondrial genome reveals the presence of selenocysteine (tRNA-Sec and SECIS) insertion machinery in land plants.

PubMed

Fajardo, Diego; Schlautman, Brandon; Steffan, Shawn; Polashock, James; Vorsa, Nicholi; Zalapa, Juan

2014-02-25

This is the first de novo assembly and annotation of a complete mitochondrial genome in the Ericales order from the American cranberry (Vaccinium macrocarpon Ait.). Moreover, only four complete Asterid mitochondrial genomes have been made publicly available. The cranberry mitochondrial genome was assembled and reconstructed from whole genome 454 Roche GS-FLX and Illumina shotgun sequences. Compared with other Asterids, the reconstruction of the genome revealed an average size mitochondrion (459,678 nt) with relatively little repetitive sequences and DNA of plastid origin. The complete mitochondrial genome of cranberry was annotated obtaining a total of 34 genes classified based on their putative function, plus three ribosomal RNAs, and 17 transfer RNAs. Maternal organellar cranberry inheritance was inferred by analyzing gene variation in the cranberry mitochondria and plastid genomes. The annotation of cranberry mitochondrial genome revealed the presence of two copies of tRNA-Sec and a selenocysteine insertion sequence (SECIS) element which were lost in plants during evolution. This is the first report of a land plant possessing selenocysteine insertion machinery at the sequence level. Published by Elsevier B.V.
A Surrogate Approach to Study the Evolution of Noncoding DNA Elements That Organize Eukaryotic Genomes

PubMed Central

Vermaak, Danielle; Bayes, Joshua J.

2009-01-01

Comparative genomics provides a facile way to address issues of evolutionary constraint acting on different elements of the genome. However, several important DNA elements have not reaped the benefits of this new approach. Some have proved intractable to current day sequencing technology. These include centromeric and heterochromatic DNA, which are essential for chromosome segregation as well as gene regulation, but the highly repetitive nature of the DNA sequences in these regions make them difficult to assemble into longer contigs. Other sequences, like dosage compensation X chromosomal sites, origins of DNA replication, or heterochromatic sequences that encode piwi-associated RNAs, have proved difficult to study because they do not have recognizable DNA features that allow them to be described functionally or computationally. We have employed an alternate approach to the direct study of these DNA elements. By using proteins that specifically bind these noncoding DNAs as surrogates, we can indirectly assay the evolutionary constraints acting on these important DNA elements. We review the impact that such “surrogate strategies” have had on our understanding of the evolutionary constraints shaping centromeres, origins of DNA replication, and dosage compensation X chromosomal sites. These have begun to reveal that in contrast to the view that such structural DNA elements are either highly constrained (under purifying selection) or free to drift (under neutral evolution), some of them may instead be shaped by adaptive evolution and genetic conflicts (these are not mutually exclusive). These insights also help to explain why the same elements (e.g., centromeres and replication origins), which are so complex in some eukaryotic genomes, can be simple and well defined in other where similar conflicts do not exist. PMID:19635763
Methylation patterns of repetitive DNA sequences in germ cells of Mus musculus.

PubMed

Sanford, J; Forrester, L; Chapman, V; Chandley, A; Hastie, N

1984-03-26

The major and the minor satellite sequences of Mus musculus were undermethylated in both sperm and oocyte DNAs relative to the amount of undermethylation observed in adult somatic tissue DNA. This hypomethylation was specific for satellite sequences in sperm DNA. Dispersed repetitive and low copy sequences show a high degree of methylation in sperm DNA; however, a dispersed repetitive sequence was undermethylated in oocyte DNA. This finding suggests a difference in the amount of total genomic DNA methylation between sperm and oocyte DNA. The methylation levels of the minor satellite sequences did not change during spermiogenesis, and were not associated with the onset of meiosis or a specific stage in sperm development.
Microarray Analysis of LTR Retrotransposon Silencing Identifies Hdac1 as a Regulator of Retrotransposon Expression in Mouse Embryonic Stem Cells

PubMed Central

Madej, Monika J.; Taggart, Mary; Gautier, Philippe; Garcia-Perez, Jose Luis; Meehan, Richard R.; Adams, Ian R.

2012-01-01

Retrotransposons are highly prevalent in mammalian genomes due to their ability to amplify in pluripotent cells or developing germ cells. Host mechanisms that silence retrotransposons in germ cells and pluripotent cells are important for limiting the accumulation of the repetitive elements in the genome during evolution. However, although silencing of selected individual retrotransposons can be relatively well-studied, many mammalian retrotransposons are seldom analysed and their silencing in germ cells, pluripotent cells or somatic cells remains poorly understood. Here we show, and experimentally verify, that cryptic repetitive element probes present in Illumina and Affymetrix gene expression microarray platforms can accurately and sensitively monitor repetitive element expression data. This computational approach to genome-wide retrotransposon expression has allowed us to identify the histone deacetylase Hdac1 as a component of the retrotransposon silencing machinery in mouse embryonic stem cells, and to determine the retrotransposon targets of Hdac1 in these cells. We also identify retrotransposons that are targets of other retrotransposon silencing mechanisms such as DNA methylation, Eset-mediated histone modification, and Ring1B/Eed-containing polycomb repressive complexes in mouse embryonic stem cells. Furthermore, our computational analysis of retrotransposon silencing suggests that multiple silencing mechanisms are independently targeted to retrotransposons in embryonic stem cells, that different genomic copies of the same retrotransposon can be differentially sensitive to these silencing mechanisms, and helps define retrotransposon sequence elements that are targeted by silencing machineries. Thus repeat annotation of gene expression microarray data suggests that a complex interplay between silencing mechanisms represses retrotransposon loci in germ cells and embryonic stem cells. PMID:22570599
AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework

PubMed Central

Zheng, Qi; Grice, Elizabeth A.

2016-01-01

Accurate mapping of next-generation sequencing (NGS) reads to reference genomes is crucial for almost all NGS applications and downstream analyses. Various repetitive elements in human and other higher eukaryotic genomes contribute in large part to ambiguously (non-uniquely) mapped reads. Most available NGS aligners attempt to address this by either removing all non-uniquely mapping reads, or reporting one random or "best" hit based on simple heuristics. Accurate estimation of the mapping quality of NGS reads is therefore critical albeit completely lacking at present. Here we developed a generalized software toolkit "AlignerBoost", which utilizes a Bayesian-based framework to accurately estimate mapping quality of ambiguously mapped NGS reads. We tested AlignerBoost with both simulated and real DNA-seq and RNA-seq datasets at various thresholds. In most cases, but especially for reads falling within repetitive regions, AlignerBoost dramatically increases the mapping precision of modern NGS aligners without significantly compromising the sensitivity even without mapping quality filters. When using higher mapping quality cutoffs, AlignerBoost achieves a much lower false mapping rate while exhibiting comparable or higher sensitivity compared to the aligner default modes, therefore significantly boosting the detection power of NGS aligners even using extreme thresholds. AlignerBoost is also SNP-aware, and higher quality alignments can be achieved if provided with known SNPs. AlignerBoost’s algorithm is computationally efficient, and can process one million alignments within 30 seconds on a typical desktop computer. AlignerBoost is implemented as a uniform Java application and is freely available at https://github.com/Grice-Lab/AlignerBoost. PMID:27706155
Development of synthetic selfish elements based on modular nucleases in Drosophila melanogaster

PubMed Central

Simoni, Alekos; Siniscalchi, Carla; Chan, Yuk-Sang; Huen, David S.; Russell, Steven; Windbichler, Nikolai; Crisanti, Andrea

2014-01-01

Selfish genes are DNA elements that increase their rate of genetic transmission at the expense of other genes in the genome and can therefore quickly spread within a population. It has been suggested that selfish elements could be exploited to modify the genome of entire populations for medical and ecological applications. Here we report that transcription activator-like effector nuclease (TALEN) and zinc finger nuclease (ZFN) can be engineered into site-specific synthetic selfish elements (SSEs) and demonstrate their transmission of up to 70% in the Drosophila germline. We show here that SSEs can spread via DNA break-induced homologous recombination, a process known as ‘homing’ similar to that observed for homing endonuclease genes (HEGs), despite their fundamentally different modes of DNA binding and cleavage. We observed that TALEN and ZFN have a reduced capability of secondary homing compared to HEG as their repetitive structure had a negative effect on their genetic stability. The modular architecture of ZFNs and TALENs allows for the rapid design of novel SSEs against specific genomic sequences making them potentially suitable for the genetic engineering of wild-type populations of animals and plants, in applications such as gene replacement or population suppression of pest species. PMID:24803674
Centromeric enrichment of LINE-1 retrotransposons and its significance for the chromosome evolution of Phyllostomid bats.

PubMed

de Sotero-Caio, Cibele Gomes; Cabral-de-Mello, Diogo Cavalcanti; Calixto, Merilane da Silva; Valente, Guilherme Targino; Martins, Cesar; Loreto, Vilma; de Souza, Maria José; Santos, Neide

2017-10-01

Despite their ubiquitous incidence, little is known about the chromosomal distribution of long interspersed elements (LINEs) in mammalian genomes. Phyllostomid bats, characterized by lineages with distinct trends of chromosomal evolution coupled with remarkable ecological and taxonomic diversity, represent good models to understand how these repetitive sequences contribute to the evolution of genome architecture and its link to lineage diversification. To test the hypothesis that LINE-1 sequences were important modifiers of bat genome architecture, we characterized the distribution of LINE-1-derived sequences on genomes of 13 phyllostomid species within a phylogenetic framework. We found massive accumulation of LINE-1 elements in the centromeres of most species: a rare phenomenon on mammalian genomes. We hypothesize that expansion of these elements has occurred early in the radiation of phyllostomids and recurred episodically. LINE-1 expansions on centromeric heterochromatin probably spurred chromosomal change before the radiation of phyllostomids into the extant 11 subfamilies and contributed to the high degree of karyotypic variation observed among different lineages. Understanding centromere architecture in a variety of taxa promises to explain how lineage-specific changes on centromere structure can contribute to karyotypic diversity while not disrupting functional constraints for proper cell division.
LINE-1 retrotransposons: from 'parasite' sequences to functional elements.

PubMed

Paço, Ana; Adega, Filomena; Chaves, Raquel

2015-02-01

Long interspersed nuclear elements-1 (LINE-1) are the most abundant and active retrotransposons in the mammalian genomes. Traditionally, the occurrence of LINE-1 sequences in the genome of mammals has been explained by the selfish DNA hypothesis. Nevertheless, recently, it has also been argued that these sequences could play important roles in these genomes, as in the regulation of gene expression, genome modelling and X-chromosome inactivation. The non-random chromosomal distribution is a striking feature of these retroelements that somehow reflects its functionality. In the present study, we have isolated and analysed a fraction of the open reading frame 2 (ORF2) LINE-1 sequence from three rodent species, Cricetus cricetus, Peromyscus eremicus and Praomys tullbergi. Physical mapping of the isolated sequences revealed an interspersed longitudinal AT pattern of distribution along all the chromosomes of the complement in the three genomes. A detailed analysis shows that these sequences are preferentially located in the euchromatic regions, although some signals could be detected in the heterochromatin. In addition, a coincidence between the location of imprinted gene regions (as Xist and Tsix gene regions) and the LINE-1 retroelements was also observed. According to these results, we propose an involvement of LINE-1 sequences in different genomic events as gene imprinting, X-chromosome inactivation and evolution of repetitive sequences located at the heterochromatic regions (e.g. satellite DNA sequences) of the rodents' genomes analysed.
A genome-specific repetitive DNA sequence from Oryza eichingeri: characterization, localization, and introgression to O. sativa.

PubMed

Yan, H. H.; Liu, G. Q.; Cheng, Z. K.; Li, X. B.; Liu, G. Z.; Min, S. K.; Zhu, L.H.

2002-02-01

In the course of transferring the brown planthopper resistance from a diploid, CC-genome wild rice species, Oryza eichingeri (IRGC acc. 105159 and 105163), to the cultivated rice variety 02428, we have isolated many alien addition and introgression lines. The O. eichingeri chromatin in some of these lines has previously been identified using genomic in situ hybridization and molecular-marker analysis. Here we cloned a tandemly repetitive DNA sequence from O. eichingeri IRGC acc105163, and detected it in 25 introgression lines. This repetitive DNA sequence showed high specificity to the rice CC genome, but was absent from all the four tetraploid species with BBCC or CCDD genomes. The monomer in this repetitive DNA sequence is 325-366-bp long, with a copy number of about 5,000 per 1 C of the O. eichingerigenome, showing 88% homology to a repetitive DNA sequence isolated from Oryza officinalis(2n=2 x=24, CC). Fluorescent in situ hybridization revealed 11 signals distributed over eight O. eichingeri chromosomes, mostly in terminal or subterminal regions.
Evolutionary Dynamics of the W Chromosome in Caenophidian Snakes

PubMed Central

Augstenová, Barbora; Rovatsos, Michail

2017-01-01

The caenophidian (assigned also as “advanced”) snakes are traditionally viewed as a group of reptiles with a limited karyotypic variation and stable ZZ/ZW sex chromosomes. The W chromosomes of the caenophidian snakes are heterochromatic, and pioneering studies demonstrated that they are rich in repetitive elements. However, a comparative study of the evolutionary dynamics of the repetitive content of the W chromosome across the whole lineage is missing. Using molecular-cytogenetic techniques, we explored the distribution of four repetitive motifs (microsatellites GATA, GACA, AG and telomeric-like sequences), which are frequently accumulated in differentiated sex chromosomes in vertebrates, in the genomes of 13 species of the caenophidian snakes covering a wide phylogenetic spectrum of the lineage. The results demonstrate a striking variability in the morphology and the repetitive content of the W chromosomes even between closely-related species, which is in contrast to the homology and long-term stability of the gene content of the caenophidian Z chromosome. We uncovered that the tested microsatellite motifs are accumulated on the degenerated, heterochromatic W chromosomes in all tested species of the caenophidian snakes with the exception of the Javan file snake representing a basal clade. On the other hand, the presence of the accumulation of the telomeric-like sequences on the caenophidian W chromosome is evolutionary much less stable. Moreover, we demonstrated that large accumulations of telomeric-like motifs on the W chromosome contribute to sexual differences in the number of copies of the telomeric and telomeric-like repeats estimated by quantitative PCR, which might be confusing and incorrectly interpreted as sexual differences in telomere length. PMID:29283388
Tongue Motion Averaging from Contour Sequences

ERIC Educational Resources Information Center

Li, Min; Kambhamettu, Chandra; Stone, Maureen

2005-01-01

In this paper, a method to get the best representation of a speech motion from several repetitions is presented. Each repetition is a representation of the same speech captured at different times by sequence of ultrasound images and is composed of a set of 2D spatio-temporal contours. These 2D contours in different repetitions are time aligned…
Tactile Ranschburg effects: facilitation and inhibitory repetition effects analogous to verbal memory.

PubMed

Roe, Daisy; Miles, Christopher; Johnson, Andrew J

2017-07-01

The present paper examines the effect of within-sequence item repetitions in tactile order memory. Employing an immediate serial recall procedure, participants reconstructed a six-item sequence tapped upon their fingers by moving those fingers in the order of original stimulation. In Experiment 1a, within-sequence repetition of an item separated by two-intervening items resulted in a significant reduction in recall accuracy for that repeated item (i.e., the Ranschburg effect). In Experiment 1b, within-sequence repetition of an adjacent item resulted in significant recall facilitation for that repeated item. These effects mirror those reported for verbal stimuli (e.g., Henson, 1998a . Item repetition in short-term memory: Ranschburg repeated. Journal of Experimental Psychology: Learning, Memory, and Cognition, 24(5), 1162-1181. doi:doi.org/10.1037/0278-7393.24.5.1162). These data are the first to demonstrate the Ranschburg effect with non-verbal stimuli and suggest further cross-modal similarities in order memory.
An Optimal Seed Based Compression Algorithm for DNA Sequences

PubMed Central

Gopalakrishnan, Gopakumar; Karunakaran, Muralikrishnan

2016-01-01

This paper proposes a seed based lossless compression algorithm to compress a DNA sequence which uses a substitution method that is similar to the LempelZiv compression scheme. The proposed method exploits the repetition structures that are inherent in DNA sequences by creating an offline dictionary which contains all such repeats along with the details of mismatches. By ensuring that only promising mismatches are allowed, the method achieves a compression ratio that is at par or better than the existing lossless DNA sequence compression algorithms. PMID:27555868
The Peculiar Landscape of Repetitive Sequences in the Olive (Olea europaea L.) Genome

PubMed Central

Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

2014-01-01

Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome. PMID:24671744
The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.

PubMed

Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

2014-04-01

Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome.
Hemispheric Asymmetries in Repetition Enhancement and Suppression Effects in the Newborn Brain

PubMed Central

Bouchon, Camillia; Nazzi, Thierry; Gervain, Judit

2015-01-01

Background The repeated presentation of stimuli typically attenuates neural responses (repetition suppression) or, less commonly, increases them (repetition enhancement) when stimuli are highly complex, degraded or presented under noisy conditions. In adult functional neuroimaging research, these repetition effects are considered as neural correlates of habituation. The development and respective functional significance of these effects in infancy remain largely unknown. Objective This study investigates repetition effects in newborns using functional near-infrared spectroscopy, and specifically the role of stimulus complexity in evoking a repetition enhancement vs. a repetition suppression response, following up on Gervain et al. (2008). In that study, abstract rule-learning was found at birth in cortical areas specific to speech processing, as evidenced by a left-lateralized repetition enhancement of the hemodynamic response to highly variable speech sequences conforming to a repetition-based ABB artificial grammar, but not to a random ABC grammar. Methods Here, the same paradigm was used to investigate how simpler stimuli (12 different sequences per condition as opposed to 140), and simpler presentation conditions (blocked rather than interleaved) would influence repetition effects at birth. Results Results revealed that the two grammars elicited different dynamics in the two hemispheres. In left fronto-temporal areas, we reproduce the early perceptual discrimination of the two grammars, with ABB giving rise to a greater response at the beginning of the experiment than ABC. In addition, the ABC grammar evoked a repetition enhancement effect over time, whereas a stable response was found for the ABB grammar. Right fronto-temporal areas showed neither initial discrimination, nor change over time to either pattern. Conclusion Taken together with Gervain et al. (2008), this is the first evidence that manipulating methodological factors influences the presence or absence of neural repetition enhancement effects in newborns and stimulus variability appears a particularly important factor. Further, this temporal modulation is restricted to the left hemisphere, confirming its specialization for learning linguistic regularities from birth. PMID:26485434
Hemispheric Asymmetries in Repetition Enhancement and Suppression Effects in the Newborn Brain.

PubMed

Bouchon, Camillia; Nazzi, Thierry; Gervain, Judit

2015-01-01

The repeated presentation of stimuli typically attenuates neural responses (repetition suppression) or, less commonly, increases them (repetition enhancement) when stimuli are highly complex, degraded or presented under noisy conditions. In adult functional neuroimaging research, these repetition effects are considered as neural correlates of habituation. The development and respective functional significance of these effects in infancy remain largely unknown. This study investigates repetition effects in newborns using functional near-infrared spectroscopy, and specifically the role of stimulus complexity in evoking a repetition enhancement vs. a repetition suppression response, following up on Gervain et al. (2008). In that study, abstract rule-learning was found at birth in cortical areas specific to speech processing, as evidenced by a left-lateralized repetition enhancement of the hemodynamic response to highly variable speech sequences conforming to a repetition-based ABB artificial grammar, but not to a random ABC grammar. Here, the same paradigm was used to investigate how simpler stimuli (12 different sequences per condition as opposed to 140), and simpler presentation conditions (blocked rather than interleaved) would influence repetition effects at birth. Results revealed that the two grammars elicited different dynamics in the two hemispheres. In left fronto-temporal areas, we reproduce the early perceptual discrimination of the two grammars, with ABB giving rise to a greater response at the beginning of the experiment than ABC. In addition, the ABC grammar evoked a repetition enhancement effect over time, whereas a stable response was found for the ABB grammar. Right fronto-temporal areas showed neither initial discrimination, nor change over time to either pattern. Taken together with Gervain et al. (2008), this is the first evidence that manipulating methodological factors influences the presence or absence of neural repetition enhancement effects in newborns and stimulus variability appears a particularly important factor. Further, this temporal modulation is restricted to the left hemisphere, confirming its specialization for learning linguistic regularities from birth.
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

PubMed

Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J; Leitch, Ilia J

2015-01-01

The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
The genome sequence of taurine cattle: a window to ruminant biology and evolution.

PubMed

Elsik, Christine G; Tellam, Ross L; Worley, Kim C; Gibbs, Richard A; Muzny, Donna M; Weinstock, George M; Adelson, David L; Eichler, Evan E; Elnitski, Laura; Guigó, Roderic; Hamernik, Debora L; Kappes, Steve M; Lewin, Harris A; Lynn, David J; Nicholas, Frank W; Reymond, Alexandre; Rijnkels, Monique; Skow, Loren C; Zdobnov, Evgeny M; Schook, Lawrence; Womack, James; Alioto, Tyler; Antonarakis, Stylianos E; Astashyn, Alex; Chapple, Charles E; Chen, Hsiu-Chuan; Chrast, Jacqueline; Câmara, Francisco; Ermolaeva, Olga; Henrichsen, Charlotte N; Hlavina, Wratko; Kapustin, Yuri; Kiryutin, Boris; Kitts, Paul; Kokocinski, Felix; Landrum, Melissa; Maglott, Donna; Pruitt, Kim; Sapojnikov, Victor; Searle, Stephen M; Solovyev, Victor; Souvorov, Alexandre; Ucla, Catherine; Wyss, Carine; Anzola, Juan M; Gerlach, Daniel; Elhaik, Eran; Graur, Dan; Reese, Justin T; Edgar, Robert C; McEwan, John C; Payne, Gemma M; Raison, Joy M; Junier, Thomas; Kriventseva, Evgenia V; Eyras, Eduardo; Plass, Mireya; Donthu, Ravikiran; Larkin, Denis M; Reecy, James; Yang, Mary Q; Chen, Lin; Cheng, Ze; Chitko-McKown, Carol G; Liu, George E; Matukumalli, Lakshmi K; Song, Jiuzhou; Zhu, Bin; Bradley, Daniel G; Brinkman, Fiona S L; Lau, Lilian P L; Whiteside, Matthew D; Walker, Angela; Wheeler, Thomas T; Casey, Theresa; German, J Bruce; Lemay, Danielle G; Maqbool, Nauman J; Molenaar, Adrian J; Seo, Seongwon; Stothard, Paul; Baldwin, Cynthia L; Baxter, Rebecca; Brinkmeyer-Langford, Candice L; Brown, Wendy C; Childers, Christopher P; Connelley, Timothy; Ellis, Shirley A; Fritz, Krista; Glass, Elizabeth J; Herzig, Carolyn T A; Iivanainen, Antti; Lahmers, Kevin K; Bennett, Anna K; Dickens, C Michael; Gilbert, James G R; Hagen, Darren E; Salih, Hanni; Aerts, Jan; Caetano, Alexandre R; Dalrymple, Brian; Garcia, Jose Fernando; Gill, Clare A; Hiendleder, Stefan G; Memili, Erdogan; Spurlock, Diane; Williams, John L; Alexander, Lee; Brownstein, Michael J; Guan, Leluo; Holt, Robert A; Jones, Steven J M; Marra, Marco A; Moore, Richard; Moore, Stephen S; Roberts, Andy; Taniguchi, Masaaki; Waterman, Richard C; Chacko, Joseph; Chandrabose, Mimi M; Cree, Andy; Dao, Marvin Diep; Dinh, Huyen H; Gabisi, Ramatu Ayiesha; Hines, Sandra; Hume, Jennifer; Jhangiani, Shalini N; Joshi, Vandita; Kovar, Christie L; Lewis, Lora R; Liu, Yih-Shin; Lopez, John; Morgan, Margaret B; Nguyen, Ngoc Bich; Okwuonu, Geoffrey O; Ruiz, San Juana; Santibanez, Jireh; Wright, Rita A; Buhay, Christian; Ding, Yan; Dugan-Rocha, Shannon; Herdandez, Judith; Holder, Michael; Sabo, Aniko; Egan, Amy; Goodell, Jason; Wilczek-Boney, Katarzyna; Fowler, Gerald R; Hitchens, Matthew Edward; Lozado, Ryan J; Moen, Charles; Steffen, David; Warren, James T; Zhang, Jingkun; Chiu, Readman; Schein, Jacqueline E; Durbin, K James; Havlak, Paul; Jiang, Huaiyang; Liu, Yue; Qin, Xiang; Ren, Yanru; Shen, Yufeng; Song, Henry; Bell, Stephanie Nicole; Davis, Clay; Johnson, Angela Jolivet; Lee, Sandra; Nazareth, Lynne V; Patel, Bella Mayurkumar; Pu, Ling-Ling; Vattathil, Selina; Williams, Rex Lee; Curry, Stacey; Hamilton, Cerissa; Sodergren, Erica; Wheeler, David A; Barris, Wes; Bennett, Gary L; Eggen, André; Green, Ronnie D; Harhay, Gregory P; Hobbs, Matthew; Jann, Oliver; Keele, John W; Kent, Matthew P; Lien, Sigbjørn; McKay, Stephanie D; McWilliam, Sean; Ratnakumar, Abhirami; Schnabel, Robert D; Smith, Timothy; Snelling, Warren M; Sonstegard, Tad S; Stone, Roger T; Sugimoto, Yoshikazu; Takasuga, Akiko; Taylor, Jeremy F; Van Tassell, Curtis P; Macneil, Michael D; Abatepaulo, Antonio R R; Abbey, Colette A; Ahola, Virpi; Almeida, Iassudara G; Amadio, Ariel F; Anatriello, Elen; Bahadue, Suria M; Biase, Fernando H; Boldt, Clayton R; Carroll, Jeffery A; Carvalho, Wanessa A; Cervelatti, Eliane P; Chacko, Elsa; Chapin, Jennifer E; Cheng, Ye; Choi, Jungwoo; Colley, Adam J; de Campos, Tatiana A; De Donato, Marcos; Santos, Isabel K F de Miranda; de Oliveira, Carlo J F; Deobald, Heather; Devinoy, Eve; Donohue, Kaitlin E; Dovc, Peter; Eberlein, Annett; Fitzsimmons, Carolyn J; Franzin, Alessandra M; Garcia, Gustavo R; Genini, Sem; Gladney, Cody J; Grant, Jason R; Greaser, Marion L; Green, Jonathan A; Hadsell, Darryl L; Hakimov, Hatam A; Halgren, Rob; Harrow, Jennifer L; Hart, Elizabeth A; Hastings, Nicola; Hernandez, Marta; Hu, Zhi-Liang; Ingham, Aaron; Iso-Touru, Terhi; Jamis, Catherine; Jensen, Kirsty; Kapetis, Dimos; Kerr, Tovah; Khalil, Sari S; Khatib, Hasan; Kolbehdari, Davood; Kumar, Charu G; Kumar, Dinesh; Leach, Richard; Lee, Justin C-M; Li, Changxi; Logan, Krystin M; Malinverni, Roberto; Marques, Elisa; Martin, William F; Martins, Natalia F; Maruyama, Sandra R; Mazza, Raffaele; McLean, Kim L; Medrano, Juan F; Moreno, Barbara T; Moré, Daniela D; Muntean, Carl T; Nandakumar, Hari P; Nogueira, Marcelo F G; Olsaker, Ingrid; Pant, Sameer D; Panzitta, Francesca; Pastor, Rosemeire C P; Poli, Mario A; Poslusny, Nathan; Rachagani, Satyanarayana; Ranganathan, Shoba; Razpet, Andrej; Riggs, Penny K; Rincon, Gonzalo; Rodriguez-Osorio, Nelida; Rodriguez-Zas, Sandra L; Romero, Natasha E; Rosenwald, Anne; Sando, Lillian; Schmutz, Sheila M; Shen, Libing; Sherman, Laura; Southey, Bruce R; Lutzow, Ylva Strandberg; Sweedler, Jonathan V; Tammen, Imke; Telugu, Bhanu Prakash V L; Urbanski, Jennifer M; Utsunomiya, Yuri T; Verschoor, Chris P; Waardenberg, Ashley J; Wang, Zhiquan; Ward, Robert; Weikard, Rosemarie; Welsh, Thomas H; White, Stephen N; Wilming, Laurens G; Wunderlich, Kris R; Yang, Jianqi; Zhao, Feng-Qi

2009-04-24

To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.
Repetition as the essence of life on this earth: music and genes.

PubMed

Ohno, S

1987-01-01

In prebiotic nucleic acid replication, templates appear to have been in short supply. A single round of tandem duplication of existing oligomers assured progressive extension of templates to the length adequate for encoding of polypeptide chains. Thus, the first set of coding sequences had to be repeats of base oligomers encoding polypeptide chains of various periodicities. On one hand, the readiness of these periodical polypeptide chains to assume alpha-helical and/or beta-sheet secondary structures contributed to the extremely rapid initial functional diversification of these polypeptide chains. It would be recalled that most, if not all, of the sugar-metabolizing enzymes had already achieved the inviolable functional competence before the division of prokaryotes from eukaryotes. On the other hand, a certain (dipeptidic?) of the peptidic periodicities was apparently chosen as the timekeeping unit by the biological clock. Musical compositions too apparently evolved originally as a timekeeping device. Accordingly, repetitiousness is evident in all musical compositions. Evolution of musical compositions from the early Baroque to the late Romantic parallels that of coding sequences from rather exact repeats of base oligomers to more complex modern coding sequences in which repetitious elements are less conspicuous and more varied. Inasmuch as the earth is governed by the hierarchy of periodicities (days, months and years), such reliance on periodicities is rather expected.

Passively mode-locked Raman fiber laser with 100 GHz repetition rate

NASA Astrophysics Data System (ADS)

Schröder, Jochen; Coen, Stéphane; Vanholsbeeck, Frédérique; Sylvestre, Thibaut

2006-12-01

We experimentally demonstrate the operation of a passively mode-locked Raman fiber ring laser with an ultrahigh repetition rate of 100GHz and up to 430mW of average output power. This laser constitutes a simple wavelength versatile pulsed optical source. Stable mode locking is based on dissipative four-wave mixing with a single fiber Bragg grating acting as the mode-locking element.
PCR Methods for Rapid Identification and Characterization of Actinobacillus seminis Strains

PubMed Central

Appuhamy, S.; Coote, J. G.; Low, J. C.; Parton, R.

1998-01-01

Twenty-four isolates of Actinobacillus seminis were typed by PCR ribotyping, repetitive extragenic palindromic element (REP)-based PCR, and enterobacterial repetitive intergenic consensus (ERIC)-based PCR. Five types were distinguished by REP-PCR, and nine types were distinguished by ERIC-PCR. PCR ribotyping produced the simplest pattern and could be useful for identification of A. seminis and for its differentiation from related species. REP- and ERIC-PCR could be used for strain differentiation in epidemiological studies of A. seminis. PMID:9508320
Characterization of 25 full-length S-RNase alleles, including flanking regions, from a pool of resequenced apple cultivars.

PubMed

De Franceschi, Paolo; Bianco, Luca; Cestaro, Alessandro; Dondini, Luca; Velasco, Riccardo

2018-06-01

Data obtained from Illumina resequencing of 63 apple cultivars were used to obtain full-length S-RNase sequences using a strategy based on both alignment and de novo assembly of reads. The reproductive biology of apple is regulated by the S-RNase-based gametophytic self-incompatibility system, that is genetically controlled by the single, multi-genic and multi-allelic S locus. Resequencing of apple cultivars provided a huge amount of genetic data, that can be aligned to the reference genome in order to characterize variation to a genome-wide level. However, this approach is not immediately adaptable to the S-locus, due to some peculiar features such as the high degree of polymorphism, lack of colinearity between haplotypes and extensive presence of repetitive elements. In this study we describe a dedicated procedure aimed at characterizing S-RNase alleles from resequenced cultivars. The S-genotype of 63 apple accessions is reported; the full length coding sequence was determined for the 25 S-RNase alleles present in the 63 resequenced cultivars; these included 10 previously incomplete sequences (S 5 , S 6a , S 6b , S 8 , S 11 , S 23 , S 39 , S 46 , S 50 and S 58 ). Moreover, sequence divergence clearly suggests that alleles S 6a and S 6b , proposed to be neutral variants of the same alleles, should be instead considered different specificities. The promoter sequences have also been analyzed, highlighting regions of homology conserved among all the alleles.
Fingerprinting of Cyanobacteria Based on PCR with Primers Derived from Short and Long Tandemly Repeated Repetitive Sequences

PubMed Central

Rasmussen, Ulla; Svenning, Mette M.

1998-01-01

The presence of repeated DNA (short tandemly repeated repetitive [STRR] and long tandemly repeated repetitive [LTRR]) sequences in the genome of cyanobacteria was used to generate a fingerprint method for symbiotic and free-living isolates. Primers corresponding to the STRR and LTRR sequences were used in the PCR, resulting in a method which generate specific fingerprints for individual isolates. The method was useful both with purified DNA and with intact cyanobacterial filaments or cells as templates for the PCR. Twenty-three Nostoc isolates from a total of 35 were symbiotic isolates from the angiosperm Gunnera species, including isolates from the same Gunnera species as well as from different species. The results show a genetic similarity among isolates from different Gunnera species as well as a genetic heterogeneity among isolates from the same Gunnera species. Isolates which have been postulated to be closely related or identical revealed similar results by the PCR method, indicating that the technique is useful for clustering of even closely related strains. The method was applied to nonheterocystus cyanobacteria from which a fingerprint pattern was obtained. PMID:16349487
Modeling repetitive motions using structured light.

PubMed

Xu, Yi; Aliaga, Daniel G

2010-01-01

Obtaining models of dynamic 3D objects is an important part of content generation for computer graphics. Numerous methods have been extended from static scenarios to model dynamic scenes. If the states or poses of the dynamic object repeat often during a sequence (but not necessarily periodically), we call such a repetitive motion. There are many objects, such as toys, machines, and humans, undergoing repetitive motions. Our key observation is that when a motion-state repeats, we can sample the scene under the same motion state again but using a different set of parameters; thus, providing more information of each motion state. This enables robustly acquiring dense 3D information difficult for objects with repetitive motions using only simple hardware. After the motion sequence, we group temporally disjoint observations of the same motion state together and produce a smooth space-time reconstruction of the scene. Effectively, the dynamic scene modeling problem is converted to a series of static scene reconstructions, which are easier to tackle. The varying sampling parameters can be, for example, structured-light patterns, illumination directions, and viewpoints resulting in different modeling techniques. Based on this observation, we present an image-based motion-state framework and demonstrate our paradigm using either a synchronized or an unsynchronized structured-light acquisition method.
RNA polymerase V-dependent small RNAs in Arabidopsis originate from small, intergenic loci including most SINE repeats.

PubMed

Lee, Tzuu-fen; Gurazada, Sai Guna Ranjan; Zhai, Jixian; Li, Shengben; Simon, Stacey A; Matzke, Marjori A; Chen, Xuemei; Meyers, Blake C

2012-07-01

In plants, heterochromatin is maintained by a small RNA-based gene silencing mechanism known as RNA-directed DNA methylation (RdDM). RdDM requires the non-redundant functions of two plant-specific DNA-dependent RNA polymerases (RNAP), RNAP IV and RNAP V. RNAP IV plays a major role in siRNA biogenesis, while RNAP V may recruit DNA methylation machinery to target endogenous loci for silencing. Although small RNA-generating regions that are dependent on both RNAP IV and RNAP V have been identified previously, the genomic loci targeted by RNAP V for siRNA accumulation and silencing have not been described extensively. To characterize the RNAP V-dependent, heterochromatic siRNA-generating regions in the Arabidopsis genome, we deeply sequenced the small RNA populations of wild-type and RNAP V null mutant (nrpe1) plants. Our results showed that RNAP V-dependent siRNA-generating loci are associated predominately with short repetitive sequences in intergenic regions. Suppression of small RNA production from short repetitive sequences was also prominent in RdDM mutants including dms4, drd1, dms3 and rdm1, reflecting the known association of these RdDM effectors with RNAP V. The genomic regions targeted by RNAP V were small, with an estimated average length of 238 bp. Our results suggest that RNAP V affects siRNA production from genomic loci with features dissimilar to known RNAP IV-dependent loci. RNAP V, along with RNAP IV and DRM1/2, may target and silence a set of small, intergenic transposable elements located in dispersed genomic regions for silencing. Silencing at these loci may be actively reinforced by RdDM.
Exploration of the Drosophila buzzatii transposable element content suggests underestimation of repeats in Drosophila genomes.

PubMed

Rius, Nuria; Guillén, Yolanda; Delprat, Alejandra; Kapusta, Aurélie; Feschotte, Cédric; Ruiz, Alfredo

2016-05-10

Many new Drosophila genomes have been sequenced in recent years using new-generation sequencing platforms and assembly methods. Transposable elements (TEs), being repetitive sequences, are often misassembled, especially in the genomes sequenced with short reads. Consequently, the mobile fraction of many of the new genomes has not been analyzed in detail or compared with that of other genomes sequenced with different methods, which could shed light into the understanding of genome and TE evolution. Here we compare the TE content of three genomes: D. buzzatii st-1, j-19, and D. mojavensis. We have sequenced a new D. buzzatii genome (j-19) that complements the D. buzzatii reference genome (st-1) already published, and compared their TE contents with that of D. mojavensis. We found an underestimation of TE sequences in Drosophila genus NGS-genomes when compared to Sanger-genomes. To be able to compare genomes sequenced with different technologies, we developed a coverage-based method and applied it to the D. buzzatii st-1 and j-19 genome. Between 10.85 and 11.16 % of the D. buzzatii st-1 genome is made up of TEs, between 7 and 7,5 % of D. buzzatii j-19 genome, while TEs represent 15.35 % of the D. mojavensis genome. Helitrons are the most abundant order in the three genomes. TEs in D. buzzatii are less abundant than in D. mojavensis, as expected according to the genome size and TE content positive correlation. However, TEs alone do not explain the genome size difference. TEs accumulate in the dot chromosomes and proximal regions of D. buzzatii and D. mojavensis chromosomes. We also report a significantly higher TE density in D. buzzatii and D. mojavensis X chromosomes, which is not expected under the current models. Our easy-to-use correction method allowed us to identify recently active families in D. buzzatii st-1 belonging to the LTR-retrotransposon superfamily Gypsy.
Interchromosomal Duplications on the Bactrocera oleae Y Chromosome Imply a Distinct Evolutionary Origin of the Sex Chromosomes Compared to Drosophila

PubMed Central

Gabrieli, Paolo; Gomulski, Ludvik M.; Bonomi, Angelica; Siciliano, Paolo; Scolari, Francesca; Franz, Gerald; Jessup, Andrew; Malacrida, Anna R.; Gasperi, Giuliano

2011-01-01

Background Diptera have an extraordinary variety of sex determination mechanisms, and Drosophila melanogaster is the paradigm for this group. However, the Drosophila sex determination pathway is only partially conserved and the family Tephritidae affords an interesting example. The tephritid Y chromosome is postulated to be necessary to determine male development. Characterization of Y sequences, apart from elucidating the nature of the male determining factor, is also important to understand the evolutionary history of sex chromosomes within the Tephritidae. We studied the Y sequences from the olive fly, Bactrocera oleae. Its Y chromosome is minute and highly heterochromatic, and displays high heteromorphism with the X chromosome. Methodology/Principal Findings A combined Representational Difference Analysis (RDA) and fluorescence in-situ hybridization (FISH) approach was used to investigate the Y chromosome to derive information on its sequence content. The Y chromosome is strewn with repetitive DNA sequences, the majority of which are also interdispersed in the pericentromeric regions of the autosomes. The Y chromosome appears to have accumulated small and large repetitive interchromosomal duplications. The large interchromosomal duplications harbour an importin-4-like gene fragment. Apart from these importin-4-like sequences, the other Y repetitive sequences are not shared with the X chromosome, suggesting molecular differentiation of these two chromosomes. Moreover, as the identified Y sequences were not detected on the Y chromosomes of closely related tephritids, we can infer divergence in the repetitive nature of their sequence contents. Conclusions/Significance The identification of Y-linked sequences may tell us much about the repetitive nature, the origin and the evolution of Y chromosomes. We hypothesize how these repetitive sequences accumulated and were maintained on the Y chromosome during its evolutionary history. Our data reinforce the idea that the sex chromosomes of the Tephritidae may have distinct evolutionary origins with respect to those of the Drosophilidae and other Dipteran families. PMID:21408187
Computation of repetitions and regularities of biologically weighted sequences.

PubMed

Christodoulakis, M; Iliopoulos, C; Mouchard, L; Perdikuri, K; Tsakalidis, A; Tsichlas, K

2006-01-01

Biological weighted sequences are used extensively in molecular biology as profiles for protein families, in the representation of binding sites and often for the representation of sequences produced by a shotgun sequencing strategy. In this paper, we address three fundamental problems in the area of biologically weighted sequences: (i) computation of repetitions, (ii) pattern matching, and (iii) computation of regularities. Our algorithms can be used as basic building blocks for more sophisticated algorithms applied on weighted sequences.
Genome analysis of the foxtail millet pathogen Sclerospora graminicola reveals the complex effector repertoire of graminicolous downy mildews.

PubMed

Kobayashi, Michie; Hiraka, Yukie; Abe, Akira; Yaegashi, Hiroki; Natsume, Satoshi; Kikuchi, Hideko; Takagi, Hiroki; Saitoh, Hiromasa; Win, Joe; Kamoun, Sophien; Terauchi, Ryohei

2017-11-22

Downy mildew, caused by the oomycete pathogen Sclerospora graminicola, is an economically important disease of Gramineae crops including foxtail millet (Setaria italica). Plants infected with S. graminicola are generally stunted and often undergo a transformation of flower organs into leaves (phyllody or witches' broom), resulting in serious yield loss. To establish the molecular basis of downy mildew disease in foxtail millet, we carried out whole-genome sequencing and an RNA-seq analysis of S. graminicola. Sequence reads were generated from S. graminicola using an Illumina sequencing platform and assembled de novo into a draft genome sequence comprising approximately 360 Mbp. Of this sequence, 73% comprised repetitive elements, and a total of 16,736 genes were predicted from the RNA-seq data. The predicted genes included those encoding effector-like proteins with high sequence similarity to those previously identified in other oomycete pathogens. Genes encoding jacalin-like lectin-domain-containing secreted proteins were enriched in S. graminicola compared to other oomycetes. Of a total of 1220 genes encoding putative secreted proteins, 91 significantly changed their expression levels during the infection of plant tissues compared to the sporangia and zoospore stages of the S. graminicola lifecycle. We established the draft genome sequence of a downy mildew pathogen that infects Gramineae plants. Based on this sequence and our transcriptome analysis, we generated a catalog of in planta-induced candidate effector genes, providing a solid foundation from which to identify the effectors causing phyllody.
Intrastrain heterogeneity of the mgpB gene in Mycoplasma genitalium is extensive in vitro and in vivo and suggests that variation is generated via recombination with repetitive chromosomal sequences.

PubMed

Iverson-Cabral, Stefanie L; Astete, Sabina G; Cohen, Craig R; Rocha, Eduardo P C; Totten, Patricia A

2006-07-01

Mycoplasma genitalium is associated with reproductive tract disease in women and may persist in the lower genital tract for months, potentially increasing the risk of upper tract infection and transmission to uninfected partners. Despite its exceptionally small genome (580 kb), approximately 4% is composed of repeated elements known as MgPar sequences (MgPa repeats) based on their homology to the mgpB gene that encodes the immunodominant MgPa adhesin protein. The presence of these MgPar sequences, as well as mgpB variability between M. genitalium strains, suggests that mgpB and MgPar sequences recombine to produce variant MgPa proteins. To examine the extent and generation of diversity within single strains of the organism, we examined mgpB variation within M. genitalium strain G-37 and observed sequence heterogeneity that could be explained by recombination between the mgpB expression site and putative donor MgPar sequences. Similarly, we analyzed mgpB sequences from cervical specimens from a persistently infected woman (21 months) and identified 17 different mgpB variants within a single infecting M. genitalium strain, confirming that mgpB heterogeneity occurs over the course of a natural infection. These observations support the hypothesis that recombination occurs between the mgpB gene and MgPar sequences and that the resulting antigenically distinct MgPa variants may contribute to immune evasion and persistence of infection.
The structure and evolution of angiosperm nuclear genomes.

PubMed

Bennetzen, J L

1998-04-01

Despite several decades of investigation, the organization of angiosperm genomes remained largely unknown until very recently. Data describing the sequence composition of large segments of genomes, covering hundreds of kilobases of contiguous sequence, have only become available in the past two years. Recent results indicate commonalities in the characteristics of many plant genomes, including in the structure of chromosomal components like telomeres and centromeres, and in the order and content of genes. Major differences between angiosperms have been associated mainly with repetitive DNAs, both gene families and mobile elements. Intriguing new studies have begun to characterize the dynamic three-dimensional structures of chromosomes and chromatin, and the relationship between genome structure and co-ordinated gene function.
Localization and characterization of X chromosome inversion breakpoints separating Drosophila mojavensis and Drosophila arizonae.

PubMed

Cirulli, Elizabeth T; Noor, Mohamed A F

2007-01-01

Ectopic exchange between transposable elements or other repetitive sequences along a chromosome can produce chromosomal inversions. As a result, genome sequence studies typically find sequence similarity between corresponding inversion breakpoint regions. Here, we identify and investigate the breakpoint regions of the X chromosome inversion distinguishing Drosophila mojavensis and Drosophila arizonae. We localize one inversion breakpoint to 13.7 kb and localize the other to a 1-Mb interval. Using this localization and assuming microsynteny between Drosophila melanogaster and D. arizonae, we pinpoint likely positions of the inversion breakpoints to windows of less than 3000 bp. These breakpoints define the size of the inversion to approximately 11 Mb. However, in contrast to many other studies, we fail to find significant sequence similarity between the 2 breakpoint regions. The localization of these inversion breakpoints will facilitate future genetic and molecular evolutionary studies in this species group, an emerging model system for ecological genetics.
Origin of B chromosomes in Characidium alipioi (Characiformes, Crenuchidae) and its relationship with supernumerary chromosomes in other Characidium species.

PubMed

Serrano, Érica Alves; Utsunomia, Ricardo; Scudeller, Patrícia Sobrinho; Oliveira, Claudio; Foresti, Fausto

2017-01-01

B chromosomes are apparently dispensable components found in the genomes of many species that are mainly composed of repetitive DNA sequences. Among the numerous questions concerning B chromosomes, the origin of these elements has been widely studied. To date, supernumerary chromosomes have been identified in approximately 60 species of fish, including species of the genus Characidium Reinhardt, 1867 in which these elements appear to have independently originated. In this study, we used molecular cytogenetic techniques to investigate the origin of B chromosomes in a population of Characidium alipioi Travassos, 1955 and determine their relationship with the extra chromosomes of other species of the genus. The results showed that the B chromosomes of Characidium alipioi had an intraspecific origin, apparently originated independently in relation to the B chromosomes of Characidium gomesi Travassos, 1956 Characidium pterostictum Gomes, 1947 and Characidium oiticicai Travassos, 1967, since they do not share specific DNA sequences, as well as their possible ancestral chromosomes and belong to different phylogenetic clades. The shared sequences between the supernumerary chromosomes and the autosommal sm pair indicate the origin of these chromosomes.
Rhipicephalus (Boophilus) microplus strain Deutsch, whole genome shotgun sequencing project first submission of genome sequence

USDA-ARS?s Scientific Manuscript database

The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence difficult. Cot filtration/selection techniques were used to reduce the repetitive fraction of the tick genome and enrich for the fraction of DNA with gene-containing regions. The Cot-selected ...
Developmental Abilities to Form Chunks in Immediate Memory and Its Non-Relationship to Span Development.

PubMed

Mathy, Fabien; Fartoukh, Michael; Gauvrit, Nicolas; Guida, Alessandro

2016-01-01

Both adults and children -by the time they are 2-3 years old- have a general ability to recode information to increase memory efficiency. This paper aims to evaluate the ability of untrained children aged 6-10 years old to deploy such a recoding process in immediate memory. A large sample of 374 children were given a task of immediate serial report based on SIMON®, a classic memory game made of four colored buttons (red, green, yellow, blue) requiring players to reproduce a sequence of colors within which repetitions eventually occur. It was hypothesized that a primitive ability across all ages (since theoretically already available in toddlers) to detect redundancies allows the span to increase whenever information can be recoded on the fly. The chunkable condition prompted the formation of chunks based on the perceived structure of color repetition within to-be-recalled sequences of colors. Our result shows a similar linear improvement of memory span with age for both chunkable and non-chunkable conditions. The amount of information retained in immediate memory systematically increased for the groupable sequences across all age groups, independently of the average age-group span that was measured on sequences that contained fewer repetitions. This result shows that chunking gives young children an equal benefit as older children. We discuss the role of recoding in the expansion of capacity in immediate memory and the potential role of data compression in the formation of chunks in long-term memory.
Developmental Abilities to Form Chunks in Immediate Memory and Its Non-Relationship to Span Development

PubMed Central

Mathy, Fabien; Fartoukh, Michael; Gauvrit, Nicolas; Guida, Alessandro

2016-01-01

Both adults and children –by the time they are 2–3 years old– have a general ability to recode information to increase memory efficiency. This paper aims to evaluate the ability of untrained children aged 6–10 years old to deploy such a recoding process in immediate memory. A large sample of 374 children were given a task of immediate serial report based on SIMON®, a classic memory game made of four colored buttons (red, green, yellow, blue) requiring players to reproduce a sequence of colors within which repetitions eventually occur. It was hypothesized that a primitive ability across all ages (since theoretically already available in toddlers) to detect redundancies allows the span to increase whenever information can be recoded on the fly. The chunkable condition prompted the formation of chunks based on the perceived structure of color repetition within to-be-recalled sequences of colors. Our result shows a similar linear improvement of memory span with age for both chunkable and non-chunkable conditions. The amount of information retained in immediate memory systematically increased for the groupable sequences across all age groups, independently of the average age-group span that was measured on sequences that contained fewer repetitions. This result shows that chunking gives young children an equal benefit as older children. We discuss the role of recoding in the expansion of capacity in immediate memory and the potential role of data compression in the formation of chunks in long-term memory. PMID:26941675
Organisation of the plant genome in chromosomes.

PubMed

Heslop-Harrison, J S Pat; Schwarzacher, Trude

2011-04-01

The plant genome is organized into chromosomes that provide the structure for the genetic linkage groups and allow faithful replication, transcription and transmission of the hereditary information. Genome sizes in plants are remarkably diverse, with a 2350-fold range from 63 to 149,000 Mb, divided into n=2 to n= approximately 600 chromosomes. Despite this huge range, structural features of chromosomes like centromeres, telomeres and chromatin packaging are well-conserved. The smallest genomes consist of mostly coding and regulatory DNA sequences present in low copy, along with highly repeated rDNA (rRNA genes and intergenic spacers), centromeric and telomeric repetitive DNA and some transposable elements. The larger genomes have similar numbers of genes, with abundant tandemly repeated sequence motifs, and transposable elements alone represent more than half the DNA present. Chromosomes evolve by fission, fusion, duplication and insertion events, allowing evolution of chromosome size and chromosome number. A combination of sequence analysis, genetic mapping and molecular cytogenetic methods with comparative analysis, all only becoming widely available in the 21st century, is elucidating the exact nature of the chromosome evolution events at all timescales, from the base of the plant kingdom, to intraspecific or hybridization events associated with recent plant breeding. As well as being of fundamental interest, understanding and exploiting evolutionary mechanisms in plant genomes is likely to be a key to crop development for food production. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.
A PCR technique based on the Hip1 interspersed repetitive sequence distinguishes cyanobacterial species and strains.

PubMed

Smith, J K; Parry, J D; Day, J G; Smith, R J

1998-10-01

The use of primers based on the Hip1 sequence as a typing technique for cyanobacteria has been investigated. The discovery of short repetitive sequence structures in bacterial DNA during the last decade has led to the development of PCR-based methods for typing, i.e., distinguishing and identifying, bacterial species and strains. An octameric palindromic sequence known as Hip1 has been shown to be present in the chromosomal DNA of many species of cyanobacteria as a highly repetitious interspersed sequence. PCR primers were constructed that extended the Hip1 sequence at the 3' end by two bases. Five of the 16 possible extended primers were tested. Each of the five primers produced a different set of products when used to prime PCR from cyanobacterial genomic DNA. Each primer produced a distinct set of products for each of the 15 cyanobacterial species tested. The ability of Hip1-based PCR to resolve taxonomic differences was assessed by analysis of independent isolates of Anabaena flos-aquae and Nostoc ellipsosporum obtained from the CCAP (Culture Collection of Algae and Protozoa, IFE, Cumbria, UK). A PCR-based RFLP analysis of products amplified from the 23S-16S rDNA intergenic region was used to characterize the isolates and to compare with the Hip1 typing data. The RFLP and Hip1 typing yielded similar results and both techniques were able to distinguish different strains. On the basis of these results it is suggested that the Hip1 PCR technique may assist in distinguishing cyanobacterial species and strains.
APE1 incision activity at abasic sites in tandem repeat sequences.

PubMed

Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M

2014-05-29

Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.

The devil is in the details: Transposable element analysis of the Tasmanian devil genome

PubMed Central

Nilsson, Maria A.

2016-01-01

ABSTRACT The third marsupial genome was sequenced from the Tasmanian devil (Sarcophilus harrisii), a species that currently is driven to extinction by a rare transmissible cancer. The transposable element (TE) landscape of the Tasmanian devil genome revealed that the main driver of retrotransposition the Long INterspersed Element 1 (LINE1) seem to have become inactivated during the past 12 million years. Strangely, the Short INterspersed Elements (SINE), that normally hijacks the LINE1 retrotransposition system, became inactive prior to LINE1 at around 30 million years ago. The SINE inactivation was in vitro verified in several species. Here I discuss that the apparent LINE1 inactivation might be caused by a genome assembly artifact. The repetitive fraction of any genome is highly complex to assemble and the observed problems are not unique to the Tasmanian devil genome. PMID:27066301
Clustered regularly interspaced short palindromic repeats (CRISPRs) analysis of members of the Mycobacterium tuberculosis complex.

PubMed

Botelho, Ana; Canto, Ana; Leão, Célia; Cunha, Mónica V

2015-01-01

Typical CRISPR (clustered, regularly interspaced, short palindromic repeat) regions are constituted by short direct repeats (DRs), interspersed with similarly sized non-repetitive spacers, derived from transmissible genetic elements, acquired when the cell is challenged with foreign DNA. The analysis of the structure, in number and nature, of CRISPR spacers is a valuable tool for molecular typing since these loci are polymorphic among strains, originating characteristic signatures. The existence of CRISPR structures in the genome of the members of Mycobacterium tuberculosis complex (MTBC) enabled the development of a genotyping method, based on the analysis of the presence or absence of 43 oligonucleotide spacers separated by conserved DRs. This method, called spoligotyping, consists on PCR amplification of the DR chromosomal region and recognition after hybridization of the spacers that are present. The workflow beneath this methodology implies that the PCR products are brought onto a membrane containing synthetic oligonucleotides that have complementary sequences to the spacer sequences. Lack of hybridization of the PCR products to a specific oligonucleotide sequence indicates absence of the correspondent spacer sequence in the examined strain. Spoligotyping gained great notoriety as a robust identification and typing tool for members of MTBC, enabling multiple epidemiological studies on human and animal tuberculosis.
A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana.

PubMed

Nowell, Reuben W; Elsworth, Ben; Oostra, Vicencio; Zwaan, Bas J; Wheat, Christopher W; Saastamoinen, Marjo; Saccheri, Ilik J; Van't Hof, Arjen E; Wasik, Bethany R; Connahs, Heidi; Aslam, Muhammad L; Kumar, Sujai; Challis, Richard J; Monteiro, Antónia; Brakefield, Paul M; Blaxter, Mark

2017-07-01

The mycalesine butterfly Bicyclus anynana, the "Squinting bush brown," is a model organism in the study of lepidopteran ecology, development, and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species. Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology; 128 Gb of raw Illumina data was filtered to 124 Gb and assembled to a final size of 475 Mb (∼×260 assembly coverage). Contigs were scaffolded using mate-pair, transcriptome, and PacBio data into 10 800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements and encodes a total of 22 642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes. We report a high-quality draft genome sequence for Bicyclus anynana. The genome assembly and annotated gene models are available at LepBase (http://ensembl.lepbase.org/index.html). © The Authors 2017. Published by Oxford University Press.
A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana

PubMed Central

Elsworth, Ben; Oostra, Vicencio; Zwaan, Bas J.; Wheat, Christopher W.; Saastamoinen, Marjo; Saccheri, Ilik J.; van’t Hof, Arjen E.; Wasik, Bethany R.; Connahs, Heidi; Aslam, Muhammad L.; Kumar, Sujai; Challis, Richard J.; Monteiro, Antónia; Brakefield, Paul M.

2017-01-01

Abstract The mycalesine butterfly Bicyclus anynana, the “Squinting bush brown,” is a model organism in the study of lepidopteran ecology, development, and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species. Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology; 128 Gb of raw Illumina data was filtered to 124 Gb and assembled to a final size of 475 Mb (∼×260 assembly coverage). Contigs were scaffolded using mate-pair, transcriptome, and PacBio data into 10 800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements and encodes a total of 22 642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes. We report a high-quality draft genome sequence for Bicyclus anynana. The genome assembly and annotated gene models are available at LepBase (http://ensembl.lepbase.org/index.html). PMID:28486658
The repeat organizer, a specialized insulator element within the intergenic spacer of the Xenopus rRNA genes.

PubMed Central

Robinett, C C; O'Connor, A; Dunaway, M

1997-01-01

We have identified a novel activity for the region of the intergenic spacer of the Xenopus laevis rRNA genes that contains the 35- and 100-bp repeats. We devised a new assay for this region by constructing DNA plasmids containing a tandem repeat of rRNA reporter genes that were separated by the 35- and 100-bp repeat region and a rRNA gene enhancer. When the 35- and 100-bp repeat region is present in its normal position and orientation at the 3' end of the rRNA reporter genes, the enhancer activates the adjacent downstream promoter but not the upstream rRNA promoter on the same plasmid. Because this element can restrict the range of an enhancer's activity in the context of tandem genes, we have named it the repeat organizer (RO). The ability to restrict enhancer action is a feature of insulator elements, but unlike previously described insulator elements the RO does not block enhancer action in a simple enhancer-blocking assay. Instead, the activity of the RO requires that it be in its normal position and orientation with respect to the other sequence elements of the rRNA genes. The enhancer-binding transcription factor xUBF also binds to the repetitive sequences of the RO in vitro, but these sequences do not activate transcription in vivo. We propose that the RO is a specialized insulator element that organizes the tandem array of rRNA genes into single-gene expression units by promoting activation of a promoter by its proximal enhancers. PMID:9111359
The interaction between vocabulary size and phonotactic probability effects on children's production accuracy and fluency in nonword repetition.

PubMed

Edwards, Jan; Beckman, Mary E; Munson, Benjamin

2004-04-01

Adults' performance on a variety of tasks suggests that phonological processing of nonwords is grounded in generalizations about sublexical patterns over all known words. A small body of research suggests that children's phonological acquisition is similarly based on generalizations over the lexicon. To test this account, production accuracy and fluency were examined in nonword repetitions by 104 children and 22 adults. Stimuli were 22 pairs of nonwords, in which one nonword contained a low-frequency or unattested two-phoneme sequence and the other contained a high-frequency sequence. For a subset of these nonword pairs, segment durations were measured. The same sound was produced with a longer duration (less fluently) when it appeared in a low-frequency sequence, as compared to a high-frequency sequence. Low-frequency sequences were also repeated with lower accuracy than high-frequency sequences. Moreover, children with smaller vocabularies showed a larger influence of frequency on accuracy than children with larger vocabularies. Taken together, these results provide support for a model of phonological acquisition in which knowledge of sublexical units emerges from generalizations made over lexical items.
New tool to assemble repetitive regions using next-generation sequencing data

NASA Astrophysics Data System (ADS)

Kuśmirek, Wiktor; Nowak, Robert M.; Neumann, Łukasz

2017-08-01

The next generation sequencing techniques produce a large amount of sequencing data. Some part of the genome are composed of repetitive DNA sequences, which are very problematic for the existing genome assemblers. We propose a modification of the algorithm for a DNA assembly, which uses the relative frequency of reads to properly reconstruct repetitive sequences. The new approach was implemented and tested, as a demonstration of the capability of our software we present some results for model organisms. The new implementation, using a three-layer software architecture was selected, where the presentation layer, data processing layer, and data storage layer were kept separate. Source code as well as demo application with web interface and the additional data are available at project web-page: http://dnaasm.sourceforge.net.
Genome-wide analysis of LTR-retrotransposon diversity and its impact on the evolution of the genus Helianthus (L.).

PubMed

Mascagni, Flavia; Giordani, Tommaso; Ceccarelli, Marilena; Cavallini, Andrea; Natali, Lucia

2017-08-18

Genome divergence by mobile elements activity and recombination is a continuous process that plays a key role in the evolution of species. Nevertheless, knowledge on retrotransposon-related variability among species belonging to the same genus is still limited. Considering the importance of the genus Helianthus, a model system for studying the ecological genetics of speciation and adaptation, we performed a comparative analysis of the repetitive genome fraction across ten species and one subspecies of sunflower, focusing on long terminal repeat retrotransposons at superfamily, lineage and sublineage levels. After determining the relative genome size of each species, genomic DNA was isolated and subjected to Illumina sequencing. Then, different assembling and clustering approaches allowed exploring the repetitive component of all genomes. On average, repetitive DNA in Helianthus species represented more than 75% of the genome, being composed mostly by long terminal repeat retrotransposons. Also, the prevalence of Gypsy over Copia superfamily was observed and, among lineages, Chromovirus was by far the most represented. Although nearly all the same sublineages are present in all species, we found considerable variability in the abundance of diverse retrotransposon lineages and sublineages, especially between annual and perennial species. This large variability should indicate that different events of amplification or loss related to these elements occurred following species separation and should have been involved in species differentiation. Our data allowed us inferring on the extent of interspecific repetitive DNA variation related to LTR-RE abundance, investigating the relationship between changes of LTR-RE abundance and the evolution of the genus, and determining the degree of coevolution of different LTR-RE lineages or sublineages between and within species. Moreover, the data suggested that LTR-RE abundance in a species was affected by the annual or perennial habit of that species.
Development of synthetic selfish elements based on modular nucleases in Drosophila melanogaster.

PubMed

Simoni, Alekos; Siniscalchi, Carla; Chan, Yuk-Sang; Huen, David S; Russell, Steven; Windbichler, Nikolai; Crisanti, Andrea

2014-06-01

Selfish genes are DNA elements that increase their rate of genetic transmission at the expense of other genes in the genome and can therefore quickly spread within a population. It has been suggested that selfish elements could be exploited to modify the genome of entire populations for medical and ecological applications. Here we report that transcription activator-like effector nuclease (TALEN) and zinc finger nuclease (ZFN) can be engineered into site-specific synthetic selfish elements (SSEs) and demonstrate their transmission of up to 70% in the Drosophila germline. We show here that SSEs can spread via DNA break-induced homologous recombination, a process known as 'homing' similar to that observed for homing endonuclease genes (HEGs), despite their fundamentally different modes of DNA binding and cleavage. We observed that TALEN and ZFN have a reduced capability of secondary homing compared to HEG as their repetitive structure had a negative effect on their genetic stability. The modular architecture of ZFNs and TALENs allows for the rapid design of novel SSEs against specific genomic sequences making them potentially suitable for the genetic engineering of wild-type populations of animals and plants, in applications such as gene replacement or population suppression of pest species. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
The Genome Sequence of Taurine Cattle: A window to ruminant biology and evolution

PubMed Central

Elsik, Christine G.; Tellam, Ross L.; Worley, Kim C.

2010-01-01

To understand the biology and evolution of ruminants, the cattle genome was sequenced to ∼7× coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1,217 are absent or undetected in non-eutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides an enabling resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production. PMID:19390049
A rule-based, dose-finding design for use in stroke rehabilitation research: methodological development.

PubMed

Colucci, E; Clark, A; Lang, C E; Pomeroy, V M

2017-12-01

Dose-optimisation studies as precursors to clinical trials are rare in stroke rehabilitation. To develop a rule-based, dose-finding design for stroke rehabilitation research. 3+3 rule-based, dose-finding study. Dose escalation/de-escalation was undertaken according to preset rules and a mathematical sequence (modified Fibonacci sequence). The target starting daily dose was 50 repetitions for the first cohort. Adherence was recorded by an electronic counter. At the end of the 2-week training period, the adherence record indicated dose tolerability (adherence to target dose) and the outcome measure indicated dose benefit (10% increase in motor function). The preset increment/decrease and checking rules were then applied to set the dose for the subsequent cohort. The process was repeated until preset stopping rules were met. Participants had a mean age of 68 (range 48 to 81) years, and were a mean of 70 (range 9 to 289) months post stroke with moderate upper limb paresis. A custom-built model of exercise-based training to enhance ability to open the paretic hand. Repetitions per minute of extension/flexion of paretic digits against resistance. Usability of the preset rules and whether the maximally tolerated dose was identifiable. Five cohorts of three participants were involved. Discernibly different doses were set for each subsequent cohort (i.e. 50, 100, 167, 251 and 209 repetitions/day). The maximally tolerated dose for the model training task was 209 repetitions/day. This dose-finding design is a feasible method for use in stroke rehabilitation research. Copyright © 2017 Chartered Society of Physiotherapy. All rights reserved.
Analysis of the giant genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size.

PubMed

Kelly, Laura J; Renny-Byfield, Simon; Pellicer, Jaume; Macas, Jiří; Novák, Petr; Neumann, Pavel; Lysak, Martin A; Day, Peter D; Berger, Madeleine; Fay, Michael F; Nichols, Richard A; Leitch, Andrew R; Leitch, Ilia J

2015-10-01

Plants exhibit an extraordinary range of genome sizes, varying by > 2000-fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size. Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria. We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low-abundance repeat-derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal. Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high-abundance repeat families. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Sequence of Spider Aciniform and Piriform Silks

DTIC Science & Technology

2001-09-19

7/98nd subtan-6/01 4. TITLE AND SUBTITLE Sequence of Spider Aciniform and Piriform Silks 5. FUNDING NUMBERS DAAD19-01-1-0569 6...aciniform glands from Argiope trifasciata were used to construct a cDNA library. The library was probed with various DNA probes based on known spider silk ...sequence in a number of other spider silks . The 5’end of the clone still appears to be repetitive sequence and thus it is unlikely to be a full-length
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

DOEpatents

Weier, H.U.G.; Gray, J.W.

1995-06-27

A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

DOEpatents

Weier, Heinz-Ulrich G.; Gray, Joe W.

1995-01-01

A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.
Targeted Capture Sequencing in Whitebark Pine Reveals Range-Wide Demographic and Adaptive Patterns Despite Challenges of a Large, Repetitive Genome.

PubMed

Syring, John V; Tennessen, Jacob A; Jennings, Tara N; Wegrzyn, Jill; Scelfo-Dalbey, Camille; Cronn, Richard

2016-01-01

Whitebark pine (Pinus albicaulis) inhabits an expansive range in western North America, and it is a keystone species of subalpine environments. Whitebark is susceptible to multiple threats - climate change, white pine blister rust, mountain pine beetle, and fire exclusion - and it is suffering significant mortality range-wide, prompting the tree to be listed as 'globally endangered' by the International Union for Conservation of Nature and 'endangered' by the Canadian government. Conservation collections (in situ and ex situ) are being initiated to preserve the genetic legacy of the species. Reliable, transferrable, and highly variable genetic markers are essential for quantifying the genetic profiles of seed collections relative to natural stands, and ensuring the completeness of conservation collections. We evaluated the use of hybridization-based target capture to enrich specific genomic regions from the 27 GB genome of whitebark pine, and to evaluate genetic variation across loci, trees, and geography. Probes were designed to capture 7,849 distinct genes, and screening was performed on 48 trees. Despite the inclusion of repetitive elements in the probe pool, the resulting dataset provided information on 4,452 genes and 32% of targeted positions (528,873 bp), and we were able to identify 12,390 segregating sites from 47 trees. Variations reveal strong geographic trends in heterozygosity and allelic richness, with trees from the southern Cascade and Sierra Range showing the greatest distinctiveness and differentiation. Our results show that even under non-optimal conditions (low enrichment efficiency; inclusion of repetitive elements in baits), targeted enrichment produces high quality, codominant genotypes from large genomes. The resulting data can be readily integrated into management and gene conservation activities for whitebark pine, and have the potential to be applied to other members of 5-needle pine group (Pinus subsect. Quinquefolia) due to their limited genetic divergence.
Isolation of centromeric-tandem repetitive DNA sequences by chromatin affinity purification using a HaloTag7-fused centromere-specific histone H3 in tobacco.

PubMed

Nagaki, Kiyotaka; Shibata, Fukashi; Kanatani, Asaka; Kashihara, Kazunari; Murata, Minoru

2012-04-01

The centromere is a multi-functional complex comprising centromeric DNA and a number of proteins. To isolate unidentified centromeric DNA sequences, centromere-specific histone H3 variants (CENH3) and chromatin immunoprecipitation (ChIP) have been utilized in some plant species. However, anti-CENH3 antibody for ChIP must be raised in each species because of its species specificity. Production of the antibodies is time-consuming and costly, and it is not easy to produce ChIP-grade antibodies. In this study, we applied a HaloTag7-based chromatin affinity purification system to isolate centromeric DNA sequences in tobacco. This system required no specific antibody, and made it possible to apply a highly stringent wash to remove contaminated DNA. As a result, we succeeded in isolating five tandem repetitive DNA sequences in addition to the centromeric retrotransposons that were previously identified by ChIP. Three of the tandem repeats were centromere-specific sequences located on different chromosomes. These results confirm the validity of the HaloTag7-based chromatin affinity purification system as an alternative method to ChIP for isolating unknown centromeric DNA sequences. The discovery of more than two chromosome-specific centromeric DNA sequences indicates the mosaic structure of tobacco centromeres. © Springer-Verlag 2011
Effects of "D"-Amphetamine and Ethanol on Variable and Repetitive Key-Peck Sequences in Pigeons

ERIC Educational Resources Information Center

Ward, Ryan D.; Bailey, Ericka M.; Odum, Amy L.

2006-01-01

This experiment assessed the effects of "d"-Amphetamine and ethanol on reinforced variable and repetitive key-peck sequences in pigeons. Pigeons responded on two keys under a multiple schedule of Repeat and Vary components. In the Repeat component, completion of a target sequence of right, right, left, left resulted in food. In the Vary component,…
DArT Markers Effectively Target Gene Space in the Rye Genome

PubMed Central

Gawroński, Piotr; Pawełkowicz, Magdalena; Tofil, Katarzyna; Uszyński, Grzegorz; Sharifova, Saida; Ahluwalia, Shivaksh; Tyrka, Mirosław; Wędzony, Maria; Kilian, Andrzej; Bolibok-Brągoszewska, Hanna

2016-01-01

Large genome size and complexity hamper considerably the genomics research in relevant species. Rye (Secale cereale L.) has one of the largest genomes among cereal crops and repetitive sequences account for over 90% of its length. Diversity Arrays Technology is a high-throughput genotyping method, in which a preferential sampling of gene-rich regions is achieved through the use of methylation sensitive restriction enzymes. We obtained sequences of 6,177 rye DArT markers and following a redundancy analysis assembled them into 3,737 non-redundant sequences, which were then used in homology searches against five Pooideae sequence sets. In total 515 DArT sequences could be incorporated into publicly available rye genome zippers providing a starting point for the integration of DArT- and transcript-based genomics resources in rye. Using Blast2Go pipeline we attributed putative gene functions to 1101 (29.4%) of the non-redundant DArT marker sequences, including 132 sequences with putative disease resistance-related functions, which were found to be preferentially located in the 4RL and 6RL chromosomes. Comparative analysis based on the DArT sequences revealed obvious inconsistencies between two recently published high density consensus maps of rye. Furthermore we demonstrated that DArT marker sequences can be a source of SSR polymorphisms. Obtained data demonstrate that DArT markers effectively target gene space in the large, complex, and repetitive rye genome. Through the annotation of putative gene functions and the alignment of DArT sequences relative to reference genomes we obtained information, that will complement the results of the studies, where DArT genotyping was deployed, by simplifying the gene ontology and microcolinearity based identification of candidate genes. PMID:27833625
DArT Markers Effectively Target Gene Space in the Rye Genome.

PubMed

Gawroński, Piotr; Pawełkowicz, Magdalena; Tofil, Katarzyna; Uszyński, Grzegorz; Sharifova, Saida; Ahluwalia, Shivaksh; Tyrka, Mirosław; Wędzony, Maria; Kilian, Andrzej; Bolibok-Brągoszewska, Hanna

2016-01-01

Large genome size and complexity hamper considerably the genomics research in relevant species. Rye ( Secale cereale L.) has one of the largest genomes among cereal crops and repetitive sequences account for over 90% of its length. Diversity Arrays Technology is a high-throughput genotyping method, in which a preferential sampling of gene-rich regions is achieved through the use of methylation sensitive restriction enzymes. We obtained sequences of 6,177 rye DArT markers and following a redundancy analysis assembled them into 3,737 non-redundant sequences, which were then used in homology searches against five Pooideae sequence sets. In total 515 DArT sequences could be incorporated into publicly available rye genome zippers providing a starting point for the integration of DArT- and transcript-based genomics resources in rye. Using Blast2Go pipeline we attributed putative gene functions to 1101 (29.4%) of the non-redundant DArT marker sequences, including 132 sequences with putative disease resistance-related functions, which were found to be preferentially located in the 4RL and 6RL chromosomes. Comparative analysis based on the DArT sequences revealed obvious inconsistencies between two recently published high density consensus maps of rye. Furthermore we demonstrated that DArT marker sequences can be a source of SSR polymorphisms. Obtained data demonstrate that DArT markers effectively target gene space in the large, complex, and repetitive rye genome. Through the annotation of putative gene functions and the alignment of DArT sequences relative to reference genomes we obtained information, that will complement the results of the studies, where DArT genotyping was deployed, by simplifying the gene ontology and microcolinearity based identification of candidate genes.

Inhibition in motor imagery: a novel action mode switching paradigm.

PubMed

Rieger, Martina; Dahm, Stephan F; Koch, Iring

2017-04-01

Motor imagery requires that actual movements are prevented (i.e., inhibited) from execution. To investigate at what level inhibition takes place in motor imagery, we developed a novel action mode switching paradigm. Participants imagined (indicating only start and end) and executed movements from start buttons to target buttons, and we analyzed trial sequence effects. Trial sequences depended on current action mode (imagination or execution), previous action mode (pure blocks/same mode, mixed blocks/same mode, or mixed blocks/other mode), and movement sequence (action repetition, hand repetition, or hand alternation). Results provided evidence for global inhibition (indicated by switch benefits in execution-imagination (E-I)-sequences in comparison to I-I-sequences), effector-specific inhibition (indicated by hand repetition costs after an imagination trial), and target inhibition (indicated by target repetition benefits in I-I-sequences). No evidence for subthreshold motor activation or action-specific inhibition (inhibition of the movement of an effector to a specific target) was obtained. Two (global inhibition and effector-specific inhibition) of the three observed mechanisms are active inhibition mechanisms. In conclusion, motor imagery is not simply a weaker form of execution, which often is implied in views focusing on similarities between imagination and execution.
Design and preparation of beta-sheet forming repetitive and block-copolymerized polypeptides.

PubMed

Higashiya, Seiichiro; Topilina, Natalya I; Ngo, Silvana C; Zagorevskii, Dmitri; Welch, John T

2007-05-01

The design and rapid construction of libraries of genes coding beta-sheet forming repetitive and block-copolymerized polypeptides bearing various C- and N-terminal sequences are described. The design was based on the assembly of DNA cassettes coding for the (GA)3GX amino acid sequence where the (GAGAGA) sequences would constitute the beta-strand units of a larger beta-sheet assembly. The edges of this beta-sheet would be functionalized by the turn-inducing amino acids (GX). The polypeptides were expressed in Escherichia coli using conventional vectors and were purified by Ni-nitriloacetic acid (NTA) chromatography. The correlation of polymer structure with molecular weight was investigated by gel electrophoresis and mass spectrometry. The monomer sequences and post-translational chemical modifications were found to influence the mobility of the polypeptides over the full range of polypeptide molecular weights while the electrophoretic mobility of lower molecular weight polypeptides was more susceptible to C- and N-termini polypeptide modifications.
Cross-species bacterial artificial chromosome (BAC) library screening via overgo-based hybridization and BAC-contig mapping of a yield enhancement quantitative trait locus (QTL) yld1.1 in the Malaysian wild rice Oryza rufipogon.

PubMed

Song, Beng-Kah; Nadarajah, Kalaivani; Romanov, Michael N; Ratnam, Wickneswari

2005-01-01

The construction of BAC-contig physical maps is an important step towards a partial or ultimate genome sequence analysis. Here, we describe our initial efforts to apply an overgo approach to screen a BAC library of the Malaysian wild rice species, Oryza rufipogon. Overgo design is based on repetitive element masking and sequence uniqueness, and uses short probes (approximately 40 bp), making this method highly efficient and specific. Pairs of 24-bp oligos that contain an 8-bp overlap were developed from the publicly available genomic sequences of the cultivated rice, O. sativa, to generate 20 overgo probes for a 1-Mb region that encompasses a yield enhancement QTL yld1.1 in O. rufipogon. The advantages of a high similarity in melting temperature, hybridization kinetics and specific activities of overgos further enabled a pooling strategy for library screening by filter hybridization. Two pools of ten overgos each were hybridized to high-density filters representing the O. rufipogon genomic BAC library. These screening tests succeeded in providing 69 PCR-verified positive hits from a total of 23,040 BAC clones of the entire O. rufipogon library. A minimal tilling path of clones was generated to contribute to a fully covered BAC-contig map of the targeted 1-Mb region. The developed protocol for overgo design based on O. sativa sequences as a comparative genomic framework, and the pooled overgo hybridization screening technique are suitable means for high-resolution physical mapping and the identification of BAC candidates for sequencing.
Meeting Report: The Role of the Mobilome in Cancer

PubMed Central

Ardeljan, Daniel; Taylor, Martin S.; Burns, Kathleen H.; Boeke, Jef D.; Espey, Michael Graham; Woodhouse, Elisa C.; Howcroft, T. Kevin

2016-01-01

Approximately half of the human genome consists of repetitive sequence attributed to the activities of mobile DNAs, including DNA transposons, RNA transposons, and endogenous retroviruses. Of these, only Long INterspersed Elements (LINE-1 or L1) and sequences copied by LINE-1 remain mobile in our species today. Although cells restrict L1 activity by both transcriptional and post-transcriptional mechanisms, L1 de-repression occurs in developmental and pathologic contexts, including many types of cancers. However, we have limited knowledge of the extent and consequences of L1 expression in premalignancies and cancer. Participants in this NIH strategic workshop considered key questions to enhance our understanding of mechanisms and roles the mobilome may play in cancer biology. PMID:27527733
Draft genome sequence of ramie, Boehmeria nivea (L.) Gaudich.

PubMed

Luan, Ming-Bao; Jian, Jian-Bo; Chen, Ping; Chen, Jun-Hui; Chen, Jian-Hua; Gao, Qiang; Gao, Gang; Zhou, Ju-Hong; Chen, Kun-Mei; Guang, Xuan-Min; Chen, Ji-Kang; Zhang, Qian-Qian; Wang, Xiao-Fei; Fang, Long; Sun, Zhi-Min; Bai, Ming-Zhou; Fang, Xiao-Dong; Zhao, Shan-Cen; Xiong, He-Ping; Yu, Chun-Ming; Zhu, Ai-Guo

2018-05-01

Ramie, Boehmeria nivea (L.) Gaudich, family Urticaceae, is a plant native to eastern Asia, and one of the world's oldest fibre crops. It is also used as animal feed and for the phytoremediation of heavy metal-contaminated farmlands. Thus, the genome sequence of ramie was determined to explore the molecular basis of its fibre quality, protein content and phytoremediation. For further understanding ramie genome, different paired-end and mate-pair libraries were combined to generate 134.31 Gb of raw DNA sequences using the Illumina whole-genome shotgun sequencing approach. The highly heterozygous B. nivea genome was assembled using the Platanus Genome Assembler, which is an effective tool for the assembly of highly heterozygous genome sequences. The final length of the draft genome of this species was approximately 341.9 Mb (contig N50 = 22.62 kb, scaffold N50 = 1,126.36 kb). Based on ramie genome annotations, 30,237 protein-coding genes were predicted, and the repetitive element content was 46.3%. The completeness of the final assembly was evaluated by benchmarking universal single-copy orthologous genes (BUSCO); 90.5% of the 1,440 expected embryophytic genes were identified as complete, and 4.9% were identified as fragmented. Phylogenetic analysis based on single-copy gene families and one-to-one orthologous genes placed ramie with mulberry and cannabis, within the clade of urticalean rosids. Genome information of ramie will be a valuable resource for the conservation of endangered Boehmeria species and for future studies on the biogeography and characteristic evolution of members of Urticaceae. © 2018 John Wiley & Sons Ltd.
Discrimination of Scedosporium prolificans against Pseudallescheria boydii and Scedosporium apiospermum by semiautomated repetitive sequence-based PCR.

PubMed

Steinmann, J; Schmidt, D; Buer, J; Rath, P-M

2011-07-01

The laboratory identification of Pseudallescheria and Scedosporium isolates at the species level is important for clinical and epidemiological purposes. This study used semiautomated repetitive sequence-based polymerase chain reaction (rep-PCR) to identify Pseudallescheria/Scedosporium. Reference strains of Pseudallescheria boydii (n = 12), Scedosporium prolificans (n = 8), Scedosporium apiospermum (n = 9), and clinical/environmental isolates (P. boydii, 7; S. prolificans, 7; S. apiospermum, 7) were analyzed by rep-PCR. All clinical isolates were identified by morphological and phenotypic characteristics and by sequence analysis. Species identification of reference strains was based on the results of available databases. Rep-PCR studies were also conducted with various molds to differentiate Pseudallescheria/Scedosporium spp. from other commonly encountered filamentous fungi. All tested Pseudallescheria/Scedosporium isolates were distinguishable from the other filamentous fungi. All Scedosporium prolificans strains clustered within the cutoff of 85%, and species identification by rep-PCR showed an agreement of 100% with sequence analysis. However, several isolates of P. boydii and S. apiospermum did not cluster within the 85% cutoff with the same species by rep-PCR. Although the identification of P. boydii and S. apiospermum was not correct, the semiautomated rep-PCR system is a promising tool for the identification of S. prolificans isolates.
Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome

PubMed Central

2009-01-01

Background Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. Results We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. Conclusion We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes. PMID:19656416
Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome.

PubMed

Hamberger, Björn; Hall, Dawn; Yuen, Mack; Oddy, Claire; Hamberger, Britta; Keeling, Christopher I; Ritland, Carol; Ritland, Kermit; Bohlmann, Jörg

2009-08-06

Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes.
MRI image plane nonuniformity in evaluation of ferrous sulphate dosimeter gel (FeGel) by means of T1-relaxation time.

PubMed

Magnusson, P; Bäck, S A; Olsson, L E

1999-11-01

MR image nonuniformity can vary significantly with the spin-echo pulse sequence repetition time. When MR images with different nonuniformity shapes are used in a T1-calculation the resulting T1-image becomes nonuniform. As shown in this work the uniformity TR-dependence of the spin-echo pulse sequence is a critical property for T1 measurements in general and for ferrous sulfate dosimeter gel (FeGel) applications in particular. The purpose was to study the characteristics of the MR image plane nonuniformity in FeGel evaluation. This included studies of the possibility of decreasing nonuniformities by selecting uniformity optimized repetition times, studies of the transmitted and received RF-fields and studies of the effectiveness of the correction methods background subtraction and quotient correction. A pronounced MR image nonuniformity variation with repetition and T1 relaxation time was observed, and was found to originate from nonuniform RF-transmission in combination with the inherent differences in T1 relaxation for different repetition times. The T1 calculation itself, the uniformity optimized repetition times, nor none of the correction methods studied could sufficiently correct the nonuniformities observed in the T1 images. The nonuniformities were found to vary considerably less with inversion time for the inversion-recovery pulse sequence, than with repetition time for the spin-echo pulse sequence, resulting in considerably lower T1 image nonuniformity levels.
Genome structure of bdelloid rotifers: shaped by asexuality or desiccation?

PubMed

Gladyshev, Eugene A; Arkhipova, Irina R

2010-01-01

Bdelloid rotifers are microscopic invertebrate animals best known for their ancient asexuality and the ability to survive desiccation at any life stage. Both factors are expected to have a profound influence on their genome structure. Recent molecular studies demonstrated that, although the gene-rich regions of bdelloid genomes are organized as colinear pairs of closely related sequences and depleted in repetitive DNA, subtelomeric regions harbor diverse transposable elements and horizontally acquired genes of foreign origin. Although asexuality is expected to result in depletion of deleterious transposons, only desiccation appears to have the power to produce all the uncovered genomic peculiarities. Repair of desiccation-induced DNA damage would require the presence of a homologous template, maintaining colinear pairs in gene-rich regions and selecting against insertion of repetitive DNA that might cause chromosomal rearrangements. Desiccation may also induce a transient state of competence in recovering animals, allowing them to acquire environmental DNA. Even if bdelloids engage in rare or obscure forms of sexual reproduction, all these features could still be present. The relative contribution of asexuality and desiccation to genome organization may be clarified by analyzing whole-genome sequences and comparing foreign gene and transposon content in species which lost the ability to survive desiccation.
Chromosomal distribution of microsatellite repeats in Amazon cichlids genome (Pisces, Cichlidae)

PubMed Central

Schneider, Carlos Henrique; Gross, Maria Claudia; Terencio, Maria Leandra; de Tavares, Édika Sabrina Girão Mitozo; Martins, Cesar; Feldberg, Eliana

2015-01-01

Abstract Fish of the family Cichlidae are recognized as an excellent model for evolutionary studies because of their morphological and behavioral adaptations to a wide diversity of explored ecological niches. In addition, the family has a dynamic genome with variable structure, composition and karyotype organization. Microsatellites represent the most dynamic genomic component and a better understanding of their organization may help clarify the role of repetitive DNA elements in the mechanisms of chromosomal evolution. Thus, in this study, microsatellite sequences were mapped in the chromosomes of Cichla monoculus Agassiz, 1831, Pterophyllum scalare Schultze, 1823, and Symphysodon discus Heckel, 1840. Four microsatellites demonstrated positive results in the genome of Cichla monoculus and Symphysodon discus, and five demonstrated positive results in the genome of Pterophyllum scalare. In most cases, the microsatellite was dispersed in the chromosome with conspicuous markings in the centromeric or telomeric regions, which suggests that sequences contribute to chromosome structure and may have played a role in the evolution of this fish family. The comparative genome mapping data presented here provide novel information on the structure and organization of the repetitive DNA region of the cichlid genome and contribute to a better understanding of this fish family’s genome. PMID:26753076
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing.

PubMed

Hribová, Eva; Neumann, Pavel; Matsumoto, Takashi; Roux, Nicolas; Macas, Jirí; Dolezel, Jaroslav

2010-09-16

Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection.
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing

PubMed Central

2010-01-01

Background Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. Results In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. Conclusion A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection. PMID:20846365
A theory that may explain the Hayflick limit--a means to delete one copy of a repeating sequence during each cell cycle in certain human cells such as fibroblasts.

PubMed

Naveilhan, P; Baudet, C; Jabbour, W; Wion, D

1994-09-01

A model that may explain the limited division potential of certain cells such as human fibroblasts in culture is presented. The central postulate of this theory is that there exists, prior to certain key exons that code for materials needed for cell division, a unique sequence of specific repeating segments of DNA. One copy of such repeating segments is deleted during each cell cycle in cells that are not protected from such deletion through methylation of their cytosine residues. According to this theory, the means through which such repeated sequences are removed, one per cycle, is through the sequential action of enzymes that act much as bacterial restriction enzymes do--namely to produce scissions in both strands of DNA in areas that correspond to the DNA base sequence recognition specificities of such enzymes. After the first scission early in a replicative cycle, that enzyme becomes inhibited, but the cleavage of the first site exposes the closest site in the repetitive element to the action of a second restriction enzyme after which that enzyme also becomes inhibited. Then repair occurs, regenerating the original first site. Through this sequential activation and inhibition of two different restriction enzymes, only one copy of the repeating sequence is deleted during each cell cycle. In effect, the repeating sequence operates as a precise counter of the numbers of cell doubling that have occurred since the cells involved differentiated during development.
Inverse PCR-based method for isolating novel SINEs from genome.

PubMed

Han, Yawei; Chen, Liping; Guan, Lihong; He, Shunping

2014-04-01

Short interspersed elements (SINEs) are moderately repetitive DNA sequences in eukaryotic genomes. Although eukaryotic genomes contain numerous SINEs copy, it is very difficult and laborious to isolate and identify them by the reported methods. In this study, the inverse PCR was successfully applied to isolate SINEs from Opsariichthys bidens genome in Eastern Asian Cyprinid. A group of SINEs derived from tRNA(Ala) molecular had been identified, which were named Opsar according to Opsariichthys. SINEs characteristics were exhibited in Opsar, which contained a tRNA(Ala)-derived region at the 5' end, a tRNA-unrelated region, and AT-rich region at the 3' end. The tRNA-derived region of Opsar shared 76 % sequence similarity with tRNA(Ala) gene. This result indicated that Opsar could derive from the inactive or pseudogene of tRNA(Ala). The reliability of method was tested by obtaining C-SINE, Ct-SINE, and M-SINEs from Ctenopharyngodon idellus, Megalobrama amblycephala, and Cyprinus carpio genomes. This method is simpler than the previously reported, which successfully omitted many steps, such as preparation of probes, construction of genomic libraries, and hybridization.
Genomic Structure of an Economically Important Cyanobacterium, Arthrospira (Spirulina) platensis NIES-39

PubMed Central

Fujisawa, Takatomo; Narikawa, Rei; Okamoto, Shinobu; Ehira, Shigeki; Yoshimura, Hidehisa; Suzuki, Iwane; Masuda, Tatsuru; Mochimaru, Mari; Takaichi, Shinichi; Awai, Koichiro; Sekine, Mitsuo; Horikawa, Hiroshi; Yashiro, Isao; Omata, Seiha; Takarada, Hiromi; Katano, Yoko; Kosugi, Hiroki; Tanikawa, Satoshi; Ohmori, Kazuko; Sato, Naoki; Ikeuchi, Masahiko; Fujita, Nobuyuki; Ohmori, Masayuki

2010-01-01

A filamentous non-N2-fixing cyanobacterium, Arthrospira (Spirulina) platensis, is an important organism for industrial applications and as a food supply. Almost the complete genome of A. platensis NIES-39 was determined in this study. The genome structure of A. platensis is estimated to be a single, circular chromosome of 6.8 Mb, based on optical mapping. Annotation of this 6.7 Mb sequence yielded 6630 protein-coding genes as well as two sets of rRNA genes and 40 tRNA genes. Of the protein-coding genes, 78% are similar to those of other organisms; the remaining 22% are currently unknown. A total 612 kb of the genome comprise group II introns, insertion sequences and some repetitive elements. Group I introns are located in a protein-coding region. Abundant restriction-modification systems were determined. Unique features in the gene composition were noted, particularly in a large number of genes for adenylate cyclase and haemolysin-like Ca2+-binding proteins and in chemotaxis proteins. Filament-specific genes were highlighted by comparative genomic analysis. PMID:20203057
Identification of multiple binding sites for the THAP domain of the Galileo transposase in the long terminal inverted-repeats☆

PubMed Central

Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald

2013-01-01

Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. PMID:23648487
Identification of multiple binding sites for the THAP domain of the Galileo transposase in the long terminal inverted-repeats.

PubMed

Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald

2013-08-01

Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
Comparative molecular cytogenetics of major repetitive sequence families of three Dendrobium species (Orchidaceae) from Bangladesh

PubMed Central

Begum, Rabeya; Alam, Sheikh Shamimul; Menzel, Gerhard; Schmidt, Thomas

2009-01-01

Background and Aims Dendrobium species show tremendous morphological diversity and have broad geographical distribution. As repetitive sequence analysis is a useful tool to investigate the evolution of chromosomes and genomes, the aim of the present study was the characterization of repetitive sequences from Dendrobium moschatum for comparative molecular and cytogenetic studies in the related species Dendrobium aphyllum, Dendrobium aggregatum and representatives from other orchid genera. Methods In order to isolate highly repetitive sequences, a c0t-1 DNA plasmid library was established. Repeats were sequenced and used as probes for Southern hybridization. Sequence divergence was analysed using bioinformatic tools. Repetitive sequences were localized along orchid chromosomes by fluorescence in situ hybridization (FISH). Key Results Characterization of the c0t-1 library resulted in the detection of repetitive sequences including the (GA)n dinucleotide DmoO11, numerous Arabidopsis-like telomeric repeats and the highly amplified dispersed repeat DmoF14. The DmoF14 repeat is conserved in six Dendrobium species but diversified in representative species of three other orchid genera. FISH analyses showed the genome-wide distribution of DmoF14 in D. moschatum, D. aphyllum and D. aggregatum. Hybridization with the telomeric repeats demonstrated Arabidopsis-like telomeres at the chromosome ends of Dendrobium species. However, FISH using the telomeric probe revealed two pairs of chromosomes with strong intercalary signals in D. aphyllum. FISH showed the terminal position of 5S and 18S–5·8S–25S rRNA genes and a characteristic number of rDNA sites in the three Dendrobium species. Conclusions The repeated sequences isolated from D. moschatum c0t-1 DNA constitute major DNA families of the D. moschatum, D. aphyllum and D. aggregatum genomes with DmoF14 representing an ancient component of orchid genomes. Large intercalary telomere-like arrays suggest chromosomal rearrangements in D. aphyllum while the number and localization of rRNA genes as well as the species-specific distribution pattern of an abundant microsatellite reflect the genomic diversity of the three Dendrobium species. PMID:19635741
The Genome Biology of Effector Gene Evolution in Filamentous Plant Pathogens.

PubMed

Sánchez-Vallet, Andrea; Fouché, Simone; Fudal, Isabelle; Hartmann, Fanny E; Soyer, Jessica L; Tellier, Aurélien; Croll, Daniel

2018-05-16

Filamentous pathogens, including fungi and oomycetes, pose major threats to global food security. Crop pathogens cause damage by secreting effectors that manipulate the host to the pathogen's advantage. Genes encoding such effectors are among the most rapidly evolving genes in pathogen genomes. Here, we review how the major characteristics of the emergence, function, and regulation of effector genes are tightly linked to the genomic compartments where these genes are located in pathogen genomes. The presence of repetitive elements in these compartments is associated with elevated rates of point mutations and sequence rearrangements with a major impact on effector diversification. The expression of many effectors converges on an epigenetic control mediated by the presence of repetitive elements. Population genomics analyses showed that rapidly evolving pathogens show high rates of turnover at effector loci and display a mosaic in effector presence-absence polymorphism among strains. We conclude that effective pathogen containment strategies require a thorough understanding of the effector genome biology and the pathogen's potential for rapid adaptation. Expected final online publication date for the Annual Review of Phytopathology Volume 56 is August 25, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

TEtools facilitates big data expression analysis of transposable elements and reveals an antagonism between their activity and that of piRNA genes

PubMed Central

Lerat, Emmanuelle; Fablet, Marie; Modolo, Laurent; Lopez-Maestre, Hélène

2017-01-01

Abstract Over recent decades, substantial efforts have been made to understand the interactions between host genomes and transposable elements (TEs). The impact of TEs on the regulation of host genes is well known, with TEs acting as platforms of regulatory sequences. Nevertheless, due to their repetitive nature it is considerably hard to integrate TE analysis into genome-wide studies. Here, we developed a specific tool for the analysis of TE expression: TEtools. This tool takes into account the TE sequence diversity of the genome, it can be applied to unannotated or unassembled genomes and is freely available under the GPL3 (https://github.com/l-modolo/TEtools). TEtools performs the mapping of RNA-seq data obtained from classical mRNAs or small RNAs onto a list of TE sequences and performs differential expression analyses with statistical relevance. Using this tool, we analyzed TE expression from five Drosophila wild-type strains. Our data show for the first time that the activity of TEs is strictly linked to the activity of the genes implicated in the piwi-interacting RNA biogenesis and therefore fits an arms race scenario between TE sequences and host control genes. PMID:28204592
Molecular Characterization of the Pericentric Inversion That Causes Differences Between Chimpanzee Chromosome 19 and Human Chromosome 17

PubMed Central

Kehrer-Sawatzki, Hildegard; Schreiner, Bettina; Tänzer, Simone; Platzer, Matthias; Müller, Stefan; Hameister, Horst

2002-01-01

A comparison of the human genome with that of the chimpanzee is an attractive approach to attempts to understand the specificity of a certain phenotype's development. The two karyotypes differ by one chromosome fusion, nine pericentric inversions, and various additions of heterochromatin to chromosomal telomeres. Only the fusion, which gave rise to human chromosome 2, has been characterized at the sequence level. During the present study, we investigated the pericentric inversion by which chimpanzee chromosome 19 differs from human chromosome 17. Fluorescence in situ hybridization was used to identify breakpoint-spanning bacterial artificial chromosomes (BACs) and plasmid artificial chromosomes (PACs). By sequencing the junction fragments, we localized breakpoints in intergenic regions rich in repetitive elements. Our findings suggest that repeat-mediated nonhomologous recombination has facilitated inversion formation. No addition or deletion of any sequence element was detected at the breakpoints or in the surrounding sequences. Next to the break, at a distance of 10.2–39.1 kb, the following genes were found: NGFR and NXPH3 (on human chromosome 17q21.3) and GUC2D and ALOX15B (on human chromosome 17p13). The inversion affects neither the genomic structure nor the gene-activity state with regard to replication timing of these genes. PMID:12094327
An Enhancer Near ISL1 and an Ultraconserved Exon of PCBP2 areDerived from a Retroposon

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bejerano, Gill; Lowe, Craig; Ahituv, Nadav

2005-11-27

Hundreds of highly conserved distal cis-regulatory elementshave been characterized to date in vertebrate genomes1. Many thousandsmore are predicted based on comparative genomics2,3. Yet, in starkcontrast to the genes they regulate, virtually none of these regions canbe traced using sequence similarity in invertebrates, leaving theirevolutionary origin obscure. Here we show that a class of conserved,primarily non-coding regions in tetrapods originated from a novel shortinterspersed repetitive element (SINE) retroposon family that was activein Sarcopterygii (lobe-finned fishes and terrestrial vertebrates) in theSilurian at least 410 Mya4, and, remarkably, appears to be recentlyactive in the "living fossil" Indonesian coelacanth, Latimeriamenadoensis. We show that onemore » copy is a distal enhancer, located 500kbfrom the neuro-developmental gene ISL1. Several others represent new,possibly regulatory, alternatively spliced exons in the middle ofpre-existing Sarcopterygian genes. One of these is the>200bpultraconserved region5, 100 percent identical in mammals, and 80 percentidentical to the coelacanth SINE, that contains a 31aa alternativelyspliced exon of the mRNA processing gene PCBP26. These add to a growinglist of examples7 in which relics of transposable elements have acquireda function that serves their host, a process termed "exaptation"8, andprovide an origin for at least some of the highly-conservedvertebrate-specific genomic sequences recently discovered usingcomparative genomics.« less
High frequency of silver resistance genes in invasive isolates of Enterobacter and Klebsiella species.

PubMed

Sütterlin, S; Dahlö, M; Tellgren-Roth, C; Schaal, W; Melhus, Å

2017-07-01

Silver-based products have been marketed as an alternative to antibiotics, and their consumption has increased. Bacteria may, however, develop resistance to silver. To study the presence of genes encoding silver resistance (silE, silP, silS) over time in three clinically important Enterobacteriaceae genera. Using polymerase chain reaction (PCR), 752 bloodstream isolates from the years 1990-2010 were investigated. Age, gender, and ward of patients were registered, and the susceptibility to antibiotics and silver nitrate was tested. Clonality and single nucleotide polymorphism were assessed with repetitive element sequence-based PCR, multi-locus sequence typing, and whole-genome sequencing. Genes encoding silver resistance were detected most frequently in Enterobacter spp. (48%), followed by Klebsiella spp. (41%) and Escherichia coli 4%. Phenotypical resistance to silver nitrate was found in Enterobacter (13%) and Klebsiella (3%) isolates. The lowest carriage rate of sil genes was observed in blood isolates from the neonatology ward (24%), and the highest in blood isolates from the oncology/haematology wards (66%). Presence of sil genes was observed in international high-risk clones. Sequences of the sil and pco clusters indicated that a single mutational event in the silS gene could have caused the phenotypic resistance. Despite a restricted consumption of silver-based products in Swedish health care, silver resistance genes are widely represented in clinical isolates of Enterobacter and Klebsiella species. To avoid further selection and spread of silver-resistant bacteria with a high potential for healthcare-associated infections, the use of silver-based products needs to be controlled and the silver resistance monitored. Copyright © 2017 The Healthcare Infection Society. Published by Elsevier Ltd. All rights reserved.
Transposable element evolution in Heliconius suggests genome diversity within Lepidoptera

PubMed Central

2013-01-01

Background Transposable elements (TEs) have the potential to impact genome structure, function and evolution in profound ways. In order to understand the contribution of transposable elements (TEs) to Heliconius melpomene, we queried the H. melpomene draft sequence to identify repetitive sequences. Results We determined that TEs comprise ~25% of the genome. The predominant class of TEs (~12% of the genome) was the non-long terminal repeat (non-LTR) retrotransposons, including a novel SINE family. However, this was only slightly higher than content derived from DNA transposons, which are diverse, with several families having mobilized in the recent past. Compared to the only other well-studied lepidopteran genome, Bombyx mori, H. melpomene exhibits a higher DNA transposon content and a distinct repertoire of retrotransposons. We also found that H. melpomene exhibits a high rate of TE turnover with few older elements accumulating in the genome. Conclusions Our analysis represents the first complete, de novo characterization of TE content in a butterfly genome and suggests that, while TEs are able to invade and multiply, TEs have an overall deleterious effect and/or that maintaining a small genome is advantageous. Our results also hint that analysis of additional lepidopteran genomes will reveal substantial TE diversity within the group. PMID:24088337
Satellite DNA and Transposable Elements in Seabuckthorn (Hippophae rhamnoides), a Dioecious Plant with Small Y and Large X Chromosomes

PubMed Central

Puterova, Janka; Razumova, Olga; Martinek, Tomas; Alexandrov, Oleg; Divashuk, Mikhail; Kubat, Zdenek; Hobza, Roman; Karlov, Gennady

2017-01-01

Seabuckthorn (Hippophae rhamnoides) is a dioecious shrub commonly used in the pharmaceutical, cosmetic, and environmental industry as a source of oil, minerals and vitamins. In this study, we analyzed the transposable elements and satellites in its genome. We carried out Illumina DNA sequencing and reconstructed the main repetitive DNA sequences. For data analysis, we developed a new bioinformatics approach for advanced satellite DNA analysis and showed that about 25% of the genome consists of satellite DNA and about 24% is formed of transposable elements, dominated by Ty3/Gypsy and Ty1/Copia LTR retrotransposons. FISH mapping revealed X chromosome-accumulated, Y chromosome-specific or both sex chromosomes-accumulated satellites but most satellites were found on autosomes. Transposable elements were located mostly in the subtelomeres of all chromosomes. The 5S rDNA and 45S rDNA were localized on one autosomal locus each. Although we demonstrated the small size of the Y chromosome of the seabuckthorn and accumulated satellite DNA there, we were unable to estimate the age and extent of the Y chromosome degeneration. Analysis of dioecious relatives such as Shepherdia would shed more light on the evolution of these sex chromosomes. PMID:28057732
The human myelin oligodendrocyte glycoprotein (MOG) gene: Complete nucleotide sequence and structural characterization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Paule Roth, M.; Malfroy, L.; Offer, C.

1995-07-20

Human myelin oligodendrocyte glycoprotein (MOG), a myelin component of the central nervous system, is a candidate target antigen for autoimmune-mediated demyelination. We have isolated and sequenced part of a cosmid clone that contains the entire human MOG gene. The primary nuclear transcript, extending from the putative start of transcription to the site of poly(A) addition, is 15,561 nucleotides in length. The human MOG gene contains 8 exons, separated by 7 introns; canonical intron/exon boundary sites are observed at each junction. The introns vary in size from 242 to 6484 bp and contain numerous repetitive DNA elements, including 14 Alu sequencesmore » within 3 introns. Another Alu element is located in the 3{prime}-untranslated region of the gene. Alu sequences were classified with respect to subfamily assignment. Seven hundred sixty-three nucleotides 5{prime} of the transcription start and 1214 nucleotides 3{prime} of the poly(A) addition sites were also sequenced. The 5{prime}-flanking region revealed the presence of several consensus sequences that could be relevant in the transcription of the MOG gene, in particular binding sites in common with other myelin gene promoters. Two polymorphic intragenic dinucleotide (CA){sub n} and tetranucleotide (TAAA){sub n} repeats were identified and may provide genetic marker tools for association and linkage studies. 50 refs., 3 figs., 3 tabs.« less
Structure and transcriptional impact of divergent repetitive elements inserted within Phanerochaete chrysosporium strain RP-78 genes

Treesearch

Luis F. Larrondo; Paulo Canessa; Rafael Vicuna; Philip Stewart; Amber Vanden Wymelenberg; Dan Cullen

2007-01-01

We describe the structure, organization, and transcriptional impact of repetitive elements within the lignin-degrading basidiomycete, Phanerochaete chrysosporium. Searches of the P. chrysosporium genome revealed five copies of pce1, a 1,750-nt non-autonomous, class II element. Alleles encoding a putative glucosyltransferase and a cytochrome P450 harbor pce insertions...
High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development.

PubMed

Daccord, Nicolas; Celton, Jean-Marc; Linsmith, Gareth; Becker, Claude; Choisne, Nathalie; Schijlen, Elio; van de Geest, Henri; Bianco, Luca; Micheletti, Diego; Velasco, Riccardo; Di Pierro, Erica Adele; Gouzy, Jérôme; Rees, D Jasper G; Guérif, Philippe; Muranty, Hélène; Durel, Charles-Eric; Laurens, François; Lespinasse, Yves; Gaillard, Sylvain; Aubourg, Sébastien; Quesneville, Hadi; Weigel, Detlef; van de Weg, Eric; Troggio, Michela; Bucher, Etienne

2017-07-01

Using the latest sequencing and optical mapping technologies, we have produced a high-quality de novo assembly of the apple (Malus domestica Borkh.) genome. Repeat sequences, which represented over half of the assembly, provided an unprecedented opportunity to investigate the uncharacterized regions of a tree genome; we identified a new hyper-repetitive retrotransposon sequence that was over-represented in heterochromatic regions and estimated that a major burst of different transposable elements (TEs) occurred 21 million years ago. Notably, the timing of this TE burst coincided with the uplift of the Tian Shan mountains, which is thought to be the center of the location where the apple originated, suggesting that TEs and associated processes may have contributed to the diversification of the apple ancestor and possibly to its divergence from pear. Finally, genome-wide DNA methylation data suggest that epigenetic marks may contribute to agronomically relevant aspects, such as apple fruit development.
Properties of some monkey DNA sequences obtained by a procedure that enriches for DNA replication origins.

PubMed

Zannis-Hadjopoulos, M; Kaufmann, G; Wang, S S; Lechner, R L; Karawya, E; Hesse, J; Martin, R G

1985-07-01

Twelve clones of monkey DNA obtained by a procedure that enriches 10(3)- to 10(4)-fold for nascent sequences activated early in S phase (G. Kaufmann, M. Zannis-Hadjopoulos, and R. G. Martin, Mol. Cell. Biol. 5:721-727, 1985) have been examined. Only 2 of the 12 ors sequences (origin-enriched sequences) are unique (ors1 and ors8). Three contain the highly reiterated Alu family (ors3, ors9, and ors11). One contains the highly reiterated alpha-satellite family (ors12), but none contain the Kpn family. Those remaining contain middle repetitive sequences. Two examples of the same middle repetitive sequence were found (ors2 and ors6). Three of the middle repetitive sequences (the ors2-ors6 pair, ors5, and ors10) are moderately dispersed; one (ors4) is highly dispersed. The last, ors7, has been mapped to the bona fide replication origin of the D loop of mitochondrial DNA. Of the nine ors sequences tested, half possess snapback (intrachain reannealing) properties.
RAD tag sequencing as a source of SNP markers in Cynara cardunculus L

PubMed Central

2012-01-01

Background The globe artichoke (Cynara cardunculus L. var. scolymus) genome is relatively poorly explored, especially compared to those of the other major Asteraceae crops sunflower and lettuce. No SNP markers are in the public domain. We have combined the recently developed restriction-site associated DNA (RAD) approach with the Illumina DNA sequencing platform to effect the rapid and mass discovery of SNP markers for C. cardunculus. Results RAD tags were sequenced from the genomic DNA of three C. cardunculus mapping population parents, generating 9.7 million reads, corresponding to ~1 Gbp of sequence. An assembly based on paired ends produced ~6.0 Mbp of genomic sequence, separated into ~19,000 contigs (mean length 312 bp), of which ~21% were fragments of putative coding sequence. The shared sequences allowed for the discovery of ~34,000 SNPs and nearly 800 indels, equivalent to a SNP frequency of 5.6 per 1,000 nt, and an indel frequency of 0.2 per 1,000 nt. A sample of heterozygous SNP loci was mapped by CAPS assays and this exercise provided validation of our mining criteria. The repetitive fraction of the genome had a high representation of retrotransposon sequence, followed by simple repeats, AT-low complexity regions and mobile DNA elements. The genomic k-mers distribution and CpG rate of C. cardunculus, compared with data derived from three whole genome-sequenced dicots species, provided a further evidence of the random representation of the C. cardunculus genome generated by RAD sampling. Conclusion The RAD tag sequencing approach is a cost-effective and rapid method to develop SNP markers in a highly heterozygous species. Our approach permitted to generate a large and robust SNP datasets by the adoption of optimized filtering criteria. PMID:22214349
A Complete and Accurate Ab Initio Repeat Finding Algorithm.

PubMed

Lian, Shuaibin; Chen, Xinwu; Wang, Peng; Zhang, Xiaoli; Dai, Xianhua

2016-03-01

It has become clear that repetitive sequences have played multiple roles in eukaryotic genome evolution including increasing genetic diversity through mutation, changes in gene expression and facilitating generation of novel genes. However, identification of repetitive elements can be difficult in the ab initio manner. Currently, some classical ab initio tools of finding repeats have already presented and compared. The completeness and accuracy of detecting repeats of them are little pool. To this end, we proposed a new ab initio repeat finding tool, named HashRepeatFinder, which is based on hash index and word counting. Furthermore, we assessed the performances of HashRepeatFinder with other two famous tools, such as RepeatScout and Repeatfinder, in human genome data hg19. The results indicated the following three conclusions: (1) The completeness of HashRepeatFinder is the best one among these three compared tools in almost all chromosomes, especially in chr9 (8 times of RepeatScout, 10 times of Repeatfinder); (2) in terms of detecting large repeats, HashRepeatFinder also performed best in all chromosomes, especially in chr3 (24 times of RepeatScout and 250 times of Repeatfinder) and chr19 (12 times of RepeatScout and 60 times of Repeatfinder); (3) in terms of accuracy, HashRepeatFinder can merge the abundant repeats with high accuracy.
Molecular Design of Performance Proteins With Repetitive Sequences

NASA Astrophysics Data System (ADS)

Vendrely, Charlotte; Ackerschott, Christian; Römer, Lin; Scheibel, Thomas

Most performance proteins responsible for the mechanical stability of cells and organisms reveal highly repetitive sequences. Mimicking such performance proteins is of high interest for the design of nanostructured biomaterials. In this article, flagelliform silk is exemplary introduced to describe a general principle for designing genes of repetitive performance proteins for recombinant expression in Escherichia coli . In the first step, repeating amino acid sequence motifs are reversely transcripted into DNA cassettes, which can in a second step be seamlessly ligated, yielding a designed gene. Recombinant expression thereof leads to proteins mimicking the natural ones. The recombinant proteins can be assembled into nanostructured materials in a controlled manner, allowing their use in several applications.
Mobile DNA in cancer. Extensive transduction of nonrepetitive DNA mediated by L1 retrotransposition in cancer genomes.

PubMed

Tubio, Jose M C; Li, Yilong; Ju, Young Seok; Martincorena, Inigo; Cooke, Susanna L; Tojo, Marta; Gundem, Gunes; Pipinikas, Christodoulos P; Zamora, Jorge; Raine, Keiran; Menzies, Andrew; Roman-Garcia, Pablo; Fullam, Anthony; Gerstung, Moritz; Shlien, Adam; Tarpey, Patrick S; Papaemmanuil, Elli; Knappskog, Stian; Van Loo, Peter; Ramakrishna, Manasa; Davies, Helen R; Marshall, John; Wedge, David C; Teague, Jon W; Butler, Adam P; Nik-Zainal, Serena; Alexandrov, Ludmil; Behjati, Sam; Yates, Lucy R; Bolli, Niccolo; Mudie, Laura; Hardy, Claire; Martin, Sancha; McLaren, Stuart; O'Meara, Sarah; Anderson, Elizabeth; Maddison, Mark; Gamble, Stephen; Foster, Christopher; Warren, Anne Y; Whitaker, Hayley; Brewer, Daniel; Eeles, Rosalind; Cooper, Colin; Neal, David; Lynch, Andy G; Visakorpi, Tapio; Isaacs, William B; Veer, Laura Van't; Caldas, Carlos; Desmedt, Christine; Sotiriou, Christos; Aparicio, Sam; Foekens, John A; Eyfjörd, Jórunn Erla; Lakhani, Sunil R; Thomas, Gilles; Myklebost, Ola; Span, Paul N; Børresen-Dale, Anne-Lise; Richardson, Andrea L; Van de Vijver, Marc; Vincent-Salomon, Anne; Van den Eynden, Gert G; Flanagan, Adrienne M; Futreal, P Andrew; Janes, Sam M; Bova, G Steven; Stratton, Michael R; McDermott, Ultan; Campbell, Peter J

2014-08-01

Long interspersed nuclear element-1 (L1) retrotransposons are mobile repetitive elements that are abundant in the human genome. L1 elements propagate through RNA intermediates. In the germ line, neighboring, nonrepetitive sequences are occasionally mobilized by the L1 machinery, a process called 3' transduction. Because 3' transductions are potentially mutagenic, we explored the extent to which they occur somatically during tumorigenesis. Studying cancer genomes from 244 patients, we found that tumors from 53% of the patients had somatic retrotranspositions, of which 24% were 3' transductions. Fingerprinting of donor L1s revealed that a handful of source L1 elements in a tumor can spawn from tens to hundreds of 3' transductions, which can themselves seed further retrotranspositions. The activity of individual L1 elements fluctuated during tumor evolution and correlated with L1 promoter hypomethylation. The 3' transductions disseminated genes, exons, and regulatory elements to new locations, most often to heterochromatic regions of the genome. Copyright © 2014, American Association for the Advancement of Science.
Genome sequencing of the sweetpotato whitefly Bemisia tabaci MED/Q.

PubMed

Xie, Wen; Chen, Chunhai; Yang, Zezhong; Guo, Litao; Yang, Xin; Wang, Dan; Chen, Ming; Huang, Jinqun; Wen, Yanan; Zeng, Yang; Liu, Yating; Xia, Jixing; Tian, Lixia; Cui, Hongying; Wu, Qingjun; Wang, Shaoli; Xu, Baoyun; Li, Xianchun; Tan, Xinqiu; Ghanim, Murad; Qiu, Baoli; Pan, Huipeng; Chu, Dong; Delatte, Helene; Maruthi, M N; Ge, Feng; Zhou, Xueping; Wang, Xiaowei; Wan, Fanghao; Du, Yuzhou; Luo, Chen; Yan, Fengming; Preisser, Evan L; Jiao, Xiaoguo; Coates, Brad S; Zhao, Jinyang; Gao, Qiang; Xia, Jinquan; Yin, Ye; Liu, Yong; Brown, Judith K; Zhou, Xuguo Joe; Zhang, Youjun

2017-05-01

The sweetpotato whitefly Bemisia tabaci is a highly destructive agricultural and ornamental crop pest. It damages host plants through both phloem feeding and vectoring plant pathogens. Introductions of B. tabaci are difficult to quarantine and eradicate because of its high reproductive rates, broad host plant range, and insecticide resistance. A total of 791 Gb of raw DNA sequence from whole genome shotgun sequencing, and 13 BAC pooling libraries were generated by Illumina sequencing using different combinations of mate-pair and pair-end libraries. Assembly gave a final genome with a scaffold N50 of 437 kb, and a total length of 658 Mb. Annotation of repetitive elements and coding regions resulted in 265.0 Mb TEs (40.3%) and 20 786 protein-coding genes with putative gene family expansions, respectively. Phylogenetic analysis based on orthologs across 14 arthropod taxa suggested that MED/Q is clustered into a hemipteran clade containing A. pisum and is a sister lineage to a clade containing both R. prolixus and N. lugens. Genome completeness, as estimated using the CEGMA and Benchmarking Universal Single-Copy Orthologs pipelines, reached 96% and 79%. These MED/Q genomic resources lay a foundation for future 'pan-genomic' comparisons of invasive vs. noninvasive, invasive vs. invasive, and native vs. exotic Bemisia, which, in return, will open up new avenues of investigation into whitefly biology, evolution, and management. © The Author 2017. Published by Oxford University Press.
Differential repetitive DNA composition in the centromeric region of chromosomes of Amazonian lizard species in the family Teiidae

PubMed Central

Carvalho, Natalia D. M.; Carmo, Edson; Neves, Rogerio O.; Schneider, Carlos Henrique; Gross, Maria Claudia

2016-01-01

Abstract Differences in heterochromatin distribution patterns and its composition were observed in Amazonian teiid species. Studies have shown repetitive DNA harbors heterochromatic blocks which are located in centromeric and telomeric regions in Ameiva ameiva (Linnaeus, 1758), Kentropyx calcarata (Spix, 1825), Kentropyx pelviceps (Cope, 1868), and Tupinambis teguixin (Linnaeus, 1758). In Cnemidophorus sp.1, repetitive DNA has multiple signals along all chromosomes. The aim of this study was to characterize moderately and highly repetitive DNA sequences by Cot1-DNA from Ameiva ameiva and Cnemidophorus sp.1 genomes through cloning and DNA sequencing, as well as mapping them chromosomally to better understand its organization and genome dynamics. The results of sequencing of DNA libraries obtained by Cot1-DNA showed that different microsatellites, transposons, retrotransposons, and some gene families also comprise the fraction of repetitive DNA in the teiid species. FISH using Cot1-DNA probes isolated from both Ameiva ameiva and Cnemidophorus sp.1 showed these sequences mainly located in heterochromatic centromeric, and telomeric regions in Ameiva ameiva, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin chromosomes, indicating they play structural and functional roles in the genome of these species. In Cnemidophorus sp.1, Cot1-DNA probe isolated from Ameiva ameiva had multiple interstitial signals on chromosomes, whereas mapping of Cot1-DNA isolated from the Ameiva ameiva and Cnemidophorus sp.1 highlighted centromeric regions of some chromosomes. Thus, the data obtained showed that many repetitive DNA classes are part of the genome of Ameiva ameiva, Cnemidophorus sp.1, Kentroyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin, and these sequences are shared among the analyzed teiid species, but they were not always allocated at the same chromosome position. PMID:27551343
Differential repetitive DNA composition in the centromeric region of chromosomes of Amazonian lizard species in the family Teiidae.

PubMed

Carvalho, Natalia D M; Carmo, Edson; Neves, Rogerio O; Schneider, Carlos Henrique; Gross, Maria Claudia

2016-01-01

Differences in heterochromatin distribution patterns and its composition were observed in Amazonian teiid species. Studies have shown repetitive DNA harbors heterochromatic blocks which are located in centromeric and telomeric regions in Ameiva ameiva (Linnaeus, 1758), Kentropyx calcarata (Spix, 1825), Kentropyx pelviceps (Cope, 1868), and Tupinambis teguixin (Linnaeus, 1758). In Cnemidophorus sp.1, repetitive DNA has multiple signals along all chromosomes. The aim of this study was to characterize moderately and highly repetitive DNA sequences by C ot1-DNA from Ameiva ameiva and Cnemidophorus sp.1 genomes through cloning and DNA sequencing, as well as mapping them chromosomally to better understand its organization and genome dynamics. The results of sequencing of DNA libraries obtained by C ot1-DNA showed that different microsatellites, transposons, retrotransposons, and some gene families also comprise the fraction of repetitive DNA in the teiid species. FISH using C ot1-DNA probes isolated from both Ameiva ameiva and Cnemidophorus sp.1 showed these sequences mainly located in heterochromatic centromeric, and telomeric regions in Ameiva ameiva, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin chromosomes, indicating they play structural and functional roles in the genome of these species. In Cnemidophorus sp.1, C ot1-DNA probe isolated from Ameiva ameiva had multiple interstitial signals on chromosomes, whereas mapping of C ot1-DNA isolated from the Ameiva ameiva and Cnemidophorus sp.1 highlighted centromeric regions of some chromosomes. Thus, the data obtained showed that many repetitive DNA classes are part of the genome of Ameiva ameiva, Cnemidophorus sp.1, Kentroyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin, and these sequences are shared among the analyzed teiid species, but they were not always allocated at the same chromosome position.
The cotton centromere contains a Ty3-gypsy-like LTR retroelement.

PubMed

Luo, Song; Mach, Jennifer; Abramson, Bradley; Ramirez, Rolando; Schurr, Robert; Barone, Pierluigi; Copenhaver, Gregory; Folkerts, Otto

2012-01-01

The centromere is a repeat-rich structure essential for chromosome segregation; with the long-term aim of understanding centromere structure and function, we set out to identify cotton centromere sequences. To isolate centromere-associated sequences from cotton, (Gossypium hirsutum) we surveyed tandem and dispersed repetitive DNA in the genus. Centromere-associated elements in other plants include tandem repeats and, in some cases, centromere-specific retroelements. Examination of cotton genomic survey sequences for tandem repeats yielded sequences that did not localize to the centromere. However, among the repetitive sequences we also identified a gypsy-like LTR retrotransposon (Centromere Retroelement Gossypium, CRG) that localizes to the centromere region of all chromosomes in domestic upland cotton, Gossypium hirsutum, the major commercially grown cotton. The location of the functional centromere was confirmed by immunostaining with antiserum to the centromere-specific histone CENH3, which co-localizes with CRG hybridization on metaphase mitotic chromosomes. G. hirsutum is an allotetraploid composed of A and D genomes and CRG is also present in the centromere regions of other AD cotton species. Furthermore, FISH and genomic dot blot hybridization revealed that CRG is found in D-genome diploid cotton species, but not in A-genome diploid species, indicating that this retroelement may have invaded the A-genome centromeres during allopolyploid formation and amplified during evolutionary history. CRG is also found in other diploid Gossypium species, including B and E2 genome species, but not in the C, E1, F, and G genome species tested. Isolation of this centromere-specific retrotransposon from Gossypium provides a probe for further understanding of centromere structure, and a tool for future engineering of centromere mini-chromosomes in this important crop species.
Burkholderia sp. induces functional nodules on the South African invasive legume Dipogon lignosus (Phaseoleae) in New Zealand soils.

PubMed

Liu, Wendy Y Y; Ridgway, Hayley J; James, Trevor K; James, Euan K; Chen, Wen-Ming; Sprent, Janet I; Young, J Peter W; Andrews, Mitchell

2014-10-01

The South African invasive legume Dipogon lignosus (Phaseoleae) produces nodules with both determinate and indeterminate characteristics in New Zealand (NZ) soils. Ten bacterial isolates produced functional nodules on D. lignosus. The 16S ribosomal RNA (rRNA) gene sequences identified one isolate as Bradyrhizobium sp., one isolate as Rhizobium sp. and eight isolates as Burkholderia sp. The Bradyrhizobium sp. and Rhizobium sp. 16S rRNA sequences were identical to those of strains previously isolated from crop plants and may have originated from inocula used on crops. Both 16S rRNA and DNA recombinase A (recA) gene sequences placed the eight Burkholderia isolates separate from previously described Burkholderia rhizobial species. However, the isolates showed a very close relationship to Burkholderia rhizobial strains isolated from South African plants with respect to their nitrogenase iron protein (nifH), N-acyltransferase nodulation protein A (nodA) and N-acetylglucosaminyl transferase nodulation protein C (nodC) gene sequences. Gene sequences and enterobacterial repetitive intergenic consensus (ERIC) PCR and repetitive element palindromic PCR (rep-PCR) banding patterns indicated that the eight Burkholderia isolates separated into five clones of one strain and three of another. One strain was tested and shown to produce functional nodules on a range of South African plants previously reported to be nodulated by Burkholderia tuberum STM678(T) which was isolated from the Cape Region. Thus, evidence is strong that the Burkholderia strains isolated here originated in South Africa and were somehow transported with the plants from their native habitat to NZ. It is possible that the strains are of a new species capable of nodulating legumes.
The Cotton Centromere Contains a Ty3-gypsy-like LTR Retroelement

PubMed Central

Luo, Song; Mach, Jennifer; Abramson, Bradley; Ramirez, Rolando; Schurr, Robert; Barone, Pierluigi; Copenhaver, Gregory; Folkerts, Otto

2012-01-01

The centromere is a repeat-rich structure essential for chromosome segregation; with the long-term aim of understanding centromere structure and function, we set out to identify cotton centromere sequences. To isolate centromere-associated sequences from cotton, (Gossypium hirsutum) we surveyed tandem and dispersed repetitive DNA in the genus. Centromere-associated elements in other plants include tandem repeats and, in some cases, centromere-specific retroelements. Examination of cotton genomic survey sequences for tandem repeats yielded sequences that did not localize to the centromere. However, among the repetitive sequences we also identified a gypsy-like LTR retrotransposon (Centromere Retroelement Gossypium, CRG) that localizes to the centromere region of all chromosomes in domestic upland cotton, Gossypium hirsutum, the major commercially grown cotton. The location of the functional centromere was confirmed by immunostaining with antiserum to the centromere-specific histone CENH3, which co-localizes with CRG hybridization on metaphase mitotic chromosomes. G. hirsutum is an allotetraploid composed of A and D genomes and CRG is also present in the centromere regions of other AD cotton species. Furthermore, FISH and genomic dot blot hybridization revealed that CRG is found in D-genome diploid cotton species, but not in A-genome diploid species, indicating that this retroelement may have invaded the A-genome centromeres during allopolyploid formation and amplified during evolutionary history. CRG is also found in other diploid Gossypium species, including B and E2 genome species, but not in the C, E1, F, and G genome species tested. Isolation of this centromere-specific retrotransposon from Gossypium provides a probe for further understanding of centromere structure, and a tool for future engineering of centromere mini-chromosomes in this important crop species. PMID:22536361

Speech Sequence Skill Learning in Adults Who Stutter

ERIC Educational Resources Information Center

Bauerly, Kim R.; De Nil, Luc F.

2011-01-01

The present study compared the ability of 12 people who stutter (PWS) and 12 people who do not stutter (PNS) to consolidate a novel sequential speech task. Participants practiced 100 repetitions of a single, monosyllabic, nonsense word sequence during an initial practice session and returned 24-h later to perform an additional 50 repetitions.…
Is perfectionism associated with academic burnout through repetitive negative thinking?

PubMed

Garratt-Reed, David; Howell, Joel; Hayes, Lana; Boyes, Mark

2018-01-01

Academic burnout is prevalent among university students, although understanding of what predicts burnout is limited. This study aimed to test the direct and indirect relationship between two dimensions of perfectionism (Perfectionistic Concerns and Perfectionistic Strivings) and the three elements of Academic Burnout (Exhaustion, Inadequacy, and Cynicism) through Repetitive Negative Thinking. In a cross-sectional survey, undergraduate students ( n = 126, M age = 23.64, 79% female) completed well-validated measures of Perfectionism, Repetitive Negative Thinking, and Academic Burnout. Perfectionistic Concerns was directly associated with all elements of burnout, as well as indirectly associated with Exhaustion and Cynicism via Repetitive Negative Thinking. Perfectionistic Strivings was directly associated with less Inadequacy and Cynicism; however, there were no indirect associations between Perfectionistic Strivings and Academic Burnout operating through Repetitive Negative Thinking. Repetitive Negative Thinking was also directly related to more burnout Exhaustion and Inadequacy, but not Cynicism. It is concluded that future research should investigate whether interventions targeting Perfectionistic Concerns and Repetitive Negative Thinking can reduce Academic Burnout in university students.
Is perfectionism associated with academic burnout through repetitive negative thinking?

PubMed Central

Garratt-Reed, David; Hayes, Lana; Boyes, Mark

2018-01-01

Academic burnout is prevalent among university students, although understanding of what predicts burnout is limited. This study aimed to test the direct and indirect relationship between two dimensions of perfectionism (Perfectionistic Concerns and Perfectionistic Strivings) and the three elements of Academic Burnout (Exhaustion, Inadequacy, and Cynicism) through Repetitive Negative Thinking. In a cross-sectional survey, undergraduate students (n = 126, Mage = 23.64, 79% female) completed well-validated measures of Perfectionism, Repetitive Negative Thinking, and Academic Burnout. Perfectionistic Concerns was directly associated with all elements of burnout, as well as indirectly associated with Exhaustion and Cynicism via Repetitive Negative Thinking. Perfectionistic Strivings was directly associated with less Inadequacy and Cynicism; however, there were no indirect associations between Perfectionistic Strivings and Academic Burnout operating through Repetitive Negative Thinking. Repetitive Negative Thinking was also directly related to more burnout Exhaustion and Inadequacy, but not Cynicism. It is concluded that future research should investigate whether interventions targeting Perfectionistic Concerns and Repetitive Negative Thinking can reduce Academic Burnout in university students. PMID:29938132
Pivotal Impacts of Retrotransposon Based Invasive RNAs on Evolution.

PubMed

Habibi, Laleh; Salmani, Hamzeh

2017-01-01

RNAs have long been described as the mediators of gene expression; they play a vital role in the structure and function of cellular complexes. Although the role of RNAs in the prokaryotes is mainly confined to these basic functions, the effects of these molecules in regulating the gene expression and enzymatic activities have been discovered in eukaryotes. Recently, a high-resolution analysis of the DNA obtained from different organisms has revealed a fundamental impact of the RNAs in shaping the genomes, heterochromatin formation, and gene creation. Deep sequencing of the human genome revealed that about half of our DNA is comprised of repetitive sequences (remnants of transposable element movements) expanded mostly through RNA-mediated processes. ORF2 encoded by L1 retrotransposons is a cellular reverse transcriptase which is mainly responsible for RNA invasion of various transposable elements (L1s, Alus, and SVAs) and cellular mRNAs in to the genomic DNA. In addition to increasing retroelements copy number; genomic expansion in association with centromere, telomere, and heterochromatin formation as well as pseudogene creation are the evolutionary consequences of this RNA-based activity. Threatening DNA integrity by disrupting the genes and forming excessive double strand breaks is another effect of this invasion. Therefore, repressive mechanisms have been evolved to control the activities of these invasive intracellular RNAs. All these mechanisms now have essential roles in the complex cellular functions. Therefore, it can be concluded that without direct action of RNA networks in shaping the genome and in the development of different cellular mechanisms, the evolution of higher eukaryotes would not be possible.
Pivotal Impacts of Retrotransposon Based Invasive RNAs on Evolution

PubMed Central

Habibi, Laleh; Salmani, Hamzeh

2017-01-01

RNAs have long been described as the mediators of gene expression; they play a vital role in the structure and function of cellular complexes. Although the role of RNAs in the prokaryotes is mainly confined to these basic functions, the effects of these molecules in regulating the gene expression and enzymatic activities have been discovered in eukaryotes. Recently, a high-resolution analysis of the DNA obtained from different organisms has revealed a fundamental impact of the RNAs in shaping the genomes, heterochromatin formation, and gene creation. Deep sequencing of the human genome revealed that about half of our DNA is comprised of repetitive sequences (remnants of transposable element movements) expanded mostly through RNA-mediated processes. ORF2 encoded by L1 retrotransposons is a cellular reverse transcriptase which is mainly responsible for RNA invasion of various transposable elements (L1s, Alus, and SVAs) and cellular mRNAs in to the genomic DNA. In addition to increasing retroelements copy number; genomic expansion in association with centromere, telomere, and heterochromatin formation as well as pseudogene creation are the evolutionary consequences of this RNA-based activity. Threatening DNA integrity by disrupting the genes and forming excessive double strand breaks is another effect of this invasion. Therefore, repressive mechanisms have been evolved to control the activities of these invasive intracellular RNAs. All these mechanisms now have essential roles in the complex cellular functions. Therefore, it can be concluded that without direct action of RNA networks in shaping the genome and in the development of different cellular mechanisms, the evolution of higher eukaryotes would not be possible. PMID:29067016
Meeting Report: The Role of the Mobilome in Cancer.

PubMed

Ardeljan, Daniel; Taylor, Martin S; Burns, Kathleen H; Boeke, Jef D; Espey, Michael Graham; Woodhouse, Elisa C; Howcroft, Thomas Kevin

2016-08-01

Approximately half of the human genome consists of repetitive sequence attributed to the activities of mobile DNAs, including DNA transposons, RNA transposons, and endogenous retroviruses. Of these, only long interspersed elements (LINE-1 or L1) and sequences copied by LINE-1 remain mobile in our species today. Although cells restrict L1 activity by both transcriptional and posttranscriptional mechanisms, L1 derepression occurs in developmental and pathologic contexts, including many types of cancers. However, we have limited knowledge of the extent and consequences of L1 expression in premalignancies and cancer. Participants in this NIH strategic workshop considered key questions to enhance our understanding of mechanisms and roles the mobilome may play in cancer biology. Cancer Res; 76(15); 4316-9. ©2016 AACR. ©2016 American Association for Cancer Research.
History of CRISPR-Cas from Encounter with a Mysterious Repeated Sequence to Genome Editing Technology.

PubMed

Ishino, Yoshizumi; Krupovic, Mart; Forterre, Patrick

2018-04-01

Clustered regularly interspaced short palindromic repeat (CRISPR)-Cas systems are well-known acquired immunity systems that are widespread in archaea and bacteria. The RNA-guided nucleases from CRISPR-Cas systems are currently regarded as the most reliable tools for genome editing and engineering. The first hint of their existence came in 1987, when an unusual repetitive DNA sequence, which subsequently was defined as a CRISPR, was discovered in the Escherichia coli genome during an analysis of genes involved in phosphate metabolism. Similar sequence patterns were then reported in a range of other bacteria as well as in halophilic archaea, suggesting an important role for such evolutionarily conserved clusters of repeated sequences. A critical step toward functional characterization of the CRISPR-Cas systems was the recognition of a link between CRISPRs and the associated Cas proteins, which were initially hypothesized to be involved in DNA repair in hyperthermophilic archaea. Comparative genomics, structural biology, and advanced biochemistry could then work hand in hand, not only culminating in the explosion of genome editing tools based on CRISPR-Cas9 and other class II CRISPR-Cas systems but also providing insights into the origin and evolution of this system from mobile genetic elements denoted casposons. To celebrate the 30th anniversary of the discovery of CRISPR, this minireview briefly discusses the fascinating history of CRISPR-Cas systems, from the original observation of an enigmatic sequence in E. coli to genome editing in humans. Copyright © 2018 American Society for Microbiology.
Massive GGAAs in genomic repetitive sequences serve as a nuclear reservoir of NF-κB.

PubMed

Wu, Jian; Wang, Qiao; Dai, Wei; Wang, Wei; Yue, Ming; Wang, Jinke

2018-04-13

Nuclear factor κB (NF-κB) is a DNA-binding transcription factor. Characterizing its genomic binding sites is crucial for understanding its gene regulatory function and mechanism in cells. This study characterized the binding sites of NF-κB RelA/p65 in the tumor neurosis factor-α (TNFα) stimulated HeLa cells by a precise chromatin immunoprecipitation-sequencing (ChIP-seq). The results revealed that NF-κB binds nontraditional motifs (nt-motifs) containing conserved GGAA quadruplet. Moreover, nt-motifs mainly distribute in the peaks nearby centromeres that contain a larger number of repetitive elements such as satellite, simple repeats and short interspersed nuclear elements (SINEs). This intracellular binding pattern was then confirmed by the in vitro detection, indicating that NF-κB dimers can bind the nontraditional κB (nt-κB) sites with low affinity. However, this binding hardly activates transcription. This study thus deduced that NF-κB binding nt-motifs may realize functions other than gene regulation as NF-κB binding traditional motifs (t-motifs). To testify the deduction, many ChIP-seq data of other cell lines were then analyzed. The results indicate that NF-κB binding nt-motifs is also widely present in other cells. The ChIP-seq data analysis also revealed that nt-motifs more widely distribute in the peaks with low-fold enrichment. Importantly, it was also found that NF-κB binding nt-motifs is mainly present in the resting cells, whereas NF-κB binding t-motifs is mainly present in the stimulated cells. Astonishingly, no known function was enriched by the gene annotation of nt-motif peaks. Based on these results, this study proposed that the nt-κB sites that extensively distribute in larger numbers of repeat elements function as a nuclear reservoir of NF-κB. The nuclear NF-κB proteins stored at nt-κB sites in the resting cells may be recruited to the t-κB sites for regulating its target genes upon stimulation. Copyright © 2018 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. Published by Elsevier Ltd. All rights reserved.
Repetitive element signature-based visualization, distance computation, and classification of 1766 microbial genomes.

PubMed

Lee, Kang-Hoon; Shin, Kyung-Seop; Lim, Debora; Kim, Woo-Chan; Chung, Byung Chang; Han, Gyu-Bum; Roh, Jeongkyu; Cho, Dong-Ho; Cho, Kiho

2015-07-01

The genomes of living organisms are populated with pleomorphic repetitive elements (REs) of varying densities. Our hypothesis that genomic RE landscapes are species/strain/individual-specific was implemented into the Genome Signature Imaging system to visualize and compute the RE-based signatures of any genome. Following the occurrence profiling of 5-nucleotide REs/words, the information from top-50 frequency words was transformed into a genome-specific signature and visualized as Genome Signature Images (GSIs), using a CMYK scheme. An algorithm for computing distances among GSIs was formulated using the GSIs' variables (word identity, frequency, and frequency order). The utility of the GSI-distance computation system was demonstrated with control genomes. GSI-based computation of genome-relatedness among 1766 microbes (117 archaea and 1649 bacteria) identified their clustering patterns; although the majority paralleled the established classification, some did not. The Genome Signature Imaging system, with its visualization and distance computation functions, enables genome-scale evolutionary studies involving numerous genomes with varying sizes. Copyright © 2015 Elsevier Inc. All rights reserved.
Advances in Understanding Stimulus Responsive Phase Behavior of Intrinsically Disordered Protein Polymers.

PubMed

Ruff, Kiersten M; Roberts, Stefan; Chilkoti, Ashutosh; Pappu, Rohit V

2018-06-24

Proteins and synthetic polymers can undergo phase transitions in response to changes to intensive solution parameters such as temperature, proton chemical potentials (pH), and hydrostatic pressure. For proteins and protein-based polymers, the information required for stimulus responsive phase transitions is encoded in their amino acid sequence. Here, we review some of the key physical principles that govern the phase transitions of archetypal intrinsically disordered protein polymers (IDPPs). These are disordered proteins with highly repetitive amino acid sequences. Advances in recombinant technologies have enabled the design and synthesis of protein sequences of a variety of sequence complexities and lengths. We summarize insights that have been gleaned from the design and characterization of IDPPs that undergo thermo-responsive phase transitions and build on these insights to present a general framework for IDPPs with pH and pressure responsive phase behavior. In doing so, we connect the stimulus responsive phase behavior of IDPPs with repetitive sequences to the coil-to-globule transitions that these sequences undergo at the single chain level in response to changes in stimuli. The proposed framework and ongoing studies of stimulus responsive phase behavior of designed IDPPs have direct implications in bioengineering, where designing sequences with bespoke material properties broadens the spectrum of applications, and in biology and medicine for understanding the sequence-specific driving forces for the formation of protein-based membraneless organelles as well as biological matrices that act as scaffolds for cells and mediators of cell-to-cell communication. Copyright © 2018. Published by Elsevier Ltd.
RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins.

PubMed

Hirsh, Layla; Paladin, Lisanna; Piovesan, Damiano; Tosatto, Silvio C E

2018-05-09

RepeatsDB-lite (http://protein.bio.unipd.it/repeatsdb-lite) is a web server for the prediction of repetitive structural elements and units in tandem repeat (TR) proteins. TRs are a widespread but poorly annotated class of non-globular proteins carrying heterogeneous functions. RepeatsDB-lite extends the prediction to all TR types and strongly improves the performance both in terms of computational time and accuracy over previous methods, with precision above 95% for solenoid structures. The algorithm exploits an improved TR unit library derived from the RepeatsDB database to perform an iterative structural search and assignment. The web interface provides tools for analyzing the evolutionary relationships between units and manually refine the prediction by changing unit positions and protein classification. An all-against-all structure-based sequence similarity matrix is calculated and visualized in real-time for every user edit. Reviewed predictions can be submitted to RepeatsDB for review and inclusion.
The Tolypocladium inflatum CPA element encodes a RecQ helicase-like gene.

PubMed

Kempken, Frank

2008-12-01

Previously, a repetitive CPA element was discovered in the genome of the filamentous fungus Tolypocladium inflatum; however, no further characterization was technically possible at that time. In this study, PCR amplification was used to detect a 4 kb conserved portion of the CPA element that appeared to be present in most, if not all, genomic CPA elements. The amplicons included a large open reading frame that was most similar to a RecQ helicase-like gene from Metarhizium anisopliae. The repetitive nature of the CPA element suggests that it is related to the eukaryotic Helitron class of transposable elements.
DNA methylation of retrotransposons, DNA transposons and genes in sugar beet (Beta vulgaris L.).

PubMed

Zakrzewski, Falk; Schmidt, Martin; Van Lijsebettens, Mieke; Schmidt, Thomas

2017-06-01

The methylation of cytosines shapes the epigenetic landscape of plant genomes, coordinates transgenerational epigenetic inheritance, represses the activity of transposable elements (TEs), affects gene expression and, hence, can influence the phenotype. Sugar beet (Beta vulgaris ssp. vulgaris), an important crop that accounts for 30% of worldwide sugar needs, has a relatively small genome size (758 Mbp) consisting of approximately 485 Mbp repetitive DNA (64%), in particular satellite DNA, retrotransposons and DNA transposons. Genome-wide cytosine methylation in the sugar beet genome was studied in leaves and leaf-derived callus with a focus on repetitive sequences, including retrotransposons and DNA transposons, the major groups of repetitive DNA sequences, and compared with gene methylation. Genes showed a specific methylation pattern for CG, CHG (H = A, C, and T) and CHH sites, whereas the TE pattern differed, depending on the TE class (class 1, retrotransposons and class 2, DNA transposons). Along genes and TEs, CG and CHG methylation was higher than that of adjacent genomic regions. In contrast to the relatively low CHH methylation in retrotransposons and genes, the level of CHH methylation in DNA transposons was strongly increased, pointing to a functional role of asymmetric methylation in DNA transposon silencing. Comparison of genome-wide DNA methylation between sugar beet leaves and callus revealed a differential methylation upon tissue culture. Potential epialleles were hypomethylated (lower methylation) at CG and CHG sites in retrotransposons and genes and hypermethylated (higher methylation) at CHH sites in DNA transposons of callus when compared with leaves. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Retrotransposons Are the Major Contributors to the Expansion of the Drosophila ananassae Muller F Element

PubMed Central

Shaffer, Christopher D.; Chen, Elizabeth J.; Quisenberry, Thomas J.; Ko, Kevin; Braverman, John M.; Giarla, Thomas C.; Mortimer, Nathan T.; Reed, Laura K.; Smith, Sheryl T.; Robic, Srebrenka; McCartha, Shannon R.; Perry, Danielle R.; Prescod, Lindsay M.; Sheppard, Zenyth A.; Saville, Ken J.; McClish, Allison; Morlock, Emily A.; Sochor, Victoria R.; Stanton, Brittney; Veysey-White, Isaac C.; Revie, Dennis; Jimenez, Luis A.; Palomino, Jennifer J.; Patao, Melissa D.; Patao, Shane M.; Himelblau, Edward T.; Campbell, Jaclyn D.; Hertz, Alexandra L.; McEvilly, Maddison F.; Wagner, Allison R.; Youngblom, James; Bedi, Baljit; Bettincourt, Jeffery; Duso, Erin; Her, Maiye; Hilton, William; House, Samantha; Karimi, Masud; Kumimoto, Kevin; Lee, Rebekah; Lopez, Darryl; Odisho, George; Prasad, Ricky; Robbins, Holly Lyn; Sandhu, Tanveer; Selfridge, Tracy; Tsukashima, Kara; Yosif, Hani; Kokan, Nighat P.; Britt, Latia; Zoellner, Alycia; Spana, Eric P.; Chlebina, Ben T.; Chong, Insun; Friedman, Harrison; Mammo, Danny A.; Ng, Chun L.; Nikam, Vinayak S.; Schwartz, Nicholas U.; Xu, Thomas Q.; Burg, Martin G.; Batten, Spencer M.; Corbeill, Lindsay M.; Enoch, Erica; Ensign, Jesse J.; Franks, Mary E.; Haiker, Breanna; Ingles, Judith A.; Kirkland, Lyndsay D.; Lorenz-Guertin, Joshua M.; Matthews, Jordan; Mittig, Cody M.; Monsma, Nicholaus; Olson, Katherine J.; Perez-Aragon, Guillermo; Ramic, Alen; Ramirez, Jordan R.; Scheiber, Christopher; Schneider, Patrick A.; Schultz, Devon E.; Simon, Matthew; Spencer, Eric; Wernette, Adam C.; Wykle, Maxine E.; Zavala-Arellano, Elizabeth; McDonald, Mitchell J.; Ostby, Kristine; Wendland, Peter; DiAngelo, Justin R.; Ceasrine, Alexis M.; Cox, Amanda H.; Docherty, James E.B.; Gingras, Robert M.; Grieb, Stephanie M.; Pavia, Michael J.; Personius, Casey L.; Polak, Grzegorz L.; Beach, Dale L.; Cerritos, Heaven L.; Horansky, Edward A.; Sharif, Karim A.; Moran, Ryan; Parrish, Susan; Bickford, Kirsten; Bland, Jennifer; Broussard, Juliana; Campbell, Kerry; Deibel, Katelynn E.; Forka, Richard; Lemke, Monika C.; Nelson, Marlee B.; O'Keeffe, Catherine; Ramey, S. Mariel; Schmidt, Luke; Villegas, Paola; Jones, Christopher J.; Christ, Stephanie L.; Mamari, Sami; Rinaldi, Adam S.; Stity, Ghazal; Hark, Amy T.; Scheuerman, Mark; Silver Key, S. Catherine; McRae, Briana D.; Haberman, Adam S.; Asinof, Sam; Carrington, Harriette; Drumm, Kelly; Embry, Terrance; McGuire, Richard; Miller-Foreman, Drew; Rosen, Stella; Safa, Nadia; Schultz, Darrin; Segal, Matt; Shevin, Yakov; Svoronos, Petros; Vuong, Tam; Skuse, Gary; Paetkau, Don W.; Bridgman, Rachael K.; Brown, Charlotte M.; Carroll, Alicia R.; Gifford, Francesca M.; Gillespie, Julie Beth; Herman, Susan E.; Holtcamp, Krystal L.; Host, Misha A.; Hussey, Gabrielle; Kramer, Danielle M.; Lawrence, Joan Q.; Martin, Madeline M.; Niemiec, Ellen N.; O'Reilly, Ashleigh P.; Pahl, Olivia A.; Quintana, Guadalupe; Rettie, Elizabeth A.S.; Richardson, Torie L.; Rodriguez, Arianne E.; Rodriguez, Mona O.; Schiraldi, Laura; Smith, Joanna J.; Sugrue, Kelsey F.; Suriano, Lindsey J.; Takach, Kaitlyn E.; Vasquez, Arielle M.; Velez, Ximena; Villafuerte, Elizabeth J.; Vives, Laura T.; Zellmer, Victoria R.; Hauke, Jeanette; Hauser, Charles R.; Barker, Karolyn; Cannon, Laurie; Parsamian, Perouza; Parsons, Samantha; Wichman, Zachariah; Bazinet, Christopher W.; Johnson, Diana E.; Bangura, Abubakarr; Black, Jordan A.; Chevee, Victoria; Einsteen, Sarah A.; Hilton, Sarah K.; Kollmer, Max; Nadendla, Rahul; Stamm, Joyce; Fafara-Thompson, Antoinette E.; Gygi, Amber M.; Ogawa, Emmy E.; Van Camp, Matt; Kocsisova, Zuzana; Leatherman, Judith L.; Modahl, Cassie M.; Rubin, Michael R.; Apiz-Saab, Susana S.; Arias-Mejias, Suzette M.; Carrion-Ortiz, Carlos F.; Claudio-Vazquez, Patricia N.; Espada-Green, Debbie M.; Feliciano-Camacho, Marium; Gonzalez-Bonilla, Karina M.; Taboas-Arroyo, Mariela; Vargas-Franco, Dorianmarie; Montañez-Gonzalez, Raquel; Perez-Otero, Joseph; Rivera-Burgos, Myrielis; Rivera-Rosario, Francisco J.; Eisler, Heather L.; Alexander, Jackie; Begley, Samatha K.; Gabbard, Deana; Allen, Robert J.; Aung, Wint Yan; Barshop, William D.; Boozalis, Amanda; Chu, Vanessa P.; Davis, Jeremy S.; Duggal, Ryan N.; Franklin, Robert; Gavinski, Katherine; Gebreyesus, Heran; Gong, Henry Z.; Greenstein, Rachel A.; Guo, Averill D.; Hanson, Casey; Homa, Kaitlin E.; Hsu, Simon C.; Huang, Yi; Huo, Lucy; Jacobs, Sarah; Jia, Sasha; Jung, Kyle L.; Wai-Chee Kong, Sarah; Kroll, Matthew R.; Lee, Brandon M.; Lee, Paul F.; Levine, Kevin M.; Li, Amy S.; Liu, Chengyu; Liu, Max Mian; Lousararian, Adam P.; Lowery, Peter B.; Mallya, Allyson P.; Marcus, Joseph E.; Ng, Patrick C.; Nguyen, Hien P.; Patel, Ruchik; Precht, Hashini; Rastogi, Suchita; Sarezky, Jonathan M.; Schefkind, Adam; Schultz, Michael B.; Shen, Delia; Skorupa, Tara; Spies, Nicholas C.; Stancu, Gabriel; Vivian Tsang, Hiu Man; Turski, Alice L.; Venkat, Rohit; Waldman, Leah E.; Wang, Kaidi; Wang, Tracy; Wei, Jeffrey W.; Wu, Dennis Y.; Xiong, David D.; Yu, Jack; Zhou, Karen; McNeil, Gerard P.; Fernandez, Robert W.; Menzies, Patrick Gomez; Gu, Tingting; Buhler, Jeremy; Mardis, Elaine R.; Elgin, Sarah C.R.

2017-01-01

The discordance between genome size and the complexity of eukaryotes can partly be attributed to differences in repeat density. The Muller F element (∼5.2 Mb) is the smallest chromosome in Drosophila melanogaster, but it is substantially larger (>18.7 Mb) in D. ananassae. To identify the major contributors to the expansion of the F element and to assess their impact, we improved the genome sequence and annotated the genes in a 1.4-Mb region of the D. ananassae F element, and a 1.7-Mb region from the D element for comparison. We find that transposons (particularly LTR and LINE retrotransposons) are major contributors to this expansion (78.6%), while Wolbachia sequences integrated into the D. ananassae genome are minor contributors (0.02%). Both D. melanogaster and D. ananassae F-element genes exhibit distinct characteristics compared to D-element genes (e.g., larger coding spans, larger introns, more coding exons, and lower codon bias), but these differences are exaggerated in D. ananassae. Compared to D. melanogaster, the codon bias observed in D. ananassae F-element genes can primarily be attributed to mutational biases instead of selection. The 5′ ends of F-element genes in both species are enriched in dimethylation of lysine 4 on histone 3 (H3K4me2), while the coding spans are enriched in H3K9me2. Despite differences in repeat density and gene characteristics, D. ananassae F-element genes show a similar range of expression levels compared to genes in euchromatic domains. This study improves our understanding of how transposons can affect genome size and how genes can function within highly repetitive domains. PMID:28667019
Extensive length variation in the ribosomal DNA intergenic spacer of yellow perch (Perca flavescens).

PubMed

Kakou, Bidénam; Angers, Bernard; Glémet, Hélène

2016-03-01

The intergenic spacer (IGS) is located between ribosomal RNA (rRNA) gene copies. Within the IGS, regulatory elements for rRNA gene transcription are found, as well as a varying number of other repetitive elements that are at the root of IGS length heterogeneity. This heterogeneity has been shown to have a functional significance through its effect on growth rate. Here, we present the structural organization of yellow perch (Perca flavescens) IGS based on its entire sequence, as well as the IGS length variation within a natural population. Yellow perch IGS structure has four discrete regions containing tandem repeat elements. For three of these regions, no specific length class was detected as allele size was seemingly normally distributed. However, for one repeat region, PCR amplification uncovered the presence of two distinctive IGS variants representing a length difference of 1116 bp. This repeat region was also devoid of any CpG sites despite a high GC content. Balanced selection may be holding the alleles in the population and would account for the high diversity of length variants observed for adjacent regions. Our study is an important precursor for further work aiming to assess the role of IGS length variation in influencing growth rate in fish.
Distinct retroelement classes define evolutionary breakpoints demarcating sites of evolutionary novelty

PubMed Central

Longo, Mark S; Carone, Dawn M; Green, Eric D; O'Neill, Michael J; O'Neill, Rachel J

2009-01-01

Background Large-scale genome rearrangements brought about by chromosome breaks underlie numerous inherited diseases, initiate or promote many cancers and are also associated with karyotype diversification during species evolution. Recent research has shown that these breakpoints are nonrandomly distributed throughout the mammalian genome and many, termed "evolutionary breakpoints" (EB), are specific genomic locations that are "reused" during karyotypic evolution. When the phylogenetic trajectory of orthologous chromosome segments is considered, many of these EB are coincident with ancient centromere activity as well as new centromere formation. While EB have been characterized as repeat-rich regions, it has not been determined whether specific sequences have been retained during evolution that would indicate previous centromere activity or a propensity for new centromere formation. Likewise, the conservation of specific sequence motifs or classes at EBs among divergent mammalian taxa has not been determined. Results To define conserved sequence features of EBs associated with centromere evolution, we performed comparative sequence analysis of more than 4.8 Mb within the tammar wallaby, Macropus eugenii, derived from centromeric regions (CEN), euchromatic regions (EU), and an evolutionary breakpoint (EB) that has undergone convergent breakpoint reuse and past centromere activity in marsupials. We found a dramatic enrichment for long interspersed nucleotide elements (LINE1s) and endogenous retroviruses (ERVs) and a depletion of short interspersed nucleotide elements (SINEs) shared between CEN and EBs. We analyzed the orthologous human EB (14q32.33), known to be associated with translocations in many cancers including multiple myelomas and plasma cell leukemias, and found a conserved distribution of similar repetitive elements. Conclusion Our data indicate that EBs tracked within the class Mammalia harbor sequence features retained since the divergence of marsupials and eutherians that may have predisposed these genomic regions to large-scale chromosomal instability. PMID:19630942
Gibbs motif sampling: detection of bacterial outer membrane protein repeats.

PubMed Central

Neuwald, A. F.; Liu, J. S.; Lawrence, C. E.

1995-01-01

The detection and alignment of locally conserved regions (motifs) in multiple sequences can provide insight into protein structure, function, and evolution. A new Gibbs sampling algorithm is described that detects motif-encoding regions in sequences and optimally partitions them into distinct motif models; this is illustrated using a set of immunoglobulin fold proteins. When applied to sequences sharing a single motif, the sampler can be used to classify motif regions into related submodels, as is illustrated using helix-turn-helix DNA-binding proteins. Other statistically based procedures are described for searching a database for sequences matching motifs found by the sampler. When applied to a set of 32 very distantly related bacterial integral outer membrane proteins, the sampler revealed that they share a subtle, repetitive motif. Although BLAST (Altschul SF et al., 1990, J Mol Biol 215:403-410) fails to detect significant pairwise similarity between any of the sequences, the repeats present in these outer membrane proteins, taken as a whole, are highly significant (based on a generally applicable statistical test for motifs described here). Analysis of bacterial porins with known trimeric beta-barrel structure and related proteins reveals a similar repetitive motif corresponding to alternating membrane-spanning beta-strands. These beta-strands occur on the membrane interface (as opposed to the trimeric interface) of the beta-barrel. The broad conservation and structural location of these repeats suggests that they play important functional roles. PMID:8520488
Initial sequence and comparative analysis of the cat genome

PubMed Central

Pontius, Joan U.; Mullikin, James C.; Smith, Douglas R.; Lindblad-Toh, Kerstin; Gnerre, Sante; Clamp, Michele; Chang, Jean; Stephens, Robert; Neelam, Beena; Volfovsky, Natalia; Schäffer, Alejandro A.; Agarwala, Richa; Narfström, Kristina; Murphy, William J.; Giger, Urs; Roca, Alfred L.; Antunes, Agostinho; Menotti-Raymond, Marilyn; Yuhki, Naoya; Pecon-Slattery, Jill; Johnson, Warren E.; Bourque, Guillaume; Tesler, Glenn; O’Brien, Stephen J.

2007-01-01

The genome sequence (1.9-fold coverage) of an inbred Abyssinian domestic cat was assembled, mapped, and annotated with a comparative approach that involved cross-reference to annotated genome assemblies of six mammals (human, chimpanzee, mouse, rat, dog, and cow). The results resolved chromosomal positions for 663,480 contigs, 20,285 putative feline gene orthologs, and 133,499 conserved sequence blocks (CSBs). Additional annotated features include repetitive elements, endogenous retroviral sequences, nuclear mitochondrial (numt) sequences, micro-RNAs, and evolutionary breakpoints that suggest historic balancing of translocation and inversion incidences in distinct mammalian lineages. Large numbers of single nucleotide polymorphisms (SNPs), deletion insertion polymorphisms (DIPs), and short tandem repeats (STRs), suitable for linkage or association studies were characterized in the context of long stretches of chromosome homozygosity. In spite of the light coverage capturing ∼65% of euchromatin sequence from the cat genome, these comparative insights shed new light on the tempo and mode of gene/genome evolution in mammals, promise several research applications for the cat, and also illustrate that a comparative approach using more deeply covered mammals provides an informative, preliminary annotation of a light (1.9-fold) coverage mammal genome sequence. PMID:17975172
V-SINEs: A New Superfamily of Vertebrate SINEs That Are Widespread in Vertebrate Genomes and Retain a Strongly Conserved Segment within Each Repetitive Unit

PubMed Central

Ogiwara, Ikuo; Miya, Masaki; Ohshima, Kazuhiko; Okada, Norihiro

2002-01-01

We have identified a new superfamily of vertebrate short interspersed repetitive elements (SINEs), designated V-SINEs, that are widespread in fishes and frogs. Each V-SINE includes a central conserved domain preceded by a 5′-end tRNA-related region and followed by a potentially recombinogenic (TG)n tract, with a 3′ tail derived from the 3′ untranslated region (UTR) of the corresponding partner long interspersed repetitive element (LINE) that encodes a functional reverse transcriptase. The central domain is strongly conserved and is even found in SINEs in the lamprey genome, suggesting that V-SINEs might be ∼550 Myr old or older in view of the timing of divergence of the lamprey lineage from the bony fish lineage. The central conserved domain might have been subject to some form of positive selection. Although the contemporary 3′ tails of V-SINEs differ from one another, it is possible that the original 3′ tail might have been replaced, via recombination, by the 3′ tails of more active partner LINEs, thereby retaining retropositional activity and the ability to survive for long periods on the evolutionary time scale. It seems plausible that V-SINEs may have some function(s) that have been maintained by the coevolution of SINEs and LINEs during the evolution of vertebrates. [The sequences reported in this paper have been deposited in the DDBJ/GenBank database under accession nos. AB072981–AB073004. Supplemental figures are available online at http://www.genome.org.] PMID:11827951
A peripheral component interconnect express-based scalable and highly integrated pulsed spectrometer for solution state dynamic nuclear polarization.

PubMed

He, Yugui; Feng, Jiwen; Zhang, Zhi; Wang, Chao; Wang, Dong; Chen, Fang; Liu, Maili; Liu, Chaoyang

2015-08-01

High sensitivity, high data rates, fast pulses, and accurate synchronization all represent challenges for modern nuclear magnetic resonance spectrometers, which make any expansion or adaptation of these devices to new techniques and experiments difficult. Here, we present a Peripheral Component Interconnect Express (PCIe)-based highly integrated distributed digital architecture pulsed spectrometer that is implemented with electron and nucleus double resonances and is scalable specifically for broad dynamic nuclear polarization (DNP) enhancement applications, including DNP-magnetic resonance spectroscopy/imaging (DNP-MRS/MRI). The distributed modularized architecture can implement more transceiver channels flexibly to meet a variety of MRS/MRI instrumentation needs. The proposed PCIe bus with high data rates can significantly improve data transmission efficiency and communication reliability and allow precise control of pulse sequences. An external high speed double data rate memory chip is used to store acquired data and pulse sequence elements, which greatly accelerates the execution of the pulse sequence, reduces the TR (time of repetition) interval, and improves the accuracy of TR in imaging sequences. Using clock phase-shift technology, we can produce digital pulses accurately with high timing resolution of 1 ns and narrow widths of 4 ns to control the microwave pulses required by pulsed DNP and ensure overall system synchronization. The proposed spectrometer is proved to be both feasible and reliable by observation of a maximum signal enhancement factor of approximately -170 for (1)H, and a high quality water image was successfully obtained by DNP-enhanced spin-echo (1)H MRI at 0.35 T.

A peripheral component interconnect express-based scalable and highly integrated pulsed spectrometer for solution state dynamic nuclear polarization

DOE Office of Scientific and Technical Information (OSTI.GOV)

He, Yugui; Liu, Chaoyang, E-mail: chyliu@wipm.ac.cn; State Key Laboratory of Magnet Resonance and Atomic and Molecular Physics, Wuhan Institute of Physics and Mathematics, Chinese Academy of Sciences, Wuhan 430071

2015-08-15

High sensitivity, high data rates, fast pulses, and accurate synchronization all represent challenges for modern nuclear magnetic resonance spectrometers, which make any expansion or adaptation of these devices to new techniques and experiments difficult. Here, we present a Peripheral Component Interconnect Express (PCIe)-based highly integrated distributed digital architecture pulsed spectrometer that is implemented with electron and nucleus double resonances and is scalable specifically for broad dynamic nuclear polarization (DNP) enhancement applications, including DNP-magnetic resonance spectroscopy/imaging (DNP-MRS/MRI). The distributed modularized architecture can implement more transceiver channels flexibly to meet a variety of MRS/MRI instrumentation needs. The proposed PCIe bus with highmore » data rates can significantly improve data transmission efficiency and communication reliability and allow precise control of pulse sequences. An external high speed double data rate memory chip is used to store acquired data and pulse sequence elements, which greatly accelerates the execution of the pulse sequence, reduces the TR (time of repetition) interval, and improves the accuracy of TR in imaging sequences. Using clock phase-shift technology, we can produce digital pulses accurately with high timing resolution of 1 ns and narrow widths of 4 ns to control the microwave pulses required by pulsed DNP and ensure overall system synchronization. The proposed spectrometer is proved to be both feasible and reliable by observation of a maximum signal enhancement factor of approximately −170 for {sup 1}H, and a high quality water image was successfully obtained by DNP-enhanced spin-echo {sup 1}H MRI at 0.35 T.« less
From an animal model to human patients: An example of a translational study on obsessive compulsive disorder (OCD).

PubMed

Eilam, David

2017-05-01

The application of similar analyses enables a direct projection from translational research in animals to human studies. Following is an example of how the methodology of a specific animal model of obsessive-compulsive disorder (OCD) was applied to study human patients. Specifically, the quinpirole rat model for OCD was based on analyzing the trajectories of travel among different locales, and scoring the set of acts performed at each locale. Applying this analytic approach in human patients unveiled various aspects of OCD, such as the repetition and addition of acts, incompleteness, and the link between behavior and specific locations. It is also illustrated how the same analytical approach could be applicable to studying other mental disorders. Finally, it is suggested that the development of OCD could be explained by the four-phase sequence of Repetition, Addition, Condensation, and Elimination, as outlined in the study of ontogeny and phylogeny and applied to normal development of behavior. In OCD, this sequence is curtailed, resulting in the abundant repetition and addition of acts. Copyright © 2017 Elsevier Ltd. All rights reserved.
Families of transposable elements, population structure and the origin of species.

PubMed

Jurka, Jerzy; Bao, Weidong; Kojima, Kenji K

2011-09-19

Eukaryotic genomes harbor diverse families of repetitive DNA derived from transposable elements (TEs) that are able to replicate and insert into genomic DNA. The biological role of TEs remains unclear, although they have profound mutagenic impact on eukaryotic genomes and the origin of repetitive families often correlates with speciation events. We present a new hypothesis to explain the observed correlations based on classical concepts of population genetics. The main thesis presented in this paper is that the TE-derived repetitive families originate primarily by genetic drift in small populations derived mostly by subdivisions of large populations into subpopulations. We outline the potential impact of the emerging repetitive families on genetic diversification of different subpopulations, and discuss implications of such diversification for the origin of new species. Several testable predictions of the hypothesis are examined. First, we focus on the prediction that the number of diverse families of TEs fixed in a representative genome of a particular species positively correlates with the cumulative number of subpopulations (demes) in the historical metapopulation from which the species has emerged. Furthermore, we present evidence indicating that human AluYa5 and AluYb8 families might have originated in separate proto-human subpopulations. We also revisit prior evidence linking the origin of repetitive families to mammalian phylogeny and present additional evidence linking repetitive families to speciation based on mammalian taxonomy. Finally, we discuss evidence that mammalian orders represented by the largest numbers of species may be subject to relatively recent population subdivisions and speciation events. The hypothesis implies that subdivision of a population into small subpopulations is the major step in the origin of new families of TEs as well as of new species. The origin of new subpopulations is likely to be driven by the availability of new biological niches, consistent with the hypothesis of punctuated equilibria. The hypothesis also has implications for the ongoing debate on the role of genetic drift in genome evolution.
The Development of Long-Term Lexical Representations through Hebb Repetition Learning

ERIC Educational Resources Information Center

Szmalec, Arnaud; Page, Mike P. A.; Duyck, Wouter

2012-01-01

This study clarifies the involvement of short- and long-term memory in novel word-form learning, using the Hebb repetition paradigm. In Experiment 1, participants recalled sequences of visually presented syllables (e.g., "la"-"va"-"bu"-"sa"-"fa"-"ra"-"re"-"si"-"di"), with one particular (Hebb) sequence repeated on every third trial. Crucially,…
Genome sequence and analysis of a stress-tolerant, wild-derived strain of Saccharomyces cerevisiae used in biofuels research

DOE Office of Scientific and Technical Information (OSTI.GOV)

McIlwain, Sean J.; Peris, Davis; Sardi, Maria

The genome sequences of more than 100 strains of the yeast Saccharomyces cerevisiae have been published. Unfortunately, most of these genome assemblies contain dozens to hundreds of gaps at repetitive sequences, including transposable elements, tRNAs, and subtelomeric regions, which is where novel genes generally reside. Relatively few strains have been chosen for genome sequencing based on their biofuel production potential, leaving an additional knowledge gap. Here, we describe the nearly complete genome sequence of GLBRCY22-3 (Y22-3), a strain of S. cerevisiae derived from the stress-tolerant wild strain NRRL YB-210 and subsequently engineered for xylose metabolism. After benchmarking several genome assemblymore » approaches, we developed a pipeline to integrate Pacific Biosciences (PacBio) and Illumina sequencing data and achieved one of the highest quality genome assemblies for any S. cerevisiae strain. Specifically, the contig N50 is 693 kbp, and the sequences of most chromosomes, the mitochondrial genome, and the 2-micron plasmid are complete. Our annotation predicts 92 genes that are not present in the reference genome of the laboratory strain S288c, over 70% of which were expressed. We predicted functions for 43 of these genes, 28 of which were previously uncharacterized and unnamed. Remarkably, many of these genes are predicted to be involved in stress tolerance and carbon metabolism and are shared with a Brazilian bioethanol production strain, even though the strains differ dramatically at most genetic loci. Lastly, the Y22-3 genome sequence provides an exceptionally high-quality resource for basic and applied research in bioenergy and genetics.« less
Genome sequence and analysis of a stress-tolerant, wild-derived strain of Saccharomyces cerevisiae used in biofuels research

DOE PAGES

McIlwain, Sean J.; Peris, Davis; Sardi, Maria; ...

2016-04-20

The genome sequences of more than 100 strains of the yeast Saccharomyces cerevisiae have been published. Unfortunately, most of these genome assemblies contain dozens to hundreds of gaps at repetitive sequences, including transposable elements, tRNAs, and subtelomeric regions, which is where novel genes generally reside. Relatively few strains have been chosen for genome sequencing based on their biofuel production potential, leaving an additional knowledge gap. Here, we describe the nearly complete genome sequence of GLBRCY22-3 (Y22-3), a strain of S. cerevisiae derived from the stress-tolerant wild strain NRRL YB-210 and subsequently engineered for xylose metabolism. After benchmarking several genome assemblymore » approaches, we developed a pipeline to integrate Pacific Biosciences (PacBio) and Illumina sequencing data and achieved one of the highest quality genome assemblies for any S. cerevisiae strain. Specifically, the contig N50 is 693 kbp, and the sequences of most chromosomes, the mitochondrial genome, and the 2-micron plasmid are complete. Our annotation predicts 92 genes that are not present in the reference genome of the laboratory strain S288c, over 70% of which were expressed. We predicted functions for 43 of these genes, 28 of which were previously uncharacterized and unnamed. Remarkably, many of these genes are predicted to be involved in stress tolerance and carbon metabolism and are shared with a Brazilian bioethanol production strain, even though the strains differ dramatically at most genetic loci. Lastly, the Y22-3 genome sequence provides an exceptionally high-quality resource for basic and applied research in bioenergy and genetics.« less
Abr1, a Transposon-Like Element in the Genome of the Cultivated Mushroom Agaricus bisporus (Lange) Imbach

PubMed Central

Sonnenberg, Anton S. M.; Baars, Johan J. P.; Mikosch, Thomas S. P.; Schaap, Peter J.; Van Griensven, Leo J. L. D.

1999-01-01

A 300-bp repetitive element was found in the genome of the white button mushroom, Agaricus bisporus, and designated Abr1. It is present in ∼15 copies per haploid genome in the commercial strain Horst U1. Analysis of seven copies showed 89 to 97% sequence identity. The repeat has features typical of class II transposons (i.e., terminal inverted repeats, subterminal repeats, and a target site duplication of 7 bp). The latter shows a consensus sequence. When used as probe on Southern blots, Abr1 identifies relatively little variation within traditional and present-day commercial strains, indicating that most strains are identical or have a common origin. In contrast to these cultivars, high variation is found among field-collected strains. Furthermore, a remarkable difference in copy numbers of Abr1 was found between A. bisporus isolates with a secondarily homothallic life cycle and those with a heterothallic life cycle. Abr1 is a type II transposon not previously reported in basidiomycetes and appears to be useful for the identification of strains within the species A. bisporus. PMID:10427018
Improved maize reference genome with single-molecule technologies.

PubMed

Jiao, Yinping; Peluso, Paul; Shi, Jinghua; Liang, Tiffany; Stitzer, Michelle C; Wang, Bo; Campbell, Michael S; Stein, Joshua C; Wei, Xuehong; Chin, Chen-Shan; Guill, Katherine; Regulski, Michael; Kumari, Sunita; Olson, Andrew; Gent, Jonathan; Schneider, Kevin L; Wolfgruber, Thomas K; May, Michael R; Springer, Nathan M; Antoniou, Eric; McCombie, W Richard; Presting, Gernot G; McMullen, Michael; Ross-Ibarra, Jeffrey; Dawe, R Kelly; Hastie, Alex; Rank, David R; Ware, Doreen

2017-06-22

Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.
Comparative Analysis of the Shared Sex-Determination Region (SDR) among Salmonid Fishes.

PubMed

Faber-Hammond, Joshua J; Phillips, Ruth B; Brown, Kim H

2015-06-25

Salmonids present an excellent model for studying evolution of young sex-chromosomes. Within the genus, Oncorhynchus, at least six independent sex-chromosome pairs have evolved, many unique to individual species. This variation results from the movement of the sex-determining gene, sdY, throughout the salmonid genome. While sdY is known to define sexual differentiation in salmonids, the mechanism of its movement throughout the genome has remained elusive due to high frequencies of repetitive elements, rDNA sequences, and transposons surrounding the sex-determining regions (SDR). Despite these difficulties, bacterial artificial chromosome (BAC) library clones from both rainbow trout and Atlantic salmon containing the sdY region have been reported. Here, we report the sequences for these BACs as well as the extended sequence for the known SDR in Chinook gained through genome walking methods. Comparative analysis allowed us to study the overlapping SDRs from three unique salmonid Y chromosomes to define the specific content, size, and variation present between the species. We found approximately 4.1 kb of orthologous sequence common to all three species, which contains the genetic content necessary for masculinization. The regions contain transposable elements that may be responsible for the translocations of the SDR throughout salmonid genomes and we examine potential mechanistic roles of each one. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
A compact control system to achieve stable voltage and low jitter trigger for repetitive intense electron-beam accelerator based on resonant charging

NASA Astrophysics Data System (ADS)

Qiu, Yongfeng; Liu, Jinliang; Yang, Jianhua; Cheng, Xinbing; Yang, Xiao

2017-08-01

A compact control system based on Delphi and Field Programmable Gate Array(FPGA) is developed for a repetitive intense electron-beam accelerator(IEBA), whose output power is 10GW and pulse duration is 160ns. The system uses both hardware and software solutions. It comprises a host computer, a communication module and a main control unit. A device independent applications programming interface, devised using Delphi, is installed on the host computer. Stability theory of voltage in repetitive mode is analyzed and a detailed overview of the hardware and software configuration is presented. High voltage experiment showed that the control system fulfilled the requests of remote operation and data-acquisition. The control system based on a time-sequence control method is used to keep constant of the voltage of the primary capacitor in every shot, which ensured the stable and reliable operation of the electron beam accelerator in the repetitive mode during the experiment. Compared with the former control system based on Labview and PIC micro-controller developed in our laboratory, the present one is more compact, and with higher precision in the time dimension. It is particularly useful for automatic control of IEBA in the high power microwave effects research experiments where pulse-to-pulse reproducibility is required.
Development of Thinopyrum ponticum-specific molecular markers and FISH probes based on SLAF-seq technology.

PubMed

Liu, Liqin; Luo, Qiaoling; Teng, Wan; Li, Bin; Li, Hongwei; Li, Yiwen; Li, Zhensheng; Zheng, Qi

2018-05-01

Based on SLAF-seq, 67 Thinopyrum ponticum-specific markers and eight Th. ponticum-specific FISH probes were developed, and these markers and probes could be used for detection of alien chromatin in a wheat background. Decaploid Thinopyrum ponticum (2n = 10x = 70) is a valuable gene reservoir for wheat improvement. Identification of Th. ponticum introgression would facilitate its transfer into diverse wheat genetic backgrounds and its practical utilization in wheat improvement. Based on specific-locus-amplified fragment sequencing (SLAF-seq) technology, 67 new Th. ponticum-specific molecular markers and eight Th. ponticum-specific fluorescence in situ hybridization (FISH) probes have been developed from a tiny wheat-Th. ponticum translocation line. These newly developed molecular markers allowed the detection of Th. ponticum DNA in a variety of materials specifically and steadily at high throughput. According to the hybridization signal pattern, the eight Th. ponticum-specific probes could be divided into two groups. The first group including five dispersed repetitive sequence probes could identify Th. ponticum chromatin more sensitively and accurately than genomic in situ hybridization (GISH). Whereas the second group having three tandem repetitive sequence probes enabled the discrimination of Th. ponticum chromosomes together with another clone pAs1 in wheat-Th. ponticum partial amphiploid Xiaoyan 68.
ATRX tolerates activity-dependent histone H3 methyl/phos switching to maintain repetitive element silencing in neurons

PubMed Central

Noh, Kyung-Min; Zhao, Dan; Xiang, Bin; Wenderski, Wendy; Lewis, Peter W.; Shen, Li; Li, Haitao; Allis, C. David

2015-01-01

ATRX (the alpha thalassemia/mental retardation syndrome X-linked protein) is a member of the switch2/sucrose nonfermentable2 (SWI2/SNF2) family of chromatin-remodeling proteins and primarily functions at heterochromatic loci via its recognition of “repressive” histone modifications [e.g., histone H3 lysine 9 tri-methylation (H3K9me3)]. Despite significant roles for ATRX during normal neural development, as well as its relationship to human disease, ATRX function in the central nervous system is not well understood. Here, we describe ATRX’s ability to recognize an activity-dependent combinatorial histone modification, histone H3 lysine 9 tri-methylation/serine 10 phosphorylation (H3K9me3S10ph), in postmitotic neurons. In neurons, this “methyl/phos” switch occurs exclusively after periods of stimulation and is highly enriched at heterochromatic repeats associated with centromeres. Using a multifaceted approach, we reveal that H3K9me3S10ph-bound Atrx represses noncoding transcription of centromeric minor satellite sequences during instances of heightened activity. Our results indicate an essential interaction between ATRX and a previously uncharacterized histone modification in the central nervous system and suggest a potential role for abnormal repetitive element transcription in pathological states manifested by ATRX dysfunction. PMID:25538301
ATRX tolerates activity-dependent histone H3 methyl/phos switching to maintain repetitive element silencing in neurons.

PubMed

Noh, Kyung-Min; Maze, Ian; Zhao, Dan; Xiang, Bin; Wenderski, Wendy; Lewis, Peter W; Shen, Li; Li, Haitao; Allis, C David

2015-06-02

ATRX (the alpha thalassemia/mental retardation syndrome X-linked protein) is a member of the switch2/sucrose nonfermentable2 (SWI2/SNF2) family of chromatin-remodeling proteins and primarily functions at heterochromatic loci via its recognition of "repressive" histone modifications [e.g., histone H3 lysine 9 tri-methylation (H3K9me3)]. Despite significant roles for ATRX during normal neural development, as well as its relationship to human disease, ATRX function in the central nervous system is not well understood. Here, we describe ATRX's ability to recognize an activity-dependent combinatorial histone modification, histone H3 lysine 9 tri-methylation/serine 10 phosphorylation (H3K9me3S10ph), in postmitotic neurons. In neurons, this "methyl/phos" switch occurs exclusively after periods of stimulation and is highly enriched at heterochromatic repeats associated with centromeres. Using a multifaceted approach, we reveal that H3K9me3S10ph-bound Atrx represses noncoding transcription of centromeric minor satellite sequences during instances of heightened activity. Our results indicate an essential interaction between ATRX and a previously uncharacterized histone modification in the central nervous system and suggest a potential role for abnormal repetitive element transcription in pathological states manifested by ATRX dysfunction.
Knowing what to respond in the future does not cancel the influence of past events.

PubMed

Tubau, Elisabet; López-Moliner, Joan

2009-05-29

Everyday tasks seldom involve isolate actions but sequences of them. We can see whether previous actions influence the current one by exploring the response time to controlled sequences of stimuli. Specifically, depending on the response-stimulus temporal interval (RSI), different mechanisms have been proposed to explain sequential effects in two-choice serial response tasks. Whereas an automatic facilitation mechanism is thought to produce a benefit for response repetitions at short RSIs, subjective expectancies are considered to replace the automatic facilitation at longer RSIs, producing a cost-benefit pattern: repetitions are faster after other repetitions but they are slower after alternations. However, there is not direct evidence showing the impact of subjective expectancies on sequential effects. By using a fixed sequence, the results of the reported experiment showed that the repetition effect was enhanced in participants who acquired complete knowledge of the order. Nevertheless, a similar cost-benefit pattern was observed in all participants and in all learning blocks. Therefore, results of the experiment suggest that sequential effects, including the cost-benefit pattern, are the consequence of automatic mechanisms which operate independently of (and simultaneously with) explicit knowledge of the sequence or other subjective expectancies.
Molecular basis of length polymorphism in the human zeta-globin gene complex.

PubMed Central

Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J

1983-01-01

The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667
Satellite DNA and Transposable Elements in Seabuckthorn (Hippophae rhamnoides), a Dioecious Plant with Small Y and Large X Chromosomes.

PubMed

Puterova, Janka; Razumova, Olga; Martinek, Tomas; Alexandrov, Oleg; Divashuk, Mikhail; Kubat, Zdenek; Hobza, Roman; Karlov, Gennady; Kejnovsky, Eduard

2017-01-01

Seabuckthorn (Hippophae rhamnoides) is a dioecious shrub commonly used in the pharmaceutical, cosmetic, and environmental industry as a source of oil, minerals and vitamins. In this study, we analyzed the transposable elements and satellites in its genome. We carried out Illumina DNA sequencing and reconstructed the main repetitive DNA sequences. For data analysis, we developed a new bioinformatics approach for advanced satellite DNA analysis and showed that about 25% of the genome consists of satellite DNA and about 24% is formed of transposable elements, dominated by Ty3/Gypsy and Ty1/Copia LTR retrotransposons. FISH mapping revealed X chromosome-accumulated, Y chromosome-specific or both sex chromosomes-accumulated satellites but most satellites were found on autosomes. Transposable elements were located mostly in the subtelomeres of all chromosomes. The 5S rDNA and 45S rDNA were localized on one autosomal locus each. Although we demonstrated the small size of the Y chromosome of the seabuckthorn and accumulated satellite DNA there, we were unable to estimate the age and extent of the Y chromosome degeneration. Analysis of dioecious relatives such as Shepherdia would shed more light on the evolution of these sex chromosomes. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
ABMapper: a suffix array-based tool for multi-location searching and splice-junction mapping.

PubMed

Lou, Shao-Ke; Ni, Bing; Lo, Leung-Yau; Tsui, Stephen Kwok-Wing; Chan, Ting-Fung; Leung, Kwong-Sak

2011-02-01

Sequencing reads generated by RNA-sequencing (RNA-seq) must first be mapped back to the genome through alignment before they can be further analyzed. Current fast and memory-saving short-read mappers could give us a quick view of the transcriptome. However, they are neither designed for reads that span across splice junctions nor for repetitive reads, which can be mapped to multiple locations in the genome (multi-reads). Here, we describe a new software package: ABMapper, which is specifically designed for exploring all putative locations of reads that are mapped to splice junctions or repetitive in nature. The software is freely available at: http://abmapper.sourceforge.net/. The software is written in C++ and PERL. It runs on all major platforms and operating systems including Windows, Mac OS X and LINUX.
Global Sea-level Changes Revealed in the Sediments of the Canterbury Basin, New Zealand: IODP Expedition 317

NASA Astrophysics Data System (ADS)

McHugh, C. M.; Fulthorpe, C.; Blum, P.; Rios, J.; Chow, Y.; Mishkin, K.

2012-12-01

Continental margins are composed of thick sedimentary sections that preserve the record of local processes modulated by global sea-level (eustatic) changes and climate. Understanding this regional variability permits us to extract the eustatic record. Integrated Ocean Drilling Program Expedition 317 drilled four sites in the offshore Canterbury Basin, eastern South Island of New Zealand, in water depths of 85 m to 320 m. One of the objectives of the expedition was to understand the influence of eustasy on continental margins sedimentation and to test the concepts of sequence stratigraphy. A high-resolution multiproxy approach that involves geochemical elemental analyses, lithostratigraphy and biostratigraphy is applied to understand the margin's sedimentation for the past ~5 million years. Multichannel seismic data (EW00-01 survey) provide a seismic sequence stratigraphic framework against which to interpret the multiproxy data. The mid- to late Pleistocene sedimentation is characterized by variable lithologies and changing facies. However, elemental compositions and facies follow predictable patterns within seismic sequences. Oxygen isotope measurements for the latest Pleistocene indicate that 100 ky Milankovich astronomical forcing controlled this variability. In contrast, Pliocene and early Pleistocene sediments are composed of repetitive siliciclastic and carbonate mud lithologies with less facies variability. Results of our analyses suggest that repetitive alternations of green and gray mud were deposited during warmer and cooler periods, respectively. Oxygen isotopes suggest that this cyclicity may reflect 40 ky Milankovich forcing. Ocean Drilling Program Legs 150 and 174A drilled on the New Jersey continental margin with similar objectives to those of Expedition 317. Results from this northern and southern hemisphere drilling reveal that eustasy, controlled by Milankovich forcing, strongly influences margin sedimentation and the formation of basin-wide unconformities. However, the correlation between eustasy and seismic sequence formation is not always one to one. High sedimentation rates in the Pleistocene offshore Canterbury Basin record a one- to-one correlation between glacioeustasy and seismic sequences, and in some sequences possibly a higher order frequency. But this is not the case for offshore New Jersey, where accumulation rates were lower and only the uppermost seismic sequences represent 100 ky cycles. Furthermore, Pliocene sedimentation in the Canterbury Basin was also controlled by eustasy, but does not show a one-to-one correlation between Milankovich cycles and seismic stratigraphy. Northern and southern hemisphere comparisons provide a powerful tool to better understand controls on regional sedimentation and extract a global signal.
QuickMap: a public tool for large-scale gene therapy vector insertion site mapping and analysis.

PubMed

Appelt, J-U; Giordano, F A; Ecker, M; Roeder, I; Grund, N; Hotz-Wagenblatt, A; Opelz, G; Zeller, W J; Allgayer, H; Fruehauf, S; Laufs, S

2009-07-01

Several events of insertional mutagenesis in pre-clinical and clinical gene therapy studies have created intense interest in assessing the genomic insertion profiles of gene therapy vectors. For the construction of such profiles, vector-flanking sequences detected by inverse PCR, linear amplification-mediated-PCR or ligation-mediated-PCR need to be mapped to the host cell's genome and compared to a reference set. Although remarkable progress has been achieved in mapping gene therapy vector insertion sites, public reference sets are lacking, as are the possibilities to quickly detect non-random patterns in experimental data. We developed a tool termed QuickMap, which uniformly maps and analyzes human and murine vector-flanking sequences within seconds (available at www.gtsg.org). Besides information about hits in chromosomes and fragile sites, QuickMap automatically determines insertion frequencies in +/- 250 kb adjacency to genes, cancer genes, pseudogenes, transcription factor and (post-transcriptional) miRNA binding sites, CpG islands and repetitive elements (short interspersed nuclear elements (SINE), long interspersed nuclear elements (LINE), Type II elements and LTR elements). Additionally, all experimental frequencies are compared with the data obtained from a reference set, containing 1 000 000 random integrations ('random set'). Thus, for the first time a tool allowing high-throughput profiling of gene therapy vector insertion sites is available. It provides a basis for large-scale insertion site analyses, which is now urgently needed to discover novel gene therapy vectors with 'safe' insertion profiles.
RNomics: an experimental approach that identifies 201 candidates for novel, small, non-messenger RNAs in mouse

PubMed Central

Hüttenhofer, Alexander; Kiefmann, Martin; Meier-Ewert, Sebastian; O’Brien, John; Lehrach, Hans; Bachellerie, Jean-Pierre; Brosius, Jürgen

2001-01-01

In mouse brain cDNA libraries generated from small RNA molecules we have identified a total of 201 different expressed RNA sequences potentially encoding novel small non-messenger RNA species (snmRNAs). Based on sequence and structural motifs, 113 of these RNAs can be assigned to the C/D box or H/ACA box subclass of small nucleolar RNAs (snoRNAs), known as guide RNAs for rRNA. While 30 RNAs represent mouse homologues of previously identified human C/D or H/ACA snoRNAs, 83 correspond to entirely novel snoRNAs. Among these, for the first time, we identified four C/D box snoRNAs and four H/ACA box snoRNAs predicted to direct modifications within U2, U4 or U6 small nuclear RNAs (snRNAs). Furthermore, 25 snoRNAs from either class lacked antisense elements for rRNAs or snRNAs. Therefore, additional snoRNA targets have to be considered. Surprisingly, six C/D box snoRNAs and one H/ACA box snoRNA were expressed exclusively in brain. Of the 88 RNAs not belonging to either snoRNA subclass, at least 26 are probably derived from truncated heterogeneous nuclear RNAs (hnRNAs) or mRNAs. Short interspersed repetitive elements (SINEs) are located on five RNA sequences and may represent rare examples of transcribed SINEs. The remaining RNA species could not as yet be assigned either to any snmRNA class or to a part of a larger hnRNA/mRNA. It is likely that at least some of the latter will represent novel, unclassified snmRNAs. PMID:11387227

Sequence Composition and Gene Content of the Short Arm of Rye (Secale cereale) Chromosome 1

PubMed Central

Fluch, Silvia; Kopecky, Dieter; Burg, Kornel; Šimková, Hana; Taudien, Stefan; Petzold, Andreas; Kubaláková, Marie; Platzer, Matthias; Berenyi, Maria; Krainer, Siegfried; Doležel, Jaroslav; Lelley, Tamas

2012-01-01

Background The purpose of the study is to elucidate the sequence composition of the short arm of rye chromosome 1 (Secale cereale) with special focus on its gene content, because this portion of the rye genome is an integrated part of several hundreds of bread wheat varieties worldwide. Methodology/Principal Findings Multiple Displacement Amplification of 1RS DNA, obtained from flow sorted 1RS chromosomes, using 1RS ditelosomic wheat-rye addition line, and subsequent Roche 454FLX sequencing of this DNA yielded 195,313,589 bp sequence information. This quantity of sequence information resulted in 0.43× sequence coverage of the 1RS chromosome arm, permitting the identification of genes with estimated probability of 95%. A detailed analysis revealed that more than 5% of the 1RS sequence consisted of gene space, identifying at least 3,121 gene loci representing 1,882 different gene functions. Repetitive elements comprised about 72% of the 1RS sequence, Gypsy/Sabrina (13.3%) being the most abundant. More than four thousand simple sequence repeat (SSR) sites mostly located in gene related sequence reads were identified for possible marker development. The existence of chloroplast insertions in 1RS has been verified by identifying chimeric chloroplast-genomic sequence reads. Synteny analysis of 1RS to the full genomes of Oryza sativa and Brachypodium distachyon revealed that about half of the genes of 1RS correspond to the distal end of the short arm of rice chromosome 5 and the proximal region of the long arm of Brachypodium distachyon chromosome 2. Comparison of the gene content of 1RS to 1HS barley chromosome arm revealed high conservation of genes related to chromosome 5 of rice. Conclusions The present study revealed the gene content and potential gene functions on this chromosome arm and demonstrated numerous sequence elements like SSRs and gene-related sequences, which can be utilised for future research as well as in breeding of wheat and rye. PMID:22328922
A draft of the genome and four transcriptomes of a medicinal and pesticidal angiosperm Azadirachta indica

PubMed Central

2012-01-01

Background The Azadirachta indica (neem) tree is a source of a wide number of natural products, including the potent biopesticide azadirachtin. In spite of its widespread applications in agriculture and medicine, the molecular aspects of the biosynthesis of neem terpenoids remain largely unexplored. The current report describes the draft genome and four transcriptomes of A. indica and attempts to contextualise the sequence information in terms of its molecular phylogeny, transcript expression and terpenoid biosynthesis pathways. A. indica is the first member of the family Meliaceae to be sequenced using next generation sequencing approach. Results The genome and transcriptomes of A. indica were sequenced using multiple sequencing platforms and libraries. The A. indica genome is AT-rich, bears few repetitive DNA elements and comprises about 20,000 genes. The molecular phylogenetic analyses grouped A. indica together with Citrus sinensis from the Rutaceae family validating its conventional taxonomic classification. Comparative transcript expression analysis showed either exclusive or enhanced expression of known genes involved in neem terpenoid biosynthesis pathways compared to other sequenced angiosperms. Genome and transcriptome analyses in A. indica led to the identification of repeat elements, nucleotide composition and expression profiles of genes in various organs. Conclusions This study on A. indica genome and transcriptomes will provide a model for characterization of metabolic pathways involved in synthesis of bioactive compounds, comparative evolutionary studies among various Meliaceae family members and help annotate their genomes. A better understanding of molecular pathways involved in the azadirachtin synthesis in A. indica will pave ways for bulk production of environment friendly biopesticides. PMID:22958331
Long-read whole genome sequencing and comparative analysis of six strains of the human pathogen Orientia tsutsugamushi.

PubMed

Batty, Elizabeth M; Chaemchuen, Suwittra; Blacksell, Stuart; Richards, Allen L; Paris, Daniel; Bowden, Rory; Chan, Caroline; Lachumanan, Ramkumar; Day, Nicholas; Donnelly, Peter; Chen, Swaine; Salje, Jeanne

2018-06-01

Orientia tsutsugamushi is a clinically important but neglected obligate intracellular bacterial pathogen of the Rickettsiaceae family that causes the potentially life-threatening human disease scrub typhus. In contrast to the genome reduction seen in many obligate intracellular bacteria, early genetic studies of Orientia have revealed one of the most repetitive bacterial genomes sequenced to date. The dramatic expansion of mobile elements has hampered efforts to generate complete genome sequences using short read sequencing methodologies, and consequently there have been few studies of the comparative genomics of this neglected species. We report new high-quality genomes of O. tsutsugamushi, generated using PacBio single molecule long read sequencing, for six strains: Karp, Kato, Gilliam, TA686, UT76 and UT176. In comparative genomics analyses of these strains together with existing reference genomes from Ikeda and Boryong strains, we identify a relatively small core genome of 657 genes, grouped into core gene islands and separated by repeat regions, and use the core genes to infer the first whole-genome phylogeny of Orientia. Complete assemblies of multiple Orientia genomes verify initial suggestions that these are remarkable organisms. They have larger genomes compared with most other Rickettsiaceae, with widespread amplification of repeat elements and massive chromosomal rearrangements between strains. At the gene level, Orientia has a relatively small set of universally conserved genes, similar to other obligate intracellular bacteria, and the relative expansion in genome size can be accounted for by gene duplication and repeat amplification. Our study demonstrates the utility of long read sequencing to investigate complex bacterial genomes and characterise genomic variation.
Molecular characterization of a phloem-specific gene encoding the filament protein, phloem protein 1 (PP1), from Cucurbita maxima.

PubMed

Clark, A M; Jacobsen, K R; Bostwick, D E; Dannenhoffer, J M; Skaggs, M I; Thompson, G A

1997-07-01

Sieve elements in the phloem of most angiosperms contain proteinaceous filaments and aggregates called P-protein. In the genus Cucurbita, these filaments are composed of two major proteins: PP1, the phloem filament protein, and PP2, the phloem lactin. The gene encoding the phloem filament protein in pumpkin (Cucurbita maxima Duch.) has been isolated and characterized. Nucleotide sequence analysis of the reconstructed gene gPP1 revealed a continuous 2430 bp protein coding sequence, with no introns, encoding an 809 amino acid polypeptide. The deduced polypeptide had characteristics of PP1 and contained a 15 amino acid sequence determined by N-terminal peptide sequence analysis of PP1. The sequence of PP1 was highly repetitive with four 200 amino acid sequence domains containing structural motifs in common with cysteine proteinase inhibitors. Expression of the PP1 gene was detected in roots, hypocotyls, cotyledons, stems, and leaves of pumpkin plants. PP1 and its mRNA accumulated in pumpkin hypocotyls during the period of rapid hypocotyl elongation after which mRNA levels declined, while protein levels remained elevated. PP1 was immunolocalized in slime plugs and P-protein bodies in sieve elements of the phloem. Occasionally, PP1 was detected in companion cells. PP1 mRNA was localized by in situ hybridization in companion cells at early stages of vascular differentiation. The developmental accumulation and localization of PP1 and its mRNA paralleled the phloem lactin, further suggesting an interaction between these phloem-specific proteins.
Construction of a nurse shark (Ginglymostoma cirratum) bacterial artificial chromosome (BAC) library and a preliminary genome survey.

PubMed

Luo, Meizhong; Kim, Hyeran; Kudrna, Dave; Sisneros, Nicholas B; Lee, So-Jeong; Mueller, Christopher; Collura, Kristi; Zuccolo, Andrea; Buckingham, E Bryan; Grim, Suzanne M; Yanagiya, Kazuyo; Inoko, Hidetoshi; Shiina, Takashi; Flajnik, Martin F; Wing, Rod A; Ohta, Yuko

2006-05-03

Sharks are members of the taxonomic class Chondrichthyes, the oldest living jawed vertebrates. Genomic studies of this group, in comparison to representative species in other vertebrate taxa, will allow us to theorize about the fundamental genetic, developmental, and functional characteristics in the common ancestor of all jawed vertebrates. In order to obtain mapping and sequencing data for comparative genomics, we constructed a bacterial artificial chromosome (BAC) library for the nurse shark, Ginglymostoma cirratum. The BAC library consists of 313,344 clones with an average insert size of 144 kb, covering ~4.5 x 1010 bp and thus providing an 11-fold coverage of the haploid genome. BAC end sequence analyses revealed, in addition to LINEs and SINEs commonly found in other animal and plant genomes, two new groups of nurse shark-specific repetitive elements, NSRE1 and NSRE2 that seem to be major components of the nurse shark genome. Screening the library with single-copy or multi-copy gene probes showed 6-28 primary positive clones per probe of which 50-90% were true positives, demonstrating that the BAC library is representative of the different regions of the nurse shark genome. Furthermore, some BAC clones contained multiple genes, making physical mapping feasible. We have constructed a deep-coverage, high-quality, large insert, and publicly available BAC library for a cartilaginous fish. It will be very useful to the scientific community interested in shark genomic structure, comparative genomics, and functional studies. We found two new groups of repetitive elements specific to the nurse shark genome, which may contribute to the architecture and evolution of the nurse shark genome.
An Ancient Transkingdom Horizontal Transfer of Penelope-Like Retroelements from Arthropods to Conifers

PubMed Central

Lin, Xuan; Faridi, Nurul; Casola, Claudio

2016-01-01

Comparative genomics analyses empowered by the wealth of sequenced genomes have revealed numerous instances of horizontal DNA transfers between distantly related species. In eukaryotes, repetitive DNA sequences known as transposable elements (TEs) are especially prone to move across species boundaries. Such horizontal transposon transfers, or HTTs, are relatively common within major eukaryotic kingdoms, including animals, plants, and fungi, while rarely occurring across these kingdoms. Here, we describe the first case of HTT from animals to plants, involving TEs known as Penelope-like elements, or PLEs, a group of retrotransposons closely related to eukaryotic telomerases. Using a combination of in situ hybridization on chromosomes, polymerase chain reaction experiments, and computational analyses we show that the predominant PLE lineage, EN(+)PLEs, is highly diversified in loblolly pine and other conifers, but appears to be absent in other gymnosperms. Phylogenetic analyses of both protein and DNA sequences reveal that conifers EN(+)PLEs, or Dryads, form a monophyletic group clustering within a clade of primarily arthropod elements. Additionally, no EN(+)PLEs were detected in 1,928 genome assemblies from 1,029 nonmetazoan and nonconifer genomes from 14 major eukaryotic lineages. These findings indicate that Dryads emerged following an ancient horizontal transfer of EN(+)PLEs from arthropods to a common ancestor of conifers approximately 340 Ma. This represents one of the oldest known interspecific transmissions of TEs, and the most conspicuous case of DNA transfer between animals and plants. PMID:27190138
Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

PubMed Central

Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.

2009-01-01

The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303
Molecular Analysis of Mycobacterium avium Isolates by Using Pulsed-Field Gel Electrophoresis and PCR

PubMed Central

Pestel-Caron, Martine; Graff, Gabriel; Berthelot, Gilles; Pons, Jean-Louis; Lemeland, Jean-François

1999-01-01

Genetic relationships among 46 isolates of Mycobacterium avium recovered from 37 patients in a 2,500-bed hospital from 1993 to 1998 were assessed by pulsed-field gel electrophoresis (PFGE) and PCR amplification of genomic sequences located between the repetitive elements IS1245 and IS1311. Each technique enabled the identification of 27 to 32 different patterns among the 46 isolates, confirming that the genetic heterogeneity of M. avium strains is high in a given community. Furthermore, this retrospective analysis of sporadic isolates allowed us (i) to suggest the existence of two remanent strains in our region, (ii) to raise the question of the possibility of nosocomial acquisition of M. avium strains, and (iii) to document laboratory contamination. The methods applied in the present study were found to be useful for the typing of M. avium isolates. In general, both methods yielded similar results for both related and unrelated isolates. However, the isolates in five of the six PCR clusters were distributed among two to three PFGE patterns, suggesting that this PCR-based method may have limitations for the analysis of strains with low insertion sequence copy numbers or for resolution of extended epidemiologic relationships. PMID:10405383
GHz laser-free time-resolved transmission electron microscopy: A stroboscopic high-duty-cycle method

DOE PAGES

Qiu, Jiaqi; Zhu, Yimei; Ha, Gwanghui; ...

2015-11-10

In this study, a device and a method for producing ultrashort electron pulses with GHz repetition rates via pulsing an input direct current (dc) electron beam are provided. The device and the method are based on an electromagnetic-mechanical pulser (EMMP) that consists of a series of transverse deflecting cavities and magnetic quadrupoles. The EMMP modulates and chops the incoming dc electron beam and converts it into pico- and sub-pico-second electron pulse sequences (pulse trains) at >1 GHz repetition rates, as well as controllably manipulates the resulting pulses. Ultimately, it leads to negligible electron pulse phase-space degradation compared to the incomingmore » dc beam parameters. The temporal pulse length and repetition rate for the EMMP can be continuously tunable over wide ranges.« less
A model for genesis of transcription systems.

PubMed

Burton, Zachary F; Opron, Kristopher; Wei, Guowei; Geiger, James H

2016-01-01

Repeating sequences generated from RNA gene fusions/ligations dominate ancient life, indicating central importance of building structural complexity in evolving biological systems. A simple and coherent story of life on earth is told from tracking repeating motifs that generate α/β proteins, 2-double-Ψ-β-barrel (DPBB) type RNA polymerases (RNAPs), general transcription factors (GTFs), and promoters. A general rule that emerges is that biological complexity that arises through generation of repeats is often bounded by solubility and closure (i.e., to form a pseudo-dimer or a barrel). Because the first DNA genomes were replicated by DNA template-dependent RNA synthesis followed by RNA template-dependent DNA synthesis via reverse transcriptase, the first DNA replication origins were initially 2-DPBB type RNAP promoters. A simplifying model for evolution of promoters/replication origins via repetition of core promoter elements is proposed. The model can explain why Pribnow boxes in bacterial transcription (i.e., (-12)TATAATG(-6)) so closely resemble TATA boxes (i.e., (-31)TATAAAAG(-24)) in archaeal/eukaryotic transcription. The evolution of anchor DNA sequences in bacterial (i.e., (-35)TTGACA(-30)) and archaeal (BRE(up); BRE for TFB recognition element) promoters is potentially explained. The evolution of BRE(down) elements of archaeal promoters is potentially explained.
DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants

PubMed Central

Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B.; Tóth, Gábor; Ortutay, Csaba P.; Patthy, László

2005-01-01

DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21 061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically. PMID:15608291
DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants.

PubMed

Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B; Tóth, Gábor; Ortutay, Csaba P; Patthy, László

2005-01-01

DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21,061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically.
Illumina Synthetic Long Read Sequencing Allows Recovery of Missing Sequences even in the “Finished” C. elegans Genome

PubMed Central

Li, Runsheng; Hsieh, Chia-Ling; Young, Amanda; Zhang, Zhihong; Ren, Xiaoliang; Zhao, Zhongying

2015-01-01

Most next-generation sequencing platforms permit acquisition of high-throughput DNA sequences, but the relatively short read length limits their use in genome assembly or finishing. Illumina has recently released a technology called Synthetic Long-Read Sequencing that can produce reads of unusual length, i.e., predominately around 10 Kb. However, a systematic assessment of their use in genome finishing and assembly is still lacking. We evaluate the promise and deficiency of the long reads in these aspects using isogenic C. elegans genome with no gap. First, the reads are highly accurate and capable of recovering most types of repetitive sequences. However, the presence of tandem repetitive sequences prevents pre-assembly of long reads in the relevant genomic region. Second, the reads are able to reliably detect missing but not extra sequences in the C. elegans genome. Third, the reads of smaller size are more capable of recovering repetitive sequences than those of bigger size. Fourth, at least 40 Kbp missing genomic sequences are recovered in the C. elegans genome using the long reads. Finally, an N50 contig size of at least 86 Kbp can be achieved with 24×reads but with substantial mis-assembly errors, highlighting a need for novel assembly algorithm for the long reads. PMID:26039588
Inhibition in task switching: The reliability of the n - 2 repetition cost.

PubMed

Kowalczyk, Agnieszka W; Grange, James A

2017-12-01

The n - 2 repetition cost seen in task switching is the effect of slower response times performing a recently completed task (e.g. an ABA sequence) compared to performing a task that was not recently completed (e.g. a CBA sequence). This cost is thought to reflect cognitive inhibition of task representations and as such, the n - 2 repetition cost has begun to be used as an assessment of individual differences in inhibitory control; however, the reliability of this measure has not been investigated in a systematic manner. The current study addressed this important issue. Seventy-two participants performed three task switching paradigms; participants were also assessed on rumination traits and processing speed-measures of individual differences potentially modulating the n - 2 repetition cost. We found significant n - 2 repetition costs for each paradigm. However, split-half reliability tests revealed that this cost was not reliable at the individual-difference level. Neither rumination tendencies nor processing speed predicted this cost. We conclude that the n - 2 repetition cost is not reliable as a measure of individual differences in inhibitory control.
The Effect of N-3 on N-2 Repetition Costs in Task Switching

ERIC Educational Resources Information Center

Schuch, Stefanie; Grange, James A.

2015-01-01

N-2 task repetition cost is a response time and error cost returning to a task recently performed after one intervening trial (i.e., an ABA task sequence) compared with returning to a task not recently performed (i.e., a CBA task sequence). This cost is considered a robust measure of inhibitory control during task switching. The present article…
Evidence of Anticipatory Eye Movements in the Spatial Hebb Repetition Effect: Insights for Modeling Sequence Learning

ERIC Educational Resources Information Center

Tremblay, Sebastien; Saint-Aubin, Jean

2009-01-01

In the present study, the authors offer a window onto the mechanisms that drive the Hebb repetition effect through the analysis of eye movement and recall performance. In a spatial serial recall task in which sequences of dots are to be remembered in order, when one particular series is repeated every 4 trials, memory performance markedly improves…
BAUM: improving genome assembly by adaptive unique mapping and local overlap-layout-consensus approach.

PubMed

Wang, Anqi; Wang, Zhanyu; Li, Zheng; Li, Lei M

2018-06-15

It is highly desirable to assemble genomes of high continuity and consistency at low cost. The current bottleneck of draft genome continuity using the second generation sequencing (SGS) reads is primarily caused by uncertainty among repetitive sequences. Even though the single-molecule real-time sequencing technology is very promising to overcome the uncertainty issue, its relatively high cost and error rate add burden on budget or computation. Many long-read assemblers take the overlap-layout-consensus (OLC) paradigm, which is less sensitive to sequencing errors, heterozygosity and variability of coverage. However, current assemblers of SGS data do not sufficiently take advantage of the OLC approach. Aiming at minimizing uncertainty, the proposed method BAUM, breaks the whole genome into regions by adaptive unique mapping; then the local OLC is used to assemble each region in parallel. BAUM can (i) perform reference-assisted assembly based on the genome of a close species (ii) or improve the results of existing assemblies that are obtained based on short or long sequencing reads. The tests on two eukaryote genomes, a wild rice Oryza longistaminata and a parrot Melopsittacus undulatus, show that BAUM achieved substantial improvement on genome size and continuity. Besides, BAUM reconstructed a considerable amount of repetitive regions that failed to be assembled by existing short read assemblers. We also propose statistical approaches to control the uncertainty in different steps of BAUM. http://www.zhanyuwang.xin/wordpress/index.php/2017/07/21/baum. Supplementary data are available at Bioinformatics online.
GC-rich coding sequences reduce transposon-like, small RNA-mediated transgene silencing.

PubMed

Sidorenko, Lyudmila V; Lee, Tzuu-Fen; Woosley, Aaron; Moskal, William A; Bevan, Scott A; Merlo, P Ann Owens; Walsh, Terence A; Wang, Xiujuan; Weaver, Staci; Glancy, Todd P; Wang, PoHao; Yang, Xiaozeng; Sriram, Shreedharan; Meyers, Blake C

2017-11-01

The molecular basis of transgene susceptibility to silencing is poorly characterized in plants; thus, we evaluated several transgene design parameters as means to reduce heritable transgene silencing. Analyses of Arabidopsis plants with transgenes encoding a microalgal polyunsaturated fatty acid (PUFA) synthase revealed that small RNA (sRNA)-mediated silencing, combined with the use of repetitive regulatory elements, led to aggressive transposon-like silencing of canola-biased PUFA synthase transgenes. Diversifying regulatory sequences and using native microalgal coding sequences (CDSs) with higher GC content improved transgene expression and resulted in a remarkable trans-generational stability via reduced accumulation of sRNAs and DNA methylation. Further experiments in maize with transgenes individually expressing three crystal (Cry) proteins from Bacillus thuringiensis (Bt) tested the impact of CDS recoding using different codon bias tables. Transgenes with higher GC content exhibited increased transcript and protein accumulation. These results demonstrate that the sequence composition of transgene CDSs can directly impact silencing, providing design strategies for increasing transgene expression levels and reducing risks of heritable loss of transgene expression.
Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana.

PubMed

Lin, X; Kaul, S; Rounsley, S; Shea, T P; Benito, M I; Town, C D; Fujii, C Y; Mason, T; Bowman, C L; Barnstead, M; Feldblyum, T V; Buell, C R; Ketchum, K A; Lee, J; Ronning, C M; Koo, H L; Moffat, K S; Cronin, L A; Shen, M; Pai, G; Van Aken, S; Umayam, L; Tallon, L J; Gill, J E; Adams, M D; Carrera, A J; Creasy, T H; Goodman, H M; Somerville, C R; Copenhaver, G P; Preuss, D; Nierman, W C; White, O; Eisen, J A; Salzberg, S L; Fraser, C M; Venter, J C

1999-12-16

Arabidopsis thaliana (Arabidopsis) is unique among plant model organisms in having a small genome (130-140 Mb), excellent physical and genetic maps, and little repetitive DNA. Here we report the sequence of chromosome 2 from the Columbia ecotype in two gap-free assemblies (contigs) of 3.6 and 16 megabases (Mb). The latter represents the longest published stretch of uninterrupted DNA sequence assembled from any organism to date. Chromosome 2 represents 15% of the genome and encodes 4,037 genes, 49% of which have no predicted function. Roughly 250 tandem gene duplications were found in addition to large-scale duplications of about 0.5 and 4.5 Mb between chromosomes 2 and 1 and between chromosomes 2 and 4, respectively. Sequencing of nearly 2 Mb within the genetically defined centromere revealed a low density of recognizable genes, and a high density and diverse range of vestigial and presumably inactive mobile elements. More unexpected is what appears to be a recent insertion of a continuous stretch of 75% of the mitochondrial genome into chromosome 2.
[Isolation and phylogenetic analysis of one actinomycete strain YIM 90022 exhibiting anticancer activity].

PubMed

Chen, Yi-Guang; Li, Wen-Jun; Cui, Xiao-Long; Jiang, Cheng-Lin; Xu, Li-Hua

2006-10-01

One facultative alkaliphilic actinomycete strain YIM 90022 was isolated from hypersaline alkaline soil in Qinghai province, China. An almost-complete 16S rRNA gene sequence (1500 bp) for strain YIM 90022 was obtained. Phylogenetic analysis based on 16S rRNA gene sequences showed that strain YIM 90022 was closely related to four members of the genus Nocardiopsis with 16S rRNA gene sequence similarity values of 98.8% (N. exhalans DSM 44407T), 98.5% (N. prasina DSM 43845T), 98.4% (N. metallicus DSM 44598T) and 97.8% (N. listeri DSM 40297T), but represented a distinct phylogenetic lineage. Repetitive element sequence-based PCR (rep-PCR) genomic fingerprinting was evaluated on strain YIM 90022 and its closest relatives to investigate their genetic relatedness. The analysis of the rep-PCR genomic fingerprints showed that strain YIM 90022 was distinguishable from its closest relatives. The polyphasic taxonomic data presented in this study, including its morphology, physiological and biochemical characteristics, chemotaxonomy, 16S rRNA gene sequence-based phylogenetic analysis and rep-PCR genomic fingerprinting, supported the view that strain YIM 90022 represented a potential new species of the genus Nocardiopsis. The fermentation broth of strain YIM 90022 strongly inhibited growth of cell series of gastric cancer, lung cancer, mammary cancer, melanoma cancer, renal cancer and uterus cancer. Strain YIM 90022 grew well on most tested media, producing exuberant vegetative hyphae and aerial hyphae. The vegetative hyphae are long and fragmented. Light yellow to deep brown diffusible pigments were produced on ISP 2, ISP 3 and ISP 6. Growth of the strain occurred in the pH range 6.0-12.0, with optimal pH8.5. The NaCl tolerate range was 0-15% (W/V). Cell walls contain meso-diaminopimelic acid and have no diagnostic sugars. Polar lipids are phosphatidylcholine, phosphatidylglycerol, diphosphatidylglycerol, phosphatidylmethylethanolamine. Major menaquinones are MK-10 (H4, H6). The DNA G + C content is 71.5 mol %.

Reduced representation approaches to interrogate genome diversity in large repetitive plant genomes.

PubMed

Hirsch, Cory D; Evans, Joseph; Buell, C Robin; Hirsch, Candice N

2014-07-01

Technology and software improvements in the last decade now provide methodologies to access the genome sequence of not only a single accession, but also multiple accessions of plant species. This provides a means to interrogate species diversity at the genome level. Ample diversity among accessions in a collection of species can be found, including single-nucleotide polymorphisms, insertions and deletions, copy number variation and presence/absence variation. For species with small, non-repetitive rich genomes, re-sequencing of query accessions is robust, highly informative, and economically feasible. However, for species with moderate to large sized repetitive-rich genomes, technical and economic barriers prevent en masse genome re-sequencing of accessions. Multiple approaches to access a focused subset of loci in species with larger genomes have been developed, including reduced representation sequencing, exome capture and transcriptome sequencing. Collectively, these approaches have enabled interrogation of diversity on a genome scale for large plant genomes, including crop species important to worldwide food security. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Human CST Facilitates Genome-wide RAD51 Recruitment to GC-Rich Repetitive Sequences in Response to Replication Stress.

PubMed

Chastain, Megan; Zhou, Qing; Shiva, Olga; Fadri-Moskwik, Maria; Whitmore, Leanne; Jia, Pingping; Dai, Xueyu; Huang, Chenhui; Ye, Ping; Chai, Weihang

2016-08-02

The telomeric CTC1/STN1/TEN1 (CST) complex has been implicated in promoting replication recovery under replication stress at genomic regions, yet its precise role is unclear. Here, we report that STN1 is enriched at GC-rich repetitive sequences genome-wide in response to hydroxyurea (HU)-induced replication stress. STN1 deficiency exacerbates the fragility of these sequences under replication stress, resulting in chromosome fragmentation. We find that upon fork stalling, CST proteins form distinct nuclear foci that colocalize with RAD51. Furthermore, replication stress induces physical association of CST with RAD51 in an ATR-dependent manner. Strikingly, CST deficiency diminishes HU-induced RAD51 foci formation and reduces RAD51 recruitment to telomeres and non-telomeric GC-rich fragile sequences. Collectively, our findings establish that CST promotes RAD51 recruitment to GC-rich repetitive sequences in response to replication stress to facilitate replication restart, thereby providing insights into the mechanism underlying genome stability maintenance. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
BioNano genome mapping of individual chromosomes supports physical mapping and sequence assembly in complex plant genomes.

PubMed

Staňková, Helena; Hastie, Alex R; Chan, Saki; Vrána, Jan; Tulpová, Zuzana; Kubaláková, Marie; Visendi, Paul; Hayashi, Satomi; Luo, Mingcheng; Batley, Jacqueline; Edwards, David; Doležel, Jaroslav; Šimková, Hana

2016-07-01

The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC-by-BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high-resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high-resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome-scale analysis of repetitive sequences and revealed a ~800-kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone-by-clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC-contig physical map and validate sequence assembly on a chromosome-arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome-by-chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Stress induced gene expression drives transient DNA methylation changes at adjacent repetitive elements.

PubMed

Secco, David; Wang, Chuang; Shou, Huixia; Schultz, Matthew D; Chiarenza, Serge; Nussaume, Laurent; Ecker, Joseph R; Whelan, James; Lister, Ryan

2015-07-21

Cytosine DNA methylation (mC) is a genome modification that can regulate the expression of coding and non-coding genetic elements. However, little is known about the involvement of mC in response to environmental cues. Using whole genome bisulfite sequencing to assess the spatio-temporal dynamics of mC in rice grown under phosphate starvation and recovery conditions, we identified widespread phosphate starvation-induced changes in mC, preferentially localized in transposable elements (TEs) close to highly induced genes. These changes in mC occurred after changes in nearby gene transcription, were mostly DCL3a-independent, and could partially be propagated through mitosis, however no evidence of meiotic transmission was observed. Similar analyses performed in Arabidopsis revealed a very limited effect of phosphate starvation on mC, suggesting a species-specific mechanism. Overall, this suggests that TEs in proximity to environmentally induced genes are silenced via hypermethylation, and establishes the temporal hierarchy of transcriptional and epigenomic changes in response to stress.
Activation of RNA polymerase III transcription of human Alu repetitive elements by adenovirus type 5: Requirement for the E1b 58-Kilodalton protein and the products of E4 open reading frames 3 and 6

DOE Office of Scientific and Technical Information (OSTI.GOV)

Panning, B.; Smiley, J.R.

1993-06-01

Alu elements are the single most abundant class of dispersed repeated sequences in the human genome, comprising 5-10% of the mass of human DNA. This report demonstrates that Ad5 infection strongly stimulates Pol III transcription of human Alu elements in HeLa and 293 cells. In contrast to the cases of Ad5-induced Pol III transcriptional activation, this process requires the E1b 58-kDa protein and the products of E4 open reading frames (ORFs) 3 and 6 in addition to the E1a 289-residue product. These findings suggest novel regulatory properties of the Ad5 E1b and E4 proteins and raise the possibility that analogousmore » cellular trans-acting factors serve to modulate Alu expression in vivo.« less
Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation

PubMed Central

Wegrzyn, Jill L.; Liechty, John D.; Stevens, Kristian A.; Wu, Le-Shin; Loopstra, Carol A.; Vasquez-Gross, Hans A.; Dougherty, William M.; Lin, Brian Y.; Zieve, Jacob J.; Martínez-García, Pedro J.; Holt, Carson; Yandell, Mark; Zimin, Aleksey V.; Yorke, James A.; Crepeau, Marc W.; Puiu, Daniela; Salzberg, Steven L.; de Jong, Pieter J.; Mockaitis, Keithanne; Main, Doreen; Langley, Charles H.; Neale, David B.

2014-01-01

The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20–40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%. PMID:24653211
Thought Speed, Mood, and the Experience of Mental Motion.

PubMed

Pronin, Emily; Jacobs, Elana

2008-11-01

This article presents a theoretical account relating thought speed to mood and psychological experience. Thought sequences that occur at a fast speed generally induce more positive affect than do those that occur slowly. Thought speed constitutes one aspect of mental motion. Another aspect involves thought variability, or the degree to which thoughts in a sequence either vary widely from or revolve closely around a theme. Thought sequences possessing more motion (occurring fast and varying widely) generally produce more positive affect than do sequences possessing little motion (occurring slowly and repetitively). When speed and variability oppose each other, such that one is low and the other is high, predictable psychological states also emerge. For example, whereas slow, repetitive thinking can prompt dejection, fast, repetitive thinking can prompt anxiety. This distinction is related to the fact that fast thinking involves greater actual and felt energy than slow thinking does. Effects of mental motion occur independent of the specific content of thought. Their consequences for mood and energy hold psychotherapeutic relevance. © 2008 Association for Psychological Science.
Comparative analysis of complete orthologous centromeres from two subspecies of rice reveals rapid variation of centromere organization and structure.

PubMed

Wu, Jianzhong; Fujisawa, Masaki; Tian, Zhixi; Yamagata, Harumi; Kamiya, Kozue; Shibata, Michie; Hosokawa, Satomi; Ito, Yukiyo; Hamada, Masao; Katagiri, Satoshi; Kurita, Kanako; Yamamoto, Mayu; Kikuta, Ari; Machita, Kayo; Karasawa, Wataru; Kanamori, Hiroyuki; Namiki, Nobukazu; Mizuno, Hiroshi; Ma, Jianxin; Sasaki, Takuji; Matsumoto, Takashi

2009-12-01

Centromeres are sites for assembly of the chromosomal structures that mediate faithful segregation at mitosis and meiosis. This function is conserved across species, but the DNA components that are involved in kinetochore formation differ greatly, even between closely related species. To shed light on the nature, evolutionary timing and evolutionary dynamics of rice centromeres, we decoded a 2.25-Mb DNA sequence covering the centromeric region of chromosome 8 of an indica rice variety, 'Kasalath' (Kas-Cen8). Analysis of repetitive sequences in Kas-Cen8 led to the identification of 222 long terminal repeat (LTR)-retrotransposon elements and 584 CentO satellite monomers, which account for 59.2% of the region. A comparison of the Kas-Cen8 sequence with that of japonica rice 'Nipponbare' (Nip-Cen8) revealed that about 66.8% of the Kas-Cen8 sequence was collinear with that of Nip-Cen8. Although the 27 putative genes are conserved between the two subspecies, only 55.4% of the total LTR-retrotransposon elements in 'Kasalath' had orthologs in 'Nipponbare', thus reflecting recent proliferation of a considerable number of LTR-retrotransposons since the divergence of two rice subspecies of indica and japonica within Oryza sativa. Comparative analysis of the subfamilies, time of insertion, and organization patterns of inserted LTR-retrotransposons between the two Cen8 regions revealed variations between 'Kasalath' and 'Nipponbare' in the preferential accumulation of CRR elements, and the expansion of CentO satellite repeats within the core domain of Cen8. Together, the results provide insights into the recent proliferation of LTR-retrotransposons, and the rapid expansion of CentO satellite repeats, underlying the dynamic variation and plasticity of plant centromeres.
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clusters.

PubMed

Dallery, Jean-Félix; Lapalu, Nicolas; Zampounis, Antonios; Pigné, Sandrine; Luyten, Isabelle; Amselem, Joëlle; Wittenberg, Alexander H J; Zhou, Shiguo; de Queiroz, Marisa V; Robin, Guillaume P; Auger, Annie; Hainaut, Matthieu; Henrissat, Bernard; Kim, Ki-Tae; Lee, Yong-Hwan; Lespinet, Olivier; Schwartz, David C; Thon, Michael R; O'Connell, Richard J

2017-08-29

The ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly-conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen.
A Rickettsia Genome Overrun by Mobile Genetic Elements Provides Insight into the Acquisition of Genes Characteristic of an Obligate Intracellular Lifestyle

PubMed Central

Joardar, Vinita; Williams, Kelly P.; Driscoll, Timothy; Hostetler, Jessica B.; Nordberg, Eric; Shukla, Maulik; Walenz, Brian; Hill, Catherine A.; Nene, Vishvanath M.; Azad, Abdu F.; Sobral, Bruno W.; Caler, Elisabet

2012-01-01

We present the draft genome for the Rickettsia endosymbiont of Ixodes scapularis (REIS), a symbiont of the deer tick vector of Lyme disease in North America. Among Rickettsia species (Alphaproteobacteria: Rickettsiales), REIS has the largest genome sequenced to date (>2 Mb) and contains 2,309 genes across the chromosome and four plasmids (pREIS1 to pREIS4). The most remarkable finding within the REIS genome is the extraordinary proliferation of mobile genetic elements (MGEs), which contributes to a limited synteny with other Rickettsia genomes. In particular, an integrative conjugative element named RAGE (for Rickettsiales amplified genetic element), previously identified in scrub typhus rickettsiae (Orientia tsutsugamushi) genomes, is present on both the REIS chromosome and plasmids. Unlike the pseudogene-laden RAGEs of O. tsutsugamushi, REIS encodes nine conserved RAGEs that include F-like type IV secretion systems similar to that of the tra genes encoded in the Rickettsia bellii and R. massiliae genomes. An unparalleled abundance of encoded transposases (>650) relative to genome size, together with the RAGEs and other MGEs, comprise ∼35% of the total genome, making REIS one of the most plastic and repetitive bacterial genomes sequenced to date. We present evidence that conserved rickettsial genes associated with an intracellular lifestyle were acquired via MGEs, especially the RAGE, through a continuum of genomic invasions. Robust phylogeny estimation suggests REIS is ancestral to the virulent spotted fever group of rickettsiae. As REIS is not known to invade vertebrate cells and has no known pathogenic effects on I. scapularis, its genome sequence provides insight on the origin of mechanisms of rickettsial pathogenicity. PMID:22056929
The map-based genome sequence of Spirodela polyrhiza aligned with its chromosomes, a reference for karyotype evolution.

PubMed

Cao, Hieu Xuan; Vu, Giang Thi Ha; Wang, Wenqin; Appenroth, Klaus J; Messing, Joachim; Schubert, Ingo

2016-01-01

Duckweeds are aquatic monocotyledonous plants of potential economic interest with fast vegetative propagation, comprising 37 species with variable genome sizes (0.158-1.88 Gbp). The genomic sequence of Spirodela polyrhiza, the smallest and the most ancient duckweed genome, needs to be aligned to its chromosomes as a reference and prerequisite to study the genome and karyotype evolution of other duckweed species. We selected physically mapped bacterial artificial chromosomes (BACs) containing Spirodela DNA inserts with little or no repetitive elements as probes for multicolor fluorescence in situ hybridization (mcFISH), using an optimized BAC pooling strategy, to validate its physical map and correlate it with its chromosome complement. By consecutive mcFISH analyses, we assigned the originally assembled 32 pseudomolecules (supercontigs) of the genomic sequences to the 20 chromosomes of S. polyrhiza. A Spirodela cytogenetic map containing 96 BAC markers with an average distance of 0.89 Mbp was constructed. Using a cocktail of 41 BACs in three colors, all chromosome pairs could be individualized simultaneously. Seven ancestral blocks emerged from duplicated chromosome segments of 19 Spirodela chromosomes. The chromosomally integrated genome of S. polyrhiza and the established prerequisites for comparative chromosome painting enable future studies on the chromosome homoeology and karyotype evolution of duckweed species. © 2015 IPK Gatersleben. New Phytologist © 2015 New Phytologist Trust.
Suppressor of sable [Su(s)] and Wdr82 down-regulate RNA from heat-shock-inducible repetitive elements by a mechanism that involves transcription termination

PubMed Central

Brewer-Jensen, Paul; Wilson, Carrie B.; Abernethy, John; Mollison, Lonna; Card, Samantha

2016-01-01

Although RNA polymerase II (Pol II) productively transcribes very long genes in vivo, transcription through extragenic sequences often terminates in the promoter-proximal region and the nascent RNA is degraded. Mechanisms that induce early termination and RNA degradation are not well understood in multicellular organisms. Here, we present evidence that the suppressor of sable [su(s)] regulatory pathway of Drosophila melanogaster plays a role in this process. We previously showed that Su(s) promotes exosome-mediated degradation of transcripts from endogenous repeated elements at an Hsp70 locus (Hsp70-αβ elements). In this report, we identify Wdr82 as a component of this process and show that it works with Su(s) to inhibit Pol II elongation through Hsp70-αβ elements. Furthermore, we show that the unstable transcripts produced during this process are polyadenylated at heterogeneous sites that lack canonical polyadenylation signals. We define two distinct regions that mediate this regulation. These results indicate that the Su(s) pathway promotes RNA degradation and transcription termination through a novel mechanism. PMID:26577379
Effects of questions' repetition and variation on the efficiency of the guilty knowledge test: a reexamination.

PubMed

Ben-Shakhar, Gershon; Elaad, Eitan

2002-10-01

The effect of question repetition and variation on the efficiency of the Guilty Knowledge Test (GKT), based on electrodermal and respiration measures, was examined in a between-subjects experiment with 3 conditions. Each participant was presented with a sequence of 12 biographical questions. In Condition 1, a single question was repeated 12 times; in Condition 4, each of 4 different questions was repeated 3 times; and in Condition 12, 12 different questions were used. A monotonic relationship between the number of different questions used and detection efficiency was observed only with the electrodermal measure (the areas under the receiver operating characteristic curves, obtained with this measure in Conditions 1, 4, and 12 were .68, .81, and .99, respectively). These results demonstrate that a GKT based on multiple questions is superior to the use of many repetitions of a single or a few questions, and it can reach an almost perfect detection efficiency.
Genomic organization of the 260 kb surrounding the waxy locus in a Japonica rice

PubMed

Nagano; Wu; Kawasaki; Kishima; Sano

1999-12-01

The present study was carried out to characterize the molecular organization in the vicinity of the waxy locus in rice. To determine the structural organization of the region surrounding waxy, contiguous clones covering a total of 260 kb were constructed using a bacterial artificial chromosome (BAC) library from the Shimokita variety of Japonica rice. This map also contains 200 overlapping subclones, which allowed construction of a fine physical map with a total of 64 HindIII sites. During the course of constructing the map, we noticed the presence of some repeated regions which might be related to transposable elements. We divided the 260-kb region into 60 segments (average size of 5.7 kb) to use as probes to determine their genomic organization. Hybridization patterns obtained by probing with these segments were classified into four types: class 1, a single or a few bands without a smeared background; class 2, a single or a few bands with a smeared background; class 3, multiple discrete bands without a smeared background; and class 4, only a smeared background. These classes constituted 6.5%, 20.9%, 3.7%, and 68.9% of the 260-kb region, respectively. The distribution of each class revealed that repetitive sequences are a major component in this region, as expected, and that unique sequence regions were mostly no longer than 6 kb due to interruption by repetitive sequences. We discuss how the map constructed here might be a powerful tool for characterization and comparison of the genome structures and the genes around the waxy locus in the Oryza species.
High-repetition-rate interferometric Rayleigh scattering for flow-velocity measurements

NASA Astrophysics Data System (ADS)

Estevadeordal, Jordi; Jiang, Naibo; Cutler, Andrew D.; Felver, Josef J.; Slipchenko, Mikhail N.; Danehy, Paul M.; Gord, James R.; Roy, Sukesh

2018-03-01

High-repetition-rate interferometric-Rayleigh-scattering (IRS) velocimetry is demonstrated for non-intrusive, high-speed flow-velocity measurements. High temporal resolution is obtained with a quasi-continuous burst-mode laser that is capable of operating at 10-100 kHz, providing 10-ms bursts with pulse widths of 5-1000 ns and pulse energy > 100 mJ at 532 nm. Coupled with a high-speed camera system, the IRS method is based on imaging the flow field through an etalon with 8-GHz free spectral range and capturing the Doppler shift of the Rayleigh-scattered light from the flow at multiple points having constructive interference. The seed-laser linewidth permits a laser linewidth of < 150 MHz at 532 nm. The technique is demonstrated in a high-speed jet, and high-repetition-rate image sequences are shown.
Constructing storyboards based on hierarchical clustering analysis

NASA Astrophysics Data System (ADS)

Hasebe, Satoshi; Sami, Mustafa M.; Muramatsu, Shogo; Kikuchi, Hisakazu

2005-07-01

There are growing needs for quick preview of video contents for the purpose of improving accessibility of video archives as well as reducing network traffics. In this paper, a storyboard that contains a user-specified number of keyframes is produced from a given video sequence. It is based on hierarchical cluster analysis of feature vectors that are derived from wavelet coefficients of video frames. Consistent use of extracted feature vectors is the key to avoid a repetition of computationally-intensive parsing of the same video sequence. Experimental results suggest that a significant reduction in computational time is gained by this strategy.
Existence of host-related DNA sequences in the schistosome genome.

PubMed

Iwamura, Y; Irie, Y; Kominami, R; Nara, T; Yasuraoka, K

1991-06-01

DNA sequences homologous to the mouse intracisternal A particle and endogenous type C retrovirus were detected in the DNAs of Schistosoma japonicum adults and S. mansoni eggs. Furthermore, other kinds of repetitive sequences in the host genome such as mouse type 1 Alu sequence (B1), mouse type 2 Alu sequence (B2) and mo-2 sequence, a mouse mini-satellite, were also detected in the DNAs from adults and eggs of S. japonicum and eggs of S. mansoni. Almost all of the sequences described above were absent in the DNAs of S. mansoni adults. The DNA fingerprints of schistosomes, using the mo-2 sequence, were indistinguishable from each other and resembled those of their murine hosts. Moreover, the mo-2 sequence was hypermethylated in the DNAs of schistosomes and its amount was variable in them. These facts indicate that host-related sequences are actually present in schistosomes and that the mo-2 repetitive sequence exists probably in extra-chromosome.
RNA editing of non-coding RNA and its role in gene regulation.

PubMed

Daniel, Chammiran; Lagergren, Jens; Öhman, Marie

2015-10-01

It has for a long time been known that repetitive elements, particularly Alu sequences in human, are edited by the adenosine deaminases acting on RNA, ADAR, family. The functional interpretation of these events has been even more difficult than that of editing events in coding sequences, but today there is an emerging understanding of their downstream effects. A surprisingly large fraction of the human transcriptome contains inverted Alu repeats, often forming long double stranded structures in RNA transcripts, typically occurring in introns and UTRs of protein coding genes. Alu repeats are also common in other primates, and similar inverted repeats can frequently be found in non-primates, although the latter are less prone to duplex formation. In human, as many as 700,000 Alu elements have been identified as substrates for RNA editing, of which many are edited at several sites. In fact, recent advancements in transcriptome sequencing techniques and bioinformatics have revealed that the human editome comprises at least a hundred million adenosine to inosine (A-to-I) editing sites in Alu sequences. Although substantial additional efforts are required in order to map the editome, already present knowledge provides an excellent starting point for studying cis-regulation of editing. In this review, we will focus on editing of long stem loop structures in the human transcriptome and how it can effect gene expression. Copyright © 2015 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
miR-ID: A novel, circularization-based platform for detection of microRNAs

PubMed Central

Kumar, Pavan; Johnston, Brian H.; Kazakov, Sergei A.

2011-01-01

MicroRNAs (miRNAs) are important regulators of gene expression and have great potential as biomarkers, prognostic indicators, and therapeutic targets. Determining the expression patterns of these molecules is essential for elucidating their biogenesis, regulation, relation to disease, and response to therapy. Although PCR-based assays are commonly used for expression profiling of miRNAs, the small size, sequence heterogeneity, and (in some cases) end modifications of miRNAs constrain the performance of existing PCR methods. Here we introduce miR-ID, a novel method that avoids these constraints while providing superior sensitivity and sequence specificity at a lower cost. It also has the unique ability to differentiate unmodified small RNAs from those carrying 2′-OMe groups at their 3′-ends while detecting both forms. miR-ID is comprised of the following steps: (1) circularization of the miRNA by a ligase; (2) reverse transcription of the circularized miRNA (RTC), producing tandem repeats of a DNA sequence complementary to the miRNA; and (3) qPCR amplification of segments of this multimeric cDNA using 5′-overlapping primers and a nonspecific dye such as SYBR Green. No chemically modified probes (e.g., TaqMan) or primers (e.g., LNA) are required. The circular RNA and multimeric cDNA templates provide unmatched flexibility in the positioning of primers, which may include straddling the boundaries between these repetitive miRNA sequences. miR-ID is based on new findings that are themselves of general interest, including reverse transcription of small RNA circles and the use of 5′-overlapping primers for detection of repetitive sequences by qPCR. PMID:21169480
BAC sequencing using pooled methods.

PubMed

Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina

2015-01-01

Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.

A Micro-Computer Model for Army Air Defense Training.

DTIC Science & Technology

1985-03-01

generator. The period is 32763 numbers generated before a repetitive sequence is encountered on the development system. Chi-Squared tests for frequency...C’ Tests CPeriodicity. The period is 32763 numbers generated C’before a repetitive sequence is encountered on the development system. This was...positions in the test array. This was done with several different random number seeds. In each case 32763 p random numbers were generated before a
The draft genome of sweet orange (Citrus sinensis).

PubMed

Xu, Qiang; Chen, Ling-Ling; Ruan, Xiaoan; Chen, Dijun; Zhu, Andan; Chen, Chunli; Bertrand, Denis; Jiao, Wen-Biao; Hao, Bao-Hai; Lyon, Matthew P; Chen, Jiongjiong; Gao, Song; Xing, Feng; Lan, Hong; Chang, Ji-Wei; Ge, Xianhong; Lei, Yang; Hu, Qun; Miao, Yin; Wang, Lun; Xiao, Shixin; Biswas, Manosh Kumar; Zeng, Wenfang; Guo, Fei; Cao, Hongbo; Yang, Xiaoming; Xu, Xi-Wen; Cheng, Yun-Jiang; Xu, Juan; Liu, Ji-Hong; Luo, Oscar Junhong; Tang, Zhonghui; Guo, Wen-Wu; Kuang, Hanhui; Zhang, Hong-Yu; Roose, Mikeal L; Nagarajan, Niranjan; Deng, Xiu-Xin; Ruan, Yijun

2013-01-01

Oranges are an important nutritional source for human health and have immense economic value. Here we present a comprehensive analysis of the draft genome of sweet orange (Citrus sinensis). The assembled sequence covers 87.3% of the estimated orange genome, which is relatively compact, as 20% is composed of repetitive elements. We predicted 29,445 protein-coding genes, half of which are in the heterozygous state. With additional sequencing of two more citrus species and comparative analyses of seven citrus genomes, we present evidence to suggest that sweet orange originated from a backcross hybrid between pummelo and mandarin. Focused analysis on genes involved in vitamin C metabolism showed that GalUR, encoding the rate-limiting enzyme of the galacturonate pathway, is significantly upregulated in orange fruit, and the recent expansion of this gene family may provide a genomic basis. This draft genome represents a valuable resource for understanding and improving many important citrus traits in the future.
Whats, hows and whys of programmed DNA elimination in Tetrahymena

PubMed Central

Noto, Tomoko

2017-01-01

Programmed genome rearrangements in ciliates provide fascinating examples of flexible epigenetic genome regulations and important insights into the interaction between transposable elements (TEs) and host genomes. DNA elimination in Tetrahymena thermophila removes approximately 12 000 internal eliminated sequences (IESs), which correspond to one-third of the genome, when the somatic macronucleus (MAC) differentiates from the germline micronucleus (MIC). More than half of the IESs, many of which show high similarity to TEs, are targeted for elimination in cis by the small RNA-mediated genome comparison of the MIC to the MAC. Other IESs are targeted for elimination in trans by the same small RNAs through repetitive sequences. Furthermore, the small RNA–heterochromatin feedback loop ensures robust DNA elimination. Here, we review an updated picture of the DNA elimination mechanism, discuss the physiological and evolutionary roles of DNA elimination, and outline the key questions that remain unanswered. PMID:29021213
Newly discovered young CORE-SINEs in marsupial genomes.

PubMed

Munemasa, Maruo; Nikaido, Masato; Nishihara, Hidenori; Donnellan, Stephen; Austin, Christopher C; Okada, Norihiro

2008-01-15

Although recent mammalian genome projects have uncovered a large part of genomic component of various groups, several repetitive sequences still remain to be characterized and classified for particular groups. The short interspersed repetitive elements (SINEs) distributed among marsupial genomes are one example. We have identified and characterized two new SINEs from marsupial genomes that belong to the CORE-SINE family, characterized by a highly conserved "CORE" domain. PCR and genomic dot blot analyses revealed that the distribution of each SINE shows distinct patterns among the marsupial genomes, implying different timing of their retroposition during the evolution of marsupials. The members of Mar3 (Marsupialia 3) SINE are distributed throughout the genomes of all marsupials, whereas the Mac1 (Macropodoidea 1) SINE is distributed specifically in the genomes of kangaroos. Sequence alignment of the Mar3 SINEs revealed that they can be further divided into four subgroups, each of which has diagnostic nucleotides. The insertion patterns of each SINE at particular genomic loci, together with the distribution patterns of each SINE, suggest that the Mar3 SINEs have intensively amplified after the radiation of diprotodontians, whereas the Mac1 SINE has amplified only slightly after the divergence of hypsiprimnodons from other macropods. By compiling the information of CORE-SINEs characterized to date, we propose a comprehensive picture of how SINE evolution occurred in the genomes of marsupials.
Altered imprinted gene expression and methylation patterns in mid-gestation aborted cloned porcine fetuses and placentas.

PubMed

Zhang, Xiaoyang; Wang, Dongxu; Han, Yang; Duan, Feifei; Lv, Qinyan; Li, Zhanjun

2014-11-01

To determine the expression patterns of imprinted genes and their methylation status in aborted cloned porcine fetuses and placentas. RNA and DNA were prepared from fetuses and placentas that were produced by SCNT and controls from artificial insemination. The expression of 18 imprinted genes was determined by quantitative real-time PCR (q-PCR). Bisulfite sequencing PCR (BSP) was conducted to determine the methylation status of PRE-1 short interspersed repetitive element (SINE), satellite DNA and H19 differentially methylated region 3 (DMR3). The weight, imprinted gene expression and genome-wide DNA methylation patterns were compared between the mid-gestation aborted and normal control samples. The results showed hypermethylation of PRE-1 and satellite sequences, the aberrant expression of imprinted genes, and the hypomethylation of H19 DMR3 occurred in mid-gestation aborted fetuses and placentas. Cloned pigs generated by somatic cell nuclear transfer (SCNT) showed a greater ratio of early abortion during mid-gestation than did normal controls because of the incomplete epigenetic reprogramming of the donor cells. Altered expression of imprinted genes and the hypermethylation profile of the repetitive regions (PRE-1 and satellite DNA) may be associated with defective development and early abortion of cloned pigs, emphasizing the importance of epigenetics during pregnancy and implications thereof for patient-specific embryonic stem cells for human therapeutic cloning and improvement of human assisted reproduction.
Evidence for structural stacking and repetition in the greenstones of the Kalgoorlie district, western Australia

NASA Technical Reports Server (NTRS)

Martyn, J. E.

1986-01-01

Most previous stratigraphic interpretations of the southern part of the Norseman-Wiluna Greenstone Belt have proposed polycyclic sequences. These invoked two and sometimes three successive suites of mafic and/or ultimafic volcanics and intrusives separated by felsic volcanics and immature clastic sediments, however no distinctive lithological differences were reported between successive mafic-ultramaic sequences. When interpretations of the area further to the north are integrated, a total of four separate major mafic-ultramafic uites emerges for a large part of the Norseman-Wiluna Belt. Although the author does not intent to imply that all polycyclic stratigraphies are wrong in principle such a situation seems suspiciously over-complex and stimulates the need to look critically at the individual areas where the stratigraphies have seen erected. For the Kalgoorlie area in the south, some of the schemes have already provoked scepticism and a simpler model consisting of one cycle subject to structural repetition has been evolved by workers in the Geological Survey of Western Australia for part of this area. The latter drew attention to the carbon copy similarity between the elements of some polycyclic stratigraphies. Much more regionally extensive integrated structural and stratigraphic data is still required to evaluate the relationship between structure and stratigraphy more fully, an objective substantially limited by poor outcrop and deep weathering, but with due effort, far from unattainable.
Evidence for structural stacking and repetition in the greenstones of the Kalgoorlie district, western Australia

NASA Astrophysics Data System (ADS)

Martyn, J. E.

Most previous stratigraphic interpretations of the southern part of the Norseman-Wiluna Greenstone Belt have proposed polycyclic sequences. These invoked two and sometimes three successive suites of mafic and/or ultimafic volcanics and intrusives separated by felsic volcanics and immature clastic sediments, however no distinctive lithological differences were reported between successive mafic-ultramaic sequences. When interpretations of the area further to the north are integrated, a total of four separate major mafic-ultramafic uites emerges for a large part of the Norseman-Wiluna Belt. Although the author does not intent to imply that all polycyclic stratigraphies are wrong in principle such a situation seems suspiciously over-complex and stimulates the need to look critically at the individual areas where the stratigraphies have seen erected. For the Kalgoorlie area in the south, some of the schemes have already provoked scepticism and a simpler model consisting of one cycle subject to structural repetition has been evolved by workers in the Geological Survey of Western Australia for part of this area. The latter drew attention to the carbon copy similarity between the elements of some polycyclic stratigraphies. Much more regionally extensive integrated structural and stratigraphic data is still required to evaluate the relationship between structure and stratigraphy more fully, an objective substantially limited by poor outcrop and deep weathering, but with due effort, far from unattainable.
CACTA-superfamily transposable element is inserted in MYB transcription factor gene of soybean line producing variegated seeds.

PubMed

Yan, Fan; Di, Shaokang; Takahashi, Ryoji

2015-08-01

The R gene of soybean, presumably encoding a MYB transcription factor, controls seed coat color. The gene consists of multiple alleles, R (black), r-m (black spots and (or) concentric streaks on brown seed), and r (brown seed). This study was conducted to determine the structure of the MYB transcription factor gene in a near-isogenic line (NIL) having r-m allele. PCR amplification of a fragment of the candidate gene Glyma.09G235100 generated a fragment of about 1 kb in the soybean cultivar Clark, whereas a fragment of about 14 kb in addition to fragments of 1 and 1.4 kb were produced in L72-2040, a Clark 63 NIL with the r-m allele. Clark 63 is a NIL of Clark with the rxp and Rps1 alleles. A DNA fragment of 13 060 bp was inserted in the intron of Glyma.09G235100 in L72-2040. The fragment had the CACTA motif at both ends, imperfect terminal inverted repeats (TIR), inverse repetition of short sequence motifs close to the 5' and 3' ends, and a duplication of three nucleotides at the site of integration, indicating that it belongs to a CACTA-superfamily transposable element. We designated the element as Tgm11. Overall nucleotide sequence, motifs of TIR, and subterminal repeats were similar to those of Tgm1 and Tgs1, suggesting that these elements comprise a family.
Centromere Locations in Brassica A and C Genomes Revealed Through Half-Tetrad Analysis

PubMed Central

Mason, Annaliese S.; Rousseau-Gueutin, Mathieu; Morice, Jérôme; Bayer, Philipp E.; Besharat, Naghmeh; Cousin, Anouska; Pradhan, Aneeta; Parkin, Isobel A. P.; Chèvre, Anne-Marie; Batley, Jacqueline; Nelson, Matthew N.

2016-01-01

Locating centromeres on genome sequences can be challenging. The high density of repetitive elements in these regions makes sequence assembly problematic, especially when using short-read sequencing technologies. It can also be difficult to distinguish between active and recently extinct centromeres through sequence analysis. An effective solution is to identify genetically active centromeres (functional in meiosis) by half-tetrad analysis. This genetic approach involves detecting heterozygosity along chromosomes in segregating populations derived from gametes (half-tetrads). Unreduced gametes produced by first division restitution mechanisms comprise complete sets of nonsister chromatids. Along these chromatids, heterozygosity is maximal at the centromeres, and homologous recombination events result in homozygosity toward the telomeres. We genotyped populations of half-tetrad-derived individuals (from Brassica interspecific hybrids) using a high-density array of physically anchored SNP markers (Illumina Brassica 60K Infinium array). Mapping the distribution of heterozygosity in these half-tetrad individuals allowed the genetic mapping of all 19 centromeres of the Brassica A and C genomes to the reference Brassica napus genome. Gene and transposable element density across the B. napus genome were also assessed and corresponded well to previously reported genetic map positions. Known centromere-specific sequences were located in the reference genome, but mostly matched unanchored sequences, suggesting that the core centromeric regions may not yet be assembled into the pseudochromosomes of the reference genome. The increasing availability of genetic markers physically anchored to reference genomes greatly simplifies the genetic and physical mapping of centromeres using half-tetrad analysis. We discuss possible applications of this approach, including in species where half-tetrads are currently difficult to isolate. PMID:26614742
Centromere Locations in Brassica A and C Genomes Revealed Through Half-Tetrad Analysis.

PubMed

Mason, Annaliese S; Rousseau-Gueutin, Mathieu; Morice, Jérôme; Bayer, Philipp E; Besharat, Naghmeh; Cousin, Anouska; Pradhan, Aneeta; Parkin, Isobel A P; Chèvre, Anne-Marie; Batley, Jacqueline; Nelson, Matthew N

2016-02-01

Locating centromeres on genome sequences can be challenging. The high density of repetitive elements in these regions makes sequence assembly problematic, especially when using short-read sequencing technologies. It can also be difficult to distinguish between active and recently extinct centromeres through sequence analysis. An effective solution is to identify genetically active centromeres (functional in meiosis) by half-tetrad analysis. This genetic approach involves detecting heterozygosity along chromosomes in segregating populations derived from gametes (half-tetrads). Unreduced gametes produced by first division restitution mechanisms comprise complete sets of nonsister chromatids. Along these chromatids, heterozygosity is maximal at the centromeres, and homologous recombination events result in homozygosity toward the telomeres. We genotyped populations of half-tetrad-derived individuals (from Brassica interspecific hybrids) using a high-density array of physically anchored SNP markers (Illumina Brassica 60K Infinium array). Mapping the distribution of heterozygosity in these half-tetrad individuals allowed the genetic mapping of all 19 centromeres of the Brassica A and C genomes to the reference Brassica napus genome. Gene and transposable element density across the B. napus genome were also assessed and corresponded well to previously reported genetic map positions. Known centromere-specific sequences were located in the reference genome, but mostly matched unanchored sequences, suggesting that the core centromeric regions may not yet be assembled into the pseudochromosomes of the reference genome. The increasing availability of genetic markers physically anchored to reference genomes greatly simplifies the genetic and physical mapping of centromeres using half-tetrad analysis. We discuss possible applications of this approach, including in species where half-tetrads are currently difficult to isolate. Copyright © 2016 by the Genetics Society of America.
Excitation and relaxation of metastable atomic states in an active medium of a repetitively pulsed copper vapour laser

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bokhan, P A; Zakrevskii, D E; Lavrukhin, M A

2016-02-28

The influence of a pre-pulse population of copper atom metastable states and their sub-population at a current pulse edge on the copper vapour laser pulse energy is studied under optimal temperature conditions. Experiments have been performed with active elements of a commercial laser having an internal diameter of a discharge channel of 14 and 20 mm. It is found that at a pulse repetition frequency of 12 – 14 kHz, corresponding to a maximal output power, the reduction of the energy due to a residual population of metastable states is by an order of magnitude less than due to theirmore » sub-population at a current pulse edge. The modelling based on the experimental results obtained has shown that in the case of an active element with an internal diameter of 14 mm, a decrease in the pulse leading edge from ∼25 ns to 0.6 ns does not reduce the laser pulse energy up to the repetition frequency of ∼50 kHz at an average output power of 70 W m{sup -1} and efficiency of ∼11%. (lasers)« less
The Prevalence and Molecular Epidemiology of Multidrug-Resistant Enterobacteriaceae Colonization in a Pediatric Intensive Care Unit.

PubMed

Suwantarat, Nuntra; Logan, Latania K; Carroll, Karen C; Bonomo, Robert A; Simner, Patricia J; Rudin, Susan D; Milstone, Aaron M; Tekle, Tsigereda; Ross, Tracy; Tamma, Pranita D

2016-05-01

To determine the prevalence and acquisition of extended-spectrum β-lactamases (ESBLs), plasmid-mediated AmpCs (pAmpCs), and carbapenemases ("MDR Enterobacteriaceae") colonizing children admitted to a pediatric intensive care unit (PICU). Prospective study. 40-bed PICU. Admission and weekly thereafter rectal surveillance swabs were collected on all pediatric patients during a 6-month study period. Routine phenotypic identification and antibiotic susceptibility testing were performed. Enterobacteriaceae displaying characteristic resistance profiles underwent further molecular characterization to identify genetic determinants of resistance likely to be transmitted on mobile genetic elements and to evaluate relatedness of strains including DNA microarray, multilocus sequence typing, repetitive sequence-based PCR, and hsp60 sequencing typing. Evaluating 854 swabs from unique children, the overall prevalence of colonization with an MDR Enterobacteriaceae upon admission to the PICU based on β-lactamase gene identification was 4.3% (n=37), including 2.8% ESBLs (n=24), 1.3% pAmpCs (n=11), and 0.2% carbapenemases (n=2). Among 157 pediatric patients contributing 603 subsequent weekly swabs, 6 children (3.8%) acquired an incident MDR Enterobacteriaceae during their PICU stay. One child acquired a pAmpC (E. coli containing bla DHA) related to an isolate from another patient. Approximately 4% of children admitted to a PICU were colonized with MDR Enterobacteriaceae (based on β-lactamase gene identification) and an additional 4% of children who remained in the PICU for at least 1 week acquired 1 of these organisms during their PICU stay. The acquired MDR Enterobacteriaceae were relatively heterogeneous, suggesting that a single source was not responsible for the introduction of these resistance mechanisms into the PICU setting.
The Prevalence and Molecular Epidemiology of Multidrug-Resistant Enterobacteriaceae Colonization in a Pediatric Intensive Care Unit

PubMed Central

Suwantarat, Nuntra; Logan, Latania K.; Carroll, Karen C.; Bonomo, Robert A.; Simner, Patricia J.; Rudin, Susan D.; Milstone, Aaron M.; Tekle, Tsigereda; Ross, Tracy; Tamma, Pranita D.

2016-01-01

OBJECTIVE To determine the prevalence and acquisition of extended-spectrum β-lactamases (ESBLs), plasmid-mediated AmpCs (pAmpCs), and carbapenemases (“MDR Enterobacteriaceae”) colonizing children admitted to a pediatric intensive care unit (PICU). DESIGN Prospective study. SETTING 40-bed PICU. METHODS Admission and weekly thereafter rectal surveillance swabs were collected on all pediatric patients during a 6-month study period. Routine phenotypic identification and antibiotic susceptibility testing were performed. Enterobacteriaceae displaying characteristic resistance profiles underwent further molecular characterization to identify genetic determinants of resistance likely to be transmitted on mobile genetic elements and to evaluate relatedness of strains including DNA microarray, multilocus sequence typing, repetitive sequence-based PCR, and hsp60 sequencing typing. Results Evaluating 854 swabs from unique children, the overall prevalence of colonization with an MDR Enterobacteriaceae upon admission to the PICU based on β-lactamase gene identification was 4.3% (n = 37), including 2.8% ESBLs (n =24), 1.3% pAmpCs (n =11), and 0.2% carbapenemases (n =2). Among 157 pediatric patients contributing 603 subsequent weekly swabs, 6 children (3.8%) acquired an incident MDR Enterobacteriaceae during their PICU stay. One child acquired a pAmpC (E. coli containing blaDHA) related to an isolate from another patient. Conclusions Approximately 4% of children admitted to a PICU were colonized with MDR Enterobacteriaceae (based on β-lactamase gene identification) and an additional 4% of children who remained in the PICU for at least 1 week acquired 1 of these organisms during their PICU stay. The acquired MDR Enterobacteriaceae were relatively heterogeneous, suggesting that a single source was not responsible for the introduction of these resistance mechanisms into the PICU setting. PMID:26856439
Analysis of laser pumping and thermal effects based on element analysis

NASA Astrophysics Data System (ADS)

Cui, Li; Liu, Zhijia; Zhang, Yizhuo; Han, Juan

2018-03-01

Thermal effect is a plateau that limits the output of high-power, high beam quality laser, and thermal effects become worse with the increase of pump power. We can reduce the effects caused by thermal effects from pumping, laser medium shape, cooling method and other aspects. In this article, by using finite element analysis software, the thermal effects between Nd:Glass and Nd:YAG laser crystal was analyzed and compared. The causes of generation for thermal effects, and factors that influence the distribution in laser medium were analyzed, including the light source, the laser medium shape and the working mode. Nd:Glass is more suitable for low repetition frequency, high energy pulsed laser output, due to its large size, line width and so on, and Nd:YAG is more suitable for continue or high repetition rate laser output, due to its higher thermal conductivity.
The devil is in the details: Transposable element analysis of the Tasmanian devil genome.

PubMed

Nilsson, Maria A

2016-01-01

The third marsupial genome was sequenced from the Tasmanian devil ( Sarcophilus harrisii ), a species that currently is driven to extinction by a rare transmissible cancer. The transposable element (TE) landscape of the Tasmanian devil genome revealed that the main driver of retrotransposition the L ong IN terspersed E lement 1 (LINE1) seem to have become inactivated during the past 12 million years. Strangely, the S hort IN terspersed E lements (SINE), that normally hijacks the LINE1 retrotransposition system, became inactive prior to LINE1 at around 30 million years ago. The SINE inactivation was in vitro verified in several species. Here I discuss that the apparent LINE1 inactivation might be caused by a genome assembly artifact. The repetitive fraction of any genome is highly complex to assemble and the observed problems are not unique to the Tasmanian devil genome.
Anaerobically Grown Escherichia coli Has an Enhanced Mutation Rate and Distinct Mutational Spectra

PubMed Central

Shewaramani, Sonal; Finn, Thomas J.; Kassen, Rees; Rainey, Paul B.

2017-01-01

Oxidative stress is a major cause of mutation but little is known about how growth in the absence of oxygen impacts the rate and spectrum of mutations. We employed long-term mutation accumulation experiments to directly measure the rates and spectra of spontaneous mutation events in Escherichia coli populations propagated under aerobic and anaerobic conditions. To detect mutations, whole genome sequencing was coupled with methods of analysis sufficient to identify a broad range of mutational classes, including structural variants (SVs) generated by movement of repetitive elements. The anaerobically grown populations displayed a mutation rate nearly twice that of the aerobic populations, showed distinct asymmetric mutational strand biases, and greater insertion element activity. Consistent with mutation rate and spectra observations, genes for transposition and recombination repair associated with SVs were up-regulated during anaerobic growth. Together, these results define differences in mutational spectra affecting the evolution of facultative anaerobes. PMID:28103245
Genomic insight into the common carp (Cyprinus carpio) genome by sequencing analysis of BAC-end sequences

PubMed Central

2011-01-01

Background Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES) are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. Result To develop such valuable resources in common carp (Cyprinus carpio), a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp. Conclusion BAC end sequences are great resources for the first genome wide survey of common carp. The repetitive DNA was estimated to be approximate 28% of common carp genome, indicating the higher complexity of the genome. Comparative analysis had mapped around 40,000 BES to zebrafish genome and established over 3,100 microsyntenies, covering over 50% of the zebrafish genome. BES of common carp are tremendous tools for comparative mapping between the two closely related species, zebrafish and common carp, which should facilitate both structural and functional genome analysis in common carp. PMID:21492448
The complete mitochondrial genome structure of snow leopard Panthera uncia.

PubMed

Wei, Lei; Wu, Xiaobing; Jiang, Zhigang

2009-05-01

The complete mitochondrial genome (mtDNA) of snow leopard Panthera uncia was obtained by using the polymerase chain reaction (PCR) technique based on the PCR fragments of 30 primers we designed. The entire mtDNA sequence was 16 773 base pairs (bp) in length, and the base composition was: A-5,357 bp (31.9%); C-4,444 bp (26.5%); G-2,428 bp (14.5%); T-4,544 bp (27.1%). The structural characteristics [0] of the P. uncia mitochondrial genome were highly similar to these of Felis catus, Acinonyx jubatus, Neofelis nebulosa and other mammals. However, we found several distinctive features of the mitochondrial genome of Panthera unica. First, the termination codon of COIII was TAA, which differed from those of F. catus, A. jubatus and N. nebulosa. Second, tRNA(Ser) ((AGY)), which lacked the ''DHU'' arm, could not be folded into the typical cloverleaf-shaped structure. Third, in the control region, a long repetitive sequence in RS-2 (32 bp) region was found with 2 repeats while one short repetitive segment (9 bp) was found with 15 repeats in the RS-3 region. We performed phylogenetic analysis based on a 3 816 bp concatenated sequence of 12S rRNA, 16S rRNA, ND2, ND4, ND5, Cyt b and ATP8 for P. uncia and other related species, the result indicated that P. uncia and P. leo were the sister species, which was different from the previous findings.
Preliminary investigation of electrothermal vaporization sample introduction for inductively coupled plasma time-of-flight mass spectrometry.

PubMed

Mahoney, P P; Ray, S J; Li, G; Hieftje, G M

1999-04-01

The coupling of an electrothermal vaporization (ETV) apparatus to an inductively coupled plasma time-of-flight mass spectrometer (ICP-TOFMS) is described. The ability of the ICP-TOFMS to produce complete elemental mass spectra at high repetition rates is experimentally demonstrated. A signal-averaging data acquisition board is employed to rapidly record complete elemental spectra throughout the vaporization stage of the ETV temperature cycle; a solution containing 34 elements is analyzed. The reduction of both molecular and atomic isobaric interferences through the temperature program of the furnace is demonstrated. Isobaric overlaps among the isotopes of cadmium, tin, and indium are resolved by exploiting differences in the vaporization characteristics of the elements. Figures of merit for the system are defined with several different data acquisition schemes capable of operating at the high repetition rate of the TOF instrument. With the use of both ion counting and a boxcar averager, the dynamic range is shown to be linear over a range of at least 6 orders of magnitude. A pair of boxcar averagers are used to measure the isotope ratio for silver with a precision of 1.9% RSD, despite a cycle-to-cycle precision of 19% RSD. Detection limits of 10-80 fg are calculated for seven elements, based upon a 10-microL injection.
Anchoring and ordering NGS contig assemblies by population sequencing (POPSEQ)

PubMed Central

Mascher, Martin; Muehlbauer, Gary J; Rokhsar, Daniel S; Chapman, Jarrod; Schmutz, Jeremy; Barry, Kerrie; Muñoz-Amatriaín, María; Close, Timothy J; Wise, Roger P; Schulman, Alan H; Himmelbach, Axel; Mayer, Klaus FX; Scholz, Uwe; Poland, Jesse A; Stein, Nils; Waugh, Robbie

2013-01-01

Next-generation whole-genome shotgun assemblies of complex genomes are highly useful, but fail to link nearby sequence contigs with each other or provide a linear order of contigs along individual chromosomes. Here, we introduce a strategy based on sequencing progeny of a segregating population that allows de novo production of a genetically anchored linear assembly of the gene space of an organism. We demonstrate the power of the approach by reconstructing the chromosomal organization of the gene space of barley, a large, complex and highly repetitive 5.1 Gb genome. We evaluate the robustness of the new assembly by comparison to a recently released physical and genetic framework of the barley genome, and to various genetically ordered sequence-based genotypic datasets. The method is independent of the need for any prior sequence resources, and will enable rapid and cost-efficient establishment of powerful genomic information for many species. PMID:23998490

Short, interspersed, and repetitive DNA sequences in Spiroplasma species.

PubMed

Nur, I; LeBlanc, D J; Tully, J G

1987-03-01

Small fragments of DNA from an 8-kbp plasmid, pRA1, from a plant pathogenic strain of Spiroplasma citri were shown previously to be present in the chromosomal DNA of at least two species of Spiroplasma. We describe here the shot-gun cloning of chromosomal DNA from S. citri Maroc and the identification of two distinct sequences exhibiting homology to pRA1. Further subcloning experiments provided specific molecular probes for the identification of these two sequences in chromosomal DNA from three distinct plant pathogenic species of Spiroplasma. The results of Southern blot hybridization indicated that each of the pRA1-associated sequences is present as multiple copies in short, dispersed, and repetitive sequences in the chromosomes of these three strains. None of the sequences was detectable in chromosomal DNA from an additional nine Spiroplasma strains examined.
Transcription of Gypsy Elements in a Y-Chromosome Male Fertility Gene of Drosophila Hydei

PubMed Central

Hochstenbach, R.; Harhangi, H.; Schouren, K.; Bindels, P.; Suijkerbuijk, R.; Hennig, W.

1996-01-01

We have found that defective gypsy retrotransposons are a major constituent of the lampbrush loop pair Nooses in the short arm of the Y chromosome of Drosophila hydei. The loop pair is formed by male fertility gene Q during the primary spermatocyte stage of spermatogenesis, each loop being a single transcription unit with an estimated length of 260 kb. Using fluorescent in situ hybridization, we show that throughout the loop transcripts gypsy elements are interspersed with blocks of a tandemly repetitive Y-specific DNA sequence, ay1. Nooses transcripts containing both sequence types show a wide size range on Northern blots, do not migrate to the cytoplasm, and are degraded just before the first meiotic division. Only one strand of ay1 and only the coding strand of gypsy can be detected in the loop transcripts. However, as cloned genomic DNA fragments also display opposite orientations of ay1 and gypsy, such DNA sections cannot be part of the Nooses. Hence, they are most likely derived from the flanking heterochromatin. The direction of transcription of ay1 and gypsy thus appears to be of a functional significance. PMID:8852843
Genomic Sequence around Butterfly Wing Development Genes: Annotation and Comparative Analysis

PubMed Central

Conceição, Inês C.; Long, Anthony D.; Gruber, Jonathan D.; Beldade, Patrícia

2011-01-01

Background Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. Methodology/Principal Findings We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations) and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes). Conclusions The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1) the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2) the high conservation of non-coding sequence around the genes wingless and Ecdysone receptor, both involved in multiple developmental processes including wing pattern formation. PMID:21909358
Effects of environmental enrichment on repetitive behaviors in the BTBR T+tf/J mouse model of autism.

PubMed

Reynolds, Stacey; Urruela, Meagan; Devine, Darragh P

2013-10-01

Lower order and higher order repetitive behaviors have been documented in the BTBR T+tf/J (BTBR) mouse strain, a mouse model that exhibits all three core behavioral domains that define autism. The purpose of this study was to evaluate the effectiveness of environmental enrichment for reducing repetitive behaviors in BTBR mice. Lower order behaviors were captured by assaying the time and sequence of grooming, while higher order behaviors were measured using pattern analysis of an object exploration task from digital recordings. Baseline scores were established at 7 weeks of age, followed by 30 days of housing in either a standard or enriched cage. As expected, BTBR mice spent significantly more time grooming and had a more rigid grooming sequence than control C57BL/6J mice did at baseline. After 30 days of enrichment housing, BTBR mice demonstrated a significant reduction in time spent grooming, resulting in levels that were lower than those exhibited by BTBR mice in standard housing. However, no changes were noted in the rigidity of their grooming sequence. In contrast to previous findings, there was no difference in repetitive patterns of exploration at baseline between BTBR and C57BL/6J mice in the object exploration test. Subsequently, enrichment did not significantly alter the number of repetitive patterns at posttest. Overall, the results suggest that environmental enrichment may be beneficial for reducing the time spent engaging in lower order repetitive behaviors, but may not change the overall quality of the behaviors when they do manifest. © 2013 International Society for Autism Research, Wiley Periodicals, Inc.
ElemeNT: a computational tool for detecting core promoter elements.

PubMed

Sloutskin, Anna; Danino, Yehuda M; Orenstein, Yaron; Zehavi, Yonathan; Doniger, Tirza; Shamir, Ron; Juven-Gershon, Tamar

2015-01-01

Core promoter elements play a pivotal role in the transcriptional output, yet they are often detected manually within sequences of interest. Here, we present 2 contributions to the detection and curation of core promoter elements within given sequences. First, the Elements Navigation Tool (ElemeNT) is a user-friendly web-based, interactive tool for prediction and display of putative core promoter elements and their biologically-relevant combinations. Second, the CORE database summarizes ElemeNT-predicted core promoter elements near CAGE and RNA-seq-defined Drosophila melanogaster transcription start sites (TSSs). ElemeNT's predictions are based on biologically-functional core promoter elements, and can be used to infer core promoter compositions. ElemeNT does not assume prior knowledge of the actual TSS position, and can therefore assist in annotation of any given sequence. These resources, freely accessible at http://lifefaculty.biu.ac.il/gershon-tamar/index.php/resources, facilitate the identification of core promoter elements as active contributors to gene expression.
A High-Density Genetic Map with Array-Based Markers Facilitates Structural and Quantitative Trait Locus Analyses of the Common Wheat Genome

PubMed Central

Iehisa, Julio Cesar Masaru; Ohno, Ryoko; Kimura, Tatsuro; Enoki, Hiroyuki; Nishimura, Satoru; Okamoto, Yuki; Nasuda, Shuhei; Takumi, Shigeo

2014-01-01

The large genome and allohexaploidy of common wheat have complicated construction of a high-density genetic map. Although improvements in the throughput of next-generation sequencing (NGS) technologies have made it possible to obtain a large amount of genotyping data for an entire mapping population by direct sequencing, including hexaploid wheat, a significant number of missing data points are often apparent due to the low coverage of sequencing. In the present study, a microarray-based polymorphism detection system was developed using NGS data obtained from complexity-reduced genomic DNA of two common wheat cultivars, Chinese Spring (CS) and Mironovskaya 808. After design and selection of polymorphic probes, 13,056 new markers were added to the linkage map of a recombinant inbred mapping population between CS and Mironovskaya 808. On average, 2.49 missing data points per marker were observed in the 201 recombinant inbred lines, with a maximum of 42. Around 40% of the new markers were derived from genic regions and 11% from repetitive regions. The low number of retroelements indicated that the new polymorphic markers were mainly derived from the less repetitive region of the wheat genome. Around 25% of the mapped sequences were useful for alignment with the physical map of barley. Quantitative trait locus (QTL) analyses of 14 agronomically important traits related to flowering, spikes, and seeds demonstrated that the new high-density map showed improved QTL detection, resolution, and accuracy over the original simple sequence repeat map. PMID:24972598
A high-density genetic map with array-based markers facilitates structural and quantitative trait locus analyses of the common wheat genome.

PubMed

Iehisa, Julio Cesar Masaru; Ohno, Ryoko; Kimura, Tatsuro; Enoki, Hiroyuki; Nishimura, Satoru; Okamoto, Yuki; Nasuda, Shuhei; Takumi, Shigeo

2014-10-01

The large genome and allohexaploidy of common wheat have complicated construction of a high-density genetic map. Although improvements in the throughput of next-generation sequencing (NGS) technologies have made it possible to obtain a large amount of genotyping data for an entire mapping population by direct sequencing, including hexaploid wheat, a significant number of missing data points are often apparent due to the low coverage of sequencing. In the present study, a microarray-based polymorphism detection system was developed using NGS data obtained from complexity-reduced genomic DNA of two common wheat cultivars, Chinese Spring (CS) and Mironovskaya 808. After design and selection of polymorphic probes, 13,056 new markers were added to the linkage map of a recombinant inbred mapping population between CS and Mironovskaya 808. On average, 2.49 missing data points per marker were observed in the 201 recombinant inbred lines, with a maximum of 42. Around 40% of the new markers were derived from genic regions and 11% from repetitive regions. The low number of retroelements indicated that the new polymorphic markers were mainly derived from the less repetitive region of the wheat genome. Around 25% of the mapped sequences were useful for alignment with the physical map of barley. Quantitative trait locus (QTL) analyses of 14 agronomically important traits related to flowering, spikes, and seeds demonstrated that the new high-density map showed improved QTL detection, resolution, and accuracy over the original simple sequence repeat map. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Identification of a high frequency transposon induced by tissue culture, nDaiZ, a member of the hAT family in rice.

PubMed

Huang, Jian; Zhang, Kewei; Shen, Yi; Huang, Zejun; Li, Ming; Tang, Ding; Gu, Minghong; Cheng, Zhukuan

2009-03-01

Recent completion of rice genome sequencing has revealed that more than 40% of its genome consists of repetitive sequences, and most of them are related to inactive transposable elements. In the present study, a transposable element, nDaiZ0, which is induced by tissue culture with high frequency, was identified by sequence analysis of an allelic line of the golden hull and internode 2 (gh2) mutant, which was integrated into the forth exon of GH2. The 528-bp nDaiZ0 has 14-bp terminal inverted repeats (TIRs), and generates an 8-bp duplication of its target sites (TSD) during its mobilization. nDaiZs are non-autonomous transposons and have no coding capacity. Bioinformatics analysis and southern blot hybridization showed that at least 16 copies of nDaiZ elements exist in the japonica cultivar Nipponbare genome and 11 copies in the indica cultivar 93-11 genome. During tissue culture, only one copy, nDaiZ9, located on chromosome 5 in the genome of Nipponbare can be activated with its transposable frequency reaching 30%. However, nDaiZ9 was not present in the 93-11 genome. The larger elements, DaiZs, were further identified by database searching using nDaiZ0 as a query because they share similar TIRs and subterminal sequences. DaiZ can also generate an 8-bp TSD. DaiZ elements contain a conserved region with a high similarity to the hAT dimerization motif, suggesting that the nDaiZ-DaiZ transposon system probably belongs to the hAT superfamily of class II transposons. Phylogenetic analysis indicated that it is a new type of plant hAT-like transposon. Although nDaiZ is activated by tissue culture, the high transposable frequency indicates that it could become a useful gene tagging system for rice functional genomic studies. In addition, the mechanism of the high transposable ability of nDaiZ9 is discussed.
Pixel-based characterisation of CMOS high-speed camera systems

NASA Astrophysics Data System (ADS)

Weber, V.; Brübach, J.; Gordon, R. L.; Dreizler, A.

2011-05-01

Quantifying high-repetition rate laser diagnostic techniques for measuring scalars in turbulent combustion relies on a complete description of the relationship between detected photons and the signal produced by the detector. CMOS-chip based cameras are becoming an accepted tool for capturing high frame rate cinematographic sequences for laser-based techniques such as Particle Image Velocimetry (PIV) and Planar Laser Induced Fluorescence (PLIF) and can be used with thermographic phosphors to determine surface temperatures. At low repetition rates, imaging techniques have benefitted from significant developments in the quality of CCD-based camera systems, particularly with the uniformity of pixel response and minimal non-linearities in the photon-to-signal conversion. The state of the art in CMOS technology displays a significant number of technical aspects that must be accounted for before these detectors can be used for quantitative diagnostics. This paper addresses these issues.
The effects of rest interval length manipulation of the first upper-body resistance exercise in sequence on acute performance of subsequent exercises in men and women.

PubMed

Ratamess, Nicholas A; Chiarello, Christina M; Sacco, Anthony J; Hoffman, Jay R; Faigenbaum, Avery D; Ross, Ryan E; Kang, Jie

2012-11-01

The purpose of the present study was to investigate the effects of manipulating rest interval (RI) length of the first upper-body exercise in sequence on subsequent resistance exercise performance. Twenty-two men and women with at least 1 year of resistance training experience performed resistance exercise protocols on 3 occasions in random order. Each protocol consisted of performing 4 barbell upper-body exercises in the same sequence (bench press, incline bench press, shoulder press, and bent-over row) for 3 sets of up to 10 repetitions with 75% of 1 repetition maximum. Bench press RIs were 1, 2, or 3 minutes, whereas other exercises were performed with a standard 2-minute rest interval. The number of repetitions completed, average power, and velocity for each set of each exercise were recorded. Gender differences were observed during the bench press and incline press as women performed significantly (p ≤ 0.05) more repetitions than men during all RIs. The magnitude of decline in velocity and power over 3 sets of the bench press and incline press was significantly higher in men than women. Manipulation of RI length during the bench press did not affect performance of the remaining exercises in men. However, significantly more repetitions were performed by women during the first set of the incline press using 3-minute rest interval than 1-minute rest interval. In men and women, performance of the incline press and shoulder press was compromised compared with baseline performances. Manipulation of RI length of the first exercise affected performance of only the first set of 1 subsequent exercise in women. All RIs led to comparable levels of fatigue in men, indicating that reductions in load are necessary for subsequent exercises performed in sequence that stress similar agonist muscle groups when 10 repetitions are desired.
Whole-comparative genomic hybridization in domestic sheep (Ovis aries) breeds.

PubMed

Dávila-Rodríguez, M I; Cortés-Gutiérrez, E I; López-Fernández, C; Pita, M; Mezzanotte, R; Gosálvez, J

2009-01-01

Whole-comparative genomic hybridization (W-CGH) allows identification of chromosomal polymorphisms related to highly repetitive DNA sequences localized in constitutive heterochromatin. Such polymorphisms are detected establishing competition between genomic DNAs in an in situ hybridization environment without subtraction of highly repetitive DNA sequences, when comparing two species from closely related taxa (same species, sub-species, or breeds) or somewhat related taxa. This experimental approach was applied to investigating differences in highly repetitive sequences of three sheep breeds (Castellana, Ojalada, and Assaf). To this end, W-CGH was carried out using mouflon (sheep ancestor) chromosomes as a common target to co-hybridize equimolar quantities of two genomic DNAs obtained from either Castellana, Ojalada or Assaf sheep breeds. The results showed that the amount of constitutive heterochromatin is greater in all pericentromeric heterochromatin regions of acrocentric chromosomes than in metacentric or sex chromosomes. Additionally, when W-CGH was performed using DNAs from the Iberian breeds Castellana and Ojalada, chromosomal pericentromeric regions revealed quantitatively and qualitatively a presence of DNA families similar to that obtained from any of the above-cited breeds. On the contrary, when the DNA used in W-CGH experiments was obtained from Assaf, as compared to either Castellana or Ojalada, two different pericentromeric DNA families of highly repetitive sequences could be detected. Lastly, sex chromosomes were shown to be homogeneous among all breeds and thus revealed no detectable constitutive heterochromatin. W-CGH results were confirmed using DNA breakage detection-FISH experiments (DBD-FISH) carried out on lymphocytes. As a whole, the results showed that two different repetitive DNA families are present in the pericentromeric heterochromatin of the sheep breeds studied here. Additionally, they suggest a differential presence of these distinct repetitive DNA families in Castellana and Ojalada breeds as compared to the Assaf breed. Finally, the results of W-CGH after using mouflon as the targeted chromosomes also show that the two DNA families are present in the ancestor. Copyright 2009 S. Karger AG, Basel.
First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Traditional Medicinal Plant

PubMed Central

Liu, Di; Zeng, Shao-Hua; Chen, Jian-Jun; Zhang, Yan-Jun; Xiao, Gong; Zhu, Lin-Yao; Wang, Ying

2013-01-01

Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant. PMID:23807511
Accuracy and Reproducibility of Adipose Tissue Measurements in Young Infants by Whole Body Magnetic Resonance Imaging

PubMed Central

Bauer, Jan Stefan; Noël, Peter Benjamin; Vollhardt, Christiane; Much, Daniela; Degirmenci, Saliha; Brunner, Stefanie; Rummeny, Ernst Josef; Hauner, Hans

2015-01-01

Purpose MR might be well suited to obtain reproducible and accurate measures of fat tissues in infants. This study evaluates MR-measurements of adipose tissue in young infants in vitro and in vivo. Material and Methods MR images of ten phantoms simulating subcutaneous fat of an infant’s torso were obtained using a 1.5T MR scanner with and without simulated breathing. Scans consisted of a cartesian water-suppression turbo spin echo (wsTSE) sequence, and a PROPELLER wsTSE sequence. Fat volume was quantified directly and by MR imaging using k-means clustering and threshold-based segmentation procedures to calculate accuracy in vitro. Whole body MR was obtained in sleeping young infants (average age 67±30 days). This study was approved by the local review board. All parents gave written informed consent. To obtain reproducibility in vivo, cartesian and PROPELLER wsTSE sequences were repeated in seven and four young infants, respectively. Overall, 21 repetitions were performed for the cartesian sequence and 13 repetitions for the PROPELLER sequence. Results In vitro accuracy errors depended on the chosen segmentation procedure, ranging from 5.4% to 76%, while the sequence showed no significant influence. Artificial breathing increased the minimal accuracy error to 9.1%. In vivo reproducibility errors for total fat volume of the sleeping infants ranged from 2.6% to 3.4%. Neither segmentation nor sequence significantly influenced reproducibility. Conclusion With both cartesian and PROPELLER sequences an accurate and reproducible measure of body fat was achieved. Adequate segmentation was mandatory for high accuracy. PMID:25706876
Accuracy and reproducibility of adipose tissue measurements in young infants by whole body magnetic resonance imaging.

PubMed

Bauer, Jan Stefan; Noël, Peter Benjamin; Vollhardt, Christiane; Much, Daniela; Degirmenci, Saliha; Brunner, Stefanie; Rummeny, Ernst Josef; Hauner, Hans

2015-01-01

MR might be well suited to obtain reproducible and accurate measures of fat tissues in infants. This study evaluates MR-measurements of adipose tissue in young infants in vitro and in vivo. MR images of ten phantoms simulating subcutaneous fat of an infant's torso were obtained using a 1.5T MR scanner with and without simulated breathing. Scans consisted of a cartesian water-suppression turbo spin echo (wsTSE) sequence, and a PROPELLER wsTSE sequence. Fat volume was quantified directly and by MR imaging using k-means clustering and threshold-based segmentation procedures to calculate accuracy in vitro. Whole body MR was obtained in sleeping young infants (average age 67±30 days). This study was approved by the local review board. All parents gave written informed consent. To obtain reproducibility in vivo, cartesian and PROPELLER wsTSE sequences were repeated in seven and four young infants, respectively. Overall, 21 repetitions were performed for the cartesian sequence and 13 repetitions for the PROPELLER sequence. In vitro accuracy errors depended on the chosen segmentation procedure, ranging from 5.4% to 76%, while the sequence showed no significant influence. Artificial breathing increased the minimal accuracy error to 9.1%. In vivo reproducibility errors for total fat volume of the sleeping infants ranged from 2.6% to 3.4%. Neither segmentation nor sequence significantly influenced reproducibility. With both cartesian and PROPELLER sequences an accurate and reproducible measure of body fat was achieved. Adequate segmentation was mandatory for high accuracy.
Functional noncoding sequences derived from SINEs in the mammalian genome.

PubMed

Nishihara, Hidenori; Smit, Arian F A; Okada, Norihiro

2006-07-01

Recent comparative analyses of mammalian sequences have revealed that a large number of nonprotein-coding genomic regions are under strong selective constraint. Here, we report that some of these loci have been derived from a newly defined family of ancient SINEs (short interspersed repetitive elements). This is a surprising result, as SINEs and other transposable elements are commonly thought to be genomic parasites. We named the ancient SINE family AmnSINE1, for Amniota SINE1, because we found it to be present in mammals as well as in birds, and some copies predate the mammalian-bird split 310 million years ago (Mya). AmnSINE1 has a chimeric structure of a 5S rRNA and a tRNA-derived SINE, and is related to five tRNA-derived SINE families that we characterized here in the coelacanth, dogfish shark, hagfish, and amphioxus genomes. All of the newly described SINE families have a common central domain that is also shared by zebrafish SINE3, and we collectively name them the DeuSINE (Deuterostomia SINE) superfamily. Notably, of the approximately 1000 still identifiable copies of AmnSINE1 in the human genome, 105 correspond to loci phylogenetically highly conserved among mammalian orthologs. The conservation is strongest over the central domain. Thus, AmnSINE1 appears to be the best example of a transposable element of which a significant fraction of the copies have acquired genomic functionality.
Human intron-encoded Alu RNAs are processed and packaged into Wdr79-associated nucleoplasmic box H/ACA RNPs

PubMed Central

Jády, Beáta E.; Ketele, Amandine; Kiss, Tamás

2012-01-01

Alu repetitive sequences are the most abundant short interspersed DNA elements in the human genome. Full-length Alu elements are composed of two tandem sequence monomers, the left and right Alu arms, both derived from the 7SL signal recognition particle RNA. Since Alu elements are common in protein-coding genes, they are frequently transcribed into pre-mRNAs. Here, we demonstrate that the right arms of nascent Alu transcripts synthesized within pre-mRNA introns are processed into metabolically stable small RNAs. The intron-encoded Alu RNAs, termed AluACA RNAs, are structurally highly reminiscent of box H/ACA small Cajal body (CB) RNAs (scaRNAs). They are composed of two hairpin units followed by the essential H (AnAnnA) and ACA box motifs. The mature AluACA RNAs associate with the four H/ACA core proteins: dyskerin, Nop10, Nhp2, and Gar1. Moreover, the 3′ hairpin of AluACA RNAs carries two closely spaced CB localization motifs, CAB boxes (UGAG), which bind Wdr79 in a cumulative fashion. In contrast to canonical H/ACA scaRNPs, which concentrate in CBs, the AluACA RNPs accumulate in the nucleoplasm. Identification of 348 human AluACA RNAs demonstrates that intron-encoded AluACA RNAs represent a novel, large subgroup of H/ACA RNAs, which are apparently confined to human or primate cells. PMID:22892240
Automated design of genomic Southern blot probes

PubMed Central

2010-01-01

Background Sothern blotting is a DNA analysis technique that has found widespread application in molecular biology. It has been used for gene discovery and mapping and has diagnostic and forensic applications, including mutation detection in patient samples and DNA fingerprinting in criminal investigations. Southern blotting has been employed as the definitive method for detecting transgene integration, and successful homologous recombination in gene targeting experiments. The technique employs a labeled DNA probe to detect a specific DNA sequence in a complex DNA sample that has been separated by restriction-digest and gel electrophoresis. Critically for the technique to succeed the probe must be unique to the target locus so as not to cross-hybridize to other endogenous DNA within the sample. Investigators routinely employ a manual approach to probe design. A genome browser is used to extract DNA sequence from the locus of interest, which is searched against the target genome using a BLAST-like tool. Ideally a single perfect match is obtained to the target, with little cross-reactivity caused by homologous DNA sequence present in the genome and/or repetitive and low-complexity elements in the candidate probe. This is a labor intensive process often requiring several attempts to find a suitable probe for laboratory testing. Results We have written an informatic pipeline to automatically design genomic Sothern blot probes that specifically attempts to optimize the resultant probe, employing a brute-force strategy of generating many candidate probes of acceptable length in the user-specified design window, searching all against the target genome, then scoring and ranking the candidates by uniqueness and repetitive DNA element content. Using these in silico measures we can automatically design probes that we predict to perform as well, or better, than our previous manual designs, while considerably reducing design time. We went on to experimentally validate a number of these automated designs by Southern blotting. The majority of probes we tested performed well confirming our in silico prediction methodology and the general usefulness of the software for automated genomic Southern probe design. Conclusions Software and supplementary information are freely available at: http://www.genes2cognition.org/software/southern_blot PMID:20113467
Analysis of magnesium and copper in aluminum alloys with high repetition rate laser-ablation spark-induced breakdown spectroscopy

NASA Astrophysics Data System (ADS)

He, Xiaoyong; Dong, Bo; Chen, Yuqi; Li, Runhua; Wang, Fujuan; Li, Jiaoyang; Cai, Zhigang

2018-03-01

In order to improve the analytical speed and performance of laser-ablation based atomic emission spectroscopy, high repetition rate laser-ablation spark-induced breakdown spectroscopy (HRR LA-SIBS) was first developed. Magnesium and copper in aluminum alloys were analyzed with this technique. In the experiments, the fundamental output of an acousto-optically Q-switched Nd:YAG laser operated at 1 kHz repetition rate with low pulse energy and 120 ns pulse width was used to ablate the samples and the plasma emission was enhanced by spark discharge. The spectra were recorded with a compact fiber spectrometer with non-intensified charge-coupled device in non-gating mode. Different parameters relative with analytical performance, such as capacitance, voltage, laser pulse energy were optimized. Under current experimental conditions, calibration curves of magnesium and copper in aluminum alloys were built and limits of detection of them were determined to be 14.0 and 9.9 ppm by HRR LA-SIBS, respectively, which were 8-12 folds better than that achieved by HRR LA under similar experimental condition without spark discharge. The analytical sensitivities are close to those obtained with conventional LIBS but with improved analytical speed as well as possibility of using compact fiber spectrometer. Under high repetition rate operation, the noise level can be decreased and the analytical reproducibility can be improved obviously by averaging multiple measurements within short time. High repetition rate operation of laser-ablation spark-induced breakdown spectroscopy is very helpful for improving analytical speed. It is possible to find applications in fast elements analysis, especially fast two-dimension elemental mapping of solid samples.
Molecular identification and characterization of clustered regularly interspaced short palindromic repeats (CRISPRs) in a urease-positive thermophilic Campylobacter sp. (UPTC).

PubMed

Tasaki, E; Hirayama, J; Tazumi, A; Hayashi, K; Hara, Y; Ueno, H; Moore, J E; Millar, B C; Matsuda, M

2012-02-01

Novel clustered regularly-interspaced short palindromic repeats (CRISPRs) locus [7,500 base pairs (bp) in length] occurred in the urease-positive thermophilic Campylobacter (UPTC) Japanese isolate, CF89-12. The 7,500 bp gene loci consisted of the 5'-methylaminomethyl-2-thiouridylate methyltransferase gene, putative (P) CRISPR associated (p-Cas), putative open reading frames, Cas1 and Cas2, leader sequence region (146 bp), 12 CRISPRs consensus sequence repeats (each 36 bp) separated by a non-repetitive unique spacer region of similar length (26-31 bp) and the phosphatidyl glycerophosphatase A gene. When the CRISPRs loci in the UPTC CF89-12 and five C. jejuni isolates were compared with one another, these six isolates contained p-Cas, Cas1 and Cas2 within the loci. Four to 12 CRISPRs consensus sequence repeats separated by a non-repetitive unique spacer region occurred in six isolates and the nucleotide sequences of those repeats gave approximately 92-100% similarity with each other. However, no sequence similarity occurred in the unique spacer regions among these isolates. The putative σ(70) transcriptional promoter and the hypothetical ρ-independent terminator structures for the CRISPRs and Cas were detected. No in vivo transcription of p-Cas, Cas1 and Cas2 was confirmed in the UPTC cells.
Kangaroo – A pattern-matching program for biological sequences

PubMed Central

2002-01-01

Background Biologists are often interested in performing a simple database search to identify proteins or genes that contain a well-defined sequence pattern. Many databases do not provide straightforward or readily available query tools to perform simple searches, such as identifying transcription binding sites, protein motifs, or repetitive DNA sequences. However, in many cases simple pattern-matching searches can reveal a wealth of information. We present in this paper a regular expression pattern-matching tool that was used to identify short repetitive DNA sequences in human coding regions for the purpose of identifying potential mutation sites in mismatch repair deficient cells. Results Kangaroo is a web-based regular expression pattern-matching program that can search for patterns in DNA, protein, or coding region sequences in ten different organisms. The program is implemented to facilitate a wide range of queries with no restriction on the length or complexity of the query expression. The program is accessible on the web at http://bioinfo.mshri.on.ca/kangaroo/ and the source code is freely distributed at http://sourceforge.net/projects/slritools/. Conclusion A low-level simple pattern-matching application can prove to be a useful tool in many research settings. For example, Kangaroo was used to identify potential genetic targets in a human colorectal cancer variant that is characterized by a high frequency of mutations in coding regions containing mononucleotide repeats. PMID:12150718

Identification of novel MITEs (miniature inverted-repeat transposable elements) in Coxiella burnetii: implications for protein and small RNA evolution.

PubMed

Wachter, Shaun; Raghavan, Rahul; Wachter, Jenny; Minnick, Michael F

2018-04-11

Coxiella burnetii is a Gram-negative gammaproteobacterium and zoonotic agent of Q fever. C. burnetii's genome contains an abundance of pseudogenes and numerous selfish genetic elements. MITEs (miniature inverted-repeat transposable elements) are non-autonomous transposons that occur in all domains of life and are thought to be insertion sequences (ISs) that have lost their transposase function. Like most transposable elements (TEs), MITEs are thought to play an active role in evolution by altering gene function and expression through insertion and deletion activities. However, information regarding bacterial MITEs is limited. We describe two MITE families discovered during research on small non-coding RNAs (sRNAs) of C. burnetii. Two sRNAs, Cbsr3 and Cbsr13, were found to originate from a novel MITE family, termed QMITE1. Another sRNA, CbsR16, was found to originate from a separate and novel MITE family, termed QMITE2. Members of each family occur ~ 50 times within the strains evaluated. QMITE1 is a typical MITE of 300-400 bp with short (2-3 nt) direct repeats (DRs) of variable sequence and is often found overlapping annotated open reading frames (ORFs). Additionally, QMITE1 elements possess sigma-70 promoters and are transcriptionally active at several loci, potentially influencing expression of nearby genes. QMITE2 is smaller (150-190 bps), but has longer (7-11 nt) DRs of variable sequences and is mainly found in the 3' untranslated region of annotated ORFs and intergenic regions. QMITE2 contains a GTAG repetitive extragenic palindrome (REP) that serves as a target for IS1111 TE insertion. Both QMITE1 and QMITE2 display inter-strain linkage and sequence conservation, suggesting that they are adaptive and existed before divergence of C. burnetii strains. We have discovered two novel MITE families of C. burnetii. Our finding that MITEs serve as a source for sRNAs is novel. QMITE2 has a unique structure and occurs in large or small versions with unique DRs that display linkage and sequence conservation between strains, allowing for tracking of genomic rearrangements. QMITE1 and QMITE2 copies are hypothesized to influence expression of neighboring genes involved in DNA repair and virulence through transcriptional interference and ribonuclease processing.
Methylation guide RNA evolution in archaea: structure, function and genomic organization of 110 C/D box sRNA families across six Pyrobaculum species.

PubMed

Lui, Lauren M; Uzilov, Andrew V; Bernick, David L; Corredor, Andrea; Lowe, Todd M; Dennis, Patrick P

2018-05-16

Archaeal homologs of eukaryotic C/D box small nucleolar RNAs (C/D box sRNAs) guide precise 2'-O-methyl modification of ribosomal and transfer RNAs. Although C/D box sRNA genes constitute one of the largest RNA gene families in archaeal thermophiles, most genomes have incomplete sRNA gene annotation because reliable, fully automated detection methods are not available. We expanded and curated a comprehensive gene set across six species of the crenarchaeal genus Pyrobaculum, particularly rich in C/D box sRNA genes. Using high-throughput small RNA sequencing, specialized computational searches and comparative genomics, we analyzed 526 Pyrobaculum C/D box sRNAs, organizing them into 110 families based on synteny and conservation of guide sequences which determine methylation targets. We examined gene duplications and rearrangements, including one family that has expanded in a pattern similar to retrotransposed repetitive elements in eukaryotes. New training data and inclusion of kink-turn secondary structural features enabled creation of an improved search model. Our analyses provide the most comprehensive, dynamic view of C/D box sRNA evolutionary history within a genus, in terms of modification function, feature plasticity, and gene mobility.
Nothing in Evolution Makes Sense Except in the Light of Genomics: Read-Write Genome Evolution as an Active Biological Process.

PubMed

Shapiro, James A

2016-06-08

The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess "Read-Write Genomes" they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.
Nothing in Evolution Makes Sense Except in the Light of Genomics: Read–Write Genome Evolution as an Active Biological Process

PubMed Central

Shapiro, James A.

2016-01-01

The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess “Read–Write Genomes” they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification. PMID:27338490
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.

PubMed

Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
The Reliability and Validity of Tasks Measuring Perception of Rapid Sequences in Children with Dyslexia

ERIC Educational Resources Information Center

Heath, Steve M.; Hogben, John H.

2004-01-01

Background: Claims that children with reading and oral language deficits have impaired perception of sequential sounds are usually based on psychophysical measures of auditory temporal processing (ATP) designed to characterise group performance. If we are to use these measures (e.g., the Tallal, 1980, Repetition Test) as the basis for intervention…
Note: All solid-state high repetitive sub-nanosecond risetime pulse generator based on bulk gallium arsenide avalanche semiconductor switches.

PubMed

Hu, Long; Su, Jiancang; Ding, Zhenjie; Hao, Qingsong; Fan, Yajun; Liu, Chunliang

2016-08-01

An all solid-state high repetitive sub-nanosecond risetime pulse generator featuring low-energy-triggered bulk gallium arsenide (GaAs) avalanche semiconductor switches and a step-type transmission line is presented. The step-type transmission line with two stages is charged to a potential of 5.0 kV also biasing at the switches. The bulk GaAs avalanche semiconductor switch closes within sub-nanosecond range when illuminated with approximately 87 nJ of laser energy at 905 nm in a single pulse. An asymmetric dipolar pulse with peak-to-peak amplitude of 9.6 kV and risetime of 0.65 ns is produced on a resistive load of 50 Ω. A technique that allows for repetition-rate multiplication of pulse trains experimentally demonstrated that the parallel-connected bulk GaAs avalanche semiconductor switches are triggered in sequence. The highest repetition rate is decided by recovery time of the bulk GaAs avalanche semiconductor switch, and the operating result of 100 kHz of the generator is discussed.
Pre-lexical disorders in repetition conduction aphasia.

PubMed

Sidiropoulos, Kyriakos; de Bleser, Ria; Ackermann, Hermann; Preilowski, Bruno

2008-12-01

At the level of clinical speech/language evaluation, the repetition type of conduction aphasia is characterized by repetition difficulties concomitant with reduced short-term memory capacities, in the presence of fluent spontaneous speech as well as unimpaired naming and reading abilities. It is still unsettled which dysfunctions of the pre-lexical processing stage of spoken word recognition contribute to this syndrome and whether there is any relevant top-down impact of the mental lexicon upon the phonetic/phonological level of speech perception. In order to further specify the underlying pathomechanisms, a comprehensive battery of psycholinguistic tests was applied to a patient suffering from repetition conduction aphasia. The obtained results point at a pre-lexical disorder in this subject. To further specify the assumed pre-lexical dysfunction, computer simulations of single-word processing, based upon an interactive activation model (IAM), were conducted. An attenuation of the features-to-phonemes inhibition value was found to simulate the observed profile of psycholinguistic deficits. Conceivably, these pre-lexical disorders interfere with the task-dependent adjustment of the temporal windows of signal analysis, giving rise to compromised sequencing of auditory-verbal information.
Gene-enriched draft genome of the cattle tick Rhipicephalus microplus: Assembly by the hybrid Pacific Biosciences/Illumina approach enabled analysis of the highly repetitive genome

USDA-ARS?s Scientific Manuscript database

The genome of the cattle tick R. microplus, an ectoparasite with global distribution, is estimated to be 7.1 Gbp and consists of ~70% repetitive DNA. We report the first assembly of a tick genome that utilized a hybrid sequencing and assembly approach to capture the repetitive fractions of the genom...
DNA sequence analysis of ARS elements from chromosome III of Saccharomyces cerevisiae: identification of a new conserved sequence.

PubMed Central

Palzkill, T G; Oliver, S G; Newlon, C S

1986-01-01

Four fragments of Saccharomyces cerevisiae chromosome III DNA which carry ARS elements have been sequenced. Each fragment contains multiple copies of sequences that have at least 10 out of 11 bases of homology to a previously reported 11 bp core consensus sequence. A survey of these new ARS sequences and previously reported sequences revealed the presence of an additional 11 bp conserved element located on the 3' side of the T-rich strand of the core consensus. Subcloning analysis as well as deletion and transposon insertion mutagenesis of ARS fragments support a role for 3' conserved sequence in promoting ARS activity. PMID:3529036
Relative Suffix Trees.

PubMed

Farruggia, Andrea; Gagie, Travis; Navarro, Gonzalo; Puglisi, Simon J; Sirén, Jouni

2018-05-01

Suffix trees are one of the most versatile data structures in stringology, with many applications in bioinformatics. Their main drawback is their size, which can be tens of times larger than the input sequence. Much effort has been put into reducing the space usage, leading ultimately to compressed suffix trees. These compressed data structures can efficiently simulate the suffix tree, while using space proportional to a compressed representation of the sequence. In this work, we take a new approach to compressed suffix trees for repetitive sequence collections, such as collections of individual genomes. We compress the suffix trees of individual sequences relative to the suffix tree of a reference sequence. These relative data structures provide competitive time/space trade-offs, being almost as small as the smallest compressed suffix trees for repetitive collections, and competitive in time with the largest and fastest compressed suffix trees.
Relative Suffix Trees

PubMed Central

Farruggia, Andrea; Gagie, Travis; Navarro, Gonzalo; Puglisi, Simon J; Sirén, Jouni

2018-01-01

Abstract Suffix trees are one of the most versatile data structures in stringology, with many applications in bioinformatics. Their main drawback is their size, which can be tens of times larger than the input sequence. Much effort has been put into reducing the space usage, leading ultimately to compressed suffix trees. These compressed data structures can efficiently simulate the suffix tree, while using space proportional to a compressed representation of the sequence. In this work, we take a new approach to compressed suffix trees for repetitive sequence collections, such as collections of individual genomes. We compress the suffix trees of individual sequences relative to the suffix tree of a reference sequence. These relative data structures provide competitive time/space trade-offs, being almost as small as the smallest compressed suffix trees for repetitive collections, and competitive in time with the largest and fastest compressed suffix trees. PMID:29795706
Kinetic processes determining attainable pulse repetition rate in pulsed metal vapor lasers

NASA Astrophysics Data System (ADS)

Petrash, Gueorgii G.

1998-06-01

A review of the investigations of the main processes determining the attainable pulse repetition rate of elemental metal vapor pulsed gas discharge self-terminating lasers, such as copper vapor laser, gold vapor laser, lead vapor laser, is given. Kinetic processes during an excitation pulse and interpulse period are considered as well as experiments with lasers operating at high repetition rate.
Exact and conceptual repetition dissociate conceptual memory tests: problems for transfer appropriate processing theory.

PubMed

McDermott, K B; Roediger, H L

1996-03-01

Three experiments examined whether a conceptual implicit memory test (specifically, category instance generation) would exhibit repetition effects similar to those found in free recall. The transfer appropriate processing account of dissociations among memory tests led us to predict that the tests would show parallel effects; this prediction was based upon the theory's assumption that conceptual tests will behave similarly as a function of various independent variables. In Experiment 1, conceptual repetition (i.e., following a target word [e.g., puzzles] with an associate [e.g., jigsaw]) did not enhance priming on the instance generation test relative to the condition of simply presenting the target word once, although this manipulation did affect free recall. In Experiment 2, conceptual repetition was achieved by following a picture with its corresponding word (or vice versa). In this case, there was an effect of conceptual repetition on free recall but no reliable effect on category instance generation or category cued recall. In addition, we obtained a picture superiority effect in free recall but not in category instance generation. In the third experiment, when the same study sequence was used as in Experiment 1, but with instructions that encouraged relational processing, priming on the category instance generation task was enhanced by conceptual repetition. Results demonstrate that conceptual memory tests can be dissociated and present problems for Roediger's (1990) transfer appropriate processing account of dissociations between explicit and implicit tests.
B chromosome dynamics in Prochilodus costatus (Teleostei, Characiformes) and comparisons with supernumerary chromosome system in other Prochilodus species

PubMed Central

Melo, Silvana; Utsunomia, Ricardo; Penitente, Manolo; Sobrinho-Scudeler, Patrícia Elda; Porto-Foresti, Fábio; Oliveira, Claudio; Foresti, Fausto; Dergam, Jorge Abdala

2017-01-01

Abstract Within the genus Prochilodus Agassiz, 1829, five species are known to carry B chromosomes, i.e. chromosomes beyond the usual diploid number that have been traditionally considered as accessory for the genome. Chromosome microdissection and mapping of repetitive DNA sequences are effective tools to assess the DNA content and allow a better understanding about the origin and composition of these elements in an array of species. In this study, a novel characterization of B chromosomes in Prochilodus costatus Valenciennes, 1850 (2n=54) was reported for the first time and their sequence complementarity with the supernumerary chromosomes observed in Prochilodus lineatus (Valenciennes, 1836) and Prochilodus argenteus Agassiz, 1829 was investigated. The hybridization patterns obtained with chromosome painting using the micro B probe of P. costatus and the satDNA SATH1 mapping made it possible to assume homology of sequences between the B chromosomes of these congeneric species. Our results suggest that the origin of B chromosomes in the genus Prochilodus is a phylogenetically old event. PMID:28919971
The combination of direct and paired link graphs can boost repetitive genome assembly

PubMed Central

Shi, Wenyu; Ji, Peifeng

2017-01-01

Abstract Currently, most paired link based scaffolding algorithms intrinsically mask the sequences between two linked contigs and bypass their direct link information embedded in the original de Bruijn assembly graph. Such disadvantage substantially complicates the scaffolding process and leads to the inability of resolving repetitive contig assembly. Here we present a novel algorithm, inGAP-sf, for effectively generating high-quality and continuous scaffolds. inGAP-sf achieves this by using a new strategy based on the combination of direct link and paired link graphs, in which direct link is used to increase graph connectivity and to decrease graph complexity and paired link is employed to supervise the traversing process on the direct link graph. Such advantage greatly facilitates the assembly of short-repeat enriched regions. Moreover, a new comprehensive decision model is developed to eliminate the noise routes accompanying with the introduced direct link. Through extensive evaluations on both simulated and real datasets, we demonstrated that inGAP-sf outperforms most of the genome scaffolding algorithms by generating more accurate and continuous assembly, especially for short repetitive regions. PMID:27924003
Draft genome of the gayal, Bos frontalis

PubMed Central

Wang, Ming-Shan; Zeng, Yan; Wang, Xiao; Nie, Wen-Hui; Wang, Jin-Huan; Su, Wei-Ting; Xiong, Zi-Jun; Wang, Sheng; Qu, Kai-Xing; Yan, Shou-Qing; Yang, Min-Min; Wang, Wen; Dong, Yang; Zhang, Ya-Ping

2017-01-01

Abstract Gayal (Bos frontalis), also known as mithan or mithun, is a large endangered semi-domesticated bovine that has a limited geographical distribution in the hill-forests of China, Northeast India, Bangladesh, Myanmar, and Bhutan. Many questions about the gayal such as its origin, population history, and genetic basis of local adaptation remain largely unresolved. De novo sequencing and assembly of the whole gayal genome provides an opportunity to address these issues. We report a high-depth sequencing, de novo assembly, and annotation of a female Chinese gayal genome. Based on the Illumina genomic sequencing platform, we have generated 350.38 Gb of raw data from 16 different insert-size libraries. A total of 276.86 Gb of clean data is retained after quality control. The assembled genome is about 2.85 Gb with scaffold and contig N50 sizes of 2.74 Mb and 14.41 kb, respectively. Repetitive elements account for 48.13% of the genome. Gene annotation has yielded 26 667 protein-coding genes, of which 97.18% have been functionally annotated. BUSCO assessment shows that our assembly captures 93% (3183 of 4104) of the core eukaryotic genes and 83.1% of vertebrate universal single-copy orthologs. We provide the first comprehensive de novo genome of the gayal. This genetic resource is integral for investigating the origin of the gayal and performing comparative genomic studies to improve understanding of the speciation and divergence of bovine species. The assembled genome could be used as reference in future population genetic studies of gayal. PMID:29048483
PROSPECT improves cis-acting regulatory element prediction by integrating expression profile data with consensus pattern searches

PubMed Central

Fujibuchi, Wataru; Anderson, John S. J.; Landsman, David

2001-01-01

Consensus pattern and matrix-based searches designed to predict cis-acting transcriptional regulatory sequences have historically been subject to large numbers of false positives. We sought to decrease false positives by incorporating expression profile data into a consensus pattern-based search method. We have systematically analyzed the expression phenotypes of over 6000 yeast genes, across 121 expression profile experiments, and correlated them with the distribution of 14 known regulatory elements over sequences upstream of the genes. Our method is based on a metric we term probabilistic element assessment (PEA), which is a ranking of potential sites based on sequence similarity in the upstream regions of genes with similar expression phenotypes. For eight of the 14 known elements that we examined, our method had a much higher selectivity than a naïve consensus pattern search. Based on our analysis, we have developed a web-based tool called PROSPECT, which allows consensus pattern-based searching of gene clusters obtained from microarray data. PMID:11574681
Molecular tandem repeat strategy for elucidating mechanical properties of high-strength proteins

PubMed Central

Jung, Huihun; Pena-Francesch, Abdon; Saadat, Alham; Sebastian, Aswathy; Kim, Dong Hwan; Hamilton, Reginald F.; Albert, Istvan; Allen, Benjamin D.; Demirel, Melik C.

2016-01-01

Many globular and structural proteins have repetitions in their sequences or structures. However, a clear relationship between these repeats and their contribution to the mechanical properties remains elusive. We propose a new approach for the design and production of synthetic polypeptides that comprise one or more tandem copies of a single unit with distinct amorphous and ordered regions. Our designed sequences are based on a structural protein produced in squid suction cups that has a segmented copolymer structure with amorphous and crystalline domains. We produced segmented polypeptides with varying repeat number, while keeping the lengths and compositions of the amorphous and crystalline regions fixed. We showed that mechanical properties of these synthetic proteins could be tuned by modulating their molecular weights. Specifically, the toughness and extensibility of synthetic polypeptides increase as a function of the number of tandem repeats. This result suggests that the repetitions in native squid proteins could have a genetic advantage for increased toughness and flexibility. PMID:27222581
Genetic Variation in Cardiomyopathy and Cardiovascular Disorders.

PubMed

McNally, Elizabeth M; Puckelwartz, Megan J

2015-01-01

With the wider deployment of massively-parallel, next-generation sequencing, it is now possible to survey human genome data for research and clinical purposes. The reduced cost of producing short-read sequencing has now shifted the burden to data analysis. Analysis of genome sequencing remains challenged by the complexity of the human genome, including redundancy and the repetitive nature of genome elements and the large amount of variation in individual genomes. Public databases of human genome sequences greatly facilitate interpretation of common and rare genetic variation, although linking database sequence information to detailed clinical information is limited by privacy and practical issues. Genetic variation is a rich source of knowledge for cardiovascular disease because many, if not all, cardiovascular disorders are highly heritable. The role of rare genetic variation in predicting risk and complications of cardiovascular diseases has been well established for hypertrophic and dilated cardiomyopathy, where the number of genes that are linked to these disorders is growing. Bolstered by family data, where genetic variants segregate with disease, rare variation can be linked to specific genetic variation that offers profound diagnostic information. Understanding genetic variation in cardiomyopathy is likely to help stratify forms of heart failure and guide therapy. Ultimately, genetic variation may be amenable to gene correction and gene editing strategies.

Molecular characterization of long direct repeat (LDR) sequences expressing a stable mRNA encoding for a 35-amino-acid cell-killing peptide and a cis-encoded small antisense RNA in Escherichia coli.

PubMed

Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada

2002-07-01

Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Bov-tA short interspersed nucleotide element sequences in circulating nucleic acids from sera of cattle with bovine spongiform encephalopathy (BSE) and sera of cattle exposed to BSE.

PubMed

Schütz, Ekkehard; Urnovitz, Howard B; Iakoubov, Leonid; Schulz-Schaeffer, Walter; Wemheuer, Wilhelm; Brenig, Bertram

2005-07-01

Circulating nucleic acids (CNA) are known to be enriched in repetitive DNA sequences in humans. Here, bovine sera CNA were analyzed to determine if cell stress-related short interspersed nucleotide elements (SINEs) could be detected in sera from cattle associated with bovine spongiform encephalopathy (BSE). Nucleic acids were extracted, amplified, cloned, and sequenced from the sera of protease-resistant prion protein (PrP(res))-positive cattle (n = 2) and sera from BSE-cohort cows (n = 6); 150 out of 163 clones revealed the presence of, on average, an 80-bp sequence from the 3' region of Bov-tA SINE. A PCR protocol was developed that differentially identified SINE-associated CNA in BSE-exposed versus normal cattle. CNA were extracted from a serum vesicular fraction after controlled blood collection and processing procedures. Sera from four confirmed cases of BSE, 137 BSE-exposed cohort animals associated with eight confirmed BSE cases, and 845 healthy, PrP(res)-negative control cows were tested. All four sera from confirmed BSE cases were repeatedly reactive in the assay. BSE-exposed cohorts had a 100-fold higher occurrence of repeatedly reactive individuals per cohort (average = 63%; range = 33% to 91%), compared to healthy controls (average = 0.6%; P < 0.001). This study shows that BSE-confirmed and cohort animals possess a unique profile of SINE-associated serum CNA that can be utilized as a marker that highly correlates to BSE exposure.
Bov-tA Short Interspersed Nucleotide Element Sequences in Circulating Nucleic Acids from Sera of Cattle with Bovine Spongiform Encephalopathy (BSE) and Sera of Cattle Exposed to BSE

PubMed Central

Schütz, Ekkehard; Urnovitz, Howard B.; Iakoubov, Leonid; Schulz-Schaeffer, Walter; Wemheuer, Wilhelm; Brenig, Bertram

2005-01-01

Circulating nucleic acids (CNA) are known to be enriched in repetitive DNA sequences in humans. Here, bovine sera CNA were analyzed to determine if cell stress-related short interspersed nucleotide elements (SINEs) could be detected in sera from cattle associated with bovine spongiform encephalopathy (BSE). Nucleic acids were extracted, amplified, cloned, and sequenced from the sera of protease-resistant prion protein (PrPres)-positive cattle (n = 2) and sera from BSE-cohort cows (n = 6); 150 out of 163 clones revealed the presence of, on average, an 80-bp sequence from the 3′ region of Bov-tA SINE. A PCR protocol was developed that differentially identified SINE-associated CNA in BSE-exposed versus normal cattle. CNA were extracted from a serum vesicular fraction after controlled blood collection and processing procedures. Sera from four confirmed cases of BSE, 137 BSE-exposed cohort animals associated with eight confirmed BSE cases, and 845 healthy, PrPres-negative control cows were tested. All four sera from confirmed BSE cases were repeatedly reactive in the assay. BSE-exposed cohorts had a 100-fold higher occurrence of repeatedly reactive individuals per cohort (average = 63%; range = 33% to 91%), compared to healthy controls (average = 0.6%; P < 0.001). This study shows that BSE-confirmed and cohort animals possess a unique profile of SINE-associated serum CNA that can be utilized as a marker that highly correlates to BSE exposure. PMID:16002628
Stress induced gene expression drives transient DNA methylation changes at adjacent repetitive elements

PubMed Central

Secco, David; Wang, Chuang; Shou, Huixia; Schultz, Matthew D; Chiarenza, Serge; Nussaume, Laurent; Ecker, Joseph R; Whelan, James; Lister, Ryan

2015-01-01

Cytosine DNA methylation (mC) is a genome modification that can regulate the expression of coding and non-coding genetic elements. However, little is known about the involvement of mC in response to environmental cues. Using whole genome bisulfite sequencing to assess the spatio-temporal dynamics of mC in rice grown under phosphate starvation and recovery conditions, we identified widespread phosphate starvation-induced changes in mC, preferentially localized in transposable elements (TEs) close to highly induced genes. These changes in mC occurred after changes in nearby gene transcription, were mostly DCL3a-independent, and could partially be propagated through mitosis, however no evidence of meiotic transmission was observed. Similar analyses performed in Arabidopsis revealed a very limited effect of phosphate starvation on mC, suggesting a species-specific mechanism. Overall, this suggests that TEs in proximity to environmentally induced genes are silenced via hypermethylation, and establishes the temporal hierarchy of transcriptional and epigenomic changes in response to stress. DOI: http://dx.doi.org/10.7554/eLife.09343.001 PMID:26196146
GrTEdb: the first web-based database of transposable elements in cotton (Gossypium raimondii).

PubMed

Xu, Zhenzhen; Liu, Jing; Ni, Wanchao; Peng, Zhen; Guo, Yue; Ye, Wuwei; Huang, Fang; Zhang, Xianggui; Xu, Peng; Guo, Qi; Shen, Xinlian; Du, Jianchang

2017-01-01

Although several diploid and tetroploid Gossypium species genomes have been sequenced, the well annotated web-based transposable elements (TEs) database is lacking. To better understand the roles of TEs in structural, functional and evolutionary dynamics of the cotton genome, a comprehensive, specific, and user-friendly web-based database, Gossypium raimondii transposable elements database (GrTEdb), was constructed. A total of 14 332 TEs were structurally annotated and clearly categorized in G. raimondii genome, and these elements have been classified into seven distinct superfamilies based on the order of protein-coding domains, structures and/or sequence similarity, including 2929 Copia-like elements, 10 368 Gypsy-like elements, 299 L1 , 12 Mutators , 435 PIF-Harbingers , 275 CACTAs and 14 Helitrons . Meanwhile, the web-based sequence browsing, searching, downloading and blast tool were implemented to help users easily and effectively to annotate the TEs or TE fragments in genomic sequences from G. raimondii and other closely related Gossypium species. GrTEdb provides resources and information related with TEs in G. raimondii , and will facilitate gene and genome analyses within or across Gossypium species, evaluating the impact of TEs on their host genomes, and investigating the potential interaction between TEs and protein-coding genes in Gossypium species. http://www.grtedb.org/. © The Author(s) 2017. Published by Oxford University Press.
Cerebellar activation during motor sequence learning is associated with subsequent transfer to new sequences.

PubMed

Shimizu, Renee E; Wu, Allan D; Knowlton, Barbara J

2016-12-01

Effective learning results not only in improved performance on a practiced task, but also in the ability to transfer the acquired knowledge to novel, similar tasks. Using a modified serial reaction time (RT) task, the authors examined the ability to transfer to novel sequences after practicing sequences in a repetitive order versus a nonrepeating interleaved order. Interleaved practice resulted in better performance on new sequences than repetitive practice. In a second study, participants practiced interleaved sequences in a functional MRI (fMRI) scanner and received a transfer test of novel sequences. Transfer ability was positively correlated with cerebellar blood oxygen level dependent activity during practice, indicating that greater cerebellar engagement during training resulted in better subsequent transfer performance. Interleaved practice may thus result in a more generalized representation that is robust to interference, and the degree of activation in the cerebellum may be a reflection of the instantiation and engagement of internal models. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
ALUminating the Path of Atherosclerosis Progression: Chaos Theory Suggests a Role for Alu Repeats in the Development of Atherosclerotic Vascular Disease.

PubMed

Hueso, Miguel; Cruzado, Josep M; Torras, Joan; Navarro, Estanislao

2018-06-12

Atherosclerosis (ATH) and coronary artery disease (CAD) are chronic inflammatory diseases with an important genetic background; they derive from the cumulative effect of multiple common risk alleles, most of which are located in genomic noncoding regions. These complex diseases behave as nonlinear dynamical systems that show a high dependence on their initial conditions; thus, long-term predictions of disease progression are unreliable. One likely possibility is that the nonlinear nature of ATH could be dependent on nonlinear correlations in the structure of the human genome. In this review, we show how chaos theory analysis has highlighted genomic regions that have shared specific structural constraints, which could have a role in ATH progression. These regions were shown to be enriched with repetitive sequences of the Alu family, genomic parasites that have colonized the human genome, which show a particular secondary structure and are involved in the regulation of gene expression. Here, we show the impact of Alu elements on the mechanisms that regulate gene expression, especially highlighting the molecular mechanisms via which the Alu elements alter the inflammatory response. We devote special attention to their relationship with the long noncoding RNA (lncRNA); antisense noncoding RNA in the INK4 locus ( ANRIL ), a risk factor for ATH; their role as microRNA (miRNA) sponges; and their ability to interfere with the regulatory circuitry of the (nuclear factor kappa B) NF-κB response. We aim to characterize ATH as a nonlinear dynamic system, in which small initial alterations in the expression of a number of repetitive elements are somehow amplified to reach phenotypic significance.
Stabilization and cytoskeletal-association of LDL receptor mRNA are mediated by distinct domains in its 3' untranslated region.

PubMed

Wilson, G M; Vasa, M Z; Deeley, R G

1998-05-01

The mRNA encoding the human low density lipoprotein (LDL) receptor is transiently stabilized after phorbol ester treatment of HepG2 cells and has been shown to associate with components of the cytoskeleton in this cell line (G. M. Wilson, E. A. Roberts, and R. G. Deeley, J. Lipid Res. 1997. 38: 437-446). Using an episomal expression system, fragments of the 3' untranslated region (3'UTR) of LDL receptor mRNA were transcribed in fusion with the coding region of beta-globin mRNA in HepG2 cells. Analyses of the decay kinetics of these beta-globin-LDL receptor fusion mRNA deletion mutants showed that sequences in the proximal 3'UTR of LDL receptor mRNA including several AU-rich elements (AREs) were sufficient to confer short constitutive mRNA half-life in the heterologous system. Stabilization of LDL receptor mRNA in the presence of PMA required sequences in the distal 3'UTR, at or near three Alu-like repetitive elements. Furthermore, the 3'UTR of LDL receptor mRNA conferred cytoskeletal association on the otherwise unassociated beta-globin mRNA, by a mechanism involving at least two distinct RNA elements. Comparisons of decay kinetics and subcellular localization of endogenous LDL receptor mRNA and beta-globin-LDL receptor mRNA fusions in HepG2 cells have demonstrated that several cis-acting elements in the receptor 3'UTR contribute to post-transcriptional regulation of receptor expression, and provide further support for involvement of the cytoskeleton in the regulation of LDL receptor mRNA turnover.
Identification of a unique library of complex, but ordered, arrays of repetitive elements in the human genome and implication of their potential involvement in pathobiology.

PubMed

Lee, Kang-Hoon; Lee, Young-Kwan; Kwon, Deug-Nam; Chiu, Sophia; Chew, Victoria; Rah, Hyungchul; Kujawski, Gregory; Melhem, Ramzi; Hsu, Karen; Chung, Cecilia; Greenhalgh, David G; Cho, Kiho

2011-06-01

Approximately 2% of the human genome is reported to be occupied by genes. Various forms of repetitive elements (REs), both characterized and uncharacterized, are presumed to make up the vast majority of the rest of the genomes of human and other species. In conjunction with a comprehensive annotation of genes, information regarding components of genome biology, such as gene polymorphisms, non-coding RNAs, and certain REs, is found in human genome databases. However, the genome-wide profile of unique RE arrangements formed by different groups of REs has not been fully characterized yet. In this study, the entire human genome was subjected to an unbiased RE survey to establish a whole-genome profile of REs and their arrangements. Due to the limitation in query size within the bl2seq alignment program (National Center for Biotechnology Information [NCBI]) utilized for the RE survey, the entire NCBI reference human genome was fragmented into 6206 units of 0.5M nucleotides. A number of RE arrangements with varying complexities and patterns were identified throughout the genome. Each chromosome had unique profiles of RE arrangements and density, and high levels of RE density were measured near the centromere regions. Subsequently, 175 complex RE arrangements, which were selected throughout the genome, were subjected to a comparison analysis using five different human genome sequences. Interestingly, three of the five human genome databases shared the exactly same arrangement patterns and sequences for all 175 RE arrangement regions (a total of 12,765,625 nucleotides). The findings from this study demonstrate that a substantial fraction of REs in the human genome are clustered into various forms of ordered structures. Further investigations are needed to examine whether some of these ordered RE arrangements contribute to the human pathobiology as a functional genome unit. Copyright © 2011 Elsevier Inc. All rights reserved.
Coincidence or not? Interconnected gas/fluid migration and ocean-atmosphere oscillations in the Levant Basin

NASA Astrophysics Data System (ADS)

Lazar, Michael; Lang, Guy; Schattner, Uri

2016-08-01

A growing number of studies on shallow marine gas/fluid systems from across the globe indicate their abundance throughout geological epochs. However, these episodic events have not been fully integrated into the fundamental concepts of continental margin development, which are thought to be dictated by three elements: tectonics, sedimentation and eustasy. The current study focuses on the passive sector of the Levant Basin on the eastern Mediterranean continental margin where these elements are well constrained, in order to isolate the contribution of gas/fluid systems. Single-channel, multichannel and 3D seismic reflection data are interpreted in terms of variance, chaos, envelope and sweetness attributes. Correlation with the Romi-1 borehole and sequence boundaries constrains interpretation of seismic stratigraphy. Results show a variety of fluid- or gas-related features such as seafloor and subsurface pockmarks, volumes of acoustic blanking, bright spots, conic pinnacle mounds, gas chimneys and high sweetness zones that represent possible secondary reservoirs. It is suggested that gas/fluid migrate upwards along lithological conduits such as falling-stage systems tracts and sequence boundaries during both highstands and lowstands. In all, 13 mid-late Pleistocene sequence boundaries are accompanied by independent evidence of 13 eustatic sea-level drops. Whether this connection is coincidental or not requires further research. These findings fill gaps between previously reported sporadic appearances throughout the Levant Basin and margin and throughout geological time from the Messinian until the present day, and create a unified framework for understanding the system as a whole. Repetitive appearance of these features suggests that their role in the morphodynamics of continental margins is more important than previously thought and thus may constitute one of the key elements of continental margin development.
An Ancient Transkingdom Horizontal Transfer of Penelope-Like Retroelements from Arthropods to Conifers.

PubMed

Lin, Xuan; Faridi, Nurul; Casola, Claudio

2016-05-02

Comparative genomics analyses empowered by the wealth of sequenced genomes have revealed numerous instances of horizontal DNA transfers between distantly related species. In eukaryotes, repetitive DNA sequences known as transposable elements (TEs) are especially prone to move across species boundaries. Such horizontal transposon transfers, or HTTs, are relatively common within major eukaryotic kingdoms, including animals, plants, and fungi, while rarely occurring across these kingdoms. Here, we describe the first case of HTT from animals to plants, involving TEs known as Penelope-like elements, or PLEs, a group of retrotransposons closely related to eukaryotic telomerases. Using a combination of in situ hybridization on chromosomes, polymerase chain reaction experiments, and computational analyses we show that the predominant PLE lineage, EN(+)PLEs, is highly diversified in loblolly pine and other conifers, but appears to be absent in other gymnosperms. Phylogenetic analyses of both protein and DNA sequences reveal that conifers EN(+)PLEs, or Dryads, form a monophyletic group clustering within a clade of primarily arthropod elements. Additionally, no EN(+)PLEs were detected in 1,928 genome assemblies from 1,029 nonmetazoan and nonconifer genomes from 14 major eukaryotic lineages. These findings indicate that Dryads emerged following an ancient horizontal transfer of EN(+)PLEs from arthropods to a common ancestor of conifers approximately 340 Ma. This represents one of the oldest known interspecific transmissions of TEs, and the most conspicuous case of DNA transfer between animals and plants. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2016. This work is written by US Government employees and is in the public domain in the US.
G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences.

PubMed

Lenis, Vasileios Panagiotis E; Swain, Martin; Larkin, Denis M

2018-05-01

Cross-species whole-genome sequence alignment is a critical first step for genome comparative analyses, ranging from the detection of sequence variants to studies of chromosome evolution. Animal genomes are large and complex, and whole-genome alignment is a computationally intense process, requiring expensive high-performance computing systems due to the need to explore extensive local alignments. With hundreds of sequenced animal genomes available from multiple projects, there is an increasing demand for genome comparative analyses. Here, we introduce G-Anchor, a new, fast, and efficient pipeline that uses a strictly limited but highly effective set of local sequence alignments to anchor (or map) an animal genome to another species' reference genome. G-Anchor makes novel use of a databank of highly conserved DNA sequence elements. We demonstrate how these elements may be aligned to a pair of genomes, creating anchors. These anchors enable the rapid mapping of scaffolds from a de novo assembled genome to chromosome assemblies of a reference species. Our results demonstrate that G-Anchor can successfully anchor a vertebrate genome onto a phylogenetically related reference species genome using a desktop or laptop computer within a few hours and with comparable accuracy to that achieved by a highly accurate whole-genome alignment tool such as LASTZ. G-Anchor thus makes whole-genome comparisons accessible to researchers with limited computational resources. G-Anchor is a ready-to-use tool for anchoring a pair of vertebrate genomes. It may be used with large genomes that contain a significant fraction of evolutionally conserved DNA sequences and that are not highly repetitive, polypoid, or excessively fragmented. G-Anchor is not a substitute for whole-genome aligning software but can be used for fast and accurate initial genome comparisons. G-Anchor is freely available and a ready-to-use tool for the pairwise comparison of two genomes.
Coiled-coil length: Size does matter.

PubMed

Surkont, Jaroslaw; Diekmann, Yoan; Ryder, Pearl V; Pereira-Leal, Jose B

2015-12-01

Protein evolution is governed by processes that alter primary sequence but also the length of proteins. Protein length may change in different ways, but insertions, deletions and duplications are the most common. An optimal protein size is a trade-off between sequence extension, which may change protein stability or lead to acquisition of a new function, and shrinkage that decreases metabolic cost of protein synthesis. Despite the general tendency for length conservation across orthologous proteins, the propensity to accept insertions and deletions is heterogeneous along the sequence. For example, protein regions rich in repetitive peptide motifs are well known to extensively vary their length across species. Here, we analyze length conservation of coiled-coils, domains formed by an ubiquitous, repetitive peptide motif present in all domains of life, that frequently plays a structural role in the cell. We observed that, despite the repetitive nature, the length of coiled-coil domains is generally highly conserved throughout the tree of life, even when the remaining parts of the protein change, including globular domains. Length conservation is independent of primary amino acid sequence variation, and represents a conservation of domain physical size. This suggests that the conservation of domain size is due to functional constraints. © 2015 Wiley Periodicals, Inc.
Repetitive DNAs and shrink genomes: A chromosomal analysis in nine Columbidae species (Aves, Columbiformes).

PubMed

Kretschmer, Rafael; de Oliveira, Thays Duarte; de Oliveira Furo, Ivanete; Oliveira Silva, Fabio Augusto; Gunski, Ricardo José; Del Valle Garnero, Analía; de Bello Cioffi, Marcelo; de Oliveira, Edivaldo Herculano Corrêa; de Freitas, Thales Renato Ochotorena

2018-01-01

An extensive karyotype variation is found among species belonging to the Columbidae family of birds (Columbiformes), both in diploid number and chromosomal morphology. Although clusters of repetitive DNA sequences play an important role in chromosomal instability, and therefore in chromosomal rearrangements, little is known about their distribution and amount in avian genomes. The aim of this study was to analyze the distribution of 11 distinct microsatellite sequences, as well as clusters of 18S rDNA, in nine different Columbidae species, correlating their distribution with the occurrence of chromosomal rearrangements. We found 2n values ranging from 76 to 86 and nine out of 11 microsatellite sequences showed distinct hybridization signals among the analyzed species. The accumulation of microsatellite repeats was found preferentially in the centromeric region of macro and microchromosomes, and in the W chromosome. Additionally, pair 2 showed the accumulation of several microsatellites in different combinations and locations in the distinct species, suggesting the occurrence of intrachromosomal rearrangements, as well as a possible fission of this pair in Geotrygon species. Therefore, although birds have a smaller amount of repetitive sequences when compared to other Tetrapoda, these seem to play an important role in the karyotype evolution of these species.
Comparative molecular cytogenetic analyses of a major tandemly repeated DNA family and retrotransposon sequences in cultivated jute Corchorus species (Malvaceae).

PubMed

Begum, Rabeya; Zakrzewski, Falk; Menzel, Gerhard; Weber, Beatrice; Alam, Sheikh Shamimul; Schmidt, Thomas

2013-07-01

The cultivated jute species Corchorus olitorius and Corchorus capsularis are important fibre crops. The analysis of repetitive DNA sequences, comprising a major part of plant genomes, has not been carried out in jute but is useful to investigate the long-range organization of chromosomes. The aim of this study was the identification of repetitive DNA sequences to facilitate comparative molecular and cytogenetic studies of two jute cultivars and to develop a fluorescent in situ hybridization (FISH) karyotype for chromosome identification. A plasmid library was generated from C. olitorius and C. capsularis with genomic restriction fragments of 100-500 bp, which was complemented by targeted cloning of satellite DNA by PCR. The diversity of the repetitive DNA families was analysed comparatively. The genomic abundance and chromosomal localization of different repeat classes were investigated by Southern analysis and FISH, respectively. The cytosine methylation of satellite arrays was studied by immunolabelling. Major satellite repeats and retrotransposons have been identified from C. olitorius and C. capsularis. The satellite family CoSat I forms two undermethylated species-specific subfamilies, while the long terminal repeat (LTR) retrotransposons CoRetro I and CoRetro II show similarity to the Metaviridea of plant retroelements. FISH karyotypes were developed by multicolour FISH using these repetitive DNA sequences in combination with 5S and 18S-5·8S-25S rRNA genes which enable the unequivocal chromosome discrimination in both jute species. The analysis of the structure and diversity of the repeated DNA is crucial for genome sequence annotation. The reference karyotypes will be useful for breeding of jute and provide the basis for karyotyping homeologous chromosomes of wild jute species to reveal the genetic and evolutionary relationship between cultivated and wild Corchorus species.
[Efficient identification and analysis of low and medium frequency repeats]. Progress report

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jurka, J.

1991-08-28

The effective starting date of this grant was May 15. In the first three months of this project we focused primarily on organizational and technical aspects of our research which included: organization of the database of repeats in primates; preparation of software for rapid and sensitive search of novel repetitive elements in GenBank; purchase and installation of the Sun workstation; and research on the mammal-specific MAR1 family of repetitive elements (to be communicated in October).
(Efficient identification and analysis of low and medium frequency repeats)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jurka, J.

1991-08-28

The effective starting date of this grant was May 15. In the first three months of this project we focused primarily on organizational and technical aspects of our research which included: organization of the database of repeats in primates; preparation of software for rapid and sensitive search of novel repetitive elements in GenBank; purchase and installation of the Sun workstation; and research on the mammal-specific MAR1 family of repetitive elements (to be communicated in October).
Molecular Epidemiology of Carbapenem-Resistant Acinetobacter baumannii Isolates in the Gulf Cooperation Council States: Dominance of OXA-23-Type Producers

PubMed Central

Sartor, Anna L.; Sidjabat, Hanna E.; Balkhy, Hanan H.; Walsh, Timothy R.; Al Johani, Sameera M.; AlJindan, Reem Y.; Alfaresi, Mubarak; Ibrahim, Emad; Al-Jardani, Amina; Al Salman, Jameela; Dashti, Ali A.; Johani, Khalid; Paterson, David L.

2015-01-01

The molecular epidemiology and mechanisms of resistance of carbapenem-resistant Acinetobacter baumannii (CRAB) were determined in hospitals in the states of the Cooperation Council for the Arab States of the Gulf (Gulf Cooperation Council [GCC]), namely, Saudi Arabia, United Arab Emirates, Oman, Qatar, Bahrain, and Kuwait. Isolates were subjected to PCR-based detection of antibiotic resistance genes and repetitive sequence-based PCR (rep-PCR) assessments of clonality. Selected isolates were subjected to multilocus sequence typing (MLST). We investigated 117 isolates resistant to carbapenem antibiotics (either imipenem or meropenem). All isolates were positive for OXA-51. The most common carbapenemases were the OXA-23-type, found in 107 isolates, followed by OXA-40-type (OXA-24-type), found in 5 isolates; 3 isolates carried the ISAba1 element upstream of blaOXA-51-type. No OXA-58-type, NDM-type, VIM-type, or IMP-type producers were detected. Multiple clones were detected with 16 clusters of clonally related CRAB. Some clusters involved hospitals in different states. MLST analysis of 15 representative isolates from different clusters identified seven different sequence types (ST195, ST208, ST229, ST436, ST450, ST452, and ST499), as well as three novel STs. The vast majority (84%) of the isolates in this study were associated with health care exposure. Awareness of multidrug-resistant organisms in GCC states has important implications for optimizing infection control practices; establishing antimicrobial stewardship programs within hospital, community, and agricultural settings; and emphasizing the need for establishing regional active surveillance systems. This will help to control the spread of CRAB in the Middle East and in hospitals accommodating transferred patients from this region. PMID:25568439
Molecular epidemiology of carbapenem-resistant Acinetobacter baumannii isolates in the Gulf Cooperation Council States: dominance of OXA-23-type producers.

PubMed

Zowawi, Hosam M; Sartor, Anna L; Sidjabat, Hanna E; Balkhy, Hanan H; Walsh, Timothy R; Al Johani, Sameera M; AlJindan, Reem Y; Alfaresi, Mubarak; Ibrahim, Emad; Al-Jardani, Amina; Al Salman, Jameela; Dashti, Ali A; Johani, Khalid; Paterson, David L

2015-03-01

The molecular epidemiology and mechanisms of resistance of carbapenem-resistant Acinetobacter baumannii (CRAB) were determined in hospitals in the states of the Cooperation Council for the Arab States of the Gulf (Gulf Cooperation Council [GCC]), namely, Saudi Arabia, United Arab Emirates, Oman, Qatar, Bahrain, and Kuwait. Isolates were subjected to PCR-based detection of antibiotic resistance genes and repetitive sequence-based PCR (rep-PCR) assessments of clonality. Selected isolates were subjected to multilocus sequence typing (MLST). We investigated 117 isolates resistant to carbapenem antibiotics (either imipenem or meropenem). All isolates were positive for OXA-51. The most common carbapenemases were the OXA-23-type, found in 107 isolates, followed by OXA-40-type (OXA-24-type), found in 5 isolates; 3 isolates carried the ISAba1 element upstream of blaOXA-51-type. No OXA-58-type, NDM-type, VIM-type, or IMP-type producers were detected. Multiple clones were detected with 16 clusters of clonally related CRAB. Some clusters involved hospitals in different states. MLST analysis of 15 representative isolates from different clusters identified seven different sequence types (ST195, ST208, ST229, ST436, ST450, ST452, and ST499), as well as three novel STs. The vast majority (84%) of the isolates in this study were associated with health care exposure. Awareness of multidrug-resistant organisms in GCC states has important implications for optimizing infection control practices; establishing antimicrobial stewardship programs within hospital, community, and agricultural settings; and emphasizing the need for establishing regional active surveillance systems. This will help to control the spread of CRAB in the Middle East and in hospitals accommodating transferred patients from this region. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Emergence of multidrug-resistant Proteus mirabilis in a long-term care facility in Croatia.

PubMed

Bedenić, Branka; Firis, Nataša; Elveđi-Gašparović, Vesna; Krilanović, Marija; Matanović, Krešimir; Štimac, Iva; Luxner, Josefa; Vraneš, Jasmina; Meštrović, Tomislav; Zarfel, Gernot; Grisold, Andrea

2016-06-01

An increased frequency of Proteus mirabilis isolates resistant to expanded-spectrum cephalosporins was observed recently in a long-term care facility in Zagreb (Godan). The aim of this study was the molecular characterization of resistance mechanisms to new cephalosporins in P. mirabilis isolates from this nursing home. Thirty-eight isolates collected from 2013-2015 showing reduced susceptibility to ceftazidime were investigated. Antibiotic susceptibilities were determined by broth microdilution method. Inhibitor-based tests were performed to detect extended-spectrum (ESBLs) and AmpC β-lactamases. AmpC β-lactamases were characterized by polymerase chain reaction (PCR) followed by sequencing of bla ampC genes. Quinolone resistance determinants (qnr genes) were characterized by PCR. Genotyping of the isolates was performed by repetitive element sequence (rep)-PCR and pulsed-field gel electrophoresis (PFGE). Presence of an AmpC β-lactamase was confirmed in all isolates by combined-disk test with phenylboronic acid. All isolates were resistant to amoxicillin alone and combined with clavulanate, cefotaxime, ceftriaxone, cefoxitin, and ciprofloxacin; but susceptible to cefepime, imipenem, and meropenem. PCR followed by sequencing using primers targeting bla ampc genes revealed CMY-16 β-lactamase in all but one strain. Bla cmy-16 was carried by a non-conjugative plasmid which did not belong to any known plasmid-based replicon typing (PBRT) group. Rep-PCR identified one large clone consisting of 15 isolates, three pairs or related isolates, one triplet, and four singletons. PFGE confirmed the clonality of the isolates. This is the first report of multidrug resistant P. mirabilis in a nursing home in Croatia. Cephalosporin resistance was due to plasmid-mediated AmpC β-lactamase CMY-16.

Chompy: an infestation of MITE-like repetitive elements in the crocodilian genome.

PubMed

Ray, David A; Hedges, Dale J; Herke, Scott W; Fowlkes, Justin D; Barnes, Erin W; LaVie, Daniel K; Goodwin, Lindsey M; Densmore, Llewellyn D; Batzer, Mark A

2005-12-05

Interspersed repeats are a major component of most eukaryotic genomes and have an impact on genome size and stability, but the repetitive element landscape of crocodilian genomes has not yet been fully investigated. In this report, we provide the first detailed characterization of an interspersed repeat element in any crocodilian genome. Chompy is a putative miniature inverted-repeat transposable element (MITE) family initially recovered from the genome of Alligator mississippiensis (American alligator) but also present in the genomes of Crocodylus moreletii (Morelet's crocodile) and Gavialis gangeticus (Indian gharial). The element has all of the hallmarks of MITEs including terminal inverted repeats, possible target site duplications, and a tendency to form secondary structures. We estimate the copy number in the alligator genome to be approximately 46,000 copies. As a result of their size and unique properties, Chompy elements may provide a useful source of genomic variation for crocodilian comparative genomics.
Task switching in a hierarchical task structure: evidence for the fragility of the task repetition benefit.

PubMed

Lien, Mei-Ching; Ruthruff, Eric

2004-05-01

This study examined how task switching is affected by hierarchical task organization. Traditional task-switching studies, which use a constant temporal and spatial distance between each task element (defined as a stimulus requiring a response), promote a flat task structure. Using this approach, Experiment 1 revealed a large switch cost of 238 ms. In Experiments 2-5, adjacent task elements were grouped temporally and/or spatially (forming an ensemble) to create a hierarchical task organization. Results indicate that the effect of switching at the ensemble level dominated the effect of switching at the element level. Experiments 6 and 7, using an ensemble of 3 task elements, revealed that the element-level switch cost was virtually absent between ensembles but was large within an ensemble. The authors conclude that the element-level task repetition benefit is fragile and can be eliminated in a hierarchical task organization.
Task switching in a hierarchical task structure: evidence for the fragility of the task repetition benefit

NASA Technical Reports Server (NTRS)

Lien, Mei-Ching; Ruthruff, Eric

2004-01-01

This study examined how task switching is affected by hierarchical task organization. Traditional task-switching studies, which use a constant temporal and spatial distance between each task element (defined as a stimulus requiring a response), promote a flat task structure. Using this approach, Experiment 1 revealed a large switch cost of 238 ms. In Experiments 2-5, adjacent task elements were grouped temporally and/or spatially (forming an ensemble) to create a hierarchical task organization. Results indicate that the effect of switching at the ensemble level dominated the effect of switching at the element level. Experiments 6 and 7, using an ensemble of 3 task elements, revealed that the element-level switch cost was virtually absent between ensembles but was large within an ensemble. The authors conclude that the element-level task repetition benefit is fragile and can be eliminated in a hierarchical task organization.
Evidence-Based Clinical Recommendations for the Administration of the Sequential Motion Rates Task

ERIC Educational Resources Information Center

Icht, Michal; Ben-David, Boaz M.

2018-01-01

The sequential motion rates (SMR) task, that involves rapid and accurate repetitions of a syllable sequence, /pataka/, is a commonly used evaluation tool for oro-motor abilities. Although the SMR is a well-known tool, some aspects of its administration protocol are unspecified. We address the following factors and their role in the SMR protocol:…
Identification and molecular epidemiology of dermatophyte isolates by repetitive-sequence-PCR-based DNA fingerprinting using the DiversiLab system in Turkey.

PubMed

Koc, A Nedret; Atalay, Mustafa A; Inci, Melek; Sariguzel, Fatma M; Sav, Hafize

2017-05-01

Dermatophyte species, isolation and identification in clinical samples are still difficult and take a long time. The identification and molecular epidemiology of dermatophytes commonly isolated in a clinical laboratory in Turkey by repetitive sequence-based PCR (rep-PCR) were assessed by comparing the results with those of reference identification. A total of 44 dermatophytes isolated from various clinical specimens of 20 patients with superficial mycoses in Kayseri and 24 patients in Hatay were studied. The identification of dermatophyte isolates was based on the reference identification and rep-PCR using the DiversiLab System (BioMerieux). The genotyping of dermatophyte isolates from different patients was determined by rep-PCR. In the identification of dermatophyte isolates, agreement between rep-PCR and conventional methods was 87.8 % ( 36 of 41). The dermatophyte strains belonged to four clones (A -D) which were determined by the use of rep-PCR. The dermatophyte strains in Clone B, D showed identical patterns with respect to the region. In conclusion, rep-PCR appears to be useful for evaluation of the identification and clonal relationships between Trichophyton rubrum species complex and Trichophyton mentagrophytes species complex isolates. The similarity and diversity of these isolates may be assessed according to different regions by rep-PCR. © 2017 Blackwell Verlag GmbH.
Spectral negentropy based sidebands and demodulation analysis for planet bearing fault diagnosis

NASA Astrophysics Data System (ADS)

Feng, Zhipeng; Ma, Haoqun; Zuo, Ming J.

2017-12-01

Planet bearing vibration signals are highly complex due to intricate kinematics (involving both revolution and spinning) and strong multiple modulations (including not only the fault induced amplitude modulation and frequency modulation, but also additional amplitude modulations due to load zone passing, time-varying vibration transfer path, and time-varying angle between the gear pair mesh lines of action and fault impact force vector), leading to difficulty in fault feature extraction. Rolling element bearing fault diagnosis essentially relies on detection of fault induced repetitive impulses carried by resonance vibration, but they are usually contaminated by noise and therefor are hard to be detected. This further adds complexity to planet bearing diagnostics. Spectral negentropy is able to reveal the frequency distribution of repetitive transients, thus providing an approach to identify the optimal frequency band of a filter for separating repetitive impulses. In this paper, we find the informative frequency band (including the center frequency and bandwidth) of bearing fault induced repetitive impulses using the spectral negentropy based infogram. In Fourier spectrum, we identify planet bearing faults according to sideband characteristics around the center frequency. For demodulation analysis, we filter out the sensitive component based on the informative frequency band revealed by the infogram. In amplitude demodulated spectrum (squared envelope spectrum) of the sensitive component, we diagnose planet bearing faults by matching the present peaks with the theoretical fault characteristic frequencies. We further decompose the sensitive component into mono-component intrinsic mode functions (IMFs) to estimate their instantaneous frequencies, and select a sensitive IMF with an instantaneous frequency fluctuating around the center frequency for frequency demodulation analysis. In the frequency demodulated spectrum (Fourier spectrum of instantaneous frequency) of selected IMF, we discern planet bearing fault reasons according to the present peaks. The proposed spectral negentropy infogram based spectrum and demodulation analysis method is illustrated via a numerical simulated signal analysis. Considering the unique load bearing feature of planet bearings, experimental validations under both no-load and loading conditions are done to verify the derived fault symptoms and the proposed method. The localized faults on outer race, rolling element and inner race are successfully diagnosed.
Nucleotide sequences of Dictyostelium discoideum developmentally regulated cDNAs rich in (AAC) imply proteins that contain clusters of asparagine, glutamine, or threonine.

PubMed

Shaw, D R; Richter, H; Giorda, R; Ohmachi, T; Ennis, H L

1989-09-01

A Dictyostelium discoideum repetitive element composed of long repeats of the codon (AAC) is found in developmentally regulated transcripts. The concentration of (AAC) sequences is low in mRNA from dormant spores and growing cells and increases markedly during spore germination and multicellular development. The sequence hybridizes to many different sized Dictyostelium DNA restriction fragments indicating that it is scattered throughout the genome. Four cDNA clones isolated contain (AAC) sequences in the deduced coding region. Interestingly, the (AAC)-rich sequences are present in all three reading frames in the deduced proteins, i.e., AAC (asparagine), ACA (threonine) and CAA (glutamine). Three of the clones contain only one of these in-frame so that the individual proteins carry either asparagine, threonine, or glutamine clusters, not mixtures. However, one clone is both glutamine- and asparagine-rich. The (AAC) portion of the transcripts are reiterated 300 times in the haploid genome while the other portions of the cDNAs represent single copy genes, whose sequences show no similarity other than the (AAC) repeats. The repeated sequence is similar to the opa or M sequence found in Drosophila melanogaster notch and homeo box genes and in fly developmentally regulated transcripts. The transcripts are present on polysomes suggesting that they are translated. Although the function of these repeats is unknown, long amino acid repeats are a characteristic feature of extracellular proteins of lower eukaryotes.
Genetic diversity of the captive Asian tapir population in Thailand, based on mitochondrial control region sequence data and the comparison of its nucleotide structure with Brazilian tapir.

PubMed

Muangkram, Yuttamol; Amano, Akira; Wajjwalku, Worawidh; Pinyopummintr, Tanu; Thongtip, Nikorn; Kaolim, Nongnid; Sukmak, Manakorn; Kamolnorranath, Sumate; Siriaroonrat, Boripat; Tipkantha, Wanlaya; Maikaew, Umaporn; Thomas, Warisara; Polsrila, Kanda; Dongsaard, Kwanreaun; Sanannu, Saowaphang; Wattananorrasate, Anuwat

2017-07-01

The Asian tapir (Tapirus indicus) has been classified as Endangered on the IUCN Red List of Threatened Species (2008). Genetic diversity data provide important information for the management of captive breeding and conservation of this species. We analyzed mitochondrial control region (CR) sequences from 37 captive Asian tapirs in Thailand. Multiple alignments of the full-length CR sequences sized 1268 bp comprised three domains as described in other mammal species. Analysis of 16 parsimony-informative variable sites revealed 11 haplotypes. Furthermore, the phylogenetic analysis using median-joining network clearly showed three clades correlated with our earlier cytochrome b gene study in this endangered species. The repetitive motif is located between first and second conserved sequence blocks, similar to the Brazilian tapir. The highest polymorphic site was located in the extended termination associated sequences domain. The results could be applied for future genetic management based in captivity and wild that shows stable populations.
The CGTCA sequence motif is essential for biological activity of the vasoactive intestinal peptide gene cAMP-regulated enhancer.

PubMed Central

Fink, J S; Verhave, M; Kasper, S; Tsukada, T; Mandel, G; Goodman, R H

1988-01-01

cAMP-regulated transcription of the human vasoactive intestinal peptide gene is dependent upon a 17-base-pair DNA element located 70 base pairs upstream from the transcriptional initiation site. This element is similar to sequences in other genes known to be regulated by cAMP and to sequences in several viral enhancers. We have demonstrated that the vasoactive intestinal peptide regulatory element is an enhancer that depends upon the integrity of two CGTCA sequence motifs for biological activity. Mutations in either of the CGTCA motifs diminish the ability of the element to respond to cAMP. Enhancers containing the CGTCA motif from the somatostatin and adenovirus genes compete for binding of nuclear proteins from C6 glioma and PC12 cells to the vasoactive intestinal peptide enhancer, suggesting that CGTCA-containing enhancers interact with similar transacting factors. Images PMID:2842787
Sensitive and Specific Target Sequences Selected from Retrotransposons of Schistosoma japonicum for the Diagnosis of Schistosomiasis

PubMed Central

Xu, Jing; Zhu, Xing-Quan; Wang, Sheng-Yue; Xia, Chao-Ming

2012-01-01

Background Schistosomiasis japonica is a serious debilitating and sometimes fatal disease. Accurate diagnostic tests play a key role in patient management and control of the disease. However, currently available diagnostic methods are not ideal, and the detection of the parasite DNA in blood samples has turned out to be one of the most promising tools for the diagnosis of schistosomiasis. In our previous investigations, a 230-bp sequence from the highly repetitive retrotransposon SjR2 was identified and it showed high sensitivity and specificity for detecting Schistosoma japonicum DNA in the sera of rabbit model and patients. Recently, 29 retrotransposons were found in S. japonicum genome by our group. The present study highlighted the key factors for selecting a new perspective sensitive target DNA sequence for the diagnosis of schistosomiasis, which can serve as example for other parasitic pathogens. Methodology/Principal Findings In this study, we demonstrated that the key factors based on the bioinformatic analysis for selecting target sequence are the higher genome proportion, repetitive complete copies and partial copies, and active ESTs than the others in the chromosome genome. New primers based on 25 novel retrotransposons and SjR2 were designed and their sensitivity and specificity for detecting S. japonicum DNA were compared. The results showed that a new 303-bp sequence from non-long terminal repeat (LTR) retrotransposon (SjCHGCS19) had high sensitivity and specificity. The 303-bp target sequence was amplified from the sera of rabbit model at 3 d post-infection by nested-PCR and it became negative at 17 weeks post-treatment. Furthermore, the percentage sensitivity of the nested-PCR was 97.67% in 43 serum samples of S. japonicum-infected patients. Conclusions/Significance Our findings highlighted the key factors based on the bioinformatic analysis for selecting target sequence from S. japonicum genome, which provide basis for establishing powerful molecular diagnostic techniques that can be used for monitoring early infection and therapy efficacy to support schistosomiasis control programs. PMID:22479661
Nonword Repetition in Children and Adults: Effects on Movement Coordination

ERIC Educational Resources Information Center

Sasisekaran, Jayanthi; Smith, Anne; Sadagopan, Neeraja; Weber-Fox, Christine

2010-01-01

Hearing and repeating novel phonetic sequences, or novel nonwords, is a task that taps many levels of processing, including auditory decoding, phonological processing, working memory, speech motor planning and execution. Investigations of nonword repetition abilities have been framed within models of psycholinguistic processing, while the motor…
Bilingual Performance on Nonword Repetition in Spanish and English

ERIC Educational Resources Information Center

Summers, Connie; Bohman, Thomas M.; Gillam, Ronald B.; Pena, Elizabeth D.; Bedore, Lisa M.

2010-01-01

Background: Nonword repetition (NWR) involves the ability to perceive, store, recall and reproduce phonological sequences. These same abilities play a role in word and morpheme learning. Cross-linguistic studies of performance on NWR tasks, word learning, and morpheme learning yield patterns of increased performance on all three tasks as a…
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE PAGES

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.; ...

2017-07-18

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

PubMed Central

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Richard A.; Brown, Steven D.

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. PMID:28769883
Evolutionary diversity and potential recombinogenic role of integration targets of non-LTR retrotransposons

PubMed Central

Gentles, Andrew J.; Kohany, Oleksiy; Jurka, Jerzy

2005-01-01

Short interspersed elements (SINEs) make up a significant fraction of total DNA in mammalian genomes, providing a rich substrate for chromosomal rearrangements by SINE-SINE recombinations. Proliferation of mammalian SINEs is mediated primarily by LINE1 (L1) non-LTR retrotransposons that preferentially integrate at DNA sequence targets with average length ~15 bp and containing conserved endonucleolytic nicking signals at both ends. We report that sequence variations in the first of the two nicking signals, represented by a 5′TT-AAAA consensus sequence, affect the position of the second signal thus leading to target site duplications (TSDs) of different lengths. The length distribution of TSDs appears to be affected also by L1-encoded enzyme variants, since targets with the same 5′ nicking site can be of different average length in different mammalian species. Taking this into account, we re-analyzed the second nicking site and found that it is larger and includes more conserved sites than previously appreciated, with a consensus of 5′ANTNTN-AA. We also studied potential involvement of the nicking sites in stimulating recombinations between SINE elements. We determined that SINE elements retaining TSDs with perfect 5′TT-AAAA nicking sites appear to be lost relatively rapidly from the human and rat genomes, and less rapidly from dog. We speculate that the introduction of single-strand DNA breaks induced by recurring endonucleolytic attacks at these sites, combined with the ubiquitousness of SINEs, may significantly promote recombination between repetitive elements, leading to the observed losses. At the same time new L1 subfamilies may be selected for “incompatibility” with pre-existing targets. This provides a possible driving force for the continual emergence of new L1 subfamilies which, in turn, may affect selection of L1-dependent SINE subfamilies. PMID:15944437
BAC-pool 454-sequencing: A rapid and efficient approach to sequence complex tetraploid cotton genomes

USDA-ARS?s Scientific Manuscript database

New and emerging next generation sequencing technologies have been promising in reducing sequencing costs, but not significantly for complex polyploid plant genomes such as cotton. Large and highly repetitive genome of G. hirsutum (~2.5GB) is less amenable and cost-intensive with traditional BAC-by...
Sequencing of 15,622 gene-bearing BACs clarifies the gene-dense regions of the barley genome

USDA-ARS?s Scientific Manuscript database

Barley (Hordeum vulgare L.) possesses a large and highly repetitive genome of 5.1 Gb that has hindered the development of a complete sequence. In 2012, the International Barley Sequencing Consortium released a resource integrating whole-genome shotgun sequences with a physical and genetic framework....
Evolution of bird genomes-a transposon's-eye view.

PubMed

Kapusta, Aurélie; Suh, Alexander

2017-02-01

Birds, the most species-rich monophyletic group of land vertebrates, have been subject to some of the most intense sequencing efforts to date, making them an ideal case study for recent developments in genomics research. Here, we review how our understanding of bird genomes has changed with the recent sequencing of more than 75 species from all major avian taxa. We illuminate avian genome evolution from a previously neglected perspective: their repetitive genomic parasites, transposable elements (TEs) and endogenous viral elements (EVEs). We show that (1) birds are unique among vertebrates in terms of their genome organization; (2) information about the diversity of avian TEs and EVEs is changing rapidly; (3) flying birds have smaller genomes yet more TEs than flightless birds; (4) current second-generation genome assemblies fail to capture the variation in avian chromosome number and genome size determined with cytogenetics; (5) the genomic microcosm of bird-TE "arms races" has yet to be explored; and (6) upcoming third-generation genome assemblies suggest that birds exhibit stability in gene-rich regions and instability in TE-rich regions. We emphasize that integration of cytogenetics and single-molecule technologies with repeat-resolved genome assemblies is essential for understanding the evolution of (bird) genomes. © 2016 New York Academy of Sciences.
Functional noncoding sequences derived from SINEs in the mammalian genome

PubMed Central

Nishihara, Hidenori; Smit, Arian F.A.; Okada, Norihiro

2006-01-01

Recent comparative analyses of mammalian sequences have revealed that a large number of nonprotein-coding genomic regions are under strong selective constraint. Here, we report that some of these loci have been derived from a newly defined family of ancient SINEs (short interspersed repetitive elements). This is a surprising result, as SINEs and other transposable elements are commonly thought to be genomic parasites. We named the ancient SINE family AmnSINE1, for Amniota SINE1, because we found it to be present in mammals as well as in birds, and some copies predate the mammalian-bird split 310 million years ago (Mya). AmnSINE1 has a chimeric structure of a 5S rRNA and a tRNA-derived SINE, and is related to five tRNA-derived SINE families that we characterized here in the coelacanth, dogfish shark, hagfish, and amphioxus genomes. All of the newly described SINE families have a common central domain that is also shared by zebrafish SINE3, and we collectively name them the DeuSINE (Deuterostomia SINE) superfamily. Notably, of the ∼1000 still identifiable copies of AmnSINE1 in the human genome, 105 correspond to loci phylogenetically highly conserved among mammalian orthologs. The conservation is strongest over the central domain. Thus, AmnSINE1 appears to be the best example of a transposable element of which a significant fraction of the copies have acquired genomic functionality. PMID:16717141

Some links on this page may take you to non-federal websites. Their policies may differ from this site.