Sample records for coding tandem repeats

  1. Complete mitochondrial genome of the whiter-spotted flower chafer, Protaetia brevitarsis (Coleoptera: Scarabaeidae).

    PubMed

    Kim, Min Jee; Im, Hyun Hwak; Lee, Kwang Youll; Han, Yeon Soo; Kim, Iksoo

    2014-06-01

    Abstract The complete nucleotide sequences of the mitochondrial genome from the whiter-spotted flower chafer, Protaetia brevitarsis (Coleoptera: Scarabaeidae), was determined. The 20,319-bp long circular genome is the longest among completely sequenced Coleoptera. As is typical in animals, the P. brevitarsis genome consisted of two ribosomal RNAs, 22 transfer RNAs, 13 protein-coding genes and one A + T-rich region. Although the size of the coding genes was typical, the non-coding A + T-rich region was 5654 bp, which is the longest in insects. The extraordinary length of this region was composed of 28,117-bp tandem repeats and 782-bp tandem repeats. These repeat sequences were encompassed by three non-repeat sequences constituting 1804 bp.

  2. Multiple independent insertions of 5S rRNA genes in the spliced-leader gene family of trypanosome species.

    PubMed

    Beauparlant, Marc A; Drouin, Guy

    2014-02-01

    Analyses of the 5S rRNA genes found in the spliced-leader (SL) gene repeat units of numerous trypanosome species suggest that such linkages were not inherited from a common ancestor, but were the result of independent 5S rRNA gene insertions. In trypanosomes, 5S rRNA genes are found either in the tandemly repeated units coding for SL genes or in independent tandemly repeated units. Given that trypanosome species where 5S rRNA genes are within the tandemly repeated units coding for SL genes are phylogenetically related, one might hypothesize that this arrangement is the result of an ancestral insertion of 5S rRNA genes into the tandemly repeated SL gene family of trypanosomes. Here, we use the types of 5S rRNA genes found associated with SL genes, the flanking regions of the inserted 5S rRNA genes and the position of these insertions to show that most of the 5S rRNA genes found within SL gene repeat units of trypanosome species were not acquired from a common ancestor but are the results of independent insertions. These multiple 5S rRNA genes insertion events in trypanosomes are likely the result of frequent founder events in different hosts and/or geographical locations in species having short generation times.

  3. Complete mitochondrial genome of the Tyto longimembris (Strigiformes: Tytonidae).

    PubMed

    Xu, Peng; Li, Yankuo; Miao, Lujun; Xie, Guangyong; Huang, Yan

    2016-07-01

    The complete mitochondrial genome of Tyto longimembris has been determined in this study. It is 18,466 bp in length and consists of 13 protein-coding genes, 22 transfer RNA (tRNA) genes, 2 ribosomal RNA (rRNA) genes and a non-coding control region (D-loop). The overall base composition of the heavy strand of the T. longimembris mitochondrial genome is A: 30.1%, T: 23.5%, C: 31.8% and G: 14.6%. The structure of control region should be characterized by a region containing tandem repeats as two definitely separated clusters of tandem repeats were found. This study provided an important data set for phylogenetic and taxonomic analyses of Tyto species.

  4. Whole-genome sequencing reveals a coding non-pathogenic variant tagging a non-coding pathogenic hexanucleotide repeat expansion in C9orf72 as cause of amyotrophic lateral sclerosis.

    PubMed

    Herdewyn, Sarah; Zhao, Hui; Moisse, Matthieu; Race, Valérie; Matthijs, Gert; Reumers, Joke; Kusters, Benno; Schelhaas, Helenius J; van den Berg, Leonard H; Goris, An; Robberecht, Wim; Lambrechts, Diether; Van Damme, Philip

    2012-06-01

    Motor neuron degeneration in amyotrophic lateral sclerosis (ALS) has a familial cause in 10% of patients. Despite significant advances in the genetics of the disease, many families remain unexplained. We performed whole-genome sequencing in five family members from a pedigree with autosomal-dominant classical ALS. A family-based elimination approach was used to identify novel coding variants segregating with the disease. This list of variants was effectively shortened by genotyping these variants in 2 additional unaffected family members and 1500 unrelated population-specific controls. A novel rare coding variant in SPAG8 on chromosome 9p13.3 segregated with the disease and was not observed in controls. Mutations in SPAG8 were not encountered in 34 other unexplained ALS pedigrees, including 1 with linkage to chromosome 9p13.2-23.3. The shared haplotype containing the SPAG8 variant in this small pedigree was 22.7 Mb and overlapped with the core 9p21 linkage locus for ALS and frontotemporal dementia. Based on differences in coverage depth of known variable tandem repeat regions between affected and non-affected family members, the shared haplotype was found to contain an expanded hexanucleotide (GGGGCC)(n) repeat in C9orf72 in the affected members. Our results demonstrate that rare coding variants identified by whole-genome sequencing can tag a shared haplotype containing a non-coding pathogenic mutation and that changes in coverage depth can be used to reveal tandem repeat expansions. It also confirms (GGGGCC)n repeat expansions in C9orf72 as a cause of familial ALS.

  5. Identification of presumed ancestral DNA sequences of phaseolin in Phaseolus vulgaris.

    PubMed Central

    Kami, J; Velásquez, V B; Debouck, D G; Gepts, P

    1995-01-01

    Common bean (Phaseolus vulgaris) consists of two major geographic gene pools, one distributed in Mexico, Central America, and Colombia and the other in the southern Andes (southern Peru, Bolivia, and Argentina). Amplification and sequencing of members of the multigene family coding for phaseolin, the major seed storage protein of the common bean, provide evidence for accumulation of tandem direct repeats in both introns and exons during evolution of the multigene family in this species. The presumed ancestral phaseolin sequences, without tandem repeats, were found in recently discovered but nearly extinct wild common bean populations of Ecuador and northern Peru that are intermediate between the two major gene pools of the species based on geographical and molecular arguments. Our results illustrate the usefulness of tandem direct repeats in establishing the polarity of DNA sequence divergence and therefore in proposing phylogenies. Images Fig. 1 Fig. 3 PMID:7862642

  6. A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

    PubMed Central

    2010-01-01

    Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by HGT and intra-genomic shuffling. Conclusions We describe novel features of PARCELs (Palindromic Amphipathic Repeat Coding ELements), a set of widely distributed repeat protein domains and coding sequences that were likely acquired through HGT by diverse unicellular microbes, further mobilized and diversified within genomes, and co-opted for expression in the membrane proteome of some taxa. Disseminated by multiple gene-centric vehicles, ORFs harboring these elements enhance accessory gene pools as part of the "mobilome" connecting genomes of various clades, in taxa sharing common niches. PMID:20626840

  7. A Dual Origin of the Xist Gene from a Protein-Coding Gene and a Set of Transposable Elements

    PubMed Central

    Elisaphenko, Eugeny A.; Kolesnikov, Nikolay N.; Shevchenko, Alexander I.; Rogozin, Igor B.; Nesterova, Tatyana B.; Brockdorff, Neil; Zakian, Suren M.

    2008-01-01

    X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC). Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA. PMID:18575625

  8. Novel variants of the 5S rRNA genes in Eruca sativa.

    PubMed

    Singh, K; Bhatia, S; Lakshmikumaran, M

    1994-02-01

    The 5S ribosomal RNA (rRNA) genes of Eruca sativa were cloned and characterized. They are organized into clusters of tandemly repeated units. Each repeat unit consists of a 119-bp coding region followed by a noncoding spacer region that separates it from the coding region of the next repeat unit. Our study reports novel gene variants of the 5S rRNA genes in plants. Two families of the 5S rDNA, the 0.5-kb size family and the 1-kb size family, coexist in the E. sativa genome. The 0.5-kb size family consists of the 5S rRNA genes (S4) that have coding regions similar to those of other reported plant 5S rDNA sequences, whereas the 1-kb size family consists of the 5S rRNA gene variants (S1) that exist as 1-kb BamHI tandem repeats. S1 is made up of two variant units (V1 and V2) of 5S rDNA where the BamHI site between the two units is mutated. Sequence heterogeneity among S4, V1, and V2 units exists throughout the sequence and is not limited to the noncoding spacer region only. The coding regions of V1 and V2 show approximately 20% dissimilarity to the coding regions of S4 and other reported plant 5S rDNA sequences. Such a large variation in the coding regions of the 5S rDNA units within the same plant species has been observed for the first time. Restriction site variation is observed between the two size classes of 5S rDNA in E. sativa.(ABSTRACT TRUNCATED AT 250 WORDS)

  9. Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts.

    PubMed

    Trofimova, Irina; Krasikova, Alla

    2016-12-01

    Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription.

  10. Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts

    PubMed Central

    Krasikova, Alla

    2016-01-01

    ABSTRACT Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription. PMID:27763817

  11. Variability of CAG tandem repeats in exon 1 of the androgen receptor gene is not related with dog intersexuality.

    PubMed

    Nowacka-Woszuk, J; Switonski, M

    2010-02-01

    Numerous mutations of the human androgen receptor (AR) gene cause an intersexual phenotype, called the androgen insensitivity syndrome. The intersexual phenotype is also quite often diagnosed in dogs. The aim of this study was to conduct a comparative analysis of the entire coding sequence (eight exons) of the AR gene in healthy and four intersex dogs, as well as in three other canids (the red fox, arctic fox and Chinese raccoon dog). The coding sequence of the studied species appeared to be conserved (similarity above 97%) and polymorphism was found in exon 1 only. Altogether, 2 SNPs were identified in healthy dogs, 14 in red foxes, 16 in arctic foxes and 6 were found in Chinese raccoon dogs, respectively. Moreover, a variable number of tandem repeats (CAG and CAA), encoding an array of glutamines, was also observed in this exon. The CAA codon numbers were invariable within species, but the CAG repeats were polymorphic. The highest number of the CAG and CAA repeats was found in dogs (from 40 to 42) and the observed variability was similar in intersex and healthy dogs. In the other canids the variability fell within the following ranges: 29-37 (red fox), 37-39 (arctic fox) and 29-32 (Chinese raccoon dog). In addition, a polymorphic microsatellite marker in intron 2 was found in the dog, red fox and Chinese raccoon dog. It was concluded that the polymorphism level of the AR gene in the dog was lower than in the other canids and none of the detected polymorphisms, including variability of the CAG tandem repeats, could be related with the intersexual phenotype of the studied dogs.

  12. Complete mitochondrial genome of the yellowtail clownfish Amphiprion clarkii (Pisces: Perciformes, Pomacentridae).

    PubMed

    Tao, Yong; Li, Jian-Long; Liu, Min; Hu, Xue-Yi

    2016-01-01

    In this study we determined the complete mitochondrial (mt) genome of the yellowtail clownfish Amphiprion clarkii using eight consensus primer pairs with a long PCR technique. The circular mtDNA molecule was 16,976 bp in size and the overall nucleotide composition of the H-strand was 29.15% A, 26.15% T, 15.67% G and 29.03% C, with an A + T bias. The complete mitogenome contained 13 protein-coding genes, 2 rRNAs, 22 tRNAs and 1 control region (D-loop), and the gene order was typical of vertebrate mitogenomes. We determined five complete continuity tandem repeat units and one imperfect tandem repeat, all located downstream in the control region.

  13. Identification and characterization of tandem repeats in exon III of dopamine receptor D4 (DRD4) genes from different mammalian species.

    PubMed

    Larsen, Svend Arild; Mogensen, Line; Dietz, Rune; Baagøe, Hans Jørgen; Andersen, Mogens; Werge, Thomas; Rasmussen, Henrik Berg

    2005-12-01

    In this study we have identified and characterized dopamine receptor D4 (DRD4) exon III tandem repeats in 33 public available nucleotide sequences from different mammalian species. We found that the tandem repeat in canids could be described in a novel and simple way, namely, as a structure composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp tandem repeat, while a tandem repeat consisting of 27-bp modules was identified in a sequence from European badger. Both these tandem repeats were composed of 9-bp basic units, which were closely related with the 9-bp repeat modules identified in the mink and ferret. Tandem repeats could not be identified in sequences from rodents. All tandem repeats possessed a high GC content with a strong bias for C. On phylogenetic analysis of the tandem repeats evolutionary related species were clustered into the same groups. The degree of conservation of the tandem repeats varied significantly between species. The deduced amino acid sequences of most of the tandem repeats exhibited a high propensity for disorder. This was also the case with an amino acid sequence of the human DRD4 exon III tandem repeat, which was included in the study for comparative purposes. We identified proline-containing motifs for SH3 and WW domain binding proteins, potential phosphorylation sites, PDZ domain binding motifs, and FHA domain binding motifs in the amino acid sequences of the tandem repeats. The numbers of potential functional sites varied pronouncedly between species. Our observations provide a platform for future studies of the architecture and evolution of the DRD4 exon III tandem repeat, and they suggest that differences in the structure of this tandem repeat contribute to specialization and generation of diversity in receptor function.

  14. Histone and ribosomal RNA repetitive gene clusters of the boll weevil are linked in a tandem array.

    PubMed

    Roehrdanz, R; Heilmann, L; Senechal, P; Sears, S; Evenson, P

    2010-08-01

    Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and the clusters are tandemly repeated. Ribosomal DNA contains a cluster of the rRNA sequences 18S, 5.8S and 28S. The rRNA genes are separated by the spacers ITS1, ITS2 and IGS. This cluster is also tandemly repeated. We found that the ribosomal RNA repeat unit of at least two species of Anthonomine weevils, Anthonomus grandis and Anthonomus texanus (Coleoptera: Curculionidae), is interspersed with a block containing the histone gene quintet. The histone genes are situated between the rRNA 18S and 28S genes in what is known as the intergenic spacer region (IGS). The complete reiterated Anthonomus grandis histone-ribosomal sequence is 16,248 bp.

  15. Identification of common, unique and polymorphic microsatellites among 73 cyanobacterial genomes.

    PubMed

    Kabra, Ritika; Kapil, Aditi; Attarwala, Kherunnisa; Rai, Piyush Kant; Shanker, Asheesh

    2016-04-01

    Microsatellites also known as Simple Sequence Repeats are short tandem repeats of 1-6 nucleotides. These repeats are found in coding as well as non-coding regions of both prokaryotic and eukaryotic genomes and play a significant role in the study of gene regulation, genetic mapping, DNA fingerprinting and evolutionary studies. The availability of 73 complete genome sequences of cyanobacteria enabled us to mine and statistically analyze microsatellites in these genomes. The cyanobacterial microsatellites identified through bioinformatics analysis were stored in a user-friendly database named CyanoSat, which is an efficient data representation and query system designed using ASP.net. The information in CyanoSat comprises of perfect, imperfect and compound microsatellites found in coding, non-coding and coding-non-coding regions. Moreover, it contains PCR primers with 200 nucleotides long flanking region. The mined cyanobacterial microsatellites can be freely accessed at www.compubio.in/CyanoSat/home.aspx. In addition to this 82 polymorphic, 13,866 unique and 2390 common microsatellites were also detected. These microsatellites will be useful in strain identification and genetic diversity studies of cyanobacteria.

  16. Complete mitochondrial genome of the Asian pencil halfbeak Hyporhamphus intermedius (Beloniformes, Hemirhamphidae).

    PubMed

    Song, Chao; Hu, Gengdong; Qiu, Liping; Fan, Limin; Meng, Shunlong; Chen, Jiazhang

    2016-11-01

    The complete mitochondrial genome of Hyporhamphus intermedius was determined to be 16,720 bp in length with (A + T) content of 56.3%, and it consists of 13 protein-coding genes, 22 tRNAs, two ribosomal RNAs, and a control region. The gene composition and the structural arrangement of the H. intermedius complete mtDNA were identical to most of the other vertebrates. Interestingly, two tandem repeat units were identified across tRNA-Pro and control region (2*41 bp), while in most of the fishes the tandem repeat units are located in the control region. The molecular data we presented here could play a useful role to study the evolutionary relationships and population genetics of Hemirhamphidae fish.

  17. Mitochondrial genome of the tomato clownfish Amphiprion frenatus (Pomacentridae, Amphiprioninae).

    PubMed

    Ye, Le; Hu, Jing; Wu, Kaichang; Wang, Yu; Li, Jianlong

    2016-01-01

    The complete mitochondrial (mt) genome of the tomato clownfish Amphiprion frenatus was obtained in this study. The circular mtDNA molecule was 16,774 bp in size and the overall nucleotide composition of the H-strand was 29.72% A, 25.81% T, 15.38% G and 29.09% C, with an A + T bias. The complete mitogenome encoded 13 protein-coding genes, 2 rRNAs, 22 tRNAs and a control region (D-loop), with the gene arrangement and translation direction basically identical to other typical vertebrate mitogenomes. The D-loop included termination associated sequence (TAS), central conserved domain (CCD) and conserved sequence block (CSB), and was composed of 6 complete continuity tandem repeat units and an imperfect tandem repeat unit.

  18. Exceptionally long 5' UTR short tandem repeats specifically linked to primates.

    PubMed

    Namdar-Aligoodarzi, P; Mohammadparast, S; Zaker-Kandjani, B; Talebi Kakroodi, S; Jafari Vesiehsari, M; Ohadi, M

    2015-09-10

    We have previously reported genome-scale short tandem repeats (STRs) in the core promoter interval (i.e. -120 to +1 to the transcription start site) of protein-coding genes that have evolved identically in primates vs. non-primates. Those STRs may function as evolutionary switch codes for primate speciation. In the current study, we used the Ensembl database to analyze the 5' untranslated region (5' UTR) between +1 and +60 of the transcription start site of the entire human protein-coding genes annotated in the GeneCards database, in order to identify "exceptionally long" STRs (≥5-repeats), which may be of selective/adaptive advantage. The importance of this critical interval is its function as core promoter, and its effect on transcription and translation. In order to minimize ascertainment bias, we analyzed the evolutionary status of the human 5' UTR STRs of ≥5-repeats in several species encompassing six major orders and superorders across mammals, including primates, rodents, Scandentia, Laurasiatheria, Afrotheria, and Xenarthra. We introduce primate-specific STRs, and STRs which have expanded from mouse to primates. Identical co-occurrence of the identified STRs of rare average frequency between 0.006 and 0.0001 in primates supports a role for those motifs in processes that diverged primates from other mammals, such as neuronal differentiation (e.g. APOD and FGF4), and craniofacial development (e.g. FILIP1L). A number of the identified STRs of ≥5-repeats may be human-specific (e.g. ZMYM3 and DAZAP1). Future work is warranted to examine the importance of the listed genes in primate/human evolution, development, and disease. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. A Trio of Human Molecular Genetics PCR Assays

    ERIC Educational Resources Information Center

    Reinking, Jeffrey L.; Waldo, Jennifer T.; Dinsmore, Jannett

    2013-01-01

    This laboratory exercise demonstrates three different analytical forms of the polymerase chain reaction (PCR) that allow students to genotype themselves at four different loci. Here, we present protocols to allow students to a) genotype a non-coding polymorphic Variable Number of Tandem Repeat (VNTR) locus on human chromosome 5 using conventional…

  20. The complete mitochondrial genome of the Giant Manta ray, Manta birostris.

    PubMed

    Hinojosa-Alvarez, Silvia; Díaz-Jaimes, Pindaro; Marcet-Houben, Marina; Gabaldón, Toni

    2015-01-01

    The complete mitochondrial genome of the giant manta ray (Manta birostris), consists of 18,075 bp with rich A + T and low G content. Gene organization and length is similar to other species of ray. It comprises of 13 protein-coding genes, 2 rRNAs genes, 23 tRNAs genes and 1 non-coding sequence, and the control region. We identified an AT tandem repeat region, similar to that reported in Mobula japanica.

  1. TRedD—A database for tandem repeats over the edit distance

    PubMed Central

    Sokol, Dina; Atagun, Firat

    2010-01-01

    A ‘tandem repeat’ in DNA is a sequence of two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats are common in the genomes of both eukaryotic and prokaryotic organisms. They are significant markers for human identity testing, disease diagnosis, sequence homology and population studies. In this article, we describe a new database, TRedD, which contains the tandem repeats found in the human genome. The database is publicly available online, and the software for locating the repeats is also freely available. The definition of tandem repeats used by TRedD is a new and innovative definition based upon the concept of ‘evolutive tandem repeats’. In addition, we have developed a tool, called TandemGraph, to graphically depict the repeats occurring in a sequence. This tool can be coupled with any repeat finding software, and it should greatly facilitate analysis of results. Database URL: http://tandem.sci.brooklyn.cuny.edu/ PMID:20624712

  2. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.

    PubMed

    Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L

    2013-01-30

    Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.

  3. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

    PubMed Central

    2013-01-01

    Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705

  4. TRAP: automated classification, quantification and annotation of tandemly repeated sequences.

    PubMed

    Sobreira, Tiago José P; Durham, Alan M; Gruber, Arthur

    2006-02-01

    TRAP, the Tandem Repeats Analysis Program, is a Perl program that provides a unified set of analyses for the selection, classification, quantification and automated annotation of tandemly repeated sequences. TRAP uses the results of the Tandem Repeats Finder program to perform a global analysis of the satellite content of DNA sequences, permitting researchers to easily assess the tandem repeat content for both individual sequences and whole genomes. The results can be generated in convenient formats such as HTML and comma-separated values. TRAP can also be used to automatically generate annotation data in the format of feature table and GFF files.

  5. Complete sequence and gene organization of the mitochondrial genome of Asio flammeus (Strigiformes, strigidae).

    PubMed

    Zhang, Yanan; Song, Tao; Pan, Tao; Sun, Xiaonan; Sun, Zhonglou; Qian, Lifu; Zhang, Baowei

    2016-07-01

    The complete sequence of the mitochondrial genome was determined for Asio flammeus, which is distributed widely in geography. The length of the complete mitochondrial genome was 18,966 bp, containing 2 rRNA genes, 22 tRNA genes, 13 protein-coding genes (PCGs), and 1 non-coding region (D-loop). All the genes were distributed on the H-strand, except for the ND6 subunit gene and eight tRNA genes which were encoded on the L-strand. The D-loop of A. flammeus contained many tandem repeats of varying lengths and repeat numbers. The molecular-based phylogeny showed that our species acted as the sister group to A. capensis and the supported Asio was the monophyletic group.

  6. Isolation of human simple repeat loci by hybridization selection.

    PubMed

    Armour, J A; Neumann, R; Gobert, S; Jeffreys, A J

    1994-04-01

    We have isolated short tandem repeat arrays from the human genome, using a rapid method involving filter hybridization to enrich for tri- or tetranucleotide tandem repeats. About 30% of clones from the enriched library cross-hybridize with probes containing trimeric or tetrameric tandem arrays, facilitating the rapid isolation of large numbers of clones. In an initial analysis of 54 clones, 46 different tandem arrays were identified. Analysis of these tandem repeat loci by PCR showed that 24 were polymorphic in length; substantially higher levels of polymorphism were displayed by the tetrameric repeat loci isolated than by the trimeric repeats. Primary mapping of these loci by linkage analysis showed that they derive from 17 chromosomes, including the X chromosome. We anticipate the use of this strategy for the efficient isolation of tandem repeats from other sources of genomic DNA, including DNA from flow-sorted chromosomes, and from other species.

  7. Short Tandem Repeat DNA Internet Database

    National Institute of Standards and Technology Data Gateway

    SRD 130 Short Tandem Repeat DNA Internet Database (Web, free access)   Short Tandem Repeat DNA Internet Database is intended to benefit research and application of short tandem repeat DNA markers for human identity testing. Facts and sequence information on each STR system, population data, commonly used multiplex STR systems, PCR primers and conditions, and a review of various technologies for analysis of STR alleles have been included.

  8. Complete mitochondrial genome of the larch hawk moth, Sphinx morio (Lepidoptera: Sphingidae).

    PubMed

    Kim, Min Jee; Choi, Sei-Woong; Kim, Iksoo

    2013-12-01

    The larch hawk moth, Sphinx morio, belongs to the lepidopteran family Sphingidae that has long been studied as a family of model insects in a diverse field. In this study, we describe the complete mitochondrial genome (mitogenome) sequences of the species in terms of general genomic features and characteristic short repetitive sequences found in the A + T-rich region. The 15,299-bp-long genome consisted of a typical set of genes (13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes) and one major non-coding A + T-rich region, with the typical arrangement found in Lepidoptera. The 316-bp-long A + T-rich region located between srRNA and tRNA(Met) harbored the conserved sequence blocks that are typically found in lepidopteran insects. Additionally, the A + T-rich region of S. morio contained three characteristic repeat sequences that are rarely found in Lepidoptera: two identical 12-bp repeat, three identical 5-bp-long tandem repeat, and six nearly identical 5-6 bp long repeat sequences.

  9. Evolutionary Conservation of a Coding Function for D4Z4, the Tandem DNA Repeat Mutated in Facioscapulohumeral Muscular Dystrophy

    PubMed Central

    Clapp, Jannine ; Mitchell, Laura M. ; Bolland, Daniel J. ; Fantes, Judy ; Corcoran, Anne E. ; Scotting, Paul J. ; Armour, John A. L. ; Hewitt, Jane E. 

    2007-01-01

    Facioscapulohumeral muscular dystrophy (FSHD) is caused by deletions within the polymorphic DNA tandem array D4Z4. Each D4Z4 repeat unit has an open reading frame (ORF), termed “DUX4,” containing two homeobox sequences. Because there has been no evidence of a transcript from the array, these deletions are thought to cause FSHD by a position effect on other genes. Here, we identify D4Z4 homologues in the genomes of rodents, Afrotheria (superorder of elephants and related species), and other species and show that the DUX4 ORF is conserved. Phylogenetic analysis suggests that primate and Afrotherian D4Z4 arrays are orthologous and originated from a retrotransposed copy of an intron-containing DUX gene, DUXC. Reverse-transcriptase polymerase chain reaction and RNA fluorescence and tissue in situ hybridization data indicate transcription of the mouse array. Together with the conservation of the DUX4 ORF for >100 million years, this strongly supports a coding function for D4Z4 and necessitates re-examination of current models of the FSHD disease mechanism. PMID:17668377

  10. [Polymorphic loci and polymorphism analysis of short tandem repeats within XNP gene].

    PubMed

    Liu, Qi-Ji; Gong, Yao-Qin; Guo, Chen-Hong; Chen, Bing-Xi; Li, Jiang-Xia; Guo, Yi-Shou

    2002-01-01

    To select polymorphic short tandem repeat markers within X-linked nuclear protein (XNP) gene, genomic clones which contain XNP gene were recognized by homologous analysis with XNP cDNA. By comparing the cDNA with genomic DNA, non-exonic sequences were identified, and short tandem repeats were selected from non-exonic sequences by using BCM search Launcher. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five short tandem repeats were identified from XNP gene, two of which were polymorphic. Four and 11 alleles were observed in Chinese population for XNPSTR1 and XNPSTR4, respectively. Heterozygosities were 47% for XNPSTR1 and 70% for XNPSTR4. XNPSTR1 and XNPSTR4 localized within 3' end and intron 10, respectively. Two polymorphic short tandem repeats have been identified within XNP gene and will be useful for linkage analysis and gene diagnosis of XNP gene.

  11. Molecular characterization and distribution of a 145-bp tandem repeat family in the genus Populus.

    PubMed

    Rajagopal, J; Das, S; Khurana, D K; Srivastava, P S; Lakshmikumaran, M

    1999-10-01

    This report aims to describe the identification and molecular characterization of a 145-bp tandem repeat family that accounts for nearly 1.5% of the Populus genome. Three members of this repeat family were cloned and sequenced from Populus deltoides and P. ciliata. The dimers of the repeat were sequenced in order to confirm the head-to-tail organization of the repeat. Hybridization-based analysis using the 145-bp tandem repeat as a probe on genomic DNA gave rise to ladder patterns which were identified to be a result of methylation and (or) sequence heterogeneity. Analysis of the methylation pattern of the repeat family using methylation-sensitive isoschizomers revealed variable methylation of the C residues and lack of methylation of the A residues. Sequence comparisons between the monomers revealed a high degree of sequence divergence that ranged between 6% and 11% in P. deltoides and between 4.2% and 8.3% in P. ciliata. This indicated the presence of sub-families within the 145-bp tandem family of repeats. Divergence was mainly due to the accumulation of point mutations and was concentrated in the central region of the repeat. The 145-bp tandem repeat family did not show significant homology to known tandem repeats from plants. A short stretch of 36 bp was found to show homology of 66.7% to a centromeric repeat from Chironomus plumosus. Dot-blot analysis and Southern hybridization data revealed the presence of the repeat family in 13 of the 14 Populus species examined. The absence of the 145-bp repeat from P. euphratica suggested that this species is relatively distant from other members of the genus, which correlates with taxonomic classifications. The widespread occurrence of the tandem family in the genus indicated that this family may be of ancient origin.

  12. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

    USDA-ARS?s Scientific Manuscript database

    Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres comprise of megabase-scale arrays of tandem repeats. The true prevalence of centromere tandem repeats, and whether they exhibit conserved seque...

  13. Comparative Mitogenomics of the Assassin Bug Genus Peirates (Hemiptera: Reduviidae: Peiratinae) Reveal Conserved Mitochondrial Genome Organization of P. atromaculatus, P. fulvescens and P. turpis

    PubMed Central

    Zhao, Guangyu; Li, Hu; Zhao, Ping; Cai, Wanzhi

    2015-01-01

    In this study, we sequenced four new mitochondrial genomes and presented comparative mitogenomic analyses of five species in the genus Peirates (Hemiptera: Reduviidae). Mitochondrial genomes of these five assassin bugs had a typical set of 37 genes and retained the ancestral gene arrangement of insects. The A+T content, AT- and GC-skews were similar to the common base composition biases of insect mtDNA. Genomic size ranges from 15,702 bp to 16,314 bp and most of the size variation was due to length and copy number of the repeat unit in the putative control region. All of the control region sequences included large tandem repeats present in two or more copies. Our result revealed similarity in mitochondrial genomes of P. atromaculatus, P. fulvescens and P. turpis, as well as the highly conserved genomic-level characteristics of these three species, e.g., the same start and stop codons of protein-coding genes, conserved secondary structure of tRNAs, identical location and length of non-coding and overlapping regions, and conservation of structural elements and tandem repeat unit in control region. Phylogenetic analyses also supported a close relationship between P. atromaculatus, P. fulvescens and P. turpis, which might be recently diverged species. The present study indicates that mitochondrial genome has important implications on phylogenetics, population genetics and speciation in the genus Peirates. PMID:25689825

  14. Origin of the polymorphism of the involucrin gene in Asians.

    PubMed Central

    Djian, P; Delhomme, B; Green, H

    1995-01-01

    The involucrin gene, encoding a protein of the terminally differentiated keratinocyte, is polymorphic in the human. There is polymorphism of marker nucleotides a two positions in the coding region, and there are over eight polymorphic forms based on the number and kind of 10-codon tandem repeats in that part of the coding region most recently added in the human lineage. The involucrin alleles of Caucasians and Africans differ in both nucleotides and repeat patterns. We show that the involucrin alleles of East Asians (Chinese and Japanese) can be divided into two populations according to whether they possess the two marker nucleotides typical of Africans or Caucasians. The Asian population bearing Caucasian-type marker nucleotides has repeat patterns similar to those of Caucasians, whereas Asians bearing African-type marker nucleotides have repeat patterns that resemble those of Africans more than those of Caucasians. The existence of two populations of East Asian involucrin alleles gives support for the existence of a Eurasian stem lineage from which Caucasians and a part of the Asian population originated. PMID:7762559

  15. Molecular characterization and physical localization of highly repetitive DNA sequences from Brazilian Alstroemeria species.

    PubMed

    Kuipers, A G J; Kamstra, S A; de Jeu, M J; Visser, R G F

    2002-01-01

    Highly repetitive DNA sequences were isolated from genomic DNA libraries of Alstroemeria psittacina and A. inodora. Among the repetitive sequences that were isolated, tandem repeats as well as dispersed repeats could be discerned. The tandem repeats belonged to a family of interlinked Sau3A subfragments with sizes varying from 68-127 bp, and constituted a larger HinfI repeat of approximately 400 bp. Southern hybridization showed a similar molecular organization of the tandem repeats in each of the Brazilian Alstroemeria species tested. None of the repeats hybridized with DNA from Chilean Alstroemeria species, which indicates that they are specific for the Brazilian species. In-situ localization studies revealed the tandem repeats to be localized in clusters on the chromosomes of A. inodora and A. psittacina: distal hybridization sites were found on chromosome arms 2PS, 6PL, 7PS, 7PL and 8PL, interstitial sites on chromosome arms 2PL, 3PL, 4PL and 5PL. The applicability of the tandem repeats for cytogenetic analysis of interspecific hybrids and their role in heterochromatin organization are discussed.

  16. Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain

    PubMed Central

    de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

    2014-01-01

    The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163

  17. The Evolution of Dark Matter in the Mitogenome of Seed Beetles

    PubMed Central

    Sayadi, Ahmed; Immonen, Elina; Tellgren-Roth, Christian

    2017-01-01

    Abstract Animal mitogenomes are generally thought of as being economic and optimized for rapid replication and transcription. We use long-read sequencing technology to assemble the remarkable mitogenomes of four species of seed beetles. These are the largest circular mitogenomes ever assembled in insects, ranging from 24,496 to 26,613 bp in total length, and are exceptional in that some 40% consists of non-coding DNA. The size expansion is due to two very long intergenic spacers (LIGSs), rich in tandem repeats. The two LIGSs are present in all species but vary greatly in length (114–10,408 bp), show very low sequence similarity, divergent tandem repeat motifs, a very high AT content and concerted length evolution. The LIGSs have been retained for at least some 45 my but must have undergone repeated reductions and expansions, despite strong purifying selection on protein coding mtDNA genes. The LIGSs are located in two intergenic sites where a few recent studies of insects have also reported shorter LIGSs (>200 bp). These sites may represent spaces that tolerate neutral repeat array expansions or, alternatively, the LIGSs may function to allow a more economic translational machinery. Mitochondrial respiration in adult seed beetles is based almost exclusively on fatty acids, which reduces the need for building complex I of the oxidative phosphorylation pathway (NADH dehydrogenase). One possibility is thus that the LIGSs may allow depressed transcription of NAD genes. RNA sequencing showed that LIGSs are partly transcribed and transcriptional profiling suggested that all seven mtDNA NAD genes indeed show low levels of transcription and co-regulation of transcription across sexes and tissues. PMID:29048527

  18. SYMTRAN - A Time-dependent Symmetric Tandem Mirror Transport Code

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hua, D; Fowler, T

    2004-06-15

    A time-dependent version of the steady-state radial transport model in symmetric tandem mirrors in Ref. [1] has been coded up and first tests performed. Our code, named SYMTRAN, is an adaptation of the earlier SPHERE code for spheromaks, now modified for tandem mirror physics. Motivated by Post's new concept of kinetic stabilization of symmetric mirrors, it is an extension of the earlier TAMRAC rate-equation code omitting radial transport [2], which successfully accounted for experimental results in TMX. The SYMTRAN code differs from the earlier tandem mirror radial transport code TMT in that our code is focused on axisymmetric tandem mirrorsmore » and classical diffusion, whereas TMT emphasized non-ambipolar transport in TMX and MFTF-B due to yin-yang plugs and non-symmetric transitions between the plugs and axisymmetric center cell. Both codes exhibit interesting but different non-linear behavior.« less

  19. Variable-Number Tandem Repeats That Are Useful in Genotyping Isolates of Salmonella enterica subsp. enterica Serovars Typhimurium and Newport▿

    PubMed Central

    Witonski, D. ; Stefanova, R.; Ranganathan, A.; Schutze, G. E.; Eisenach, K. D.; Cave, M. D.

    2006-01-01

    The genome of Salmonella enterica subsp. enterica serovar Typhimurium strain LT2 was analyzed for direct repeats, and 54 sequences containing variable-number tandem repeat loci were identified. Ten primer pairs that anneal upstream and downstream of each selected locus were designed and used to amplify PCR targets in isolates of S. enterica serovars Typhimurium and Newport. Four of the 10 loci did not show polymorphism in the length of products. Six loci were selected for analysis. Isolates of S. enterica serovars Typhimurium and Newport that were related to specific outbreaks and showed identical pulsed-field gel electrophoresis patterns were indistinguishable by the length of the six variable-number tandem repeats. Isolates that differed in their pulsed-field gel electrophoresis patterns showed polymorphism in variable-number tandem repeat profiles. Length of the products was confirmed by DNA sequence analysis. Only 2 of the 10 loci contained exact integers of the direct repeat. Eight loci contained partial copies. The partial copies were maintained at the ends of the variable-number tandem repeat loci in all isolates. In spite of having partial copies that were maintained in all isolates, the number of direct repeats at a locus was polymorphic. Six variable-number tandem repeat loci were useful in distinguishing isolates of S. enterica serovars Typhimurium and Newport that had different pulsed-field gel electrophoresis patterns and in identifying outbreak-associated cases that shared a common pulsed-field gel pattern. PMID:16943354

  20. ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae.

    PubMed

    Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta

    2012-11-07

    Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found.

  1. ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae

    PubMed Central

    2012-01-01

    Background Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. Results ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. Conclusions We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found. PMID:23134664

  2. Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain.

    PubMed

    de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

    2014-06-01

    The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.

    PubMed Central

    Benslimane, A A; Dron, M; Hartmann, C; Rode, A

    1986-01-01

    Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553

  4. Complete Chloroplast Genome Sequence of Tartary Buckwheat (Fagopyrum tataricum) and Comparative Analysis with Common Buckwheat (F. esculentum)

    PubMed Central

    Cho, Kwang-Soo; Yun, Bong-Kyoung; Yoon, Young-Ho; Hong, Su-Young; Mekapogu, Manjulatha; Kim, Kyung-Hee; Yang, Tae-Jin

    2015-01-01

    We report the chloroplast (cp) genome sequence of tartary buckwheat (Fagopyrum tataricum) obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale) cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp) were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats) and F. esculentum (one repeat), and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes—rpoC2, ycf3, accD, and clpP—have high synonymous (Ks) value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum. PMID:25966355

  5. Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights

    PubMed Central

    Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.

    2016-01-01

    Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794

  6. GENETIC DIVERSITY OF TYPHA LATIFOLIA (TYPHACEAE) AND THE IMPACT OF POLLUTANTS EXAMINED WITH TANDEM-REPETITIVE DNA PROBES

    EPA Science Inventory

    Genetic diversity at variable-number-tandem-repeat (VNTR) loci was examined in the common cattail, Typha latifolia (Typhaceae), using three synthetic DNA probes composed of tandemly repeated "core" sequences (GACA, GATA, and GCAC). The principal objectives of this investigation w...

  7. An examination of the origin and evolution of additional tandem repeats in the mitochondrial DNA control region of Japanese sika deer (Cervus Nippon).

    PubMed

    Ba, Hengxing; Wu, Lang; Liu, Zongyue; Li, Chunyi

    2016-01-01

    Tandem repeat units are only detected in the left domain of the mitochondrial DNA control region in sika deer. Previous studies showed that Japanese sika deer have more tandem repeat units than its cousins from the Asian continent and Taiwan, which often have only three repeat units. To determine the origin and evolution of these additional repeat units in Japanese sika deer, we obtained the sequence of repeat units from an expanded dataset of the control region from all sika deer lineages. The functional constraint is inferred to act on the first repeat unit because this repeat has the least sequence divergence in comparison to the other units. Based on slipped-strand mispairing mechanisms, the illegitimate elongation model could account for the addition or deletion of these additional repeat units in the Japanese sika deer population. We also report that these additional repeat units could be occurring in the internal positions of tandem repeat regions, possibly via coupling with a homogenization mechanism within and among these lineages. Moreover, the increased number of repeat units in the Japanese sika deer population could reflect a balance between mutation and selection, as well as genetic drift.

  8. A novel species-specific tandem repeat DNA family from Sinapis arvensis: detection of telomere-like sequences.

    PubMed

    Kapila, R; Das, S; Srivastava, P S; Lakshmikumaran, M

    1996-08-01

    DNA sequences representing a tandemly repeated DNA family of the Sinapis arvensis genome were cloned and characterized. The 700-bp tandem repeat family is represented by two clones, pSA35 and pSA52, which are 697 and 709 bp in length, respectively. Dot matrix analysis of the sequences indicates the presence of repeated elements within each monomeric unit. Sequence analysis of the repetitive region of clones pSA35 and pSA52 shows that there are several copies of a 7-bp repeat element organized in tandem. The consensus sequence of this repeat element is 5'-TTTAGGG-3'. These elements are highly mutated and the difference in length between the two clones is due to different copy numbers of these elements. The repetitive region of clone pSA35 has 26 copies of the element TTTAGGG, whereas clone pSA52 has 28 copies. The repetitive region in both clones is flanked on either side by inverted repeats that may be footprints of a transposition event. Sequence comparison indicates that the element TTTAGGG is identical to telomeric repeats present in Arabidopsis, maize, tomato, and other plants. However, Bal31 digestion kinetics indicates non-telomeric localization of the 700-bp tandem repeats. The clones represent a novel repeat family as (i) they contain telomere-like motifs as subrepeats within each unit; and (ii) they do not hybridize to related crucifers and are species-specific in nature.

  9. 5meCpG epigenetic marks neighboring a primate-conserved core promoter short tandem repeat indicate X-chromosome inactivation.

    PubMed

    Machado, Filipe Brum; Machado, Fabricio Brum; Faria, Milena Amendro; Lovatel, Viviane Lamim; Alves da Silva, Antonio Francisco; Radic, Claudia Pamela; De Brasi, Carlos Daniel; Rios, Álvaro Fabricio Lopes; de Sousa Lopes, Susana Marina Chuva; da Silveira, Leonardo Serafim; Ruiz-Miranda, Carlos Ramon; Ramos, Ester Silveira; Medina-Acosta, Enrique

    2014-01-01

    X-chromosome inactivation (XCI) is the epigenetic transcriptional silencing of an X-chromosome during the early stages of embryonic development in female eutherian mammals. XCI assures monoallelic expression in each cell and compensation for dosage-sensitive X-linked genes between females (XX) and males (XY). DNA methylation at the carbon-5 position of the cytosine pyrimidine ring in the context of a CpG dinucleotide sequence (5meCpG) in promoter regions is a key epigenetic marker for transcriptional gene silencing. Using computational analysis, we revealed an extragenic tandem GAAA repeat 230-bp from the landmark CpG island of the human X-linked retinitis pigmentosa 2 RP2 promoter whose 5meCpG status correlates with XCI. We used this RP2 onshore tandem GAAA repeat to develop an allele-specific 5meCpG-based PCR assay that is highly concordant with the human androgen receptor (AR) exonic tandem CAG repeat-based standard HUMARA assay in discriminating active (Xa) from inactive (Xi) X-chromosomes. The RP2 onshore tandem GAAA repeat contains neutral features that are lacking in the AR disease-linked tandem CAG repeat, is highly polymorphic (heterozygosity rates approximately 0.8) and shows minimal variation in the Xa/Xi ratio. The combined informativeness of RP2/AR is approximately 0.97, and this assay excels at determining the 5meCpG status of alleles at the Xp (RP2) and Xq (AR) chromosome arms in a single reaction. These findings are relevant and directly translatable to nonhuman primate models of XCI in which the AR CAG-repeat is monomorphic. We conducted the RP2 onshore tandem GAAA repeat assay in the naturally occurring chimeric New World monkey marmoset (Callitrichidae) and found it to be informative. The RP2 onshore tandem GAAA repeat will facilitate studies on the variable phenotypic expression of dominant and recessive X-linked diseases, epigenetic changes in twins, the physiology of aging hematopoiesis, the pathogenesis of age-related hematopoietic malignancies and the clonality of cancers in human and nonhuman primates.

  10. 5meCpG Epigenetic Marks Neighboring a Primate-Conserved Core Promoter Short Tandem Repeat Indicate X-Chromosome Inactivation

    PubMed Central

    Machado, Filipe Brum; Machado, Fabricio Brum; Faria, Milena Amendro; Lovatel, Viviane Lamim; Alves da Silva, Antonio Francisco; Radic, Claudia Pamela; De Brasi, Carlos Daniel; Rios, Álvaro Fabricio Lopes; de Sousa Lopes, Susana Marina Chuva; da Silveira, Leonardo Serafim; Ruiz-Miranda, Carlos Ramon; Ramos, Ester Silveira; Medina-Acosta, Enrique

    2014-01-01

    X-chromosome inactivation (XCI) is the epigenetic transcriptional silencing of an X-chromosome during the early stages of embryonic development in female eutherian mammals. XCI assures monoallelic expression in each cell and compensation for dosage-sensitive X-linked genes between females (XX) and males (XY). DNA methylation at the carbon-5 position of the cytosine pyrimidine ring in the context of a CpG dinucleotide sequence (5meCpG) in promoter regions is a key epigenetic marker for transcriptional gene silencing. Using computational analysis, we revealed an extragenic tandem GAAA repeat 230-bp from the landmark CpG island of the human X-linked retinitis pigmentosa 2 RP2 promoter whose 5meCpG status correlates with XCI. We used this RP2 onshore tandem GAAA repeat to develop an allele-specific 5meCpG-based PCR assay that is highly concordant with the human androgen receptor (AR) exonic tandem CAG repeat-based standard HUMARA assay in discriminating active (Xa) from inactive (Xi) X-chromosomes. The RP2 onshore tandem GAAA repeat contains neutral features that are lacking in the AR disease-linked tandem CAG repeat, is highly polymorphic (heterozygosity rates approximately 0.8) and shows minimal variation in the Xa/Xi ratio. The combined informativeness of RP2/AR is approximately 0.97, and this assay excels at determining the 5meCpG status of alleles at the Xp (RP2) and Xq (AR) chromosome arms in a single reaction. These findings are relevant and directly translatable to nonhuman primate models of XCI in which the AR CAG-repeat is monomorphic. We conducted the RP2 onshore tandem GAAA repeat assay in the naturally occurring chimeric New World monkey marmoset (Callitrichidae) and found it to be informative. The RP2 onshore tandem GAAA repeat will facilitate studies on the variable phenotypic expression of dominant and recessive X-linked diseases, epigenetic changes in twins, the physiology of aging hematopoiesis, the pathogenesis of age-related hematopoietic malignancies and the clonality of cancers in human and nonhuman primates. PMID:25078280

  11. MicroRNAs in CAG trinucleotide repeat expansion disorders: an integrated review of the literature.

    PubMed

    Dumitrescu, Laura; Popescu, Bogdan O

    2015-01-01

    MicroRNAs are small RNAs involved in gene silencing. They play important roles in transcriptional regulation and are selectively and abundantly expressed in the central nervous system. A considerable amount of the human genome is comprised of tandem repeating nucleotide streams. Several diseases are caused by above-threshold expansion of certain trinucleotide repeats occurring in a protein-coding or non-coding region. Though monogenic, CAG trinucleotide repeat expansion disorders have a complex pathogenesis, various combinations of multiple coexisting pathways resulting in one common final consequence: selective neurodegeneration. Mutant protein and mutant transcript gain of toxic function are considered to be the core pathogenic mechanisms. The profile of microRNAs in CAG trinucleotide repeat disorders is scarcely described, however microRNA dysregulation has been identified in these diseases and microRNA-related intereference with gene expression is considered to be involved in their pathogenesis. Better understanding of microRNAs functions and means of manipulation promises to offer further insights into the pathogenic pathways of CAG repeat expansion disorders, to point out new potential targets for drug intervention and to provide some of the much needed etiopathogenic therapeutic agents. A number of disease-modifying microRNA silencing strategies are under development, but several implementation impediments still have to be resolved. CAG targeting seems feasible and efficient in animal models and is an appealing approach for clinical practice. Preliminary human trials are just beginning.

  12. Ca2+-stabilized adhesin helps an Antarctic bacterium reach out and bind ice.

    PubMed

    Vance, Tyler D R; Olijve, Luuk L C; Campbell, Robert L; Voets, Ilja K; Davies, Peter L; Guo, Shuaiqi

    2014-07-04

    The large size of a 1.5-MDa ice-binding adhesin [MpAFP (Marinomonas primoryensis antifreeze protein)] from an Antarctic Gram-negative bacterium, M. primoryensis, is mainly due to its highly repetitive RII (Region II). MpAFP_RII contains roughly 120 tandem copies of an identical 104-residue repeat. We have previously determined that a single RII repeat folds as a Ca2+-dependent immunoglobulin-like domain. Here, we solved the crystal structure of RII tetra-tandemer (four tandem RII repeats) to a resolution of 1.8 Å. The RII tetra-tandemer reveals an extended (~190-Å × ~25-Å), rod-like structure with four RII-repeats aligned in series with each other. The inter-repeat regions of the RII tetra-tandemer are strengthened by Ca2+ bound to acidic residues. SAXS (small-angle X-ray scattering) profiles indicate the RII tetra-tandemer is significantly rigidified upon Ca2+ binding, and that the protein's solution structure is in excellent agreement with its crystal structure. We hypothesize that >600 Ca2+ help rigidify the chain of ~120 104-residue repeats to form a ~0.6 μm rod-like structure in order to project the ice-binding domain of MpAFP away from the bacterial cell surface. The proposed extender role of RII can help the strictly aerobic, motile bacterium bind ice in the upper reaches of the Antarctic lake where oxygen and nutrients are most abundant. Ca2+-induced rigidity of tandem Ig-like repeats in large adhesins might be a general mechanism used by bacteria to bind to their substrates and help colonize specific niches.

  13. Rational design of alpha-helical tandem repeat proteins with closed architectures

    PubMed Central

    Doyle, Lindsey; Hallinan, Jazmine; Bolduc, Jill; Parmeggiani, Fabio; Baker, David; Stoddard, Barry L.; Bradley, Philip

    2015-01-01

    Tandem repeat proteins, which are formed by repetition of modular units of protein sequence and structure, play important biological roles as macromolecular binding and scaffolding domains, enzymes, and building blocks for the assembly of fibrous materials1,2. The modular nature of repeat proteins enables the rapid construction and diversification of extended binding surfaces by duplication and recombination of simple building blocks3,4. The overall architecture of tandem repeat protein structures – which is dictated by the internal geometry and local packing of the repeat building blocks – is highly diverse, ranging from extended, super-helical folds that bind peptide, DNA, and RNA partners5–9, to closed and compact conformations with internal cavities suitable for small molecule binding and catalysis10. Here we report the development and validation of computational methods for de novo design of tandem repeat protein architectures driven purely by geometric criteria defining the inter-repeat geometry, without reference to the sequences and structures of existing repeat protein families. We have applied these methods to design a series of closed alpha-solenoid11 repeat structures (alpha-toroids) in which the inter-repeat packing geometry is constrained so as to juxtapose the N- and C-termini; several of these designed structures have been validated by X-ray crystallography. Unlike previous approaches to tandem repeat protein engineering12–20, our design procedure does not rely on template sequence or structural information taken from natural repeat proteins and hence can produce structures unlike those seen in nature. As an example, we have successfully designed and validated closed alpha-solenoid repeats with a left-handed helical architecture that – to our knowledge – is not yet present in the protein structure database21. PMID:26675735

  14. Typing Clostridium difficile strains based on tandem repeat sequences

    PubMed Central

    2009-01-01

    Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124

  15. GENETIC VARIATION IN RED RASPBERRIES (RUBUS IDAEUS L.; ROSACEAE) FROM SITES DIFFERING IN ORGANIC POLLUTANTS COMPARED WITH SYNTHETIC TANDEM REPEAT DNA PROBES

    EPA Science Inventory

    Two synthetic tandem repetitive DNA probes were used to compare genetic variation at variable-number-tandem-repeat (VNTR) loci among Rubus idaeus L. var. strigosus (Michx.) Maxim. (Rosaceae) individuals sampled at eight sites contaminated by pollutants (N = 39) and eight adjacent...

  16. Comparative and functional characterization of intragenic tandem repeats in 10 Aspergillus genomes.

    PubMed

    Gibbons, John G; Rokas, Antonis

    2009-03-01

    Intragenic tandem repeats (ITRs) are consecutive repeats of three or more nucleotides found in coding regions. ITRs are the underlying cause of several human genetic diseases and have been associated with phenotypic variation, including pathogenesis, in several clades of the tree of life. We have examined the evolution and functional role of ITRs in 10 genomes spanning the fungal genus Aspergillus, a clade of relevance to medicine, agriculture, and industry. We identified several hundred ITRs in each of the species examined. ITR content varied extensively between species, with an average 79% of ITRs unique to a given species. For the fraction of conserved ITR regions, sequence comparisons within species and between close relatives revealed that they were highly variable. ITR-containing proteins were evolutionarily less conserved, compositionally distinct, and overrepresented for domains associated with cell-surface localization and function relative to the rest of the proteome. Furthermore, ITRs were preferentially found in proteins involved in transcription, cellular communication, and cell-type differentiation but were underrepresented in proteins involved in metabolism and energy. Importantly, although ITRs were evolutionarily labile, their functional associations appeared. To be remarkably conserved across eukaryotes. Fungal ITRs likely participate in a variety of developmental processes and cell-surface-associated functions, suggesting that their contribution to fungal lifestyle and evolution may be more general than previously assumed.

  17. Variable Number Of Tandem Repeats (VNTR) and its application in bacterial epidemiology.

    PubMed

    Ramazanzadeh, Rashid; McNerney, Ruth

    2007-08-15

    Molecular epidemiology is the using of molecular techniques to study bacterial distribution in human populations. Recently molecular epidemiologist benefit from several techniques such as Variable Number Tandem Repeat (VNTR) typing method to typing bacterial strains. Variable Number Tandem Repeat (VNTR) typing is a tool for genotyping and provides data in a simple and numeric format based on the number of repetitive sequences. VNTR for first time identified in M. tuberculosis as Mycobacterial Interspersed Repeat Units (MIRUs). General terms of VNTR have now been reported in Bacillus anthracis, Legionella pneumophila, Pseudomonas aeruginosa, Salmonella enterica and Escherichia coli O157.

  18. Biased distribution of IS629 among strains in different lineages of enterohemorrhagic Escherichia coli serovar O157.

    PubMed

    Yokoyama, Eiji; Hashimoto, Ruiko; Etoh, Yoshiki; Ichihara, Sachiko; Horikawa, Kazumi; Uchimura, Masako

    2011-01-01

    The distribution of insertion sequence (IS) 629 among strains of enterohemorrhagic Escherichia coli serovar O157 (O157) was investigated and compared with the strain lineages defined by lineage specific polymorphism assay-6 (LSPA-6) to demonstrate the effectiveness of IS629 analysis for population genetics analysis. Using pulsed-field gel electrophoresis and variable-number tandem repeat typing, 140 strains producing both VT1 and VT2 and 98 strains producing only VT2 were selected from a total of 592 strains isolated from patients and asymptomatic carriers in Chiba Prefecture, Japan, during 2003-2008. By LSPA-6 analysis, six strains had atypical amplicon sizes in their Z5935 loci and five strains had atypical amplicon sizes in their arp-iclR intergenic regions. Sequence analyses of PCR amplified DNAs showed that five of the six loci used for LSPA-6 analysis had tandem repeats and the allele changes were due to changes in the number of tandem repeats. Subculturing and long-term incubation was found to have no detectable effect on the lineages defined by LSPA-6 analysis, demonstrating the robustness of LSPA-6 analysis. Minimum spanning tree analysis reconstruction revealed that strains in lineage I, I/II, and II clustered on separate branches, indicating that the distribution of IS629 was biased among O157 strains in different lineages. Strains with LSPA-6 codes 231111, 211113, and 211114 had atypical amplicon sizes and were clustered in lineage I/II branch, and strains with LSPA-6 codes 212114, 221123, 221223, 222123, 222224, 242123, 252123, and 242222 had atypical amplicon sizes and clustered in lineage II branches. Linkage disequilibrium was observed in strains in every lineage when the standardized index of association was calculated using IS629 distribution data. Therefore, the distribution analysis of IS629 may be effective for population genetics analysis of O157 due to the biased IS629 distribution among strains in the three O157 lineages. Copyright © 2010 Elsevier B.V. All rights reserved.

  19. Slipped-strand mispairing at noncontiguous repeats in Poecilia reticulata: a model for minisatellite birth.

    PubMed Central

    Taylor, J S; Breden, F

    2000-01-01

    The standard slipped-strand mispairing (SSM) model for the formation of variable number tandem repeats (VNTRs) proposes that a few tandem repeats, produced by chance mutations, provide the "raw material" for VNTR expansion. However, this model is unlikely to explain the formation of VNTRs with long motifs (e.g., minisatellites), because the likelihood of a tandem repeat forming by chance decreases rapidly as the length of the repeat motif increases. Phylogenetic reconstruction of the birth of a mitochondrial (mt) DNA minisatellite in guppies suggests that VNTRs with long motifs can form as a consequence of SSM at noncontiguous repeats. VNTRs formed in this manner have motifs longer than the noncontiguous repeat originally formed by chance and are flanked by one unit of the original, noncontiguous repeat. SSM at noncontiguous repeats can therefore explain the birth of VNTRs with long motifs and the "imperfect" or "short direct" repeats frequently observed adjacent to both mtDNA and nuclear VNTRs. PMID:10880490

  20. Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping.

    PubMed

    U'Ren, Jana M; Schupp, James M; Pearson, Talima; Hornstra, Heidie; Friedman, Christine L Clark; Smith, Kimothy L; Daugherty, Rebecca R Leadem; Rhoton, Shane D; Leadem, Ben; Georgia, Shalamar; Cardon, Michelle; Huynh, Lynn Y; DeShazer, David; Harvey, Steven P; Robison, Richard; Gal, Daniel; Mayo, Mark J; Wagner, David; Currie, Bart J; Keim, Paul

    2007-03-30

    The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation) to that of the most diverse tandemly repeated regions found in other less diverse bacteria. The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were identical using previous typing methods. Given the health threat to humans and livestock and the potential for B. pseudomallei to be released intentionally, MLVA could prove to be an important tool for fine-scale epidemiological or forensic tracking of this increasingly important environmental pathogen.

  1. Evidence of birth-and-death evolution of 5S rRNA gene in Channa species (Teleostei, Perciformes).

    PubMed

    Barman, Anindya Sundar; Singh, Mamta; Singh, Rajeev Kumar; Lal, Kuldeep Kumar

    2016-12-01

    In higher eukaryotes, minor rDNA family codes for 5S rRNA that is arranged in tandem arrays and comprises of a highly conserved 120 bp long coding sequence with a variable non-transcribed spacer (NTS). Initially the 5S rDNA repeats are considered to be evolved by the process of concerted evolution. But some recent reports, including teleost fishes suggested that evolution of 5S rDNA repeat does not fit into the concerted evolution model and evolution of 5S rDNA family may be explained by a birth-and-death evolution model. In order to study the mode of evolution of 5S rDNA repeats in Perciformes fish species, nucleotide sequence and molecular organization of five species of genus Channa were analyzed in the present study. Molecular analyses revealed several variants of 5S rDNA repeats (four types of NTS) and networks created by a neighbor net algorithm for each type of sequences (I, II, III and IV) did not show a clear clustering in species specific manner. The stable secondary structure is predicted and upstream and downstream conserved regulatory elements were characterized. Sequence analyses also shown the presence of two putative pseudogenes in Channa marulius. Present study supported that 5S rDNA repeats in genus Channa were evolved under the process of birth-and-death.

  2. A Mitochondrial Genome of Rhyparochromidae (Hemiptera: Heteroptera) and a Comparative Analysis of Related Mitochondrial Genomes.

    PubMed

    Li, Teng; Yang, Jie; Li, Yinwan; Cui, Ying; Xie, Qiang; Bu, Wenjun; Hillis, David M

    2016-10-19

    The Rhyparochromidae, the largest family of Lygaeoidea, encompasses more than 1,850 described species, but no mitochondrial genome has been sequenced to date. Here we describe the first mitochondrial genome for Rhyparochromidae: a complete mitochondrial genome of Panaorus albomaculatus (Scott, 1874). This mitochondrial genome is comprised of 16,345 bp, and contains the expected 37 genes and control region. The majority of the control region is made up of a large tandem-repeat region, which has a novel pattern not previously observed in other insects. The tandem-repeats region of P. albomaculatus consists of 53 tandem duplications (including one partial repeat), which is the largest number of tandem repeats among all the known insect mitochondrial genomes. Slipped-strand mispairing during replication is likely to have generated this novel pattern of tandem repeats. Comparative analysis of tRNA gene families in sequenced Pentatomomorpha and Lygaeoidea species shows that the pattern of nucleotide conservation is markedly higher on the J-strand. Phylogenetic reconstruction based on mitochondrial genomes suggests that Rhyparochromidae is not the sister group to all the remaining Lygaeoidea, and supports the monophyly of Lygaeoidea.

  3. Tandemly repeated sequences in mtDNA control region of whitefish, Coregonus lavaretus.

    PubMed

    Brzuzan, P

    2000-06-01

    Length variation of the mitochondrial DNA control region was observed with PCR amplification of a sample of 138 whitefish (Coregonus lavaretus). Nucleotide sequences of representative PCR products showed that the variation was due to the presence of an approximately 100-bp motif tandemly repeated two, three, or five times in the region between the conserved sequence block-3 (CSB-3) and the gene for phenylalanine tRNA. This is the first report on the tandem array composed of long repeat units in mitochondrial DNA of salmonids.

  4. Acquisition and amplification of a testis-expressed autosomal gene, SSL, by the Drosophila Y chromosome

    PubMed Central

    Kalmykova, Alla I.; Shevelyov, Yury Y.; Dobritsa, Anna A.; Gvozdev, Vladimir A.

    1997-01-01

    The acquisition of autosomal fertility genes has been proposed to be an important process in human Y chromosome evolution. For example, the Y-linked fertility factor DAZ (Deleted in Azoospermia) appears to have arisen after the transposition and tandem amplification of the autosomal DAZH gene. The Drosophila melanogaster Y chromosome contains tandemly repeated Su(Ste) units that are thought to affect male fertility as suppressors of the homologous X-linked Stellate repeats. Here we report the detection of a testis-expressed autosomal gene, SSL [Su(Ste)-like], that appears to be an ancestor of the Y-linked Su(Ste) units. SSL encodes a casein kinase 2 (CK2) β-subunit-like protein. Its putative ORF shares extensive (45%) homology with the genuine β-subunit of CK2 and retains the conserved C-terminal and Glu/Asp-rich domains that are essential for CK2 holoenzyme regulation. SSL maps within region 60D1–2 of D. melanogaster and D. simulans polytene chromosomes. We present evidence that SSL was derived from the genuine βCK2 gene by reverse transcription. This event resulted in the loss of the first three introns in the coding region of the SSL ancestor gene. Evolutionary analysis indicates that SSL has evolved under selective pressure at the translational level. Its sequence, especially in the 3′ region, is much closer to the Y-linked Su(Ste) tandem repeats than to the βCK2 gene. These results suggest that the acquisition of testis-specific autosomal genes may be important for the evolution of Drosophila as well as human Y chromosomes. PMID:9177211

  5. [Analysis on genetic polymorphism of 5 STR loci selected from X chromosome].

    PubMed

    Liu, Qi-ji; Gong, Yao-qin; Zhang, Xi-yu; Gao, Gui-min; Li, Jiang-xia; Guo, Yi-shou

    2005-02-01

    To select short tandem repeats(STR) from X chromosome. STR is a universal genetic marker that has changeable polymorphism and stable heredity in human genome. It is a specific DNA segment composed of 2-6 base pairs as its core sequence. It is an ideal DNA marker used in linkage analysis and gene mapping. In this study, 8 short tandem repeats were selected from two genomic clones on X chromosome by using BCM Search Launcher. Primers amplifying the STR loci were designed by using Primer 3.0 according to the unique sequence flanking the STRs. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five of these STRs were polymorphic. Chi-square test indicated that the distribution of genotypes agreed with Hardy-Weinberg equilibrium (P>0.05). Five polymorphic short tandem repeats have been identified on chromosome X and will be useful for linkage analysis and gene mapping.

  6. Concerted evolution of the tandem array encoding primate U2 snRNA occurs in situ, without changing the cytological context of the RNU2 locus.

    PubMed Central

    Pavelitz, T; Rusché, L; Matera, A G; Scharf, J M; Weiner, A M

    1995-01-01

    In primates, the tandemly repeated genes encoding U2 small nuclear RNA evolve concertedly, i.e. the sequence of the U2 repeat unit is essentially homogeneous within each species but differs somewhat between species. Using chromosome painting and the NGFR gene as an outside marker, we show that the U2 tandem array (RNU2) has remained at the same chromosomal locus (equivalent to human 17q21) through multiple speciation events over > 35 million years leading to the Old World monkey and hominoid lineages. The data suggest that the U2 tandem repeat, once established in the primate lineage, contained sequence elements favoring perpetuation and concerted evolution of the array in situ, despite a pericentric inversion in chimpanzee, a reciprocal translocation in gorilla and a paracentric inversion in orang utan. Comparison of the 11 kb U2 repeat unit found in baboon and other Old World monkeys with the 6 kb U2 repeat unit in humans and other hominids revealed that an ancestral U2 repeat unit was expanded by insertion of a 5 kb retrovirus bearing 1 kb long terminal repeats (LTRs). Subsequent excision of the provirus by homologous recombination between the LTRs generated a 6 kb U2 repeat unit containing a solo LTR. Remarkably, both junctions between the human U2 tandem array and flanking chromosomal DNA at 17q21 fall within the solo LTR sequence, suggesting a role for the LTR in the origin or maintenance of the primate U2 array. Images PMID:7828589

  7. MSDB: A Comprehensive Database of Simple Sequence Repeats

    PubMed Central

    Avvaru, Akshay Kumar; Saxena, Saketh; Mishra, Rakesh Kumar

    2017-01-01

    Abstract Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1–6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb. PMID:28854643

  8. Stabilization of perfect and imperfect tandem repeats by single-strand DNA exonucleases

    PubMed Central

    Feschenko, Vladimir V.; Rajman, Luis A.; Lovett, Susan T.

    2003-01-01

    Rearrangements between tandemly repeated DNA sequences are a common source of genetic instability. Such rearrangements underlie several human genetic diseases. In many organisms, the mismatch-repair (MMR) system functions to stabilize repeats when the repeat unit is short or when sequence imperfections are present between the repeats. We show here that the action of single-stranded DNA (ssDNA) exonucleases plays an additional, important role in stabilizing tandem repeats, independent of their role in MMR. For perfect repeats of ≈100 bp in Escherichia coli that are not susceptible to MMR, exonuclease (Exo)-I, ExoX, and RecJ exonuclease redundantly inhibit deletion. Our data suggest that >90% of potential deletion events are avoided by the combined action of these three exonucleases. Imperfect tandem repeats, less prone to rearrangements, are stabilized by both the MMR-pathway and ssDNA-specific exonucleases. For 100-bp repeats containing four mispairs, ExoI alone aborts most deletion events, even in the presence of a functional MMR system. By genetic analysis, we show that the inhibitory effect of ssDNA exonucleases on deletion formation is independent of the MutS and UvrD proteins. Exonuclease degradation of DNA displaced during the deletion process may abort slipped misalignment. Exonuclease action is therefore a significant force in genetic stabilization of many forms of repetitive DNA. PMID:12538867

  9. Tandem-repeat protein domains across the tree of life.

    PubMed

    Jernigan, Kristin K; Bordenstein, Seth R

    2015-01-01

    Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20-40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species.

  10. Ten tandem repeats of {beta}-hCG 109-118 enhance immunogenicity and anti-tumor effects of {beta}-hCG C-terminal peptide carried by mycobacterial heat-shock protein HSP65

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang Yankai; Yan Rong; He Yi

    2006-07-14

    The {beta}-subunit of human chorionic gonadotropin ({beta}-hCG) is secreted by many kinds of tumors and it has been used as an ideal target antigen to develop vaccines against tumors. In view of the low immunogenicity of this self-peptide,we designed a method based on isocaudamer technique to repeat tandemly the 10-residue sequence X of {beta}-hCG (109-118), then 10 tandemly repeated copies of the 10-residue sequence combined with {beta}-hCG C-terminal 37 peptides were fused to mycobacterial heat-shock protein 65 to construct a fusion protein HSP65-X10-{beta}hCGCTP37 as an immunogen. In this study, we examined the effect of the tandem repeats of this 10-residuemore » sequence in eliciting an immune by comparing the immunogenicity and anti-tumor effects of the two immunogens, HSP65-X10-{beta}hCGCTP37 and HSP65-{beta}hCGCTP37 (without the 10 tandem repeats). Immunization of mice with the fusion protein HSP65-X10-{beta}hCGCTP37 elicited much higher levels of specific anti-{beta}-hCG antibodies and more effectively inhibited the growth of Lewis lung carcinoma (LLC) in vivo than with HSP65-{beta}hCGCTP37, which should suggest that HSP65-X10-{beta}hCGCTP37 may be an effective protein vaccine for the treatment of {beta}-hCG-dependent tumors and multiple tandem repeats of a certain epitope are an efficient method to overcome the low immunogenicity of self-peptide antigens.« less

  11. The central domain of bovine submaxillary mucin consists of over 50 tandem repeats of 329 amino acids. Chromosomal localization of the BSM1 gene and relations to ovine and porcine counterparts.

    PubMed

    Jiang, W; Gupta, D; Gallagher, D; Davis, S; Bhavanandan, V P

    2000-04-01

    We previously elucidated five distinct protein domains (I-V) for bovine submaxillary mucin, which is encoded by two genes, BSM1 and BSM2. Using Southern blot analysis, genomic cloning and sequencing of the BSM1 gene, we now show that the central domain (V) consists of approximately 55 tandem repeats of 329 amino acids and that domains III-V are encoded by a 58.4-kb exon, the largest exon known for all genes to date. The BSM1 gene was mapped by fluorescence in situ hybridization to the proximal half of chromosome 5 at bands q2. 2-q2.3. The amino-acid sequence of six tandem repeats (two full and four partial) were found to have only 92-94% identities. We propose that the variability in the amino-acid sequences of the mucin tandem repeat is important for generating the combinatorial library of saccharides that are necessary for the protective function of mucins. The deduced peptide sequences of the central domain match those determined from the purified bovine submaxillary mucin and also show 68-94% identity to published peptide sequences of ovine submaxillary mucin. This indicates that the core protein of ovine submaxillary mucin is closely related to that of bovine submaxillary mucin and contains similar tandem repeats in the central domain. In contrast, the central domain of porcine submaxillary mucin is reported to consist of 81-amino-acid tandem repeats. However, both bovine submaxillary mucin and porcine submaxillary mucin contain similar N-terminal and C-terminal domains and the corresponding genes are in the conserved linkage regions of the respective genomes.

  12. Tandem-repeat protein domains across the tree of life

    PubMed Central

    Jernigan, Kristin K.

    2015-01-01

    Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20–40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species. PMID:25653910

  13. A TALE-inspired computational screen for proteins that contain approximate tandem repeats.

    PubMed

    Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias

    2017-01-01

    TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.

  14. A TALE-inspired computational screen for proteins that contain approximate tandem repeats

    PubMed Central

    Krwawicz, Joanna

    2017-01-01

    TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen. PMID:28617832

  15. Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm

    PubMed Central

    Glunčić, Matko; Paar, Vladimir

    2013-01-01

    The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes). PMID:22977183

  16. Tandem Repeated Irritation Test (TRIT) Studies and Clinical Relevance: Post 2006.

    PubMed

    Reddy, Rasika; Maibach, Howard

    2018-06-11

    Single or multiple applications of irritants can lead to occupational contact dermatitis, and most commonly irritant contact dermatitis (ICD). Tandem irritation, the sequential application of two irritants to a target skin area, has been studied using the Tandem Repeated Irritation Test (TRIT) to provide a more accurate representation of skin irritation. Here we present an update to Kartono's review on tandem irritation studies since 2006 [1]. We surveyed the literature available on PubMed, Embase, Google Scholar, and the UCSF Dermatology library databases since 2006. The studies included discuss the tandem effects of common chemical irritants, organic solvents, occlusion as well as clinical relevance - and enlarge our ability to discern whether multiple chemical exposures are more or less likely to enhance irritation.

  17. Human minisatellite alleles detectable only after PCR amplification.

    PubMed

    Armour, J A; Crosier, M; Jeffreys, A J

    1992-01-01

    We present evidence that a proportion of alleles at two human minisatellite loci is undetected by standard Southern blot hybridization. In each case the missing allele(s) can be identified after PCR amplification and correspond to tandem arrays too short to detect by hybridization. At one locus, there is only one undetected allele (population frequency 0.3), which contains just three repeat units. At the second locus, there are at least five undetected alleles (total population frequency 0.9) containing 60-120 repeats; they are not detected because these tandem repeats give very poor signals when used as a probe in standard Southern blot hybridization, and also cross-hybridize with other sequences in the genome. Under these circumstances only signals from the longest tandemly repeated alleles are detectable above the nonspecific background. The structures of these loci have been compared in human and primate DNA, and at one locus the short human allele containing three repeat units is shown to be an intermediate state in the expansion of a monomeric precursor allele in primates to high copy number in the longer human arrays. We discuss the implications of such loci for studies of human populations, minisatellite isolation by cloning, and the evolution of highly variable tandem arrays.

  18. The evolution of filamin – A protein domain repeat perspective

    PubMed Central

    Light, Sara; Sagit, Rauan; Ithychanda, Sujay S.; Qin, Jun; Elofsson, Arne

    2013-01-01

    Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. PMID:22414427

  19. The evolution of filamin-a protein domain repeat perspective.

    PubMed

    Light, Sara; Sagit, Rauan; Ithychanda, Sujay S; Qin, Jun; Elofsson, Arne

    2012-09-01

    Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. Copyright © 2012 Elsevier Inc. All rights reserved.

  20. The complete mitochondrial genome of the African palm civet, Nandinia binotata, the only representative of the family Nandiniidae (Mammalia, Carnivora).

    PubMed

    Hassanin, Alexandre

    2016-01-01

    Here I report the complete mitochondrial genome of the African palm civet, (Nandinia binotata) as sequenced from overlapping PCR products. The genome is 17,103 bp in length and contains the 37 genes found in a typical mammalian genome: 13 protein-coding genes, 22 transfer RNA genes and 2 ribosomal RNA genes. The control region of N. binotata includes both RS2 and RS3 tandem repeats. The overall base composition on the L-strand is A: 33.6%, C: 27.3%, G: 13.0%, and T: 26.1%.

  1. Intratypic variability of a tandem repeat locus within the DNA polymerase gene of human herpes simplex virus type 2.

    PubMed

    Sun, Yongjiang; Chan, Roy Kum Wah; Tan, Suat Hoon

    2004-01-01

    In this study, the irntratypic variability of a tandem repeat locus within the DNA polymerase (pol) gene of human herpes simplex virus type 2 (HSV2) was uncovered. The locus contained variable numbers of tandem dodecanucleotide (5'-GAC GAG GAC GGG-3') repetitive units. Our result showed that approximately 95% of analyzed HSV2 clinical isolates and the current GenBank HSV2 strains contained two copies of the repetitive units. From genital herpes specimens, three new HSV2 strains, which respectively contained 1, 3, and 4 copies of the repetitive units, were identified. This variable number of tandem repeat (VNTR) locus is absent in HSV1, and thus it also contributes to the intertypic variability of HSV1 and HSV2. The intratypic variability of the locus may be useful for HSV2 strain genotyping and this application is discussed.

  2. Repetition as the essence of life on this earth: music and genes.

    PubMed

    Ohno, S

    1987-01-01

    In prebiotic nucleic acid replication, templates appear to have been in short supply. A single round of tandem duplication of existing oligomers assured progressive extension of templates to the length adequate for encoding of polypeptide chains. Thus, the first set of coding sequences had to be repeats of base oligomers encoding polypeptide chains of various periodicities. On one hand, the readiness of these periodical polypeptide chains to assume alpha-helical and/or beta-sheet secondary structures contributed to the extremely rapid initial functional diversification of these polypeptide chains. It would be recalled that most, if not all, of the sugar-metabolizing enzymes had already achieved the inviolable functional competence before the division of prokaryotes from eukaryotes. On the other hand, a certain (dipeptidic?) of the peptidic periodicities was apparently chosen as the timekeeping unit by the biological clock. Musical compositions too apparently evolved originally as a timekeeping device. Accordingly, repetitiousness is evident in all musical compositions. Evolution of musical compositions from the early Baroque to the late Romantic parallels that of coding sequences from rather exact repeats of base oligomers to more complex modern coding sequences in which repetitious elements are less conspicuous and more varied. Inasmuch as the earth is governed by the hierarchy of periodicities (days, months and years), such reliance on periodicities is rather expected.

  3. Variable-number tandem repeats as molecular markers for biotypes of Pasteuria ramosa in Daphnia spp.

    PubMed

    Mouton, Laurence; Nong, Guang; Preston, James F; Ebert, Dieter

    2007-06-01

    Variable-number tandem repeats (VNTRs) have been identified in populations of Pasteuria ramosa, a castrating endobacterium of Daphnia species. The allelic polymorphisms at 14 loci in laboratory and geographically diverse soil samples showed that VNTRs may serve as biomarkers for the genetic characterization of P. ramosa isolates.

  4. Diversity and structure of PIF/Harbinger-like elements in the genome of Medicago truncatula

    PubMed Central

    Grzebelus, Dariusz; Lasota, Slawomir; Gambin, Tomasz; Kucherov, Gregory; Gambin, Anna

    2007-01-01

    Background Transposable elements constitute a significant fraction of plant genomes. The PIF/Harbinger superfamily includes DNA transposons (class II elements) carrying terminal inverted repeats and producing a 3 bp target site duplication upon insertion. The presence of an ORF coding for the DDE/DDD transposase, required for transposition, is characteristic for the autonomous PIF/Harbinger-like elements. Based on the above features, PIF/Harbinger-like elements were identified in several plant genomes and divided into several evolutionary lineages. Availability of a significant portion of Medicago truncatula genomic sequence allowed for mining PIF/Harbinger-like elements, starting from a single previously described element MtMaster. Results Twenty two putative autonomous, i.e. carrying an ORF coding for TPase and complete terminal inverted repeats, and 67 non-autonomous PIF/Harbinger-like elements were found in the genome of M. truncatula. They were divided into five families, MtPH-A5, MtPH-A6, MtPH-D,MtPH-E, and MtPH-M, corresponding to three previously identified and two new lineages. The largest families, MtPH-A6 and MtPH-M were further divided into four and three subfamilies, respectively. Non-autonomous elements were usually direct deletion derivatives of the putative autonomous element, however other types of rearrangements, including inversions and nested insertions were also observed. An interesting structural characteristic – the presence of 60 bp tandem repeats – was observed in a group of elements of subfamily MtPH-A6-4. Some families could be related to miniature inverted repeat elements (MITEs). The presence of empty loci (RESites), paralogous to those flanking the identified transposable elements, both autonomous and non-autonomous, as well as the presence of transposon insertion related size polymorphisms, confirmed that some of the mined elements were capable for transposition. Conclusion The population of PIF/Harbinger-like elements in the genome of M. truncatula is diverse. A detailed intra-family comparison of the elements' structure proved that they proliferated in the genome generally following the model of abortive gap repair. However, the presence of tandem repeats facilitated more pronounced rearrangements of the element internal regions. The insertion polymorphism of the MtPH elements and related MITE families in different populations of M. truncatula, if further confirmed experimentally, could be used as a source of molecular markers complementary to other marker systems. PMID:17996080

  5. A Comparative Proteomic Analysis of the Simple Amino Acid Repeat Distributions in Plasmodia Reveals Lineage Specific Amino Acid Selection

    PubMed Central

    Dalby, Andrew R.

    2009-01-01

    Background Microsatellites have been used extensively in the field of comparative genomics. By studying microsatellites in coding regions we have a simple model of how genotypic changes undergo selection as they are directly expressed in the phenotype as altered proteins. The simplest of these tandem repeats in coding regions are the tri-nucleotide repeats which produce a repeat of a single amino acid when translated into proteins. Tri-nucleotide repeats are often disease associated, and are also known to be unstable to both expansion and contraction. This makes them sensitive markers for studying proteome evolution, in closely related species. Results The evolutionary history of the family of malarial causing parasites Plasmodia is complex because of the life-cycle of the organism, where it interacts with a number of different hosts and goes through a series of tissue specific stages. This study shows that the divergence between the primate and rodent malarial parasites has resulted in a lineage specific change in the simple amino acid repeat distribution that is correlated to A–T content. The paper also shows that this altered use of amino acids in SAARs is consistent with the repeat distributions being under selective pressure. Conclusions The study shows that simple amino acid repeat distributions can be used to group related species and to examine their phylogenetic relationships. This study also shows that an outgroup species with a similar A–T content can be distinguished based only on the amino acid usage in repeats, and suggest that this might be a useful feature for proteome clustering. The lineage specific use of amino acids in repeat regions suggests that comparative studies of SAAR distributions between proteomes gives an insight into the mechanisms of expansion and the selective pressures acting on the organism. PMID:19597555

  6. MSDB: A Comprehensive Database of Simple Sequence Repeats.

    PubMed

    Avvaru, Akshay Kumar; Saxena, Saketh; Sowpati, Divya Tej; Mishra, Rakesh Kumar

    2017-06-01

    Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Mimosoid legume plastome evolution: IR expansion, tandem repeat expansions, and accelerated rate of evolution in clpP.

    PubMed

    Dugas, Diana V; Hernandez, David; Koenen, Erik J M; Schwarz, Erika; Straub, Shannon; Hughes, Colin E; Jansen, Robert K; Nageswara-Rao, Madhugiri; Staats, Martijn; Trujillo, Joshua T; Hajrah, Nahid H; Alharbi, Njud S; Al-Malki, Abdulrahman L; Sabir, Jamal S M; Bailey, C Donovan

    2015-11-23

    The Leguminosae has emerged as a model for studying angiosperm plastome evolution because of its striking diversity of structural rearrangements and sequence variation. However, most of what is known about legume plastomes comes from few genera representing a subset of lineages in subfamily Papilionoideae. We investigate plastome evolution in subfamily Mimosoideae based on two newly sequenced plastomes (Inga and Leucaena) and two recently published plastomes (Acacia and Prosopis), and discuss the results in the context of other legume and rosid plastid genomes. Mimosoid plastomes have a typical angiosperm gene content and general organization as well as a generally slow rate of protein coding gene evolution, but they are the largest known among legumes. The increased length results from tandem repeat expansions and an unusual 13 kb IR-SSC boundary shift in Acacia and Inga. Mimosoid plastomes harbor additional interesting features, including loss of clpP intron1 in Inga, accelerated rates of evolution in clpP for Acacia and Inga, and dN/dS ratios consistent with neutral and positive selection for several genes. These new plastomes and results provide important resources for legume comparative genomics, plant breeding, and plastid genetic engineering, while shedding further light on the complexity of plastome evolution in legumes and angiosperms.

  8. PSSRdb: a relational database of polymorphic simple sequence repeats extracted from prokaryotic genomes.

    PubMed

    Kumar, Pankaj; Chaitanya, Pasumarthy S; Nagarajaram, Hampapathalu A

    2011-01-01

    PSSRdb (Polymorphic Simple Sequence Repeats database) (http://www.cdfd.org.in/PSSRdb/) is a relational database of polymorphic simple sequence repeats (PSSRs) extracted from 85 different species of prokaryotes. Simple sequence repeats (SSRs) are the tandem repeats of nucleotide motifs of the sizes 1-6 bp and are highly polymorphic. SSR mutations in and around coding regions affect transcription and translation of genes. Such changes underpin phase variations and antigenic variations seen in some bacteria. Although SSR-mediated phase variation and antigenic variations have been well-studied in some bacteria there seems a lot of other species of prokaryotes yet to be investigated for SSR mediated adaptive and other evolutionary advantages. As a part of our on-going studies on SSR polymorphism in prokaryotes we compared the genome sequences of various strains and isolates available for 85 different species of prokaryotes and extracted a number of SSRs showing length variations and created a relational database called PSSRdb. This database gives useful information such as location of PSSRs in genomes, length variation across genomes, the regions harboring PSSRs, etc. The information provided in this database is very useful for further research and analysis of SSRs in prokaryotes.

  9. Genome-wide analysis of tandem repeats in plants and green algae

    Treesearch

    Zhixin Zhao; Cheng Guo; Sreeskandarajan Sutharzan; Pei Li; Craig Echt; Jie Zhang; Chun Liang

    2014-01-01

    Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among...

  10. Versatile communication strategies among tandem WW domain repeats

    PubMed Central

    Dodson, Emma Joy; Fishbain-Yoskovitz, Vered; Rotem-Bamberger, Shahar

    2015-01-01

    Interactions mediated by short linear motifs in proteins play major roles in regulation of cellular homeostasis since their transient nature allows for easy modulation. We are still far from a full understanding and appreciation of the complex regulation patterns that can be, and are, achieved by this type of interaction. The fact that many linear-motif-binding domains occur in tandem repeats in proteins indicates that their mutual communication is used extensively to obtain complex integration of information toward regulatory decisions. This review is an attempt to overview, and classify, different ways by which two and more tandem repeats cooperate in binding to their targets, in the well-characterized family of WW domains and their corresponding polyproline ligands. PMID:25710931

  11. Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.

    PubMed

    Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru

    2015-01-01

    The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.

  12. Evolution of short inverted repeat in cupressophytes, transfer of accD to nucleus in Sciadopitys verticillata and phylogenetic position of Sciadopityaceae.

    PubMed

    Li, Jia; Gao, Lei; Chen, Shanshan; Tao, Ke; Su, Yingjuan; Wang, Ting

    2016-02-11

    Sciadopitys verticillata is an evergreen conifer and an economically valuable tree used in construction, which is the only member of the family Sciadopityaceae. Acquisition of the S. verticillata chloroplast (cp) genome will be useful for understanding the evolutionary mechanism of conifers and phylogenetic relationships among gymnosperm. In this study, we have first reported the complete chloroplast genome of S. verticillata. The total genome is 138,284 bp in length, consisting of 118 unique genes. The S. verticillata cp genome has lost one copy of the canonical inverted repeats and shown distinctive genomic structure comparing with other cupressophytes. Fifty-three simple sequence repeat loci and 18 forward tandem repeats were identified in the S. verticillata cp genome. According to the rearrangement of cupressophyte cp genome, we proposed one mechanism for the formation of inverted repeat: tandem repeat occured first, then rearrangement divided the tandem repeat into inverted repeats located at different regions. Phylogenetic estimates inferred from 59-gene sequences and cpDNA organizations have both shown that S. verticillata was sister to the clade consisting of Cupressaceae, Taxaceae, and Cephalotaxaceae. Moreover, accD gene was found to be lost in the S. verticillata cp genome, and a nucleus copy was identified from two transcriptome data.

  13. Revisiting the TALE repeat.

    PubMed

    Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng

    2014-04-01

    Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.

  14. Whole genome evaluation of tandem repeat polymorphisms between two pathogenically similar strains of Xylella fastidiosa isolated from almond and grape in California

    USDA-ARS?s Scientific Manuscript database

    Whole genome tandem repeat polymorphisms were evaluated between two closely related Xylella fastidiosa strains, M23 and Temecula1, both cause almond leaf scorch disease (ALSD) and grape Pierce’s disease (PD) in California. Strain M23 was isolated from almond and the genome was sequenced in this stu...

  15. DNA Fingerprint Analysis of Three Short Tandem Repeat (STR) Loci for Biochemistry and Forensic Science Laboratory Courses

    ERIC Educational Resources Information Center

    McNamara-Schroeder, Kathleen; Olonan, Cheryl; Chu, Simon; Montoya, Maria C.; Alviri, Mahta; Ginty, Shannon; Love, John J.

    2006-01-01

    We have devised and implemented a DNA fingerprinting module for an upper division undergraduate laboratory based on the amplification and analysis of three of the 13 short tandem repeat loci that are required by the Federal Bureau of Investigation Combined DNA Index System (FBI CODIS) data base. Students first collect human epithelial (cheek)…

  16. Two different size classes of 5S rDNA units coexisting in the same tandem array in the razor clam Ensis macha: is this region suitable for phylogeographic studies?

    PubMed

    Fernández-Tajes, Juan; Méndez, Josefina

    2009-12-01

    For a study of 5S ribosomal genes (rDNA) in the razor clam Ensis macha, the 5S rDNA region was amplified and sequenced. Two variants, so-called type I or short repeat (approximately 430 bp) and type II or long repeat (approximately 735 bp), appeared to be the main components of the 5S rDNA of this species. Their spacers differed markedly, both in length and nucleotide composition. The organization of the two variants was investigated by amplifying the genomic DNA with primers based on the sequence of the type I and type II spacers. PCR amplification products with primers EMLbF and EMSbR showed that the long and short repeats are associated within the same tandem array, suggesting an intermixed arrangement of both spacers. Nevertheless, amplifications carried out with inverse primers EMSinvF/R and EMLinvF/R revealed that some short and long repeats are contiguous in the same tandem array. This is the first report of the coexistence of two variable spacers in the same tandem array in bivalve mollusks.

  17. The complete mitochondrial genome sequence of the Tibetan red fox (Vulpes vulpes montana).

    PubMed

    Zhang, Jin; Zhang, Honghai; Zhao, Chao; Chen, Lei; Sha, Weilai; Liu, Guangshuai

    2015-01-01

    In this study, the complete mitochondrial genome of the Tibetan red fox (Vulpes Vulpes montana) was sequenced for the first time using blood samples obtained from a wild female red fox captured from Lhasa in Tibet, China. Qinghai--Tibet Plateau is the highest plateau in the world with an average elevation above 3500 m. Sequence analysis showed it contains 12S rRNA gene, 16S rRNA gene, 22 tRNA genes, 13 protein-coding genes and 1 control region (CR). The variable tandem repeats in CR is the main reason of the length variability of mitochondrial genome among canide animals.

  18. Molecular basis of the polydispersity of mucins: implications for the generation of saccharide diversity.

    PubMed

    Bhavanandan, V P; Gupta, D; Woitach, J; Guo, X; Jiang, W

    1999-06-01

    Secreted epithelial mucins are large macromolecules which exhibit extreme polydispersity, the molecular basis of which is not fully understood. We have obtained partial sequences of two genes (BSM1 and BSM2) coding for two distinct molecules. This is the first time that such closely-related genes have been identified for any mucin from an animal. We propose that a combination of multiple homologous genes, alternative splicing, differential glycosylation, and additional post-translational processing all contribute to the extreme polydispersity of mucins. The multiple domain structure and non-identical tandem repeats are also very important for the generation of the saccharide diversities of mucins.

  19. Assessing the 5S ribosomal RNA heterogeneity in Arabidopsis thaliana using short RNA next generation sequencing data.

    PubMed

    Szymanski, Maciej; Karlowski, Wojciech M

    2016-01-01

    In eukaryotes, ribosomal 5S rRNAs are products of multigene families organized within clusters of tandemly repeated units. Accumulation of genomic data obtained from a variety of organisms demonstrated that the potential 5S rRNA coding sequences show a large number of variants, often incompatible with folding into a correct secondary structure. Here, we present results of an analysis of a large set of short RNA sequences generated by the next generation sequencing techniques, to address the problem of heterogeneity of the 5S rRNA transcripts in Arabidopsis and identification of potentially functional rRNA-derived fragments.

  20. A further analysis of the relationship between yellow ripe-fruit color and the capsanthin-capsorubin synthase gene in pepper (Capsicum sp.) indicated a new mutant variant in C. annuum and a tandem repeat structure in promoter region.

    PubMed

    Li, Zheng; Wang, Shu; Gui, Xiao-Ling; Chang, Xiao-Bei; Gong, Zhen-Hui

    2013-01-01

    Mature pepper (Capsicum sp.) fruits come in a variety of colors, including red, orange, yellow, brown, and white. To better understand the genetic and regulatory relationships between the yellow fruit phenotype and the capsanthin-capsorubin synthase gene (Ccs), we examined 156 Capsicum varieties, most of which were collected from Northwest Chinese landraces. A new ccs variant was identified in the yellow fruit cultivar CK7. Cluster analysis revealed that CK7, which belongs to the C. annuum species, has low genetic similarity to other yellow C. annuum varieties. In the coding sequence of this ccs allele, we detected a premature stop codon derived from a C to G change, as well as a downstream frame-shift caused by a 1-bp nucleotide deletion. In addition, the expression of the gene was detected in mature CK7 fruit. Furthermore, the promoter sequences of Ccs from some pepper varieties were examined, and we detected a 176-bp tandem repeat sequence in the promoter region. In all C. annuum varieties examined in this study, the repeat number was three, compared with four in two C. chinense accessions. The sequence similarity ranged from 84.8% to 97.7% among the four types of repeats, and some putative cis-elements were also found in every repeat. This suggests that the transcriptional regulation of Ccs expression is complex. Based on the analysis of the novel C. annuum mutation reported here, along with the studies of three mutation types in yellow C. annuum and C. chinense accessions, we suggest that the mechanism leading to the production of yellow color fruit may be not as complex as that leading to orange fruit production.

  1. A Further Analysis of the Relationship between Yellow Ripe-Fruit Color and the Capsanthin-Capsorubin Synthase Gene in Pepper (Capsicum sp.) Indicated a New Mutant Variant in C. annuum and a Tandem Repeat Structure in Promoter Region

    PubMed Central

    Gui, Xiao-Ling; Chang, Xiao-Bei; Gong, Zhen-Hui

    2013-01-01

    Mature pepper (Capsicum sp.) fruits come in a variety of colors, including red, orange, yellow, brown, and white. To better understand the genetic and regulatory relationships between the yellow fruit phenotype and the capsanthin-capsorubin synthase gene (Ccs), we examined 156 Capsicum varieties, most of which were collected from Northwest Chinese landraces. A new ccs variant was identified in the yellow fruit cultivar CK7. Cluster analysis revealed that CK7, which belongs to the C. annuum species, has low genetic similarity to other yellow C. annuum varieties. In the coding sequence of this ccs allele, we detected a premature stop codon derived from a C to G change, as well as a downstream frame-shift caused by a 1-bp nucleotide deletion. In addition, the expression of the gene was detected in mature CK7 fruit. Furthermore, the promoter sequences of Ccs from some pepper varieties were examined, and we detected a 176-bp tandem repeat sequence in the promoter region. In all C. annuum varieties examined in this study, the repeat number was three, compared with four in two C. chinense accessions. The sequence similarity ranged from 84.8% to 97.7% among the four types of repeats, and some putative cis-elements were also found in every repeat. This suggests that the transcriptional regulation of Ccs expression is complex. Based on the analysis of the novel C. annuum mutation reported here, along with the studies of three mutation types in yellow C. annuum and C. chinense accessions, we suggest that the mechanism leading to the production of yellow color fruit may be not as complex as that leading to orange fruit production. PMID:23637942

  2. Functional centromeres in Astragalus sinicus include a compact centromere-specific histone H3 and a 20-bp tandem repeat.

    PubMed

    Tek, Ahmet L; Kashihara, Kazunari; Murata, Minoru; Nagaki, Kiyotaka

    2011-11-01

    The centromere plays an essential role for proper chromosome segregation during cell division and usually harbors long arrays of tandem repeated satellite DNA sequences. Although this function is conserved among eukaryotes, the sequences of centromeric DNA repeats are variable. Most of our understanding of functional centromeres, which are defined by localization of a centromere-specific histone H3 (CENH3) protein, comes from model organisms. The components of the functional centromere in legumes are poorly known. The genus Astragalus is a member of the legumes and bears the largest numbers of species among angiosperms. Therefore, we studied the components of centromeres in Astragalus sinicus. We identified the CenH3 homolog of A. sinicus, AsCenH3 that is the most compact in size among higher eukaryotes. A CENH3-based assay revealed the functional centromeric DNA sequences from A. sinicus, called CentAs. The CentAs repeat is localized in A. sinicus centromeres, and comprises an AT-rich tandem repeat with a monomer size of 20 nucleotides.

  3. Multi-locus variable number tandem repeat analysis for Escherichia coli causing extraintestinal infections.

    PubMed

    Manges, Amee R; Tellis, Patricia A; Vincent, Caroline; Lifeso, Kimberley; Geneau, Geneviève; Reid-Smith, Richard J; Boerlin, Patrick

    2009-11-01

    Discriminatory genotyping methods for the analysis of Escherichia coli other than O157:H7 are necessary for public health-related activities. A new multi-locus variable number tandem repeat analysis protocol is presented; this method achieves an index of discrimination of 99.5% and is reproducible and valid when tested on a collection of 836 diverse E. coli.

  4. The profile of repeat-associated histone lysine methylation states in the mouse epigenome

    PubMed Central

    Martens, Joost H A; O'Sullivan, Roderick J; Braunschweig, Ulrich; Opravil, Susanne; Radolf, Martin; Steinlein, Peter; Jenuwein, Thomas

    2005-01-01

    Histone lysine methylation has been shown to index silenced chromatin regions at, for example, pericentric heterochromatin or of the inactive X chromosome. Here, we examined the distribution of repressive histone lysine methylation states over the entire family of DNA repeats in the mouse genome. Using chromatin immunoprecipitation in a cluster analysis representing repetitive elements, our data demonstrate the selective enrichment of distinct H3-K9, H3-K27 and H4-K20 methylation marks across tandem repeats (e.g. major and minor satellites), DNA transposons, retrotransposons, long interspersed nucleotide elements and short interspersed nucleotide elements. Tandem repeats, but not the other repetitive elements, give rise to double-stranded (ds) RNAs that are further elevated in embryonic stem (ES) cells lacking the H3-K9-specific Suv39h histone methyltransferases. Importantly, although H3-K9 tri- and H4-K20 trimethylation appear stable at the satellite repeats, many of the other repeat-associated repressive marks vary in chromatin of differentiated ES cells or of embryonic trophoblasts and fibroblasts. Our data define a profile of repressive histone lysine methylation states for the repetitive complement of four distinct mouse epigenomes and suggest tandem repeats and dsRNA as primary triggers for more stable chromatin imprints. PMID:15678104

  5. Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.

    PubMed

    Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C

    1997-12-01

    Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.

  6. Molecular tandem repeat strategy for elucidating mechanical properties of high-strength proteins

    PubMed Central

    Jung, Huihun; Pena-Francesch, Abdon; Saadat, Alham; Sebastian, Aswathy; Kim, Dong Hwan; Hamilton, Reginald F.; Albert, Istvan; Allen, Benjamin D.; Demirel, Melik C.

    2016-01-01

    Many globular and structural proteins have repetitions in their sequences or structures. However, a clear relationship between these repeats and their contribution to the mechanical properties remains elusive. We propose a new approach for the design and production of synthetic polypeptides that comprise one or more tandem copies of a single unit with distinct amorphous and ordered regions. Our designed sequences are based on a structural protein produced in squid suction cups that has a segmented copolymer structure with amorphous and crystalline domains. We produced segmented polypeptides with varying repeat number, while keeping the lengths and compositions of the amorphous and crystalline regions fixed. We showed that mechanical properties of these synthetic proteins could be tuned by modulating their molecular weights. Specifically, the toughness and extensibility of synthetic polypeptides increase as a function of the number of tandem repeats. This result suggests that the repetitions in native squid proteins could have a genetic advantage for increased toughness and flexibility. PMID:27222581

  7. The complete mitochondrial genome of the cryptic "lineage B" big-fin reef squid, Sepioteuthis lessoniana (Cephalopoda: Loliginidae) in Indo-West Pacific.

    PubMed

    Shen, Kang-Ning; Yen, Ta-Chi; Chen, Ching-Hung; Ye, Jeng-Jia; Hsiao, Chung-Der

    2016-05-01

    In this study, the complete mitogenome sequence of the cryptic "lineage B" big-fin reef squid, Sepioteuthis lessoniana (Cephalopoda: Loliginidae) has been sequenced by next-generation sequencing method. The assembled mitogenome consisting of 16,694 bp, includes 13 protein coding genes, 25 transfer RNAs, 2 ribosomal RNAs genes. The overall base composition of "lineage B" S. lessoniana is 36.7% for A, 18.9 % for C, 34.5 % for T and 9.8 % for G and show 90% identities to "lineage C" S. lessoniana. It is also exhibits high T + A content (71.2%), two non-coding regions with TA tandem repeats. The complete mitogenome of the cryptic "lineage B" S. lessoniana provides essential and important DNA molecular data for further phylogeography and evolutionary analysis for big-fin reef squid species complex.

  8. The complete mitochondrial genome of the cryptic "lineage A" big-fin reef squid, Sepioteuthis lessoniana (Cephalopoda: Loliginidae) in Indo-West Pacific.

    PubMed

    Hsiao, Chung-Der; Shen, Kang-Ning; Ching, Tzu-Yun; Wang, Ya-Hsien; Ye, Jeng-Jia; Tsai, Shiou-Yi; Wu, Shan-Chun; Chen, Ching-Hung; Wang, Chia-Hui

    2016-07-01

    In this study, the complete mitogenome sequence of the cryptic "lineage A" big-fin reef squid, Sepioteuthis lessoniana (Cephalopoda: Loliginidae) has been sequenced by the next-generation sequencing method. The assembled mitogenome consists of 16,605 bp, which includes 13 protein-coding genes, 22 transfer RNAs, and 2 ribosomal RNAs genes. The overall base composition of "lineage A" S. lessoniana is 37.5% for A, 17.4% for C, 9.1% for G, and 35.9% for T and shows 87% identities to "lineage C" S. lessoniana. It is also noticed by its high T + A content (73.4%), two non-coding regions with TA tandem repeats. The complete mitogenome of the cryptic "lineage A" S. lessoniana provides essential and important DNA molecular data for further phylogeography and evolutionary analysis for big-fin reef squid species complex.

  9. Molecular basis of length polymorphism in the human zeta-globin gene complex.

    PubMed Central

    Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J

    1983-01-01

    The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667

  10. The minisatellite of the GPI/AMF/NLK/MF gene: interspecies conservation and transcriptional activity.

    PubMed

    Williams, R R; Hassan-Walker, A F; Lavender, F L; Morgan, M; Faik, P; Ragoussis, J

    2001-05-16

    Minisatellites are tandemly repeated DNA sequences found throughout the genomes of all eukaryotes. They are regions often prone to instability and hence hypervariability; thus repeat unit sequence is generally not conserved beyond closely related species. We have studied the minisatellite located in intron 9 of the human glucose phosphate isomerase (GPI) gene (also known as neuroleukin, autocrine motility factor, maturation and differentiation factor) and have found, by Zoo blotting coupled with PCR amplification and DNA sequencing, that similar repeat units are present in seven other species of mammal. There is also evidence for the presence of the minisatellite in chicken. The repeat unit does not appear to be present at any other locus in these genomes. Minisatellite DNA has been reported to be involved in recombination activity, control of gene expression of nearby gene(s) (both transcriptional and translational), whilst others form protein coding regions. The high level of conservation exhibited by the GPI minisatellite, coupled with the unique location, strongly suggests a functional role. Our results from transient and stable transfections using luciferase reporter constructs have shown that the GPI minisatellite region can act to increase transcription from the SV40 promoter, CMV promoter and the human GPI promoter.

  11. Clustering of Tuberculosis Cases Based on Variable-Number Tandem-Repeat Typing in Relation to the Population Structure of Mycobacterium tuberculosis in the Netherlands

    PubMed Central

    Sloot, Rosa; Borgdorff, Martien W.; de Beer, Jessica L.; van Ingen, Jakko; Supply, Philip

    2013-01-01

    The population structure of 3,776 Mycobacterium tuberculosis isolates was determined using variable-number tandem-repeat (VNTR) typing. The degree of clonality was so high that a more relaxed definition of clustering cannot be applied. Among recent immigrants with non-Euro-American isolates, transmission is overestimated if based on identical VNTR patterns. PMID:23658260

  12. Intergenic Variable-Number Tandem-Repeat Polymorphism Upstream of rocA Alters Toxin Production and Enhances Virulence in Streptococcus pyogenes.

    PubMed

    Zhu, Luchang; Olsen, Randall J; Horstmann, Nicola; Shelburne, Samuel A; Fan, Jia; Hu, Ye; Musser, James M

    2016-07-01

    Variable-number tandem-repeat (VNTR) polymorphisms are ubiquitous in bacteria. However, only a small fraction of them has been functionally studied. Here, we report an intergenic VNTR polymorphism that confers an altered level of toxin production and increased virulence in Streptococcus pyogenes The nature of the polymorphism is a one-unit deletion in a three-tandem-repeat locus upstream of the rocA gene encoding a sensor kinase. S. pyogenes strains with this type of polymorphism cause human infection and produce significantly larger amounts of the secreted cytotoxins S. pyogenes NADase (SPN) and streptolysin O (SLO). Using isogenic mutant strains, we demonstrate that deleting one or more units of the tandem repeats abolished RocA production, reduced CovR phosphorylation, derepressed multiple CovR-regulated virulence factors (such as SPN and SLO), and increased virulence in a mouse model of necrotizing fasciitis. The phenotypic effect of the VNTR polymorphism was nearly the same as that of inactivating the rocA gene. In summary, we identified and characterized an intergenic VNTR polymorphism in S. pyogenes that affects toxin production and virulence. These new findings enhance understanding of rocA biology and the function of VNTR polymorphisms in S. pyogenes. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  13. Development of Multiple-Locus Variable-Number Tandem-Repeat Analysis for Molecular Subtyping of Campylobacter jejuni by Using Capillary Electrophoresis

    PubMed Central

    Techaruvichit, Punnida; Vesaratchavest, Mongkol; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon

    2015-01-01

    Campylobacter jejuni is a common cause of the frequently reported food-borne diseases in developed and developing nations. This study describes the development of multiple-locus variable-number tandem-repeat (VNTR) analysis (MLVA) using capillary electrophoresis as a novel typing method for microbial source tracking and epidemiological investigation of C. jejuni. Among 36 tandem repeat loci detected by the Tandem Repeat Finder program, 7 VNTR loci were selected and used for characterizing 60 isolates recovered from chicken meat samples from retail shops, samples from chicken meat processing factory, and stool samples. The discrimination ability of MLVA was compared with that of multilocus sequence typing (MLST). MLVA (diversity index of 0.97 with 31 MLVA types) provided slightly higher discrimination than MLST (diversity index of 0.95 with 25 MLST types). The overall concordance between MLVA and MLST was estimated at 63% by adjusted Rand coefficient. MLVA predicted MLST type better than MLST predicted MLVA type, as reflected by Wallace coefficient (Wallace coefficient for MLVA to MLST versus MLST to MLVA, 86% versus 51%). MLVA is a useful tool and can be used for effective monitoring of C. jejuni and investigation of epidemics caused by C. jejuni. PMID:26025899

  14. RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins.

    PubMed

    Hirsh, Layla; Paladin, Lisanna; Piovesan, Damiano; Tosatto, Silvio C E

    2018-05-09

    RepeatsDB-lite (http://protein.bio.unipd.it/repeatsdb-lite) is a web server for the prediction of repetitive structural elements and units in tandem repeat (TR) proteins. TRs are a widespread but poorly annotated class of non-globular proteins carrying heterogeneous functions. RepeatsDB-lite extends the prediction to all TR types and strongly improves the performance both in terms of computational time and accuracy over previous methods, with precision above 95% for solenoid structures. The algorithm exploits an improved TR unit library derived from the RepeatsDB database to perform an iterative structural search and assignment. The web interface provides tools for analyzing the evolutionary relationships between units and manually refine the prediction by changing unit positions and protein classification. An all-against-all structure-based sequence similarity matrix is calculated and visualized in real-time for every user edit. Reviewed predictions can be submitted to RepeatsDB for review and inclusion.

  15. Evolution of Transcription Activator-Like Effectors in Xanthomonas oryzae

    PubMed Central

    Erkes, Annett; Reschke, Maik; Boch, Jens

    2017-01-01

    Abstract Transcription activator-like effectors (TALEs) are secreted by plant–pathogenic Xanthomonas bacteria into plant cells where they act as transcriptional activators and, hence, are major drivers in reprogramming the plant for the benefit of the pathogen. TALEs possess a highly repetitive DNA-binding domain of typically 34 amino acid (AA) tandem repeats, where AA 12 and 13, termed repeat variable di-residue (RVD), determine target specificity. Different Xanthomonas strains possess different repertoires of TALEs. Here, we study the evolution of TALEs from the level of RVDs determining target specificity down to the level of DNA sequence with focus on rice-pathogenic Xanthomonas oryzae pv. oryzae (Xoo) and Xanthomonas oryzae pv. oryzicola (Xoc) strains. We observe that codon pairs coding for individual RVDs are conserved to a similar degree as the flanking repeat sequence. We find strong indications that TALEs may evolve 1) by base substitutions in codon pairs coding for RVDs, 2) by recombination of N-terminal or C-terminal regions of existing TALEs, or 3) by deletion of individual TALE repeats, and we propose possible mechanisms. We find indications that the reassortment of TALE genes in clusters is mediated by an integron-like mechanism in Xoc. We finally study the effect of the presence/absence and evolutionary modifications of TALEs on transcriptional activation of putative target genes in rice, and find that even single RVD swaps may lead to considerable differences in activation. This correlation allowed a refined prediction of TALE targets, which is the crucial step to decipher their virulence activity. PMID:28637323

  16. Coherent Somatic Mutation in Autoimmune Disease

    PubMed Central

    Ross, Kenneth Andrew

    2014-01-01

    Background Many aspects of autoimmune disease are not well understood, including the specificities of autoimmune targets, and patterns of co-morbidity and cross-heritability across diseases. Prior work has provided evidence that somatic mutation caused by gene conversion and deletion at segmentally duplicated loci is relevant to several diseases. Simple tandem repeat (STR) sequence is highly mutable, both somatically and in the germ-line, and somatic STR mutations are observed under inflammation. Results Protein-coding genes spanning STRs having markers of mutability, including germ-line variability, high total length, repeat count and/or repeat similarity, are evaluated in the context of autoimmunity. For the initiation of autoimmune disease, antigens whose autoantibodies are the first observed in a disease, termed primary autoantigens, are informative. Three primary autoantigens, thyroid peroxidase (TPO), phogrin (PTPRN2) and filaggrin (FLG), include STRs that are among the eleven longest STRs spanned by protein-coding genes. This association of primary autoantigens with long STR sequence is highly significant (). Long STRs occur within twenty genes that are associated with sixteen common autoimmune diseases and atherosclerosis. The repeat within the TTC34 gene is an outlier in terms of length and a link with systemic lupus erythematosus is proposed. Conclusions The results support the hypothesis that many autoimmune diseases are triggered by immune responses to proteins whose DNA sequence mutates somatically in a coherent, consistent fashion. Other autoimmune diseases may be caused by coherent somatic mutations in immune cells. The coherent somatic mutation hypothesis has the potential to be a comprehensive explanation for the initiation of many autoimmune diseases. PMID:24988487

  17. The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.

    PubMed

    Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin

    2013-01-01

    Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.

  18. A novel tandem repeat sequence located on human chromosome 4p: isolation and characterization.

    PubMed

    Kogi, M; Fukushige, S; Lefevre, C; Hadano, S; Ikeda, J E

    1997-06-01

    In an effort to analyze the genomic region of the distal half of human chromosome 4p, to where Huntington disease and other diseases have been mapped, we have isolated the cosmid clone (CRS447) that was likely to contain a region with specific repeat sequences. Clone CRS447 was subjected to detailed analysis, including chromosome mapping, restriction mapping, and DNA sequencing. Chromosome mapping by both a human-CHO hybrid cell panel and FISH revealed that CRS447 was predominantly located in the 4p15.1-15.3 region. CRS447 was shown to consist of tandem repeats of 4.7-kb units present on chromosome 4p. A single EcoRI unit was subcloned (pRS447), and the complete sequence was determined as 4752 nucleotides. When pRS447 was used as a probe, the number of copies of this repeat per haploid genome was estimated to be 50-70. Sequence analysis revealed that it contained two internal CA repeats and one putative ORF. Database search established that this sequence was unreported. However, two homologous STS markers were found in the database. We concluded that CRS447/pRS447 is a novel tandem repeat sequence that is mainly specific to human chromosome 4p.

  19. A complete mitochondrial genome sequence of Asian black bear Sichuan subspecies (Ursus thibetanus mupinensis)

    PubMed Central

    Hou, Wan-ru; Chen, Yu; Wu, Xia; Hu, Jin-chu; Peng, Zheng-song; Yang, Jung; Tang, Zong-xiang; Zhou, Cai-Quan; Li, Yu-ming; Yang, Shi-kui; Du, Yu-jie; Kong, Ling-lu; Ren, Zheng-long; Zhang, Huai-yu; Shuai, Su-rong

    2007-01-01

    We obtained the complete mitochondrial genome of U.thibetanus mupinensis by DNA sequencing based on the PCR fragments of 18 primers we designed. The results indicate that the mtDNA is 16 868 bp in size, encodes 13 protein genes, 22 tRNA genes, and 2 rRNA genes, with an overall H-strand base composition of 31.2% A, 25.4% C, 15.5% G and 27.9% T. The sequence of the control region (CR) located between tRNA-Pro and tRNA-Phe is 1422 bp in size, consists of 8.43% of the whole genome, GC content is 51.9% and has a 6bp tandem repeat and two 10bp tandem repeats identified by using the Tandem Repeats Finder. U. thibetanus mupinensis mitochondrial genome shares high similarity with those of three other Ursidae: U. americanus (91.46%), U. arctos (89.25%) and U. maritimus (87.66%). PMID:17205108

  20. Chicken microsatellite markers isolated from libraries enriched for simple tandem repeats.

    PubMed

    Gibbs, M; Dawson, D A; McCamley, C; Wardle, A F; Armour, J A; Burke, T

    1997-12-01

    The total number of microsatellite loci is considered to be at least 10-fold lower in avian species than in mammalian species. Therefore, efficient large-scale cloning of chicken microsatellites, as required for the construction of a high-resolution linkage map, is facilitated by the construction of libraries using an enrichment strategy. In this study, a plasmid library enriched for tandem repeats was constructed from chicken genomic DNA by hybridization selection. Using this technique the proportion of recombinant clones that cross-hybridized to probes containing simple tandem repeats was raised to 16%, compared with < 0.1% in a non-enriched library. Primers were designed from 121 different sequences. Polymerase chain reaction (PCR) analysis of two chicken reference pedigrees enabled 72 loci to be localized within the collaborative chicken genetic map, and at least 30 of the remaining loci have been shown to be informative in these or other crosses.

  1. [Molecular cloning and characterization of a novel Clonorchis sinensis antigenic protein containing tandem repeat sequences].

    PubMed

    Liu, Qian; Xu, Xue-Nian; Zhou, Yan; Cheng, Na; Dong, Yu-Ting; Zheng, Hua-Jun; Zhu, Yong-Qiang; Zhu, Yong-Qiang

    2013-08-01

    To find and clone new antigen genes from the lambda-ZAP cDNA expression library of adult Clonorchis sinensis, and determine the immunological characteristics of the recombinant proteins. The cDNA expression library of adult C. sinensis was screened by pooled sera of clonorchiasis patients. The sequences of the positive phage clones were compared with the sequences in EST database, and the full-length sequence of the gene (Cs22 gene) was obtained by RT-PCR. cDNA fragments containing 2 and 3 times tandem repeat sequences were generated by jumping PCR. The sequence encoding the mature peptide or the tandem repeat sequence was respectively cloned into the prokaryotic expression vector pET28a (+), and then transformed into E. coli Rosetta DE3 cells for expression. The recombinant proteins (rCs22-2r, rCs22-3r, rCs22M-2r, and rCs22M-3r) were purified by His-bind-resin (Ni-NTA) affinity chromatography. The immunogenicity of rCs22-2r and rCs22-3r was identified by ELISA. To evaluate the immunological diagnostic value of rCs22-2r and rCs22-3r, serum samples from 35 clonorchiasis patients, 31 healthy individuals, 15 schistosomiasis patients, 15 paragonimiasis westermani patients and 13 cysticercosis patients were examined by ELISA. To locate antigenic determinants, the pooled sera of clonorchiasis patients and healthy persons were analyzed for specific antibodies by ELISA with recombinant protein rCs22M-2r and rCs22M-3r containing the tandem repeat sequences. The full-length sequence of Cs22 antigen gene of C. sinensis was obtained. It contained 13 times tandem repeat sequences of EQQDGDEEGMGGDGGRGKEKGKVEGEDGAGEQKEQA. Bioinformatics analysis indicated that the protein (Cs22) belonged to GPI-anchored proteins family. The recombinant proteins rCs22-2r and rCs22-3r showed a certain level of immunogenicity. The positive rate by ELISA coated with the purified PrCs22-2r and PrCs22-3r for sera of clonorchiasis patients both were 45.7% (16/35), and 3.2% (1/31) for those of healthy persons. There was no cross reaction with sera of schistosomiasis and cysticercosis patients. The cross reaction with sera of paragonimiasis westermani patients was 1/15. The recombinant proteins rCs22M-2r and rCs22M-3r which only contained tandem repeats were specifically recognized by pooled sera of clonorchiasis patients. The Cs22 antigen gene of Clonorchis sinensis is obtained, and the recombinant proteins have certain diagnostic value. The antigenic determinant is located in tandem repeat sequences.

  2. Detecting long tandem duplications in genomic sequences.

    PubMed

    Audemard, Eric; Schiex, Thomas; Faraut, Thomas

    2012-05-08

    Detecting duplication segments within completely sequenced genomes provides valuable information to address genome evolution and in particular the important question of the emergence of novel functions. The usual approach to gene duplication detection, based on all-pairs protein gene comparisons, provides only a restricted view of duplication. In this paper, we introduce ReD Tandem, a software using a flow based chaining algorithm targeted at detecting tandem duplication arrays of moderate to longer length regions, with possibly locally weak similarities, directly at the DNA level. On the A. thaliana genome, using a reference set of tandem duplicated genes built using TAIR,(a) we show that ReD Tandem is able to predict a large fraction of recently duplicated genes (dS  <  1) and that it is also able to predict tandem duplications involving non coding elements such as pseudo-genes or RNA genes. ReD Tandem allows to identify large tandem duplications without any annotation, leading to agnostic identification of tandem duplications. This approach nicely complements the usual protein gene based which ignores duplications involving non coding regions. It is however inherently restricted to relatively recent duplications. By recovering otherwise ignored events, ReD Tandem gives a more comprehensive view of existing evolutionary processes and may also allow to improve existing annotations.

  3. The complete sequence of the mitochondrial genome of Arctic fox (Alopex lagopus).

    PubMed

    Yan, Shou-Qing; Guo, Peng-Cheng; Yue, Yuan; Li, Wan-Hong; Bai, Chun-Yan; Li, Yu-Mei; Sun, Jin-Hai; Zhao, Zhi-Hui

    2016-11-01

    In the present study, the complete mitochondrial genome sequence of Arctic fox (Alopex lagopus) was determined for the first time. It has a total length of 16,656 bp, and contains 13 protein-coding genes, 22 tRNA genes, 2 ribosome RNA genes and 1 control region. The nucleotide composition is 31.3% for A, 26.2% for C, 14.8% for G and 27.7% for T, respectively. The D-loop region located between tRNA Pro and tRNA Phe contains a (ACACGTACACGCAT) 18 tandem repeat array. The data will be useful for the investigation of the genetic structure and diversity in the natural and farmed population of Arctic foxes.

  4. Heme oxygenase-1 gene promoter microsatellite polymorphism is associated with progressive atherosclerosis and incident cardiovascular disease.

    PubMed

    Pechlaner, Raimund; Willeit, Peter; Summerer, Monika; Santer, Peter; Egger, Georg; Kronenberg, Florian; Demetz, Egon; Weiss, Günter; Tsimikas, Sotirios; Witztum, Joseph L; Willeit, Karin; Iglseder, Bernhard; Paulweber, Bernhard; Kedenko, Lyudmyla; Haun, Margot; Meisinger, Christa; Gieger, Christian; Müller-Nurasyid, Martina; Peters, Annette; Willeit, Johann; Kiechl, Stefan

    2015-01-01

    The enzyme heme oxygenase-1 (HO-1) exerts cytoprotective effects in response to various cellular stressors. A variable number tandem repeat polymorphism in the HO-1 gene promoter region has previously been linked to cardiovascular disease. We examined this association prospectively in the general population. Incidence of stroke, myocardial infarction, or vascular death was registered between 1995 and 2010 in 812 participants of the Bruneck Study aged 45 to 84 years (49.4% males). Carotid atherosclerosis progression was quantified by high-resolution ultrasound. HO-1 variable number tandem repeat length was determined by polymerase chain reaction. Subjects with ≥32 tandem repeats on both HO-1 alleles compared with the rest of the population (recessive trait) featured substantially increased cardiovascular disease risk (hazard ratio [95% confidence interval], 5.45 [2.39, 12.42]; P<0.0001), enhanced atherosclerosis progression (median difference in atherosclerosis score [interquartile range], 2.1 [0.8, 5.6] versus 0.0 [0.0, 2.2] mm; P=0.0012), and a trend toward higher levels of oxidized phospholipids on apolipoprotein B-100 (median oxidized phospholipids/apolipoprotein B level [interquartile range], 11364 [4160, 18330] versus 4844 [3174, 12284] relative light units; P=0.0554). Increased cardiovascular disease risk in those homozygous for ≥32 repeats was also detected in a pooled analysis of 7848 participants of the Bruneck, SAPHIR, and KORA prospective studies (hazard ratio [95% confidence interval], 3.26 [1.50, 7.33]; P=0.0043). This study found a strong association between the HO-1 variable number tandem repeat polymorphism and cardiovascular disease risk confined to subjects with a high number of repeats on both HO-1 alleles and provides evidence for accelerated atherogenesis and decreased antioxidant defense in this vascular high-risk group. © 2014 American Heart Association, Inc.

  5. Fingerprinting of Cyanobacteria Based on PCR with Primers Derived from Short and Long Tandemly Repeated Repetitive Sequences

    PubMed Central

    Rasmussen, Ulla; Svenning, Mette M.

    1998-01-01

    The presence of repeated DNA (short tandemly repeated repetitive [STRR] and long tandemly repeated repetitive [LTRR]) sequences in the genome of cyanobacteria was used to generate a fingerprint method for symbiotic and free-living isolates. Primers corresponding to the STRR and LTRR sequences were used in the PCR, resulting in a method which generate specific fingerprints for individual isolates. The method was useful both with purified DNA and with intact cyanobacterial filaments or cells as templates for the PCR. Twenty-three Nostoc isolates from a total of 35 were symbiotic isolates from the angiosperm Gunnera species, including isolates from the same Gunnera species as well as from different species. The results show a genetic similarity among isolates from different Gunnera species as well as a genetic heterogeneity among isolates from the same Gunnera species. Isolates which have been postulated to be closely related or identical revealed similar results by the PCR method, indicating that the technique is useful for clustering of even closely related strains. The method was applied to nonheterocystus cyanobacteria from which a fingerprint pattern was obtained. PMID:16349487

  6. Application of multilocus variable number tandem repeat analysis to monitor Verocytotoxin-producing Escherichia coli O157 phage type 8 in England and Wales: emergence of a profile associated with a national outbreak.

    PubMed

    Perry, N; Cheasty, T; Dallman, T; Launders, N; Willshaw, G

    2013-10-01

    Evaluation of multilocus variable number tandem repeat analysis (MLVA) to subtype all isolates of Vero cytotoxin-producing Escherichia coli O157 phage type 8 in England and Wales. Over a 13 month period from December 2010, 483 isolates of VTEC O157 PT8 were tested by MLVA; 39% were received in the first 4 months of 2011, when infections are generally low. One profile, or single locus variants of it, was present in 249 (52%) isolates but was not common previously. These cases represented a national increase in PT8, associated epidemiologically with soil-contaminated vegetables. Most of the 177 other MLVA profiles were unique to a single isolate. Profiles shared by >1 isolate included cases from two small community, food-borne outbreaks and 11 households. Several shared profiles were found among 23 isolates without known links. Apart from one group, isolates linked to travel abroad had very diverse profiles. Multilocus variable number tandem repeat analysis discriminated apparent sporadic isolates of the same PT and assisted in detection of cases in an emerging national outbreak. Multilocus variable number tandem repeat analysis is an epidemiologically valid complement to surveillance and applicable as a rapid, practical test for large numbers of isolates. © 2013 The Society for Applied Microbiology.

  7. Tandem repeats of the 5' non-transcribed spacer of Tetrahymena rDNA function as high copy number autonomous replicons in the macronucleus but do not prevent rRNA gene dosage regulation.

    PubMed Central

    Pan, W J; Blackburn, E H

    1995-01-01

    The rRNA genes in the somatic macronucleus of Tetrahymena thermophila are normally on 21 kb linear palindromic molecules (rDNA). We examined the effect on rRNA gene dosage of transforming T.thermophila macronuclei with plasmid constructs containing a pair of tandemly repeated rDNA replication origin regions unlinked to the rRNA gene. A significant proportion of the plasmid sequences were maintained as high copy circular molecules, eventually consisting solely of tandem arrays of origin regions. As reported previously for cells transformed by a construct in which the same tandem rDNA origins were linked to the rRNA gene [Yu, G.-L. and Blackburn, E. H. (1990) Mol. Cell. Biol., 10, 2070-2080], origin sequences recombined to form linear molecules bearing several tandem repeats of the origin region, as well as rRNA genes. The total number of rDNA origin sequences eventually exceeded rRNA gene copies by approximately 20- to 40-fold and the number of circular replicons carrying only rDNA origin sequences exceeded rRNA gene copies by 2- to 3-fold. However, the rRNA gene dosage was unchanged. Hence, simply monitoring the total number of rDNA origin regions is not sufficient to regulate rRNA gene copy number. Images PMID:7784211

  8. TANDEM: matching proteins with tandem mass spectra.

    PubMed

    Craig, Robertson; Beavis, Ronald C

    2004-06-12

    Tandem mass spectra obtained from fragmenting peptide ions contain some peptide sequence specific information, but often there is not enough information to sequence the original peptide completely. Several proprietary software applications have been developed to attempt to match the spectra with a list of protein sequences that may contain the sequence of the peptide. The application TANDEM was written to provide the proteomics research community with a set of components that can be used to test new methods and algorithms for performing this type of sequence-to-data matching. The source code and binaries for this software are available at http://www.proteome.ca/opensource.html, for Windows, Linux and Macintosh OSX. The source code is made available under the Artistic License, from the authors.

  9. Mitochondrial genomes of the green macroalga Ulva pertusa (Ulvophyceae, Chlorophyta): novel insights into the evolution of mitogenomes in the Ulvophyceae.

    PubMed

    Liu, Feng; Melton, James T; Bi, Yuping

    2017-10-01

    To further understand the trends in the evolution of mitochondrial genomes (mitogenomes or mtDNAs) in the Ulvophyceae, the mitogenomes of two separate thalli of Ulva pertusa were sequenced. Two U. pertusa mitogenomes (Up1 and Up2) were 69,333 bp and 64,602 bp in length. These mitogenomes shared two ribosomal RNAs (rRNAs), 28 transfer RNAs (tRNAs), 29 protein-coding genes, and 12 open reading frames. The 4.7 kb difference in size was attributed to variation in intron content and tandem repeat regions. A total of six introns were present in the smaller U. pertusa mtDNA (Up2), while the larger mtDNA (Up1) had eight. The larger mtDNA had two additional group II introns in two genes (cox1 and cox2) and tandem duplication mutations in noncoding regions. Our results showed the first case of intraspecific variation in chlorophytan mitogenomes and provided further genomic data for the undersampled Ulvophyceae. © 2017 Phycological Society of America.

  10. Guide to the TANDEM System for the Modern Languages Department Tape Library: A Non-Technical Guide for Teachers.

    ERIC Educational Resources Information Center

    Hounsell, D.; And Others

    This guide for teachers to the tape indexing system (TANDEM) in use at the Modern Languages Department at Portsmouth Polytechnic focuses on tape classification, numbering, labeling, and shelving system procedures. The appendixes contain information on: (1) the classification system and related codes, (2) color and letter codes, (3) marking of tape…

  11. Satellite DNA Modulates Gene Expression in the Beetle Tribolium castaneum after Heat Stress

    PubMed Central

    Feliciello, Isidoro; Akrap, Ivana; Ugarković, Đurđica

    2015-01-01

    Non-coding repetitive DNAs have been proposed to perform a gene regulatory role, however for tandemly repeated satellite DNA no such role was defined until now. Here we provide the first evidence for a role of satellite DNA in the modulation of gene expression under specific environmental conditions. The major satellite DNA TCAST1 in the beetle Tribolium castaneum is preferentially located within pericentromeric heterochromatin but is also dispersed as single repeats or short arrays in the vicinity of protein-coding genes within euchromatin. Our results show enhanced suppression of activity of TCAST1-associated genes and slower recovery of their activity after long-term heat stress relative to the same genes without associated TCAST1 satellite DNA elements. The level of gene suppression is not influenced by the distance of TCAST1 elements from the associated genes up to 40 kb from the genes’ transcription start sites, but it does depend on the copy number of TCAST1 repeats within an element, being stronger for the higher number of copies. The enhanced gene suppression correlates with the enrichment of the repressive histone marks H3K9me2/3 at dispersed TCAST1 elements and their flanking regions as well as with increased expression of TCAST1 satellite DNA. The results reveal transient, RNAi based heterochromatin formation at dispersed TCAST1 repeats and their proximal regions as a mechanism responsible for enhanced silencing of TCAST1-associated genes. Differences in the pattern of distribution of TCAST1 elements contribute to gene expression diversity among T. castaneum strains after long-term heat stress and might have an impact on adaptation to different environmental conditions. PMID:26275223

  12. No evidence of radiation effect on mutation rates at hypervariable minisatellite loci in the germ cells of atomic bomb survivors.

    PubMed

    Kodaira, Mieko; Izumi, Shizue; Takahashi, Norio; Nakamura, Nori

    2004-10-01

    Human minisatellites consist of tandem arrays of short repeat sequences, and some are highly polymorphic in numbers of repeats among individuals. Since these loci mutate much more frequently than coding sequences, they make attractive markers for screening populations for genetic effects of mutagenic agents. Here we report the results of our analysis of mutations at eight hypervariable minisatellite loci in the offspring (61 from exposed families in 60 of which only one parent was exposed, and 58 from unexposed parents) of atomic bomb survivors with mean doses of >1 Sv. We found 44 mutations in paternal alleles and eight mutations in maternal alleles with no indication that the high doses of acutely applied radiation had caused significant genetic effects. Our finding contrasts with those of some other studies in which much lower radiation doses, applied chronically, caused significantly increased mutation rates. Possible reasons for this discrepancy are discussed.

  13. An improved genome assembly uncovers prolific tandem repeats in Atlantic cod.

    PubMed

    Tørresen, Ole K; Star, Bastiaan; Jentoft, Sissel; Reinar, William B; Grove, Harald; Miller, Jason R; Walenz, Brian P; Knight, James; Ekholm, Jenny M; Peluso, Paul; Edvardsen, Rolf B; Tooming-Klunderud, Ave; Skage, Morten; Lien, Sigbjørn; Jakobsen, Kjetill S; Nederbragt, Alexander J

    2017-01-18

    The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance.

  14. Tandem repeated application of organic solvents and sodium lauryl sulphate enhances cumulative skin irritation.

    PubMed

    Schliemann, Sibylle; Schmidt, Christina; Elsner, Peter

    2014-01-01

    The objective of our study was to investigate the tandem irritation potential of two organic solvents with concurrent exposure to the hydrophilic detergent irritant sodium lauryl sulphate (SLS). A tandem repeated irritation test was performed with two undiluted organic solvents, cumene (C) and octane (O), with either alternating application with SLS 0.5% or twice daily application of each irritant alone in 27 volunteers on the skin of the back. The cumulative irritation induced over 4 days was quantified using visual scoring and non-invasive bioengineering measurements (skin colour reflectance, skin hydration and transepidermal water loss). Repeated application of C/SLS and O/SLS induced more decline of stratum corneum hydration and higher degrees of clinical irritation and erythema compared to each irritant alone. Our results demonstrate a further example of additive harmful skin effects induced by particular skin irritants and indicate that exposure to organic solvents together with detergents may increase the risk of acquiring occupational contact dermatitis. © 2014 S. Karger AG, Basel.

  15. Thermal denaturation of the BRCT tandem repeat region of human tumour suppressor gene product BRCA1.

    PubMed

    Pyrpassopoulos, Serapion; Ladopoulou, Angela; Vlassi, Metaxia; Papanikolau, Yannis; Vorgias, Constantinos E; Yannoukakos, Drakoulis; Nounesis, George

    2005-04-01

    Reduced stability of the tandem BRCT domains of human BReast CAncer 1 (BRCA1) due to missense mutations may be critical for loss of function in DNA repair and damage-induced checkpoint control. In the present thermal denaturation study of the BRCA1 BRCT region, high-precision differential scanning calorimetry (DSC) and circular dichroism (CD) spectroscopy provide evidence for the existence of a denatured state that is structurally very similar to the native. Consistency between theoretical structure-based estimates of the enthalpy (DeltaH) and heat capacity change (DeltaCp) and the calorimetric results is obtained when considering partial thermal unfolding contained in the region of the conserved hydrophobic pocket formed at the interface of the two BRCT repeats. The structural integrity of this region has been shown to be crucial for the interaction of BRCA1 with phosphorylated peptides. In addition, cancer-causing missense mutations located at the inter-BRCT-repeat interface have been linked to the destabilization of the tandem BRCT structure.

  16. STRBase: a short tandem repeat DNA database for the human identity testing community

    PubMed Central

    Ruitberg, Christian M.; Reeder, Dennis J.; Butler, John M.

    2001-01-01

    The National Institute of Standards and Technology (NIST) has compiled and maintained a Short Tandem Repeat DNA Internet Database (http://www.cstl.nist.gov/biotech/strbase/) since 1997 commonly referred to as STRBase. This database is an information resource for the forensic DNA typing community with details on commonly used short tandem repeat (STR) DNA markers. STRBase consolidates and organizes the abundant literature on this subject to facilitate on-going efforts in DNA typing. Observed alleles and annotated sequence for each STR locus are described along with a review of STR analysis technologies. Additionally, commercially available STR multiplex kits are described, published polymerase chain reaction (PCR) primer sequences are reported, and validation studies conducted by a number of forensic laboratories are listed. To supplement the technical information, addresses for scientists and hyperlinks to organizations working in this area are available, along with the comprehensive reference list of over 1300 publications on STRs used for DNA typing purposes. PMID:11125125

  17. Molecular typing of Chinese Streptococcus pyogenes isolates.

    PubMed

    You, Yuanhai; Wang, Haibin; Bi, Zhenwang; Walker, Mark; Peng, Xianhui; Hu, Bin; Zhou, Haijian; Song, Yanyan; Tao, Xiaoxia; Kou, Zengqiang; Meng, Fanliang; Zhang, Menghan; Bi, Zhenqiang; Luo, Fengji; Zhang, Jianzhong

    2015-06-01

    Streptococcus pyogenes causes human infections ranging from mild pharyngitis and impetigo to serious diseases including necrotizing fasciitis and streptococcal toxic shock syndrome. The objective of this study was to compare molecular emm typing and pulsed field gel electrophoresis (PFGE) with multiple-locus variable-number tandem-repeat analysis (MLVA) for genotyping of Chinese S. pyogenes isolates. Molecular emm typing and PFGE were performed using standard protocols. Seven variable number tandem repeat (VNTR) loci reported in a previous study were used to genotype 169 S. pyogenes geographically-diverse isolates from China isolated from a variety of disease syndromes. Multiple-locus variable-number tandem-repeat analysis provided greater discrimination between isolates when compared to emm typing and PFGE. Removal of a single VNTR locus (Spy2) reduced the sensitivity by only 0.7%, which suggests that Spy2 was not informative for the isolates screened. The results presented support the use of MLVA as a powerful epidemiological tool for genotyping S. pyogenes clinical isolates. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. Full-length model of the human galectin-4 and insights into dynamics of inter-domain communication

    NASA Astrophysics Data System (ADS)

    Rustiguel, Joane K.; Soares, Ricardo O. S.; Meisburger, Steve P.; Davis, Katherine M.; Malzbender, Kristina L.; Ando, Nozomi; Dias-Baruffi, Marcelo; Nonato, Maria Cristina

    2016-09-01

    Galectins are proteins involved in diverse cellular contexts due to their capacity to decipher and respond to the information encoded by β-galactoside sugars. In particular, human galectin-4, normally expressed in the healthy gastrointestinal tract, displays differential expression in cancerous tissues and is considered a potential drug target for liver and lung cancer. Galectin-4 is a tandem-repeat galectin characterized by two carbohydrate recognition domains connected by a linker-peptide. Despite their relevance to cell function and pathogenesis, structural characterization of full-length tandem-repeat galectins has remained elusive. Here, we investigate galectin-4 using X-ray crystallography, small- and wide-angle X-ray scattering, molecular modelling, molecular dynamics simulations, and differential scanning fluorimetry assays and describe for the first time a structural model for human galectin-4. Our results provide insight into the structural role of the linker-peptide and shed light on the dynamic characteristics of the mechanism of carbohydrate recognition among tandem-repeat galectins.

  19. A Lossy Compression Technique Enabling Duplication-Aware Sequence Alignment

    PubMed Central

    Freschi, Valerio; Bogliolo, Alessandro

    2012-01-01

    In spite of the recognized importance of tandem duplications in genome evolution, commonly adopted sequence comparison algorithms do not take into account complex mutation events involving more than one residue at the time, since they are not compliant with the underlying assumption of statistical independence of adjacent residues. As a consequence, the presence of tandem repeats in sequences under comparison may impair the biological significance of the resulting alignment. Although solutions have been proposed, repeat-aware sequence alignment is still considered to be an open problem and new efficient and effective methods have been advocated. The present paper describes an alternative lossy compression scheme for genomic sequences which iteratively collapses repeats of increasing length. The resulting approximate representations do not contain tandem duplications, while retaining enough information for making their comparison even more significant than the edit distance between the original sequences. This allows us to exploit traditional alignment algorithms directly on the compressed sequences. Results confirm the validity of the proposed approach for the problem of duplication-aware sequence alignment. PMID:22518086

  20. Multiple-locus variable-number tandem repeat analysis of Salmonella Enteritidis isolates from human and non-human sources using a single multiplex PCR

    PubMed Central

    Cho, Seongbeom; Boxrud, David J; Bartkus, Joanne M; Whittam, Thomas S; Saeed, Mahdi

    2007-01-01

    Simplified multiple-locus variable-number tandem repeat analysis (MLVA) was developed using one-shot multiplex PCR for seven variable-number tandem repeats (VNTR) markers with high diversity capacity. MLVA, phage typing, and PFGE methods were applied on 34 diverse Salmonella Enteritidis isolates from human and non-human sources. MLVA detected allelic variations that helped to classify the S. Enteritidis isolates into more evenly distributed subtypes than other methods. MLVA-based S. Enteritidis clonal groups were largely associated with sources of the isolates. Nei's diversity indices for polymorphism ranged from 0.25 to 0.70 for seven VNTR loci markers. Based on Simpson's and Shannon's diversity indices, MLVA had a higher discriminatory power than pulsed field gel electrophoresis (PFGE), phage typing, or multilocus enzyme electrophoresis. Therefore, MLVA may be used along with PFGE to enhance the effectiveness of the molecular epidemiologic investigation of S. Enteritidis infections. PMID:17692097

  1. Topological characteristics of helical repeat proteins.

    PubMed

    Groves, M R; Barford, D

    1999-06-01

    The recent elucidation of protein structures based upon repeating amino acid motifs, including the armadillo motif, the HEAT motif and tetratricopeptide repeats, reveals that they belong to the class of helical repeat proteins. These proteins share the common property of being assembled from tandem repeats of an alpha-helical structural unit, creating extended superhelical structures that are ideally suited to create a protein recognition interface.

  2. Multiple-Locus Variable-Number Tandem-Repeat Analysis in Genotyping Yersinia enterocolitica Strains from Human and Porcine Origins

    PubMed Central

    Laukkanen-Ninios, R.; Ortiz Martínez, P.; Siitonen, A.; Fredriksson-Ahomaa, M.; Korkeala, H.

    2013-01-01

    Sporadic and epidemiologically linked Yersinia enterocolitica strains (n = 379) isolated from fecal samples from human patients, tonsil or fecal samples from pigs collected at slaughterhouses, and pork samples collected at meat stores were genotyped using multiple-locus variable-number tandem-repeat analysis (MLVA) with six loci, i.e., V2A, V4, V5, V6, V7, and V9. In total, 312 different MLVA types were found. Similar types were detected (i) in fecal samples collected from human patients over 2 to 3 consecutive years, (ii) in samples from humans and pigs, and (iii) in samples from pigs that originated from the same farms. Among porcine strains, we found farm-specific MLVA profiles. Variations in the numbers of tandem repeats from one to four for variable-number tandem-repeat (VNTR) loci V2A, V5, V6, and V7 were observed within a farm. MLVA was applicable for serotypes O:3, O:5,27, and O:9 and appeared to be a highly discriminating tool for distinguishing sporadic and outbreak-related strains. With long-term use, interpretation of the results became more challenging due to variations in more-discriminating loci, as was observed for strains originating from pig farms. Additionally, we encountered unexpectedly short V2A VNTR fragments and sequenced them. According to the sequencing results, updated guidelines for interpreting V2A VNTR results were prepared. PMID:23637293

  3. Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies.

    PubMed Central

    Wincker, P; Jubier-Maurin, V; Roizès, G

    1987-01-01

    Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies. Images PMID:3684566

  4. Plant chromosomes from end to end: telomeres, heterochromatin and centromeres.

    PubMed

    Lamb, Jonathan C; Yu, Weichang; Han, Fangpu; Birchler, James A

    2007-04-01

    Recent evidence indicates that heterochromatin in plants is composed of heterogeneous sequences, which are usually composed of transposable elements or tandem repeat arrays. These arrays are associated with chromatin modifications that produce a closed configuration that limits transcription. Centromere sequences in plants are usually composed of tandem repeat arrays that are homogenized across the genome. Analysis of such arrays in closely related taxa suggests a rapid turnover of the repeat unit that is typical of a particular species. In addition, two lines of evidence for an epigenetic component of centromere specification have been reported, namely an example of a neocentromere formed over sequences without the typical repeat array and examples of centromere inactivation. Although the telomere repeat unit is quite prevalent in the plant kingdom, unusual repeats have been found in some families. Recently, it was demonstrated that the introduction of telomere sequences into plants cells causes truncation of the chromosomes, and that this technique can be used to produce artificial chromosome platforms.

  5. The solution structure of the pentatricopeptide repeat protein PPR10 upon binding atpH RNA

    PubMed Central

    Gully, Benjamin S.; Cowieson, Nathan; Stanley, Will A.; Shearston, Kate; Small, Ian D.; Barkan, Alice; Bond, Charles S.

    2015-01-01

    The pentatricopeptide repeat (PPR) protein family is a large family of RNA-binding proteins that is characterized by tandem arrays of a degenerate 35-amino-acid motif which form an α-solenoid structure. PPR proteins influence the editing, splicing, translation and stability of specific RNAs in mitochondria and chloroplasts. Zea mays PPR10 is amongst the best studied PPR proteins, where sequence-specific binding to two RNA transcripts, atpH and psaJ, has been demonstrated to follow a recognition code where the identity of two amino acids per repeat determines the base-specificity. A recently solved ZmPPR10:psaJ complex crystal structure suggested a homodimeric complex with considerably fewer sequence-specific protein–RNA contacts than inferred previously. Here we describe the solution structure of the ZmPPR10:atpH complex using size-exclusion chromatography-coupled synchrotron small-angle X-ray scattering (SEC-SY-SAXS). Our results support prior evidence that PPR10 binds RNA as a monomer, and that it does so in a manner that is commensurate with a canonical and predictable RNA-binding mode across much of the RNA–protein interface. PMID:25609698

  6. The proliferation marker pKi-67 becomes masked to MIB-1 staining after expression of its tandem repeats.

    PubMed

    Schmidt, Mirko H H; Broll, Rainer; Bruch, Hans-Peter; Duchrow, Michael

    2002-11-01

    The Ki-67 antigen, pKi-67, is one of the most commonly used markers of proliferating cells. The protein can only be detected in dividing cells (G(1)-, S-, G(2)-, and M-phase) but not in quiescent cells (G(0)). The standard antibody to detect pKi-67 is MIB-1, which detects the so-called 'Ki-67 motif' FKELF in 9 of the protein's 16 tandem repeats. To investigate the function of these repeats we expressed three of them in an inducible gene expression system in HeLa cells. Surprisingly, addition of a nuclear localization sequence led to a complete absence of signal in the nuclei of MIB-1-stained cells. At the same time antibodies directed against different epitopes of pKi-67 did not fail to detect the protein. We conclude that the overexpression of the 'Ki-67 motif', which is present in the repeats, can lead to inability of MIB-1 to detect its antigen as demonstrated in adenocarcinoma tissue samples. Thereafter, in order to prevent the underestimation of Ki-67 proliferation indices in MIB-1-labeled preparations, additional antibodies (for example, MIB-21) should be used. Additionally, we could show in a mammalian two-hybrid assay that recombinant pKi-67 repeats are capable of self-associating with endogenous pKi-67. Speculating that the tandem repeats are intimately involved in its protein-protein interactions, this offers new insights in how access to these repeats is regulated by pKi-67 itself.

  7. Medium-sized tandem repeats represent an abundant component of the Drosophila virilis genome.

    PubMed

    Abdurashitov, Murat A; Gonchar, Danila A; Chernukhin, Valery A; Tomilov, Victor N; Tomilova, Julia E; Schostak, Natalia G; Zatsepina, Olga G; Zelentsova, Elena S; Evgen'ev, Michael B; Degtyarev, Sergey K H

    2013-11-09

    Previously, we developed a simple method for carrying out a restriction enzyme analysis of eukaryotic DNA in silico, based on the known DNA sequences of the genomes. This method allows the user to calculate lengths of all DNA fragments that are formed after a whole genome is digested at the theoretical recognition sites of a given restriction enzyme. A comparison of the observed peaks in distribution diagrams with the results from DNA cleavage using several restriction enzymes performed in vitro have shown good correspondence between the theoretical and experimental data in several cases. Here, we applied this approach to the annotated genome of Drosophila virilis which is extremely rich in various repeats. Here we explored the combined approach to perform the restriction analysis of D. virilis DNA. This approach enabled to reveal three abundant medium-sized tandem repeats within the D. virilis genome. While the 225 bp repeats were revealed previously in intergenic non-transcribed spacers between ribosomal genes of D. virilis, two other families comprised of 154 bp and 172 bp repeats were not described. Tandem Repeats Finder search demonstrated that 154 bp and 172 bp units are organized in multiple clusters in the genome of D. virilis. Characteristically, only 154 bp repeats derived from Helitron transposon are transcribed. Using in silico digestion in combination with conventional restriction analysis and sequencing of repeated DNA fragments enabled us to isolate and characterize three highly abundant families of medium-sized repeats present in the D. virilis genome. These repeats comprise a significant portion of the genome and may have important roles in genome function and structural integrity. Therefore, we demonstrated an approach which makes possible to investigate in detail the gross arrangement and expression of medium-sized repeats basing on sequencing data even in the case of incompletely assembled and/or annotated genomes.

  8. Characterization of toxin-producing cyanobacteria by using an oligonucleotide probe containing a tandemly repeated heptamer.

    PubMed Central

    Rouhiainen, L; Sivonen, K; Buikema, W J; Haselkorn, R

    1995-01-01

    Cyanobacteria produce toxins that kill animals. The two main classes of cyanobacterial toxins are cyclic peptides that cause liver damage and alkaloids that block nerve transmission. Many toxin-producing strains from Finnish lakes were brought into axenic culture, and their toxins were characterized. Restriction fragment length polymorphism analysis, probing with a short tandemly repeated DNA sequence found at many locations in the chromosome of Anabaena sp. strain PCC 7120, distinguishes hepatotoxic Anabaena isolates from neurotoxin-producing strains and from Nostoc spp. PMID:7592362

  9. Analysis of tandem repeat units of the promoter of capsanthin/capsorubin synthase (Ccs) gene in pepper fruit.

    PubMed

    Tian, Shi-Lin; Li, Zheng; Li, Li; Shah, S N M; Gong, Zhen-Hui

    2017-07-01

    Capsanthin/capsorubin synthase ( Ccs ) gene is a key gene that regulates the synthesis of capsanthin and the development of red coloration in pepper fruits. There are three tandem repeat units in the promoter region of Ccs , but the potential effects of the number of repetitive units on the transcriptional regulation of Ccs has been unclear. In the present study, expression vectors carrying different numbers of repeat units of the Ccs promoter were constructed, and the transient expression of the β-glucuronidase ( GUS ) gene was used to detect differences in expression levels associated with the promoter fragments. These repeat fragments and the plant expression vector PBI121 containing the 35s CaMV promoter were ligated to form recombinant vectors that were transfected into Agrobacterium tumefaciens GV3101. A fluorescence spectrophotometer was used to analyze the expression associated with the various repeat units. It was concluded that the constructs containing at least one repeat were associated with GUS expression, though they did not differ from one another. This repeating unit likely plays a role in transcription and regulation of Ccs expression.

  10. Variation of Cats under Domestication: Genetic Assignment of Domestic Cats to Breeds and Worldwide Random Bred Populations

    PubMed Central

    Kurushima, J. D.; Lipinski, M. J.; Gandolfi, B.; Froenicke, L.; Grahn, J. C.; Grahn, R. A.; Lyons, L. A.

    2012-01-01

    Summary Both cat breeders and the lay public have interests in the origins of their pets, not only in the genetic identity of the purebred individuals, but also the historical origins of common household cats. The cat fancy is a relatively new institution with over 85% of its 40–50 breeds arising only in the past 75 years, primarily through selection on single-gene aesthetic traits. The short, yet intense cat breed history poses a significant challenge to the development of a genetic marker-based breed identification strategy. Using different breed assignment strategies and methods, 477 cats representing 29 fancy breeds were analysed with 38 short tandem repeats, 148 intergenic and five phenotypic single nucleotide polymorphisms. Results suggest the frequentist method of Paetkau (accuracy single nucleotide polymorphisms = 0.78, short tandem repeats = 0.88) surpasses the Bayesian method of Rannala and Mountain (single nucleotide polymorphisms = 0.56, short tandem repeats = 0.83) for accurate assignment of individuals to the correct breed. Additionally, a post-assignment verification step with the five phenotypic single nucleotide polymorphisms accurately identified between 0.31 and 0.58 of the mis-assigned individuals raising the sensitivity of assignment with the frequentist method to 0.89 and 0.92 single nucleotide polymorphisms and short tandem repeats respectively. This study provides a novel multi-step assignment strategy and suggests that, despite their short breed history and breed family groupings, a majority of cats can be assigned to their proper breed or population of origin, i.e. race. PMID:23171373

  11. Diversity and evolution of centromere repeats in the maize genome.

    PubMed

    Bilinski, Paul; Distor, Kevin; Gutierrez-Lopez, Jose; Mendoza, Gabriela Mendoza; Shi, Jinghua; Dawe, R Kelly; Ross-Ibarra, Jeffrey

    2015-03-01

    Centromere repeats are found in most eukaryotes and play a critical role in kinetochore formation. Though centromere repeats exhibit considerable diversity both within and among species, little is understood about the mechanisms that drive centromere repeat evolution. Here, we use maize as a model to investigate how a complex history involving polyploidy, fractionation, and recent domestication has impacted the diversity of the maize centromeric repeat CentC. We first validate the existence of long tandem arrays of repeats in maize and other taxa in the genus Zea. Although we find considerable sequence diversity among CentC copies genome-wide, genetic similarity among repeats is highest within these arrays, suggesting that tandem duplications are the primary mechanism for the generation of new copies. Nonetheless, clustering analyses identify similar sequences among distant repeats, and simulations suggest that this pattern may be due to homoplasious mutation. Although the two ancestral subgenomes of maize have contributed nearly equal numbers of centromeres, our analysis shows that the majority of all CentC repeats derive from one of the parental genomes, with an even stronger bias when examining the largest assembled contiguous clusters. Finally, by comparing maize with its wild progenitor teosinte, we find that the abundance of CentC likely decreased after domestication, while the pericentromeric repeat Cent4 has drastically increased.

  12. Laser mass spectrometry for DNA fingerprinting for forensic applications

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, C.H.; Tang, K.; Taranenko, N.I.

    The application of DNA fingerprinting has become very broad in forensic analysis, patient identification, diagnostic medicine, and wildlife poaching, since every individual`s DNA structure is identical within all tissues of their body. DNA fingerprinting was initiated by the use of restriction fragment length polymorphisms (RFLP). In 1987, Nakamura et al. found that a variable number of tandem repeats (VNTR) often occurred in the alleles. The probability of different individuals having the same number of tandem repeats in several different alleles is very low. Thus, the identification of VNTR from genomic DNA became a very reliable method for identification of individuals.more » DNA fingerprinting is a reliable tool for forensic analysis. In DNA fingerprinting, knowledge of the sequence of tandem repeats and restriction endonuclease sites can provide the basis for identification. The major steps for conventional DNA fingerprinting include (1) specimen processing (2) amplification of selected DNA segments by PCR, and (3) gel electrophoresis to do the final DNA analysis. In this work we propose to use laser desorption mass spectrometry for fast DNA fingerprinting. The process and advantages are discussed.« less

  13. Combined deficiency of MSH2 and Sμ region abolishes class switch recombination.

    PubMed

    Leduc, Claire; Haddad, Dania; Laviolette-Malirat, Nathalie; Nguyen Huu, Ngoc-Sa; Khamlichi, Ahmed Amine

    2010-10-01

    Class switch recombination (CSR) is mediated by G-rich tandem repeated sequences termed switch regions. Transcription of switch regions generates single-stranded R loops that provide substrates for activation-induced cytidine deaminase. Mice deficient in MSH2 have a mild defect in CSR and analysis of their switch junctions has led to a model in which MSH2 is more critical for switch recombination events outside than within the tandem repeats. It is also known that deletion of the whole Sμ region severely impairs but does not abrogate CSR despite the lack of detectable R loops. Here, we demonstrate that deficiency of both MSH2 and the Sμ region completely abolishes CSR and that the abrogation occurs at the genomic level. This finding further supports the crucial role of MSH2 outside the tandem repeats. It also indicates that during CSR, MSH2 has access to activation-induced cytidine deaminase targets in R-loop-deficient Iμ-Cμ sequences rarely used in CSR, suggesting an MSH2-dependent DNA processing activity at the Iμ exon that may decrease with transcription elongation across the Sμ region.

  14. MULTIPLE-LOCUS VARIABLE-NUMBER TANDEM REPEAT ANALYSIS OF BRUCELLA ISOLATES FROM THAILAND.

    PubMed

    Kumkrong, Khurawan; Chankate, Phanita; Tonyoung, Wittawat; Intarapuk, Apiradee; Kerdsin, Anusak; Kalambaheti, Thareerat

    2017-01-01

    Brucellosis-induced abortion can result in significant economic loss to farm animals. Brucellosis can be transmitted to humans during slaughter of infected animals or via consumption of contaminated food products. Strain identification of Brucella isolates can reveal the route of transmission. Brucella strains were isolated from vaginal swabs of farm animal, cow milk and from human blood cultures. Multiplex PCR was used to identify Brucella species, and owing to high DNA homology among Brucella isolates, multiple-locus variable-number tandem repeat analysis (MLVA) based on the number of tandem repeats at 16 different genomic loci was used for strain identification. Multiplex PCR categorized the isolates into B. abortus (n = 7), B. melitensis (n = 37), B. suis (n = 3), and 5 of unknown Brucella spp. MLVA-16 clustering analysis differentiated the strains into various genotypes, with Brucella isolates from the same geographic region being closely related, and revealed that the Thai isolates were phylogenetically distinct from those in other countries, including within the Southeast Asian region. Thus, MLVA-16 typing has utility in epidemiological studies.

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zweifel,M.; Leahy, D.; Barrick, D.

    Deltex is a cytosolic effector of Notch signaling thought to bind through its N-terminal domain to the Notch receptor. Here we report the structure of the Drosophila Deltex N-terminal domain, which contains two tandem WWE sequence repeats. The WWE repeats, which adopt a novel fold, are related by an approximate two-fold axis of rotation. Although the WWE repeats are structurally distinct, they interact extensively and form a deep cleft at their junction that appears well suited for ligand binding. The two repeats are thermodynamically coupled; this coupling is mediated in part by a conserved segment that is immediately C-terminal tomore » the second WWE domain. We demonstrate that although the Deltex WWE tandem is monomeric in solution, it forms a heterodimer with the ankyrin domain of the Notch receptor. These results provide structural and functional insight into how Deltex modulates Notch signaling, and how WWE modules recognize targets for ubiquitination.« less

  16. Construction of a self-cloning sake yeast that overexpresses alcohol acetyltransferase gene by a two-step gene replacement protocol.

    PubMed

    Hirosawa, I; Aritomi, K; Hoshida, H; Kashiwagi, S; Nishizawa, Y; Akada, R

    2004-07-01

    The commercial application of genetically modified industrial microorganisms has been problematic due to public concerns. We constructed a "self-cloning" sake yeast strain that overexpresses the ATF1 gene encoding alcohol acetyltransferase, to improve the flavor profile of Japanese sake. A constitutive yeast overexpression promoter, TDH3p, derived from the glyceraldehyde-3-phosphate dehydrogenase gene from sake yeast was fused to ATF1; and the 5' upstream non-coding sequence of ATF1 was further fused to TDH3p-ATF1. The fragment was placed on a binary vector, pGG119, containing a drug-resistance marker for transformation and a counter-selection marker for excision of unwanted DNA. The plasmid was integrated into the ATF1 locus of a sake yeast strain. This integration constructed tandem repeats of ATF1 and TDH3p-ATF1 sequences, between which the plasmid was inserted. Loss of the plasmid, which occurs through homologous recombination between either the TDH3p downstream ATF1 repeats or the TDH3p upstream repeat sequences, was selected by growing transformants on counter-selective medium. Recombination between the downstream repeats led to reversion to a wild type strain, but that between the upstream repeats resulted in a strain that possessed TDH3p-ATF1 without the extraneous DNA sequences. The self-cloning TDH3p-ATF1 yeast strain produced a higher amount of isoamyl acetate. This is the first expression-controlled self-cloning industrial yeast.

  17. Age-related change in the association between a polymorphism in the PER3 gene and preferred timing of sleep and waking activities.

    PubMed

    Jones, Kay H S; Ellis, Jason; von Schantz, Malcolm; Skene, Debra J; Dijk, Derk-Jan; Archer, Simon N

    2007-03-01

    The objective of this study was to investigate the effect of age on the association between preferred timing of sleep and waking activities and a coding-region variable number tandem repeat (VNTR) polymorphism in the clock gene PER3. We have previously reported this polymorphism to associate with diurnal preference and delayed sleep phase syndrome (DSPS). Participants (n = 1590; 707 males and 883 females) completed the Horne-Ostberg (HO) questionnaire for diurnal preference and provided a DNA sample. Overall HO scores were plotted against age. The 5% extremes and intermediates were selected for genotyping. Frequencies of the PER3 4- and 5-repeat alleles were examined in separate age groups (18-29, 30-39, 40-49 and 50+ years of age). The 4-repeat allele was significantly more frequent in evening types, and the 5-repeat allele more frequent in morning types (Fisher's exact test, P = 0.016). Analysis in the four age groupings revealed that the strength of this association attenuated with age and was significant only in the youngest group (18-29 years). These results extend our previous finding of an association between the PER3 VNTR and diurnal preference. They also demonstrate that diurnal preference in young people is more closely associated with this polymorphism than it is in other age groups.

  18. The repeat organizer, a specialized insulator element within the intergenic spacer of the Xenopus rRNA genes.

    PubMed Central

    Robinett, C C; O'Connor, A; Dunaway, M

    1997-01-01

    We have identified a novel activity for the region of the intergenic spacer of the Xenopus laevis rRNA genes that contains the 35- and 100-bp repeats. We devised a new assay for this region by constructing DNA plasmids containing a tandem repeat of rRNA reporter genes that were separated by the 35- and 100-bp repeat region and a rRNA gene enhancer. When the 35- and 100-bp repeat region is present in its normal position and orientation at the 3' end of the rRNA reporter genes, the enhancer activates the adjacent downstream promoter but not the upstream rRNA promoter on the same plasmid. Because this element can restrict the range of an enhancer's activity in the context of tandem genes, we have named it the repeat organizer (RO). The ability to restrict enhancer action is a feature of insulator elements, but unlike previously described insulator elements the RO does not block enhancer action in a simple enhancer-blocking assay. Instead, the activity of the RO requires that it be in its normal position and orientation with respect to the other sequence elements of the rRNA genes. The enhancer-binding transcription factor xUBF also binds to the repetitive sequences of the RO in vitro, but these sequences do not activate transcription in vivo. We propose that the RO is a specialized insulator element that organizes the tandem array of rRNA genes into single-gene expression units by promoting activation of a promoter by its proximal enhancers. PMID:9111359

  19. High Quality Maize Centromere 10 Sequence Reveals Evidence of Frequent Recombination Events

    PubMed Central

    Wolfgruber, Thomas K.; Nakashima, Megan M.; Schneider, Kevin L.; Sharma, Anupma; Xie, Zidian; Albert, Patrice S.; Xu, Ronghui; Bilinski, Paul; Dawe, R. Kelly; Ross-Ibarra, Jeffrey; Birchler, James A.; Presting, Gernot G.

    2016-01-01

    The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR) has presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here, we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 × 10−6 and 5 × 10−5 for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb from the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length CR from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB) repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. In many cases examined here, DSB repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to efficiently repair frequent DSBs in centromeres. PMID:27047500

  20. Family-based association study between monoamine oxidase A (MAOA) gene promoter VNTR polymorphism and Tourette's syndrome in Chinese Han population.

    PubMed

    Liu, Shiguo; Wang, Xueqin; Xu, Longqiang; Zheng, Lanlan; Ge, Yinlin; Ma, Xu

    2015-02-01

    To clarify the association of monoamine oxidase A- variable number of tandem repeat (MAOA-pVNTR) with susceptibility to Tourette's syndrome (TS) in Chinese Han population we discuss the genetic contribution of MAOA-VNTR in 141 TS patients including all their parents in Chinese Han population using transmission disequilibrium test (TDT) design. Our results revealed that no significant association was found in the MAOA gene promoter VNTR polymorphism and TS in Chinese Han population (TDT = 1.515, df = 1, p > 0.05). The negative result may be mainly due to the small sample size, but we don't deny the role of gene coding serotonergic or monoaminergic structures in the etiology of TS.

  1. A Dynamic Tandem Repeat in Monocotyledons Inferred from a Comparative Analysis of Chloroplast Genomes in Melanthiaceae.

    PubMed

    Do, Hoang Dang Khoa; Kim, Joo-Hwan

    2017-01-01

    Chloroplast genomes (cpDNA) are highly valuable resources for evolutionary studies of angiosperms, since they are highly conserved, are small in size, and play critical roles in plants. Slipped-strand mispairing (SSM) was assumed to be a mechanism for generating repeat units in cpDNA. However, research on the employment of different small repeated sequences through SSM events, which may induce the accumulation of distinct types of repeats within the same region in cpDNA, has not been documented. Here, we sequenced two chloroplast genomes from the endemic species Heloniopsis tubiflora (Korea) and Xerophyllum tenax (USA) to cover the gap between molecular data and explore "hot spots" for genomic events in Melanthiaceae. Comparative analysis of 23 complete cpDNA sequences revealed that there were different stages of deletion in the rps16 region across the Melanthiaceae. Based on the partial or complete loss of rps16 gene in cpDNA, we have firstly reported potential molecular markers for recognizing two sections ( Veratrum and Fuscoveratrum ) of Veratrum . Melathiaceae exhibits a significant change in the junction between large single copy and inverted repeat regions, ranging from trnH_GUG to a part of rps3 . Our results show an accumulation of tandem repeats in the rpl23-ycf2 regions of cpDNAs. Small conserved sequences exist and flank tandem repeats in further observation of this region across most of the examined taxa of Liliales. Therefore, we propose three scenarios in which different small repeated sequences were used during SSM events to generate newly distinct types of repeats. Occasionally, prior to the SSM process, point mutation event and double strand break repair occurred and induced the formation of initial repeat units which are indispensable in the SSM process. SSM may have likely occurred more frequently for short repeats than for long repeat sequences in tribe Parideae (Melanthiaceae, Liliales). Collectively, these findings add new evidence of dynamic results from SSM in chloroplast genomes which can be useful for further evolutionary studies in angiosperms. Additionally, genomics events in cpDNA are potential resources for mining molecular markers in Liliales.

  2. Inheritance patterns of ATCCT repeat interruptions in spinocerebellar ataxia type 10 (SCA10) expansions.

    PubMed

    Landrian, Ivette; McFarland, Karen N; Liu, Jilin; Mulligan, Connie J; Rasmussen, Astrid; Ashizawa, Tetsuo

    2017-01-01

    Spinocerebellar ataxia type 10 (SCA10), an autosomal dominant cerebellar ataxia disorder, is caused by a non-coding ATTCT microsatellite repeat expansion in the ataxin 10 gene. In a subset of SCA10 families, the 5'-end of the repeat expansion contains a complex sequence of penta- and heptanucleotide interruption motifs which is followed by a pure tract of tandem ATCCT repeats of unknown length at its 3'-end. Intriguingly, expansions that carry these interruption motifs correlate with an epileptic seizure phenotype and are unstable despite the theory that interruptions are expected to stabilize expanded repeats. To examine the apparent contradiction of unstable, interruption-positive SCA10 expansion alleles and to determine whether the instability originates outside of the interrupted region, we sequenced approximately 1 kb of the 5'-end of SCA10 expansions using the ATCCT-PCR product in individuals across multiple generations from four SCA10 families. We found that the greatest instability within this region occurred in paternal transmissions of the allele in stretches of pure ATTCT motifs while the intervening interrupted sequences were stable. Overall, the ATCCT interruption changes by only one to three repeat units and therefore cannot account for the instability across the length of the disease allele. We conclude that the AT-rich interruptions locally stabilize the SCA10 expansion at the 5'-end but do not completely abolish instability across the entire span of the expansion. In addition, analysis of the interruption alleles across these families support a parsimonious single origin of the mutation with a shared distant ancestor.

  3. Complete chloroplast genome sequence of MD-2 pineapple and its comparative analysis among nine other plants from the subclass Commelinidae.

    PubMed

    Redwan, R M; Saidin, A; Kumar, S V

    2015-08-12

    Pineapple (Ananas comosus var. comosus) is known as the king of fruits for its crown and is the third most important tropical fruit after banana and citrus. The plant, which is indigenous to South America, is the most important species in the Bromeliaceae family and is largely traded for fresh fruit consumption. Here, we report the complete chloroplast sequence of the MD-2 pineapple that was sequenced using the PacBio sequencing technology. In this study, the high error rate of PacBio long sequence reads of A. comosus's total genomic DNA were improved by leveraging on the high accuracy but short Illumina reads for error-correction via the latest error correction module from Novocraft. Error corrected long PacBio reads were assembled by using a single tool to produce a contig representing the pineapple chloroplast genome. The genome of 159,636 bp in length is featured with the conserved quadripartite structure of chloroplast containing a large single copy region (LSC) with a size of 87,482 bp, a small single copy region (SSC) with a size of 18,622 bp and two inverted repeat regions (IRA and IRB) each with the size of 26,766 bp. Overall, the genome contained 117 unique coding regions and 30 were repeated in the IR region with its genes contents, structure and arrangement similar to its sister taxon, Typha latifolia. A total of 35 repeats structure were detected in both the coding and non-coding regions with a majority being tandem repeats. In addition, 205 SSRs were detected in the genome with six protein-coding genes contained more than two SSRs. Comparative chloroplast genomes from the subclass Commelinidae revealed a conservative protein coding gene albeit located in a highly divergence region. Analysis of selection pressure on protein-coding genes using Ka/Ks ratio showed significant positive selection exerted on the rps7 gene of the pineapple chloroplast with P less than 0.05. Phylogenetic analysis confirmed the recent taxonomical relation among the member of commelinids which support the monophyly relationship between Arecales and Dasypogonaceae and between Zingiberales to the Poales, which includes the A. comosus. The complete sequence of the chloroplast of pineapple provides insights to the divergence of genic chloroplast sequences from the members of the subclass Commelinidae. The complete pineapple chloroplast will serve as a reference for in-depth taxonomical studies in the Bromeliaceae family when more species under the family are sequenced in the future. The genetic sequence information will also make feasible other molecular applications of the pineapple chloroplast for plant genetic improvement.

  4. ARA type protograph codes

    NASA Technical Reports Server (NTRS)

    Divsalar, Dariush (Inventor); Abbasfar, Aliazam (Inventor); Jones, Christopher R. (Inventor); Dolinar, Samuel J. (Inventor); Thorpe, Jeremy C. (Inventor); Andrews, Kenneth S. (Inventor); Yao, Kung (Inventor)

    2008-01-01

    An apparatus and method for encoding low-density parity check codes. Together with a repeater, an interleaver and an accumulator, the apparatus comprises a precoder, thus forming accumulate-repeat-accumulate (ARA codes). Protographs representing various types of ARA codes, including AR3A, AR4A and ARJA codes, are described. High performance is obtained when compared to the performance of current repeat-accumulate (RA) or irregular-repeat-accumulate (IRA) codes.

  5. PUF Proteins: Cellular Functions and Potential Applications.

    PubMed

    Kiani, Seyed Jalal; Taheri, Tahereh; Rafati, Sima; Samimi-Rad, Katayoun

    2017-01-01

    RNA-binding proteins play critical roles in the regulation of gene expression. Among several families of RNA-binding proteins, PUF (Pumilio and FBF) proteins have been the subject of extensive investigations, as they can bind RNA in a sequence-specific manner and they are evolutionarily conserved among a wide range of organisms. The outstanding feature of these proteins is a highly conserved RNA-binding domain, which is known as the Pumilio-homology domain (PUM-HD) that mostly consists of eight tandem repeats. Each repeat recognizes an RNA base with a simple three-letter code that can be programmed in order to change the sequence-specificity of the protein. Using this tailored architecture, researchers have been able to change the specificity of the PUM-HD and target desired transcripts in the cell, even in subcellular compartments. The potential applications of this versatile tool in molecular cell biology seem unbounded and the use of these factors in pharmaceutics might be an interesting field of study in near future. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  6. Complete mitochondrial genome of Camponotus atrox (Hymenoptera: Formicidae): a new tRNA arrangement in Hymenoptera.

    PubMed

    Kim, Min Jee; Hong, Eui Jeong; Kim, Iksoo

    2016-01-01

    We sequenced the complete mitochondrial (mt) genome of Camponotus atrox (Hymenoptera: Formicidae), which is only distributed in Korea. The genome was 16 540 bp in size and contained typical sets of genes (13 protein-coding genes, 22 tRNAs, and 2 rRNAs). The C. atrox A+T-rich region, at 1402 bp, was the longest of all sequenced ant genomes and was composed of an identical tandem repeat consisting of six 100-bp copies and one 96-bp copy. A total of 315 bp of intergenic spacer sequence was spread over 23 regions. An alignment of the spacer sequences in ants was largely feasible among congeneric species, and there was substantial sequence divergence, indicating their potential use as molecular markers for congeneric species. The A/T contents at the first and second codon positions of protein-coding genes (PCGs) were similar for ant species, including C. atrox (73.9% vs. 72.3%, on average). With increased taxon sampling among hymenopteran superfamilies, differences in the divergence rates (i.e., the non-synonymous substitution rates) between the suborders Symphyta and Apocrita were detected, consistent with previous results. The C. atrox mt genome had a unique gene arrangement, trnI-trnM-trnQ, at the A+T-rich region and ND2 junction (underline indicates inverted gene). This may have originated from a tandem duplication of trnM-trnI, resulting in trnM-trnI-trnM-trnI-trnQ, and the subsequent loss of the first trnM and second trnI, resulting in trnI-trnM-trnQ.

  7. Altered Methylation in Tandem Repeat Element and Elemental Component Levels in Inhalable Air Particles

    PubMed Central

    Hou, Lifang; Zhang, Xiao; Zheng, Yinan; Wang, Sheng; Dou, Chang; Guo, Liqiong; Byun, Hyang-Min; Motta, Valeria; McCracken, John; Díaz, Anaité; Kang, Choong-Min; Koutrakis, Petros; Bertazzi, Pier Alberto; Li, Jingyun; Schwartz, Joel; Baccarelli, Andrea A.

    2014-01-01

    Exposure to particulate matter (PM) has been associated with lung cancer risk in epidemiology investigations. Elemental components of PM have been suggested to have critical roles in PM toxicity, but the molecular mechanisms underlying their association with cancer risks remain poorly understood. DNA methylation has emerged as a promising biomarker for environmental-related diseases, including lung cancer. In this study, we evaluated the effects of PM elemental components on methylation of three tandem repeats in a highly-exposed population in Beijing, China. The Beijing Truck Driver Air Pollution Study was conducted shortly before the 2008 Beijing Olympic Games (June 15-July 27, 2008) and included 60 truck drivers and 60 office workers. On two days separated by 1-2 weeks, we measured blood DNA methylation of SATα, NBL2, D4Z4, and personal exposure to eight elemental components in PM2.5, including aluminum (Al), silicon (Si), sulfur (S), potassium (K), calcium (Ca) titanium (Ti), iron (Fe), and zinc (Zn). We estimated the associations of individual elemental component with each tandem repeat methylation in generalized estimating equations (GEE) models adjusted for PM2.5 mass and other covariates. Out of the eight examined elements, NBL2 methylation was positively associated with concentrations of Si (0.121, 95%CI: 0.030; 0.212, FDR=0.047) and Ca (0.065, 95%CI: 0.014; 0.115, FDR=0.047) in truck drivers. In office workers, SATα methylation was positively associated with concentrations of S (0.115, 95%CI: 0.034; 0.196, FDR=0.042). PM-associated differences in blood tandem-repeat methylation may help detect biological effects of the exposure and identify individuals who may eventually experience higher lung cancer risk. PMID:24273195

  8. The production and characterization of novel heavy-chain antibodies against the tandem repeat region of MUC1 mucin.

    PubMed

    Rahbarizadeh, Fatemeh; Rasaee, Mohammad J; Forouzandeh, Mehdi; Allameh, Abdolamir; Sarrami, Ramin; Nasiry, Habib; Sadeghizadeh, Majid

    2005-01-01

    Camelidae are known to produce immunoglobulins (Igs) devoid of light chains and constant heavy-chain domains (CH1). Antigen-specific fragments of these heavy-chain IgGs (VHH) are of great interest in biotechnology applications. This paper describes the first example of successfully raised heavy-chain antibodies in Camelus dromedarius (single-humped camel) and Camelus bactrianus (two-humped camel) against a MUC1 related peptide that is found to be an important epitope expressed in cancerous tissue. Camels were immunized against a synthetic peptide corresponding to the tandem repeat region of MUC1 mucin and cancerous tissue preparation obtained from patients suffering from breast carcinoma. Three IgG subclasses with different binding properties to protein A and G were purified by affinity chromatography. Both conventional and heavy-chain IgG antibodies were produced in response to MUC1-related peptide. The elicited antibodies could react specifically with the tandem repeat region of MUC1 mucin in an enzyme linked immunosorbant assay (ELISA). Anti-peptide antibodies were purified after passing antiserum over two affinity chromatography columns. Using ELISA, immunocytochemistry and Western blotting, the interaction of purified antibodies with different antigens was evaluated. The antibodies were observed to be selectively bound to antigens namely: MUC1 peptide (tandem repeat region), human milk fat globule membrane (HMFG), deglycosylated human milk fat globule membrane (D-HMFG), homogenized cancerous breast tissue and a native MUC1 purified from ascitic fluid. Ka values of specific polyclonal antipeptide antibodies were estimated in C. dromedarius and C. bactrianus, as 7 x 10(10) M(-1) and 1.4 x 10(10) M(-1) respectively.

  9. Tandem Repeats in Proteins: Prediction Algorithms and Biological Role.

    PubMed

    Pellegrini, Marco

    2015-01-01

    Tandem repetitions in protein sequence and structure is a fascinating subject of research which has been a focus of study since the late 1990s. In this survey, we give an overview on the multi-faceted aspects of research on protein tandem repeats (PTR for short), including prediction algorithms, databases, early classification efforts, mechanisms of PTR formation and evolution, and synthetic PTR design. We also touch on the rather open issue of the relationship between PTR and flexibility (or disorder) in proteins. Detection of PTR either from protein sequence or structure data is challenging due to inherent high (biological) signal-to-noise ratio that is a key feature of this problem. As early in silico analytic tools have been key enablers for starting this field of study, we expect that current and future algorithmic and statistical breakthroughs will have a high impact on the investigations of the biological role of PTR.

  10. Concerted evolution of the tandemly repeated genes encoding primate U2 small nuclear RNA (the RNU2 locus) does not prevent rapid diversification of the (CT){sub n} {center_dot} (GA){sub n} microsatellite embedded within the U2 repeat unit

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Liao, D.; Weiner, A.M.

    1995-12-10

    The RNU2 locus encoding human U2 small nuclear RNA (snRNA) is organized as a nearly perfect tandem array containing 5 to 22 copies of a 5.8-kb repeat unit. Just downstream of the U2 snRNA gene in each 5.8-kb repeat unit lies a large (CT){sub n}{center_dot}(GA){sub n} dinucleotide repeat (n {approx} 70). This form of genomic organization, in which one repeat is embedded within another, provides an unusual opportunity to study the balance of forces maintaining the homogeneity of both kinds of repeats. Using a combination of field inversion gel electrophoresis and polymerase chain reaction, we have been able to studymore » the CT microsatellites within individual U2 tandem arrays. We find that the CT microsatellites within an RNU2 allele exhibit significant length polymorphism, despite the remarkable homogeneity of the surrounding U2 repeat units. Length polymorphism is due primarily to loss or gain of CT dinucleotide repeats, but other types of deletions, insertions, and substitutions are also frequent. Polymorphism is greatly reduced in regions where pure (CT){sub n} tracts are interrupted by occasional G residues, suggesting that irregularities stabilize both the length and the sequence of the dinucleotide repeat. We further show that the RNU2 loci of other catarrhine primates (gorilla, chimpanzee, ogangutan, and baboon) contain orthologous CT microsatellites; these also exhibit length polymorphism, but are highly divergent from each other. Thus, although the CT microsatellite is evolving far more rapidly than the rest of the U2 repeat unit, it has persisted through multiple speciation events spanning >35 Myr. The persistence of the CT microsatellite, despite polymorphism and rapid evolution, suggests that it might play a functional role in concerted evolution of the RNU2 loci, perhaps as an initiation site for recombination and/or gene conversion. 70 refs., 5 figs.« less

  11. The paradox of MHC-DRB exon/intron evolution: alpha-helix and beta-sheet encoding regions diverge while hypervariable intronic simple repeats coevolve with beta-sheet codons.

    PubMed

    Schwaiger, F W; Weyers, E; Epplen, C; Brün, J; Ruff, G; Crawford, A; Epplen, J T

    1993-09-01

    Twenty-one different caprine and 13 ovine MHC-DRB exon 2 sequences were determined including part of the adjacent introns containing simple repetitive (gt)n(ga)m elements. The positions for highly polymorphic DRB amino acids vary slightly among ungulates and other mammals. From man and mouse to ungulates the basic (gt)n(ga)m structure is fixed in evolution for 7 x 10(7) years whereas ample variations exist in the tandem (gt)n and (ga)m dinucleotides and especially their "degenerated" derivatives. Phylogenetic trees for the alpha-helices and beta-pleated sheets of the ungulate DRB sequences suggest different evolutionary histories. In hoofed animals as well as in humans DRB beta-sheet encoding sequences and adjacent intronic repeats can be assembled into virtually identical groups suggesting coevolution of noncoding as well as coding DNA. In contrast alpha-helices and C-terminal parts of the first DRB domain evolve distinctly. In the absence of a defined mechanism causing specific, site-directed mutations, double-recombination or gene-conversion-like events would readily explain this fact. The role of the intronic simple (gt)n(ga)m repeat is discussed with respect to these genetic exchange mechanisms during evolution.

  12. Ligand binding by repeat proteins: natural and designed

    PubMed Central

    Grove, Tijana Z; Cortajarena, Aitziber L; Regan, Lynne

    2012-01-01

    Repeat proteins contain tandem arrays of small structural motifs. As a consequence of this architecture, they adopt non-globular, extended structures that present large, highly specific surfaces for ligand binding. Here we discuss recent advances toward understanding the functional role of this unique modular architecture. We showcase specific examples of natural repeat proteins interacting with diverse ligands and also present examples of designed repeat protein–ligand interactions. PMID:18602006

  13. Exploring the repeat protein universe through computational protein design

    DOE PAGES

    Brunette, TJ; Parmeggiani, Fabio; Huang, Po-Ssu; ...

    2015-12-16

    A central question in protein evolution is the extent to which naturally occurring proteins sample the space of folded structures accessible to the polypeptide chain. Repeat proteins composed of multiple tandem copies of a modular structure unit are widespread in nature and have critical roles in molecular recognition, signalling, and other essential biological processes. Naturally occurring repeat proteins have been re-engineered for molecular recognition and modular scaffolding applications. In this paper, we use computational protein design to investigate the space of folded structures that can be generated by tandem repeating a simple helix–loop–helix–loop structural motif. Eighty-three designs with sequences unrelatedmore » to known repeat proteins were experimentally characterized. Of these, 53 are monomeric and stable at 95 °C, and 43 have solution X-ray scattering spectra consistent with the design models. Crystal structures of 15 designs spanning a broad range of curvatures are in close agreement with the design models with root mean square deviations ranging from 0.7 to 2.5 Å. Finally, our results show that existing repeat proteins occupy only a small fraction of the possible repeat protein sequence and structure space and that it is possible to design novel repeat proteins with precisely specified geometries, opening up a wide array of new possibilities for biomolecular engineering.« less

  14. Characterization of rat serum amyloid A4 (SAA4): A novel member of the SAA superfamily

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rossmann, Christine; Windpassinger, Christian; Brunner, Daniela

    2014-08-08

    Highlights: • The full length rat SAA4 (rSAA4) mRNA was characterized by rapid amplification of cDNA ends. • rSAA4 mRNA has 1830 bases including a GA dinucleotide tandem repeat in the 5′UTR. • Three consecutive C/EBP promoter elements are crucial for transcription of rSAA4. • rSAA4 is abundantly expressed in the liver on mRNA and protein level. - Abstract: The serum amyloid A (SAA) family of proteins is encoded by multiple genes, which display allelic variation and a high degree of homology in mammals. The SAA1/2 genes code for non-glycosylated acute-phase SAA1/2 proteins, that may increase up to 1000-fold duringmore » inflammation. The SAA4 gene, well characterized in humans (hSAA4) and mice (mSaa4) codes for a SAA4 protein that is glycosylated only in humans. We here report on a previously uncharacterized SAA4 gene (rSAA4) and its product in Rattus norvegicus, the only mammalian species known not to express acute-phase SAA. The exon/intron organization of rSAA4 is similar to that reported for hSAA4 and mSaa4. By performing 5′- and 3′RACE, we identified a 1830-bases containing rSAA4 mRNA (including a GA-dinucleotide tandem repeat). Highest rSAA4 mRNA expression was detected in rat liver. In McA-RH7777 rat hepatoma cells, rSAA4 transcription was significantly upregulated in response to LPS and IL-6 while IL-1α/β and TNFα were without effect. Luciferase assays with promoter-truncation constructs identified three proximal C/EBP-elements that mediate expression of rSAA4 in McA-RH7777 cells. In line with sequence prediction a 14-kDa non-glycosylated SAA4 protein is abundantly expressed in rat liver. Fluorescence microscopy revealed predominant localization of rSAA4-GFP-tagged fusion protein in the ER.« less

  15. De novo transcriptome sequencing reveals a considerable bias in the incidence of simple sequence repeats towards the downstream of 'Pre-miRNAs' of black pepper.

    PubMed

    Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

    2013-01-01

    Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of '43 pre-miRNA candidates bearing different types of SSR motifs'. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted 'pre-miRNA candidates bearing SSRs'. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted 'pre-miRNA candidates'. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of 'tandem repeats' in miRNAs.

  16. A naturally occurring, noncanonical GTP aptamer made of simple tandem repeats

    PubMed Central

    Curtis, Edward A; Liu, David R

    2014-01-01

    Recently, we used in vitro selection to identify a new class of naturally occurring GTP aptamer called the G motif. Here we report the discovery and characterization of a second class of naturally occurring GTP aptamer, the “CA motif.” The primary sequence of this aptamer is unusual in that it consists entirely of tandem repeats of CA-rich motifs as short as three nucleotides. Several active variants of the CA motif aptamer lack the ability to form consecutive Watson-Crick base pairs in any register, while others consist of repeats containing only cytidine and adenosine residues, indicating that noncanonical interactions play important roles in its structure. The circular dichroism spectrum of the CA motif aptamer is distinct from that of A-form RNA and other major classes of nucleic acid structures. Bioinformatic searches indicate that the CA motif is absent from most archaeal and bacterial genomes, but occurs in at least 70 percent of approximately 400 eukaryotic genomes examined. These searches also uncovered several phylogenetically conserved examples of the CA motif in rodent (mouse and rat) genomes. Together, these results reveal the existence of a second class of naturally occurring GTP aptamer whose sequence requirements, like that of the G motif, are not consistent with those of a canonical secondary structure. They also indicate a new and unexpected potential biochemical activity of certain naturally occurring tandem repeats. PMID:24824832

  17. Development of a Multiple-Locus Variable number of tandem repeat Analysis (MLVA) for Leptospira interrogans and its application to Leptospira interrogans serovar Australis isolates from Far North Queensland, Australia

    PubMed Central

    Slack, Andrew T; Dohnt, Michael F; Symonds, Meegan L; Smythe, Lee D

    2005-01-01

    Background Leptospirosis is a zoonotic disease caused by the genus, Leptospira. Leptospira interrogans is the most common genomospecies implicated in the disease. Epidemiological investigations are needed to distinguish outbreak situations or to trace reservoirs of the organisms. Current methodologies used for typing Leptospira have significant drawbacks. The development of an easy to perform yet high resolution method is needed for this organism. Methods In this study we have searched the available genomic sequence of L. interrogans serovar Copenhageni strain Fiocruz L1-130 for the presence of tandem repeats [1]. These repeats were evaluated against reference strains for diversity. Six loci were selected to create a Multiple Locus Variable Number of Tandem Repeats (VNTR) Analysis (MLVA) to explore the genetic diversity within L. interrogans serovar Australis clinical isolates from Far North Queensland. Results The 39 reference strains used for the development of the method displayed 39 distinct patterns. Diversity Indexes for the loci varied between 0.80 and 0.93 and the number of repeat units at each locus varied between less than one to 52 repeats. When the MLVA was applied to serovar Australis isolates three large clusters were distinguishable, each comprising various hosts including Rattus species, human and canines. Conclusion The MLVA described in this report, was easy to perform, analyse and was reproducible. The loci selected had high diversity allowing discrimination between serovars and also between strains within a serovar. This method provides a starting point on which improvements to the method and comparisons to other techniques can be made. PMID:15987533

  18. Correlation of FCGRT genomic structure with serum immunoglobulin, albumin and farletuzumab pharmacokinetics in patients with first relapsed ovarian cancer.

    PubMed

    O'Shannessy, Daniel J; Bendas, Katie; Schweizer, Charles; Wang, Wenquan; Albone, Earl; Somers, Elizabeth B; Weil, Susan; Meredith, Rhonda K; Wustner, Jason; Grasso, Luigi; Landers, Mark; Nicolaides, Nicholas C

    2017-07-01

    Farletuzumab (FAR) is a humanized monoclonal antibody (mAb) that binds to folate receptor alpha. A Ph3 trial in ovarian cancer patients treated with carboplatin/taxane plus FAR or placebo did not meet the primary statistical endpoint. Subgroup analysis demonstrated that subjects with high FAR exposure levels (Cmin>57.6μg/mL) showed statistically significant improvements in PFS and OS. The neonatal Fc receptor (fcgrt) plays a central role in albumin/IgG stasis and mAb pharmacokinetics (PK). Here we evaluated fcgrt sequence and association of its promoter variable number tandem repeats (VNTR) and coding single nucleotide variants (SNV) with albumin/IgG levels and FAR PK in the Ph3 patients. A statistical correlation existed between high FAR Cmin and AUC in patients with the highest quartile of albumin and lowest quartile of IgG1. Analysis of fcgrt identified 5 different VNTRs in the promoter region and 9 SNVs within the coding region, 4 which are novel. Copyright © 2017. Published by Elsevier Inc.

  19. Length Variation in Mitochondrial DNA of the Minnow Cyprinella Spiloptera

    PubMed Central

    Broughton, R. E.; Dowling, T. E.

    1994-01-01

    Length differences in animal mitochondrial DNA (mtDNA) are common, frequently due to variation in copy number of direct tandem duplications. While such duplications appear to form without great difficulty in some taxonomic groups, they appear to be relatively short-lived, as typical duplication products are geographically restricted within species and infrequently shared among species. To better understand such length variation, we have studied a tandem and direct duplication of approximately 260 bp in the control region of the cyprinid fish, Cyprinella spiloptera. Restriction site analysis of 38 individuals was used to characterize population structure and the distribution of variation in repeat copy number. This revealed two length variants, including individuals with two or three copies of the repeat, and little geographic structure among populations. No standard length (single copy) genomes were found and heteroplasmy, a common feature of length variation in other taxa, was absent. Nucleotide sequence of tandem duplications and flanking regions localized duplication junctions in the phenylalanine tRNA and near the origin of replication. The locations of these junctions and the stability of folded repeat copies support the hypothesized importance of secondary structures in models of duplication formation. PMID:8001785

  20. A Legionella pneumophila collagen-like protein encoded by a gene with a variable number of tandem repeats is involved in the adherence and invasion of host cells.

    PubMed

    Vandersmissen, Liesbeth; De Buck, Emmy; Saels, Veerle; Coil, David A; Anné, Jozef

    2010-05-01

    Legionella pneumophila is a Gram-negative, facultative intracellular pathogen and the causative agent of Legionnaires' disease, a severe pneumonia in humans. Analysis of the Legionella sequenced genomes revealed a gene with a variable number of tandem repeats (VNTRs), whose number varies between strains. We examined the strain distribution of this gene among a collection of 108 clinical, environmental and hot spring serotype I strains. Twelve variants were identified, but no correlation was observed between the number of repeat units and clinical and environmental strains. The encoded protein contains the C-terminal consensus motif of outer membrane proteins and has a large region of collagen-like repeats that is encoded by the VNTR region. We have therefore annotated this protein Lcl for Legionella collagen-like protein. Lcl was shown to contribute to the adherence and invasion of host cells and it was demonstrated that the number of repeat units present in lcl had an influence on these adhesion characteristics.

  1. Condensin loaded onto the replication fork barrier site in the rRNA gene repeats during S phase in a FOB1-dependent fashion to prevent contraction of a long repetitive array in Saccharomyces cerevisiae.

    PubMed

    Johzuka, Katsuki; Terasawa, Masahiro; Ogawa, Hideyuki; Ogawa, Tomoko; Horiuchi, Takashi

    2006-03-01

    An average of 200 copies of the rRNA gene (rDNA) is clustered in a long tandem array in Saccharomyces cerevisiae. FOB1 is known to be required for expansion/contraction of the repeats by stimulating recombination, thereby contributing to the maintenance of the average copy number. In Deltafob1 cells, the repeats are still maintained without any fluctuation in the copy number, suggesting that another, unknown system acts to prevent repeat contraction. Here, we show that condensin acts together with FOB1 in a functionally complemented fashion to maintain the long tandem repeats. Six condensin mutants possessing severely contracted rDNA repeats were isolated in Deltafob1 cells but not in FOB1+ cells. We also found that the condensin complex associated with the nontranscribed spacer region of rDNA with a major peak coincided with the replication fork barrier (RFB) site in a FOB1-dependent fashion. Surprisingly, condensin association with the RFB site was established during S phase and was maintained until anaphase. These results indicate that FOB1 plays a novel role in preventing repeat contraction by regulating condensin association and suggest a link between replication termination and chromosome condensation and segregation.

  2. The complete mitochondrial genome of the mudsnail Cipangopaludina cathayensis (Gastropoda: Viviparidae).

    PubMed

    Yang, Huirong; Zhang, Jia-En; Luo, Hao; Luo, Mingzhu; Guo, Jing; Deng, Zhixin; Zhao, Benliang

    2016-05-01

    We present the complete mitochondrial genome of Cipangopaludina cathayensis in this study. The mitochondrial genome is 17,157 bp in length, containing 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes. All of them are encoded on the heavy strand except 7 tRNA genes on the light strand. Overall nucleotide compositions of the light strand are 44.51% of A, 26.74% of T, 20.48% of C and 8.28% of G. All the protein-coding genes start with ATG initiation codon except ATP6 with ATA and ND4 with TTG, and 2 types of termination codons are TAA (ATP6, ND2, COX1, COX2, ATP8, ND1, ND6, Cytb, COX3, ND4) and TAG (ND4L, ND5, ND3). There are 29 intergenic spacers and 5 gene overlaps. The tandem repeat sequences are observed in COX2, tRNA(Asp), ATP6, tRNA(Cys), S-rRNA, ND1, Cytb, ND4 and COX3 genes. Gene arrangement and distribution are different from the typical vertebrates. The absence of D-loop is consistent with the Gastropoda, but at least one lengthy non-coding region is essential regulatory element for the initiation of transcription and replication.

  3. [DNA prints instead of plantar prints in neonatal identification].

    PubMed

    Rodríguez-Alarcón Gómez, J; Martińez de Pancorbo Gómez, M; Santillana Ferrer, L; Castro Espido, A; Melchor Maros, J C; Linares Uribe, M A; Fernández-Llebrez del Rey, L; Aranguren Dúo, G

    1996-06-22

    To check the possible usefulness in studying DNA in dried blood spots taken on filter paper blotters for newborn identification. It set out to establish: 1. The validity of the method for analysis; 2. The validity of all stored samples (such as those kept in clinical records); 3. Guarantee of non-intrusion in the genetic code; 4. Acceptable price and execution time. Forty (40) anonymous 13-year-old samples of 20 subjects (2 per subject) were studied. DNA was extracted using Chelex resin and the STR ("small tandem repeat") of microsatellite DNA was studies using the "polimerase chain reaction method" (PCR). Three non coding DNA loci (CSF1PO, TPOX and THO1) were analyzed by Multiplex amplification. It was possible to type 39 samples, making it possible to match the 20 cases (one by exclusion). The complete procedure yielded the results within 24 hours in all cases. The estimated final cost was found to be a fifth of that conventional maternity/paternity tests. The study carried out made matching possible in all 20 cases (directly in 19 cases). It was not necessary to study DNA coding areas. The validity of the method for analyzing samples stored for 13 years without any special care was also demonstrated. The technic was fast, producing the results within 24 hours, and at reasonable cost.

  4. Analysis of variable sites between two complete South China tiger (Panthera tigris amoyensis) mitochondrial genomes.

    PubMed

    Zhang, Wenping; Yue, Bisong; Wang, Xiaofang; Zhang, Xiuyue; Xie, Zhong; Liu, Nonglin; Fu, Wenyuan; Yuan, Yaohua; Chen, Daqing; Fu, Danghua; Zhao, Bo; Yin, Yuzhong; Yan, Xiahui; Wang, Xinjing; Zhang, Rongying; Liu, Jie; Li, Maoping; Tang, Yao; Hou, Rong; Zhang, Zhihe

    2011-10-01

    In order to investigate the mitochondrial genome of Panthera tigris amoyensis, two South China tigers (P25 and P27) were analyzed following 15 cymt-specific primer sets. The entire mtDNA sequence was found to be 16,957 bp and 17,001 bp long for P25 and P27 respectively, and this difference in length between P25 and P27 occurred in the number of tandem repeats in the RS-3 segment of the control region. The structural characteristics of complete P. t. amoyensis mitochondrial genomes were also highly similar to those of P. uncia. Additionally, the rate of point mutation was only 0.3% and a total of 59 variable sites between P25 and P27 were found. Out of the 59 variable sites, 6 were located in 6 different tRNA genes, 6 in the 2 rRNA genes, 7 in non-coding regions (one located between tRNA-Asn and tRNA-Tyr and six in the D-loop), and 40 in 10 protein-coding genes. COI held the largest amount of variable sites (9 sites) and Cytb contained the highest variable rate (0.7%) in the complete sequences. Moreover, out of the 40 variable sites located in 10 protein-coding genes, 12 sites were nonsynonymous.

  5. Surface display of monkey metallothionein α tandem repeats and EGFP fusion protein on Pseudomonas putida X4 for biosorption and detection of cadmium.

    PubMed

    He, Xiaochuan; Chen, Wenli; Huang, Qiaoyun

    2012-09-01

    Monkey metallothionein α domain tandem repeats (4mMTα), which exhibit high cadmium affinity, have been displayed for the first time on the surface of a bacterium using ice nucleation protein N-domain (inaXN) protein from the Xanthomonas campestris pv (ACCC-10049) as an anchoring motif. The shuttle vector pIME, which codes for INAXN-4mMTα-EGFP fusion, was constructed and used to target 4mMTα and EGFP on the surface of Pseudomonas putida X4 (CCTCC-209319). The surface location of the INAXN-4mMTα-EGFP fusion was further verified by western blot analysis and immunofluorescence microscopy. The growth of X4 showed resistance to cadmium presence. The presence of surface-exposed 4mMTα on the engineered strains was four times higher than that of the wild-type X4. The Cd²⁺ accumulation by X4/pIME was not only four times greater than that of the original host bacterial cells but was also remarkably unaffected by the presence of Cu²⁺ and Zn²⁺. Moreover, the surface-engineered strains could effectively bind Cd²⁺ under a wide range of pH levels, from 4 to 7. P. putida X4/pIME with surface-expressed 4mMTα-EGFP had twice the cadmium binding capacity as well as 1.4 times the fluorescence as the cytoplasmic 4mMTa-EGFP. These results suggest that P. putida X4 expressing 4mMTα-EGFP with the INAXN anchor motif on the surface would be a useful tool for the remediation and biodetection of environmental cadmium contaminants.

  6. ACCA phosphopeptide recognition by the BRCT repeats of BRCA1.

    PubMed

    Ray, Hind; Moreau, Karen; Dizin, Eva; Callebaut, Isabelle; Venezia, Nicole Dalla

    2006-06-16

    The tumour suppressor gene BRCA1 encodes a 220 kDa protein that participates in multiple cellular processes. The BRCA1 protein contains a tandem of two BRCT repeats at its carboxy-terminal region. The majority of disease-associated BRCA1 mutations affect this region and provide to the BRCT repeats a central role in the BRCA1 tumour suppressor function. The BRCT repeats have been shown to mediate phospho-dependant protein-protein interactions. They recognize phosphorylated peptides using a recognition groove that spans both BRCT repeats. We previously identified an interaction between the tandem of BRCA1 BRCT repeats and ACCA, which was disrupted by germ line BRCA1 mutations that affect the BRCT repeats. We recently showed that BRCA1 modulates ACCA activity through its phospho-dependent binding to ACCA. To delineate the region of ACCA that is crucial for the regulation of its activity by BRCA1, we searched for potential phosphorylation sites in the ACCA sequence that might be recognized by the BRCA1 BRCT repeats. Using sequence analysis and structure modelling, we proposed the Ser1263 residue as the most favourable candidate among six residues, for recognition by the BRCA1 BRCT repeats. Using experimental approaches, such as GST pull-down assay with Bosc cells, we clearly showed that phosphorylation of only Ser1263 was essential for the interaction of ACCA with the BRCT repeats. We finally demonstrated by immunoprecipitation of ACCA in cells, that the whole BRCA1 protein interacts with ACCA when phosphorylated on Ser1263.

  7. Inter-laboratory comparison of multi-locus variable-number tandem repeat analysis (MLVA) for verocytotoxin-producing Escherichia coli O157 to facilitate data sharing.

    PubMed

    Holmes, A; Perry, N; Willshaw, G; Hanson, M; Allison, L

    2015-01-01

    Multi-locus variable number tandem repeat analysis (MLVA) is used in clinical and reference laboratories for subtyping verocytotoxin-producing Escherichia coli O157 (VTEC O157). However, as yet there is no common allelic or profile nomenclature to enable laboratories to easily compare data. In this study, we carried out an inter-laboratory comparison of an eight-loci MLVA scheme using a set of 67 isolates of VTEC O157. We found all but two isolates were identical in profile in the two laboratories, and repeat units were homogeneous in size but some were incomplete. A subset of the isolates (n = 17) were sequenced to determine the actual copy number of representative alleles, thereby enabling alleles to be named according to international consensus guidelines. This work has enabled us to realize the potential of MLVA as a portable, highly discriminatory and convenient subtyping method.

  8. The Effective Mutation Rate at Y Chromosome Short Tandem Repeats, with Application to Human Population-Divergence Time

    PubMed Central

    Zhivotovsky, Lev A.; Underhill, Peter A.; Cinnioğlu, Cengiz; Kayser, Manfred; Morar, Bharti; Kivisild, Toomas; Scozzari, Rosaria; Cruciani, Fulvio; Destro-Bisol, Giovanni; Spedini, Gabriella; Chambers, Geoffrey K.; Herrera, Rene J.; Yong, Kiau Kiun; Gresham, David; Tournev, Ivailo; Feldman, Marcus W.; Kalaydjieva, Luba

    2004-01-01

    We estimate an effective mutation rate at an average Y chromosome short-tandem repeat locus as 6.9×10-4 per 25 years, with a standard deviation across loci of 5.7×10-4, using data on microsatellite variation within Y chromosome haplogroups defined by unique-event polymorphisms in populations with documented short-term histories, as well as comparative data on worldwide populations at both the Y chromosome and various autosomal loci. This value is used to estimate the times of the African Bantu expansion, the divergence of Polynesian populations (the Maoris, Cook Islanders, and Samoans), and the origin of Gypsy populations from Bulgaria. PMID:14691732

  9. Characterization and assessment of an avian repetitive DNA sequence as an icterid phylogenetic marker.

    PubMed

    Quinn, J S; Guglich, E; Seutin, G; Lau, R; Marsolais, J; Parna, L; Boag, P T; White, B N

    1992-02-01

    The first tandemly repeated sequence examined in a passerine bird, a 431-bp PstI fragment named pMAT1, has been cloned from the genome of the brown-headed cowbird (Molothrus ater). The sequence represents about 5-10% of the genome (about 4 x 10(5) copies) and yields prominent ethidium bromide stained bands when genomic DNA cut with a variety of restriction enzymes is electrophoresed in agarose gels. A particularly striking ladder of fragments is apparent when the DNA is cut with HinfI, indicative of a tandem arrangement of the monomer. The cloned PstI monomer has been sequenced, revealing no internal repeated structure. There are sequences that hybridize with pMAT1 found in related nine-primaried oscines but not in more distantly related oscines, suboscines, or nonpasserine species. Little sequence similarity to tandemly repeated PstI cut sequences from the merlin (Falco columbarius), saurus crane (Grus antigone), or Puerto Rican parrot (Amazona vittata) or to HinfI digested sequence from the Toulouse goose (Anser anser) was detected. The isolated sequence was used as a probe to examine DNA samples of eight members of the tribe Icterini. This examination revealed phylogenetically informative characters. The repeat contains cutting sites from a number of restriction enzymes, which, if sufficiently polymorphic, would provide new phylogenetic characters. Sequences like these, conserved within a species, but variable between closely related species, may be very useful for phylogenetic studies of closely related taxa.

  10. Characterization of the variable-number tandem repeats in vrrA from different Bacillus anthracis isolates

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jackson, P.J.; Walthers, E.A.; Richmond, K.L.

    1997-04-01

    PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less

  11. Tandem Repeat Proteins Inspired By Squid Ring Teeth

    NASA Astrophysics Data System (ADS)

    Pena-Francesch, Abdon

    Proteins are large biomolecules consisting of long chains of amino acids that hierarchically assemble into complex structures, and provide a variety of building blocks for biological materials. The repetition of structural building blocks is a natural evolutionary strategy for increasing the complexity and stability of protein structures. However, the relationship between amino acid sequence, structure, and material properties of protein systems remains unclear due to the lack of control over the protein sequence and the intricacies of the assembly process. In order to investigate the repetition of protein building blocks, a recently discovered protein from squids is examined as an ideal protein system. Squid ring teeth are predatory appendages located inside the suction cups that provide a strong grasp of prey, and are solely composed of a group of proteins with tandem repetition of building blocks. The objective of this thesis is the understanding of sequence, structure and property relationship in repetitive protein materials inspired in squid ring teeth for the first time. Specifically, this work focuses on squid-inspired structural proteins with tandem repeat units in their sequence (i.e., repetition of alternating building blocks) that are physically cross-linked via beta-sheet structures. The research work presented here tests the hypothesis that, in these systems, increasing the number of building blocks in the polypeptide chain decreases the protein network defects and improves the material properties. Hence, the sequence, nanostructure, and properties (thermal, mechanical, and conducting) of tandem repeat squid-inspired protein materials are examined. Spectroscopic structural analysis, advanced materials characterization, and entropic elasticity theory are combined to elucidate the structure and material properties of these repetitive proteins. This approach is applied not only to native squid proteins but also to squid-inspired synthetic polypeptides that allow for a fine control of the sequence and network morphology. The results provided in this work establish a clear dependence between the repetitive building blocks, the network morphology, and the properties of squid-inspired repetitive protein materials. Increasing the number of tandem repeat units in SRT-inspired proteins led to more effective protein networks with superior properties. Through increasing tandem repetition and optimization of network morphology, highly efficient protein materials capable of withstanding deformations up to 400% of their original length, with MPa-GPa modulus, high energy absorption (50 MJ m-3), peak proton conductivity of 3.7 mS cm-1 (at pH 7, highest reported to date for biological materials), and peak thermal conductivity of 1.4 W m-1 K -1 (which exceeds that of most polymer materials) were developed. These findings introduce new design rules in the engineering of proteins based on tandem repetition and morphology control, and provide a novel framework for tailoring and optimizing the properties of protein-based materials.

  12. Molecular analysis of two phytohemagglutinin genes and their expression in Phaseolus vulgaris cv. Pinto, a lectin-deficient cultivar of the bean.

    PubMed

    Voelker, T A; Staswick, P; Chrispeels, M J

    1986-12-01

    Phytohemagglutinin (PHA), the seed lectin of the common bean, Phaseolus vulgaris, is encoded by two highly homologous, tandemly linked genes, dlec1 and dlec2, which are coordinately expressed at high levels in developing cotyledons. Their respective transcripts translate into closely related polypeptides, PHA-E and PHA-L, constituents of the tetrameric lectin which accumulates at high levels in developing seeds. In the bean cultivar Pinto UI111, PHA-E is not detectable, and PHA-L accumulates at very reduced levels. To investigate the cause of the Pinto phenotype, we cloned and sequenced the two PHA genes of Pinto, called Pdlec1 and Pdlec2, and determined the abundance of their respective mRNAs in developing cotyledons. Both genes are more than 90% homologous to the normal PHA genes found in other cultivars. Pdlec1 carries a 1-bp frameshift mutation close to the 5' end of its coding sequence. Only very truncated polypeptides could be made from its mRNA. The gene Pdlec2 encodes a polypeptide, which resembles PHA-L and its predicted amino acid sequence agrees with the available Pinto PHA amino acid sequence data. Analysis of the mRNA of developing cotyledons revealed that the Pdlec1 message is reduced 600-fold, and Pdlec2 mRNA is reduced 20-fold with respect to mRNA levels in normal cultivars. A comparison of the sequences which are upstream from the coding sequence shows that Pdlec2 has a 100-bp deletion compared to the other genes (dlec1, dlec2 and Pdlec1). This deletion which contains a large tandem repeat may be responsible for the low level of expression of Pdlec2. The very low expression of Pdlec1 is as yet unexplained.

  13. Stability of Tandem Repeats in the Drosophila Melanogaster HSR-Omega Nuclear RNA

    PubMed Central

    Hogan, N. C.; Slot, F.; Traverse, K. L.; Garbe, J. C.; Bendena, W. G.; Pardue, M. L.

    1995-01-01

    The Drosophila melanogaster Hsr-omega locus produces a nuclear RNA containing >5 kb of tandem repeat sequences. These repeats are unique to Hsr-omega and show concerted evolution similar to that seen with classical satellite DNAs. In D. melanogaster the monomer is ~280 bp. Sequences of 191/2 monomers differ by 8 +/- 5% (mean +/- SD), when all pairwise comparisons are considered. Differences are single nucleotide substitutions and 1-3 nucleotide deletions/insertions. Changes appear to be randomly distributed over the repeat unit. Outer repeats do not show the decrease in monomer homogeneity that might be expected if homogeneity is maintained by recombination. However, just outside the last complete repeat at each end, there are a few fragments of sequence similar to the monomer. The sequences in these flanking regions are not those predicted for sequences decaying in the absence of recombination. Instead, the fragmentation of the sequence homology suggests that flanking regions have undergone more severe disruptions, possibly during an insertion or amplification event. Hsr-omega alleles differing in the number of repeats are detected and appear to be stable over a few thousand generations; however, both increases and decreases in repeat numbers have been observed. The new alleles appear to be as stable as their predecessors. No alleles of less than ~5 kb nor more than ~16 kb of repeats were seen in any stocks examined. The evidence that there is a limit on the minimum number of repeats is consistent with the suggestion that these repeats are important in the function of the unusual Hsr-omega nuclear RNA. PMID:7540581

  14. Structure, organization, and sequence of alpha satellite DNA from human chromosome 17: evidence for evolution by unequal crossing-over and an ancestral pentamer repeat shared with the human X chromosome.

    PubMed

    Waye, J S; Willard, H F

    1986-09-01

    The centromeric regions of all human chromosomes are characterized by distinct subsets of a diverse tandemly repeated DNA family, alpha satellite. On human chromosome 17, the predominant form of alpha satellite is a 2.7-kilobase-pair higher-order repeat unit consisting of 16 alphoid monomers. We present the complete nucleotide sequence of the 16-monomer repeat, which is present in 500 to 1,000 copies per chromosome 17, as well as that of a less abundant 15-monomer repeat, also from chromosome 17. These repeat units were approximately 98% identical in sequence, differing by the exclusion of precisely 1 monomer from the 15-monomer repeat. Homologous unequal crossing-over is suggested as a probable mechanism by which the different repeat lengths on chromosome 17 were generated, and the putative site of such a recombination event is identified. The monomer organization of the chromosome 17 higher-order repeat unit is based, in part, on tandemly repeated pentamers. A similar pentameric suborganization has been previously demonstrated for alpha satellite of the human X chromosome. Despite the organizational similarities, substantial sequence divergence distinguishes these subsets. Hybridization experiments indicate that the chromosome 17 and X subsets are more similar to each other than to the subsets found on several other human chromosomes. We suggest that the chromosome 17 and X alpha satellite subsets may be related components of a larger alphoid subfamily which have evolved from a common ancestral repeat into the contemporary chromosome-specific subsets.

  15. MR-Tandem: parallel X!Tandem using Hadoop MapReduce on Amazon Web Services.

    PubMed

    Pratt, Brian; Howbert, J Jeffry; Tasman, Natalie I; Nilsson, Erik J

    2012-01-01

    MR-Tandem adapts the popular X!Tandem peptide search engine to work with Hadoop MapReduce for reliable parallel execution of large searches. MR-Tandem runs on any Hadoop cluster but offers special support for Amazon Web Services for creating inexpensive on-demand Hadoop clusters, enabling search volumes that might not otherwise be feasible with the compute resources a researcher has at hand. MR-Tandem is designed to drop in wherever X!Tandem is already in use and requires no modification to existing X!Tandem parameter files, and only minimal modification to X!Tandem-based workflows. MR-Tandem is implemented as a lightly modified X!Tandem C++ executable and a Python script that drives Hadoop clusters including Amazon Web Services (AWS) Elastic Map Reduce (EMR), using the modified X!Tandem program as a Hadoop Streaming mapper and reducer. The modified X!Tandem C++ source code is Artistic licensed, supports pluggable scoring, and is available as part of the Sashimi project at http://sashimi.svn.sourceforge.net/viewvc/sashimi/trunk/trans_proteomic_pipeline/extern/xtandem/. The MR-Tandem Python script is Apache licensed and available as part of the Insilicos Cloud Army project at http://ica.svn.sourceforge.net/viewvc/ica/trunk/mr-tandem/. Full documentation and a windows installer that configures MR-Tandem, Python and all necessary packages are available at this same URL. brian.pratt@insilicos.com

  16. Genetic variation and willingness to participate in epidemiologic research: data from three studies.

    PubMed

    Bhatti, Parveen; Sigurdson, Alice J; Wang, Sophia S; Chen, Jinbo; Rothman, Nathaniel; Hartge, Patricia; Bergen, Andrew W; Landi, Maria Teresa

    2005-10-01

    The differences in common genetic polymorphism frequencies by willingness to participate in epidemiologic studies are unexplored, but the same threats to internal validity operate as for studies with nongenetic information. We analyzed single nucleotide polymorphism genotypes, haplotypes, and short tandem repeats among control groups from three studies with different recruitment designs that included early, late, and never questionnaire responders, one or more participation incentives, and blood or buccal DNA collection. Among 2,955 individuals, we compared 108 genotypes, 8 haplotypes, and 9 to 15 short tandem repeats by respondent type. Among our main comparisons, single nucleotide polymorphism genotype frequencies differed significantly (P < 0.05) between respondent groups in six instances, with 13 expected by chance alone. When comparing the odds of carrying a variant among the various response groups, 19 odds ratios were /=1.40, levels that might be notably different. Among the various respondent group comparisons, haplotype and short tandem repeat frequencies were not significantly different by willingness to participate. We observed little evidence to suggest that genotype differences underlie response characteristics in molecular epidemiologic studies, but a greater variety of genes should be examined, including those related to behavioral traits potentially associated with willingness to participate. To the extent possible, investigators should evaluate their own genetic data for bias in response categories.

  17. Production of monoclonal antibody, PR81, recognizing the tandem repeat region of MUC1 mucin.

    PubMed

    Paknejad, M; Rasaee, M J; Tehrani, F Karami; Kashanian, S; Mohagheghi, M A; Omidfar, K; Bazl, M Rajabi

    2003-06-01

    A monoclonal antibody (MAb) was generated by immunizing BALB/c mice with homogenized breast cancerous tissues. This antibody (PR81) was found to be of IgG(1) class and subclass, containing kappa light chain. PR81 reacted with either the membrane extracts of several breast cancerous tissues or the cell surface of some MUC1 positive cell lines (MCF-7, BT-20 and T-47D) tested by enzyme immunoassay and for MCF-7 by immunofluorescence method. PR81 also reacted with two synthetic 27 and 16-amino acid peptides, TSA-P1-24 and A-P1-15, respectively, which included the core tandem repeat sequence of MUC1. However, this antibody did not react with a synthetic 14 amino acid peptide that has no similarity with tandem repeat found in MUC1. The generated antibody had good and similar affinities (2.19 x 10(8) M(-1)) toward TSA-P1-24 and A-P1-15, which are mainly shared in the hydrophilic sequence of PDTRPAP. Through Western blot analysis of homogenized breast tissues, PR81 recognized only a major band of 250 kDa. This band is stronger in malignant tissue than benign and normal tissues.

  18. Deep landscape update of dispersed and tandem repeats in the genome model of the red jungle fowl, Gallus gallus, using a series of de novo investigating tools.

    PubMed

    Guizard, Sébastien; Piégu, Benoît; Arensburger, Peter; Guillou, Florian; Bigot, Yves

    2016-08-19

    The program RepeatMasker and the database Repbase-ISB are part of the most widely used strategy for annotating repeats in animal genomes. They have been used to show that avian genomes have a lower repeat content (8-12 %) than the sequenced genomes of many vertebrate species (30-55 %). However, the efficiency of such a library-based strategies is dependent on the quality and completeness of the sequences in the database that is used. An alternative to these library based methods are methods that identify repeats de novo. These alternative methods have existed for a least a decade and may be more powerful than the library based methods. We have used an annotation strategy involving several complementary de novo tools to determine the repeat content of the model genome galGal4 (1.04 Gbp), including identifying simple sequence repeats (SSRs), tandem repeats and transposable elements (TEs). We annotated over one Gbp. of the galGal4 genome and showed that it is composed of approximately 19 % SSRs and TEs repeats. Furthermore, we estimate that the actual genome of the red jungle fowl contains about 31-35 % repeats. We find that library-based methods tend to overestimate TE diversity. These results have a major impact on the current understanding of repeats distributions throughout chromosomes in the red jungle fowl. Our results are a proof of concept of the reliability of using de novo tools to annotate repeats in large animal genomes. They have also revealed issues that will need to be resolved in order to develop gold-standard methodologies for annotating repeats in eukaryote genomes.

  19. New paradigm in ankyrin repeats: Beyond protein-protein interaction module.

    PubMed

    Islam, Zeyaul; Nagampalli, Raghavendra Sashi Krishna; Fatima, Munazza Tamkeen; Ashraf, Ghulam Md

    2018-04-01

    Classically, ankyrin repeat (ANK) proteins are built from tandems of two or more repeats and form curved solenoid structures that are associated with protein-protein interactions. These are short, widespread structural motif of around 33 amino acids repeats in tandem, having a canonical helix-loop-helix fold, found individually or in combination with other domains. The multiplicity of structural pattern enables it to form assemblies of diverse sizes, required for their abilities to confer multiple binding and structural roles of proteins. Three-dimensional structures of these repeats determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. Recent work on the ANK has proposed novel structural information, especially protein-lipid, protein-sugar and protein-protein interaction. Self-assembly of these repeats was also shown to prevent the associated protein in forming filaments. In this review, we summarize the latest findings and how the new structural information has increased our understanding of the structural determinants of ANK proteins. We discussed latest findings on how these proteins participate in various interactions to diversify the ANK roles in numerous biological processes, and explored the emerging and evolving field of designer ankyrins and its framework for protein engineering emphasizing on biotechnological applications. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population.

    PubMed

    Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao

    2018-05-01

    Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.

  1. Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

    PubMed

    Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D

    2015-05-01

    Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.

  2. A Hybrid Approach To Tandem Cylinder Noise

    NASA Technical Reports Server (NTRS)

    Lockard, David P.

    2004-01-01

    Aeolian tone generation from tandem cylinders is predicted using a hybrid approach. A standard computational fluid dynamics (CFD) code is used to compute the unsteady flow around the cylinders, and the acoustics are calculated using the acoustic analogy. The CFD code is nominally second order in space and time and includes several turbulence models, but the SST k - omega model is used for most of the calculations. Significant variation is observed between laminar and turbulent cases, and with changes in the turbulence model. A two-dimensional implementation of the Ffowcs Williams-Hawkings (FW-H) equation is used to predict the far-field noise.

  3. Revisiting the Plastid Phylogenomics of Pinaceae with Two Complete Plastomes of Pseudolarix and Tsuga

    PubMed Central

    Sudianto, Edi; Wu, Chung-Shien; Lin, Ching-Ping; Chaw, Shu-Miaw

    2016-01-01

    Phylogeny of the ten Pinaceous genera has long been contentious. Plastid genomes (plastomes) provide an opportunity to resolve this problem because they contain rich evolutionary information. To comprehend the plastid phylogenomics of all ten Pinaceous genera, we sequenced the plastomes of two previously unavailable genera, Pseudolarix amabilis (122,234 bp) and Tsuga chinensis (120,859 bp). Both plastomes share similar gene repertoire and order. Here for the first time we report a unique insertion of tandem repeats in accD of T. chinensis. From the 65 plastid protein-coding genes common to all Pinaceous genera, we re-examined the phylogenetic relationship among all Pinaceous genera. Our two phylogenetic trees are congruent in an identical tree topology, with the five genera of the Abietoideae subfamily constituting a monophyletic clade separate from the other three subfamilies: Pinoideae, Piceoideae, and Laricoideae. The five genera of Abietoideae were grouped into two sister clades consisting of (1) Cedrus alone and (2) two sister subclades of Pseudolarix—Tsuga and Abies—Keteleeria, with the former uniquely losing the gene psaM and the latter specifically excluding the 3 psbA from the residual inverted repeat. PMID:27352945

  4. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese

    PubMed Central

    Ebstein, Richard P.; Monakhov, Mikhail V.; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-01-01

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal–conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. PMID:26246555

  5. Repeat-containing protein effectors of plant-associated organisms

    PubMed Central

    Mesarich, Carl H.; Bowen, Joanna K.; Hamiaux, Cyril; Templeton, Matthew D.

    2015-01-01

    Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs) that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms. PMID:26557126

  6. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese.

    PubMed

    Ebstein, Richard P; Monakhov, Mikhail V; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-08-22

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal-conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. © 2015 The Author(s).

  7. Repeat-containing protein effectors of plant-associated organisms.

    PubMed

    Mesarich, Carl H; Bowen, Joanna K; Hamiaux, Cyril; Templeton, Matthew D

    2015-01-01

    Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs) that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms.

  8. Ehrlichia chaffeensis Tandem Repeat Proteins and Ank200 are Type 1 Secretion System Substrates Related to the Repeats-in-Toxin Exoprotein Family

    PubMed Central

    Wakeel, Abdul; den Dulk-Ras, Amke; Hooykaas, Paul J. J.; McBride, Jere W.

    2011-01-01

    Ehrlichia chaffeensis has type 1 and 4 secretion systems (T1SS and T4SS), but the substrates have not been identified. Potential substrates include secreted tandem repeat protein (TRP) 47, TRP120, and TRP32, and the ankyrin repeat protein, Ank200, that are involved in molecular host–pathogen interactions including DNA binding and a network of protein–protein interactions with host targets associated with signaling, transcriptional regulation, vesicle trafficking, and apoptosis. In this study we report that E. chaffeensis TRP47, TRP32, TRP120, and Ank200 were not secreted in the Agrobacterium tumefaciens Cre recombinase reporter assay routinely used to identify T4SS substrates. In contrast, all TRPs and the Ank200 proteins were secreted by the Escherichia coli complemented with the hemolysin secretion system (T1SS), and secretion was reduced in a T1SS mutant (ΔTolC), demonstrating that these proteins are T1SS substrates. Moreover, T1SS secretion signals were identified in the C-terminal domains of the TRPs and Ank200, and a detailed bioinformatic analysis of E. chaffeensis TRPs and Ank200 revealed features consistent with those described in the repeats-in-toxins (RTX) family of exoproteins, including glycine- and aspartate-rich tandem repeats, homology with ATP-transporters, a non-cleavable C-terminal T1SS signal, acidic pIs, and functions consistent with other T1SS substrates. Using a heterologous E. coli T1SS, this investigation has identified the first Ehrlichia T1SS substrates supporting the conclusion that the T1SS and corresponding substrates are involved in molecular host–pathogen interactions that contribute to Ehrlichia pathobiology. Further investigation of the relationship between Ehrlichia TRPs, Ank200, and the RTX exoprotein family may lead to a greater understanding of the importance of T1SS substrates and specific functions of T1SS in the pathobiology of obligately intracellular bacteria. PMID:22919588

  9. De novo generation of plant centromeres at tandem repeats.

    PubMed

    Teo, Chee How; Lermontova, Inna; Houben, Andreas; Mette, Michael Florian; Schubert, Ingo

    2013-06-01

    Artificial minichromosomes are highly desirable tools for basic research, breeding, and biotechnology purposes. We present an option to generate plant artificial minichromosomes via de novo engineering of plant centromeres in Arabidopsis thaliana by targeting kinetochore proteins to tandem repeat arrays at non-centromeric positions. We employed the bacterial lactose repressor/lactose operator system to guide derivatives of the centromeric histone H3 variant cenH3 to LacO operator sequences. Tethering of cenH3 to non-centromeric loci led to de novo assembly of kinetochore proteins and to dicentric carrier chromosomes which potentially form anaphase bridges. This approach will be further developed and may contribute to generating minichromosomes from preselected genomic regions, potentially even in a diploid background.

  10. Genetic characterization of the UCS and Kex1 loci of Pneumocystis jirovecii.

    PubMed

    Esteves, F; Tavares, A; Costa, M C; Gaspar, J; Antunes, F; Matos, O

    2009-02-01

    Nucleotide variation in the Pneumocystis jirovecii upstream conserved sequence (UCS) and kexin-like serine protease (Kex1) loci was studied in pulmonary specimens from Portuguese HIV-positive patients. DNA was extracted and used for specific molecular sequence analysis. The number of UCS tandem repeats detected in 13 successfully sequenced isolates ranged from three (9 isolates, 69%) to four (4 isolates, 31%). A novel tandem repeat pattern and two novel polymorphisms were detected in the UCS region. For the Kex1 gene, the wild-type (24 isolates, 86%) was the most frequent sequence detected among the 28 sequenced isolates. Nevertheless, a nonsynonymous (1 isolate, 3%) and three synonymous (3 isolates, 11%) polymorphisms were detected and are described here for the first time.

  11. High occurrence of mitochondrial heteroplasmy in nepalese indigenous sheep (Ovis aries) compared to chinese sheep.

    PubMed

    Gorkhali, Neena Amatya; Jiang, Lin; Shrestha, Bhola Shankar; He, Xiao-Hong; Junzhao, Qian; Han, Jian-Lin; Ma, Yue-Hui

    2016-07-01

    Heteroplasmy due to length polymorphism with tandem repeats in mtDNAs within individual was hardly studied in domestic animals. In the present study, we identified intra-individual length variation in the control region of mtDNAs in Nepalese sheep by molecular cloning and sequencing techniques. We observed one to four tandem repeats of a 75-bp nucleotide sequences in the mtDNA control region in 45% of the total Nepalese sheep sampled in contrast to the Chinese sheep, indicating that the heteroplasmy is specific to Nepalese sheep. The high rate of heteroplasmy in Nepalese sheep could be a resultant of the mtDNA mutation and independent segregation at intra-individual level or a strand slippage and mispairing during the replication.

  12. Accumulate repeat accumulate codes

    NASA Technical Reports Server (NTRS)

    Abbasfar, A.; Divsalar, D.; Yao, K.

    2004-01-01

    In this paper we propose an innovative channel coding scheme called Accumulate Repeat Accumulate codes. This class of codes can be viewed as trubo-like codes, namely a double serial concatenation of a rate-1 accumulator as an outer code, a regular or irregular repetition as a middle code, and a punctured accumulator as an inner code.

  13. The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng.

    PubMed

    Chen, Jinhui; Hao, Zhaodong; Xu, Haibin; Yang, Liming; Liu, Guangxin; Sheng, Yu; Zheng, Chen; Zheng, Weiwei; Cheng, Tielong; Shi, Jisen

    2015-01-01

    Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around 10 species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp) genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats, and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR) analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR) region, which was found to be IR region A (IRA), was lost in the M. glyptostroboides cp genome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for related species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostroboides is a sister species to Cryptomeria japonica (L. F.) D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyptostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the coniferous cp genomes, especially for the position of M. glyptostroboides in plant systematics and evolution.

  14. The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng

    PubMed Central

    Chen, Jinhui; Hao, Zhaodong; Xu, Haibin; Yang, Liming; Liu, Guangxin; Sheng, Yu; Zheng, Chen; Zheng, Weiwei; Cheng, Tielong; Shi, Jisen

    2015-01-01

    Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around 10 species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp) genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats, and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR) analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR) region, which was found to be IR region A (IRA), was lost in the M. glyptostroboides cp genome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for related species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostroboides is a sister species to Cryptomeria japonica (L. F.) D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyptostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the coniferous cp genomes, especially for the position of M. glyptostroboides in plant systematics and evolution. PMID:26136762

  15. One-way quantum repeaters with quantum Reed-Solomon codes

    NASA Astrophysics Data System (ADS)

    Muralidharan, Sreraman; Zou, Chang-Ling; Li, Linshu; Jiang, Liang

    2018-05-01

    We show that quantum Reed-Solomon codes constructed from classical Reed-Solomon codes can approach the capacity on the quantum erasure channel of d -level systems for large dimension d . We study the performance of one-way quantum repeaters with these codes and obtain a significant improvement in key generation rate compared to previously investigated encoding schemes with quantum parity codes and quantum polynomial codes. We also compare the three generations of quantum repeaters using quantum Reed-Solomon codes and identify parameter regimes where each generation performs the best.

  16. MR-Tandem: parallel X!Tandem using Hadoop MapReduce on Amazon Web Services

    PubMed Central

    Pratt, Brian; Howbert, J. Jeffry; Tasman, Natalie I.; Nilsson, Erik J.

    2012-01-01

    Summary: MR-Tandem adapts the popular X!Tandem peptide search engine to work with Hadoop MapReduce for reliable parallel execution of large searches. MR-Tandem runs on any Hadoop cluster but offers special support for Amazon Web Services for creating inexpensive on-demand Hadoop clusters, enabling search volumes that might not otherwise be feasible with the compute resources a researcher has at hand. MR-Tandem is designed to drop in wherever X!Tandem is already in use and requires no modification to existing X!Tandem parameter files, and only minimal modification to X!Tandem-based workflows. Availability and implementation: MR-Tandem is implemented as a lightly modified X!Tandem C++ executable and a Python script that drives Hadoop clusters including Amazon Web Services (AWS) Elastic Map Reduce (EMR), using the modified X!Tandem program as a Hadoop Streaming mapper and reducer. The modified X!Tandem C++ source code is Artistic licensed, supports pluggable scoring, and is available as part of the Sashimi project at http://sashimi.svn.sourceforge.net/viewvc/sashimi/trunk/trans_proteomic_pipeline/extern/xtandem/. The MR-Tandem Python script is Apache licensed and available as part of the Insilicos Cloud Army project at http://ica.svn.sourceforge.net/viewvc/ica/trunk/mr-tandem/. Full documentation and a windows installer that configures MR-Tandem, Python and all necessary packages are available at this same URL. Contact: brian.pratt@insilicos.com PMID:22072385

  17. Accumulate-Repeat-Accumulate-Accumulate Codes

    NASA Technical Reports Server (NTRS)

    Divsalar, Dariush; Dolinar, Samuel; Thorpe, Jeremy

    2007-01-01

    Accumulate-repeat-accumulate-accumulate (ARAA) codes have been proposed, inspired by the recently proposed accumulate-repeat-accumulate (ARA) codes. These are error-correcting codes suitable for use in a variety of wireless data-communication systems that include noisy channels. ARAA codes can be regarded as serial turbolike codes or as a subclass of low-density parity-check (LDPC) codes, and, like ARA codes they have projected graph or protograph representations; these characteristics make it possible to design high-speed iterative decoders that utilize belief-propagation algorithms. The objective in proposing ARAA codes as a subclass of ARA codes was to enhance the error-floor performance of ARA codes while maintaining simple encoding structures and low maximum variable node degree.

  18. Complete Mitochondrial Genome of the Red Fox (Vuples vuples) and Phylogenetic Analysis with Other Canid Species.

    PubMed

    Zhong, Hua-Ming; Zhang, Hong-Hai; Sha, Wei-Lai; Zhang, Cheng-De; Chen, Yu-Cai

    2010-04-01

    The whole mitochondrial genome sequence of red fox (Vuples vuples) was determined. It had a total length of 16 723 bp. As in most mammal mitochondrial genome, it contained 13 protein coding genes, two ribosome RNA genes, 22 transfer RNA genes and one control region. The base composition was 31.3% A, 26.1% C, 14.8% G and 27.8% T, respectively. The codon usage of red fox, arctic fox, gray wolf, domestic dog and coyote followed the same pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 3 gene in the red fox. A long tandem repeat rich in AC was found between conserved sequence block 1 and 2 in the control region. In order to confirm the phylogenetic relationships of red fox to other canids, phylogenetic trees were reconstructed by neighbor-joining and maximum parsimony methods using 12 concatenated heavy-strand protein-coding genes. The result indicated that arctic fox was the sister group of red fox and they both belong to the red fox-like clade in family Canidae, while gray wolf, domestic dog and coyote belong to wolf-like clade. The result was in accordance with existing phylogenetic results.

  19. Establishing a reference dataset for the authentication of spinal muscular atrophy cell lines using STR profiling and digital PCR.

    PubMed

    Stabley, Deborah L; Holbrook, Jennifer; Harris, Ashlee W; Swoboda, Kathryn J; Crawford, Thomas O; Sol-Church, Katia; Butchbach, Matthew E R

    2017-05-01

    Fibroblasts and lymphoblastoid cell lines (LCLs) derived from individuals with spinal muscular atrophy (SMA) have been and continue to be essential for translational SMA research. Authentication of cell lines helps ensure reproducibility and rigor in biomedical research. This quality control measure identifies mislabeling or cross-contamination of cell lines and prevents misinterpretation of data. Unfortunately, authentication of SMA cell lines used in various studies has not been possible because of a lack of a reference. In this study, we provide said reference so that SMA cell lines can be subsequently authenticated. We use short tandem repeat (STR) profiling and digital PCR (dPCR), which quantifies SMN1 and SMN2 copy numbers, to generate molecular identity codes for fibroblasts and LCLs that are commonly used in SMA research. Using these molecular identity codes, we clarify the familial relationships within a set of fibroblasts commonly used in SMA research. This study presents the first cell line reference set for the SMA research community and demonstrates its usefulness for re-identification and authentication of lines commonly used as in vitro models for future studies. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Axisymmetric Tandem Mirrors: Stabilization and Confinement Studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Post, R.F.; Fowler, T.K.; Bulmer, R.

    2005-01-15

    The 'Kinetic Stabilizer' has been proposed as a means of MHD stabilizing an axisymmetric tandem mirror system. The K-S concept is based on theoretical studies by Ryutov, confirmed experimentally in the Gas Dynamic Trap experiment in Novosibirsk. In the K-S beams of ions are directed into the end of an 'expander' region outside the outer mirror of a tandem mirror. These ions, slowed, stagnated, and reflected as they move up the magnetic gradient, produce a low-density stabilizing plasma.At the Lawrence Livermore National Laboratory we have been conducting theoretical and computational studies of the K-S Tandem Mirror. These studies have employedmore » a low-beta code written especially to analyze the beam injection/stabilization process,and a new code SYMTRAN (by Hua and Fowler)that solves the coupled radial and axial particle and energy transport in a K-S T-M. Also, a 'legacy' MHD stability code, FLORA, has been upgraded and employed to benchmark the injection/stabilization code and to extend its results to high beta values.The FLORA code studies so far have confirmed the effectiveness of the K-S in stabilizing high-beta (40%) plasmas with stabilizer plasmas the peak pressures of which are several orders of magnitude smaller than those of the confined plasma.Also the SYMTRAN code has shown D-T plasma ignition from alpha particle energy deposition in T-M regimes with strong end plugging.Our studies have confirmed the viability of the K-S T-M concept with respect to MHD stability and radial and axial confinement. We are continuing these studies in order to optimize the parameters and to examine means for the stabilization of possible residual instability modes, such as drift modes and 'trapped-particle' modes. These modes may in principle be controlled by tailoring the stabilizer plasma distribution and/or the radial potential distribution.In the paper the results to date of our studies are summarized and projected to scope out possible fusion-power versions of the K-S T-M.« less

  1. Axisymmetric Tandem Mirrors: Stabilization and Confinement Studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Post, R F; Fowler, T K; Bulmer, R

    2004-07-15

    The 'Kinetic Stabilizer' has been proposed as a means of MHD stabilizing an axisymmetric tandem mirror system. The K-S concept is based on theoretical studies by Ryutov, confirmed experimentally in the Gas Dynamic Trap experiment in Novosibirsk. In the K-S beams of ions are directed into the end of an 'expander' region outside the outer mirror of a tandem mirror. These ions, slowed, stagnated, and reflected as they move up the magnetic gradient, produce a low-density stabilizing plasma. At the Lawrence Livermore National Laboratory we have been conducting theoretical and computational studies of the K-S Tandem Mirror. These studies havemore » employed a low-beta code written especially to analyze the beam injection/stabilization process, and a new code SYMTRAN (by Hua and Fowler) that solves the coupled radial and axial particle and energy transport in a K-S TM. Also, a 'legacy' MHD stability code, FLORA, has been upgraded and employed to benchmark the injection/stabilization code and to extend its results to high beta values. The FLORA code studies so far have confirmed the effectiveness of the K-S in stabilizing high-beta (40%) plasmas with stabilizer plasmas the peak pressures of which are several orders of magnitude smaller than those of the confined plasma. Also the SYMTRAN code has shown D-T plasma ignition from alpha particle energy deposition in T-M regimes with strong end plugging. Our studies have confirmed the viability of the K-S-T-M concept with respect to MHD stability and radial and axial confinement. We are continuing these studies in order to optimize the parameters and to examine means for the stabilization of possible residual instability modes, such as drift modes and 'trapped-particle' modes. These modes may in principle be controlled by tailoring the stabilizer plasma distribution and/or the radial potential distribution. In the paper the results to date of our studies are summarized and projected to scope out possible fusion-power versions of the K-S T-M« less

  2. Association study of ERβ, AR, and CYP19A1 genes and MtF transsexualism.

    PubMed

    Fernández, Rosa; Esteva, Isabel; Gómez-Gil, Esther; Rumbo, Teresa; Almaraz, Mari Cruz; Roda, Ester; Haro-Mora, Juan-Jesús; Guillamón, Antonio; Pásaro, Eduardo

    2014-12-01

    The etiology of male-to-female (MtF) transsexualism is unknown. Both genetic and neurological factors may play an important role. To investigate the possible influence of the genetic factor on the etiology of MtF transsexualism. We carried out a cytogenetic and molecular analysis in 442 MtFs and 473 healthy, age- and geographical origin-matched XY control males. The karyotype was investigated by G-banding and by high-density array in the transsexual group. The molecular analysis involved three tandem variable regions of genes estrogen receptor β (ERβ) (CA tandem repeats in intron 5), androgen receptor (AR) (CAG tandem repeats in exon 1), and CYP19A1 (TTTA tandem repeats in intron 4). The allele and genotype frequencies, after division into short and long alleles, were obtained. We investigated the association between genotype and transsexualism by performing a molecular analysis of three variable regions of genes ERβ, AR, and CYP19A1 in 915 individuals (442 MtFs and 473 control males). Most MtFs showed an unremarkable 46,XY karyotype (97.96%). No specific chromosome aberration was associated with MtF transsexualism, and prevalence of aneuploidy (2.04%) was slightly higher than in the general population. Molecular analyses showed no significant difference in allelic or genotypic distribution of the genes examined between MtFs and controls. Moreover, molecular findings presented no evidence of an association between the sex hormone-related genes (ERβ, AR, and CYP19A1) and MtF transsexualism. The study suggests that the analysis of karyotype provides limited information in these subjects. Variable regions analyzed from ERβ, AR, and CYP19A1 are not associated with MtF transsexualism. Nevertheless, this does not exclude other polymorphic regions not analyzed. © 2014 International Society for Sexual Medicine.

  3. An annotated genetic map of loblolly pine based on microsatellite and cDNA markers

    USDA-ARS?s Scientific Manuscript database

    Previous loblolly pine (Pinus taeda L.) genetic linkage maps have been based on a variety of DNA polymorphisms, such as AFLPs, RAPDs, RFLPs, and ESTPs, but only a few SSRs (simple sequence repeats), also known as simple tandem repeats or microsatellites, have been mapped in P. taeda. The objective o...

  4. Salmonella enterica serotype enteritidis in French Polynesia, South Pacific, 2008-2013.

    PubMed

    Le Hello, Simon; Maillard, Fiona; Mallet, Henri-Pierre; Daudens, Elise; Levy, Marc; Roy, Valérie; Branaa, Philippe; Bertrand, Sophie; Fabre, Laetitia; Weill, François-Xavier

    2015-06-01

    Outbreaks of Salmonella enterica serotype Enteritidis infections associated with eggs occurred in French Polynesia during 2008-2013. Molecular analysis of isolates by using clustered regularly interspaced short palindromic repeat polymorphisms and multilocus variable-number tandem-repeat analysis was performed. This subtyping made defining the epidemic strain, finding the source, and decontaminating affected poultry flocks possible.

  5. Alpha particle confinement in tandem mirrors

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Devoto, R.S.; Ohnishi, M.; Kerns, J.

    1980-10-10

    Mechanisms leading to loss of alpha particles from non-axisymmetric tandem mirrors are considered. Stochastic diffusion due to bounce-drift resonances, which can cause rapid radial losses of high-energy alpha particles, can be suppressed by imposing a 20% rise in axisymmetric fields before the quadrupole transition sections. Alpha particles should then be well-confined until thermal energies when they enter the resonant plateau require. A fast code for computation of drift behavior in reactors is described. Sample calculations are presented for resonant particles in a proposed coil set for the Tandem Mirror Next Step.

  6. 6-mercaptopurine influences TPMT gene transcription in a TPMT gene promoter variable number of tandem repeats-dependent manner.

    PubMed

    Kotur, Nikola; Stankovic, Biljana; Kassela, Katerina; Georgitsi, Marianthi; Vicha, Anna; Leontari, Iliana; Dokmanovic, Lidija; Janic, Dragana; Krstovski, Nada; Klaassen, Kristel; Radmilovic, Milena; Stojiljkovic, Maja; Nikcevic, Gordana; Simeonidis, Argiris; Sivolapenko, Gregory; Pavlovic, Sonja; Patrinos, George P; Zukic, Branka

    2012-02-01

    TPMT activity is characterized by a trimodal distribution, namely low, intermediate and high methylator. TPMT gene promoter contains a variable number of GC-rich tandem repeats (VNTRs), namely A, B and C, ranging from three to nine repeats in length in an A(n)B(m)C architecture. We have previously shown that the VNTR architecture in the TPMT gene promoter affects TPMT gene transcription. MATERIALS, METHODS & RESULTS: Here we demonstrate, using reporter assays, that 6-mercaptopurine (6-MP) treatment results in a VNTR architecture-dependent decrease of TPMT gene transcription, mediated by the binding of newly recruited protein complexes to the TPMT gene promoter, upon 6-MP treatment. We also show that acute lymphoblastic leukemia patients undergoing 6-MP treatment display a VNTR architecture-dependent response to 6-MP. These data suggest that the TPMT gene promoter VNTR architecture can be potentially used as a pharmacogenomic marker to predict toxicity due to 6-MP treatment in acute lymphoblastic leukemia patients.

  7. Microsatellite diversity of isolates of the parasitic nematode Haemonchus contortus.

    PubMed

    Otsen, M; Plas, M E; Lenstra, J A; Roos, M H; Hoekstra, R

    2000-09-01

    The alarming development of anthelmintic resistance in important gastrointestinal nematode parasites of man and live-stock is caused by selection for specific genotypes. In order to provide genetic tools to study the nematode populations and the consequences of anthelmintic treatment, we isolated and sequenced 59 microsatellites of the sheep and goat parasite Haemonchus contortus. These microsatellites consist typically of 2-10 tandems CA/GT repeats that are interrupted by sequences of 1-10 bp. A predominant cause of the imperfect structure of the microsatellites appeared mutations of G/C bp in the tandem repeat. About 44% of the microsatellites were associated with the HcREP1 direct repeat, and it was demonstrated that a generic HcREP1 primer could be used to amplify HcREP1-associated microsatellites. Thirty microsatellites could be typed by polymerase chain reaction (PCR) of which 27 were polymorphic. A number of these markers were used to detect genetic contamination of an experimental inbred population. The microsatellites may also contribute to the genetic mapping of drug resistance genes.

  8. Effective application of multiple locus variable number of tandem repeats analysis to tracing Staphylococcus aureus in food-processing environment.

    PubMed

    Rešková, Z; Koreňová, J; Kuchta, T

    2014-04-01

    A total of 256 isolates of Staphylococcus aureus were isolated from 98 samples (34 swabs and 64 food samples) obtained from small or medium meat- and cheese-processing plants in Slovakia. The strains were genotypically characterized by multiple locus variable number of tandem repeats analysis (MLVA), involving multiplex polymerase chain reaction (PCR) with subsequent separation of the amplified DNA fragments by an automated flow-through gel electrophoresis. With the panel of isolates, MLVA produced 31 profile types, which was a sufficient discrimination to facilitate the description of spatial and temporal aspects of contamination. Further data on MLVA discrimination were obtained by typing a subpanel of strains by multiple locus sequence typing (MLST). MLVA coupled to automated electrophoresis proved to be an effective, comparatively fast and inexpensive method for tracing S. aureus contamination of food-processing factories. Subspecies genotyping of microbial contaminants in food-processing factories may facilitate identification of spatial and temporal aspects of the contamination. This may help to properly manage the process hygiene. With S. aureus, multiple locus variable number of tandem repeats analysis (MLVA) proved to be an effective method for the purpose, being sufficiently discriminative, yet comparatively fast and inexpensive. The application of automated flow-through gel electrophoresis to separation of DNA fragments produced by multiplex PCR helped to improve the accuracy and speed of the method. © 2013 The Society for Applied Microbiology.

  9. Maximal oxygen uptake is associated with allele -202 A of insulin-like growth factor binding protein-3 (IGFBP3) promoter polymorphism and (CA)n tandem repeats of insulin-like growth factor IGF1 in Caucasians from Poland.

    PubMed

    Gronek, Piotr; Holdys, Joanna; Kryściak, Jakub; Wieliński, Dariusz; Słomski, Ryszard

    2014-01-01

    Physical fitness is a trait determined by multiple genes, and its genetic basis is modified by numerous environmental factors. The present study examines the effects of the (CA)n tandem repeats polymorphism in IGFI gene and SNP Alw21I restriction site -202 A>C polymorphism in IGF1BP3 on VO2max--a physiological index of aerobic capacity of high heritability. The study sample consisted of 239 (154 male and 85 female) students of the University School of Physical Education in Poznań and athletes practicing various sports, including members of the Polish national team. An association was found between -202 A/C polymorphism of IGFBP3 gene with VO2max in men. Higher VO2max values were attained by men with CC genotype, especially male athletes practicing endurance sports and sports featuring energy metabolism of aerobic/anaerobic character. A statistically significant influence of allele 188 and genotype 188/188 of tandem repeats (CA)n polymorphism of IGF1 gene on VO2max was found in women. Also, lower values of maximal oxygen uptake were noted in individuals with allele 186 or genotype 186/186, and higher VO2max values in athletes with allele 194.

  10. Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

    PubMed

    Prajapati, Surendra K; Joshi, Hema; Carlton, Jane M; Rizvi, M Alam

    2013-01-01

    The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75) from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years) and long-term population history (79,235 to 104,008) of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes) to that inferred from mitochondrial genome diversity.

  11. Within-Host Evolution of Burkholderia pseudomallei in Four Cases of Acute Melioidosis

    PubMed Central

    Limmathurotsakul, Direk; Max, Tamara L.; Sarovich, Derek S.; Vogler, Amy J.; Dale, Julia L.; Ginther, Jennifer L.; Leadem, Benjamin; Colman, Rebecca E.; Foster, Jeffrey T.; Tuanyok, Apichai; Wagner, David M.; Peacock, Sharon J.; Pearson, Talima; Keim, Paul

    2010-01-01

    Little is currently known about bacterial pathogen evolution and adaptation within the host during acute infection. Previous studies of Burkholderia pseudomallei, the etiologic agent of melioidosis, have shown that this opportunistic pathogen mutates rapidly both in vitro and in vivo at tandemly repeated loci, making this organism a relevant model for studying short-term evolution. In the current study, B. pseudomallei isolates cultured from multiple body sites from four Thai patients with disseminated melioidosis were subjected to fine-scale genotyping using multilocus variable-number tandem repeat analysis (MLVA). In order to understand and model the in vivo variable-number tandem repeat (VNTR) mutational process, we characterized the patterns and rates of mutations in vitro through parallel serial passage experiments of B. pseudomallei. Despite the short period of infection, substantial divergence from the putative founder genotype was observed in all four melioidosis cases. This study presents a paradigm for examining bacterial evolution over the short timescale of an acute infection. Further studies are required to determine whether the mutational process leads to phenotypic alterations that impact upon bacterial fitness in vivo. Our findings have important implications for future sampling strategies, since colonies in a single clinical sample may be genetically heterogeneous, and organisms in a culture taken late in the infective process may have undergone considerable genetic change compared with the founder inoculum. PMID:20090837

  12. Analysis of the genome-wide variations among multiple strains of the plant pathogenic bacterium Xylella fastidiosa

    PubMed Central

    Doddapaneni, Harshavardhan; Yao, Jiqiang; Lin, Hong; Walker, M Andrew; Civerolo, Edwin L

    2006-01-01

    Background The Gram-negative, xylem-limited phytopathogenic bacterium Xylella fastidiosa is responsible for causing economically important diseases in grapevine, citrus and many other plant species. Despite its economic impact, relatively little is known about the genomic variations among strains isolated from different hosts and their influence on the population genetics of this pathogen. With the availability of genome sequence information for four strains, it is now possible to perform genome-wide analyses to identify and categorize such DNA variations and to understand their influence on strain functional divergence. Results There are 1,579 genes and 194 non-coding homologous sequences present in the genomes of all four strains, representing a 76. 2% conservation of the sequenced genome. About 60% of the X. fastidiosa unique sequences exist as tandem gene clusters of 6 or more genes. Multiple alignments identified 12,754 SNPs and 14,449 INDELs in the 1528 common genes and 20,779 SNPs and 10,075 INDELs in the 194 non-coding sequences. The average SNP frequency was 1.08 × 10-2 per base pair of DNA and the average INDEL frequency was 2.06 × 10-2 per base pair of DNA. On an average, 60.33% of the SNPs were synonymous type while 39.67% were non-synonymous type. The mutation frequency, primarily in the form of external INDELs was the main type of sequence variation. The relative similarity between the strains was discussed according to the INDEL and SNP differences. The number of genes unique to each strain were 60 (9a5c), 54 (Dixon), 83 (Ann1) and 9 (Temecula-1). A sub-set of the strain specific genes showed significant differences in terms of their codon usage and GC composition from the native genes suggesting their xenologous origin. Tandem repeat analysis of the genomic sequences of the four strains identified associations of repeat sequences with hypothetical and phage related functions. Conclusion INDELs and strain specific genes have been identified as the main source of variations among strains, with individual strains showing different rates of genome evolution. Based on these genome comparisons, it appears that the Pierce's disease strain Temecula-1 genome represents the ancestral genome of the X. fastidiosa. Results of this analysis are publicly available in the form of a web database. PMID:16948851

  13. Efficient production of artificially designed gelatins with a Bacillus brevis system.

    PubMed

    Kajino, T; Takahashi, H; Hirai, M; Yamada, Y

    2000-01-01

    Artificially designed gelatins comprising tandemly repeated 30-amino-acid peptide units derived from human alphaI collagen were successfully produced with a Bacillus brevis system. The DNA encoding the peptide unit was synthesized by taking into consideration the codon usage of the host cells, but no clones having a tandemly repeated gene were obtained through the above-mentioned strategy. Minirepeat genes could be selected in vivo from a mixture of every possible sequence encoding an artificial gelatin by randomly ligating the mixed sequence unit and transforming it into Escherichia coli. Larger repeat genes constructed by connecting minirepeat genes obtained by in vivo selection were also stable in the expression host cells. Gelatins derived from the eight-unit and six-unit repeat genes were extracellularly produced at the level of 0.5 g/liter and easily purified by ammonium sulfate fractionation and anion-exchange chromatography. The purified artificial gelatins had the predicted N-terminal sequences and amino acid compositions and a solgel property similar to that of the native gelatin. These results suggest that the selection of a repeat unit sequence stable in an expression host is a shortcut for the efficient production of repetitive proteins and that it can conveniently be achieved by the in vivo selection method. This study revealed the possible industrial application of artificially designed repetitive proteins.

  14. [Convergent origin of repeats in genes coding for globular proteins. An analysis of the factors determining the presence of inverted and symmetrical repeats].

    PubMed

    Solov'ev, V V; Kel', A E; Kolchanov, N A

    1989-01-01

    The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.

  15. Comparison of simple sequence repeats in 19 Archaea.

    PubMed

    Trivedi, S

    2006-12-05

    All organisms that have been studied until now have been found to have differential distribution of simple sequence repeats (SSRs), with more SSRs in intergenic than in coding sequences. SSR distribution was investigated in Archaea genomes where complete chromosome sequences of 19 Archaea were analyzed with the program SPUTNIK to find di- to penta-nucleotide repeats. The number of repeats was determined for the complete chromosome sequences and for the coding and non-coding sequences. Different from what has been found for other groups of organisms, there is an abundance of SSRs in coding regions of the genome of some Archaea. Dinucleotide repeats were rare and CG repeats were found in only two Archaea. In general, trinucleotide repeats are the most abundant SSR motifs; however, pentanucleotide repeats are abundant in some Archaea. Some of the tetranucleotide and pentanucleotide repeat motifs are organism specific. In general, repeats are short and CG-rich repeats are present in Archaea having a CG-rich genome. Among the 19 Archaea, SSR density was not correlated with genome size or with optimum growth temperature. Pentanucleotide density had an inverse correlation with the CG content of the genome.

  16. De novo Transcriptome Sequencing Reveals a Considerable Bias in the Incidence of Simple Sequence Repeats towards the Downstream of ‘Pre-miRNAs’ of Black Pepper

    PubMed Central

    Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

    2013-01-01

    Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of ‘43 pre-miRNA candidates bearing different types of SSR motifs’. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted ‘pre-miRNA candidates bearing SSRs’. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted ‘pre-miRNA candidates’. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of ‘tandem repeats’ in miRNAs. PMID:23469176

  17. Chloroplast Genome Differences between Asian and American Equisetum arvense (Equisetaceae) and the Origin of the Hypervariable trnY-trnE Intergenic Spacer

    PubMed Central

    Kim, Hyoung Tae; Kim, Ki-Joong

    2014-01-01

    Comparative analyses of complete chloroplast (cp) DNA sequences within a species may provide clues to understand the population dynamics and colonization histories of plant species. Equisetum arvense (Equisetaceae) is a widely distributed fern species in northeastern Asia, Europe, and North America. The complete cp DNA sequences from Asian and American E. arvense individuals were compared in this study. The Asian E. arvense cp genome was 583 bp shorter than that of the American E. arvense. In total, 159 indels were observed between two individuals, most of which were concentrated on the hypervariable trnY-trnE intergenic spacer (IGS) in the large single-copy (LSC) region of the cp genome. This IGS region held a series of 19 bp repeating units. The numbers of the 19 bp repeat unit were responsible for 78% of the total length difference between the two cp genomes. Furthermore, only other closely related species of Equisetum also show the hypervariable nature of the trnY-trnE IGS. By contrast, only a single indel was observed in the gene coding regions: the ycf1 gene showed 24 bp differences between the two continental individuals due to a single tandem-repeat indel. A total of 165 single-nucleotide polymorphisms (SNPs) were recorded between the two cp genomes. Of these, 52 SNPs (31.5%) were distributed in coding regions, 13 SNPs (7.9%) were in introns, and 100 SNPs (60.6%) were in intergenic spacers (IGS). The overall difference between the Asian and American E. arvense cp genomes was 0.12%. Despite the relatively high genetic diversity between Asian and American E. arvense, the two populations are recognized as a single species based on their high morphological similarity. This indicated that the two regional populations have been in morphological stasis. PMID:25157804

  18. Transferability of short tandem repeat markers for two wild Canid species inhabiting the Brazilian Cerrado.

    PubMed

    Rodrigues, F M; Telles, M P C; Resende, L V; Soares, T N; Diniz-Filho, J A F; Jácomo, A T A; Silveira, L

    2006-12-13

    The maned wolf (Chrysocyon brachyurus) and the crab-eating fox (Cerdocyon thous) are two wild-canid species found in the Brazilian Cerrado. We tested cross-amplification and transferability of 29 short tandem repeat primers originally developed for cattle and domestic dogs and cats on 38 individuals of each of these two species, collected in the Emas National Park, which is the largest national park in the Cerrado region. Six of these primers were successfully transferred (CSSM-038, PEZ-05, PEZ-12, LOCO-13, LOCO-15, and PEZ-20); five of which were found to be polymorphic. Genetic parameter values (number of alleles per locus, observed and expected heterozygosities, and fixation indices) were within the expected range reported for canid populations worldwide.

  19. Population genetic study for 24 STR loci and Y indel (GlobalFiler™ PCR Amplification kit and PowerPlex® Fusion system) in 1000 Korean individuals.

    PubMed

    Park, Hyun-Chul; Kim, Kicheol; Nam, Younhyoung; Park, Jihye; Lee, Jinmyung; Lee, Hyehyeon; Kwon, Hansol; Jin, Hanjun; Kim, Wook; Kim, Won; Lim, Sikeun

    2016-07-01

    Allele frequencies for 23 autosomal short tandem repeat loci (D3S1358, vWA, D16S539, CSF1PO, TPOX, D8S1179, D21S11, D18S51, TH01, FGA, D5S818, D13S317, D7S820, D2S441, D19S433, D22S1045, D10S1248, D1S1656, D12S391, D2S1338, SE33, Penta D, Penta E), 1 Y-chromosome short tandem repeat locus (DYS391) and Y indel were obtained from 1000 unrelated individuals of the Korean population. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  20. Novel variable number of tandem repeats of gibbon MAOA gene and its evolutionary significance.

    PubMed

    Choi, Yuri; Jung, Yi-Deun; Ayarpadikannan, Selvam; Koga, Akihiko; Imai, Hiroo; Hirai, Hirohisa; Roos, Christian; Kim, Heui-Soo

    2014-08-01

    Variable number of tandem repeats (VNTRs) are scattered throughout the primate genome, and genetic variation of these VNTRs have been accumulated during primate radiation. Here, we analyzed VNTRs upstream of the monoamine oxidase A (MAOA) gene in 11 different gibbon species. An abundance of truncated VNTR sequences and copy number differences were observed compared to those of human VNTR sequences. To better understand the biological role of these VNTRs, a luciferase activity assay was conducted and results indicated that selected VNTR sequences of the MAOA gene from human and three different gibbon species (Hylobates klossii, Hylobates lar, and Nomascus concolor) showed silencing ability. Together, these data could be useful for understanding the evolutionary history and functional significance of MAOA VNTR sequences in gibbon species.

  1. [Reticulate evolution of parthenogenetic species of the Lacertidae rock lizards: inheritance of CLsat tandem repeats and anonymous RAPD markers].

    PubMed

    Chobanu, D; Rudykh, I A; Riabinina, N L; Grechko, V V; Kramerov, D A; Darevskiĭ, I S

    2002-01-01

    The genetic relatedness of several bisexual and of four unisexual "Lacerta saxicola complex" lizards was studied, using monomer sequences of the complex-specific CLsat tandem repeats and anonymous RAPD markers. Genomes of parthenospecies were shown to include different satellite monomers. The structure of each such monomer is specific for a certain pair of bisexual species. This fact might be interpreted in favor of co-dominant inheritance of these markers in bisexual species hybridogenesis. This idea is supported by the results obtained with RAPD markers; i.e., unisexual species genomes include only the loci characteristic of certain bisexual species. At the same time, in neither case parthenospecies possess specific, autoapomorphic loci that were not present in this or that bisexual species.

  2. Length and sequence variability in mitochondrial control region of the milkfish, Chanos chanos.

    PubMed

    Ravago, Rachel G; Monje, Virginia D; Juinio-Meñez, Marie Antonette

    2002-01-01

    Extensive length variability was observed in the mitochondrial control region of the milkfish, Chanos chanos. The nucleotide sequence of the control region and flanking regions was determined. Length variability and heteroplasmy was due to the presence of varying numbers of a 41-bp tandemly repeated sequence and a 48-bp insertion/deletion (indel). The structure and organization of the milkfish control region is similar to that of other teleost fish and vertebrates. However, extensive variation in the copy number of tandem repeats (4-20 copies) and the presence of a relatively large (48-bp) indel, are apparently uncommon in teleost fish control region sequences reported to date. High sequence variability of control region peripheral domains indicates the potential utility of selected regions as markers for population-level studies.

  3. Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.

    PubMed Central

    Ananiev, E V; Phillips, R L; Rines, H W

    1998-01-01

    The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055

  4. Antibiotic Susceptibility and Molecular Diversity of Bacillus anthracis Strains in Chad: Detection of a New Phylogenetic Subgroup

    PubMed Central

    Maho, Angaya; Rossano, Alexandra; Hächler, Herbert; Holzer, Anita; Schelling, Esther; Zinsstag, Jakob; Hassane, Mahamat H.; Toguebaye, Bhen S.; Akakpo, Ayayi J.; Van Ert, Matthew; Keim, Paul; Kenefic, Leo; Frey, Joachim; Perreten, Vincent

    2006-01-01

    We genotyped 15 Bacillus anthracis isolates from Chad, Africa, using multiple-locus variable-number tandem repeat analysis and three additional direct-repeat markers. We identified two unique genotypes that represent a novel genetic lineage in the A cluster. Chadian isolates were susceptible to 11 antibiotics and free of 94 antibiotic resistance genes. PMID:16954291

  5. Salmonella enterica Serotype Enteritidis in French Polynesia, South Pacific, 2008–2013

    PubMed Central

    Maillard, Fiona; Mallet, Henri-Pierre; Daudens, Elise; Levy, Marc; Roy, Valérie; Branaa, Philippe; Bertrand, Sophie; Fabre, Laetitia; Weill, François-Xavier

    2015-01-01

    Outbreaks of Salmonella enterica serotype Enteritidis infections associated with eggs occurred in French Polynesia during 2008–2013. Molecular analysis of isolates by using clustered regularly interspaced short palindromic repeat polymorphisms and multilocus variable-number tandem-repeat analysis was performed. This subtyping made defining the epidemic strain, finding the source, and decontaminating affected poultry flocks possible. PMID:25988406

  6. Is mammalian chromosomal evolution driven by regions of genome fragility?

    PubMed Central

    Ruiz-Herrera, Aurora; Castresana, Jose; Robinson, Terence J

    2006-01-01

    Background A fundamental question in comparative genomics concerns the identification of mechanisms that underpin chromosomal change. In an attempt to shed light on the dynamics of mammalian genome evolution, we analyzed the distribution of syntenic blocks, evolutionary breakpoint regions, and evolutionary breakpoints taken from public databases available for seven eutherian species (mouse, rat, cattle, dog, pig, cat, and horse) and the chicken, and examined these for correspondence with human fragile sites and tandem repeats. Results Our results confirm previous investigations that showed the presence of chromosomal regions in the human genome that have been repeatedly used as illustrated by a high breakpoint accumulation in certain chromosomes and chromosomal bands. We show, however, that there is a striking correspondence between fragile site location, the positions of evolutionary breakpoints, and the distribution of tandem repeats throughout the human genome, which similarly reflect a non-uniform pattern of occurrence. Conclusion These observations provide further evidence that certain chromosomal regions in the human genome have been repeatedly used in the evolutionary process. As a consequence, the genome is a composite of fragile regions prone to reorganization that have been conserved in different lineages, and genomic tracts that do not exhibit the same levels of evolutionary plasticity. PMID:17156441

  7. Rapid and high resolution genotyping of all Escherichia coli serotypes using 10 genomic repeat-containing loci.

    PubMed

    Løbersli, Inger; Haugum, Kjersti; Lindstedt, Bjørn-Arne

    2012-01-01

    Our laboratory has previously published two multiple-locus variable-number tandem-repeats analysis (MLVA) methods for rapid genotyping of Escherichia coli (E. coli), which are now in routine use for surveillance and outbreak detection. The first assay developed was specific for E. coli O157:H7; however this assay was not suitable for genotyping other E. coli serotypes. A new generic MLVA-assay was then developed with the capability of genotyping all E. coli serotypes. This generic E. coli MLVA (GECM7) was based on polymorphism in seven variable number of tandem repeats (VNTR) loci. GECM7 worked well with the majority of E. coli serotypes; however we wanted to increase the resolution for this method based in part of comparison with PFGE typing of E. coli O26:H11, where PFGE appeared to display higher resolution. The GECM7 method was improved by adding three new repeat-loci to a total of ten (GECM10), and a considerable increase in resolution was observed (from 296 to 507 genotypes on the same set of strains). Copyright © 2011 Elsevier B.V. All rights reserved.

  8. Complete sequence and analysis of the mitochondrial genome of Hemiselmis andersenii CCMP644 (Cryptophyceae).

    PubMed

    Kim, Eunsoo; Lane, Christopher E; Curtis, Bruce A; Kozera, Catherine; Bowman, Sharen; Archibald, John M

    2008-05-12

    Cryptophytes are an enigmatic group of unicellular eukaryotes with plastids derived by secondary (i.e., eukaryote-eukaryote) endosymbiosis. Cryptophytes are unusual in that they possess four genomes-a host cell-derived nuclear and mitochondrial genome and an endosymbiont-derived plastid and 'nucleomorph' genome. The evolutionary origins of the host and endosymbiont components of cryptophyte algae are at present poorly understood. Thus far, a single complete mitochondrial genome sequence has been determined for the cryptophyte Rhodomonas salina. Here, the second complete mitochondrial genome of the cryptophyte alga Hemiselmis andersenii CCMP644 is presented. The H. andersenii mtDNA is 60,553 bp in size and encodes 30 structural RNAs and 36 protein-coding genes, all located on the same strand. A prominent feature of the genome is the presence of a approximately 20 Kbp long intergenic region comprised of numerous tandem and dispersed repeat units of between 22-336 bp. Adjacent to these repeats are 27 copies of palindromic sequences predicted to form stable DNA stem-loop structures. One such stem-loop is located near a GC-rich and GC-poor region and may have a regulatory function in replication or transcription. The H. andersenii mtDNA shares a number of features in common with the genome of the cryptophyte Rhodomonas salina, including general architecture, gene content, and the presence of a large repeat region. However, the H. andersenii mtDNA is devoid of inverted repeats and introns, which are present in R. salina. Comparative analyses of the suite of tRNAs encoded in the two genomes reveal that the H. andersenii mtDNA has lost or converted its original trnK(uuu) gene and possesses a trnS-derived 'trnK(uuu)', which appears unable to produce a functional tRNA. Mitochondrial protein coding gene phylogenies strongly support a variety of previously established eukaryotic groups, but fail to resolve the relationships among higher-order eukaryotic lineages. Comparison of the H. andersenii and R. salina mitochondrial genomes reveals a number of cryptophyte-specific genomic features, most notably the presence of a large repeat-rich intergenic region. However, unlike R. salina, the H. andersenii mtDNA does not possess introns and lacks a Lys-tRNA, which is presumably imported from the cytosol.

  9. Complete Sequence and Analysis of the Mitochondrial Genome of Hemiselmis andersenii CCMP644 (Cryptophyceae)

    PubMed Central

    Kim, Eunsoo; Lane, Christopher E; Curtis, Bruce A; Kozera, Catherine; Bowman, Sharen; Archibald, John M

    2008-01-01

    Background Cryptophytes are an enigmatic group of unicellular eukaryotes with plastids derived by secondary (i.e., eukaryote-eukaryote) endosymbiosis. Cryptophytes are unusual in that they possess four genomes–a host cell-derived nuclear and mitochondrial genome and an endosymbiont-derived plastid and 'nucleomorph' genome. The evolutionary origins of the host and endosymbiont components of cryptophyte algae are at present poorly understood. Thus far, a single complete mitochondrial genome sequence has been determined for the cryptophyte Rhodomonas salina. Here, the second complete mitochondrial genome of the cryptophyte alga Hemiselmis andersenii CCMP644 is presented. Results The H. andersenii mtDNA is 60,553 bp in size and encodes 30 structural RNAs and 36 protein-coding genes, all located on the same strand. A prominent feature of the genome is the presence of a ~20 Kbp long intergenic region comprised of numerous tandem and dispersed repeat units of between 22–336 bp. Adjacent to these repeats are 27 copies of palindromic sequences predicted to form stable DNA stem-loop structures. One such stem-loop is located near a GC-rich and GC-poor region and may have a regulatory function in replication or transcription. The H. andersenii mtDNA shares a number of features in common with the genome of the cryptophyte Rhodomonas salina, including general architecture, gene content, and the presence of a large repeat region. However, the H. andersenii mtDNA is devoid of inverted repeats and introns, which are present in R. salina. Comparative analyses of the suite of tRNAs encoded in the two genomes reveal that the H. andersenii mtDNA has lost or converted its original trnK(uuu) gene and possesses a trnS-derived 'trnK(uuu)', which appears unable to produce a functional tRNA. Mitochondrial protein coding gene phylogenies strongly support a variety of previously established eukaryotic groups, but fail to resolve the relationships among higher-order eukaryotic lineages. Conclusion Comparison of the H. andersenii and R. salina mitochondrial genomes reveals a number of cryptophyte-specific genomic features, most notably the presence of a large repeat-rich intergenic region. However, unlike R. salina, the H. andersenii mtDNA does not possess introns and lacks a Lys-tRNA, which is presumably imported from the cytosol. PMID:18474103

  10. Maximum likelihood decoding analysis of accumulate-repeat-accumulate codes

    NASA Technical Reports Server (NTRS)

    Abbasfar, A.; Divsalar, D.; Yao, K.

    2004-01-01

    In this paper, the performance of the repeat-accumulate codes with (ML) decoding are analyzed and compared to random codes by very tight bounds. Some simple codes are shown that perform very close to Shannon limit with maximum likelihood decoding.

  11. Transonic Axial Splittered Rotor Tandem Stator Stage

    DTIC Science & Technology

    2016-12-01

    CODE 13. ABSTRACT (maximum 200 words) Development of a procedure to model the hot shape of a rotor blade and a comparison analysis of the transonic...fluid-structure interaction. Rotational forces as well as gas loading forces were observed as an influence on blade deformation. Utilizing the...Turbomachinery, splittered rotor, tandem stator, transonic compressor, blade deformation, fluid-structure interaction 15. NUMBER OF PAGES 87 16. PRICE

  12. Identification of Variable-Number Tandem-Repeat (VNTR) Sequences in Acinetobacter baumannii and Interlaboratory Validation of an Optimized Multiple-Locus VNTR Analysis Typing Scheme▿†

    PubMed Central

    Pourcel, Christine; Minandri, Fabrizia; Hauck, Yolande; D'Arezzo, Silvia; Imperi, Francesco; Vergnaud, Gilles; Visca, Paolo

    2011-01-01

    Acinetobacter baumannii is an important opportunistic pathogen responsible for nosocomial outbreaks, mostly occurring in intensive care units. Due to the multiplicity of infection sources, reliable molecular fingerprinting techniques are needed to establish epidemiological correlations among A. baumannii isolates. Multiple-locus variable-number tandem-repeat analysis (MLVA) has proven to be a fast, reliable, and cost-effective typing method for several bacterial species. In this study, an MLVA assay compatible with simple PCR- and agarose gel-based electrophoresis steps as well as with high-throughput automated methods was developed for A. baumannii typing. Preliminarily, 10 potential polymorphic variable-number tandem repeats (VNTRs) were identified upon bioinformatic screening of six annotated genome sequences of A. baumannii. A collection of 7 reference strains plus 18 well-characterized isolates, including unique types and representatives of the three international A. baumannii lineages, was then evaluated in a two-center study aimed at validating the MLVA assay and comparing it with other genotyping assays, namely, macrorestriction analysis with pulsed-field gel electrophoresis (PFGE) and PCR-based sequence group (SG) profiling. The results showed that MLVA can discriminate between isolates with identical PFGE types and SG profiles. A panel of eight VNTR markers was selected, all showing the ability to be amplified and good amounts of polymorphism in the majority of strains. Independently generated MLVA profiles, composed of an ordered string of allele numbers corresponding to the number of repeats at each VNTR locus, were concordant between centers. Typeability, reproducibility, stability, discriminatory power, and epidemiological concordance were excellent. A database containing information and MLVA profiles for several A. baumannii strains is available from http://mlva.u-psud.fr/. PMID:21147956

  13. Chloroplast genome of Aconitum barbatum var. puberulum (Ranunculaceae) derived from CCS reads using the PacBio RS platform.

    PubMed

    Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping

    2015-01-01

    The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum.

  14. Chloroplast genome of Aconitum barbatum var. puberulum (Ranunculaceae) derived from CCS reads using the PacBio RS platform

    PubMed Central

    Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping

    2015-01-01

    The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum. PMID:25705213

  15. Polymorphisms in TS, MTHFR and ERCC1 genes as predictive markers in first-line platinum and pemetrexed therapy in NSCLC patients.

    PubMed

    Krawczyk, Paweł; Kucharczyk, Tomasz; Kowalski, Dariusz M; Powrózek, Tomasz; Ramlau, Rodryg; Kalinka-Warzocha, Ewa; Winiarczyk, Kinga; Knetki-Wróblewska, Magdalena; Wojas-Krawczyk, Kamila; Kałakucka, Katarzyna; Dyszkiewicz, Wojciech; Krzakowski, Maciej; Milanowski, Janusz

    2014-12-01

    We presented retrospective analysis of up to five polymorphisms in TS, MTHFR and ERCC1 genes as molecular predictive markers for homogeneous Caucasian, non-squamous NSCLC patients treated with pemetrexed and platinum front-line chemotherapy. The following polymorphisms in DNA isolated from 115 patients were analyzed: various number of 28-bp tandem repeats in 5'-UTR region of TS gene, single nucleotide polymorphism (SNP) within the second tandem repeat of TS gene (G>C); 6-bp deletion in 3'-UTR region of the TS (1494del6); 677C>T SNP in MTHFR; 19007C>T SNP in ERCC1. Molecular examinations' results were correlated with disease control rate, progression-free survival (PFS) and overall survival. Polymorphic tandem repeat sequence (2R, 3R) in the enhancer region of TS gene and G>C SNP within the second repeat of 3R allele seem to be important for the effectiveness of platinum and pemetrexed in first-line chemotherapy. The insignificant shortening of PFS in 3R/3R homozygotes as compared to 2R/2R and 2R/3R genotypes were observed, while it was significantly shorter in patients carrying synchronous 3R allele and G nucleotide. The combined analysis of TS VNTR and MTHFR 677C>T SNP revealed shortening of PFS in synchronous carriers of 3R allele in TS and two C alleles in MTHFR. The strongest factors increased the risk of progression were poor PS, weight loss, anemia and synchronous presence of 3R allele and G nucleotide in the second repeat of 3R allele in TS. Moreover, lack of application of second-line chemotherapy, weight loss and poor performance status and above-mentioned genotype of TS gene increased risk of early mortality. The examined polymorphisms should be accounted as molecular predictor factors for pemetrexed- and platinum-based front-line chemotherapy in non-squamous NSCLC patients.

  16. Accumulate repeat accumulate codes

    NASA Technical Reports Server (NTRS)

    Abbasfar, Aliazam; Divsalar, Dariush; Yao, Kung

    2004-01-01

    In this paper we propose an innovative channel coding scheme called 'Accumulate Repeat Accumulate codes' (ARA). This class of codes can be viewed as serial turbo-like codes, or as a subclass of Low Density Parity Check (LDPC) codes, thus belief propagation can be used for iterative decoding of ARA codes on a graph. The structure of encoder for this class can be viewed as precoded Repeat Accumulate (RA) code or as precoded Irregular Repeat Accumulate (IRA) code, where simply an accumulator is chosen as a precoder. Thus ARA codes have simple, and very fast encoder structure when they representing LDPC codes. Based on density evolution for LDPC codes through some examples for ARA codes, we show that for maximum variable node degree 5 a minimum bit SNR as low as 0.08 dB from channel capacity for rate 1/2 can be achieved as the block size goes to infinity. Thus based on fixed low maximum variable node degree, its threshold outperforms not only the RA and IRA codes but also the best known LDPC codes with the dame maximum node degree. Furthermore by puncturing the accumulators any desired high rate codes close to code rate 1 can be obtained with thresholds that stay close to the channel capacity thresholds uniformly. Iterative decoding simulation results are provided. The ARA codes also have projected graph or protograph representation that allows for high speed decoder implementation.

  17. Differentiation of “Candidatus Liberibacter asiaticus” Isolates by Variable-Number Tandem-Repeat Analysis ▿

    PubMed Central

    Katoh, Hiroshi; Subandiyah, Siti; Tomimura, Kenta; Okuda, Mitsuru; Su, Hong-Ji; Iwanami, Toru

    2011-01-01

    Four highly polymorphic simple sequence repeat (SSR) loci were selected and used to differentiate 84 Japanese isolates of “Candidatus Liberibacter asiaticus.” The Nei's measure of genetic diversity values for these four SSRs ranged from 0.60 to 0.86. The four SSR loci were also highly polymorphic in four isolates from Taiwan and 12 isolates from Indonesia. PMID:21239554

  18. Revisiting the Plastid Phylogenomics of Pinaceae with Two Complete Plastomes of Pseudolarix and Tsuga.

    PubMed

    Sudianto, Edi; Wu, Chung-Shien; Lin, Ching-Ping; Chaw, Shu-Miaw

    2016-06-27

    Phylogeny of the ten Pinaceous genera has long been contentious. Plastid genomes (plastomes) provide an opportunity to resolve this problem because they contain rich evolutionary information. To comprehend the plastid phylogenomics of all ten Pinaceous genera, we sequenced the plastomes of two previously unavailable genera, Pseudolarix amabilis (122,234 bp) and Tsuga chinensis (120,859 bp). Both plastomes share similar gene repertoire and order. Here for the first time we report a unique insertion of tandem repeats in accD of T. chinensis From the 65 plastid protein-coding genes common to all Pinaceous genera, we re-examined the phylogenetic relationship among all Pinaceous genera. Our two phylogenetic trees are congruent in an identical tree topology, with the five genera of the Abietoideae subfamily constituting a monophyletic clade separate from the other three subfamilies: Pinoideae, Piceoideae, and Laricoideae. The five genera of Abietoideae were grouped into two sister clades consisting of (1) Cedrus alone and (2) two sister subclades of Pseudolarix-Tsuga and Abies-Keteleeria, with the former uniquely losing the gene psaM and the latter specifically excluding the 3 psbA from the residual inverted repeat. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. aPPRove: An HMM-Based Method for Accurate Prediction of RNA-Pentatricopeptide Repeat Protein Binding Events

    PubMed Central

    Harrison, Thomas; Ruiz, Jaime; Sloan, Daniel B.; Ben-Hur, Asa; Boucher, Christina

    2016-01-01

    Pentatricopeptide repeat containing proteins (PPRs) bind to RNA transcripts originating from mitochondria and plastids. There are two classes of PPR proteins. The P class contains tandem P-type motif sequences, and the PLS class contains alternating P, L and S type sequences. In this paper, we describe a novel tool that predicts PPR-RNA interaction; specifically, our method, which we call aPPRove, determines where and how a PLS-class PPR protein will bind to RNA when given a PPR and one or more RNA transcripts by using a combinatorial binding code for site specificity proposed by Barkan et al. Our results demonstrate that aPPRove successfully locates how and where a PPR protein belonging to the PLS class can bind to RNA. For each binding event it outputs the binding site, the amino-acid-nucleotide interaction, and its statistical significance. Furthermore, we show that our method can be used to predict binding events for PLS-class proteins using a known edit site and the statistical significance of aligning the PPR protein to that site. In particular, we use our method to make a conjecture regarding an interaction between CLB19 and the second intronic region of ycf3. The aPPRove web server can be found at www.cs.colostate.edu/~approve. PMID:27560805

  20. Complete chloroplast genome and 45S nrDNA sequences of the medicinal plant species Glycyrrhiza glabra and Glycyrrhiza uralensis.

    PubMed

    Kang, Sang-Ho; Lee, Jeong-Hoon; Lee, Hyun Oh; Ahn, Byoung Ohg; Won, So Youn; Sohn, Seong-Han; Kim, Jung Sun

    2017-10-06

    Glycyrrhiza uralensis and G. glabra, members of the Fabaceae, are medicinally important species that are native to Asia and Europe. Extracts from these plants are widely used as natural sweeteners because of their much greater sweetness than sucrose. In this study, the three complete chloroplast genomes and five 45S nuclear ribosomal (nr)DNA sequences of these two licorice species and an interspecific hybrid are presented. The chloroplast genomes of G. glabra, G. uralensis and G. glabra × G. uralensis were 127,895 bp, 127,716 bp and 127,939 bp, respectively. The three chloroplast genomes harbored 110 annotated genes, including 76 protein-coding genes, 30 tRNA genes and 4 rRNA genes. The 45S nrDNA sequences were either 5,947 or 5,948 bp in length. Glycyrrhiza glabra and G. glabra × G. uralensis showed two types of nrDNA, while G. uralensis contained a single type. The complete 45S nrDNA sequence unit contains 18S rRNA, ITS1, 5.8S rRNA, ITS2 and 26S rRNA. We identified simple sequence repeat and tandem repeat sequences. We also developed four reliable markers for analysis of Glycyrrhiza diversity authentication.

  1. Crystal structure of tandem type III fibronectin domains from Drosophila neuroglian at 2.0 A.

    PubMed

    Huber, A H; Wang, Y M; Bieber, A J; Bjorkman, P J

    1994-04-01

    We report the crystal structure of two adjacent fibronectin type III repeats from the Drosophila neural cell adhesion molecule neuroglian. Each domain consists of two antiparallel beta sheets and is folded topologically identically to single fibronectin type III domains from the extracellular matrix proteins tenascin and fibronectin. beta bulges and left-handed polyproline II helices disrupt the regular beta sheet structure of both neuroglian domains. The hydrophobic interdomain interface includes a metal-binding site, presumably involved in stabilizing the relative orientation between domains and predicted by sequence comparision to be present in the vertebrate homolog molecule L1. The neuroglian domains are related by a near perfect 2-fold screw axis along the longest molecular dimension. Using this relationship, a model for arrays of tandem fibronectin type III repeats in neuroglian and other molecules is proposed.

  2. Rapid carrier screening using short tandem repeats in the phenylalanine hydroxylase gene.

    PubMed

    Shawky, R M; el-Aleem, K A; Rifaat, M M; el-Naggar, R L; Marzouk, G M

    2002-01-01

    Phenylketonuria (PKU) is an autosomal recessive genetic disorder caused by defects in the phenylalanine hydroxylase (PAH) system. Our work aimed to screen the PAH locus for the presence of potentially useful short tandem repeats (STR) as markers for carrier detection in PKU families in Egypt, and to determine the level of PAH heterozygosity within the Egyptian population. The system contains at least eight independent alleles in the Egyptian population, transmitted in a Mendelian fashion. Variations in the number of STR in the 16 families studied gave rise to polymorphisms that proved to be suitable markers for PKU carrier detection and prenatal diagnosis. The most frequent allelic fragment size in PKU patients was 246 bp (35.7%), which together with a fragment of 254 bp accounted for 60.7% of the mutant chromosomes.

  3. Phylogenic analysis and forensic genetic characterization of Chinese Uyghur group via autosomal multi STR markers.

    PubMed

    Jin, Xiaoye; Wei, Yuanyuan; Chen, Jiangang; Kong, Tingting; Mu, Yuling; Guo, Yuxin; Dong, Qian; Xie, Tong; Meng, Haotian; Zhang, Meng; Li, Jianfei; Li, Xiaopeng; Zhu, Bofeng

    2017-09-26

    We investigated the allelic frequencies and forensic descriptive parameters of 23 autosomal short tandem repeat loci in a randomly selected sample of 1218 unrelated healthy Uyghur individuals residing in the Xinjiang Uyghur Autonomous Region, northwest China. A total of 281 alleles at these loci were identified and their corresponding allelic frequencies ranged from 0.0004 to 0.5390. The combined match probability and combined probability of exclusion of all loci were 5.192 × 10 -29 and 0.9999999996594, respectively. The results of population genetic study manifested that Uyghur had close relationships with those contiguous populations, such as Xibe and Hui groups. In a word, these autosomal short tandem repeat loci were highly informative in Uyghur group and the multiplex PCR system could be used as a valuable tool for forensic caseworks and population genetic analysis.

  4. Protein arginine methyltransferase 7 has a novel homodimer-like structure formed by tandem repeats.

    PubMed

    Hasegawa, Morio; Toma-Fukai, Sachiko; Kim, Jun-Dal; Fukamizu, Akiyoshi; Shimizu, Toshiyuki

    2014-05-21

    Protein arginine methyltransferase 7 (PRMT7) is a member of a family of enzymes that catalyze the transfer of methyl groups from S-adenosyl-l-methionine to nitrogen atoms on arginine residues. Here, we describe the crystal structure of Caenorhabditis elegans PRMT7 in complex with its reaction product S-adenosyl-L-homocysteine. The structural data indicated that PRMT7 harbors two tandem repeated PRMT core domains that form a novel homodimer-like structure. S-adenosyl-L-homocysteine bound to the N-terminal catalytic site only; the C-terminal catalytic site is occupied by a loop that inhibits cofactor binding. Mutagenesis demonstrated that only the N-terminal catalytic site of PRMT7 is responsible for cofactor binding. Copyright © 2014 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  5. Performance of the SNPforID 52 SNP-plex assay in paternity testing.

    PubMed

    Børsting, Claus; Sanchez, Juan J; Hansen, Hanna E; Hansen, Anders J; Bruun, Hanne Q; Morling, Niels

    2008-09-01

    The performance of a multiplex assay with 52 autosomal single nucleotide polymorphisms (SNPs) developed for human identification was tested on 124 mother-child-father trios. The typical paternity indices (PIs) were 10(5)-10(6) for the trios and 10(3)-10(4) for the child-father duos. Using the SNP profiles from the randomly selected trios and 700 previously typed individuals, a total of 83,096 comparisons between mother, child and an unrelated man were performed. On average, 9-10 mismatches per comparison were detected. Four mismatches were genetic inconsistencies and 5-6 mismatches were opposite homozygosities. In only two of the 83,096 comparisons did an unrelated man match perfectly to a mother-child duo, and in both cases the PI of the true father was much higher than the PI of the unrelated man. The trios were also typed for 15 short tandem repeats (STRs) and seven variable number of tandem repeats (VNTRs). The typical PIs based on 15 STRs or seven VNTRs were 5-50 times higher than the typical PIs based on 52 SNPs. Six mutations in tandem repeats were detected among the randomly selected trios. In contrast, there was not found any mutations in the SNP loci. The results showed that the 52 SNP-plex assay is a very useful alternative to currently used methods in relationship testing. The usefulness of SNP markers with low mutation rates in paternity and immigration casework is discussed.

  6. New Multilocus Variable-Number Tandem-Repeat Analysis Tool for Surveillance and Local Epidemiology of Bacterial Leaf Blight and Bacterial Leaf Streak of Rice Caused by Xanthomonas oryzae

    PubMed Central

    Poulin, L.; Grygiel, P.; Magne, M.; Rodriguez-R, L. M.; Forero Serna, N.; Zhao, S.; El Rafii, M.; Dao, S.; Tekete, C.; Wonni, I.; Koita, O.; Pruvost, O.; Verdier, V.; Vernière, C.

    2014-01-01

    Multilocus variable-number tandem-repeat analysis (MLVA) is efficient for routine typing and for investigating the genetic structures of natural microbial populations. Two distinct pathovars of Xanthomonas oryzae can cause significant crop losses in tropical and temperate rice-growing countries. Bacterial leaf streak is caused by X. oryzae pv. oryzicola, and bacterial leaf blight is caused by X. oryzae pv. oryzae. For the latter, two genetic lineages have been described in the literature. We developed a universal MLVA typing tool both for the identification of the three X. oryzae genetic lineages and for epidemiological analyses. Sixteen candidate variable-number tandem-repeat (VNTR) loci were selected according to their presence and polymorphism in 10 draft or complete genome sequences of the three X. oryzae lineages and by VNTR sequencing of a subset of loci of interest in 20 strains per lineage. The MLVA-16 scheme was then applied to 338 strains of X. oryzae representing different pathovars and geographical locations. Linkage disequilibrium between MLVA loci was calculated by index association on different scales, and the 16 loci showed linear Mantel correlation with MLSA data on 56 X. oryzae strains, suggesting that they provide a good phylogenetic signal. Furthermore, analyses of sets of strains for different lineages indicated the possibility of using the scheme for deeper epidemiological investigation on small spatial scales. PMID:25398857

  7. Multilocus variable-number tandem repeat analysis distinguishes outbreak and sporadic Escherichia coli O157:H7 isolates.

    PubMed

    Noller, Anna C; McEllistrem, M Catherine; Pacheco, Antonio G F; Boxrud, David J; Harrison, Lee H

    2003-12-01

    Escherichia coli O157:H7 is a major cause of food-borne illness in the United States. Outbreak detection involves traditional epidemiological methods and routine molecular subtyping by pulsed-field gel electrophoresis (PFGE). PFGE is labor-intensive, and the results are difficult to analyze and not easily transferable between laboratories. Multilocus variable-number tandem repeat (VNTR) analysis (MLVA) is a fast, portable method that analyzes multiple VNTR loci, which are areas of the bacterial genome that evolve quickly. Eighty isolates, including 21 isolates from five epidemiologically well-characterized outbreaks from Pennsylvania and Minnesota, were analyzed by PFGE and MLVA. Strains in PFGE clusters were defined as strains that differed by less than or equal to one band by using XbaI and the confirmatory enzyme SpeI. MLVA was performed by comparing the number of tandem repeats at seven loci. From 6 to 30 alleles were found at the seven loci, resulting in 64 MLVA types among the 80 isolates. MLVA correctly identified the isolates from all five outbreaks if only a single-locus variant was allowed. MLVA differentiated strains with unique PFGE types. Additionally, MLVA discriminated strains within PFGE-defined clusters that were not known to be part of an outbreak. In addition to being a simple and validated method for E. coli O157:H7 outbreak detection, MLVA appears to have a sensitivity equal to that of PFGE and a specificity superior to that of PFGE.

  8. Skewing of the genetic architecture at the ZMYM3 human-specific 5' UTR short tandem repeat in schizophrenia.

    PubMed

    Alizadeh, F; Bozorgmehr, A; Tavakkoly-Bazzaz, J; Ohadi, M

    2018-06-01

    Differential expansion of a number of human short tandem repeats (STRs) at the critical core promoter and 5' untranslated region (UTR) support the hypothesis that at least some of these STRs may provide a selective advantage in human evolution. Following a genome-wide screen of all human protein-coding gene 5' UTRs based on the Ensembl database ( http://www.ensembl.org ), we previously reported that the longest STR in this interval is a (GA) 32 , which belongs to the X-linked zinc finger MYM-type containing 3 (ZMYM3) gene. In the present study, we analyzed the evolutionary implication of this region across evolution and examined the allele and genotype distribution of the "exceptionally long" STR by direct sequencing of 486 Iranian unrelated male subjects consisting of 196 cases of schizophrenia (SCZ) and 290 controls. We found that the ZMYM3 transcript containing the STR is human-specific (ENST00000373998.5). A significant allele variance difference was observed between the cases and controls (Levene's test for equality of variances F = 4.00, p < 0.03). In addition, six alleles were observed in the SCZ patients that were not detected in the control group ("disease-only" alleles) (mid p exact < 0.0003). Those alleles were at the extreme short and long ends of the allele distribution curve and composed 4% of the genotypes in the SCZ group. In conclusion, we found skewing of the genetic architecture at the ZMYM3 STR in SCZ. Further, we found a bell-shaped distribution of alleles and selection against alleles at the extreme ends of this STR. The ZMYM3 STR sets a prototype, the evolutionary course of which determines the range of alleles in a particular species. Extreme "disease-only" alleles and genotypes may change our perspective of adaptive evolution and complex disorders. The ZMYM3 gene "exceptionally long" STR should be sequenced in SCZ and other human-specific phenotypes/characteristics.

  9. Maximum likelihood decoding analysis of Accumulate-Repeat-Accumulate Codes

    NASA Technical Reports Server (NTRS)

    Abbasfar, Aliazam; Divsalar, Dariush; Yao, Kung

    2004-01-01

    Repeat-Accumulate (RA) codes are the simplest turbo-like codes that achieve good performance. However, they cannot compete with Turbo codes or low-density parity check codes (LDPC) as far as performance is concerned. The Accumulate Repeat Accumulate (ARA) codes, as a subclass of LDPC codes, are obtained by adding a pre-coder in front of RA codes with puncturing where an accumulator is chosen as a precoder. These codes not only are very simple, but also achieve excellent performance with iterative decoding. In this paper, the performance of these codes with (ML) decoding are analyzed and compared to random codes by very tight bounds. The weight distribution of some simple ARA codes is obtained, and through existing tightest bounds we have shown the ML SNR threshold of ARA codes approaches very closely to the performance of random codes. We have shown that the use of precoder improves the SNR threshold but interleaving gain remains unchanged with respect to RA code with puncturing.

  10. Joint numerical study of the 2011 Tohoku-Oki tsunami: comparative propagation simulations and high resolution coastal models

    NASA Astrophysics Data System (ADS)

    Loevenbruck, Anne; Arpaia, Luca; Ata, Riadh; Gailler, Audrey; Hayashi, Yutaka; Hébert, Hélène; Heinrich, Philippe; Le Gal, Marine; Lemoine, Anne; Le Roy, Sylvestre; Marcer, Richard; Pedreros, Rodrigo; Pons, Kevin; Ricchiuto, Mario; Violeau, Damien

    2017-04-01

    This study is part of the joint actions carried out within TANDEM (Tsunamis in northern AtlaNtic: Definition of Effects by Modeling). This French project, mainly dedicated to the appraisal of coastal effects due to tsunami waves on the French coastlines, was initiated after the catastrophic 2011 Tohoku-Oki tsunami. This event, which tragically struck Japan, drew the attention to the importance of tsunami risk assessment, in particular when nuclear facilities are involved. As a contribution to this challenging task, the TANDEM partners intend to provide guidance for the French Atlantic area based on numerical simulation. One of the identified objectives consists in designing, adapting and validating simulation codes for tsunami hazard assessment. Besides an integral benchmarking workpackage, the outstanding database of the 2011 event offers the TANDEM partners the opportunity to test their numerical tools with a real case. As a prerequisite, among the numerous published seismic source models arisen from the inversion of the various available records, a couple of coseismic slip distributions have been selected to provide common initial input parameters for the tsunami computations. After possible adaptations or specific developments, the different codes are employed to simulate the Tohoku-Oki tsunami from its source to the northeast Japanese coastline. The results are tested against the numerous tsunami measurements and, when relevant, comparisons of the different codes are carried out. First, the results related to the oceanic propagation phase are compared with the offshore records. Then, the modeled coastal impacts are tested against the onshore data. Flooding at a regional scale is considered, but high resolution simulations are also performed with some of the codes. They allow examining in detail the runup amplitudes and timing, as well as the complexity of the tsunami interaction with the coastal structures. The work is supported by the Tandem project in the frame of French PIA grant ANR-11-RSNR-00023.

  11. Discordant expression and variable numbers of neighboring GGA- and GAA-rich triplet repeats in the 3' untranslated regions of two groups of messenger RNAs encoded by the rat polymeric immunoglobulin receptor gene.

    PubMed Central

    Koch, K S; Gleiberman, A S; Aoki, T; Leffert, H L; Feren, A; Jones, A L; Fodor, E J

    1995-01-01

    An unusual S1-nuclease sensitive microsatellite (STMS) has been found in the single copy, rat polymeric immunoglobulin receptor gene (PIGR) terminal exon. In Fisher rats, elements within or beyond the STMS are expressed variably in the 3' untranslated regions (3'UTRs) of two 'Groups' of PIGR-encoded hepatic mRNAs (pIg-R) during liver regeneration. STMS elements include neighboring constant regions (a 60-bp d[GA]-rich tract with a chi-like octamer, followed by 15 tandem d[GGA] repeats) that merge directly with 36 or 39 tandem d[GAA] repeats (Fisher or Wistar strains, respectively) interrupted by d[AA] between their 5th-6th repeat units. The Wistar STMS is flanked upstream by two regions of nearly contiguous d[CA] or d[CT] repeats in the 3' end of intron 8; and downstream, by a 283 bp 'unit' containing several inversions at its 5' end, and two polyadenylation signals at its 3' end. The 283 nt unit is expressed in Group 1 pIg-R mRNAs; but it is absent in the Group 2 family so that their GAA repeats merge with their poly A tails. In contrast to genomic sequence, GGA triplet repeats are amplified (n > or = 24-26), whereas GAA triplet repeats are truncated variably (n < or = 9-37) and expressed uninterruptedly in both mRNA Groups. These results suggest that 3' end processing of the rat PIGR gene may involve misalignment, slippage and premature termination of RNA polymerase II. The function of this unusual processing and possible roles of chi-like octamers in quiescent or extrahepatic tissues are discussed. Images PMID:7739889

  12. Effect of repeat copy number on variable-number tandem repeat mutations in Escherichia coli O157:H7.

    PubMed

    Vogler, Amy J; Keys, Christine; Nemoto, Yoshimi; Colman, Rebecca E; Jay, Zack; Keim, Paul

    2006-06-01

    Variable-number tandem repeat (VNTR) loci have shown a remarkable ability to discriminate among isolates of the recently emerged clonal pathogen Escherichia coli O157:H7, making them a very useful molecular epidemiological tool. However, little is known about the rates at which these sequences mutate, the factors that affect mutation rates, or the mechanisms by which mutations occur at these loci. Here, we measure mutation rates for 28 VNTR loci and investigate the effects of repeat copy number and mismatch repair on mutation rate using in vitro-generated populations for 10 E. coli O157:H7 strains. We find single-locus rates as high as 7.0 x 10(-4) mutations/generation and a combined 28-locus rate of 6.4 x 10(-4) mutations/generation. We observed single- and multirepeat mutations that were consistent with a slipped-strand mispairing mutation model, as well as a smaller number of large repeat copy number mutations that were consistent with recombination-mediated events. Repeat copy number within an array was strongly correlated with mutation rate both at the most mutable locus, O157-10 (r2= 0.565, P = 0.0196), and across all mutating loci. The combined locus model was significant whether locus O157-10 was included (r2= 0.833, P < 0.0001) or excluded (r2= 0.452, P < 0.0001) from the analysis. Deficient mismatch repair did not affect mutation rate at any of the 28 VNTRs with repeat unit sizes of >5 bp, although a poly(G) homomeric tract was destabilized in the mutS strain. Finally, we describe a general model for VNTR mutations that encompasses insertions and deletions, single- and multiple-repeat mutations, and their relative frequencies based upon our empirical mutation rate data.

  13. Sequence repeats and protein structure

    NASA Astrophysics Data System (ADS)

    Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos

    2012-11-01

    Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.

  14. Selective intra-dinucleotide interactions and periodicities of bases separated by K sites: a new vision and tool for phylogeny analyses.

    PubMed

    Valenzuela, Carlos Y

    2017-02-13

    Direct tests of the random or non-random distribution of nucleotides on genomes have been devised to test the hypothesis of neutral, nearly-neutral or selective evolution. These tests are based on the direct base distribution and are independent of the functional (coding or non-coding) or structural (repeated or unique sequences) properties of the DNA. The first approach described the longitudinal distribution of bases in tandem repeats under the Bose-Einstein statistics. A huge deviation from randomness was found. A second approach was the study of the base distribution within dinucleotides whose bases were separated by 0, 1, 2… K nucleotides. Again an enormous difference from the random distribution was found with significances out of tables and programs. These test values were periodical and included the 16 dinucleotides. For example a high "positive" (more observed than expected dinucleotides) value, found in dinucleotides whose bases were separated by (3K + 2) sites, was preceded by two smaller "negative" (less observed than expected dinucleotides) values, whose bases were separated by (3K) or (3K + 1) sites. We examined mtDNAs, prokaryote genomes and some eukaryote chromosomes and found that the significant non-random interactions and periodicities were present up to 1000 or more sites of base separation and in human chromosome 21 until separations of more than 10 millions sites. Each nucleotide has its own significant value of its distance to neutrality; this yields 16 hierarchical significances. A three dimensional table with the number of sites of separation between the bases and the 16 significances (the third dimension is the dinucleotide, individual or taxon involved) gives directly an evolutionary state of the analyzed genome that can be used to obtain phylogenies. An example is provided.

  15. Submegabase Clusters of Unstable Tandem Repeats Unique to the Tla Region of Mouse T Haplotypes

    PubMed Central

    Uehara, H.; Ebersole, T.; Bennett, D.; Artzt, K.

    1990-01-01

    We describe here the identification and genomic organization of mouse t haplotype-specific elements (TSEs) 7.8 and 5.8 kb in length. The TSEs exist as submegabase-long clusters of tandem repeats localized in the Tla region of the major histocompatibility complex of all t haplotype chromosomes examined. In contrast, no such clusters were detected among 12 inbred strains of Mus musculus and other Mus species; thus, clusters of TSEs represent the first absolutely qualitative difference between t haplotypes and wild-type chromosomes. Pulsed field gel electrophoresis shows that the number of clusters, and the number of repeats in each cluster are extremely variable. Dramatic quantitative differences of TSEs uniquely distinguish every independent t haplotype from any other. The complete nucleotide sequence of one 7.8-kb TSE reveals significant homology to the ETn (a major transcript in the early embryo of the mouse), and some homologies to intracisternal A-particles and the mammary tumor virus env gene. Apart from the diagnostic relevance to t haplotypes, evolutionary and functional significances are discussed with respect to chromosome structure and genetic recombination. PMID:2076812

  16. Optimization of Analytical Conditions for a Rapid Determination of Aniline in Environmental Water by Liquid Chromatography/Tandem Mass Spectrometry.

    PubMed

    Furukawa, Koji; Hashimoto, Makoto; Kaneco, Satoshi

    2017-01-01

    A rapid determination of aniline in environmental water was examined based on liquid chromatography/tandem mass spectrometry (LC/MS/MS). Environmental water samples were diluted 20-fold with Mill-Q water and measured by LC/MS/MS after adding a surrogate substance (aniline-d 5 ). In the results of the present study, the calibration curve of aniline showed good linearity in the range of 0.05 - 2.0 μg/L. Since the RSD (repeatability) by measuring repeatedly an aniline standard solution (0.05 μg/L, n = 7) was 3.2%, the repeatability of this work was very excellent. In addition, the recovery rate of aniline in environmental water was in the range of 99.0 - 102% with RSD 3.4 - 7.7%, and very good recovery test results were obtained. From these results, this analytical method was confirmed to be effective for aniline measurements of environmental water samples. Also, it is possible to conduct rapid analyses of aniline in environmental water without any solid-phase extraction process, compared to the solid-phase extraction-GC/MS method.

  17. Accumulate Repeat Accumulate Coded Modulation

    NASA Technical Reports Server (NTRS)

    Abbasfar, Aliazam; Divsalar, Dariush; Yao, Kung

    2004-01-01

    In this paper we propose an innovative coded modulation scheme called 'Accumulate Repeat Accumulate Coded Modulation' (ARA coded modulation). This class of codes can be viewed as serial turbo-like codes, or as a subclass of Low Density Parity Check (LDPC) codes that are combined with high level modulation. Thus at the decoder belief propagation can be used for iterative decoding of ARA coded modulation on a graph, provided a demapper transforms the received in-phase and quadrature samples to reliability of the bits.

  18. Complete nucleotide sequence of pig (Sus scrofa) mitochondrial genome and dating evolutionary divergence within Artiodactyla.

    PubMed

    Lin, C S; Sun, Y L; Liu, C Y; Yang, P C; Chang, L C; Cheng, I C; Mao, S J; Huang, M C

    1999-08-05

    The complete nucleotide sequence of the pig (Sus scrofa) mitochondrial genome, containing 16613bp, is presented in this report. The genome is not a specific length because of the presence of the variable numbers of tandem repeats, 5'-CGTGCGTACA in the displacement loop (D-loop). Genes responsible for 12S and 16S rRNAs, 22 tRNAs, and 13 protein-coding regions are found. The genome carries very few intergenic nucleotides with several instances of overlap between protein-coding or tRNA genes, except in the D-loop region. For evaluating the possible evolutionary relationships between Artiodactyla and Cetacea, the nucleotide substitutions and amino acid sequences of 13 protein-coding genes were aligned by pairwise comparisons of the pig, cow, and fin whale. By comparing these sequences, we suggest that there is a closer relationship between the pig and cow than that between either of these species and fin whale. In addition, the accumulation of transversions and gaps in pig 12S and 16S rRNA genes was compared with that in other eutherian species, including cow, fin whale, human, horse, and harbor seal. The results also reveal a close phylogenetic relationship between pig and cow, as compared to fin whale and others. Thus, according to the sequence differences of mitochondrial rRNA genes in eutherian species, the evolutionary separation of pig and cow occurred about 53-60 million years ago.

  19. [Analysis of mitochondrial SNPs in addition to conventional STR-typing in a case of aggravated theft].

    PubMed

    Röper, Andrea; Reichert, Walter; Mattern, Rainer

    2007-01-01

    In the field of forensic DNA typing, the analysis of Short Tandem Repeats (STRs) can fail in cases of degraded DNA. The typing of coding region Single Nucleotide Polymorphisms (SNPs) of the mitochondrial genome provides an approach to acquire additional information. In the examined case of aggravated theft, both suspects could be excluded of having left the analyzed hair on the crime scene by SNP typing. This conclusion was not possible subsequent to STR typing. SNP typing of the trace on the torch light left on the crime scene increased the likelihood for suspect no. 2 to be the origin of this trace. This finding was already indicated by STR analysis. Suspect no. 1 was excluded for being the origin of this trace by SNP typing which was also indicated by STR analysis. A limiting factor for the analysis of SNPs is the maternal inheritance of mitochondrial DNA. Individualisation is not possible. In conclusion, it can be said that in the case of traces which cause problems with conventional STR typing the supplementary analysis of coding region SNPs from the mitochondrial genome is very reasonable and greatly contributes to the refinement of analysis methods in the field of forensic genetics.

  20. Accumulate-Repeat-Accumulate-Accumulate-Codes

    NASA Technical Reports Server (NTRS)

    Divsalar, Dariush; Dolinar, Sam; Thorpe, Jeremy

    2004-01-01

    Inspired by recently proposed Accumulate-Repeat-Accumulate (ARA) codes [15], in this paper we propose a channel coding scheme called Accumulate-Repeat-Accumulate-Accumulate (ARAA) codes. These codes can be seen as serial turbo-like codes or as a subclass of Low Density Parity Check (LDPC) codes, and they have a projected graph or protograph representation; this allows for a high-speed iterative decoder implementation using belief propagation. An ARAA code can be viewed as a precoded Repeat-and-Accumulate (RA) code with puncturing in concatenation with another accumulator, where simply an accumulator is chosen as the precoder; thus ARAA codes have a very fast encoder structure. Using density evolution on their associated protographs, we find examples of rate-lJ2 ARAA codes with maximum variable node degree 4 for which a minimum bit-SNR as low as 0.21 dB from the channel capacity limit can be achieved as the block size goes to infinity. Such a low threshold cannot be achieved by RA or Irregular RA (IRA) or unstructured irregular LDPC codes with the same constraint on the maximum variable node degree. Furthermore by puncturing the accumulators we can construct families of higher rate ARAA codes with thresholds that stay close to their respective channel capacity thresholds uniformly. Iterative decoding simulation results show comparable performance with the best-known LDPC codes but with very low error floor even at moderate block sizes.

  1. Molecular analysis of two phytohemagglutinin genes and their expression in Phaseolus vulgaris cv. Pinto, a lectin-deficient cultivar of the bean

    PubMed Central

    Voelker, Toni A.; Staswick, Paul; Chrispeels, Maarten J.

    1986-01-01

    Phytohemagglutinin (PHA), the seed lectin of the common bean, Phaseolus vulgaris, is encoded by two highly homologous, tandemly linked genes, dlec1 and dlec2, which are coordinately expressed at high levels in developing cotyledons. Their respective transcripts translate into closely related polypeptides, PHA-E and PHA-L, constituents of the tetrameric lectin which accumulates at high levels in developing seeds. In the bean cultivar Pinto UI111, PHA-E is not detectable, and PHA-L accumulates at very reduced levels. To investigate the cause of the Pinto phenotype, we cloned and sequenced the two PHA genes of Pinto, called Pdlec1 and Pdlec2, and determined the abundance of their respective mRNAs in developing cotyledons. Both genes are more than 90% homologous to the normal PHA genes found in other cultivars. Pdlec1 carries a 1-bp frameshift mutation close to the 5' end of its coding sequence. Only very truncated polypeptides could be made from its mRNA. The gene Pdlec2 encodes a polypeptide, which resembles PHA-L and its predicted amino acid sequence agrees with the available Pinto PHA amino acid sequence data. Analysis of the mRNA of developing cotyledons revealed that the Pdlec1 message is reduced 600-fold, and Pdlec2 mRNA is reduced 20-fold with respect to mRNA levels in normal cultivars. A comparison of the sequences which are upstream from the coding sequence shows that Pdlec2 has a 100-bp deletion compared to the other genes (dlec1, dlec2 and Pdlec1). This deletion which contains a large tandem repeat may be responsible for the low level of expression of Pdlec2. The very low expression of Pdlec1 is as yet unexplained. ImagesFig. 5. PMID:16453730

  2. An annotated genetic map of loblolly pine based on microsatellite and cDNA markers

    Treesearch

    Craig S. Echt; Surya Saha; Konstantin V. Krutovsky; Kokulapalan Wimalanathan; John E. Erpelding; Chun Liang; C Dana Nelson

    2011-01-01

    Previous loblolly pine (Pinus taeda L.) genetic linkage maps have been based on a variety of DNA polymorphisms, such as AFLPs, RAPDs, RFLPs, and ESTPs, but only a few SSRs (simple sequence repeats), also known as simple tandem repeats or microsatellites, have been mapped in P. taeda. The objective of this study was to integrate a large set of SSR markers from a variety...

  3. Novel protein domains and repeats in Drosophila melanogaster: insights into structure, function, and evolution.

    PubMed

    Ponting, C P; Mott, R; Bork, P; Copley, R R

    2001-12-01

    Sequence database searching methods such as BLAST, are invaluable for predicting molecular function on the basis of sequence similarities among single regions of proteins. Searches of whole databases however, are not optimized to detect multiple homologous regions within a single polypeptide. Here we have used the prospero algorithm to perform self-comparisons of all predicted Drosophila melanogaster gene products. Predicted repeats, and their homologs from all species, were analyzed further to detect hitherto unappreciated evolutionary relationships. Results included the identification of novel tandem repeats in the human X-linked retinitis pigmentosa type-2 gene product, repeated segments in cystinosin, associated with a defect in cystine transport, and 'nested' homologous domains in dysferlin, whose gene is mutated in limb girdle muscular dystrophy. Novel signaling domain families were found that may regulate the microtubule-based cytoskeleton and ubiquitin-mediated proteolysis, respectively. Two families of glycosyl hydrolases were shown to contain internal repetitions that hint at their evolution via a piecemeal, modular approach. In addition, three examples of fruit fly genes were detected with tandem exons that appear to have arisen via internal duplication. These findings demonstrate how completely sequenced genomes can be exploited to further understand the relationships between molecular structure, function, and evolution.

  4. [Family-based association study of a variable number of tandem repeat polymorphism of DAT1 gene with Tourette syndrome in a Chinese Han population].

    PubMed

    Zheng, Lanlan; Han, Zhen-liang; Zhang, Xin-hua; Wang, Xue-qin; Jiang, Wei-hua; Yi, Ming-ji; Liu, Shi-guo

    2013-10-01

    To assess the association of a 40 bp variable number of tandem repeat (VNTR) polymorphism within 3 untranslated region of dopamine transporter gene (DAT1) with Tourette syndrome (TS) in a Chinese Han population. A total of 160 TS patients and their parents were recruited. The VNTR polymorphism was detected with polymerase chain reaction-VNTR analysis, and its association with TS and its subtypes were assessed through a family-based association study comprising transmission disequilibrium test (TDT) and haplotype relative risk (HRR) analysis. The repeat numbers at the DAT1 40 bp locus were 11, 10, 9, 7.5 and 7 among the patients and their parents, with the most common type being a 10-repeat allele. No significant association was detected between the polymorphism and TS (TDT: X ² = 0.472, df = 1, P = 0.583; HRR: X ² = 0.313, P = 0.576, OR = 0.855, 95%CI: 0.493-1.481). Our data suggested that the VNTR polymorphism of DAT1 gene is not associated with susceptibility to TS in Chinese Han population. However, our results are to be validated in larger sets of patients collected from other populations.

  5. Reanalysis and revision of the complete mitochondrial genome of Rachycentron canadum (Teleostei, Perciformes, Rachycentridae).

    PubMed

    Musika, Jidapa; Khongchatee, Adison; Phinchongsakuldit, Jaros

    2014-08-01

    The complete mitochondrial genome of cobia, Rachycentron canadum, was reanalyzed and revised. The genome is 18,008 bp in length, containing 13 protein-coding genes, 2 ribosomal RNA (rRNA) genes, 22 transfer RNA (tRNA) genes, and a control region or displacement loop (D-loop). The gene arrangement is identical to that observed in most vertebrates. Base composition on the heavy strand is 30.14% A, 25.22% C, 15.80% G and 28.84% T. The D-loop region exhibits an A + T rich pattern, containing short tandem repeats of TATATACATGG, TATATGCACAA and TATATGCACGG. The mitochondrial genome studied differs from the previously published genome in two segments; the control region to 12S and ND5 to tRNA(Glu). The 12S sequence also differs from those published in the databases. Phylogeny analyses revealed that the differences could be due to errors in sequence assembly and/or sample misidentification of the previous studies.

  6. Complete mitochondrial genome of Skylark, Alauda arvensis (Aves: Passeriformes): the first representative of the family Alaudidae with two extensive heteroplasmic control regions.

    PubMed

    Qian, Chaoju; Wang, Yuanxiu; Guo, Zhichun; Yang, Jianke; Kan, Xianzhao

    2013-06-01

    The circular mitochondrial genome of Alauda arvensis is 17,018 bp in length, containing 13 protein-coding genes (PCGs), 2 ribosomal RNA genes, 22 transfer RNA (tRNA) genes, and 2 extensive heteroplasmic control regions. All of the genes encoded on the H-strand, with the exceptions of one PCG (nad6) and eight tRNA genes (tRNA(Gln), tRNA(Ala), tRNA(Asn), tRNA(Cys), tRNA(Tyr), tRNA(Ser(UCN)), tRNA(Pro), and tRNA(Glu)), as found in other birds' mitochondrial genomes. All of these PCGs are initiated with ATG, while stopped by six types of stop codons. All tRNA genes have the potential to fold into typical clover-leaf structure. Two extensive heteroplasmic control regions were found, and more interestingly, a minisatellite of 37 nucleotides (5'-TCAATCCCATTGATTTCATTATATTAGTATAAAGAAA-3') with 6 tandem repeats was detected at the end of CR2.

  7. The complete mitochondrial genome of the green lizard Lacerta viridis viridis (Reptilia: Lacertidae) and its phylogenetic position within squamate reptiles.

    PubMed

    Böhme, M U; Fritzsch, G; Tippmann, A; Schlegel, M; Berendonk, T U

    2007-06-01

    For the first time the complete mitochondrial genome was sequenced for a member of Lacertidae. Lacerta viridis viridis was sequenced in order to compare the phylogenetic relationships of this family to other reptilian lineages. Using the long-polymerase chain reaction (long PCR) we characterized a mitochondrial genome, 17,156 bp long showing a typical vertebrate pattern with 13 protein coding genes, 22 transfer RNAs (tRNA), two ribosomal RNAs (rRNA) and one major noncoding region. The noncoding region of L. v. viridis was characterized by a conspicuous 35 bp tandem repeat at its 5' terminus. A phylogenetic study including all currently available squamate mitochondrial sequences demonstrates the position of Lacertidae within a monophyletic squamate group. We obtained a narrow relationship of Lacertidae to Scincidae, Iguanidae, Varanidae, Anguidae, and Cordylidae. Although, the internal relationships within this group yielded only a weak resolution and low bootstrap support, the revealed relationships were more congruent with morphological studies than with recent molecular analyses.

  8. Tyms double (2R) and triple repeat (3R) confers risk for human oral squamous cell carcinoma.

    PubMed

    Bezerra, Alexandre Medeiros; Sant'Ana, Thalita Araújo; Gomes, Adriana Vieira; de Lacerda Vidal, Aurora Karla; Muniz, Maria Tereza Cartaxo

    2014-12-01

    The oral cancer is responsible for approximately 3 % of cases of cancer in Brazil. Epidemiological studies have associated low folate intake with an increased risk of epithelial cancers, including oral cancer. Folic acid has a key role in DNA synthesis, repair, methylation and this is the basis of explanations for a putative role for folic acid in cancer prevention. The role of folic acid in carcinogenesis may be modulated by polymorphism C677T in MTHFR and tandem repeats 2R/3R in the promoter site of TYMS gene that are related to decreased enzymatic activity and quantity and availability of the enzyme, respectively. These events cause a decrease in the synthesis, repair and DNA methylation, which can lead to a disruption in the expression of tumor suppressor genes as TP53. The objective of this study was investigate the distribution of polymorphisms C677T and tandem repeats 2R/3R associated with the development of oral squamous cell carcinoma (OSCC). 53 paraffin-embedded samples from patients who underwent surgery but are no longer at the institution and 43 samples collected by method of oral exfoliation by cytobrush were selected. 132 healthy subjects were selected by specialists at the dental clinics of the Faculdade de Odontologia de Pernambuco-FOP. The MTHFR genotyping was performed by PCR-RFLP, and the TYMS genotyping was performed by conventional PCR. Fisher's Exact test at significant level of 5 %. Odds ratios (ORs) and 95 % confidence intervals (CIs) were used to measure the strength of association between genotype frequency and OSCC development. The results were statistically significant for the tandem repeats of the TYMS gene (p = 0.015). The TYMS 2R3R genotype was significantly associated with the development of OSCC (OR = 3.582; 95 % CI 1.240-10.348; p = 0.0262) and also the genotype 3R3R (OR = 3.553; 95 % CI 1.293-9.760; p = 0.0345). When analyzed together, the TYMS 2R3R + 3R3R genotypes also showed association (OR = 3.518; 95 % CI 11.188-10.348; p = 0.0177). No differences for the MTHFR C677T polymorphisms distribution were found between the oral cancer patients and controls subjects in our study (p = 0.499). Therefore, these data suggest that determination of TYMS tandem repeats could provide information on the comprehension of the risk factors and prevention of the OSCC.

  9. Phylogenic analysis and forensic genetic characterization of Chinese Uyghur group via autosomal multi STR markers

    PubMed Central

    Jin, Xiaoye; Wei, Yuanyuan; Chen, Jiangang; Kong, Tingting; Mu, Yuling; Guo, Yuxin; Dong, Qian; Xie, Tong; Meng, Haotian; Zhang, Meng; Li, Jianfei; Li, Xiaopeng; Zhu, Bofeng

    2017-01-01

    We investigated the allelic frequencies and forensic descriptive parameters of 23 autosomal short tandem repeat loci in a randomly selected sample of 1218 unrelated healthy Uyghur individuals residing in the Xinjiang Uyghur Autonomous Region, northwest China. A total of 281 alleles at these loci were identified and their corresponding allelic frequencies ranged from 0.0004 to 0.5390. The combined match probability and combined probability of exclusion of all loci were 5.192 × 10−29 and 0.9999999996594, respectively. The results of population genetic study manifested that Uyghur had close relationships with those contiguous populations, such as Xibe and Hui groups. In a word, these autosomal short tandem repeat loci were highly informative in Uyghur group and the multiplex PCR system could be used as a valuable tool for forensic caseworks and population genetic analysis. PMID:29088750

  10. Molecular Investigation of Tularemia Outbreaks, Spain, 1997–2008

    PubMed Central

    Ariza-Miguel, Jaime; Johansson, Anders; Fernández-Natal, María Isabel; Martínez-Nistal, Carmen; Orduña, Antonio; Rodríguez-Ferri, Elías F.; Hernández, Marta

    2014-01-01

    Tularemia outbreaks occurred in northwestern Spain in 1997–1998 and 2007–2008 and affected >1,000 persons. We assessed isolates involved in these outbreaks by using pulsed-field gel electrophoresis with 2 restriction enzymes and multilocus variable number tandem repeat analysis of 16 genomic loci of Francisella tularensis, the cause of this disease. Isolates were divided into 3 pulsotypes by pulsed-field gel electrophoresis and 8 allelic profiles by multilocus variable number tandem repeat analysis. Isolates obtained from the second tularemia outbreak had the same genotypes as isolates obtained from the first outbreak. Both outbreaks were caused by genotypes of genetic subclade B.Br:FTNF002–00, which is widely distributed in countries in central and western Europe. Thus, reemergence of tularemia in Spain was not caused by the reintroduction of exotic strains, but probably by persistence of local reservoirs of infection. PMID:24750848

  11. Interpreting short tandem repeat variations in humans using mutational constraint

    PubMed Central

    Gymrek, Melissa; Willems, Thomas; Reich, David; Erlich, Yaniv

    2017-01-01

    Identifying regions of the genome that are depleted of mutations can reveal potentially deleterious variants. Short tandem repeats (STRs), also known as microsatellites, are among the largest contributors of de novo mutations in humans. However, per-locus studies of STR mutations have been limited to highly ascertained panels of several dozen loci. Here, we harnessed bioinformatics tools and a novel analytical framework to estimate mutation parameters for each STR in the human genome by correlating STR genotypes with local sequence heterozygosity. We applied our method to obtain robust estimates of the impact of local sequence features on mutation parameters and used this to create a framework for measuring constraint at STRs by comparing observed vs. expected mutation rates. Constraint scores identified known pathogenic variants with early onset effects. Our metric will provide a valuable tool for prioritizing pathogenic STRs in medical genetics studies. PMID:28892063

  12. Rare Sequence Variation in the Genome Flanking a Short Tandem Repeat Locus Can Lead to a Question of “Nonmaternity”

    PubMed Central

    Deucher, Anne; Chiang, Tsoyu; Schrijver, Iris

    2010-01-01

    Typing of STR (short tandem repeat) alleles is used in a variety of applications in clinical molecular pathology, including evaluations for maternal cell contamination. Using a commercially available STR typing assay for maternal cell contamination performed in conjunction with prenatal diagnostic testing, we were posed with apparent nonmaternity when the two fetal samples did not demonstrate the expected maternal allele at one locus. By designing primers external to the region amplified by the primers from the commercial assay and by performing direct sequencing of the resulting amplicon, we were able to determine that a guanine to adenine sequence variation led to primer mismatch and allele dropout. This explained the apparent null allele shared between the maternal and fetal samples. Therefore, although rare, allele dropout must be considered whenever unexplained homozygosity at an STR locus is observed. PMID:20203001

  13. Single-Stranded Condensation Stochastically Blocks G-Quadruplex Assembly in Human Telomeric RNA.

    PubMed

    Gutiérrez, Irene; Garavís, Miguel; de Lorenzo, Sara; Villasante, Alfredo; González, Carlos; Arias-Gonzalez, J Ricardo

    2018-05-17

    TERRA is an RNA molecule transcribed from human subtelomeric regions toward chromosome ends potentially involved in regulation of heterochromatin stability, semiconservative replication, and telomerase inhibition, among others. TERRA contains tandem repeats of the sequence GGGUUA, with a strong tendency to fold into a four-stranded arrangement known as a parallel G-quadruplex. Here, we demonstrate by using single-molecule force spectroscopy that this potential is limited by the inherent capacity of RNA to self-associate randomly and further condense into entropically more favorable structures. We stretched RNA constructions with more than four and less than eight hexanucleotide repeats, thus unable to form several G-quadruplexes in tandem, flanked by non-G-rich overhangs of random sequence by optical tweezers on a one by one basis. We found that condensed RNA stochastically blocks G-quadruplex folding pathways with a near 20% probability, a behavior that is not found in DNA analogous molecules.

  14. Heteroduplex analysis can increase the informativeness of PCR-amplified VNTR markers: Application using a marker tightly linked to the COL2A1 gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wilkin, D.J.; Cohn, D.H.; Koprivnikar, K.E.

    1993-02-01

    Variable number of tandem repeat (VNTR) polymorphism provide a high degree of informativeness in linkage studies. Whether performed by standard methods or by polymerase chain reaction (PCR), analysis of these markers involves assessment of the length of each allele. VNTR alleles usually differ in the number of tandem repeats. During PCR amplification of a VNTR closely linked to the type II collagen gene (COL2A1), we identified allelic microheterogeneity through the analysis of unique heteroduplexes between amplified strands of the two alleles. In one large pedigree, heteroduplex analysis identified only three distinct alleles. The identification of these heteroduplexes allowed the determinationmore » of the COL2A1 inheritance pattern in the family, which otherwise would have been noninformative. 26 refs., 3 figs.« less

  15. Allele frequency distribution for the variable number of tandem repeat locus D10S28 in Tamil Nadu (south India) population.

    PubMed

    Pandian, S K; Kumar, S; Krishnan, M; Dharmalingam, K; Damodaran, C

    1995-09-01

    Allele frequencies were determined in unrelated individuals of Tamil speaking population from the Madras City (Tamil Nadu, South India) area for the polymorphic DNA locus D10S28 using the probe TBQ7. Membranes hybridized with the probe YNH24 were subjected to deprobing and were subsequently hybridized with random priming - labeled, purified inserts of TBQ7. The sizes of the fragments were grouped to 100 bp as well as to arbitrary fixed bins (Federal Bureau of Investigation / Royal Canadian Mounted Police). There were 14 bins in the latter with the most common bin being 11 (1789-1924 bp) with a frequency of 9.8%. We observed a heterozygosity of 92% comparable to Caucasian populations. The data presented here can be used as the basis for utilizing this variable number of tandem repeats (TNTR) DNA marker for paternity determinations and forensic investigations.

  16. Transcriptome-wide effects of inverted SINEs on gene expression and their impact on RNA polymerase II activity.

    PubMed

    Tajaddod, Mansoureh; Tanzer, Andrea; Licht, Konstantin; Wolfinger, Michael T; Badelt, Stefan; Huber, Florian; Pusch, Oliver; Schopoff, Sandy; Janisiw, Michael; Hofacker, Ivo; Jantsch, Michael F

    2016-10-25

    Short interspersed elements (SINEs) represent the most abundant group of non-long-terminal repeat transposable elements in mammalian genomes. In primates, Alu elements are the most prominent and homogenous representatives of SINEs. Due to their frequent insertion within or close to coding regions, SINEs have been suggested to play a crucial role during genome evolution. Moreover, Alu elements within mRNAs have also been reported to control gene expression at different levels. Here, we undertake a genome-wide analysis of insertion patterns of human Alus within transcribed portions of the genome. Multiple, nearby insertions of SINEs within one transcript are more abundant in tandem orientation than in inverted orientation. Indeed, analysis of transcriptome-wide expression levels of 15 ENCODE cell lines suggests a cis-repressive effect of inverted Alu elements on gene expression. Using reporter assays, we show that the negative effect of inverted SINEs on gene expression is independent of known sensors of double-stranded RNAs. Instead, transcriptional elongation seems impaired, leading to reduced mRNA levels. Our study suggests that there is a bias against multiple SINE insertions that can promote intramolecular base pairing within a transcript. Moreover, at a genome-wide level, mRNAs harboring inverted SINEs are less expressed than mRNAs harboring single or tandemly arranged SINEs. Finally, we demonstrate a novel mechanism by which inverted SINEs can impact on gene expression by interfering with RNA polymerase II.

  17. A novel amino acid analysis method using derivatization of multiple functional groups followed by liquid chromatography/tandem mass spectrometry.

    PubMed

    Sakaguchi, Yohei; Kinumi, Tomoya; Yamazaki, Taichi; Takatsu, Akiko

    2015-03-21

    We have developed a novel amino acid analysis method using derivatization of multiple functional groups (amino, carboxyl, and phenolic hydroxyl groups). The amino, carboxyl, and phenolic hydroxyl groups of the amino acids were derivatized with 1-bromobutane so that the hydrophobicities and basicities of the amino acids were improved. The derivatized amino acids, including amino group-modified amino acids, could be detected with high sensitivity using liquid chromatography/tandem mass spectrometry (LC-MS/MS). In this study, 17 amino acids obtained by hydrolyzing proteins and 4 amino group-modified amino acids found in the human body (N,N-dimethylglycine, N-formyl-L-methionine, L-pyroglutamic acid, and sarcosine) were selected as target compounds. The 21 derivatized amino acids could be separated using an octadecyl-silylated silica column within 20 min and simultaneously detected. The detection limits for the 21 amino acids were 5.4-91 fmol, and the calibration curves were linear over the range of 10-100 nmol L(-1) (r(2) > 0.9984) with good repeatability. A confirmatory experiment showed that our proposed method could be applied to the determination of a protein certified reference material using the analysis of 12 amino acids combined with isotope dilution mass spectrometry. Furthermore, the proposed method was successfully applied to a stable isotope-coded derivatization method using 1-bromobutane and 1-bromobutane-4,4,4-d3 for comparative analysis of amino acids in human serum.

  18. Microevolution of Pandemic Vibrio parahaemolyticus Assessed by the Number of Repeat Units in Short Sequence Tandem Repeat Regions

    PubMed Central

    García, Katherine; Gavilán, Ronnie G.; Höfle, Manfred G.; Martínez-Urtaza, Jaime; Espejo, Romilio T.

    2012-01-01

    The emergence of the pandemic strain Vibrio parahaemolyticus O3:K6 in 1996 caused a large increase of diarrhea outbreaks related to seafood consumption in Southeast Asia, and later worldwide. Isolates of this strain constitutes a clonal complex, and their effectual differentiation is possible by comparison of their variable number tandem repeats (VNTRs). The differentiation of the isolates by the differences in VNTRs will allow inferring the population dynamics and microevolution of this strain but this requires knowing the rate and mechanism of VNTRs' variation. Our study of mutants obtained after serial cultivation of clones showed that mutation rates of the six VNTRs examined are on the order of 10−4 mutant per generation and that difference increases by stepwise addition of single mutations. The single stepwise mutation (SSM) was deduced because mutants with 1, 2, 3, or more repeat unit deletions or insertions follow a geometric distribution. Plausible phylogenetic trees are obtained when, according to SSM, the genetic distance between clusters with different number of repeats is assessed by the absolute differences in repeats. Using this approach, mutants originated from different isolates of pandemic V. parahaemolyticus after serial cultivation are clustered with their parental isolates. Additionally, isolates of pandemic V. parahaemolyticus from Southeast Asia, Tokyo, and northern and southern Chile are clustered according their geographical origin. The deepest split in these four populations is observed between the Tokyo and southern Chile populations. We conclude that proper phylogenetic relations and successful tracing of pandemic V. parahaemolyticus requires measuring the differences between isolates by the absolute number of repeats in the VNTRs considered. PMID:22292049

  19. Comparative genomics and repetitive sequence divergence in the species of diploid Nicotiana section Alatae.

    PubMed

    Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R

    2006-12-01

    Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.

  20. Genomic Sequence around Butterfly Wing Development Genes: Annotation and Comparative Analysis

    PubMed Central

    Conceição, Inês C.; Long, Anthony D.; Gruber, Jonathan D.; Beldade, Patrícia

    2011-01-01

    Background Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. Methodology/Principal Findings We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations) and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes). Conclusions The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1) the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2) the high conservation of non-coding sequence around the genes wingless and Ecdysone receptor, both involved in multiple developmental processes including wing pattern formation. PMID:21909358

  1. Optimization of a ΔE - E detector for 41Ca AMS

    NASA Astrophysics Data System (ADS)

    Hosoya, Seiji; Sasa, Kimikazu; Matsunaka, Tetsuya; Takahashi, Tsutomu; Matsumura, Masumi; Matsumura, Hiroshi; Sundquist, Mark; Stodola, Mark; Sueki, Keisuke

    2017-09-01

    A series of nuclides (14C, 26Al, and 36Cl) was measured using the 12UD Pelletron tandem accelerator before replacement by the horizontal 6 MV tandem accelerator at the University of Tsukuba Tandem Accelerator Complex (UTTAC). This paper considers the modification of the accelerator mass spectrometry (AMS) measurement parameters to suit the current 6 MV tandem accelerator setup (e.g., terminal voltage, detected ion charge state, gas pressure, and entrance window material in detector). The Particle and Heavy Ion Transport code System (PHITS) was also used to simulate AMS measurement to determine the best conditions to suppress isobaric interference. The spectra of 41Ca and 41K were then successfully separated and their nuclear spectra were identified; the system achieved a background level of 41Ca/40Ca ∼ 6 ×10-14 .

  2. Molecular Dynamics Simulations of DNA-Free and DNA-Bound TAL Effectors

    PubMed Central

    Wan, Hua; Hu, Jian-ping; Li, Kang-shun; Tian, Xu-hong; Chang, Shan

    2013-01-01

    TAL (transcriptional activator-like) effectors (TALEs) are DNA-binding proteins, containing a modular central domain that recognizes specific DNA sequences. Recently, the crystallographic studies of TALEs revealed the structure of DNA-recognition domain. In this article, molecular dynamics (MD) simulations are employed to study two crystal structures of an 11.5-repeat TALE, in the presence and absence of DNA, respectively. The simulated results indicate that the specific binding of RVDs (repeat-variable diresidues) with DNA leads to the markedly reduced fluctuations of tandem repeats, especially at the two ends. In the DNA-bound TALE system, the base-specific interaction is formed mainly by the residue at position 13 within a TAL repeat. Tandem repeats with weak RVDs are unfavorable for the TALE-DNA binding. These observations are consistent with experimental studies. By using principal component analysis (PCA), the dominant motions are open-close movements between the two ends of the superhelical structure in both DNA-free and DNA-bound TALE systems. The open-close movements are found to be critical for the recognition and binding of TALE-DNA based on the analysis of free energy landscape (FEL). The conformational analysis of DNA indicates that the 5′ end of DNA target sequence has more remarkable structural deformability than the other sites. Meanwhile, the conformational change of DNA is likely associated with the specific interaction of TALE-DNA. We further suggest that the arrangement of N-terminal repeats with strong RVDs may help in the design of efficient TALEs. This study provides some new insights into the understanding of the TALE-DNA recognition mechanism. PMID:24130757

  3. Expanded complexity of unstable repeat diseases

    PubMed Central

    Polak, Urszula; McIvor, Elizabeth; Dent, Sharon Y.R.; Wells, Robert D.; Napierala, Marek

    2015-01-01

    Unstable Repeat Diseases (URDs) share a common mutational phenomenon of changes in the copy number of short, tandemly repeated DNA sequences. More than 20 human neurological diseases are caused by instability, predominantly expansion, of microsatellite sequences. Changes in the repeat size initiate a cascade of pathological processes, frequently characteristic of a unique disease or a small subgroup of the URDs. Understanding of both the mechanism of repeat instability and molecular consequences of the repeat expansions is critical to developing successful therapies for these diseases. Recent technological breakthroughs in whole genome, transcriptome and proteome analyses will almost certainly lead to new discoveries regarding the mechanisms of repeat instability, the pathogenesis of URDs, and will facilitate development of novel therapeutic approaches. The aim of this review is to give a general overview of unstable repeats diseases, highlight the complexities of these diseases, and feature the emerging discoveries in the field. PMID:23233240

  4. User's manual for the FLORA equilibrium and stability code

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Freis, R.P.; Cohen, B.I.

    1985-04-01

    This document provides a user's guide to the content and use of the two-dimensional axisymmetric equilibrium and stability code FLORA. FLORA addresses the low-frequency MHD stability of long-thin axisymmetric tandem mirror systems with finite pressure and finite-larmor-radius effects. FLORA solves an initial-value problem for interchange, rotational, and ballooning stability.

  5. Immunogenicity of a recombinant fusion protein of tandem repeat epitopes of foot-and-mouth disease virus type Asia 1 for guinea pigs.

    PubMed

    Zhang, Q; Yang, Y Q; Zhang, Z Y; Li, L; Yan, W Y; Jiang, W J; Xin, A G; Lei, C X; Zheng, Z X

    2002-01-01

    In this study, the sequences of capsid protein VPI regions of YNAs1.1 and YNAs1.2 isolates of foot-and-mouth disease virus (FMDV) were analyzed and a peptide containing amino acids (aa) 133-158 of VP1 and aa 20-34 of VP4 of FMDV type Asia I was assumed to contain B and T cell epitopes, because it is hypervariable and includes a cell attachment site RGD located in the G-H loop. The DNA fragments encoding aa 133-158 of VP1 and aa 20-34 of VP4 of FMDV type Asia 1 were chemically synthesized and ligated into a tandem repeat of aa 133-158-20 approximately 34-133-158. In order to enhance its immunogenicity, the tandem repeat was inserted downstream of the beta-galactosidase gene in the expression vector pWR590. This insertion yielded a recombinant expression vector pAS1 encoding the fusion protein. The latter reacted with sera from FMDV type Asia 1-infected animals in vitro and elicited high levels of neutralizing antibodies in guinea pigs. The T cell proliferation in immunized animals increased following stimulation with the fusion protein. It is reported for the first time that a recombinant fusion protein vaccine was produced using B and T cell epitopes of FMDV type Asia 1 and that this fusion protein was immunogenic. The fusion protein reported here can serve as a candidate of fusion epitopes for design of a vaccine against FMDV type Asia 1.

  6. Inbreeding drives maize centromere evolution.

    PubMed

    Schneider, Kevin L; Xie, Zidian; Wolfgruber, Thomas K; Presting, Gernot G

    2016-02-23

    Functional centromeres, the chromosomal sites of spindle attachment during cell division, are marked epigenetically by the centromere-specific histone H3 variant cenH3 and typically contain long stretches of centromere-specific tandem DNA repeats (∼1.8 Mb in maize). In 23 inbreds of domesticated maize chosen to represent the genetic diversity of maize germplasm, partial or nearly complete loss of the tandem DNA repeat CentC precedes 57 independent cenH3 relocation events that result in neocentromere formation. Chromosomal regions with newly acquired cenH3 are colonized by the centromere-specific retrotransposon CR2 at a rate that would result in centromere-sized CR2 clusters in 20,000-95,000 y. Three lines of evidence indicate that CentC loss is linked to inbreeding, including (i) CEN10 of temperate lineages, presumed to have experienced a genetic bottleneck, contain less CentC than their tropical relatives; (ii) strong selection for centromere-linked genes in domesticated maize reduced diversity at seven of the ten maize centromeres to only one or two postdomestication haplotypes; and (iii) the centromere with the largest number of haplotypes in domesticated maize (CEN7) has the highest CentC levels in nearly all domesticated lines. Rare recombinations introduced one (CEN2) or more (CEN5) alternate CEN haplotypes while retaining a single haplotype at domestication loci linked to these centromeres. Taken together, this evidence strongly suggests that inbreeding, favored by postdomestication selection for centromere-linked genes affecting key domestication or agricultural traits, drives replacement of the tandem centromere repeats in maize and other crop plants. Similar forces may act during speciation in natural systems.

  7. Sensitivity of immune response quality to influenza helix 190 antigen structure displayed on a modular virus-like particle.

    PubMed

    Anggraeni, Melisa R; Connors, Natalie K; Wu, Yang; Chuan, Yap P; Lua, Linda H L; Middelberg, Anton P J

    2013-09-13

    Biomolecular engineering enables synthesis of improved proteins through synergistic fusion of modules from unrelated biomolecules. Modularization of peptide antigen from an unrelated pathogen for presentation on a modular virus-like particle (VLP) represents a new and promising approach to synthesize safe and efficacious vaccines. Addressing a key knowledge gap in modular VLP engineering, this study investigates the underlying fundamentals affecting the ability of induced antibodies to recognize the native pathogen. Specifically, this quality of immune response is correlated to the peptide antigen module structure. We modularized a helical peptide antigen element, helix 190 (H190) from the influenza hemagglutinin (HA) receptor binding region, for presentation on murine polyomavirus VLP, using two strategies aimed to promote H190 helicity on the VLP. In the first strategy, H190 was flanked by GCN4 structure-promoting elements within the antigen module; in the second, dual H190 copies were arrayed as tandem repeats in the module. Molecular dynamics simulation predicted that tandem repeat arraying would minimize secondary structural deviation of modularized H190 from its native conformation. In vivo testing supported this finding, showing that although both modularization strategies conferred high H190-specific immunogenicity, tandem repeat arraying of H190 led to a strikingly higher immune response quality, as measured by ability to generate antibodies recognizing a recombinant HA domain and split influenza virion. These findings provide new insights into the rational engineering of VLP vaccines, and could ultimately enable safe and efficacious vaccine design as an alternative to conventional approaches necessitating pathogen cultivation. Copyright © 2013 Elsevier Ltd. All rights reserved.

  8. Production of novel recombinant single-domain antibodies against tandem repeat region of MUC1 mucin.

    PubMed

    Rahbarizadeh, F; Rasaee, M J; Forouzandeh Moghadam, M; Allameh, A A; Sadroddiny, E

    2004-06-01

    Recently, the existence of "heavy-chain" antibody in Camelidae has been described. However, as yet there is no data on the binding of this type of antibody to peptides. In addition, there was not any report of production of single-domain antibodies in two-humped camels (Camelus bactrianus). In the present study, these questions are addressed. We showed the feasibility of immunizing old world camels, cloning the repertoire of the variable domain of their heavy-chain antibodies, panning and selection, leading to the successful identification of minimum-sized antigen binders. Antigen-specific fragments of the heavy-chain IgGs (V(HH)) are of great interest in biotechnology because they are very stable, highly soluble, and react specifically and with high affinity to the antigens. In this study, we immunized two camels (Camelus dromedarius and Camelus bactrianus) with homogenized cancerous tissues, synthetic peptide, and human milk fat globule membrane (HMFG), and generated two V(HH) libraries displayed on phage particles. Some single-domain antibody fragments have been isolated that specifically recognize the tandem repeat region of MUC1. The camels' single-domain V(HH) harbor the original, intact antigen binding site and reacted specifically and with high affinity to the tandem repeat region of MUC1. Indeed soluble, specific antigen binders and good affinities (in the range of 0.2 x 10(9) M(-1) to 0.6 x 10(9) M(-1)) were identified from these libraries. This is the first example of the isolation of camel anti-peptide V(HH) domains.

  9. New PAH gene promoter KLF1 and 3'-region C/EBPalpha motifs influence transcription in vitro.

    PubMed

    Klaassen, Kristel; Stankovic, Biljana; Kotur, Nikola; Djordjevic, Maja; Zukic, Branka; Nikcevic, Gordana; Ugrin, Milena; Spasovski, Vesna; Srzentic, Sanja; Pavlovic, Sonja; Stojiljkovic, Maja

    2017-02-01

    Phenylketonuria (PKU) is a metabolic disease caused by mutations in the phenylalanine hydroxylase (PAH) gene. Although the PAH genotype remains the main determinant of PKU phenotype severity, genotype-phenotype inconsistencies have been reported. In this study, we focused on unanalysed sequences in non-coding PAH gene regions to assess their possible influence on the PKU phenotype. We transiently transfected HepG2 cells with various chloramphenicol acetyl transferase (CAT) reporter constructs which included PAH gene non-coding regions. Selected non-coding regions were indicated by in silico prediction to contain transcription factor binding sites. Furthermore, electrophoretic mobility shift assay (EMSA) and supershift assays were performed to identify which transcriptional factors were engaged in the interaction. We found novel KLF1 motif in the PAH promoter, which decreases CAT activity by 50 % in comparison to basal transcription in vitro. The cytosine at the c.-170 promoter position creates an additional binding site for the protein complex involving KLF1 transcription factor. Moreover, we assessed for the first time the role of a multivariant variable number tandem repeat (VNTR) region located in the 3'-region of the PAH gene. We found that the VNTR3, VNTR7 and VNTR8 constructs had approximately 60 % of CAT activity. The regulation is mediated by the C/EBPalpha transcription factor, present in protein complex binding to VNTR3. Our study highlighted two novel promoter KLF1 and 3'-region C/EBPalpha motifs in the PAH gene which decrease transcription in vitro and, thus, could be considered as PAH expression modifiers. New transcription motifs in non-coding regions will contribute to better understanding of the PKU phenotype complexity and may become important for the optimisation of PKU treatment.

  10. The complete chloroplast genome sequence of Epipremnum aureum and its comparative analysis among eight Araceae species

    PubMed Central

    Han, Limin; Chen, Chen; Wang, Zhezhi

    2018-01-01

    Epipremnum aureum is an important foliage plant in the Araceae family. In this study, we have sequenced the complete chloroplast genome of E. aureum by using Illumina Hiseq sequencing platforms. This genome is a double-stranded circular DNA sequence of 164,831 bp that contains 35.8% GC. The two inverted repeats (IRa and IRb; 26,606 bp) are spaced by a small single-copy region (22,868 bp) and a large single-copy region (88,751 bp). The chloroplast genome has 131 (113 unique) functional genes, including 86 (79 unique) protein-coding genes, 37 (30 unique) tRNA genes, and eight (four unique) rRNA genes. Tandem repeats comprise the majority of the 43 long repetitive sequences. In addition, 111 simple sequence repeats are present, with mononucleotides being the most common type and di- and tetranucleotides being infrequent events. Positive selection pressure on rps12 in the E. aureum chloroplast has been demonstrated via synonymous and nonsynonymous substitution rates and selection pressure sites analyses. Ycf15 and infA are pseudogenes in this species. We constructed a Maximum Likelihood phylogenetic tree based on the complete chloroplast genomes of 38 species from 13 families. Those results strongly indicated that E. aureum is positioned as the sister of Colocasia esculenta within the Araceae family. This work may provide information for further study of the molecular phylogenetic relationships within Araceae, as well as molecular markers and breeding novel varieties by chloroplast genetic-transformation of E. aureum in particular. PMID:29529038

  11. Host adaptation of Chlamydia pecorum towards low virulence evident in co-evolution of the ompA, incA, and ORF663 Loci.

    PubMed

    Mohamad, Khalil Yousef; Kaltenboeck, Bernhard; Rahman, Kh Shamsur; Magnino, Simone; Sachse, Konrad; Rodolakis, Annie

    2014-01-01

    Chlamydia (C.) pecorum, an obligate intracellular bacterium, may cause severe diseases in ruminants, swine and koalas, although asymptomatic infections are the norm. Recently, we identified genetic polymorphisms in the ompA, incA and ORF663 genes that potentially differentiate between high-virulence C. pecorum isolates from diseased animals and low-virulence isolates from asymptomatic animals. Here, we expand these findings by including additional ruminant, swine, and koala strains. Coding tandem repeats (CTRs) at the incA locus encoded a variable number of repeats of APA or AGA amino acid motifs. Addition of any non-APA/AGA repeat motif, such as APEVPA, APAVPA, APE, or APAPE, associated with low virulence (P<10-4), as did a high number of amino acids in all incA CTRs (P = 0.0028). In ORF663, high numbers of 15-mer CTRs correlated with low virulence (P = 0.0001). Correction for ompA phylogram position in ORF663 and incA abolished the correlation between genetic changes and virulence, demonstrating co-evolution of ompA, incA, and ORF663 towards low virulence. Pairwise divergence of ompA, incA, and ORF663 among isolates from healthy animals was significantly higher than among strains isolated from diseased animals (P≤10-5), confirming the longer evolutionary path traversed by low-virulence strains. All three markers combined identified 43 unique strains and 4 pairs of identical strains among all 57 isolates tested, demonstrating the suitability of these markers for epidemiological investigations.

  12. Anaplasma ovis genetic diversity detected by major surface protein 1a and its prevalence in small ruminants.

    PubMed

    Aktas, Munir; Özübek, Sezayi

    2018-04-01

    Anaplasma ovis is a widely distributed tick-borne rickettsial pathogen of sheep, goats, and wild ruminants. The aims of this study were to assess the prevalence, associations of Anaplasma ovis in sheep and goats, as well as its genetic diversity based on analysis of the msp1α gene. A total of 416 DNA samples from sheep (n = 236) and goats (n = 180) from four provinces in southeastern Turkey were analyzed by PCR. The overall A. ovis prevalence was 18% (CI 14.4-22.1). The infection rates of A. ovis varied from 15.9% to 21.8% in sampled provinces, and they were not significantly different. There was no difference between Anaplasma ovis infection in sheep (20.3%, CI 15.4-26.0) and goats (15.0%, CI 10.1-21.1) or in infection rate of animals <1 year (21.8%, CI 14.9-30.1) compared to >1 year (16.4%, CI 12.4-21.2). A significant association between A. ovis infection and the presence of Rhipicephalus bursa and Rhipicephalus turanicus was observed (P < 0.05). Prevalence of A. ovis-positive animals was higher in animals showing co-infection with Babesia and Theileria compared to those not co-infected (P < 0.05). The Msp1a amino acid repeats were identified and used for the characterization of A. ovis strains. Forty partial msp1a gene sequences containing the repeated sequences of A. ovis were obtained, and 14 previously undescribed tandem repeats with 33 to 43 amino acids were found. Thirteen A. ovis genotypes were identified based on the structure of Msp1a tandem repeats. The majority of A. ovis isolates exhibited one Msp1a tandem repeat, with a maximum of three. This study revealed the Msp1a could be used as a marker for genotyping A. ovis, and high genetic diversity of A. ovis were found in small ruminants in Turkey. Copyright © 2018 Elsevier B.V. All rights reserved.

  13. Featured Article: Nuclear export of opioid growth factor receptor is CRM1 dependent.

    PubMed

    Kren, Nancy P; Zagon, Ian S; McLaughlin, Patricia J

    2016-02-01

    Opioid growth factor receptor (OGFr) facilitates growth inhibition in the presence of its specific ligand opioid growth factor (OGF), chemically termed [Met(5)]-enkephalin. The function of the OGF-OGFr axis requires the receptor to translocate to the nucleus. However, the mechanism of nuclear export of OGFr is unknown. In this study, endogenous OGFr, as well as exogenously expressed OGFr-EGFP, demonstrated significant nuclear accumulation in response to leptomycin B (LMB), an inhibitor of CRM1-dependent nuclear export, suggesting that OGFr is exported in a CRM1-dependent manner. One consensus sequence for a nuclear export signal (NES) was identified. Mutation of the associated leucines, L217 L220 L223 and L225, to alanine resulted in decreased nuclear accumulation. NES-EGFP responded to LMB, indicating that this sequence is capable of functioning as an export signal in isolation. To determine why the sequence functions differently in isolation than as a full length protein, the localization of subNES was evaluated in the presence and absence of MG132, a potent inhibitor of proteosomal degradation. MG132 had no effect of subNES localization. The role of tandem repeats located at the C-terminus of OGFr was examined for their role in nuclear trafficking. Six of seven tandem repeats were removed to form deltaTR. DeltaTR localized exclusively to the nucleus indicating that the tandem repeats may contribute to the localization of the receptor. Similar to the loss of cellular proliferation activity (i.e. inhibition) recorded with subNES, deltaTR also demonstrated a significant loss of inhibitory activity indicating that the repeats may be integral to receptor function. These experiments reveal that OGFr contains one functional NES, L217 L220 L223 and L225 and can be exported from the nucleus in a CRM1-dependent manner. © 2015 by the Society for Experimental Biology and Medicine.

  14. Genetic polymorphisms in 5-Fluorouracil-related enzymes predict pathologic response after neoadjuvant chemoradiation for rectal cancer.

    PubMed

    Nelson, Bailey; Carter, Jane V; Eichenberger, Maurice R; Netz, Uri; Galandiuk, Susan

    2016-11-01

    Many patients with rectal cancer undergo preoperative neoadjuvant chemoradiation, with approximately 70% exhibiting pathologic downstaging in response to treatment. Currently, there is no accurate test to predict patients who are likely to be complete responders to therapy. 5-Fluorouracil is used regularly in the neoadjuvant treatment of rectal cancer. Genetic polymorphisms affect the activity of thymidylate synthase, an enzyme involved in 5-Fluorouracil metabolism, which may account for observed differences in response to neoadjuvant treatment between patients. Detection of genetic polymorphisms might identify patients who are likely to have a complete response to neoadjuvant therapy and perhaps allow them to avoid operation. DNA was isolated from whole blood taken from patients with newly diagnosed rectal cancer who received neoadjuvant therapy (n = 50). Response to therapy was calculated with a tumor regression score based on histology from the time of operation. Polymerase chain reaction was performed targeting the promoter region of thymidylate synthase. Polymerase chain reaction products were separated using electrophoresis to determine whether patients were homozygous for a double-tandem repeat (2R), a triple-tandem repeat (3R), or were heterozygous (2R/3R). A single nucleotide polymorphism, 3G or 3C, also may be present in the second repeat unit of the triple-tandem repeat allele. Restriction fragment length polymorphism assays were performed in patients with at least one 3R allele using HaeIII. Patients with at least 1 thymidylate synthase 3G allele were more likely to have a complete or partial pathologic response to 5-Fluorouracil neoadjuvant therapy (odds ratio 10.4; 95% confidence interval, 1.3-81.6; P = .01) than those without at least one 3G allele. Identification of rectal cancer patients with specific genetic polymorphisms in enzymes involved in 5-Fluorouracil metabolism seems to predict the likelihood of complete or partial pathologic response to preoperative neoadjuvant therapy. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. A strategy of gene overexpression based on tandem repetitive promoters in Escherichia coli.

    PubMed

    Li, Mingji; Wang, Junshu; Geng, Yanping; Li, Yikui; Wang, Qian; Liang, Quanfeng; Qi, Qingsheng

    2012-02-06

    For metabolic engineering, many rate-limiting steps may exist in the pathways of accumulating the target metabolites. Increasing copy number of the desired genes in these pathways is a general method to solve the problem, for example, the employment of the multi-copy plasmid-based expression system. However, this method may bring genetic instability, structural instability and metabolic burden to the host, while integrating of the desired gene into the chromosome may cause inadequate transcription or expression. In this study, we developed a strategy for obtaining gene overexpression by engineering promoter clusters consisted of multiple core-tac-promoters (MCPtacs) in tandem. Through a uniquely designed in vitro assembling process, a series of promoter clusters were constructed. The transcription strength of these promoter clusters showed a stepwise enhancement with the increase of tandem repeats number until it reached the critical value of five. Application of the MCPtacs promoter clusters in polyhydroxybutyrate (PHB) production proved that it was efficient. Integration of the phaCAB genes with the 5CPtacs promoter cluster resulted in an engineered E.coli that can accumulate 23.7% PHB of the cell dry weight in batch cultivation. The transcription strength of the MCPtacs promoter cluster can be greatly improved by increasing the tandem repeats number of the core-tac-promoter. By integrating the desired gene together with the MCPtacs promoter cluster into the chromosome of E. coli, we can achieve high and stale overexpression with only a small size. This strategy has an application potential in many fields and can be extended to other bacteria.

  16. Identification and characterization of the highly polymorphic locus D14S739 in the Han Chinese population

    PubMed Central

    Shao, Chengchen; Zhang, Yaqi; Zhou, Yueqin; Zhu, Wei; Xu, Hongmei; Liu, Zhiping; Tang, Qiqun; Shen, Yiwen; Xie, Jianhui

    2015-01-01

    Aim To systemically select and evaluate short tandem repeats (STRs) on the chromosome 14 and obtain new STR loci as expanded genotyping markers for forensic application. Methods STRs on the chromosome 14 were filtered from Tandem Repeats Database and further selected based on their positions on the chromosome, repeat patterns of the core sequences, sequence homology of the flanking regions, and suitability of flanking regions in primer design. The STR locus with the highest heterozygosity and polymorphism information content (PIC) was selected for further analysis of genetic polymorphism, forensic parameters, and the core sequence. Results Among 26 STR loci selected as candidates, D14S739 had the highest heterozygosity (0.8691) and PIC (0.8432), and showed no deviation from the Hardy-Weinberg equilibrium. 14 alleles were observed, ranging in size from 21 to 34 tetranucleotide units in the core region of (GATA)9-18 (GACA)7-12 GACG (GACA)2 GATA. Paternity testing showed no mutations. Conclusion D14S739 is a highly informative STR locus and could be a suitable genetic marker for forensic applications in the Han Chinese population. PMID:26526885

  17. A tandem mirror plasma source for hybrid plume plasma studies

    NASA Technical Reports Server (NTRS)

    Yang, T. F.; Chang, F. R.; Miller, R. H.; Wenzel, K. W.; Krueger, W. A.

    1985-01-01

    A tandem mirror device to be considered as a hot plasma source for the hybrid plume rocket concept is discussed. The hot plamsa from this device is injected into an exhaust duct, which will interact with an annular hypersonic layer of neutral gas. The device can be used to study the dynamics of the hybrid plume, and to verify the numerical predictions obtained with computer codes. The basic system design is also geared towards low weight and compactness, and high power density at the exhaust. The basic structure of the device consists of four major subsystems: (1) an electric power supply; (2) a low temperature, high density plasma gun, such as a stream gun, an MPD source or gas cell; (3) a power booster in the form of a tandem mirror machine; and (4) an exhaust nozzle arrangement. The configuration of the tandem mirror section is shown.

  18. Long distance quantum communication with quantum Reed-Solomon codes

    NASA Astrophysics Data System (ADS)

    Muralidharan, Sreraman; Zou, Chang-Ling; Li, Linshu; Jiang, Liang; Jianggroup Team

    We study the construction of quantum Reed Solomon codes from classical Reed Solomon codes and show that they achieve the capacity of quantum erasure channel for multi-level quantum systems. We extend the application of quantum Reed Solomon codes to long distance quantum communication, investigate the local resource overhead needed for the functioning of one-way quantum repeaters with these codes, and numerically identify the parameter regime where these codes perform better than the known quantum polynomial codes and quantum parity codes . Finally, we discuss the implementation of these codes into time-bin photonic states of qubits and qudits respectively, and optimize the performance for one-way quantum repeaters.

  19. APE1 incision activity at abasic sites in tandem repeat sequences.

    PubMed

    Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M

    2014-05-29

    Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.

  20. PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences.

    PubMed

    Avvaru, Akshay Kumar; Sowpati, Divya Tej; Mishra, Rakesh Kumar

    2018-03-15

    Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations. We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. tej@ccmb.res.in. Supplementary data are available at Bioinformatics online.

  1. Fission yeast RNA triphosphatase reads an Spt5 CTD code

    DOE PAGES

    Doamekpor, Selom K.; Schwer, Beate; Sanchez, Ana M.; ...

    2014-11-20

    mRNA capping enzymes are directed to nascent RNA polymerase II (Pol2) transcripts via interactions with the carboxy-terminal domains (CTDs) of Pol2 and transcription elongation factor Spt5. Fission yeast RNA triphosphatase binds to the Spt5 CTD, comprising a tandem repeat of nonapeptide motif TPAWNSGSK. Here we report the crystal structure of a Pct1·Spt5-CTD complex, which revealed two CTD docking sites on the Pct1 homodimer that engage TPAWN segments of the motif. Each Spt5 CTD interface, composed of elements from both subunits of the homodimer, is dominated by van der Waals contacts from Pct1 to the tryptophan of the CTD. The boundmore » CTD adopts a distinctive conformation in which the peptide backbone makes a tight U-turn so that the proline stacks over the tryptophan. We show that Pct1 binding to Spt5 CTD is antagonized by threonine phosphorylation. Our results fortify an emerging concept of an “Spt5 CTD code” in which (i) the Spt5 CTD is structurally plastic and can adopt different conformations that are templated by particular cellular Spt5 CTD receptor proteins; and (ii) threonine phosphorylation of the Spt5 CTD repeat inscribes a binary on–off switch that is read by diverse CTD receptors, each in its own distinctive manner.« less

  2. Fission yeast RNA triphosphatase reads an Spt5 CTD code

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Doamekpor, Selom K.; Schwer, Beate; Sanchez, Ana M.

    mRNA capping enzymes are directed to nascent RNA polymerase II (Pol2) transcripts via interactions with the carboxy-terminal domains (CTDs) of Pol2 and transcription elongation factor Spt5. Fission yeast RNA triphosphatase binds to the Spt5 CTD, comprising a tandem repeat of nonapeptide motif TPAWNSGSK. Here we report the crystal structure of a Pct1·Spt5-CTD complex, which revealed two CTD docking sites on the Pct1 homodimer that engage TPAWN segments of the motif. Each Spt5 CTD interface, composed of elements from both subunits of the homodimer, is dominated by van der Waals contacts from Pct1 to the tryptophan of the CTD. The boundmore » CTD adopts a distinctive conformation in which the peptide backbone makes a tight U-turn so that the proline stacks over the tryptophan. We show that Pct1 binding to Spt5 CTD is antagonized by threonine phosphorylation. Our results fortify an emerging concept of an “Spt5 CTD code” in which (i) the Spt5 CTD is structurally plastic and can adopt different conformations that are templated by particular cellular Spt5 CTD receptor proteins; and (ii) threonine phosphorylation of the Spt5 CTD repeat inscribes a binary on–off switch that is read by diverse CTD receptors, each in its own distinctive manner.« less

  3. Two transcription factors TaPpm1 and TaPpb1 co-regulate anthocyanin biosynthesis in purple pericarps of wheat

    PubMed Central

    Jiang, Wenhui; Liu, Tianxiang; Nan, Wenzhi; Jeewani, Diddugodage Chamila; Niu, Yanlu; Li, Chunlian; Shi, Xue; Wang, Cong; Wang, Jiahuan; Li, Yang; Wang, Zhonghua

    2018-01-01

    Abstract Purple pericarps of bread wheat (Triticum aestivum L.) are a useful source of dietary anthocyanins. Previous mapping results indicated that the purple pericarp trait is controlled by two complementary genes located on chromosomes 7D and 2A. However, the identity of the genes and the mechanisms by which they regulate the trait are unknown. In this study, two transcription factors were characterised as anthocyanin activators in purple pericarps: TaPpm1 (purple pericarp-MYB 1) and TaPpb1 (purple pericarp-bHLH 1). Three non-functional variants were detected in the coding sequence of TaPpm1 from non-purple seed lines, in which the function of TaPpm1 was destroyed either by insertion-induced frame shifts or truncated peptides. There were six 261-bp tandem repeats in the promoter region of TaPpb1 in the purple-grained varieties, while there was only one repeat unit present in the non-purple varieties. Furthermore, using yeast two-hybrid, dual luciferase, yeast one-hybrid, and transient assays, we were able to demonstrate that the interaction of TaPpm1 and TaPpb1 co-regulates the synthesis of anthocyanin. Overall, our results provide a better understanding of the molecular basis of anthocyanin synthesis in the wheat pericarp and indicate the existence of an integrated regulatory mechanism that controls production. PMID:29562292

  4. Detection of Large Pathogenic Expansions in FRDA1, SCA10, and SCA12 Genes Using a Simple Fluorescent Repeat-Primed PCR Assay

    PubMed Central

    Cagnoli, Claudia; Michielotto, Chiara; Matsuura, Tohru; Ashizawa, Tetsuo; Margolis, Russell L.; Holmes, Susan E.; Gellera, Cinzia; Migone, Nicola; Brusco, Alfredo

    2004-01-01

    At least 18 human genetic diseases are caused by expansion of short tandem repeats. Here we describe a successful application of a fluorescent PCR method for the detection of expanded repeats in FRDA1, SCA10, and SCA12 genes. Although this test cannot give a precise estimate of the size of the expansion, it is robust, reliable, and inexpensive, and can be used to screen large series of patients. It proved useful for confirming the presence of large expansions in the Friedreich ataxia gene following an ambiguous result of long-range PCR, as well as rapid pre-screening for large repeat expansions associated with Friedreich ataxia and SCA10 and the shorter repeat expansions associated with SCA12. PMID:15096564

  5. Diversity and Plasticity of the Intracellular Plant Pathogen and Insect Symbiont “Candidatus Liberibacter asiaticus” as Revealed by Hypervariable Prophage Genes with Intragenic Tandem Repeats ▿ †

    PubMed Central

    Zhou, Lijuan; Powell, Charles A.; Hoffman, Michele T.; Li, Wenbin; Fan, Guocheng; Liu, Bo; Lin, Hong; Duan, Yongping

    2011-01-01

    “Candidatus Liberibacter asiaticus” is a psyllid-transmitted, phloem-limited alphaproteobacterium and the most prevalent species of “Ca. Liberibacter” associated with a devastating worldwide citrus disease known as huanglongbing (HLB). Two related and hypervariable genes (hyvI and hyvII) were identified in the prophage regions of the Psy62 “Ca. Liberibacter asiaticus” genome. Sequence analyses of the hyvI and hyvII genes in 35 “Ca. Liberibacter asiaticus” DNA isolates collected globally revealed that the hyvI gene contains up to 12 nearly identical tandem repeats (NITRs, 132 bp) and 4 partial repeats, while hyvII contains up to 2 NITRs and 4 partial repeats and shares homology with hyvI. Frequent deletions or insertions of these repeats within the hyvI and hyvII genes were observed, none of which disrupted the open reading frames. Sequence conservation within the individual repeats but an extensive variation in repeat numbers, rearrangement, and the sequences flanking the repeat region indicate the diversity and plasticity of “Ca. Liberibacter asiaticus” bacterial populations in the world. These differences were found not only in samples of distinct geographical origins but also in samples from a single origin and even from a single “Ca. Liberibacter asiaticus”-infected sample. This is the first evidence of different “Ca. Liberibacter asiaticus” populations coexisting in a single HLB-affected sample. The Florida “Ca. Liberibacter asiaticus” isolates contain both hyvI and hyvII, while all other global “Ca. Liberibacter asiaticus” isolates contain either one or the other. Interclade assignments of the putative HyvI and HyvII proteins from Florida isolates with other global isolates in phylogenetic trees imply multiple “Ca. Liberibacter asiaticus” populations in the world and a multisource introduction of the “Ca. Liberibacter asiaticus” bacterium into Florida. PMID:21784907

  6. A variable number of tandem repeats in the 3'-untranslated region of the dopamine transporter modulates striatal function during working memory updating across the adult age span.

    PubMed

    Sambataro, Fabio; Podell, Jamie E; Murty, Vishnu P; Das, Saumitra; Kolachana, Bhaskar; Goldberg, Terry E; Weinberger, Daniel R; Mattay, Venkata S

    2015-08-01

    Dopamine modulation of striatal function is critical for executive functions such as working memory (WM) updating. The dopamine transporter (DAT) regulates striatal dopamine signaling via synaptic reuptake. A variable number of tandem repeats in the 3'-untranslated region of SLC6A3 (DAT1-3'-UTR-VNTR) is associated with DAT expression, such that 9-repeat allele carriers tend to express lower levels (associated with higher extracellular dopamine concentrations) than 10-repeat homozygotes. Aging is also associated with decline of the dopamine system. The goal of the present study was to investigate the effects of aging and DAT1-3'-UTR-VNTR on the neural activity and functional connectivity of the striatum during WM updating. Our results showed both an age-related decrease in striatal activity and an effect of DAT1-3'-UTR-VNTR. Ten-repeat homozygotes showed reduced striatal activity and increased striatal-hippocampal connectivity during WM updating relative to the 9-repeat carriers. There was no age by DAT1-3'-UTR-VNTR interaction. These results suggest that, whereas striatal function during WM updating is modulated by both age and genetically determined DAT levels, the rate of the age-related decline in striatal function is similar across both DAT1-3'-UTR-VNTR genotype groups. They further suggest that, because of the baseline difference in striatal function based on DAT1-3'-UTR-VNTR polymorphism, 10-repeat homozygotes, who have lower levels of striatal function throughout the adult life span, may reach a threshold of decreased striatal function and manifest impairments in cognitive processes mediated by the striatum earlier in life than the 9-repeat carriers. Our data suggest that age and DAT1-3'-UTR-VNTR polymorphism independently modulate striatal function. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.

  7. Effect of Repeat Copy Number on Variable-Number Tandem Repeat Mutations in Escherichia coli O157:H7

    PubMed Central

    Vogler, Amy J.; Keys, Christine; Nemoto, Yoshimi; Colman, Rebecca E.; Jay, Zack; Keim, Paul

    2006-01-01

    Variable-number tandem repeat (VNTR) loci have shown a remarkable ability to discriminate among isolates of the recently emerged clonal pathogen Escherichia coli O157:H7, making them a very useful molecular epidemiological tool. However, little is known about the rates at which these sequences mutate, the factors that affect mutation rates, or the mechanisms by which mutations occur at these loci. Here, we measure mutation rates for 28 VNTR loci and investigate the effects of repeat copy number and mismatch repair on mutation rate using in vitro-generated populations for 10 E. coli O157:H7 strains. We find single-locus rates as high as 7.0 × 10−4 mutations/generation and a combined 28-locus rate of 6.4 × 10−4 mutations/generation. We observed single- and multirepeat mutations that were consistent with a slipped-strand mispairing mutation model, as well as a smaller number of large repeat copy number mutations that were consistent with recombination-mediated events. Repeat copy number within an array was strongly correlated with mutation rate both at the most mutable locus, O157-10 (r2 = 0.565, P = 0.0196), and across all mutating loci. The combined locus model was significant whether locus O157-10 was included (r2 = 0.833, P < 0.0001) or excluded (r2 = 0.452, P < 0.0001) from the analysis. Deficient mismatch repair did not affect mutation rate at any of the 28 VNTRs with repeat unit sizes of >5 bp, although a poly(G) homomeric tract was destabilized in the mutS strain. Finally, we describe a general model for VNTR mutations that encompasses insertions and deletions, single- and multiple-repeat mutations, and their relative frequencies based upon our empirical mutation rate data. PMID:16740932

  8. Optimization of sequence alignment for simple sequence repeat regions.

    PubMed

    Jighly, Abdulqader; Hamwieh, Aladdin; Ogbonnaya, Francis C

    2011-07-20

    Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs).SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type.When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.

  9. Using a Tandem Pelletron accelerator to produce a thermal neutron beam for detector testing purposes.

    PubMed

    Irazola, L; Praena, J; Fernández, B; Macías, M; Bedogni, R; Terrón, J A; Sánchez-Nieto, B; Arias de Saavedra, F; Porras, I; Sánchez-Doblado, F

    2016-01-01

    Active thermal neutron detectors are used in a wide range of measuring devices in medicine, industry and research. For many applications, the long-term stability of these devices is crucial, so that very well controlled neutron fields are needed to perform calibrations and repeatability tests. A way to achieve such reference neutron fields, relying on a 3 MV Tandem Pelletron accelerator available at the CNA (Seville, Spain), is reported here. This paper shows thermal neutron field production and reproducibility characteristics over few days. Copyright © 2015 Elsevier Ltd. All rights reserved.

  10. Inbreeding drives maize centromere evolution

    PubMed Central

    Schneider, Kevin L.; Xie, Zidian; Wolfgruber, Thomas K.; Presting, Gernot G.

    2016-01-01

    Functional centromeres, the chromosomal sites of spindle attachment during cell division, are marked epigenetically by the centromere-specific histone H3 variant cenH3 and typically contain long stretches of centromere-specific tandem DNA repeats (∼1.8 Mb in maize). In 23 inbreds of domesticated maize chosen to represent the genetic diversity of maize germplasm, partial or nearly complete loss of the tandem DNA repeat CentC precedes 57 independent cenH3 relocation events that result in neocentromere formation. Chromosomal regions with newly acquired cenH3 are colonized by the centromere-specific retrotransposon CR2 at a rate that would result in centromere-sized CR2 clusters in 20,000–95,000 y. Three lines of evidence indicate that CentC loss is linked to inbreeding, including (i) CEN10 of temperate lineages, presumed to have experienced a genetic bottleneck, contain less CentC than their tropical relatives; (ii) strong selection for centromere-linked genes in domesticated maize reduced diversity at seven of the ten maize centromeres to only one or two postdomestication haplotypes; and (iii) the centromere with the largest number of haplotypes in domesticated maize (CEN7) has the highest CentC levels in nearly all domesticated lines. Rare recombinations introduced one (CEN2) or more (CEN5) alternate CEN haplotypes while retaining a single haplotype at domestication loci linked to these centromeres. Taken together, this evidence strongly suggests that inbreeding, favored by postdomestication selection for centromere-linked genes affecting key domestication or agricultural traits, drives replacement of the tandem centromere repeats in maize and other crop plants. Similar forces may act during speciation in natural systems. PMID:26858403

  11. The cotton centromere contains a Ty3-gypsy-like LTR retroelement.

    PubMed

    Luo, Song; Mach, Jennifer; Abramson, Bradley; Ramirez, Rolando; Schurr, Robert; Barone, Pierluigi; Copenhaver, Gregory; Folkerts, Otto

    2012-01-01

    The centromere is a repeat-rich structure essential for chromosome segregation; with the long-term aim of understanding centromere structure and function, we set out to identify cotton centromere sequences. To isolate centromere-associated sequences from cotton, (Gossypium hirsutum) we surveyed tandem and dispersed repetitive DNA in the genus. Centromere-associated elements in other plants include tandem repeats and, in some cases, centromere-specific retroelements. Examination of cotton genomic survey sequences for tandem repeats yielded sequences that did not localize to the centromere. However, among the repetitive sequences we also identified a gypsy-like LTR retrotransposon (Centromere Retroelement Gossypium, CRG) that localizes to the centromere region of all chromosomes in domestic upland cotton, Gossypium hirsutum, the major commercially grown cotton. The location of the functional centromere was confirmed by immunostaining with antiserum to the centromere-specific histone CENH3, which co-localizes with CRG hybridization on metaphase mitotic chromosomes. G. hirsutum is an allotetraploid composed of A and D genomes and CRG is also present in the centromere regions of other AD cotton species. Furthermore, FISH and genomic dot blot hybridization revealed that CRG is found in D-genome diploid cotton species, but not in A-genome diploid species, indicating that this retroelement may have invaded the A-genome centromeres during allopolyploid formation and amplified during evolutionary history. CRG is also found in other diploid Gossypium species, including B and E2 genome species, but not in the C, E1, F, and G genome species tested. Isolation of this centromere-specific retrotransposon from Gossypium provides a probe for further understanding of centromere structure, and a tool for future engineering of centromere mini-chromosomes in this important crop species.

  12. Resurgence of Pertussis and Emergence of the Ptxp3 Toxin Promoter Allele in South Italy.

    PubMed

    Loconsole, Daniela; De Robertis, Anna Lisa; Morea, Anna; Metallo, Angela; Lopalco, Pier Luigi; Chironna, Maria

    2018-05-01

    Despite universal immunization programs, pertussis remains a major public health concern. This study aimed to describe the pertussis epidemiology in the Puglia region in 2006-2015 and to identify recent polymorphisms in Bordetella pertussis virulence-associated genes. The pertussis cases in 2006-2015 were identified from the National Hospital Discharge Database and the Information System of Infectious Diseases. Samples of pertussis cases in 2014-2016 that were confirmed by the Regional Reference Laboratory were subjected to ptxA, ptxP and prn gene sequencing and, in 10 cases, multiple-locus variable-number tandem repeat analysis. In Puglia in 2006-2015, the pertussis incidence rose from an average of 1.39/100,000 inhabitants in 2006-2013 to 2.56-2.54/100,000 in 2014-2015. In infants <1 year of age, the incidence rose from an average of 60.4/100,000 infants in 2006-2013 to 149.9/100,000 in 2015. Of the 661 cases recorded in 2006-2015, 80.3% required hospitalization; of these, 45.4% were <1 year of age. Of the 80 sequenced samples, the allelic profile ptxA1-ptxP3-prn2 was detected in 74. This variant was detected in both vaccinated and unvaccinated people. Six Bordetella pertussis samples were prn deficient. The multiple-locus variable-number tandem repeat analysis cases exhibited multiple-locus variable-number tandem repeat analysis-type 27. The pertussis incidence in Puglia has risen. The hypervirulent strain was also found in vaccinated people. This suggests bacterial adaptation to the vaccine and raises questions about acellular vaccine effectiveness. Prevention of infant pertussis cases is best achieved by immunizing the pregnant mother. Enhanced surveillance and systematic laboratory confirmation of pertussis should be improved in Italy.

  13. The Cotton Centromere Contains a Ty3-gypsy-like LTR Retroelement

    PubMed Central

    Luo, Song; Mach, Jennifer; Abramson, Bradley; Ramirez, Rolando; Schurr, Robert; Barone, Pierluigi; Copenhaver, Gregory; Folkerts, Otto

    2012-01-01

    The centromere is a repeat-rich structure essential for chromosome segregation; with the long-term aim of understanding centromere structure and function, we set out to identify cotton centromere sequences. To isolate centromere-associated sequences from cotton, (Gossypium hirsutum) we surveyed tandem and dispersed repetitive DNA in the genus. Centromere-associated elements in other plants include tandem repeats and, in some cases, centromere-specific retroelements. Examination of cotton genomic survey sequences for tandem repeats yielded sequences that did not localize to the centromere. However, among the repetitive sequences we also identified a gypsy-like LTR retrotransposon (Centromere Retroelement Gossypium, CRG) that localizes to the centromere region of all chromosomes in domestic upland cotton, Gossypium hirsutum, the major commercially grown cotton. The location of the functional centromere was confirmed by immunostaining with antiserum to the centromere-specific histone CENH3, which co-localizes with CRG hybridization on metaphase mitotic chromosomes. G. hirsutum is an allotetraploid composed of A and D genomes and CRG is also present in the centromere regions of other AD cotton species. Furthermore, FISH and genomic dot blot hybridization revealed that CRG is found in D-genome diploid cotton species, but not in A-genome diploid species, indicating that this retroelement may have invaded the A-genome centromeres during allopolyploid formation and amplified during evolutionary history. CRG is also found in other diploid Gossypium species, including B and E2 genome species, but not in the C, E1, F, and G genome species tested. Isolation of this centromere-specific retrotransposon from Gossypium provides a probe for further understanding of centromere structure, and a tool for future engineering of centromere mini-chromosomes in this important crop species. PMID:22536361

  14. Production of monoclonal antibodies recognising the peptide core of MUC2 intestinal mucin.

    PubMed

    Durrant, L G; Jacobs, E; Price, M R

    1994-01-01

    A peptide based on the tandem repeat sequence of MUC2 mucin was used to produce a series of monoclonal antibodies (MAb). The fine specificity of these antibodies and their implications for MUC2 expression are presented. Three of the MAbs, 996/1, 996/7 and 995/25, were specific to the MUC2p and failed to bind to peptides based on the MUC1,3,4 tandem repeat sequences whereas three others, 994/152, 994/91 and 996/36, cross reacted with the MUC2p and the MUC3 tandem repeat peptide but not the MUC1 and MUC4 peptides. An antigen, affinity purified from a colorectal tumour on one of the MUC2p-specific MAbs, 996/1, was shown to be a high molecular weight polydisperse, mucin-like antigen. Two of the MAbs, 996/1 and 994/152, recognised MUC2 in tissue sections, although the fine specificity varied between the two MAbs, with 994/152 strongly staining gastric, ileum and kidney epithelia, and MAb 996/1 intensely staining colon, liver and prostate tissues. These antibodies also stained a colorectal cell line, and MAb 994/152 also stained a gastric and an ovarian cell line. Six of the MAbs were used to stain colorectal tumour and adjacent 'normal' colonic mucosa sections. All six stained normal mucosa, but only two of the MAbs, 996/1 and 994/91, stained tumour tissue. The staining probably reflects exposure of cryptic epitopes due to varying levels of glycosylation in different tissues. These anti-MUC2p MAbs may help in determining the normal role of MUC2 mucin and how it is subverted in malignancy.

  15. Nbs1 ChIP-Seq Identifies Off-Target DNA Double-Strand Breaks Induced by AID in Activated Splenic B Cells

    PubMed Central

    Linehan, Erin K.; Schrader, Carol E.; Stavnezer, Janet

    2015-01-01

    Activation-induced cytidine deaminase (AID) is required for initiation of Ig class switch recombination (CSR) and somatic hypermutation (SHM) of antibody genes during immune responses. AID has also been shown to induce chromosomal translocations, mutations, and DNA double-strand breaks (DSBs) involving non-Ig genes in activated B cells. To determine what makes a DNA site a target for AID-induced DSBs, we identify off-target DSBs induced by AID by performing chromatin immunoprecipitation (ChIP) for Nbs1, a protein that binds DSBs, followed by deep sequencing (ChIP-Seq). We detect and characterize hundreds of off-target AID-dependent DSBs. Two types of tandem repeats are highly enriched within the Nbs1-binding sites: long CA repeats, which can form Z-DNA, and tandem pentamers containing the AID target hotspot WGCW. These tandem repeats are not nearly as enriched at AID-independent DSBs, which we also identified. Msh2, a component of the mismatch repair pathway and important for genome stability, increases off-target DSBs, similar to its effect on Ig switch region DSBs, which are required intermediates during CSR. Most of the off-target DSBs are two-ended, consistent with generation during G1 phase, similar to DSBs in Ig switch regions. However, a minority are one-ended, presumably due to conversion of single-strand breaks to DSBs during replication. One-ended DSBs are repaired by processes involving homologous recombination, including break-induced replication repair, which can lead to genome instability. Off-target DSBs, especially those present during S phase, can lead to chromosomal translocations, deletions and gene amplifications, resulting in the high frequency of B cell lymphomas derived from cells that express or have expressed AID. PMID:26263206

  16. [Standard algorithm of molecular typing of Yersinia pestis strains].

    PubMed

    Eroshenko, G A; Odinokov, G N; Kukleva, L M; Pavlova, A I; Krasnov, Ia M; Shavina, N Iu; Guseva, N P; Vinogradova, N A; Kutyrev, V V

    2012-01-01

    Development of the standard algorithm of molecular typing of Yersinia pestis that ensures establishing of subspecies, biovar and focus membership of the studied isolate. Determination of the characteristic strain genotypes of plague infectious agent of main and nonmain subspecies from various natural foci of plague of the Russian Federation and the near abroad. Genotyping of 192 natural Y. pestis strains of main and nonmain subspecies was performed by using PCR methods, multilocus sequencing and multilocus analysis of variable tandem repeat number. A standard algorithm of molecular typing of plague infectious agent including several stages of Yersinia pestis differentiation by membership: in main and nonmain subspecies, various biovars of the main subspecies, specific subspecies; natural foci and geographic territories was developed. The algorithm is based on 3 typing methods--PCR, multilocus sequence typing and multilocus analysis of variable tandem repeat number using standard DNA targets--life support genes (terC, ilvN, inv, glpD, napA, rhaS and araC) and 7 loci of variable tandem repeats (ms01, ms04, ms06, ms07, ms46, ms62, ms70). The effectiveness of the developed algorithm is shown on the large number of natural Y. pestis strains. Characteristic sequence types of Y. pestis strains of various subspecies and biovars as well as MLVA7 genotypes of strains from natural foci of plague of the Russian Federation and the near abroad were established. The application of the developed algorithm will increase the effectiveness of epidemiologic monitoring of plague infectious agent, and analysis of epidemics and outbreaks of plague with establishing the source of origin of the strain and routes of introduction of the infection.

  17. Tandem repeats analysis for the high resolution phylogenetic analysis of Yersinia pestis

    PubMed Central

    Pourcel, C; André-Mazeaud, F; Neubauer, H; Ramisse, F; Vergnaud, G

    2004-01-01

    Background Yersinia pestis, the agent of plague, is a young and highly monomorphic species. Three biovars, each one thought to be associated with the last three Y. pestis pandemics, have been defined based on biochemical assays. More recently, DNA based assays, including DNA sequencing, IS typing, DNA arrays, have significantly improved current knowledge on the origin and phylogenetic evolution of Y. pestis. However, these methods suffer either from a lack of resolution or from the difficulty to compare data. Variable number of tandem repeats (VNTRs) provides valuable polymorphic markers for genotyping and performing phylogenetic analyses in a growing number of pathogens and have given promising results for Y. pestis as well. Results In this study we have genotyped 180 Y. pestis isolates by multiple locus VNTR analysis (MLVA) using 25 markers. Sixty-one different genotypes were observed. The three biovars were distributed into three main branches, with some exceptions. In particular, the Medievalis phenotype is clearly heterogeneous, resulting from different mutation events in the napA gene. Antiqua strains from Asia appear to hold a central position compared to Antiqua strains from Africa. A subset of 7 markers is proposed for the quick comparison of a new strain with the collection typed here. This can be easily achieved using a Web-based facility, specifically set-up for running such identifications. Conclusion Tandem-repeat typing may prove to be a powerful complement to the existing phylogenetic tools for Y. pestis. Typing can be achieved quickly at a low cost in terms of consumables, technical expertise and equipment. The resulting data can be easily compared between different laboratories. The number and selection of markers will eventually depend upon the type and aim of investigations. PMID:15186506

  18. Subtyping of a Large Collection of Historical Listeria monocytogenes Strains from Ontario, Canada, by an Improved Multilocus Variable-Number Tandem-Repeat Analysis (MLVA)

    PubMed Central

    Saleh-Lakha, S.; Allen, V. G.; Li, J.; Pagotto, F.; Odumeru, J.; Taboada, E.; Lombos, M.; Tabing, K. C.; Blais, B.; Ogunremi, D.; Downing, G.; Lee, S.; Gao, A.; Nadon, C.

    2013-01-01

    Listeria monocytogenes is responsible for severe and often fatal food-borne infections in humans. A collection of 2,421 L. monocytogenes isolates originating from Ontario's food chain between 1993 and 2010, along with Ontario clinical isolates collected from 2004 to 2010, was characterized using an improved multilocus variable-number tandem-repeat analysis (MLVA). The MLVA method was established based on eight primer pairs targeting seven variable-number tandem-repeat (VNTR) loci in two 4-plex fluorescent PCRs. Diversity indices and amplification rates of the individual VNTR loci ranged from 0.38 to 0.92 and from 0.64 to 0.99, respectively. MLVA types and pulsed-field gel electrophoresis (PFGE) patterns were compared using Comparative Partitions analysis involving 336 clinical and 99 food and environmental isolates. The analysis yielded Simpson's diversity index values of 0.998 and 0.992 for MLVA and PFGE, respectively, and adjusted Wallace coefficients of 0.318 when MLVA was used as a primary subtyping method and 0.088 when PFGE was a primary typing method. Statistical data analysis using BioNumerics allowed for identification of at least 8 predominant and persistent L. monocytogenes MLVA types in Ontario's food chain. The MLVA method correctly clustered epidemiologically related outbreak strains and separated unrelated strains in a subset analysis. An MLVA database was established for the 2,421 L. monocytogenes isolates, which allows for comparison of data among historical and new isolates of different sources. The subtyping method coupled with the MLVA database will help in effective monitoring/prevention approaches to identify environmental contamination by pathogenic strains of L. monocytogenes and investigation of outbreaks. PMID:23956391

  19. Multiple-locus variable-number tandem repeat analysis for strain discrimination of non-O157 Shiga toxin-producing Escherichia coli.

    PubMed

    Timmons, Chris; Trees, Eija; Ribot, Efrain M; Gerner-Smidt, Peter; LaFon, Patti; Im, Sung; Ma, Li Maria

    2016-06-01

    Non-O157 Shiga toxin-producing Escherichia coli (STEC) are foodborne pathogens of growing concern worldwide that have been associated with several recent multistate and multinational outbreaks of foodborne illness. Rapid and sensitive molecular-based bacterial strain discrimination methods are critical for timely outbreak identification and contaminated food source traceback. One such method, multiple-locus variable-number tandem repeat analysis (MLVA), is being used with increasing frequency in foodborne illness outbreak investigations to augment the current gold standard bacterial subtyping technique, pulsed-field gel electrophoresis (PFGE). The objective of this study was to develop a MLVA assay for intra- and inter-serogroup discrimination of six major non-O157 STEC serogroups-O26, O111, O103, O121, O45, and O145-and perform a preliminary internal validation of the method on a limited number of clinical isolates. The resultant MLVA scheme consists of ten variable number tandem repeat (VNTR) loci amplified in three multiplex PCR reactions. Sixty-five unique MLVA types were obtained among 84 clinical non-O157 STEC strains comprised of geographically diverse sporadic and outbreak related isolates. Compared to PFGE, the developed MLVA scheme allowed similar discrimination among serogroups O26, O111, O103, and O121 but not among O145 and O45. To more fully compare the discriminatory power of this preliminary MLVA method to PFGE and to determine its epidemiological congruence, a thorough internal and external validation needs to be performed on a carefully selected large panel of strains, including multiple isolates from single outbreaks. Copyright © 2016. Published by Elsevier B.V.

  20. Barrier function and natural moisturizing factor levels after cumulative exposure to a fruit-derived organic acid and a detergent: different outcomes in atopic and healthy skin and relevance for occupational contact dermatitis in the food industry.

    PubMed

    Angelova-Fischer, Irena; Hoek, Anne-Karin; Dapic, Irena; Jakasa, Ivone; Kezic, Sanja; Fischer, Tobias W; Zillikens, Detlef

    2015-12-01

    Fruit-derived organic compounds and detergents are relevant exposure factors for occupational contact dermatitis in the food industry. Although individuals with atopic dermatitis (AD) are at risk for development of occupational contact dermatitis, there have been no controlled studies on the effects of repeated exposure to multiple irritants, relevant for the food industry, in atopic skin. The aim of the study was to investigate the outcomes of repeated exposure to a fruit-derived organic acid and a detergent in AD compared to healthy volunteers. The volunteers were exposed to 2.0% acetic acid (AcA) and/or 0.5% sodium lauryl sulfate (SLS) in controlled tandem repeated irritation test. The outcomes were assessed by measurements of erythema, transepidermal water loss (TEWL) and natural moisturizing factor (NMF) levels. In the AD volunteers, repeated AcA exposure led to barrier disruption and significant TEWL increase; no significant differences after the same exposure in the healthy controls were found. Repeated exposure to SLS and the irritant tandems enhanced the reactions and resulted in a significantly higher increase in TEWL in the AD compared to the control group. Cumulative irritant exposure reduced the NMF levels in both groups. Differences in the severity of irritant-induced barrier impairment in atopic individuals contribute to the risk for occupational contact dermatitis in result of multiple exposures to food-derived irritants and detergents. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  1. GenomeLandscaper: Landscape analysis of genome-fingerprints maps assessing chromosome architecture.

    PubMed

    Ai, Hannan; Ai, Yuncan; Meng, Fanmei

    2018-01-18

    Assessing correctness of an assembled chromosome architecture is a central challenge. We create a geometric analysis method (called GenomeLandscaper) to conduct landscape analysis of genome-fingerprints maps (GFM), trace large-scale repetitive regions, and assess their impacts on the global architectures of assembled chromosomes. We develop an alignment-free method for phylogenetics analysis. The human Y chromosomes (GRCh.chrY, HuRef.chrY and YH.chrY) are analysed as a proof-of-concept study. We construct a galaxy of genome-fingerprints maps (GGFM) for them, and a landscape compatibility among relatives is observed. But a long sharp straight line on the GGFM breaks such a landscape compatibility, distinguishing GRCh38p1.chrY (and throughout GRCh38p7.chrY) from GRCh37p13.chrY, HuRef.chrY and YH.chrY. We delete a 1.30-Mbp target segment to rescue the landscape compatibility, matching the antecedent GRCh37p13.chrY. We re-locate it into the modelled centromeric and pericentromeric region of GRCh38p10.chrY, matching a gap placeholder of GRCh37p13.chrY. We decompose it into sub-constituents (such as BACs, interspersed repeats, and tandem repeats) and trace their homologues by phylogenetics analysis. We elucidate that most examined tandem repeats are of reasonable quality, but the BAC-sized repeats, 173U1020C (176.46 Kbp) and 5U41068C (205.34 Kbp), are likely over-repeated. These results offer unique insights into the centromeric and pericentromeric regions of the human Y chromosomes.

  2. Plant centromeres.

    PubMed

    Lamb, J C; Yu, W; Han, F; Birchler, J A

    2008-01-01

    Plant centromeres are generally composed of tandem arrays of simple repeats that are typical of a particular species, but that evolve rapidly. Centromere specific retroelements are also present. These arrays associate with a centromere specific variant of histone H3 that anchors the site of the kinetochore. Although such DNA arrays are typical of the centromere, the specification of centromere activity has an epigenetic component as shown by the fact that centromeres are formed in the absence of such repeats and that centromeres in dicentric chromosomes regularly undergo inactivation.

  3. First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications

    PubMed Central

    Chen, Zhi-Teng; Du, Yu-Zhou

    2017-01-01

    The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer (AGN), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae. PMID:28475163

  4. First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications.

    PubMed

    Chen, Zhi-Teng; Du, Yu-Zhou

    2017-05-05

    The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer ( AGN ), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae.

  5. Is Tobacco Smoke a Germ-Cell Mutagen?

    EPA Science Inventory

    Although no international organization exists to declare whether an agent is a germ-cell mutagen, tobacco smoke may be a human germ-cell mutagen. In the mouse, tobacco smoke induces a significant increase in the mutation frequency at an expanded simple tandem repeat (ESTR) locus....

  6. A Large Population Genetic Study of 15 Autosomal Short Tandem Repeat Loci for Establishment of Korean DNA Profile Database

    PubMed Central

    Yoo, Seong Yeon; Cho, Nam Soo; Park, Myung Jin; Seong, Ki Min; Hwang, Jung Ho; Song, Seok Bean; Han, Myun Soo; Lee, Won Tae; Chung, Ki Wha

    2011-01-01

    Genotyping of highly polymorphic short tandem repeat (STR) markers is widely used for the genetic identification of individuals in forensic DNA analyses and in paternity disputes. The National DNA Profile Databank recently established by the DNA Identification Act in Korea contains the computerized STR DNA profiles of individuals convicted of crimes. For the establishment of a large autosomal STR loci population database, 1805 samples were obtained at random from Korean individuals and 15 autosomal STR markers were analyzed using the AmpFlSTR Identifiler PCR Amplification kit. For the 15 autosomal STR markers, no deviations from the Hardy-Weinberg equilibrium were observed. The most informative locus in our data set was the D2S1338 with a discrimination power of 0.9699. The combined matching probability was 1.521 × 10-17. This large STR profile dataset including atypical alleles will be important for the establishment of the Korean DNA database and for forensic applications. PMID:21597912

  7. A large-scale dataset of single and mixed-source short tandem repeat profiles to inform human identification strategies: PROVEDIt.

    PubMed

    Alfonse, Lauren E; Garrett, Amanda D; Lun, Desmond S; Duffy, Ken R; Grgicak, Catherine M

    2018-01-01

    DNA-based human identity testing is conducted by comparison of PCR-amplified polymorphic Short Tandem Repeat (STR) motifs from a known source with the STR profiles obtained from uncertain sources. Samples such as those found at crime scenes often result in signal that is a composite of incomplete STR profiles from an unknown number of unknown contributors, making interpretation an arduous task. To facilitate advancement in STR interpretation challenges we provide over 25,000 multiplex STR profiles produced from one to five known individuals at target levels ranging from one to 160 copies of DNA. The data, generated under 144 laboratory conditions, are classified by total copy number and contributor proportions. For the 70% of samples that were synthetically compromised, we report the level of DNA damage using quantitative and end-point PCR. In addition, we characterize the complexity of the signal by exploring the number of detected alleles in each profile. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. X-chromosome STR markers data in a Cabo Verde immigrant population of Lisboa.

    PubMed

    Afonso Costa, Heloísa; Morais, Paulo; Vieira da Silva, Cláudia; Matos, Sara; Marques Santos, Rodolfo; Espinheira, Rosa; Costa Santos, Jorge; Amorim, António

    2014-01-01

    Population genetic data of 12 X chromosomal short tandem repeats markers (DXS10074, DXS10079, DXS10101, DXS10103, DXS10134, DXS10135, DXS10146, DXS10148, DXS7132, DXS7423, DXS8378 and HPRTB) were analysed in 54 females and 95 males of an immigrant population from Cabo Verde living in Lisboa. The obtained results for forensic statistical parameters such as observed heterozigosity, polymorphism information content, power of discrimination and mean exclusion chance, based on single allele frequencies, reveal that this multiplex system is highly informative and can represent an important tool for genetic identification purposes in the immigrant population of Cabo Verde. Since the studied short tandem repeats genetic markers are distributed on four linkage groups, that can provide independent genotype information, we studied those groups as haploytes. The forensic efficiency parameters for the linked groups were all higher than 0.97, with linkage group I being the most polymorphic and linkage group III the less informative.

  9. MiniX-STR multiplex system population study in Japan and application to degraded DNA analysis.

    PubMed

    Asamura, H; Sakai, H; Kobayashi, K; Ota, M; Fukushima, H

    2006-05-01

    We sought to evaluate a more effective system for analyzing X-chromosomal short tandem repeats (X-STRs) in highly degraded DNA. To generate smaller amplicon lengths, we designed new polymerase chain reaction (PCR) primers for DXS7423, DXS6789, DXS101, GATA31E08, DXS8378, DXS7133, DXS7424, and GATA165B12 at X-linked short tandem repeat (STR) loci, devising two miniX-multiplex PCR systems. Among 333 Japanese individuals, these X-linked loci were detected in amplification products ranging in length from 76 to 169 bp, and statistical analyses of the eight loci indicated a high usefulness for the Japanese forensic practice. Results of tests on highly degraded DNA indicated the miniX-STR multiplex strategies to be an effective system for analyzing degraded DNA. We conclude that analysis by the current miniX-STR multiplex systems offers high effectiveness for personal identification from degraded DNA samples.

  10. Comparison of expression of monomeric and multimeric adenoregulin genes in Escherichia coli and Pichia pastorias.

    PubMed

    Zhou, Yuxun; Cao, Wei; Wang, Jinzhi; Ma, Yushu; Wei, Dongzhi

    2005-05-01

    Adenoregulin is a 33 amino acid antibiotic peptide who belongs to dermaseptin family which is the first vertebrate family to show lethal effects against filamentous fungi, as well as a broad spectrum of pathogenic microorganisms. Synthetic adenoregulin gene was cloned in 2, 4 and 6 tandem repeats and subcloned in pET32a and pET22b vectors. Recombinant plasmids were transformed into E. coli BL21(DE3), Fusion proteins of Trx-ADR1, Trx-ADR2 and Trx-ADR4 could be expressed after the hosts were induced by IPTG, but the expression level decreased dramatically with the number of tandem repeats increased. ADR1, ADR4 and ADR6 could not be expressed by E. coli without carrier proteins. But for Pichia pastoris GS115, ADR1 and ADR6 in the fermentation broth of the hosts could be detected by ELISA, and the bactericidal activities could also be observed.

  11. NIST mixed stain study 3: signal intensity balance in commercial short tandem repeat multiplexes.

    PubMed

    Duewer, David L; Kline, Margaret C; Redman, Janette W; Butler, John M

    2004-12-01

    Short-tandem repeat (STR) allelic intensities were collected from more than 60 forensic laboratories for a suite of seven samples as part of the National Institute of Standards and Technology-coordinated 2001 Mixed Stain Study 3 (MSS3). These interlaboratory challenge data illuminate the relative importance of intrinsic and user-determined factors affecting the locus-to-locus balance of signal intensities for currently used STR multiplexes. To varying degrees, seven of the eight commercially produced multiplexes used by MSS3 participants displayed very similar patterns of intensity differences among the different loci probed by the multiplexes for all samples, in the hands of multiple analysts, with a variety of supplies and instruments. These systematic differences reflect intrinsic properties of the individual multiplexes, not user-controllable measurement practices. To the extent that quality systems specify minimum and maximum absolute intensities for data acceptability and data interpretation schema require among-locus balance, these intrinsic intensity differences may decrease the utility of multiplex results and surely increase the cost of analysis.

  12. Highly Discriminatory Variable-Number Tandem-Repeat Markers for Genotyping of Trichophyton interdigitale Strains

    PubMed Central

    Drira, Ines; Hadrich, Ines; Neji, Sourour; Mahfouth, Nedia; Trabelsi, Houaida; Sellami, Hayet; Makni, Fattouma

    2014-01-01

    Trichophyton interdigitale is the second most frequent cause of superficial fungal infections of various parts of the human body. Studying the population structure and genotype differentiation of T. interdigitale strains may lead to significant improvements in clinical practice. The present study aimed to develop and select suitable variable-number tandem-repeat (VNTR) markers for 92 clinical strains of T. interdigitale. On the basis of an analysis of four VNTR markers, four to eight distinct alleles were detected for each marker. The marker with the highest discriminatory power had eight alleles and a D value of 0.802. The combination of all four markers yielded a D value of 0.969 with 29 distinct multilocus genotypes. VNTR typing revealed the genetic diversity of the strains, identifying three populations according to their colonization sites. A correlation between phenotypic characteristics and multilocus genotypes was observed. Seven patients harbored T. interdigitale strains with different genotypes. Typing of clinical T. interdigitale samples by VNTR markers displayed excellent discriminatory power and 100% reproducibility. PMID:24989614

  13. A large population genetic study of 15 autosomal short tandem repeat loci for establishment of Korean DNA profile database.

    PubMed

    Yoo, Seong Yeon; Cho, Nam Soo; Park, Myung Jin; Seong, Ki Min; Hwang, Jung Ho; Song, Seok Bean; Han, Myun Soo; Lee, Won Tae; Chung, Ki Wha

    2011-07-01

    Genotyping of highly polymorphic short tandem repeat (STR) markers is widely used for the genetic identification of individuals in forensic DNA analyses and in paternity disputes. The National DNA Profile Databank recently established by the DNA Identification Act in Korea contains the computerized STR DNA profiles of individuals convicted of crimes. For the establishment of a large autosomal STR loci population database, 1805 samples were obtained at random from Korean individuals and 15 autosomal STR markers were analyzed using the AmpFlSTR Identifiler PCR Amplification kit. For the 15 autosomal STR markers, no deviations from the Hardy-Weinberg equilibrium were observed. The most informative locus in our data set was the D2S1338 with a discrimination power of 0.9699. The combined matching probability was 1.521 × 10(-17). This large STR profile dataset including atypical alleles will be important for the establishment of the Korean DNA database and for forensic applications.

  14. Developmental validation of a Cannabis sativa STR multiplex system for forensic analysis.

    PubMed

    Howard, Christopher; Gilmore, Simon; Robertson, James; Peakall, Rod

    2008-09-01

    A developmental validation study based on recommendations of the Scientific Working Group on DNA Analysis Methods (SWGDAM) was conducted on a multiplex system of 10 Cannabis sativa short tandem repeat loci. Amplification of the loci in four multiplex reactions was tested across DNA from dried root, stem, and leaf sources, and DNA from fresh, frozen, and dried leaf tissue with a template DNA range of 10.0-0.01 ng. The loci were amplified and scored consistently for all DNA sources when DNA template was in the range of 10.0-1.0 ng. Some allelic dropout and PCR failure occurred in reactions with lower template DNA amounts. Overall, amplification was best using 10.0 ng of template DNA from dried leaf tissue indicating that this is the optimal source material. Cross species amplification was observed in Humulus lupulus for three loci but there was no allelic overlap. This is the first study following SWGDAM validation guidelines to validate short tandem repeat markers for forensic use in plants.

  15. The structure of the protein phosphatase 2A PR65/A subunit reveals the conformation of its 15 tandemly repeated HEAT motifs.

    PubMed

    Groves, M R; Hanlon, N; Turowski, P; Hemmings, B A; Barford, D

    1999-01-08

    The PR65/A subunit of protein phosphatase 2A serves as a scaffolding molecule to coordinate the assembly of the catalytic subunit and a variable regulatory B subunit, generating functionally diverse heterotrimers. Mutations of the beta isoform of PR65 are associated with lung and colon tumors. The crystal structure of the PR65/Aalpha subunit, at 2.3 A resolution, reveals the conformation of its 15 tandemly repeated HEAT sequences, degenerate motifs of approximately 39 amino acids present in a variety of proteins, including huntingtin and importin beta. Individual motifs are composed of a pair of antiparallel alpha helices that assemble in a mainly linear, repetitive fashion to form an elongated molecule characterized by a double layer of alpha helices. Left-handed rotations at three interrepeat interfaces generate a novel left-hand superhelical conformation. The protein interaction interface is formed from the intrarepeat turns that are aligned to form a continuous ridge.

  16. Analysis of an "off-ladder" allele at the Penta D short tandem repeat locus.

    PubMed

    Yang, Y L; Wang, J G; Wang, D X; Zhang, W Y; Liu, X J; Cao, J; Yang, S L

    2015-11-25

    Kinship testing of a father and his son from Guangxi, China, the location of the Zhuang minority people, was performed using the PowerPlex® 18D System with a short tandem repeat typing kit. The results indicated that both the father and his son had an off-ladder allele at the Penta D locus, with a genetic size larger than that of the maximal standard allelic ladder. To further identify this locus, monogenic amplification, gene cloning, and genetic sequencing were performed. Sequencing analysis demonstrated that the fragment size of the Penta D-OL locus was 469 bp and the core sequence was [AAAGA]21, also called Penta D-21. The rare Penta D-21 allele was found to be distributed among the Zhuang population from the Guangxi Zhuang Autonomous Region of China; therefore, this study improved the range of DNA data available for this locus and enhanced our ability for individual identification of gene loci.

  17. Sequence Effect on the Formation of DNA Minidumbbells.

    PubMed

    Liu, Yuan; Lam, Sik Lok

    2017-11-16

    The DNA minidumbbell (MDB) is a recently identified non-B structure. The reported MDBs contain two TTTA, CCTG, or CTTG type II loops. At present, the knowledge and understanding of the sequence criteria for MDB formation are still limited. In this study, we performed a systematic high-resolution nuclear magnetic resonance (NMR) and native gel study to investigate the effect of sequence variations in tandem repeats on the formation of MDBs. Our NMR results reveal the importance of hydrogen bonds, base-base stacking, and hydrophobic interactions from each of the participating residues. We conclude that in the MDBs formed by tandem repeats, C-G loop-closing base pairs are more stabilizing than T-A loop-closing base pairs, and thymine residues in both the second and third loop positions are more stabilizing than cytosine residues. The results from this study enrich our knowledge on the sequence criteria for the formation of MDBs, paving a path for better exploring their potential roles in biological systems and DNA nanotechnology.

  18. Developmental Validation of Short Tandem Repeat Reagent Kit for Forensic DNA Profiling of Canine Biological Materials

    PubMed Central

    Dayton, Melody; Koskinen, Mikko T; Tom, Bradley K; Mattila, Anna-Maria; Johnston, Eric; Halverson, Joy; Fantin, Dennis; DeNise, Sue; Budowle, Bruce; Smith, David Glenn; Kanthaswamy, Sree

    2009-01-01

    Aim To develop a reagent kit that enables multiplex polymerase chain reaction (PCR) amplification of 18 short tandem repeats (STR) and the canine sex-determining Zinc Finger marker. Methods Validation studies to determine the robustness and reliability in forensic DNA typing of this multiplex assay included sensitivity testing, reproducibility studies, intra- and inter-locus color balance studies, annealing temperature and cycle number studies, peak height ratio determination, characterization of artifacts such as stutter percentages and dye blobs, mixture analyses, species-specificity, case type samples analyses and population studies. Results The kit robustly amplified domesticated dog samples and consistently generated full 19-locus profiles from as little as 125 pg of dog DNA. In addition, wolf DNA samples could be analyzed with the kit. Conclusion The kit, which produces robust, reliable, and reproducible results, will be made available for the forensic research community after modifications based on this study’s evaluation to comply with the quality standards expected for forensic casework. PMID:19480022

  19. Genetic analysis of eight population groups living in Taiwan using a 13 X-chromosomal STR loci multiplex system.

    PubMed

    Hwa, Hsiao-Lin; Lee, James Chun-I; Chang, Yih-Yuan; Yin, Hsiang-Yi; Chen, Ya-Hui; Tseng, Li-Hui; Su, Yi-Ning; Ko, Tsang-Ming

    2011-01-01

    A 13 X-chromosomal short tandem repeat (STR) multiplex system (DXS6807, DXS8378, DSX9902, DXS7132, DXS9898, DXS6809, DXS6789, DXS7424, DXS101, GATA172D05, HPRTB, DXS8377, and DXS7423) was tested on 1,037 DNA samples from eight population groups currently living in Taiwan. Different distributions of the allelic frequencies in different populations were presented. DXS8377 and DXS101 were the two most polymorphic loci in these eight populations, whereas DXS7423 was the least informative marker in most of the populations studied. The genetic distances between the populations and the constructed phylogenetic tree revealed a long genetic distance between Asian and Caucasian populations as well as isolation of the Tao population. The phylogenetic tree grouped populations into clusters compatible with their ethnogeographic relationships. This 13 X-chromosomal short tandem repeat multiplex system offers a considerable number of polymorphic patterns in different populations. This system can be useful in forensic identification casework and ethnogeographic research.

  20. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bae, Song Yi; Kim, Seulgi; Hwang, Heejin

    Research highlights: {yields} Formation of the {alpha}-synuclein amyloid fibrils by [BIMbF{sub 3}Im]. {yields} Disaggregation of amyloid fibrils by epigallocatechin gallate (EGCG) and baicalein. {yields} Amyloid formation of {alpha}-synuclein tandem repeat ({alpha}-TR). -- Abstract: The aggregation of {alpha}-synuclein is clearly related to the pathogenesis of Parkinson's disease. Therefore, detailed understanding of the mechanism of fibril formation is highly valuable for the development of clinical treatment and also of the diagnostic tools. Here, we have investigated the interaction of {alpha}-synuclein with ionic liquids by using several biochemical techniques including Thioflavin T assays and transmission electron microscopy (TEM). Our data shows a rapidmore » formation of {alpha}-synuclein amyloid fibrils was stimulated by 1-butyl-3-methylimidazolium bis(trifluoromethylsulfonyl)imide [BIMbF{sub 3}Im], and these fibrils could be disaggregated by polyphenols such as epigallocatechin gallate (EGCG) and baicalein. Furthermore, the effect of [BIMbF{sub 3}Im] on the {alpha}-synuclein tandem repeat ({alpha}-TR) in the aggregation process was studied.« less

  1. Multiple-locus variable number of tandem repeat analysis (MLVA) of Irish verocytotoxigenic Escherichia coli O157 from feedlot cattle: uncovering strain dissemination routes.

    PubMed

    Murphy, Mary; Minihan, Donal; Buckley, James F; O'Mahony, Micheál; Whyte, Paul; Fanning, Séamus

    2008-01-24

    The identification of the routes of dissemination of Escherichia coli (E. coli) O157 through a cohort of cattle is a critical step to control this pathogen at farm level. The aim of this study was to identify potential routes of dissemination of E. coli O157 using Multiple-Locus Variable number of tandem repeat Analysis (MLVA). Thirty-eight environmental and sixteen cattle faecal isolates, which were detected in four adjacent pens over a four-month period were sub-typed. MLVA could separate these isolates into broadly defined clusters consisting of twelve MLVA types. Strain diversity was observed within pens, individual cattle and the environment. Application of MLVA is a broadly useful and convenient tool when applied to uncover the dissemination of E. coli O157 in the environment and in supporting improved on-farm management of this important pathogen. These data identified diverse strain types based on amplification of VNTR markers in each case.

  2. Characterization of Escherichia coli O157:H7 in New Zealand using multiple-locus variable-number tandem-repeat analysis.

    PubMed

    Dyet, K H; Robertson, I; Turbitt, E; Carter, P E

    2011-03-01

    Recently, multiple-locus variable-number tandem-repeat analysis (MLVA) has been proposed as an alternative to pulsed-field gel electrophoresis (PFGE) for characterization of Escherichia coli O157:H7. In this study we characterized 118 E. coli O157:H7 isolates from cases of gastrointestinal disease in New Zealand using XbaI PFGE profiles and a MLVA scheme that assessed variability in eight polymorphic loci. The 118 isolates characterized included all 80 E. coli O157:H7 referred to New Zealand's Enteric Reference Laboratory in 2006 and 29 phage-type 2 isolates from 2005. When applied to these isolates the discriminatory power of PFGE and MLVA was not significantly different. However, MLVA data may be more epidemiologically relevant as isolates from family clusters of disease had identical MLVA profiles, even when the XbaI PFGE profiles differed slightly. Furthermore, most isolates with indistinguishable XbaI PFGE profiles that did not appear to be epidemiologically related had distinct MLVA profiles.

  3. A multiple-locus variable-number tandem repeat analysis (MLVA) of Listeria monocytogenes isolated from Norwegian salmon-processing factories and from listeriosis patients.

    PubMed

    Lunestad, B T; Truong, T T T; Lindstedt, B-A

    2013-10-01

    The objective of this study was to characterize Listeria monocytogenes isolated from farmed Atlantic salmon (Salmo salar) and the processing environment in three different Norwegian factories, and compare these to clinical isolates by multiple-locus variable-number tandem repeat analysis (MLVA). The 65 L. monocytogenes isolates obtained gave 15 distinct MLVA profiles. There was great heterogeneity in the distribution of MLVA profiles in factories and within each factory. Nine of the 15 MLVA profiles found in the fish-associated isolates were found to match human profiles. The MLVA profile 07-07-09-10-06 was the most common strain in Norwegian listeriosis patients. L. monocytogenes with this profile has previously been associated with at least two known listeriosis outbreaks in Norway, neither determined to be due to fish consumption. However, since this profile was also found in fish and in the processing environment, fish should be considered as a possible food vehicle during sporadic cases and outbreaks of listeriosis.

  4. Dancing together and separate again: gymnosperms exhibit frequent changes of fundamental 5S and 35S rRNA gene (rDNA) organisation

    PubMed Central

    Garcia, S; Kovařík, A

    2013-01-01

    In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S–5.8S–26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S–18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S–5.8S–26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants. PMID:23512008

  5. Dancing together and separate again: gymnosperms exhibit frequent changes of fundamental 5S and 35S rRNA gene (rDNA) organisation.

    PubMed

    Garcia, S; Kovařík, A

    2013-07-01

    In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S-5.8S-26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S-18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S-5.8S-26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants.

  6. Solution structure of the tandem acyl carrier protein domains from a polyunsaturated fatty acid synthase reveals beads-on-a-string configuration.

    PubMed

    Trujillo, Uldaeliz; Vázquez-Rosa, Edwin; Oyola-Robles, Delise; Stagg, Loren J; Vassallo, David A; Vega, Irving E; Arold, Stefan T; Baerga-Ortiz, Abel

    2013-01-01

    The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP domains for increasing the yield of fatty acids in bacterial cultures.

  7. Solution Structure of the Tandem Acyl Carrier Protein Domains from a Polyunsaturated Fatty Acid Synthase Reveals Beads-on-a-String Configuration

    PubMed Central

    Trujillo, Uldaeliz; Vázquez-Rosa, Edwin; Oyola-Robles, Delise; Stagg, Loren J.; Vassallo, David A.; Vega, Irving E.; Arold, Stefan T.; Baerga-Ortiz, Abel

    2013-01-01

    The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP domains for increasing the yield of fatty acids in bacterial cultures. PMID:23469090

  8. Full mitochondrial genome sequences of two endemic Philippine hornbill species (Aves: Bucerotidae) provide evidence for pervasive mitochondrial DNA recombination.

    PubMed

    Sammler, Svenja; Bleidorn, Christoph; Tiedemann, Ralph

    2011-01-14

    Although nowaday it is broadly accepted that mitochondrial DNA (mtDNA) may undergo recombination, the frequency of such recombination remains controversial. Its estimation is not straightforward, as recombination under homoplasmy (i.e., among identical mt genomes) is likely to be overlooked. In species with tandem duplications of large mtDNA fragments the detection of recombination can be facilitated, as it can lead to gene conversion among duplicates. Although the mechanisms for concerted evolution in mtDNA are not fully understood yet, recombination rates have been estimated from "one per speciation event" down to 850 years or even "during every replication cycle". Here we present the first complete mt genome of the avian family Bucerotidae, i.e., that of two Philippine hornbills, Aceros waldeni and Penelopides panini. The mt genomes are characterized by a tandemly duplicated region encompassing part of cytochrome b, 3 tRNAs, NADH6, and the control region. The duplicated fragments are identical to each other except for a short section in domain I and for the length of repeat motifs in domain III of the control region. Due to the heteroplasmy with regard to the number of these repeat motifs, there is some size variation in both genomes; with around 21,657 bp (A. waldeni) and 22,737 bp (P. panini), they significantly exceed the hitherto longest known avian mt genomes, that of the albatrosses. We discovered concerted evolution between the duplicated fragments within individuals. The existence of differences between individuals in coding genes as well as in the control region, which are maintained between duplicates, indicates that recombination apparently occurs frequently, i.e., in every generation. The homogenised duplicates are interspersed by a short fragment which shows no sign of recombination. We hypothesize that this region corresponds to the so-called Replication Fork Barrier (RFB), which has been described from the chicken mitochondrial genome. As this RFB is supposed to halt replication, it offers a potential mechanistic explanation for frequent recombination in mitochondrial genomes.

  9. Full mitochondrial genome sequences of two endemic Philippine hornbill species (Aves: Bucerotidae) provide evidence for pervasive mitochondrial DNA recombination

    PubMed Central

    2011-01-01

    Background Although nowaday it is broadly accepted that mitochondrial DNA (mtDNA) may undergo recombination, the frequency of such recombination remains controversial. Its estimation is not straightforward, as recombination under homoplasmy (i.e., among identical mt genomes) is likely to be overlooked. In species with tandem duplications of large mtDNA fragments the detection of recombination can be facilitated, as it can lead to gene conversion among duplicates. Although the mechanisms for concerted evolution in mtDNA are not fully understood yet, recombination rates have been estimated from "one per speciation event" down to 850 years or even "during every replication cycle". Results Here we present the first complete mt genome of the avian family Bucerotidae, i.e., that of two Philippine hornbills, Aceros waldeni and Penelopides panini. The mt genomes are characterized by a tandemly duplicated region encompassing part of cytochrome b, 3 tRNAs, NADH6, and the control region. The duplicated fragments are identical to each other except for a short section in domain I and for the length of repeat motifs in domain III of the control region. Due to the heteroplasmy with regard to the number of these repeat motifs, there is some size variation in both genomes; with around 21,657 bp (A. waldeni) and 22,737 bp (P. panini), they significantly exceed the hitherto longest known avian mt genomes, that of the albatrosses. We discovered concerted evolution between the duplicated fragments within individuals. The existence of differences between individuals in coding genes as well as in the control region, which are maintained between duplicates, indicates that recombination apparently occurs frequently, i.e., in every generation. Conclusions The homogenised duplicates are interspersed by a short fragment which shows no sign of recombination. We hypothesize that this region corresponds to the so-called Replication Fork Barrier (RFB), which has been described from the chicken mitochondrial genome. As this RFB is supposed to halt replication, it offers a potential mechanistic explanation for frequent recombination in mitochondrial genomes. PMID:21235758

  10. Allelic association of a dopamine transporter gene polymorphism with antisocial behaviour in heroin-dependent patients.

    PubMed

    Gerra, Gilberto; Garofano, Luciano; Pellegrini, Caterina; Bosari, Silvano; Zaimovic, Amir; Moi, Gabriele; Avanzini, Paola; Talarico, Enrica; Gardini, Federica; Donnini, Claudia

    2005-09-01

    Polymorphism of a variable number of tandem repeats (VNTR) in the 3' untranslated region of exon 15 of the SLC6A3 gene, coding for the dopamine transporter (DAT), was analysed to test whether length variation contributes to differences in the individual susceptibility to aggressive - criminal behaviour and liability to heroin dependence. The repeat number of the DAT polymorphism was assessed in 125 healthy subjects and 104 heroin-dependent subjects (including 52 addicted individuals with violent behaviour and criminal records). There was no significant difference in the frequencies of genotypes and alleles between heroin-dependent subjects and control subjects. On the contrary, there was a significant difference between offenders and non-offenders, p = 0.004 and p = 0.002, respectively, among heroin-dependent subjects. No association was found between DAT polymorphism and history of suicide. Buss - Durkee Hostility Inventory (BDHI) mean total scores were significantly higher in heroin addicts than in controls (p < 0.001) and in antisocial - violent heroin addicts in comparison with addicted individuals without antisocial behaviour (p < 0.005). The regression analysis of BDHI subscales, performed to provide an estimate of the magnitude of any potential effect on the risk of aggressiveness associated with the variants in DAT VNTR, showed that the presence of the 9 - 9 genotype significantly increases the risk of irritability and direct aggressiveness more than six and 10 times with respect to the 9 - 10 genotype. Our findings suggest that the 9-repeat allele of the DAT polymorphism confers increased susceptibility to antisocial - violent behaviour and aggressiveness, rather than drug dependence per se in heroin-dependent males.

  11. Comparative Genomic and Transcriptomic Characterization of the Toxigenic Marine Dinoflagellate Alexandrium ostenfeldii

    PubMed Central

    Jaeckisch, Nina; Yang, Ines; Wohlrab, Sylke; Glöckner, Gernot; Kroymann, Juergen; Vogel, Heiko; Cembella, Allan; John, Uwe

    2011-01-01

    Many dinoflagellate species are notorious for the toxins they produce and ecological and human health consequences associated with harmful algal blooms (HABs). Dinoflagellates are particularly refractory to genomic analysis due to the enormous genome size, lack of knowledge about their DNA composition and structure, and peculiarities of gene regulation, such as spliced leader (SL) trans-splicing and mRNA transposition mechanisms. Alexandrium ostenfeldii is known to produce macrocyclic imine toxins, described as spirolides. We characterized the genome of A. ostenfeldii using a combination of transcriptomic data and random genomic clones for comparison with other dinoflagellates, particularly Alexandrium species. Examination of SL sequences revealed similar features as in other dinoflagellates, including Alexandrium species. SL sequences in decay indicate frequent retro-transposition of mRNA species. This probably contributes to overall genome complexity by generating additional gene copies. Sequencing of several thousand fosmid and bacterial artificial chromosome (BAC) ends yielded a wealth of simple repeats and tandemly repeated longer sequence stretches which we estimated to comprise more than half of the whole genome. Surprisingly, the repeats comprise a very limited set of 79–97 bp sequences; in part the genome is thus a relatively uniform sequence space interrupted by coding sequences. Our genomic sequence survey (GSS) represents the largest genomic data set of a dinoflagellate to date. Alexandrium ostenfeldii is a typical dinoflagellate with respect to its transcriptome and mRNA transposition but demonstrates Alexandrium-like stop codon usage. The large portion of repetitive sequences and the organization within the genome is in agreement with several other studies on dinoflagellates using different approaches. It remains to be determined whether this unusual composition is directly correlated to the exceptionally genome organization of dinoflagellates with a low amount of histones and histone-like proteins. PMID:22164224

  12. The organization and evolution of the Responder satellite in species of the Drosophila melanogaster group: dynamic evolution of a target of meiotic drive.

    PubMed

    Larracuente, Amanda M

    2014-11-25

    Satellite DNA can make up a substantial fraction of eukaryotic genomes and has roles in genome structure and chromosome segregation. The rapid evolution of satellite DNA can contribute to genomic instability and genetic incompatibilities between species. Despite its ubiquity and its contribution to genome evolution, we currently know little about the dynamics of satellite DNA evolution. The Responder (Rsp) satellite DNA family is found in the pericentric heterochromatin of chromosome 2 of Drosophila melanogaster. Rsp is well-known for being the target of Segregation Distorter (SD)- an autosomal meiotic drive system in D. melanogaster. I present an evolutionary genetic analysis of the Rsp family of repeats in D. melanogaster and its closely-related species in the melanogaster group (D. simulans, D. sechellia, D. mauritiana, D. erecta, and D. yakuba) using a combination of available BAC sequences, whole genome shotgun Sanger reads, Illumina short read deep sequencing, and fluorescence in situ hybridization. I show that Rsp repeats have euchromatic locations throughout the D. melanogaster genome, that Rsp arrays show evidence for concerted evolution, and that Rsp repeats exist outside of D. melanogaster, in the melanogaster group. The repeats in these species are considerably diverged at the sequence level compared to D. melanogaster, and have a strikingly different genomic distribution, even between closely-related sister taxa. The genomic organization of the Rsp repeat in the D. melanogaster genome is complex-it exists of large blocks of tandem repeats in the heterochromatin and small blocks of tandem repeats in the euchromatin. My discovery of heterochromatic Rsp-like sequences outside of D. melanogaster suggests that SD evolved after its target satellite and that the evolution of the Rsp satellite family is highly dynamic over a short evolutionary time scale (<240,000 years).

  13. Time code dissemination experiment via the SIRIO-1 VHF transponder

    NASA Technical Reports Server (NTRS)

    Detoma, E.; Gobbo, G.; Leschiutta, S.; Pettiti, V.

    1982-01-01

    An experiment to evaluate the possibility of disseminating a time code via the SIRIO-1 satellite, by using the onboard VHF repeater is described. The precision in the synchronization of remote clocks was expected to be of the order of 0.1 to 1 ms. The RF carrier was in the VHF band, so that low cost receivers could be used and then a broader class of users could be served. An already existing repeater, even if not designed specifically for communications could be utilized; the operation of this repeater was not intended to affect any other function of the spacecraft (both the SHF repeater and the VHF telemetry link were active during the time code dissemination via the VHF transponder).

  14. Comprehensive mutation analysis of 17 Y-chromosomal short tandem repeat polymorphisms included in the AmpFlSTR Yfiler PCR amplification kit.

    PubMed

    Goedbloed, Miriam; Vermeulen, Mark; Fang, Rixun N; Lembring, Maria; Wollstein, Andreas; Ballantyne, Kaye; Lao, Oscar; Brauer, Silke; Krüger, Carmen; Roewer, Lutz; Lessig, Rüdiger; Ploski, Rafal; Dobosz, Tadeusz; Henke, Lotte; Henke, Jürgen; Furtado, Manohar R; Kayser, Manfred

    2009-11-01

    The Y-chromosomal short tandem repeat (Y-STR) polymorphisms included in the AmpFlSTR Yfiler polymerase chain reaction amplification kit have become widely used for forensic and evolutionary applications where a reliable knowledge on mutation properties is necessary for correct data interpretation. Therefore, we investigated the 17 Yfiler Y-STRs in 1,730-1,764 DNA-confirmed father-son pairs per locus and found 84 sequence-confirmed mutations among the 29,792 meiotic transfers covered. Of the 84 mutations, 83 (98.8%) were single-repeat changes and one (1.2%) was a double-repeat change (ratio, 1:0.01), as well as 43 (51.2%) were repeat gains and 41 (48.8%) repeat losses (ratio, 1:0.95). Medians from Bayesian estimation of locus-specific mutation rates ranged from 0.0003 for DYS448 to 0.0074 for DYS458, with a median rate across all 17 Y-STRs of 0.0025. The mean age (at the time of son's birth) of fathers with mutations was with 34.40 (+/-11.63) years higher than that of fathers without ones at 30.32 (+/-10.22) years, a difference that is highly statistically significant (p < 0.001). A Poisson-based modeling revealed that the Y-STR mutation rate increased with increasing father's age on a statistically significant level (alpha = 0.0294, 2.5% quantile = 0.0001). From combining our data with those previously published, considering all together 135,212 meiotic events and 331 mutations, we conclude for the Yfiler Y-STRs that (1) none had a mutation rate of >1%, 12 had mutation rates of >0.1% and four of <0.1%, (2) single-repeat changes were strongly favored over multiple-repeat ones for all loci but 1 and (3) considerable variation existed among loci in the ratio of repeat gains versus losses. Our finding of three Y-STR mutations in one father-son pair (and two pairs with two mutations each) has consequences for determining the threshold of allelic differences to conclude exclusion constellations in future applications of Y-STRs in paternity testing and pedigree analyses.

  15. The Peculiar Landscape of Repetitive Sequences in the Olive (Olea europaea L.) Genome

    PubMed Central

    Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

    2014-01-01

    Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome. PMID:24671744

  16. The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.

    PubMed

    Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

    2014-04-01

    Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome.

  17. Distinct Copy Number, Coding Sequence, and Locus Methylation Patterns Underlie Rhg1-Mediated Soybean Resistance to Soybean Cyst Nematode1[W][OPEN

    PubMed Central

    Cook, David E.; Bayless, Adam M.; Wang, Kai; Guo, Xiaoli; Song, Qijian; Jiang, Jiming; Bent, Andrew F.

    2014-01-01

    Copy number variation of kilobase-scale genomic DNA segments, beyond presence/absence polymorphisms, can be an important driver of adaptive traits. Resistance to Heterodera glycines (Rhg1) is a widely utilized quantitative trait locus that makes the strongest known contribution to resistance against soybean cyst nematode (SCN), Heterodera glycines, the most damaging pathogen of soybean (Glycine max). Rhg1 was recently discovered to be a complex locus at which resistance-conferring haplotypes carry up to 10 tandem repeat copies of a 31-kb DNA segment, and three disparate genes present on each repeat contribute to SCN resistance. Here, we use whole-genome sequencing, fiber-FISH (fluorescence in situ hybridization), and other methods to discover the genetic variation at Rhg1 across 41 diverse soybean accessions. Based on copy number variation, transcript abundance, nucleic acid polymorphisms, and differentially methylated DNA regions, we find that SCN resistance is associated with multicopy Rhg1 haplotypes that form two distinct groups. The tested high-copy-number Rhg1 accessions, including plant introduction (PI) 88788, contain a flexible number of copies (seven to 10) of the 31-kb Rhg1 repeat. The identified low-copy-number Rhg1 group, including PI 548402 (Peking) and PI 437654, contains three copies of the Rhg1 repeat and a newly identified allele of Glyma18g02590 (a predicted α-SNAP [α-soluble N-ethylmaleimide–sensitive factor attachment protein]). There is strong evidence for a shared origin of the two resistance-conferring multicopy Rhg1 groups and subsequent independent evolution. Differentially methylated DNA regions also were identified within Rhg1 that correlate with SCN resistance. These data provide insights into copy number variation of multigene segments, using as the example a disease resistance trait of high economic importance. PMID:24733883

  18. The evolution and function of protein tandem repeats in plants.

    PubMed

    Schaper, Elke; Anisimova, Maria

    2015-04-01

    Sequence tandem repeats (TRs) are abundant in proteomes across all domains of life. For plants, little is known about their distribution or contribution to protein function. We exhaustively annotated TRs and studied the evolution of TR unit variations for all Ensembl plants. Using phylogenetic patterns of TR units, we detected conserved TRs with unit number and order preserved during evolution, and those TRs that have diverged via recent TR unit gains/losses. We correlated the mode of evolution of TRs to protein function. TR number was strongly correlated with proteome size, with about one-half of all TRs recognized as common protein domains. The majority of TRs have been highly conserved over long evolutionary distances, some since the separation of red algae and green plants c. 1.6 billion yr ago. Conversely, recurrent recent TR unit mutations were rare. Our results suggest that the first TRs by far predate the first plants, and that TR appearance is an ongoing process with similar rates across the plant kingdom. Interestingly, the few detected highly mutable TRs might provide a source of variation for rapid adaptation. In particular, such TRs are enriched in leucine-rich repeats (LRRs) commonly found in R genes, where TR unit gain/loss may facilitate resistance to emerging pathogens. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.

  19. Fully automated screening of veterinary drugs in milk by turbulent flow chromatography and tandem mass spectrometry

    PubMed Central

    Stolker, Alida A. M.; Peters, Ruud J. B.; Zuiderent, Richard; DiBussolo, Joseph M.

    2010-01-01

    There is an increasing interest in screening methods for quick and sensitive analysis of various classes of veterinary drugs with limited sample pre-treatment. Turbulent flow chromatography in combination with tandem mass spectrometry has been applied for the first time as an efficient screening method in routine analysis of milk samples. Eight veterinary drugs, belonging to seven different classes were selected for this study. After developing and optimising the method, parameters such as linearity, repeatability, matrix effects and carry-over were studied. The screening method was then tested in the routine analysis of 12 raw milk samples. Even without internal standards, the linearity of the method was found to be good in the concentration range of 50 to 500 µg/L. Regarding repeatability, RSDs below 12% were obtained for all analytes, with only a few exceptions. The limits of detection were between 0.1 and 5.2 µg/L, far below the maximum residue levels for milk set by the EU regulations. While matrix effects—ion suppression or enhancement—are obtained for all the analytes the method has proved to be useful for screening purposes because of its sensitivity, linearity and repeatability. Furthermore, when performing the routine analysis of the raw milk samples, no false positive or negative results were obtained. PMID:20379812

  20. Multi-laboratory validation study of multilocus variable-number tandem repeat analysis (MLVA) for Salmonella enterica serovar Enteritidis, 2015

    PubMed Central

    Peters, Tansy; Bertrand, Sophie; Björkman, Jonas T; Brandal, Lin T; Brown, Derek J; Erdõsi, Tímea; Heck, Max; Ibrahem, Salha; Johansson, Karin; Kornschober, Christian; Kotila, Saara M; Le Hello, Simon; Lienemann, Taru; Mattheus, Wesley; Nielsen, Eva Møller; Ragimbeau, Catherine; Rumore, Jillian; Sabol, Ashley; Torpdahl, Mia; Trees, Eija; Tuohy, Alma; de Pinna, Elizabeth

    2017-01-01

    Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data. PMID:28277220

  1. Mitochondrial DNA repairs double-strand breaks in yeast chromosomes.

    PubMed

    Ricchetti, M; Fairhead, C; Dujon, B

    1999-11-04

    The endosymbiotic theory for the origin of eukaryotic cells proposes that genetic information can be transferred from mitochondria to the nucleus of a cell, and genes that are probably of mitochondrial origin have been found in nuclear chromosomes. Occasionally, short or rearranged sequences homologous to mitochondrial DNA are seen in the chromosomes of different organisms including yeast, plants and humans. Here we report a mechanism by which fragments of mitochondrial DNA, in single or tandem array, are transferred to yeast chromosomes under natural conditions during the repair of double-strand breaks in haploid mitotic cells. These repair insertions originate from noncontiguous regions of the mitochondrial genome. Our analysis of the Saccharomyces cerevisiae mitochondrial genome indicates that the yeast nuclear genome does indeed contain several short sequences of mitochondrial origin which are similar in size and composition to those that repair double-strand breaks. These sequences are located predominantly in non-coding regions of the chromosomes, frequently in the vicinity of retrotransposon long terminal repeats, and appear as recent integration events. Thus, colonization of the yeast genome by mitochondrial DNA is an ongoing process.

  2. Human U2 snRNA Genes Exhibit a Persistently Open Transcriptional State and Promoter Disassembly at Metaphase▿

    PubMed Central

    Pavelitz, Thomas; Bailey, Arnold D.; Elco, Christopher P.; Weiner, Alan M.

    2008-01-01

    In mammals, small multigene families generate spliceosomal U snRNAs that are nearly as abundant as rRNA. Using the tandemly repeated human U2 genes as a model, we show by footprinting with DNase I and permanganate that nearly all sequences between the enhancer-like distal sequence element and the initiation site are protected during interphase whereas the upstream half of the U2 snRNA coding region is exposed. We also show by chromatin immunoprecipitation that the SNAPc complex, which binds the TATA-like proximal sequence element, is removed at metaphase but remains bound under conditions that induce locus-specific metaphase fragility of the U2 genes, such as loss of CSB, BRCA1, or BRCA2 function, treatment with actinomycin D, or overexpression of the tetrameric p53 C terminus. We propose that the U2 snRNA promoter establishes a persistently open state to facilitate rapid reinitiation and perhaps also to bypass TFIIH-dependent promoter melting; this open state would then be disassembled to allow metaphase chromatin condensation. PMID:18378697

  3. The genome of obligately intracellular Ehrlichia canis revealsthemes of complex membrane structure and immune evasion strategies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mavromatis, K.; Kuyler Doyle, C.; Lykidis, A.

    2005-09-01

    Ehrlichia canis, a small obligately intracellular, tick-transmitted, gram-negative, a-proteobacterium is the primary etiologic agent of globally distributed canine monocytic ehrlichiosis. Complete genome sequencing revealed that the E. canis genome consists of a single circular chromosome of 1,315,030 bp predicted to encode 925 proteins, 40 stable RNA species, and 17 putative pseudogenes, and a substantial proportion of non-coding sequence (27 percent). Interesting genome features include a large set of proteins with transmembrane helices and/or signal sequences, and a unique serine-threonine bias associated with the potential for O-glycosylation that was prominent in proteins associated with pathogen-host interactions. Furthermore, two paralogous protein familiesmore » associated with immune evasion were identified, one of which contains poly G:C tracts, suggesting that they may play a role in phase variation and facilitation of persistent infections. Proteins associated with pathogen-host interactions were identified including a small group of proteins (12) with tandem repeats and another with eukaryotic-like ankyrin domains (7).« less

  4. Characterization of the patterns of polymorphism in a [open quotes]cryptic repeat[close quotes] reveals a novel type of hypervariable sequence

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jacobson, D.P.; Schmeling, P.; Sommer, S.S.

    Alternating purine and pyrimidine repeats (RY(i)) are an abundant source of polymorphism. The subset with long tandem repeats of GT or AC (GT(i)) have been studied extensively, but cryptic RY(i) (i.e., no single tandem repeat predominates) have received little attention. The factor IX gene has a polymorphic cryptic RY(i) of 142-216 bp. Previously, there were four known polymorphic alleles, of the form AB, A[sub 2]B, A[sub 2]B[sub 2], and A[sub 3]B[sub 2], where A = (GT)(AC)[sub 3](AT)[sub 3](GT)(AT)[sub 4] and B = A with an additional 3' AT dinucleotide. To further characterize this locus, the authors examined more than 1,700more » additional human chromosomes and determined the sequences of the homologous sites in orangutans and chimpanzees. The novel alleles found in humans expand the repertoire of A/B alleles to A[sub 0-4]B[sub 1] and A[sub 1-3]B[sub 2]. The A[sub n]B[sub 2] series are abundant in Caucasians but are absent in blacks and Asians. Conversely, the A[sub 0]B[sub 1] allele is common in blacks but is not found in more than 1,700 Caucasian chromosomes. The data are compatible with a model in which recombination is more frequent than polymerase slippage at this locus. In orangutans, the RY(i) is present, but the sequence is markedly different. An A/B-type of pattern was discerned in which B differs from A by an additional six (AT) dinucleotides at the 3' end. In chimpanzees, the size of the RY(i) locus was greatly expanded, and the sequence showed a novel pattern of hypervariability in which there are many tandem repeats of the form (GT)[sub n](AC)[sub 0](AT)[sub p](GT)[sub q](AT)[sub s], where n, o, p, q, and s are different integers. The sequences of the factor IX intron 1 cryptic RY(i) in three primates provide perspective on the range of possible patterns of polymorphism. Analysis of the patterns suggests how the RY(i) can be conserved during evolution, while the precise sequence varies. 25 refs., 5 figs., 3 tabs.« less

  5. Natural Burkholderia mallei Infection in Dromedary, Bahrain

    PubMed Central

    Wernery, Ulrich; Wernery, Renate; Joseph, Marina; Al-Salloom, Fajer; Johnson, Bobby; Kinne, Joerg; Jose, Shanti; Jose, Sherry; Tappendorf, Britta; Hornstra, Heidie

    2011-01-01

    We confirm a natural infection of dromedaries with glanders. Multilocus variable number tandem repeat analysis of a Burkholderia mallei strain isolated from a diseased dromedary in Bahrain revealed close genetic proximity to strain Dubai 7, which caused an outbreak of glanders in horses in the United Arab Emirates in 2004. PMID:21762586

  6. Natural Burkholderia mallei infection in Dromedary, Bahrain.

    PubMed

    Wernery, Ulrich; Wernery, Renate; Joseph, Marina; Al-Salloom, Fajer; Johnson, Bobby; Kinne, Joerg; Jose, Shanti; Jose, Sherry; Tappendorf, Britta; Hornstra, Heidie; Scholz, Holger C

    2011-07-01

    We confirm a natural infection of dromedaries with glanders. Multilocus variable number tandem repeat analysis of a Burkholderia mallei strain isolated from a diseased dromedary in Bahrain revealed close genetic proximity to strain Dubai 7, which caused an outbreak of glanders in horses in the United Arab Emirates in 2004.

  7. Pichia stipitis genomics, transcriptomics, and gene clusters

    Treesearch

    Thomas W. Jeffries; Jennifer R. Headman Van Vleet

    2009-01-01

    Genome sequencing and subsequent global gene expression studies have advanced our understanding of the lignocellulose-fermenting yeast Pichia stipitis. These studies have provided an insight into its central carbon metabolism, and analysis of its genome has revealed numerous functional gene clusters and tandem repeats. Specialized physiological traits are often the...

  8. Essential RNA-Based Technologies and Their Applications in Plant Functional Genomics.

    PubMed

    Teotia, Sachin; Singh, Deepali; Tang, Xiaoqing; Tang, Guiliang

    2016-02-01

    Genome sequencing has not only extended our understanding of the blueprints of many plant species but has also revealed the secrets of coding and non-coding genes. We present here a brief introduction to and personal account of key RNA-based technologies, as well as their development and applications for functional genomics of plant coding and non-coding genes, with a focus on short tandem target mimics (STTMs), artificial microRNAs (amiRNAs), and CRISPR/Cas9. In addition, their use in multiplex technologies for the functional dissection of gene networks is discussed. Copyright © 2015 Elsevier Ltd. All rights reserved.

  9. Association of STin2 Variable Number of Tandem Repeat (VNTR) Polymorphism of Serotonin Transporter Gene with Lifelong Premature Ejaculation: A Case-Control Study in Han Chinese Subjects

    PubMed Central

    Huang, Yuanyuan; Zhang, Xiansheng; Gao, Jingjing; Tang, Dongdong; Gao, Pan; Peng, Dangwei; Liang, Chaozhao

    2016-01-01

    Background The STin2 VNTR polymorphism has a variable number of tandem repeats in intron 2 of the serotonin transporter gene. We aimed to explore the relationship between STin2 VNTR polymorphism and lifelong premature ejaculation (LPE). Material/Methods We recruited a total of 115 outpatients who complained of ejaculating prematurely and who were diagnosed as LPE, and 101 controls without PE complaint. Allelic variations of STin2 VNTR were genotyped using PCR-based technology. We evaluated the associations between STin2 VNTR allelic and genotypic frequencies and LPE, as well as the intravaginal ejaculation latency time (IELT) of different STin2 VNTR genotypes among LPE patients. Results The patients and controls did not differ significantly in terms of any characteristic except age. A significantly higher frequency of STin2.12/12 genotype was found among LPE patients versus controls (P=0.026). Frequency of patients carrying at least 1 copy of the 10-repeat allele was significantly lower compared to the control group (28.3% vs. 41.8%, OR=0.55; 95%CI=0.31–0.97, P=0.040). In the LPE group, the mean IELT showed significant difference in STin2.12/12 genotype when compared to those with STin2.12/10 and STin2.10/10 genotypes. The mean IELT in10-repeat allele carriers was 50% longer compared to homozygous carriers of the STin2.12 allele. Conclusions Our results indicate the presence of STin2.10 allele is a protective factor for LPE. Men carrying the higher expression genotype STin2. 12/12 have shorter IELT than 10-repeat allele carriers. PMID:27713390

  10. Evaluation of two new STR loci 9q2h2 and wg3f12 in a Japanese population.

    PubMed

    Mizutani, M; Huang, X L; Tamaki, K; Yoshimoto, T; Uchihi, R; Yamamoto, T; Katsumata, Y; Armour, J A

    1999-09-01

    Two short tandem repeat (STR) loci (9q2h2 and wg3f12) have been evaluated in a Japanese population. Ten and seven different alleles were observed in 9q2h2 and wg3f12 respectively. 9q2h2 displayed simple polymorphism in tetrameric repeat structure; by contrast, wg3f12 contained variable numbers of tetrameric repeats and a 30-bp deletion/insertion polymorphism. No "interalleles" were found. The expected heterozygosities of 9q2h2 and wg3fl2 were 0.749 and 0.574, respectively. No deviation from Hardy-Weinberg equilibrium was found.

  11. The Crystal Structure of TAL Effector PthXo1 Bound to Its DNA Target

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mak, Amanda Nga-Sze; Bradley, Philip; Cernadas, Raul A.

    2012-02-10

    DNA recognition by TAL effectors is mediated by tandem repeats, each 33 to 35 residues in length, that specify nucleotides via unique repeat-variable diresidues (RVDs). The crystal structure of PthXo1 bound to its DNA target was determined by high-throughput computational structure prediction and validated by heavy-atom derivatization. Each repeat forms a left-handed, two-helix bundle that presents an RVD-containing loop to the DNA. The repeats self-associate to form a right-handed superhelix wrapped around the DNA major groove. The first RVD residue forms a stabilizing contact with the protein backbone, while the second makes a base-specific contact to the DNA sense strand.more » Two degenerate amino-terminal repeats also interact with the DNA. Containing several RVDs and noncanonical associations, the structure illustrates the basis of TAL effector-DNA recognition.« less

  12. Sequences in the intergenic spacer influence RNA Pol I transcription from the human rRNA promoter

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, W.M.; Sylvester, J.E.

    1994-09-01

    In most eucaryotic species, ribosomal genes are tandemly repeated about 100-5000 times per haploid genome. The 43 Kb human rDNA repeat consists of a 13 Kb coding region for the 18S, 5.8S, 28S ribosomal RNAs (rRNAs) and transcribed spacers separated by a 30 Kb intergenic spacer. For species such as frog, mouse and rat, sequences in the intergenic spacer other than the gene promoter have been shown to modulate transcription of the ribosomal gene. These sequences are spacer promoters, enhancers and the terminator for spacer transcription. We are addressing whether the human ribosomal gene promoter is similarly influenced. In-vitro transcriptionmore » run-off assays have revealed that the 4.5 kb region (CBE), directly upstream of the gene promoter, has cis-stimulation and trans-competition properties. This suggests that the CBE fragment contains an enhancer(s) for ribosomal gene transcription. Further experiments have shown that a fragment ({approximately}1.6 kb) within the CBE fragment also has trans-competition function. Deletion subclones of this region are being tested to delineate the exact sequences responsible for these modulating activities. Previous sequence analysis and functional studies have revealed that CBE contains regions of DNA capable of adopting alternative structures such as bent DNA, Z-DNA, and triple-stranded DNA. Whether these structures are required for modulating transcription remains to be determined as does the specific DNA-protein interaction involved.« less

  13. Associations between dopamine D4 receptor gene variation with both infidelity and sexual promiscuity.

    PubMed

    Garcia, Justin R; MacKillop, James; Aller, Edward L; Merriwether, Ann M; Wilson, David Sloan; Lum, J Koji

    2010-11-30

    Human sexual behavior is highly variable both within and between populations. While sex-related characteristics and sexual behavior are central to evolutionary theory (sexual selection), little is known about the genetic bases of individual variation in sexual behavior. The variable number tandem repeats (VNTR) polymorphism in exon III of the human dopamine D4 receptor gene (DRD4) has been correlated with an array of behavioral phenotypes and may be predicatively responsible for variation in motivating some sexual behaviors, particularly promiscuity and infidelity. We administered an anonymous survey on personal history of sexual behavior and intimate relationships to 181 young adults. We also collected buccal wash samples and genotyped the DRD4 VNTR. Here we show that individuals with at least one 7-repeat allele (7R+) report a greater categorical rate of promiscuous sexual behavior (i.e., having ever had a "one-night stand") and report a more than 50% increase in instances of sexual infidelity. DRD4 VNTR genotype varies considerably within and among populations and has been subject to relatively recent, local selective pressures. Individual differences in sexual behavior are likely partially mediated by individual genetic variation in genes coding for motivation and reward in the brain. Conceptualizing these findings in terms of r/K selection theory suggests a mechanism for selective pressure for and against the 7R+ genotype that may explain the considerable global allelic variation for this polymorphism.

  14. Associations between Dopamine D4 Receptor Gene Variation with Both Infidelity and Sexual Promiscuity

    PubMed Central

    Garcia, Justin R.; MacKillop, James; Aller, Edward L.; Merriwether, Ann M.; Wilson, David Sloan; Lum, J. Koji

    2010-01-01

    Background Human sexual behavior is highly variable both within and between populations. While sex-related characteristics and sexual behavior are central to evolutionary theory (sexual selection), little is known about the genetic bases of individual variation in sexual behavior. The variable number tandem repeats (VNTR) polymorphism in exon III of the human dopamine D4 receptor gene (DRD4) has been correlated with an array of behavioral phenotypes and may be predicatively responsible for variation in motivating some sexual behaviors, particularly promiscuity and infidelity. Methodology/Principal Findings We administered an anonymous survey on personal history of sexual behavior and intimate relationships to 181 young adults. We also collected buccal wash samples and genotyped the DRD4 VNTR. Here we show that individuals with at least one 7-repeat allele (7R+) report a greater categorical rate of promiscuous sexual behavior (i.e., having ever had a “one-night stand”) and report a more than 50% increase in instances of sexual infidelity. Conclusions/Significance DRD4 VNTR genotype varies considerably within and among populations and has been subject to relatively recent, local selective pressures. Individual differences in sexual behavior are likely partially mediated by individual genetic variation in genes coding for motivation and reward in the brain. Conceptualizing these findings in terms of r/K selection theory suggests a mechanism for selective pressure for and against the 7R+ genotype that may explain the considerable global allelic variation for this polymorphism. PMID:21152404

  15. Organisation of the plant genome in chromosomes.

    PubMed

    Heslop-Harrison, J S Pat; Schwarzacher, Trude

    2011-04-01

    The plant genome is organized into chromosomes that provide the structure for the genetic linkage groups and allow faithful replication, transcription and transmission of the hereditary information. Genome sizes in plants are remarkably diverse, with a 2350-fold range from 63 to 149,000 Mb, divided into n=2 to n= approximately 600 chromosomes. Despite this huge range, structural features of chromosomes like centromeres, telomeres and chromatin packaging are well-conserved. The smallest genomes consist of mostly coding and regulatory DNA sequences present in low copy, along with highly repeated rDNA (rRNA genes and intergenic spacers), centromeric and telomeric repetitive DNA and some transposable elements. The larger genomes have similar numbers of genes, with abundant tandemly repeated sequence motifs, and transposable elements alone represent more than half the DNA present. Chromosomes evolve by fission, fusion, duplication and insertion events, allowing evolution of chromosome size and chromosome number. A combination of sequence analysis, genetic mapping and molecular cytogenetic methods with comparative analysis, all only becoming widely available in the 21st century, is elucidating the exact nature of the chromosome evolution events at all timescales, from the base of the plant kingdom, to intraspecific or hybridization events associated with recent plant breeding. As well as being of fundamental interest, understanding and exploiting evolutionary mechanisms in plant genomes is likely to be a key to crop development for food production. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.

  16. Mrp--a new auxiliary gene essential for optimal expression of methicillin resistance in Staphylococcus aureus.

    PubMed

    Wu, S W; De Lencastre, H

    1999-01-01

    Screening of a library of Tn551 insertional mutants selected for reduction in the methicillin resistance level of the parental Staphylococcus aureus strain COL resulted in the isolation of mutant RUSA266 in which the minimal inhibitory concentration (MIC) of the parent was reduced from 1,600 to 1.5 micrograms/mL. Cloning and sequencing of the vicinity of the insertion site omega 726 identified an open reading frame (orf1365) encoding a very large polypeptide of more than 1,365 amino acids. A unique feature of the deduced amino acid sequence was the presence of multiple tandem repeats of 75 amino acids in the polypeptide, reminiscent of the structure of high-molecular-weight cell-surface proteins EF* and Emb identified in some streptococcal strains. Mutant RUSA266 with the inactivated gene, which we shall provisionally refer to as mrp (for multiple repeat polypeptide), produced a peptidoglycan with altered muropeptide composition, and both the reduced antibiotic resistance and the altered cell wall composition were co-transduced in back-crosses into the parental strain COL. Additional sequencing upstream of mrp has revealed that this gene was part of a five-gene cluster occupying a 9.2-kb region of the staphylococcal chromosome and was composed of glmM (directly upstream of mrp), two open reading frames orf310 and orf269 coding for two hypothetical proteins, and the gene encoding the staphylococcal arginase (arg). Transcriptional analysis demonstrated that the five genes in the cluster were transcribed together.

  17. The complete mitochondrial genome of the pink stem borer, Sesamia inferens, in comparison with four other Noctuid moths.

    PubMed

    Chai, Huan-Na; Du, Yu-Zhou

    2012-01-01

    The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif "ATAGA" followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite "(AT)(7)", without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae.

  18. The Complete Mitochondrial Genome of the Pink Stem Borer, Sesamia inferens, in Comparison with Four Other Noctuid Moths

    PubMed Central

    Chai, Huan-Na; Du, Yu-Zhou

    2012-01-01

    The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif “ATAGA” followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite “(AT)7”, without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae. PMID:22949858

  19. Upstream mononucleotide A-repeats play a cis-regulatory role in mammals through the DICER1 and Ago proteins.

    PubMed

    Aporntewan, Chatchawit; Pin-on, Piyapat; Chaiyaratana, Nachol; Pongpanich, Monnat; Boonyaratanakornkit, Viroj; Mutirangura, Apiwat

    2013-10-01

    A-repeats are the simplest form of tandem repeats and are found ubiquitously throughout genomes. These mononucleotide repeats have been widely believed to be non-functional 'junk' DNA. However, studies in yeasts suggest that A-repeats play crucial biological functions, and their role in humans remains largely unknown. Here, we showed a non-random pattern of distribution of sense A- and T-repeats within 20 kb around transcription start sites (TSSs) in the human genome. Different distributions of these repeats are observed upstream and downstream of TSSs. Sense A-repeats are enriched upstream, whereas sense T-repeats are enriched downstream of TSSs. This enrichment directly correlates with repeat size. Genes with different functions contain different lengths of repeats. In humans, tissue-specific genes are enriched for short repeats of <10 bp, whereas housekeeping genes are enriched for long repeats of ≥10 bp. We demonstrated that DICER1 and Argonaute proteins are required for the cis-regulatory role of A-repeats. Moreover, in the presence of a synthetic polymer that mimics an A-repeat, protein binding to A-repeats was blocked, resulting in a dramatic change in the expression of genes containing upstream A-repeats. Our findings suggest a length-dependent cis-regulatory function of A-repeats and that Argonaute proteins serve as trans-acting factors, binding to A-repeats.

  20. Unitary circular code motifs in genomes of eukaryotes.

    PubMed

    El Soufi, Karim; Michel, Christian J

    A set X of 20 trinucleotides was identified in genes of bacteria, eukaryotes, plasmids and viruses, which has in average the highest occurrence in reading frame compared to its two shifted frames (Michel, 2015; Arquès and Michel, 1996). This set X has an interesting mathematical property as X is a circular code (Arquès and Michel, 1996). Thus, the motifs from this circular code X, called X motifs, have the property to always retrieve, synchronize and maintain the reading frame in genes. The origin of this circular code X in genes is an open problem since its discovery in 1996. Here, we first show that the unitary circular codes (UCC), i.e. sets of one word, allow to generate unitary circular code motifs (UCC motifs), i.e. a concatenation of the same motif (simple repeats) leading to low complexity DNA. Three classes of UCC motifs are studied here: repeated dinucleotides (D + motifs), repeated trinucleotides (T + motifs) and repeated tetranucleotides (T + motifs). Thus, the D + , T + and T + motifs allow to retrieve, synchronize and maintain a frame modulo 2, modulo 3 and modulo 4, respectively, and their shifted frames (1 modulo 2; 1 and 2 modulo 3; 1, 2 and 3 modulo 4 according to the C 2 , C 3 and C 4 properties, respectively) in the DNA sequences. The statistical distribution of the D + , T + and T + motifs is analyzed in the genomes of eukaryotes. A UCC motif and its comp lementary UCC motif have the same distribution in the eukaryotic genomes. Furthermore, a UCC motif and its complementary UCC motif have increasing occurrences contrary to their number of hydrogen bonds, very significant with the T + motifs. The longest D + , T + and T + motifs in the studied eukaryotic genomes are also given. Surprisingly, a scarcity of repeated trinucleotides (T + motifs) in the large eukaryotic genomes is observed compared to the D + and T + motifs. This result has been investigated and may be explained by two outcomes. Repeated trinucleotides (T + motifs) are identified in the X motifs of low composition (cardinality less than 10) in the genomes of eukaryotes. Furthermore, identical trinucleotide pairs of the circular code X are preferentially used in the gene sequences of eukaryotes. These two results suggest that the unitary circular codes of trinucleotides may have been involved in the formation of the trinucleotide circular code X. Indeed, repeated trinucleotides in the X motifs in the genomes of eukaryotes may represent an intermediary evolution from repeated trinucleotides of cardinality 1 (T + motifs) in the genomes of eukaryotes up to the X motifs of cardinality 20 in the gene sequences of eukaryotes. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. Tuberculosis in Alpacas (Lama pacos) Caused by Mycobacterium bovis▿

    PubMed Central

    García-Bocanegra, I.; Barranco, I.; Rodríguez-Gómez, I. M.; Pérez, B.; Gómez-Laguna, J.; Rodríguez, S.; Ruiz-Villamayor, E.; Perea, A.

    2010-01-01

    We report three cases of tuberculosis in alpacas from Spain caused by Mycobacterium bovis. The animals revealed two different lesional patterns. Mycobacterial culture and PCR assay yielded positive results for M. bovis. Molecular typing of the isolates identified spoligotype SB0295 and identical variable-number tandem repeat (VNTR) allele sizes. PMID:20237097

  2. Isolation and characterization of microsatellite markers in Fraser fir (Abies fraseri)

    Treesearch

    S.A. Josserand; K.M. Potter; G. Johnson; J.A. Bowen; J. Frampton; C.D. Nelson

    2006-01-01

    We describe the isolation and characterization of 14 microsatellite loci from Fraser fir (Abies fraseri). These markers originated from cloned inserts enriched for DNA sequences containing tandem di- and tri-nucleotide repeats. In total, 36 clones were selected, sequenced and evaluated. Polymerase chain reaction (PCR) primers for 14 of these...

  3. 76 FR 16609 - Proposed Information Collection; Comment Request; Identification of Human Cell Lines Project

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-03-24

    ... Collection; Comment Request; Identification of Human Cell Lines Project AGENCY: National Institute of... by short tandem repeat (STR) profiling up to 1500 human cell line samples as part of the Identification of Human Cell Lines Project. All data and corresponding information will be posted in a publically...

  4. Telomete length in peripheral blood mononuclear cells is associated with folate status in men

    USDA-ARS?s Scientific Manuscript database

    Human chromosomes are capped by tandem repeats of DNA and associated proteins termed telomeres. The length of the telomeres is reduced with increasing cell divisions except when the enzyme telomerase is active as seen in stem cells and germ cells. Telomere dysfunction has been associated with deve...

  5. Variable number of tandem repeat profiles and antimicrobial resistance patterns of Staphylococcus haemolyticus strains isolated from blood cultures in children.

    PubMed

    Hosseinkhani, Faride; Jabalameli, Fereshteh; Nodeh Farahani, Narges; Taherikalani, Morovat; van Leeuwen, Willem B; Emaneini, Mohammad

    2016-03-01

    Staphylococcus haemolyticus is a healthcare-associated pathogen and can cause a variety of lifethreatening infections. Additionally, multi-drug resistance (MDR), in particular methicillin-resistant S. haemolyticus (MRSH) isolates, have emerged. Dissemination of such strains can be of great concern in the hospital environment. A total number of 20S. haemolyticus isolates from blood cultures obtained from children were included in this study. A high prevalence of MDR-MRSH isolates with high MIC values to vancomycin was found and 35% of the isolates were intermediate resistant to vancomycin. Multilocus variable number of tandem repeats analysis (MLVF) revealed 5 MLVF types among 20 isolates of S. haemolyticus. Twelve isolates shared the same MLVF type and were isolated from different wards in a pediatric hospital in Iran. This is a serious alarm for infection control; i.e. in the absence of adequate infection diagnostics and infection control guidelines, these resistant strains can spread to other sectors of a hospital and possibly among the community. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Evaluation of a highly discriminating multiplex multi-locus variable-number of tandem-repeats (MLVA) analysis for Vibrio cholerae.

    PubMed

    Olsen, Jaran S; Aarskaug, Tone; Skogan, Gunnar; Fykse, Else Marie; Ellingsen, Anette Bauer; Blatny, Janet M

    2009-09-01

    Vibrio cholerae is the etiological agent of cholera and may be used in bioterror actions due to the easiness of its dissemination, and the public fear for acquiring the cholera disease. A simple and highly discriminating method for connecting clinical and environmental isolates of V. cholerae is needed in microbial forensics. Twelve different loci containing variable numbers of tandem-repeats (VNTRs) were evaluated in which six loci were polymorphic. Two multiplex reactions containing PCR primers targeting these six VNTRs resulted in successful DNA amplification of 142 various environmental and clinical V. cholerae isolates. The genetic distribution inside the V. cholerae strain collection was used to evaluate the discriminating power (Simpsons Diversity Index=0.99) of this new MLVA analysis, showing that the assay have a potential to differentiate between various strains, but also to identify those isolates which are collected from a common V. cholerae outbreak. This work has established a rapid and highly discriminating MLVA assay useful for track back analyses and/or forensic studies of V. cholerae infections.

  7. The RNase P RNA from cyanobacteria: short tandemly repeated repetitive (STRR) sequences are present within the RNase P RNA gene in heterocyst-forming cyanobacteria.

    PubMed Central

    Vioque, A

    1997-01-01

    The RNase P RNA gene (rnpB) from 10 cyanobacteria has been characterized. These new RNAs, together with the previously available ones, provide a comprehensive data set of RNase P RNA from diverse cyanobacterial lineages. All heterocystous cyanobacteria, but none of the non-heterocystous strains analyzed, contain short tandemly repeated repetitive (STRR) sequences that increase the length of helix P12. Site-directed mutagenesis experiments indicate that the STRR sequences are not required for catalytic activity in vitro. STRR sequences seem to have recently and independently invaded the RNase P RNA genes in heterocyst-forming cyanobacteria because closely related strains contain unrelated STRR sequences. Most cyanobacteria RNase P RNAs lack the sequence GGU in the loop connecting helices P15 and P16 that has been established to interact with the 3'-end CCA in precursor tRNA substrates in other bacteria. This character is shared with plastid RNase P RNA. Helix P6 is longer than usual in most cyanobacteria as well as in plastid RNase P RNA. PMID:9254706

  8. Multilocus Variable-Number Tandem Repeat Typing of Mycobacterium ulcerans

    PubMed Central

    Ablordey, Anthony; Swings, Jean; Hubans, Christine; Chemlal, Karim; Locht, Camille; Portaels, Françoise; Supply, Philip

    2005-01-01

    The apparent genetic homogeneity of Mycobacterium ulcerans contributes to the poorly understood epidemiology of M. ulcerans infection. Here, we report the identification of variable number tandem repeat (VNTR) sequences as novel polymorphic elements in the genome of this species. A total of 19 potential VNTR loci identified in the closely related M. marinum genome sequence were screened in a collection of 23 M. ulcerans isolates, one Mycobacterium species referred to here as an intermediate species, and five M. marinum strains. Nine of the 19 loci were polymorphic in the three species (including the intermediate species) and revealed eight M. ulcerans and five M. marinum genotypes. The results from the VNTR analysis corroborated the genetic relationships of M. ulcerans isolates from various geographical origins, as defined by independent molecular markers. Although these results further highlight the extremely high clonal homogeneity within certain geographic regions, we report for the first time the discrimination of the two South American strains from Surinam and French Guyana. These findings support the potential of a VNTR-based genotyping method for strain discrimination within M. ulcerans and M. marinum. PMID:15814964

  9. The association of 22 Y chromosome short tandem repeat loci with initiative-aggressive behavior.

    PubMed

    Yang, Chun; Ba, Huajie; Zhang, Wei; Zhang, Shuyou; Zhao, Hanqing; Yu, Haiying; Gao, Zhiqin; Wang, Binbin

    2018-05-15

    Aggressive behavior represents an important public concern and a clinical challenge to behaviorists and psychiatrists. Aggression in humans is known to have an important genetic basis, so to investigate the association of Y chromosome short tandem repeat (Y-STR) loci with initiative-aggressive behavior, we compared allelic and haplotypic distributions of 22 Y-STRs in a group of Chinese males convicted of premeditated extremely violent crimes (n = 271) with a normal control group (n = 492). Allelic distributions of DYS533 and DYS437 loci differed significantly between the two groups (P < 0.05). The case group had higher frequencies of DYS533 allele 14, DYS437 allele 14, and haplotypes 11-14 of DYS533-DYS437 compared with the control group. Additionally, the DYS437 allele 15 frequency was significantly lower in cases than controls. No frequency differences were observed in the other 20 Y-STR loci between these two groups. Our results indicate a genetic role for Y-STR loci in the development of initiative aggression in non-psychiatric subjects. Copyright © 2018 Elsevier B.V. All rights reserved.

  10. Lymphatic filarial species differentiation using evolutionarily modified tandem repeats: generation of new genetic markers.

    PubMed

    Sakthidevi, Moorthy; Murugan, Vadivel; Hoti, Sugeerappa Laxmanappa; Kaliraj, Perumal

    2010-05-01

    Polymerase chain reaction based methods are promising tools for the monitoring and evaluation of the Global Program for the Elimination of Lymphatic Filariasis. The currently available PCR methods do not differentiate the DNA of Wuchereria bancrofti or Brugia malayi by a single PCR and hence are cumbersome. Therefore, we designed a single step PCR strategy for differentiating Bancroftian infection from Brugian infection based on a newly identified gene from the W. bancrofti genome, abundant larval transcript-2 (alt-2), which is abundantly expressed. The difference in PCR product sizes generated from the presence or absence of evolutionarily altered tandem repeats in alt-2 intron-3 differentiated W. bancrofti from B. malayi. The analysis was performed on the genomic DNA of microfilariae from a number of patient blood samples or microfilariae positive slides from different Indian geographical regions. The assay gave consistent results, differentiating the two filarial parasite species accurately. This alt-2 intron-3 based PCR assay can be a potential tool for the diagnosis and differentiation of co-infections by lymphatic filarial parasites. Copyright (c) 2010 Elsevier B.V. All rights reserved.

  11. Molecular typing of Argentinian Mycobacterium avium subsp. paratuberculosis isolates by multiple-locus variable number-tandem repeat analysis

    PubMed Central

    Gioffré, Andrea; Correa Muñoz, Magnolia; Alvarado Pinedo, María F.; Vaca, Roberto; Morsella, Claudia; Fiorentino, María Andrea; Paolicchi, Fernando; Ruybal, Paula; Zumárraga, Martín; Travería, Gabriel E.; Romano, María Isabel

    2015-01-01

    Multiple-locus variable number-tandem repeat analysis (MLVA) of Mycobacterium avium subspecies paratuberculosis (MAP) isolates may contribute to the knowledge of strain diversity in Argentina. Although the diversity of MAP has been previously investigated in Argentina using IS900-RFLP, a small number of isolates were employed, and a low discriminative power was reached. The aim of the present study was to test the genetic diversity among MAP isolates using an MLVA approach based on 8 repetitive loci. We studied 97 isolates from cattle, goat and sheep and could describe 7 different patterns: INMV1, INMV2, INMV11, INMV13, INMV16, INMV33 and one incomplete pattern. INMV1 and INMV2 were the most frequent patterns, grouping 76.3% of the isolates. We were also able to demonstrate the coexistence of genotypes in herds and co-infection at the organism level. This study shows that all the patterns described are common to those described in Europe, suggesting an epidemiological link between the continents. PMID:26273274

  12. Short tandem repeat DNA typing provides an international reference standard for authentication of human cell lines.

    PubMed

    Dirks, Wilhelm Gerhard; Faehnrich, Silke; Estella, Isabelle Annick Janine; Drexler, Hans Guenter

    2005-01-01

    Cell lines have wide applications as model systems in the medical and pharmaceutical industry. Much drug and chemical testing is now first carried out exhaustively on in vitro systems, reducing the need for complicated and invasive animal experiments. The basis for any research, development or production program involving cell lines is the choice of an authentic cell line. Microsatellites in the human genome that harbour short tandem repeat (STR) DNA markers allow individualisation of established cell lines at the DNA level. Fluorescence polymerase chain reaction amplification of eight highly polymorphic microsatellite STR loci plus gender determination was found to be the best tool to screen the uniqueness of DNA profiles in a fingerprint database. Our results demonstrate that cross-contamination and misidentification remain chronic problems in the use of human continuous cell lines. The combination of rapidly generated DNA types based on single-locus STR and their authentication or individualisation by screening the fingerprint database constitutes a highly reliable and robust method for the identification and verification of cell lines.

  13. Epitope mapping of PR81 anti-MUC1 monoclonal antibody following PEPSCAN and phage display techniques.

    PubMed

    Mohammadi, Mohammad; Rasaee, Mohammad Javad; Rajabibazl, Masoumeh; Paknejad, Malihe; Zare, Mehrak; Mohammadzadeh, Sara

    2007-08-01

    PR81 is an anti-MUC1 monoclonal antibody (MAb) which was generated against human MUC1 mucin that reacted with breast cancerous tissue, MUC1 positive cell line (MCF-7, BT-20, and T-4 7 D), and synthetic peptide, including the tandem repeat sequence of MUC1. Here we characterized the binding properties of PR81 against the tandem repeat of MUC1 by two different epitope mapping techniques, namely, PEPSCAN and phage display. Epitope mapping of PR81 MAb by PEPSCAN revealed a minimal consensus binding sequence, PDTRP, which is found on MUC1 peptide as the most important epitope. Using the phage display peptide library, we identified the motif PD(T/S/G)RP as an epitope and the motif AVGLSPDGSRGV as a mimotope recognized by PR81. Results of these two methods showed that the two residues, arginine and aspartic acid, have important roles in antibody binding and threonine can be substituted by either glycine or serine. These results may be of importance in tailor making antigens used in immunoassay.

  14. Visualization of tandem repeat mutagenesis in Bacillus subtilis.

    PubMed

    Dormeyer, Miriam; Lentes, Sabine; Ballin, Patrick; Wilkens, Markus; Klumpp, Stefan; Kohlheyer, Dietrich; Stannek, Lorena; Grünberger, Alexander; Commichau, Fabian M

    2018-03-01

    Mutations are crucial for the emergence and evolution of proteins with novel functions, and thus for the diversity of life. Tandem repeats (TRs) are mutational hot spots that are present in the genomes of all organisms. Understanding the molecular mechanism underlying TR mutagenesis at the level of single cells requires the development of mutation reporter systems. Here, we present a mutation reporter system that is suitable to visualize mutagenesis of TRs occurring in single cells of the Gram-positive model bacterium Bacillus subtilis using microfluidic single-cell cultivation. The system allows measuring the elimination of TR units due to growth rate recovery. The cultivation of bacteria carrying the mutation reporter system in microfluidic chambers allowed us for the first time to visualize the emergence of a specific mutation at the level of single cells. The application of the mutation reporter system in combination with microfluidics might be helpful to elucidate the molecular mechanism underlying TR (in)stability in bacteria. Moreover, the mutation reporter system might be useful to assess whether mutations occur in response to nutrient starvation. Copyright © 2018 Elsevier B.V. All rights reserved.

  15. Phylogenomics of nonavian reptiles and the structure of the ancestral amniote genome

    PubMed Central

    Shedlock, Andrew M.; Botka, Christopher W.; Zhao, Shaying; Shetty, Jyoti; Zhang, Tingting; Liu, Jun S.; Deschavanne, Patrick J.; Edwards, Scott V.

    2007-01-01

    We report results of a megabase-scale phylogenomic analysis of the Reptilia, the sister group of mammals. Large-scale end-sequence scanning of genomic clones of a turtle, alligator, and lizard reveals diverse, mammal-like landscapes of retroelements and simple sequence repeats (SSRs) not found in the chicken. Several global genomic traits, including distinctive phylogenetic lineages of CR1-like long interspersed elements (LINEs) and a paucity of A-T rich SSRs, characterize turtles and archosaur genomes, whereas higher frequencies of tandem repeats and a lower global GC content reveal mammal-like features in Anolis. Nonavian reptile genomes also possess a high frequency of diverse and novel 50-bp unit tandem duplications not found in chicken or mammals. The frequency distributions of ≈65,000 8-mer oligonucleotides suggest that rates of DNA-word frequency change are an order of magnitude slower in reptiles than in mammals. These results suggest a diverse array of interspersed and SSRs in the common ancestor of amniotes and a genomic conservatism and gradual loss of retroelements in reptiles that culminated in the minimalist chicken genome. PMID:17307883

  16. A unique chromatin complex occupies young α-satellite arrays of human centromeres

    PubMed Central

    Henikoff, Jorja G.; Thakur, Jitendra; Kasinathan, Sivakanthan; Henikoff, Steven

    2015-01-01

    The intractability of homogeneous α-satellite arrays has impeded understanding of human centromeres. Artificial centromeres are produced from higher-order repeats (HORs) present at centromere edges, although the exact sequences and chromatin conformations of centromere cores remain unknown. We use high-resolution chromatin immunoprecipitation (ChIP) of centromere components followed by clustering of sequence data as an unbiased approach to identify functional centromere sequences. We find that specific dimeric α-satellite units shared by multiple individuals dominate functional human centromeres. We identify two recently homogenized α-satellite dimers that are occupied by precisely positioned CENP-A (cenH3) nucleosomes with two ~100–base pair (bp) DNA wraps in tandem separated by a CENP-B/CENP-C–containing linker, whereas pericentromeric HORs show diffuse positioning. Precise positioning is largely maintained, whereas abundance decreases exponentially with divergence, which suggests that young α-satellite dimers with paired ~100-bp particles mediate evolution of functional human centromeres. Our unbiased strategy for identifying functional centromeric sequences should be generally applicable to tandem repeat arrays that dominate the centromeres of most eukaryotes. PMID:25927077

  17. PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile.

    PubMed

    Pereira, Luísa; Alshamali, Farida; Andreassen, Rune; Ballard, Ruth; Chantratita, Wasun; Cho, Nam Soo; Coudray, Clotilde; Dugoujon, Jean-Michel; Espinoza, Marta; González-Andrade, Fabricio; Hadi, Sibte; Immel, Uta-Dorothee; Marian, Catalin; Gonzalez-Martin, Antonio; Mertens, Gerhard; Parson, Walther; Perone, Carlos; Prieto, Lourdes; Takeshita, Haruo; Rangel Villalobos, Héctor; Zeng, Zhaoshu; Zhivotovsky, Lev; Camacho, Rui; Fonseca, Nuno A

    2011-09-01

    Because of their sensitivity and high level of discrimination, short tandem repeat (STR) maker systems are currently the method of choice in routine forensic casework and data banking, usually in multiplexes up to 15-17 loci. Constraints related to sample amount and quality, frequently encountered in forensic casework, will not allow to change this picture in the near future, notwithstanding the technological developments. In this study, we present a free online calculator named PopAffiliator ( http://cracs.fc.up.pt/popaffiliator ) for individual population affiliation in the three main population groups, Eurasian, East Asian and sub-Saharan African, based on genotype profiles for the common set of STRs used in forensics. This calculator performs affiliation based on a model constructed using machine learning techniques. The model was constructed using a data set of approximately fifteen thousand individuals collected for this work. The accuracy of individual population affiliation is approximately 86%, showing that the common set of STRs routinely used in forensics provide a considerable amount of information for population assignment, in addition to being excellent for individual identification.

  18. Variable-number-of-tandem-repeats analysis of genetic diversity in Pasteuria ramosa.

    PubMed

    Mouton, L; Ebert, D

    2008-05-01

    Variable-number-of-tandem-repeats (VNTR) markers are increasingly being used in population genetic studies of bacteria. They were recently developed for Pasteuria ramosa, an endobacterium that infects Daphnia species. In the present study, we genotyped P. ramosa in 18 infected hosts from the United Kingdom, Belgium, and two lakes in the United States using seven VNTR markers. Two Daphnia species were collected: D. magna and D. dentifera. Six loci showed length polymorphism, with as many as five alleles identified for a single locus. Similarity coefficient calculations showed that the extent of genetic variation between pairs of isolates within populations differed according to the population, but it was always less than the genetic distances among populations. Analysis of the genetic distances performed using principal component analysis revealed strong clustering by location of origin, but not by host Daphnia species. Our study demonstrated that the VNTR markers available for P. ramosa are informative in revealing genetic differences within and among populations and may therefore become an important tool for providing detailed analysis of population genetics and epidemiology.

  19. Stress-induced rearrangement of Fusarium retrotransposon sequences.

    PubMed

    Anaya, N; Roncero, M I

    1996-11-27

    Rearrangement of fusarium oxysporum retrotransposon skippy was induced by growth in the presence of potassium chlorate. Three fungal strains, one sensitive to chlorate (Co60) and two resistant to chlorate and deficient for nitrate reductase (Co65 and Co94), were studied by Southern analysis of their genomic DNA. Polymorphism was detected in their hybridization banding pattern, relative to the wild type grown in the absence of chlorate, using various enzymes with or without restriction sites within the retrotransposon. Results were consistent with the assumption that three different events had occurred in strain Co60: genomic amplification of skippy yielding tandem arrays of the element, generation of new skippy sequences, and deletion of skippy sequences. Amplification of Co60 genomic DNA using the polymerase chain reaction and divergent primers derived from the retrotransposon generated a new band, corresponding to one long terminal repeat plus flanking sequences, that was not present in the wild-type strain. Molecular analysis of nitrate reductase-deficient mutants showed that generation and deletion of skippy sequences, but not genomic amplification in tandem repeats, had occurred in their genomes.

  20. PGLa-H tandem-repeat peptides active against multidrug resistant clinical bacterial isolates.

    PubMed

    Rončević, Tomislav; Gajski, Goran; Ilić, Nada; Goić-Barišić, Ivana; Tonkić, Marija; Zoranić, Larisa; Simunić, Juraj; Benincasa, Monica; Mijaković, Marijana; Tossi, Alessandro; Juretić, Davor

    2017-02-01

    Antimicrobial peptides (AMPs) are promising candidates for new antibiotic classes but often display an unacceptably high toxicity towards human cells. A naturally produced C-terminal fragment of PGLa, named PGLa-H, has been reported to have a very low haemolytic activity while maintaining a moderate antibacterial activity. A sequential tandem repeat of this fragment, diPGLa-H, was designed, as well as an analogue with a Val to Gly substitution at a key position. These peptides showed markedly improved in vitro bacteriostatic and bactericidal activity against both reference strains and multidrug resistant clinical isolates of Gram-negative and Gram-positive pathogens, with generally low toxicity for human cells as assessed by haemolysis, cell viability, and DNA damage assays. The glycine substitution analogue, kiadin, had a slightly better antibacterial activity and reduced haemolytic activity, which may correlate with an increased flexibility of its helical structure, as deduced using molecular dynamics simulations. These peptides may serve as useful lead compounds for developing anti-infective agents against resistant Gram-negative and Gram-positive species. Copyright © 2016 Elsevier B.V. All rights reserved.

  1. Molecular characterization of Shiga-toxigenic Escherichia coli isolated from diverse sources from India by multi-locus variable number tandem repeat analysis (MLVA).

    PubMed

    Kumar, A; Taneja, N; Sharma, R K; Sharma, H; Ramamurthy, T; Sharma, M

    2014-12-01

    In a first study from India, a diverse collection of 140 environmental and clinical non-O157 Shiga-toxigenic Escherichia coli strains from a large geographical area in north India was typed by multi-locus variable number tandem repeat analysis (MLVA). The distribution of major virulence genes stx1, stx2 and eae was found to be 78%, 70% and 10%, respectively; 15 isolates were enterohaemorrhagic E. coli (stx1 +/stx2 + and eae +). By MLVA analysis, 44 different alleles were obtained. Dendrogram analysis revealed 104 different genotypes and 19 MLVA-type complexes divided into two main lineages, i.e. mutton and animal stool. Human isolates presented a statistically significant greater odds ratio for clustering with mutton samples compared to animal stool isolates. Five human isolates clustered with animal stool strains suggesting that some of the human infections may be from cattle, perhaps through milk, contact or the environment. Further epidemiological studies are required to explore these sources in context with occurrence of human cases.

  2. DNA fingerprinting of Shiga-toxin producing Escherichia coli O157 based on Multiple-Locus Variable-Number Tandem-Repeats Analysis (MLVA)

    PubMed Central

    Lindstedt, Bjørn-Arne; Heir, Even; Gjernes, Elisabet; Vardund, Traute; Kapperud, Georg

    2003-01-01

    Background The ability to react early to possible outbreaks of Escherichia coli O157:H7 and to trace possible sources relies on the availability of highly discriminatory and reliable techniques. The development of methods that are fast and has the potential for complete automation is needed for this important pathogen. Methods In all 73 isolates of shiga-toxin producing E. coli O157 (STEC) were used in this study. The two available fully sequenced STEC genomes were scanned for tandem repeated stretches of DNA, which were evaluated as polymorphic markers for isolate identification. Results The 73 E. coli isolates displayed 47 distinct patterns and the MLVA assay was capable of high discrimination between the E. coli O157 strains. The assay was fast and all the steps can be automated. Conclusion The findings demonstrate a novel high discriminatory molecular typing method for the important pathogen E. coli O157 that is fast, robust and offers many advantages compared to current methods. PMID:14664722

  3. Application of a multilocus variable number of tandem repeats analysis to regional outbreak surveillance of Enterohemorrhagic Escherichia coli O157:H7 infections.

    PubMed

    Konno, Takayuki; Yatsuyanagi, Jun; Saito, Shioko

    2011-01-01

    A total of 18 strains of EHEC O157:H7 were isolated from distinct cases in Akita Prefecture, Japan from July to September 2007. The genetic relatedness of these isolates was investigated by performing a multilocus variable number of tandem repeats analysis (MLVA) and a pulsed-field gel electrophoresis (PFGE) analysis using XbaI. The PFGE analyses allowed us to group these 18 isolates into three major clusters. The MLVA results correlated closely with those obtained by PFGE, although some variants were found within the clusters obtained by PFGE, thus highlighting the utility of this technique for determining a precise classification when it is difficult to differentiate between isolates with indistinguishable or very similar PFGE patterns. In addition, MLVA is a much easier and more rapid method than PFGE for analysis of the genetic relatedness of strains. Thus, as a second molecular epidemiological subtyping method, MLVA is useful for the regional outbreak surveillance of EHEC O157:H7 infections.

  4. Crystal structures of ryanodine receptor SPRY1 and tandem-repeat domains reveal a critical FKBP12 binding determinant

    NASA Astrophysics Data System (ADS)

    Yuchi, Zhiguang; Yuen, Siobhan M. Wong King; Lau, Kelvin; Underhill, Ainsley Q.; Cornea, Razvan L.; Fessenden, James D.; van Petegem, Filip

    2015-08-01

    Ryanodine receptors (RyRs) form calcium release channels located in the membranes of the sarcoplasmic and endoplasmic reticulum. RyRs play a major role in excitation-contraction coupling and other Ca2+-dependent signalling events, and consist of several globular domains that together form a large assembly. Here we describe the crystal structures of the SPRY1 and tandem-repeat domains at 1.2-1.5 Å resolution, which reveal several structural elements not detected in recent cryo-EM reconstructions of RyRs. The cryo-EM studies disagree on the position of SPRY domains, which had been proposed based on homology modelling. Computational docking of the crystal structures, combined with FRET studies, show that the SPRY1 domain is located next to FK506-binding protein (FKBP). Molecular dynamics flexible fitting and mutagenesis experiments suggest a hydrophobic cluster within SPRY1 that is crucial for FKBP binding. A RyR1 disease mutation, N760D, appears to directly impact FKBP binding through interfering with SPRY1 folding.

  5. Detailed phenolic composition of Vidal grape pomace by ultrahigh-performance liquid chromatography-tandem mass spectrometry.

    PubMed

    Luo, Lanxin; Cui, Yan; Zhang, Shuting; Li, Lingxi; Suo, Hao; Sun, Baoshan

    2017-11-15

    Vidal Blanc grape (Vitis vinifera cv.) is the predominant white grape variety used for the production of icewine in China's Liaoning province. In this paper, the development and validation of the method by ultrahigh-performance liquid chromatography-tandem mass spectrometry has been performed for determination of the detailed phenolic composition in the skin, seed and stem of Vidal grapes. The validation of the method was realized by calculating the linearity, repeatability, precision, stability and the limits of detection (LOD) and quantification (LOQ) of standard solutions. All the curves exhibited good linearity (r 2 >0.9997) and the LOD and LOQ were in the range of 0.002-0.025 and 0.006-0.086μg/ml, respectively. Good repeatability (RSD<4.3%) and stability (RSD<3.7%) were also found. Results confirmed that the developed method was more effective and sensitive for simultaneous determination of the major phenolic compounds in Vidal grape pomace. The optimized and validated method of ultrahigh-performance liquid chromatography tandem two complementary techniques, fourier transform ion cyclotron resonance mass spectrometry and triple-quadrupole mass spectrometry, allowed to identify and quantify up to 35 phenolic compounds in Vidal grape pomace, which has, as far as we know, been reported this grapevine variety for the first time. Seeds, skins and stems exhibited different qualitative and quantitative phenolic profiles. These results provided useful information for recovery of phenolic antioxidants from different parts of icewine pomace. Copyright © 2017. Published by Elsevier B.V.

  6. Divergence, differential methylation and interspersion of melon satellite DNA sequences.

    PubMed Central

    Shmookler Reis, R; Timmis, J N; Ingle, J

    1981-01-01

    Melon (Cucumis melo) satellite DNA consists of two components, Q and S, each with a buoyant density in CsCl of 1.707 g/ml, but differing by 9 degrees C in "melting" temperature. These physical properties appear to be in contradiction, since both depend on G + C content. In order to resolve this anomaly, base compositions were directly determined for isolated fractions. the low-"melting" component S contains 41.8% G + C, with 6% of C present as 5-methylcytosine, whereas Q DNA contains 54% G + C, with 41% of C methylated. Analyses of restriction site loss agreed well with the direct determinations of methylation and divergence, and indicated some clustering of methylated sites in Q DNA. Analysis of restricted main-band DNA by hydridization with RNA complementary to Q satellite DNA ("Southern transfer") showed satellite Q tandem arrays interspersed in DNA of main-band density. Sequence divergence and extent of methylation did not appear to depend on whether a repeat array was present as satellite or interspersed in main-band DNA. Hydridization in situ indicated considerable heterogeneity in the genomic proportion of the Q-DNA sequences in melon fruit nuclei, implying over- and under-representation consistent with extensive unequal recombination in satellite Q tandem arrays. The cucumber, Cucumis sativus, contains less than 8% as much Q-homologous DNA per genome as the melon, suggesting rapid evolutionary gain or loss of these tandem repeat sequences. Images Fig. 2. PLATE 1 Fig. 4. Fig. 10. PMID:6172117

  7. Large pathogenic expansions in the SCA2 and SCA7 genes can be detected by fluorescent repeat-primed polymerase chain reaction assay.

    PubMed

    Cagnoli, Claudia; Stevanin, Giovanni; Michielotto, Chiara; Gerbino Promis, Giovanni; Brussino, Alessandro; Pappi, Patrizia; Durr, Alexandra; Dragone, Elisa; Viemont, Michelle; Gellera, Cinzia; Brice, Alexis; Migone, Nicola; Brusco, Alfredo

    2006-02-01

    Large expansions in the SCA2 and SCA7 genes (>100 CAG repeats) have been associated with juvenile and infantile forms of cerebellar ataxias that cannot be detected using standard polymerase chain reaction (PCR). Here, we describe a successful application of the fluorescent short tandem repeat-primed PCR method for accurate identification of these expanded repeats. The test is robust, reliable, and inexpensive and can be used to screen large series of patients, although it cannot give a precise evaluation of the size of the expansion. This test may be of practical value in prenatal diagnoses offered to affected or pre-symptomatic at-risk parents, in which a very large expansion inherited from one of the parents can be missed in the fetus by standard PCR.

  8. Large Pathogenic Expansions in the SCA2 and SCA7 Genes Can Be Detected by Fluorescent Repeat-Primed Polymerase Chain Reaction Assay

    PubMed Central

    Cagnoli, Claudia; Stevanin, Giovanni; Michielotto, Chiara; Gerbino Promis, Giovanni; Brussino, Alessandro; Pappi, Patrizia; Durr, Alexandra; Dragone, Elisa; Viemont, Michelle; Gellera, Cinzia; Brice, Alexis; Migone, Nicola; Brusco, Alfredo

    2006-01-01

    Large expansions in the SCA2 and SCA7 genes (>100 CAG repeats) have been associated with juvenile and infantile forms of cerebellar ataxias that cannot be detected using standard polymerase chain reaction (PCR). Here, we describe a successful application of the fluorescent short tandem repeat-primed PCR method for accurate identification of these expanded repeats. The test is robust, reliable, and inexpensive and can be used to screen large series of patients, although it cannot give a precise evaluation of the size of the expansion. This test may be of practical value in prenatal diagnoses offered to affected or pre-symptomatic at-risk parents, in which a very large expansion inherited from one of the parents can be missed in the fetus by standard PCR. PMID:16436644

  9. TU-AB-201-01: A Comprehensive Planning Comparison Study Between a Novel Direction Modulated Brachytherapy Tandem Applicator and Conventional T&R Applicator for Image Guided Cervical Cancer Brachytherapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Han, D; Liu, Z; University of California, San Diego, La Jolla, CA

    2015-06-15

    Purpose: To demonstrate that utilization of a novel, intensity modulation capable, direction modulated brachytherapy (DMBT) tandem applicator can improve plan quality compared with conventional T&R applicator during an image guided cervical cancer brachytherapy. Methods: 45 cervical cancer patients treated with PDR brachytherapy were reviewed. Of them, a) 27 were treated using T&R only, b) 9 were treated using T&R with needles attached to the ring, and c) the remaining 9 were treated using T&R with needles attached to the ring (AN) as well as additional free-hand-loaded needles (FN). The DMBT tandem design has 6 peripheral holes of 1.3-mm diameter, groovedmore » along a nonmagnetic tungsten alloy rod, enclosed in a plastic sheath with total 6.0-mm diameter. An in-house-coded inverse planning system was used for planning DMBT and T&R cases. All typical clinical constraints including OAR dose limits, dwell times, and loading patterns were respected. For the DMBT and T&R applicators, the plans were optimized with the same conventional ring in place, but repeatedly planned with and without AN/FN needles. All generated plans were normalized to the same D90 of the clinically treated plans. Results: For the plans in category a), DMBT generally outperformed T&R with average reduction in D2cc of −2.39%, −5.21%, and −2.69% for bladder, rectum, and sigmoid, respectively. For the plans in category b) and c), DMBT generally outperformed T&R if the same needles in AN/FN were utilized in both cases with average reduction in D2cc of −1.82%, −3.40%, and −6.04%, respectively. For the cases where the needles were not utilized for both applicators, an average D2cc reduction of −7.45%, −7.61%, and 17.47% were observed, respectively. Conclusions: Under the same clinical conditions, with/without needles, the DMBT applicator tends to generate more favorable plans compared with the conventional T&R applicator, and hence, is a promising technology.« less

  10. Tetris Is a Foldback Transposon that Provided the Building Blocks for an Emerging Satellite DNA of Drosophila virilis

    PubMed Central

    Dias, Guilherme B.; Svartman, Marta; Delprat, Alejandra; Ruiz, Alfredo; Kuhn, Gustavo C.S.

    2014-01-01

    Transposable elements (TEs) and satellite DNAs (satDNAs) are abundant components of most eukaryotic genomes studied so far and their impact on evolution has been the focus of several studies. A number of studies linked TEs with satDNAs, but the nature of their evolutionary relationships remains unclear. During in silico analyses of the Drosophila virilis assembled genome, we found a novel DNA transposon we named Tetris based on its modular structure and diversity of rearranged forms. We aimed to characterize Tetris and investigate its role in generating satDNAs. Data mining and sequence analysis showed that Tetris is apparently nonautonomous, with a structure similar to foldback elements, and present in D. virilis and D. americana. Herein, we show that Tetris shares the final portions of its terminal inverted repeats (TIRs) with DAIBAM, a previously described miniature inverted transposable element implicated in the generation of chromosome inversions. Both elements are likely to be mobilized by the same autonomous TE. Tetris TIRs contain approximately 220-bp internal tandem repeats that we have named TIR-220. We also found TIR-220 repeats making up longer (kb-size) satDNA-like arrays. Using bioinformatic, phylogenetic and cytogenomic tools, we demonstrated that Tetris has contributed to shaping the genomes of D. virilis and D. americana, providing internal tandem repeats that served as building blocks for the amplification of satDNA arrays. The β-heterochromatic genomic environment seemed to have favored such amplification. Our results imply for the first time a role for foldback elements in generating satDNAs. PMID:24858539

  11. TRDistiller: a rapid filter for enrichment of sequence datasets with proteins containing tandem repeats.

    PubMed

    Richard, François D; Kajava, Andrey V

    2014-06-01

    The dramatic growth of sequencing data evokes an urgent need to improve bioinformatics tools for large-scale proteome analysis. Over the last two decades, the foremost efforts of computer scientists were devoted to proteins with aperiodic sequences having globular 3D structures. However, a large portion of proteins contain periodic sequences representing arrays of repeats that are directly adjacent to each other (so called tandem repeats or TRs). These proteins frequently fold into elongated fibrous structures carrying different fundamental functions. Algorithms specific to the analysis of these regions are urgently required since the conventional approaches developed for globular domains have had limited success when applied to the TR regions. The protein TRs are frequently not perfect, containing a number of mutations, and some of them cannot be easily identified. To detect such "hidden" repeats several algorithms have been developed. However, the most sensitive among them are time-consuming and, therefore, inappropriate for large scale proteome analysis. To speed up the TR detection we developed a rapid filter that is based on the comparison of composition and order of short strings in the adjacent sequence motifs. Tests show that our filter discards up to 22.5% of proteins which are known to be without TRs while keeping almost all (99.2%) TR-containing sequences. Thus, we are able to decrease the size of the initial sequence dataset enriching it with TR-containing proteins which allows a faster subsequent TR detection by other methods. The program is available upon request. Copyright © 2014 Elsevier Inc. All rights reserved.

  12. Centromere and telomere sequence alterations reflect the rapid genome evolution within the carnivorous plant genus Genlisea.

    PubMed

    Tran, Trung D; Cao, Hieu X; Jovtchev, Gabriele; Neumann, Pavel; Novák, Petr; Fojtová, Miloslava; Vu, Giang T H; Macas, Jiří; Fajkus, Jiří; Schubert, Ingo; Fuchs, Joerg

    2015-12-01

    Linear chromosomes of eukaryotic organisms invariably possess centromeres and telomeres to ensure proper chromosome segregation during nuclear divisions and to protect the chromosome ends from deterioration and fusion, respectively. While centromeric sequences may differ between species, with arrays of tandemly repeated sequences and retrotransposons being the most abundant sequence types in plant centromeres, telomeric sequences are usually highly conserved among plants and other organisms. The genome size of the carnivorous genus Genlisea (Lentibulariaceae) is highly variable. Here we study evolutionary sequence plasticity of these chromosomal domains at an intrageneric level. We show that Genlisea nigrocaulis (1C = 86 Mbp; 2n = 40) and G. hispidula (1C = 1550 Mbp; 2n = 40) differ as to their DNA composition at centromeres and telomeres. G. nigrocaulis and its close relative G. pygmaea revealed mainly 161 bp tandem repeats, while G. hispidula and its close relative G. subglabra displayed a combination of four retroelements at centromeric positions. G. nigrocaulis and G. pygmaea chromosome ends are characterized by the Arabidopsis-type telomeric repeats (TTTAGGG); G. hispidula and G. subglabra instead revealed two intermingled sequence variants (TTCAGG and TTTCAGG). These differences in centromeric and, surprisingly, also in telomeric DNA sequences, uncovered between groups with on average a > 9-fold genome size difference, emphasize the fast genome evolution within this genus. Such intrageneric evolutionary alteration of telomeric repeats with cytosine in the guanine-rich strand, not yet known for plants, might impact the epigenetic telomere chromatin modification. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.

  13. A Simple and Efficient Method for Assembling TALE Protein Based on Plasmid Library

    PubMed Central

    Xu, Huarong; Xin, Ying; Zhang, Tingting; Ma, Lixia; Wang, Xin; Chen, Zhilong; Zhang, Zhiying

    2013-01-01

    DNA binding domain of the transcription activator-like effectors (TALEs) from Xanthomonas sp. consists of tandem repeats that can be rearranged according to a simple cipher to target new DNA sequences with high DNA-binding specificity. This technology has been successfully applied in varieties of species for genome engineering. However, assembling long TALE tandem repeats remains a big challenge precluding wide use of this technology. Although several new methodologies for efficiently assembling TALE repeats have been recently reported, all of them require either sophisticated facilities or skilled technicians to carry them out. Here, we described a simple and efficient method for generating customized TALE nucleases (TALENs) and TALE transcription factors (TALE-TFs) based on TALE repeat tetramer library. A tetramer library consisting of 256 tetramers covers all possible combinations of 4 base pairs. A set of unique primers was designed for amplification of these tetramers. PCR products were assembled by one step of digestion/ligation reaction. 12 TALE constructs including 4 TALEN pairs targeted to mouse Gt(ROSA)26Sor gene and mouse Mstn gene sequences as well as 4 TALE-TF constructs targeted to mouse Oct4, c-Myc, Klf4 and Sox2 gene promoter sequences were generated by using our method. The construction routines took 3 days and parallel constructions were available. The rate of positive clones during colony PCR verification was 64% on average. Sequencing results suggested that all TALE constructs were performed with high successful rate. This is a rapid and cost-efficient method using the most common enzymes and facilities with a high success rate. PMID:23840477

  14. A simple and efficient method for assembling TALE protein based on plasmid library.

    PubMed

    Zhang, Zhiqiang; Li, Duo; Xu, Huarong; Xin, Ying; Zhang, Tingting; Ma, Lixia; Wang, Xin; Chen, Zhilong; Zhang, Zhiying

    2013-01-01

    DNA binding domain of the transcription activator-like effectors (TALEs) from Xanthomonas sp. consists of tandem repeats that can be rearranged according to a simple cipher to target new DNA sequences with high DNA-binding specificity. This technology has been successfully applied in varieties of species for genome engineering. However, assembling long TALE tandem repeats remains a big challenge precluding wide use of this technology. Although several new methodologies for efficiently assembling TALE repeats have been recently reported, all of them require either sophisticated facilities or skilled technicians to carry them out. Here, we described a simple and efficient method for generating customized TALE nucleases (TALENs) and TALE transcription factors (TALE-TFs) based on TALE repeat tetramer library. A tetramer library consisting of 256 tetramers covers all possible combinations of 4 base pairs. A set of unique primers was designed for amplification of these tetramers. PCR products were assembled by one step of digestion/ligation reaction. 12 TALE constructs including 4 TALEN pairs targeted to mouse Gt(ROSA)26Sor gene and mouse Mstn gene sequences as well as 4 TALE-TF constructs targeted to mouse Oct4, c-Myc, Klf4 and Sox2 gene promoter sequences were generated by using our method. The construction routines took 3 days and parallel constructions were available. The rate of positive clones during colony PCR verification was 64% on average. Sequencing results suggested that all TALE constructs were performed with high successful rate. This is a rapid and cost-efficient method using the most common enzymes and facilities with a high success rate.

  15. Interleukin-1 Receptor Antagonist and Interleukin-4 Genes Variable Number Tandem Repeats Are Associated with Adiposity in Malaysian Subjects

    PubMed Central

    Kok, Yung-Yean; Ong, Hing-Huat

    2017-01-01

    Interleukin-1 receptor antagonist (IL1RA) intron 2 86 bp repeat and interleukin-4 (IL4) intron 3 70 bp repeat are variable number tandem repeats (VNTRs) that have been associated with various diseases, but their role in obesity is elusive. The objective of this study was to investigate the association of IL1RA and IL4 VNTRs with obesity and adiposity in 315 Malaysian subjects (128 M/187 F; 23 Malays/251 ethnic Chinese/41 ethnic Indians). The allelic distributions of IL1RA and IL4 were significantly different among ethnicities, and the alleles were associated with total body fat (TBF) classes. Individuals with IL1RA I/II genotype or allele II had greater risk of having higher overall adiposity, relative to those having the I/I genotype or I allele, respectively, even after controlling for ethnicity [Odds Ratio (OR) of I/II genotype = 12.21 (CI = 2.54, 58.79; p = 0.002); II allele = 5.78 (CI = 1.73, 19.29; p = 0.004)]. However, IL4 VNTR B2 allele was only significantly associated with overall adiposity status before adjusting for ethnicity [OR = 1.53 (CI = 1.04, 2.23; p = 0.03)]. Individuals with IL1RA II allele had significantly higher TBF than those with I allele (31.79 ± 2.52 versus 23.51 ± 0.40; p = 0.005). Taken together, IL1RA intron 2 VNTR seems to be a genetic marker for overall adiposity status in Malaysian subjects. PMID:28293435

  16. Interleukin-1 Receptor Antagonist and Interleukin-4 Genes Variable Number Tandem Repeats Are Associated with Adiposity in Malaysian Subjects.

    PubMed

    Kok, Yung-Yean; Ong, Hing-Huat; Say, Yee-How

    2017-01-01

    Interleukin-1 receptor antagonist ( IL1RA ) intron 2 86 bp repeat and interleukin-4 ( IL4 ) intron 3 70 bp repeat are variable number tandem repeats (VNTRs) that have been associated with various diseases, but their role in obesity is elusive. The objective of this study was to investigate the association of IL1RA and IL4 VNTRs with obesity and adiposity in 315 Malaysian subjects (128 M/187 F; 23 Malays/251 ethnic Chinese/41 ethnic Indians). The allelic distributions of IL1RA and IL4 were significantly different among ethnicities, and the alleles were associated with total body fat (TBF) classes. Individuals with IL1RA I/II genotype or allele II had greater risk of having higher overall adiposity, relative to those having the I/I genotype or I allele, respectively, even after controlling for ethnicity [Odds Ratio (OR) of I/II genotype = 12.21 (CI = 2.54, 58.79; p = 0.002); II allele = 5.78 (CI = 1.73, 19.29; p = 0.004)]. However, IL4 VNTR B2 allele was only significantly associated with overall adiposity status before adjusting for ethnicity [OR = 1.53 (CI = 1.04, 2.23; p = 0.03)]. Individuals with IL1RA II allele had significantly higher TBF than those with I allele (31.79 ± 2.52 versus 23.51 ± 0.40; p = 0.005). Taken together, IL1RA intron 2 VNTR seems to be a genetic marker for overall adiposity status in Malaysian subjects.

  17. The mitochondrial genome of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae).

    PubMed

    Yang, Ming Ru; Zhou, Zhi Jun; Chang, Yan Lin; Zhao, Le Hong

    2012-08-01

    To help determine whether the typical arthropod arrangement was a synapomorphy for the whole Tettigoniidae, we sequenced the mitochondrial genome (mitogenome) of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae). The 16,166-bp nucleotide sequences of X. fascipes mitogenome contains the typical gene content, gene order, base composition, and codon usage found in arthropod mitogenomes. As a whole, the X. fascipes mitogenome contains a lower A+T content (70.2%) found in the complete orthopteran mitogenomes determined to date. All protein-coding genes started with a typical ATN codon. Ten of the 13 protein-coding genes have a complete termination codon, but the remaining three genes (COIII, ND5 and ND4) terminate with incomplete T. All tRNAs have the typical clover-leaf structure of mitogenome tRNA, except for tRNA(Ser(AGN)), in which lengthened anticodon stem (9 bp) with a bulged nuleotide in the middle, an unusual T-stem (6 bp in constrast to the normal 5 bp), a mini DHU arm (2 bp) and no connector nucleotides. In the A+T-rich region, two (TA)n conserved blocks that were previously described in Ensifera and two 150-bp tandem repeats plus a partial copy of the composed at 61 bp of the beginning were present. Phylogenetic analysis found: i) the monophyly of Conocephalinae was interrupted by Elimaea cheni from Phaneropterinae; and ii) Meconematinae was the most basal group among these five subfamilies.

  18. The complete mitochondrial genome of the styloperlid stonefly species Styloperla spinicercia Wu (Insecta: Plecoptera) with family-level phylogenetic analyses of the Pteronarcyoidea.

    PubMed

    Wang, Ying; Cao, Jinjun; Li, Weihai

    2017-03-13

    We present the complete mitochondrial (mt) genome sequence of the stonefly, Styloperla spinicercia Wu, 1935 (Plecoptera: Styloperlidae), the type species of the genus Styloperla and the first complete mt genome for the family Styloperlidae. The genome is circular, 16,129 base pairs long, has an A+T content of 70.7%, and contains 37 genes including the large and small ribosomal RNA (rRNA) subunits, 13 protein coding genes (PCGs), 22 tRNA genes and a large non-coding region (CR). All of the PCGs use the standard initiation codon ATN except ND1 and ND5, which start with TTG and GTG. Twelve of the PCGs stop with conventional terminal codons TAA and TAG, except ND5 which shows an incomplete terminator signal T. All tRNAs have the classic clover-leaf structures with the dihydrouridine (DHU) arm of tRNASer(AGN) forming a simple loop. Secondary structures of the two ribosomal RNAs are presented with reference to previous models. The structural elements and the variable numbers of tandem repeats are described within the control region. Phylogenetic analyses using both Bayesian (BI) and Maximum Likelihood (ML) methods support the previous hypotheses regarding family level relationships within the Pteronarcyoidea. The genetic distance calculated based on 13 PCGs and two rRNAs between Styloperla sp. and S. spinicercia is provided and interspecific divergence is discussed.

  19. Evolution in the block: common elements of 5S rDNA organization and evolutionary patterns in distant fish genera.

    PubMed

    Campo, Daniel; García-Vázquez, Eva

    2012-01-01

    The 5S rDNA is organized in the genome as tandemly repeated copies of a structural unit composed of a coding sequence plus a nontranscribed spacer (NTS). The coding region is highly conserved in the evolution, whereas the NTS vary in both length and sequence. It has been proposed that 5S rRNA genes are members of a gene family that have arisen through concerted evolution. In this study, we describe the molecular organization and evolution of the 5S rDNA in the genera Lepidorhombus and Scophthalmus (Scophthalmidae) and compared it with already known 5S rDNA of the very different genera Merluccius (Merluccidae) and Salmo (Salmoninae), to identify common structural elements or patterns for understanding 5S rDNA evolution in fish. High intra- and interspecific diversity within the 5S rDNA family in all the genera can be explained by a combination of duplications, deletions, and transposition events. Sequence blocks with high similarity in all the 5S rDNA members across species were identified for the four studied genera, with evidences of intense gene conversion within noncoding regions. We propose a model to explain the evolution of the 5S rDNA, in which the evolutionary units are blocks of nucleotides rather than the entire sequences or single nucleotides. This model implies a "two-speed" evolution: slow within blocks (homogenized by recombination) and fast within the gene family (diversified by duplications and deletions).

  20. Copy Number Heterogeneity, Large Origin Tandem Repeats, and Interspecies Recombination in Human Herpesvirus 6A (HHV-6A) and HHV-6B Reference Strains

    PubMed Central

    Roychoudhury, Pavitra; Makhsous, Negar; Hanson, Derek; Chase, Jill; Krueger, Gerhard; Xie, Hong; Huang, Meei-Li; Saunders, Lindsay; Ablashi, Dharam; Koelle, David M.; Cook, Linda; Jerome, Keith R.

    2018-01-01

    ABSTRACT Quantitative PCR is a diagnostic pillar for clinical virology testing, and reference materials are necessary for accurate, comparable quantitation between clinical laboratories. Accurate quantitation of human herpesvirus 6A/B (HHV-6A/B) is important for detection of viral reactivation and inherited chromosomally integrated HHV-6A/B in immunocompromised patients. Reference materials in clinical virology commonly consist of laboratory-adapted viral strains that may be affected by the culture process. We performed next-generation sequencing to make relative copy number measurements at single nucleotide resolution of eight candidate HHV-6A and seven HHV-6B reference strains and DNA materials from the HHV-6 Foundation and Advanced Biotechnologies Inc. Eleven of 17 (65%) HHV-6A/B candidate reference materials showed multiple copies of the origin of replication upstream of the U41 gene by next-generation sequencing. These large tandem repeats arose independently in culture-adapted HHV-6A and HHV-6B strains, measuring 1,254 bp and 983 bp, respectively. The average copy number measured was between 5 and 10 times the number of copies of the rest of the genome. We also report the first interspecies recombinant HHV-6A/B strain with a HHV-6A backbone and a >5.5-kb region from HHV-6B, from U41 to U43, that covered the origin tandem repeat. Specific HHV-6A reference strains demonstrated duplication of regions at U1/U2, U87, and U89, as well as deletion in the U12-to-U24 region and the U94/U95 genes. HHV-6A/B strains derived from cord blood mononuclear cells from different laboratories on different continents with fewer passages revealed no copy number differences throughout the viral genome. These data indicate that large origin tandem duplications are an adaptation of both HHV-6A and HHV-6B in culture and show interspecies recombination is possible within the Betaherpesvirinae. IMPORTANCE Anything in science that needs to be quantitated requires a standard unit of measurement. This includes viruses, for which quantitation increasingly determines definitions of pathology and guidelines for treatment. However, the act of making standard or reference material in virology can alter its very accuracy through genomic duplications, insertions, and rearrangements. We used deep sequencing to examine candidate reference strains for HHV-6, a ubiquitous human virus that can reactivate in the immunocompromised population and is integrated into the human genome in every cell of the body for 1% of people worldwide. We found large tandem repeats in the origin of replication for both HHV-6A and HHV-6B that are selected for in culture. We also found the first interspecies recombinant between HHV-6A and HHV-6B, a phenomenon that is well known in alphaherpesviruses but to date has not been seen in betaherpesviruses. These data critically inform HHV-6A/B biology and the standard selection process. PMID:29491155

  1. MAOA, DBH and SLC6A4 variants in CHARGE: A case control study of autism spectrum disorders

    PubMed Central

    Tassone, Flora; Qi, Lihong; Zhang, Wenting; Hansen, Robin L; Pessah, Isaac N; Hertz-Picciotto, Irva

    2011-01-01

    Background Genetic factors are established to contribute to the development of autism. We examined three loci, serotonin transporter (SLC6A4), dopamine hydroxylase (DBH) and the variable number of tandem repeat promoter of the monoamine oxidase A (MAOA) for association with autism in participants from the CHARGE (CHildhood Autism Risks from Genetics and the Environment) Study, the first large-scale population-based case-control investigation of both environmental and genetic contributions to autism risk. Methods Among male children enrolled in the CHARGE study we tested associations between each of the three polymorphisms and autism (AU) (n=119), or a combined group of autism and other autism spectrum disorders (AU+ASD, which includes an additional n=53) as compared with typically developing controls (TD, n=137). Results The case-control association analysis showed neither SLC6A4 nor DBH to be statistically significantly associated with AU or ASD. However, the male children carrying 4 tandem repeats in the promoter region of the MAOA gene showed a 2-fold higher risk of AU (or AU+ASD) than those carrying allele 3, adjusted for confounders (OR = 2.02, 95% CI = 1.12, 3.65, p = 0.02 for AU vs. TD, and OR = 2.05, 95% CI = 1.19, 3.53, p = 0.01 for ASD vs. TD). In addition, mothers homozygous for the 4 tandem repeat allele showed at least a 3-fold higher risk of AU (or AU+ASD) than mothers homozygous for allele 3 (OR = 3.07, 95% CI = 1.19, 7.91, p = 0.02 for AU vs. TD, and OR = 3.26, 95% CI = 1.35, 7.89, p = 0.009 for AU+ASD vs. TD). Conclusions These results suggest a potential role of the functional MAOA promoter alleles in the male child, the mother, or both in autism spectrum disorders. PMID:21538940

  2. Characteristic mutations found in the ML0411 gene of Mycobacterium leprae isolated in Northeast Asian countries.

    PubMed

    Kai, M; Nakata, N; Matsuoka, M; Sekizuka, T; Kuroda, M; Makino, M

    2013-10-01

    Genome analysis of Mycobacterium leprae strain Kyoto-2 in this study revealed characteristic nucleotide substitutions in gene ML0411, compared to the reference genome M. leprae strain TN. The ML0411 gene of Kyoto-2 had six SNPs compared to that of TN. All SNPs in ML0411 were non-synonymous mutations that result in amino acid replacements. In addition, a seventh SNP was found 41 bp upstream of the start codon in the regulatory region. The seven SNP sites in the ML0411 region were investigated by sequencing in 36 M. leprae isolates from the Leprosy Research Center in Japan. The SNP pattern in 14 of the 36 isolates showed similarity to that of Kyoto-2. Determination of the standard SNP types within the 36 stocked isolates revealed that almost all of the Japanese strains belonged to SNP type III, with nucleotide substitutions at position 14676, 164275, and 2935685 of the M. leprae TN genome. The geographical distribution pattern of east Asian M. leprae isolates by discrimination of ML0411 SNPs was investigated and interestingly turned out to be similar to that of tandem repeat numbers of GACATC in the rpoT gene (3 copies or 4 copies), which has been established as a tool for M. leprae genotyping. All seven Korean M. leprae isolates examined in this study, as well as those derived from Honshu Island of Japan, showed 4 copies of the 6-base tandem repeat plus the ML0411 SNPs observed in M. leprae Kyoto-2. They are termed Northeast Asian (NA) strain of M. leprae. On the other hand, many of isolates derived from the Okinawa Islands of Japan and from the Philippines showed 3 copies of the 6-base tandem repeat in addition to the M. leprae TN ML0411 type of SNPs. These results demonstrate the existence of M. leprae strains in Northeast Asian region having characteristic SNP patterns. Copyright © 2013 Elsevier B.V. All rights reserved.

  3. Tandem repeat variation near the HIC1 (hypermethylated in cancer 1) promoter predicts outcome of oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer.

    PubMed

    Okazaki, Satoshi; Schirripa, Marta; Loupakis, Fotios; Cao, Shu; Zhang, Wu; Yang, Dongyun; Ning, Yan; Berger, Martin D; Miyamoto, Yuji; Suenaga, Mitsukuni; Iqubal, Syma; Barzi, Afsaneh; Cremolini, Chiara; Falcone, Alfredo; Battaglin, Francesca; Salvatore, Lisa; Borelli, Beatrice; Helentjaris, Timothy G; Lenz, Heinz-Josef

    2017-11-15

    The hypermethylated in cancer 1/sirtuin 1 (HIC1/SIRT1) axis plays an important role in regulating the nucleotide excision repair pathway, which is the main oxaliplatin-induced damage-repair system. On the basis of prior evidence that the variable number of tandem repeat (VNTR) sequence located near the promoter lesion of HIC1 is associated with HIC1 gene expression, the authors tested the hypothesis that this VNTR is associated with clinical outcome in patients with metastatic colorectal cancer who receive oxaliplatin-based chemotherapy. Four independent cohorts were tested. Patients who received oxaliplatin-based chemotherapy served as the training cohort (n = 218), and those who received treatment without oxaliplatin served as the control cohort (n = 215). Two cohorts of patients who received oxaliplatin-based chemotherapy were used for validation studies (n = 176 and n = 73). The VNTR sequence near HIC1 was analyzed by polymerase chain reaction analysis and gel electrophoresis and was tested for associations with the response rate, progression-free survival, and overall survival. In the training cohort, patients who harbored at least 5 tandem repeats (TRs) in both alleles had a significantly shorter PFS compared with those who had fewer than 4 TRs in at least 1 allele (9.5 vs 11.6 months; hazard ratio, 1.93; P = .012), and these findings remained statistically significant after multivariate analysis (hazard ratio, 2.00; 95% confidence interval, 1.13-3.54; P = .018). This preliminary association was confirmed in the validation cohort, and patients who had at least 5 TRs in both alleles had a worse PFS compared with the other cohort (7.9 vs 9.8 months; hazard ratio, 1.85; P = .044). The current findings suggest that the VNTR sequence near HIC1 could be a predictive marker for oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer. Cancer 2017;123:4506-14. © 2017 American Cancer Society. © 2017 American Cancer Society.

  4. Second generation subtyping: a proposed PulseNet protocol for multiple-locus variable-number tandem repeat analysis of Shiga toxin-producing Escherichia coli O157 (STEC O157).

    PubMed

    Hyytiä-Trees, Eija; Smole, Sandra C; Fields, Patricia A; Swaminathan, Bala; Ribot, Efrain M

    2006-01-01

    Most bacterial genomes contain tandem duplications of short DNA sequences, termed "variable-number tandem repeats" (VNTR). A subtyping method targeting these repeats, multiple-locus VNTR analysis (MLVA), has emerged as a powerful tool for characterization of clonal organisms such as Shiga toxin-producing Escherichia coli O157 (STEC O157). We modified and optimized a recently published MLVA scheme targeting 29 polymorphic VNTR regions of STEC O157 to render it suitable for routine use by public health laboratories that participate in PulseNet, the national and international molecular subtyping network for foodborne disease surveillance. Nine VNTR loci were included in the final protocol. They were amplified in three PCR reactions, after which the PCR products were sized using capillary electrophoresis. Two hundred geographically diverse, sporadic and outbreak- related STEC O157 isolates were characterized by MLVA and the results were compared with data obtained by pulsed-field gel electrophoresis (PFGE) using XbaI macrorestriction of genomic DNA. A total of 139 unique XbaI PFGE patterns and 162 MLVA types were identified. A subset of 100 isolates characterized by both XbaI and BlnI macrorestriction had 62 unique PFGE and MLVA types. Although the clustering of isolates by the two subtyping systems was generally in agreement, some discrepancies were observed. Importantly, MLVA was able to discriminate among some epidemiologically unrelated isolates which were indistinguishable by PFGE. However, among strains from three of the eight outbreaks included in the study, two single locus MLVA variants and one double locus variant were detected among epidemiologically implicated isolates that were indistinguishable by PFGE. Conversely, in three other outbreaks, isolates that were indistinguishable by MLVA displayed multiple PFGE types. An additional more extensive multi-laboratory validation of the MLVA protocol is in progress in order to address critical issues such as establishing epidemiologically relevant interpretation guidelines for the MLVA data.

  5. Dopamine D4 receptor (DRD4) gene polymorphism in the first psychotic episode.

    PubMed

    Rinetti, G; Camarena, B; Cruz, C; Apiquián, R; Fresán, A; Páez, F; Nicolini, H

    2001-01-01

    Dopamine D4 receptor (DRD4) has shown some interesting properties at genetic and possibly functional levels. It has been suggested that some molecular variants of the DRD4 gene (e.g., four and seven alleles) could be implicated in the pathogenesis of psychotic disorders. Additionally, the VNTR polymorphism could be implicated in part of the response to treatment with neuroleptics. This study tested the possible association between the 48-bp tandem repeats in exon 3 of the DRD4 gene and patients experiencing their first psychotic episode. Patients with a first psychotic episode (FPE, n = 37) were diagnosed and compared with a matched control group (n = 37). The FPE group was subdivided into two categories: those with nonaffective and those with affective psychoses. The variable number of tandem repeats (VNTR) region of the DRD4 gene was amplified by PCR procedures. Chi-square statistics and appropriate corrections and adjustments were used for data analysis. A significantly lower frequency of the four repeat (4-R) carriers in the FPE group was observed. This association was sustained mainly by the affective psychotic group (chi2 = 9.99 df = 2, p = 0.0073). Although these results require testing with stringent methods, it is suggested that the DRD4-4R allele may confer some protection against psychosis, mainly of the affective subtype.

  6. Characterization of Spindle Checkpoint Kinase Mps1 Reveals Domain with Functional and Structural Similarities to Tetratricopeptide Repeat Motifs of Bub1 and BubR1 Checkpoint Kinases*

    PubMed Central

    Lee, Semin; Thebault, Philippe; Freschi, Luca; Beaufils, Sylvie; Blundell, Tom L.; Landry, Christian R.; Bolanos-Garcia, Victor M.; Elowe, Sabine

    2012-01-01

    Kinetochore targeting of the mitotic kinases Bub1, BubR1, and Mps1 has been implicated in efficient execution of their functions in the spindle checkpoint, the self-monitoring system of the eukaryotic cell cycle that ensures chromosome segregation occurs with high fidelity. In all three kinases, kinetochore docking is mediated by the N-terminal region of the protein. Deletions within this region result in checkpoint failure and chromosome segregation defects. Here, we use an interdisciplinary approach that includes biophysical, biochemical, cell biological, and bioinformatics methods to study the N-terminal region of human Mps1. We report the identification of a tandem repeat of the tetratricopeptide repeat (TPR) motif in the N-terminal kinetochore binding region of Mps1, with close homology to the tandem TPR motif of Bub1 and BubR1. Phylogenetic analysis indicates that TPR Mps1 was acquired after the split between deutorostomes and protostomes, as it is distinguishable in chordates and echinoderms. Overexpression of TPR Mps1 resulted in decreased efficiency of both chromosome alignment and mitotic arrest, likely through displacement of endogenous Mps1 from the kinetochore and decreased Mps1 catalytic activity. Taken together, our multidisciplinary strategy provides new insights into the evolution, structural organization, and function of Mps1 N-terminal region. PMID:22187426

  7. Characterization of spindle checkpoint kinase Mps1 reveals domain with functional and structural similarities to tetratricopeptide repeat motifs of Bub1 and BubR1 checkpoint kinases.

    PubMed

    Lee, Semin; Thebault, Philippe; Freschi, Luca; Beaufils, Sylvie; Blundell, Tom L; Landry, Christian R; Bolanos-Garcia, Victor M; Elowe, Sabine

    2012-02-17

    Kinetochore targeting of the mitotic kinases Bub1, BubR1, and Mps1 has been implicated in efficient execution of their functions in the spindle checkpoint, the self-monitoring system of the eukaryotic cell cycle that ensures chromosome segregation occurs with high fidelity. In all three kinases, kinetochore docking is mediated by the N-terminal region of the protein. Deletions within this region result in checkpoint failure and chromosome segregation defects. Here, we use an interdisciplinary approach that includes biophysical, biochemical, cell biological, and bioinformatics methods to study the N-terminal region of human Mps1. We report the identification of a tandem repeat of the tetratricopeptide repeat (TPR) motif in the N-terminal kinetochore binding region of Mps1, with close homology to the tandem TPR motif of Bub1 and BubR1. Phylogenetic analysis indicates that TPR Mps1 was acquired after the split between deutorostomes and protostomes, as it is distinguishable in chordates and echinoderms. Overexpression of TPR Mps1 resulted in decreased efficiency of both chromosome alignment and mitotic arrest, likely through displacement of endogenous Mps1 from the kinetochore and decreased Mps1 catalytic activity. Taken together, our multidisciplinary strategy provides new insights into the evolution, structural organization, and function of Mps1 N-terminal region.

  8. Multi-laboratory validation study of multilocus variable-number tandem repeat analysis (MLVA) for Salmonella enterica serovar Enteritidis, 2015.

    PubMed

    Peters, Tansy; Bertrand, Sophie; Björkman, Jonas T; Brandal, Lin T; Brown, Derek J; Erdõsi, Tímea; Heck, Max; Ibrahem, Salha; Johansson, Karin; Kornschober, Christian; Kotila, Saara M; Le Hello, Simon; Lienemann, Taru; Mattheus, Wesley; Nielsen, Eva Møller; Ragimbeau, Catherine; Rumore, Jillian; Sabol, Ashley; Torpdahl, Mia; Trees, Eija; Tuohy, Alma; de Pinna, Elizabeth

    2017-03-02

    Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data. This article is copyright of The Authors, 2017.

  9. Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures.

    PubMed

    Hayes, Michael L; Giang, Karolyn; Mulligan, R Michael

    2012-05-14

    Pentatricopeptide repeat (PPR) proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82) and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3' UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative selection even in the absence of an editing site target suggests that unrecognized function(s) might exist for this PPR protein. PPR gene sequences that encode helix A are under strong selection, and could be involved in RNA substrate recognition.

  10. The application of new software tools to quantitative protein profiling via isotope-coded affinity tag (ICAT) and tandem mass spectrometry: I. Statistically annotated datasets for peptide sequences and proteins identified via the application of ICAT and tandem mass spectrometry to proteins copurifying with T cell lipid rafts.

    PubMed

    von Haller, Priska D; Yi, Eugene; Donohoe, Samuel; Vaughn, Kelly; Keller, Andrew; Nesvizhskii, Alexey I; Eng, Jimmy; Li, Xiao-jun; Goodlett, David R; Aebersold, Ruedi; Watts, Julian D

    2003-07-01

    Lipid rafts were prepared according to standard protocols from Jurkat T cells stimulated via T cell receptor/CD28 cross-linking and from control (unstimulated) cells. Co-isolating proteins from the control and stimulated cell preparations were labeled with isotopically normal (d0) and heavy (d8) versions of the same isotope-coded affinity tag (ICAT) reagent, respectively. Samples were combined, proteolyzed, and resultant peptides fractionated via cation exchange chromatography. Cysteine-containing (ICAT-labeled) peptides were recovered via the biotin tag component of the ICAT reagents by avidin-affinity chromatography. On-line micro-capillary liquid chromatography tandem mass spectrometry was performed on both avidin-affinity (ICAT-labeled) and flow-through (unlabeled) fractions. Initial peptide sequence identification was by searching recorded tandem mass spectrometry spectra against a human sequence data base using SEQUEST software. New statistical data modeling algorithms were then applied to the SEQUEST search results. These allowed for discrimination between likely "correct" and "incorrect" peptide assignments, and from these the inferred proteins that they collectively represented, by calculating estimated probabilities that each peptide assignment and subsequent protein identification was a member of the "correct" population. For convenience, the resultant lists of peptide sequences assigned and the proteins to which they corresponded were filtered at an arbitrarily set cut-off of 0.5 (i.e. 50% likely to be "correct") and above and compiled into two separate datasets. In total, these data sets contained 7667 individual peptide identifications, which represented 2669 unique peptide sequences, corresponding to 685 proteins and related protein groups.

  11. ScanRanker: Quality Assessment of Tandem Mass Spectra via Sequence Tagging

    PubMed Central

    Ma, Ze-Qiang; Chambers, Matthew C.; Ham, Amy-Joan L.; Cheek, Kristin L.; Whitwell, Corbin W.; Aerni, Hans-Rudolf; Schilling, Birgit; Miller, Aaron W.; Caprioli, Richard M.; Tabb, David L.

    2011-01-01

    In shotgun proteomics, protein identification by tandem mass spectrometry relies on bioinformatics tools. Despite recent improvements in identification algorithms, a significant number of high quality spectra remain unidentified for various reasons. Here we present ScanRanker, an open-source tool that evaluates the quality of tandem mass spectra via sequence tagging with reliable performance in data from different instruments. The superior performance of ScanRanker enables it not only to find unassigned high quality spectra that evade identification through database search, but also to select spectra for de novo sequencing and cross-linking analysis. In addition, we demonstrate that the distribution of ScanRanker scores predicts the richness of identifiable spectra among multiple LC-MS/MS runs in an experiment, and ScanRanker scores assist the process of peptide assignment validation to increase confident spectrum identifications. The source code and executable versions of ScanRanker are available from http://fenchurch.mc.vanderbilt.edu. PMID:21520941

  12. Clonal origins of Vibrio cholerae O1 El Tor strains, Papua New Guinea, 2009-2011.

    PubMed

    Horwood, Paul F; Collins, Deirdre; Jonduo, Marinjho H; Rosewell, Alexander; Dutta, Samir R; Dagina, Rosheila; Ropa, Berry; Siba, Peter M; Greenhill, Andrew R

    2011-11-01

    We used multilocus sequence typing and variable number tandem repeat analysis to determine the clonal origins of Vibrio cholerae O1 El Tor strains from an outbreak of cholera that began in 2009 in Papua New Guinea. The epidemic is ongoing, and transmission risk is elevated within the Pacific region.

  13. Brief Report: Identical Male Twins Concordant for Asperger's Disorder

    ERIC Educational Resources Information Center

    Ishijima, Michiko; Kurita, Hiroshi

    2007-01-01

    The first case study of identical male twins concordant for DSM-IV Asperger's disorder (ASD) was presented. Their monozygocity was confirmed by short tandem repeat analyses with a probability of 99.999963%. Despite sharing the same DNA and environment, the twins are different in comorbidity (i.e., major depressive disorder in the elder and absence…

  14. Interaction of Dopamine Transporter Gene and Observed Parenting Behaviors on Attention-Deficit/Hyperactivity Disorder: A Structural Equation Modeling Approach

    ERIC Educational Resources Information Center

    Li, James J.; Lee, Steve S.

    2013-01-01

    Emerging evidence suggests that some individuals may be simultaneously more responsive to the effects from environmental adversity "and" enrichment (i.e., differential susceptibility). Given that parenting behavior and a variable number tandem repeat polymorphism in the 3'untranslated region of the dopamine transporter (DAT1) gene are…

  15. Association of ADHD, Tics, and Anxiety with Dopamine Transporter ("DAT1") Genotype in Autism Spectrum Disorder

    ERIC Educational Resources Information Center

    Gadow, Kenneth D.; Roohi, Jasmin; DeVincent, Carla J.; Hatchwell, Eli

    2008-01-01

    Background: Autism spectrum disorder (ASD) is associated with high rates of psychiatric disturbance to include attention-deficit/hyperactivity disorder (ADHD), tic disorder, and anxiety disorders. The aim of the present study was to examine the association between a variable number tandem repeat (VNTR) functional polymorphism located in the…

  16. Pharmacokinetic-Pharmacodynamic Assessment of Faropenem in a Lethal Murine Bacillus anthracis Inhalation Postexposure Prophylaxis Model

    DTIC Science & Technology

    2010-05-01

    Inhalation Postexposure Prophylaxis Model Stanley C. Gill,1* Christopher M. Rubino,2 Jennifer Bassett,3 Lynda Miller,3 Paul G. Ambrose,2 Sujata M. Bhavnani,2...tandem repeat analysis reveals genetic relationships within Bacillus anthracis. J. Bacteriol. 182:2928–2936. 14. Mohammed, M. J., C. K. Marston , T

  17. Optimization of absorption bands of dye-sensitized and perovskite tandem solar cells based on loss-in-potential values.

    PubMed

    Sobuś, Jan; Ziółek, Marcin

    2014-07-21

    A numerical study of optimal bandgaps of light absorbers in tandem solar cell configurations is presented with the main focus on dye-sensitized solar cells (DSSCs) and perovskite solar cells (PSCs). The limits in efficiency and the expected improvements of tandem structures are investigated as a function of total loss-in-potential (V(L)), incident photon to current efficiency (IPCE) and fill factor (FF) of individual components. It is shown that the optimal absorption onsets are significantly smaller than those derived for multi-junction devices. For example, for double-cell devices the onsets are at around 660 nm and 930 nm for DSSCs with iodide based electrolytes and at around 720 nm and 1100 nm for both DSSCs with cobalt based electrolytes and PSCs. Such configurations can increase the total sunlight conversion efficiency by about 35% in comparison to single-cell devices of the same VL, IPCE and FF. The relevance of such studies for tandem n-p DSSCs and for a proposed new configuration for PSCs is discussed. In particular, it is shown that maximum total losses of 1.7 V for DSSCs and 1.4 V for tandem PSCs are necessary to give any efficiency improvement with respect to the single bandgap device. This means, for example, a tandem n-p DSSC with TiO2 and NiO porous electrodes will hardly work better than the champion single DSSC. A source code of the program used for calculations is also provided.

  18. A new family of dispersed repeats from Brassica nigra: characterization and localization.

    PubMed

    Kapila, R; Negi, M S; This, P; Delseny, M; Srivastava, P S; Lakshmikumaran, M

    1996-11-01

    The 459-bp HindIII (pBN-4) and the 1732-bp Eco RI (pBNE8) fragments from the Brassica nigra genome were cloned and shown to be members of a dispersed repeat family. Of the three major diploid Brassica species, the repeat pBN-4 was found to be highly specific for the B. nigra genome. The family also hybridized to Sinapis arvensis showing that B. nigra had a closer relationship with the S. arvensis genome than with B. oleracea or B. campestris. The clone pBNE8 showed homology to a number of tRNA species indicating that this family of repeats may have originated from a tRNA sequence. The species-specific 459-bp repeat pBN-4 was localized on the B. nigra chromosomes using monosomic addition lines. In addition to the localization of pBN-4, the chromosomal distribution of two other species-specific repeats, pBN34 and pBNBH35 (reported earlier), was studied. The dispersed repeats pBN-4 and pBNBH35 were found to be present on all of the chromosomes, whereas the tandem repeat pBN34 was localized on two chromosomes.

  19. CRISPRcompar: a website to compare clustered regularly interspaced short palindromic repeats.

    PubMed

    Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine

    2008-07-01

    Clustered regularly interspaced short palindromic repeat (CRISPR) elements are a particular family of tandem repeats present in prokaryotic genomes, in almost all archaea and in about half of bacteria, and which participate in a mechanism of acquired resistance against phages. They consist in a succession of direct repeats (DR) of 24-47 bp separated by similar sized unique sequences (spacers). In the large majority of cases, the direct repeats are highly conserved, while the number and nature of the spacers are often quite diverse, even among strains of a same species. Furthermore, the acquisition of new units (DR + spacer) was shown to happen almost exclusively on one side of the locus. Therefore, the CRISPR presents an interesting genetic marker for comparative and evolutionary analysis of closely related bacterial strains. CRISPRcompar is a web service created to assist biologists in the CRISPR typing process. Two tools facilitates the in silico investigation: CRISPRcomparison and CRISPRtionary. This website is freely accessible at http://crispr.u-psud.fr/CRISPRcompar/.

  20. Non-B DB v2.0: a database of predicted non-B DNA-forming motifs and its associated tools.

    PubMed

    Cer, Regina Z; Donohue, Duncan E; Mudunuri, Uma S; Temiz, Nuri A; Loss, Michael A; Starner, Nathan J; Halusa, Goran N; Volfovsky, Natalia; Yi, Ming; Luke, Brian T; Bacolla, Albino; Collins, Jack R; Stephens, Robert M

    2013-01-01

    The non-B DB, available at http://nonb.abcc.ncifcrf.gov, catalogs predicted non-B DNA-forming sequence motifs, including Z-DNA, G-quadruplex, A-phased repeats, inverted repeats, mirror repeats, direct repeats and their corresponding subsets: cruciforms, triplexes and slipped structures, in several genomes. Version 2.0 of the database revises and re-implements the motif discovery algorithms to better align with accepted definitions and thresholds for motifs, expands the non-B DNA-forming motifs coverage by including short tandem repeats and adds key visualization tools to compare motif locations relative to other genomic annotations. Non-B DB v2.0 extends the ability for comparative genomics by including re-annotation of the five organisms reported in non-B DB v1.0, human, chimpanzee, dog, macaque and mouse, and adds seven additional organisms: orangutan, rat, cow, pig, horse, platypus and Arabidopsis thaliana. Additionally, the non-B DB v2.0 provides an overall improved graphical user interface and faster query performance.

  1. Development of high intensity ion sources for a Tandem-Electrostatic-Quadrupole facility for Accelerator-Based Boron Neutron Capture Therapy.

    PubMed

    Bergueiro, J; Igarzabal, M; Sandin, J C Suarez; Somacal, H R; Vento, V Thatar; Huck, H; Valda, A A; Repetto, M; Kreiner, A J

    2011-12-01

    Several ion sources have been developed and an ion source test stand has been mounted for the first stage of a Tandem-Electrostatic-Quadrupole facility For Accelerator-Based Boron Neutron Capture Therapy. A first source, designed, fabricated and tested is a dual chamber, filament driven and magnetically compressed volume plasma proton ion source. A 4 mA beam has been accelerated and transported into the suppressed Faraday cup. Extensive simulations of the sources have been performed using both 2D and 3D self-consistent codes. Copyright © 2011 Elsevier Ltd. All rights reserved.

  2. Tandem betatron

    DOEpatents

    Keinigs, Rhonald K.

    1992-01-01

    Two betatrons are provided in tandem for alternately accelerating an electron beam to avoid the single flux swing limitation of conventional betatrons and to accelerate the electron beam to high energies. The electron beam is accelerated in a first betatron during a period of increasing magnetic flux. The eletron beam is extracted from the first betatron as a peak magnetic flux is reached and then injected into a second betatron at a time of minimum magnetic flux in the second betatron. The cycle may be repeated until the desired electron beam energy is obtained. In one embodiment, the second betatron is axially offset from the first betatron to provide for electron beam injection directly at the axial location of the beam orbit in the second betatron.

  3. Dietary Intervention by Phytochemicals and Their Role in Modulating Coding and Non-Coding Genes in Cancer

    PubMed Central

    Budisan, Liviuta; Gulei, Diana; Zanoaga, Oana Mihaela; Irimie, Alexandra Iulia; Chira, Sergiu; Braicu, Cornelia; Gherman, Claudia Diana; Berindan-Neagoe, Ioana

    2017-01-01

    Phytochemicals are natural compounds synthesized as secondary metabolites in plants, representing an important source of molecules with a wide range of therapeutic applications. These natural agents are important regulators of key pathological processes/conditions, including cancer, as they are able to modulate the expression of coding and non-coding transcripts with an oncogenic or tumour suppressor role. These natural agents are currently exploited for the development of therapeutic strategies alone or in tandem with conventional treatments for cancer. The aim of this paper is to review the recent studies regarding the role of these natural phytochemicals in different processes related to cancer inhibition, including apoptosis activation, angiogenesis and metastasis suppression. From the large palette of phytochemicals we selected epigallocatechin gallate (EGCG), caffeic acid phenethyl ester (CAPE), genistein, morin and kaempferol, due to their increased activity in modulating multiple coding and non-coding genes, targeting the main hallmarks of cancer. PMID:28587155

  4. Dietary Intervention by Phytochemicals and Their Role in Modulating Coding and Non-Coding Genes in Cancer.

    PubMed

    Budisan, Liviuta; Gulei, Diana; Zanoaga, Oana Mihaela; Irimie, Alexandra Iulia; Sergiu, Chira; Braicu, Cornelia; Gherman, Claudia Diana; Berindan-Neagoe, Ioana

    2017-06-01

    Phytochemicals are natural compounds synthesized as secondary metabolites in plants, representing an important source of molecules with a wide range of therapeutic applications. These natural agents are important regulators of key pathological processes/conditions, including cancer, as they are able to modulate the expression of coding and non-coding transcripts with an oncogenic or tumour suppressor role. These natural agents are currently exploited for the development of therapeutic strategies alone or in tandem with conventional treatments for cancer. The aim of this paper is to review the recent studies regarding the role of these natural phytochemicals in different processes related to cancer inhibition, including apoptosis activation, angiogenesis and metastasis suppression. From the large palette of phytochemicals we selected epigallocatechin gallate (EGCG), caffeic acid phenethyl ester (CAPE), genistein, morin and kaempferol, due to their increased activity in modulating multiple coding and non-coding genes, targeting the main hallmarks of cancer.

  5. Tetris is a foldback transposon that provided the building blocks for an emerging satellite DNA of Drosophila virilis.

    PubMed

    Dias, Guilherme B; Svartman, Marta; Delprat, Alejandra; Ruiz, Alfredo; Kuhn, Gustavo C S

    2014-05-24

    Transposable elements (TEs) and satellite DNAs (satDNAs) are abundant components of most eukaryotic genomes studied so far and their impact on evolution has been the focus of several studies. A number of studies linked TEs with satDNAs, but the nature of their evolutionary relationships remains unclear. During in silico analyses of the Drosophila virilis assembled genome, we found a novel DNA transposon we named Tetris based on its modular structure and diversity of rearranged forms. We aimed to characterize Tetris and investigate its role in generating satDNAs. Data mining and sequence analysis showed that Tetris is apparently nonautonomous, with a structure similar to foldback elements, and present in D. virilis and D. americana. Herein, we show that Tetris shares the final portions of its terminal inverted repeats (TIRs) with DAIBAM, a previously described miniature inverted transposable element implicated in the generation of chromosome inversions. Both elements are likely to be mobilized by the same autonomous TE. Tetris TIRs contain approximately 220-bp internal tandem repeats that we have named TIR-220. We also found TIR-220 repeats making up longer (kb-size) satDNA-like arrays. Using bioinformatic, phylogenetic and cytogenomic tools, we demonstrated that Tetris has contributed to shaping the genomes of D. virilis and D. americana, providing internal tandem repeats that served as building blocks for the amplification of satDNA arrays. The β-heterochromatic genomic environment seemed to have favored such amplification. Our results imply for the first time a role for foldback elements in generating satDNAs. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  6. TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

    PubMed

    Pellegrini, Marco; Renda, M Elena; Vecchio, Alessio

    2010-06-15

    Genomes in higher eukaryotic organisms contain a substantial amount of repeated sequences. Tandem Repeats (TRs) constitute a large class of repetitive sequences that are originated via phenomena such as replication slippage and are characterized by close spatial contiguity. They play an important role in several molecular regulatory mechanisms, and also in several diseases (e.g. in the group of trinucleotide repeat disorders). While for TRs with a low or medium level of divergence the current methods are rather effective, the problem of detecting TRs with higher divergence (fuzzy TRs) is still open. The detection of fuzzy TRs is propaedeutic to enriching our view of their role in regulatory mechanisms and diseases. Fuzzy TRs are also important as tools to shed light on the evolutionary history of the genome, where higher divergence correlates with more remote duplication events. We have developed an algorithm (christened TRStalker) with the aim of detecting efficiently TRs that are hard to detect because of their inherent fuzziness, due to high levels of base substitutions, insertions and deletions. To attain this goal, we developed heuristics to solve a Steiner version of the problem for which the fuzziness is measured with respect to a motif string not necessarily present in the input string. This problem is akin to the 'generalized median string' that is known to be an NP-hard problem. Experiments with both synthetic and biological sequences demonstrate that our method performs better than current state of the art for fuzzy TRs and that the fuzzy TRs of the type we detect are indeed present in important biological sequences. TRStalker will be integrated in the web-based TRs Discovery Service (TReaDS) at bioalgo.iit.cnr.it. Supplementary data are available at Bioinformatics online.

  7. Rapid evolutionary change of common bean (Phaseolus vulgaris L) plastome, and the genomic diversification of legume chloroplasts

    PubMed Central

    Guo, Xianwu; Castillo-Ramírez, Santiago; González, Víctor; Bustos, Patricia; Luís Fernández-Vázquez, José; Santamaría, Rosa Isela; Arellano, Jesús; Cevallos, Miguel A; Dávila, Guillermo

    2007-01-01

    Background Fabaceae (legumes) is one of the largest families of flowering plants, and some members are important crops. In contrast to what we know about their great diversity or economic importance, our knowledge at the genomic level of chloroplast genomes (cpDNAs or plastomes) for these crops is limited. Results We sequenced the complete genome of the common bean (Phaseolus vulgaris cv. Negro Jamapa) chloroplast. The plastome of P. vulgaris is a 150,285 bp circular molecule. It has gene content similar to that of other legume plastomes, but contains two pseudogenes, rpl33 and rps16. A distinct inversion occurred at the junction points of trnH-GUG/rpl14 and rps19/rps8, as in adzuki bean [1]. These two pseudogenes and the inversion were confirmed in 10 varieties representing the two domestication centers of the bean. Genomic comparative analysis indicated that inversions generally occur in legume plastomes and the magnitude and localization of insertions/deletions (indels) also vary. The analysis of repeat sequences demonstrated that patterns and sequences of tandem repeats had an important impact on sequence diversification between legume plastomes and tandem repeats did not belong to dispersed repeats. Interestingly, P. vulgaris plastome had higher evolutionary rates of change on both genomic and gene levels than G. max, which could be the consequence of pressure from both mutation and natural selection. Conclusion Legume chloroplast genomes are widely diversified in gene content, gene order, indel structure, abundance and localization of repetitive sequences, intracellular sequence exchange and evolutionary rates. The P. vulgaris plastome is a rapidly evolving genome. PMID:17623083

  8. Structural analysis of the rDNA intergenic spacer of Brassica nigra: evolutionary divergence of the spacers of the three diploid Brassica species.

    PubMed

    Bhatia, S; Singh Negi, M; Lakshmikumaran, M

    1996-11-01

    EcoRI restriction of the B. nigra rDNA recombinants, isolated from a lambda genomic library, showed that the 3.9-kb fragment corresponded to the Intergenic Spacer (IGS), which was sequenced and found to be 3,928 bp in size. Sequence and dot-matrix analyses showed that the organization of the B. nigra rDNA IGS was typical of most rDNA spacers, consisting of a central repetitive region and flanking unique sequences on either side. The repetitive region was composed of two repeat families-RF 'A' and RF 'B.' The B. nigra RF 'A' consisted of a tandem array of three full-length copies of a 106-bp sequence element. RF 'B' was composed of 66 tandemly repeated elements. Each 'B' element was only 21-bp in size and this is the smallest repeat unit identified in plant rDNA to date. The putative transcription initiation site (TIS) was identified as nucleotide position 3,110. Based on the sequence analysis it was suggested that the present organization of the repeat families was generated by successive cycles of deletions and amplifications and was being maintained by homogenization processes such as gene conversion and crossing-over.A detailed comparison of the rDNA IGS sequences of the three diploid Brassica species-namely, B. nigra, B. campestris, and B. oleracea-was carried out. First, comparisons revealed that B. campestris and B. oleracea were close to each other as the repeat families in both showed high sequence homology between each other. Second, the repeat elements in both the species were organized in an interspersed manner. Third, a 52-bp sequence, present just downstream of the repeats in B. campestris, was found to be identical to the B. oleracea repeats, thereby suggesting a common progenitor. On the other hand, in B. nigra no interspersion pattern of organization of repeats was observed. Further, the B. nigra RF 'A' was identified as distinct from the repeat families of B. campestris and B. oleracea. Based on this analysis, it was suggested that during speciation B. campestris and B. oleracea evolved in one lineage whereas B. nigra diverged into a separate lineage. The comparative analysis of the IGS helped in identifying not only conserved ancestral sequence motifs of possible functional significance such as promoters and enhancers, but also sequences which showed variation between the three diploid species and were therefore identified as species-specific sequences.

  9. Effects of Transposable Elements on the Expression of the Forked Gene of Drosophila Melanogaster

    PubMed Central

    Hoover, K. K.; Chien, A. J.; Corces, V. G.

    1993-01-01

    The products of the forked gene are involved in the formation and/or maintenance of a temporary fibrillar structure within the developing bristle rudiment of Drosophila melanogaster. Mutations in the forked locus alter this structure and result in aberrant development of macrochaetae, microchaetae and trichomes. The locus has been characterized at the molecular level by walking, mutant characterization and transcript analysis. Expression of the six forked transcripts is temporally restricted to midlate pupal development. At this time, RNAs of 6.4, 5.6, 5.4, 2.5, 1.9 and 1.1 kilobases (kb) are detected by Northern analysis. The coding region of these RNAs has been found to be within a 21-kb stretch of genomic DNA. The amino terminus of the proteins encoded by the 5.4- and 5.6-kb forked transcripts contain tandem copies of ankyrin-like repeats that may play an important role in the function of forked-encoded products. The profile of forked RNA expression is altered in seven spontaneous mutations characterized during this study. Three forked mutations induced by the insertion of the gypsy retrotransposon contain a copy of this element inserted into an intron of the gene. In these mutants, the 5.6-, 5.4- and 2.5-kb forked mRNAs are truncated via recognition of the polyadenylation site in the 5' long terminal repeat of the gypsy retrotransposon. These results help explain the role of the forked gene in fly development and further our understanding of the role of transposable elements in mutagenesis. PMID:8244011

  10. Complete chloroplast genome sequence and comparative analysis of loblolly pine (Pinus taeda L.) with related species

    PubMed Central

    Khan, Abdul Latif; Khan, Muhammad Aaqil; Shahzad, Raheem; Lubna; Kang, Sang Mo; Al-Harrasi, Ahmed; Al-Rawahi, Ahmed; Lee, In-Jung

    2018-01-01

    Pinaceae, the largest family of conifers, has a diversified organization of chloroplast (cp) genomes with two typical highly reduced inverted repeats (IRs). In the current study, we determined the complete sequence of the cp genome of an economically and ecologically important conifer tree, the loblolly pine (Pinus taeda L.), using Illumina paired-end sequencing and compared the sequence with those of other pine species. The results revealed a genome size of 121,531 base pairs (bp) containing a pair of 830-bp IR regions, distinguished by a small single copy (42,258 bp) and large single copy (77,614 bp) region. The chloroplast genome of P. taeda encodes 120 genes, comprising 81 protein-coding genes, four ribosomal RNA genes, and 35 tRNA genes, with 151 randomly distributed microsatellites. Approximately 6 palindromic, 34 forward, and 22 tandem repeats were found in the P. taeda cp genome. Whole cp genome comparison with those of other Pinus species exhibited an overall high degree of sequence similarity, with some divergence in intergenic spacers. Higher and lower numbers of indels and single-nucleotide polymorphism substitutions were observed relative to P. contorta and P. monophylla, respectively. Phylogenomic analyses based on the complete genome sequence revealed that 60 shared genes generated trees with the same topologies, and P. taeda was closely related to P. contorta in the subgenus Pinus. Thus, the complete P. taeda genome provided valuable resources for population and evolutionary studies of gymnosperms and can be used to identify related species. PMID:29596414

  11. Automatic-repeat-request error control schemes

    NASA Technical Reports Server (NTRS)

    Lin, S.; Costello, D. J., Jr.; Miller, M. J.

    1983-01-01

    Error detection incorporated with automatic-repeat-request (ARQ) is widely used for error control in data communication systems. This method of error control is simple and provides high system reliability. If a properly chosen code is used for error detection, virtually error-free data transmission can be attained. Various types of ARQ and hybrid ARQ schemes, and error detection using linear block codes are surveyed.

  12. Identification and characterization of short tandem repeats in the Tibetan macaque genome based on resequencing data.

    PubMed

    Liu, San-Xu; Hou, Wei; Zhang, Xue-Yan; Peng, Chang-Jun; Yue, Bi-Song; Fan, Zhen-Xin; Li, Jing

    2018-07-18

    The Tibetan macaque, which is endemic to China, is currently listed as a Near Endangered primate species by the International Union for Conservation of Nature (IUCN). Short tandem repeats (STRs) refer to repetitive elements of genome sequence that range in length from 1-6 bp. They are found in many organisms and are widely applied in population genetic studies. To clarify the distribution characteristics of genome-wide STRs and understand their variation among Tibetan macaques, we conducted a genome-wide survey of STRs with next-generation sequencing of five macaque samples. A total of 1 077 790 perfect STRs were mined from our assembly, with an N50 of 4 966 bp. Mono-nucleotide repeats were the most abundant, followed by tetra- and di-nucleotide repeats. Analysis of GC content and repeats showed consistent results with other macaques. Furthermore, using STR analysis software (lobSTR), we found that the proportion of base pair deletions in the STRs was greater than that of insertions in the five Tibetan macaque individuals (P<0.05, t-test). We also found a greater number of homozygous STRs than heterozygous STRs (P<0.05, t-test), with the Emei and Jianyang Tibetan macaques showing more heterozygous loci than Huangshan Tibetan macaques. The proportion of insertions and mean variation of alleles in the Emei and Jianyang individuals were slightly higher than those in the Huangshan individuals, thus revealing differences in STR allele size between the two populations. The polymorphic STR loci identified based on the reference genome showed good amplification efficiency and could be used to study population genetics in Tibetan macaques. The neighbor-joining tree classified the five macaques into two different branches according to their geographical origin, indicating high genetic differentiation between the Huangshan and Sichuan populations. We elucidated the distribution characteristics of STRs in the Tibetan macaque genome and provided an effective method for screening polymorphic STRs. Our results also lay a foundation for future genetic variation studies of macaques.

  13. Antihypertensive activity of transgenic rice seed containing an 18-repeat novokinin peptide localized in the nucleolus of endosperm cells.

    PubMed

    Wakasa, Yuhya; Zhao, Hui; Hirose, Sakiko; Yamauchi, Daiki; Yamada, Yuko; Yang, Lijun; Ohinata, Kousaku; Yoshikawa, Masaaki; Takaiwa, Fumio

    2011-09-01

    Novokinin (Arg-Pro-Leu-Lys-Pro-Trp, RPLKPW) is a new potent antihypertensive peptide based on the sequence of ovokinin (2-7) derived from ovalbumin. We previously generated transgenic rice seeds in which eight novokinin were fused to storage protein glutelins (GluA2 and GluC) for expression. Oral administration of these seeds to spontaneously hypertensive rats (SHRs) reduced systolic blood pressures at a dose of 1 g seed/kg of SHR. Here, 10- or 18-tandem repeats of novokinin with an endoplasmic reticulum (ER) retention signal (Lys-Asp-Glu-Leu, KDEL) at the C terminus were directly expressed in rice under the control of the glutelin promoter containing its signal peptide. Only small amounts of the 18-repeat novokinin accumulated, and it was unexpectedly deposited in the nucleolus. This abnormal intracellular localization was explained by an endogenous signal for nuclear localization. The GFP reporter protein fused to this sequence targeted to nuclei by a transient assay using onion epidermal cells. Transgenic seed expressing the 18-repeat novokinin exhibited significantly higher antihypertensive activity after a single oral dose to SHR even at one-quarter the amount (0.25 g/kg) of the transgenic rice seed expressing the fusion construct; though, its novokinin content was much lower (1/5). Furthermore, in a long-term administration for 5 weeks, even a smaller dose (0.0625 g/kg) of transgenic seeds could confer antihypertensive activity. This high antihypertensive activity may be attributed to differences in digestibility of expressed products by gastrointestinal enzymes and the unique intracellular localization. These results indicate that accumulation of novokinin as a tandemly repeated structure in transgenic rice is more effective than as a fusion-type structure. © 2010 The Authors. Plant Biotechnology Journal © 2010 Society for Experimental Biology and Blackwell Publishing Ltd.

  14. Molecular Typing of Pneumococci for Investigation of Linked Cases of Invasive Pneumococcal Disease ▿

    PubMed Central

    Pichon, Bruno; Moyce, Laura; Sheppard, Carmen; Slack, Mary; Turbitt, Deborah; Pebody, Richard; Spencer, David A.; Edwards, Justin; Krahé, Daniel; George, Robert

    2010-01-01

    In winter 2007-2008, an outbreak of pediatric pneumonia caused by serotype 5 pneumococci was identified in a northeast London suburb. Variable number of tandem repeat analyses clustered these pneumococci from the other serotype 5 pneumococci in the United Kingdom, highlighting the importance of this discriminative typing method in supporting epidemiological investigations. PMID:20164267

  15. Clonal Origins of Vibrio cholerae O1 El Tor Strains, Papua New Guinea, 2009–2011

    PubMed Central

    Collins, Deirdre; Jonduo, Marinjho H.; Rosewell, Alexander; Dutta, Samir R.; Dagina, Rosheila; Ropa, Berry; Siba, Peter M.; Greenhill, Andrew R.

    2011-01-01

    We used multilocus sequence typing and variable number tandem repeat analysis to determine the clonal origins of Vibrio cholerae O1 El Tor strains from an outbreak of cholera that began in 2009 in Papua New Guinea. The epidemic is ongoing, and transmission risk is elevated within the Pacific region. PMID:22099099

  16. Variable Number of Tandem Repeat Markers in the Genome Sequence of Mycosphaerella Fijiensis, the Causal Agent of Black Leaf Streak Disease of Banana (Musa spp.)

    USDA-ARS?s Scientific Manuscript database

    Mycosphaerella fijiensis, the causal agent of banana leaf streak disease (commonly known as black Sigatoka), is the most devastating pathogen attacking bananas (Musa spp). Recently the whole genome sequence of M. fijiensis became available. This sequence was screened for the presence of Variable Num...

  17. Molecular Strain Typing of Mycobacterium tuberculosis: a Review of Frequently Used Methods

    PubMed Central

    2016-01-01

    Tuberculosis, caused by the bacterium Mycobacterium tuberculosis, remains one of the most serious global health problems. Molecular typing of M. tuberculosis has been used for various epidemiologic purposes as well as for clinical management. Currently, many techniques are available to type M. tuberculosis. Choosing the most appropriate technique in accordance with the existing laboratory conditions and the specific features of the geographic region is important. Insertion sequence IS6110-based restriction fragment length polymorphism (RFLP) analysis is considered the gold standard for the molecular epidemiologic investigations of tuberculosis. However, other polymerase chain reaction-based methods such as spacer oligonucleotide typing (spoligotyping), which detects 43 spacer sequence-interspersing direct repeats (DRs) in the genomic DR region; mycobacterial interspersed repetitive units–variable number tandem repeats, (MIRU-VNTR), which determines the number and size of tandem repetitive DNA sequences; repetitive-sequence-based PCR (rep-PCR), which provides high-throughput genotypic fingerprinting of multiple Mycobacterium species; and the recently developed genome-based whole genome sequencing methods demonstrate similar discriminatory power and greater convenience. This review focuses on techniques frequently used for the molecular typing of M. tuberculosis and discusses their general aspects and applications. PMID:27709842

  18. Structural basis of DNA sequence recognition by the response regulator PhoP in Mycobacterium tuberculosis.

    PubMed

    He, Xiaoyuan; Wang, Liqin; Wang, Shuishu

    2016-04-15

    The transcriptional regulator PhoP is an essential virulence factor in Mycobacterium tuberculosis, and it presents a target for the development of new anti-tuberculosis drugs and attenuated tuberculosis vaccine strains. PhoP binds to DNA as a highly cooperative dimer by recognizing direct repeats of 7-bp motifs with a 4-bp spacer. To elucidate the PhoP-DNA binding mechanism, we determined the crystal structure of the PhoP-DNA complex. The structure revealed a tandem PhoP dimer that bound to the direct repeat. The surprising tandem arrangement of the receiver domains allowed the four domains of the PhoP dimer to form a compact structure, accounting for the strict requirement of a 4-bp spacer and the highly cooperative binding of the dimer. The PhoP-DNA interactions exclusively involved the effector domain. The sequence-recognition helix made contact with the bases of the 7-bp motif in the major groove, and the wing interacted with the adjacent minor groove. The structure provides a starting point for the elucidation of the mechanism by which PhoP regulates the virulence of M. tuberculosis and guides the design of screening platforms for PhoP inhibitors.

  19. The structure of the regulatory region of the rat L1 (L1Rn, long interspersed repeated) DNA family of transposable elements.

    PubMed Central

    Furano, A V; Robb, S M; Robb, F T

    1988-01-01

    Here we report the DNA structure of the left 1.5 kb of two newly isolated full length members of the rat L1 DNA family (L1Rn, long interspersed repeated DNA). In contrast to earlier isolated rat L1 members, both of these contain promoter-like regions that are most likely full length. In addition, the promoter-like region of both members has undergone a partial tandem duplication. A second internal region of the left end of one of the reported members is also tandemly duplicated. The propensity of the left end of rat L1 elements to undergo this form of genetic rearrangement, as well as other structural features revealed by the present work, is discussed in light of the fact that during evolution the otherwise conserved mammalian L1 DNA families have each acquired completely different promoter-like regions. In an accompanying paper [Nur, I., Pascale, E., and Furano, A. V. (1988) Nucleic Acids Res. 16, submitted], we report that one of the rat promoter-like regions can function as a promoter in rat cells when fused to the Escherichia coli chloramphenicol acyltransferase gene. PMID:2845369

  20. Centromeres: long intergenic spaces with adaptive features.

    PubMed

    Kanizay, Lisa; Dawe, R Kelly

    2009-08-01

    Centromeres are composed of inner kinetochore proteins, which are largely conserved across species, and repetitive DNA, which shows comparatively little sequence conservation. Due to this fundamental paradox the formation and maintenance of centromeres remains largely a mystery. However, it has become increasingly clear that a long-standing balance between epigenetic and genetic control governs the interactions of centromeric DNA and inner kinetochore proteins. The comparison of classical neocentromeres in plants, which are entirely genetic in their mode of operation, and clinical neocentromeres, which are sequence-independent, illustrates the conflict between genetics and epigenetics in regions that control their own transmission to progeny. Tandem repeat arrays present in centromeres may have an origin in meiotic drive or other selfish patterns of evolution, as is the case for the CENP-B box and CENP-B protein in human. In grasses retrotransposons have invaded centromeres to the point of complete domination, consequently breaking genetic regulation at these centromeres. The accumulation of tandem repeats and transposons causes centromeres to expand in size, effectively pushing genes to the sides and opening the centromere to ever fewer constraints on the DNA sequence. On genetic maps centromeres appear as long intergenic spaces that evolve rapidly and apparently without regard to host fitness.

  1. Variant Alleles, Triallelic Patterns, and Point Mutations Observed in Nuclear Short Tandem Repeat Typing of Populations in Bosnia and Serbia

    PubMed Central

    Huel, René L. M.; Bašić, Lara; Madacki-Todorović, Kamelija; Smajlović, Lejla; Eminović, Izet; Berbić, Irfan; Miloš, Ana; Parsons, Thomas J.

    2007-01-01

    Aim To present a compendium of off-ladder alleles and other genotyping irregularities relating to rare/unexpected population genetic variation, observed in a large short tandem repeat (STR) database from Bosnia and Serbia. Methods DNA was extracted from blood stain cards relating to reference samples from a population of 32 800 individuals from Bosnia and Serbia, and typed using Promega’s PowerPlex®16 STR kit. Results There were 31 distinct off-ladder alleles were observed in 10 of the 15 STR loci amplified from the PowerPlex®16 STR kit. Of these 31 alleles, 3 have not been previously reported. Furthermore, 16 instances of triallelic patterns were observed in 9 of the 15 loci. Primer binding site mismatches that affected amplification were observed in two loci, D5S818 and D8S1179. Conclusion Instances of deviations from manufacturer’s allelic ladders should be expected and caution taken to properly designate the correct alleles in large DNA databases. Particular care should be taken in kinship matching or paternity cases as incorrect designation of any of these deviations from allelic ladders could lead to false exclusions. PMID:17696304

  2. Elevated mutation rates in the germ line of first- and second-generation offspring of irradiated male mice

    PubMed Central

    Barber, Ruth; Plumb, Mark A.; Boulton, Emma; Roux, Isabelle; Dubrova, Yuri E.

    2002-01-01

    Mutation rates at two expanded simple tandem repeat loci were studied in the germ line of first- and second-generation offspring of inbred male CBA/H, C57BL/6, and BALB/c mice exposed to either high linear energy transfer fission neutrons or low linear energy transfer x-rays. Paternal CBA/H exposure to either x-rays or fission neutrons resulted in increased mutation rates in the germ line of two subsequent generations. Comparable transgenerational effects were observed also in neutron-irradiated C57BL/6 and x-irradiated BALB/c mice. The levels of spontaneous mutation rates and radiation-induced transgenerational instability varied between strains (BALB/c>CBA/H>C57BL/6). Pre- and postmeiotic paternal exposure resulted in similar increases in mutation rate in the germ line of both generations of CBA/H mice, which together with our previous results suggests that radiation-induced expanded simple tandem repeat instability is manifested in diploid cells after fertilization. The remarkable finding that radiation-induced germ-line instability persists for at least two generations raises important issues of risk evaluation in humans. PMID:11997464

  3. Release of kinesin from vesicles by hsc70 and regulation of fast axonal transport

    NASA Technical Reports Server (NTRS)

    Tsai, M. Y.; Morfini, G.; Szebenyi, G.; Brady, S. T.

    2000-01-01

    The nature of kinesin interactions with membrane-bound organelles and mechanisms for regulation of kinesin-based motility have both been surprisingly difficult to define. Most kinesin is recovered in supernatants with standard protocols for purification of motor proteins, but kinesin recovered on membrane-bound organelles is tightly bound. Partitioning of kinesin between vesicle and cytosolic fractions is highly sensitive to buffer composition. Addition of either N-ethylmaleimide or EDTA to homogenization buffers significantly increased the fraction of kinesin bound to organelles. Given that an antibody against kinesin light chain tandem repeats also releases kinesin from vesicles, these observations indicated that specific cytoplasmic factors may regulate kinesin release from membranes. Kinesin light tandem repeats contain DnaJ-like motifs, so the effects of hsp70 chaperones were evaluated. Hsc70 released kinesin from vesicles in an MgATP-dependent and N-ethylmaleimide-sensitive manner. Recombinant kinesin light chains inhibited kinesin release by hsc70 and stimulated the hsc70 ATPase. Hsc70 actions may provide a mechanism to regulate kinesin function by releasing kinesin from cargo in specific subcellular domains, thereby effecting delivery of axonally transported materials.

  4. Molecular Strain Typing of Mycobacterium tuberculosis: a Review of Frequently Used Methods.

    PubMed

    Ei, Phyu Win; Aung, Wah Wah; Lee, Jong Seok; Choi, Go Eun; Chang, Chulhun L

    2016-11-01

    Tuberculosis, caused by the bacterium Mycobacterium tuberculosis, remains one of the most serious global health problems. Molecular typing of M. tuberculosis has been used for various epidemiologic purposes as well as for clinical management. Currently, many techniques are available to type M. tuberculosis. Choosing the most appropriate technique in accordance with the existing laboratory conditions and the specific features of the geographic region is important. Insertion sequence IS6110-based restriction fragment length polymorphism (RFLP) analysis is considered the gold standard for the molecular epidemiologic investigations of tuberculosis. However, other polymerase chain reaction-based methods such as spacer oligonucleotide typing (spoligotyping), which detects 43 spacer sequence-interspersing direct repeats (DRs) in the genomic DR region; mycobacterial interspersed repetitive units-variable number tandem repeats, (MIRU-VNTR), which determines the number and size of tandem repetitive DNA sequences; repetitive-sequence-based PCR (rep-PCR), which provides high-throughput genotypic fingerprinting of multiple Mycobacterium species; and the recently developed genome-based whole genome sequencing methods demonstrate similar discriminatory power and greater convenience. This review focuses on techniques frequently used for the molecular typing of M. tuberculosis and discusses their general aspects and applications.

  5. Multiple-Locus Variable-Number Tandem-Repeats Analysis of Escherichia coli O157 using PCR multiplexing and multi-colored capillary electrophoresis.

    PubMed

    Lindstedt, Bjørn-Arne; Vardund, Traute; Kapperud, Georg

    2004-08-01

    The Multiple-Locus Variable-Number Tandem-Repeats Analysis (MLVA) method is currently being used as the primary typing tool for Shiga-toxin-producing Escherichia coli (STEC) O157 isolates in our laboratory. The initial assay was performed using a single fluorescent dye and the different patterns were assigned using a gel image. Here, we present a significantly improved assay using multiple dye colors and enhanced PCR multiplexing to increase speed, and ease the interpretation of the results. The different MLVA patterns are now based on allele sizes entered as character values, thus removing the uncertainties introduced when analyzing band patterns from the gel image. We additionally propose an easy numbering scheme for the identification of separate isolates that will facilitate exchange of typing data. Seventy-two human and animal strains of Shiga-toxin-producing E. coli O157 were used for the development of the improved MLVA assay. The method is based on capillary separation of multiplexed PCR products of VNTR loci in the E. coli O157 genome labeled with multiple fluorescent dyes. The different alleles at each locus were then assigned to allele numbers, which were used for strain comparison.

  6. Multiple-locus variable-number tandem-repeats analysis of Listeria monocytogenes using multicolour capillary electrophoresis and comparison with pulsed-field gel electrophoresis typing.

    PubMed

    Lindstedt, Bjørn-Arne; Tham, Wilhelm; Danielsson-Tham, Marie-Louise; Vardund, Traute; Helmersson, Seved; Kapperud, Georg

    2008-02-01

    The multiple-locus variable-number tandem-repeats analysis (MLVA) method for genotyping has proven to be a fast and reliable typing tool in several bacterial species. MLVA is in our laboratory the routine typing method for Salmonella enterica subsp. enterica serovar Typhimurium and Escherichia coli O157. The gram-positive bacteria Listeria monocytogenes, while not isolated as frequent as S. Typhimurium and E. coli, causes severe illness with an overall mortality rate of 30%. Thus, it is important that any outbreak of this pathogen is detected early and a fast trace to the source can be performed. In view of this, we have used the information provided by two fully sequenced L. monocytogenes strains to develop a MLVA assay coupled with high-resolution capillary electrophoresis and compared it to pulsed-field gel electrophoresis (PFGE) in two sets of isolates, one Norwegian (79 isolates) and one Swedish (61 isolates) set. The MLVA assay could resolve all of the L. monocytogenes serotypes tested, and was slightly more discriminatory than PFGE for the Norwegian isolates (28 MLVA profiles and 24 PFGE profiles) and opposite for the Swedish isolates (42 MLVA profiles and 43 PFGE profiles).

  7. Clinical significance of mycobacterial genotyping in Mycobacterium avium lung disease in Korea.

    PubMed

    Kim, S-Y; Lee, S-T; Jeong, B-H; Jeon, K; Kim, J-W; Shin, S J; Koh, W-J

    2012-10-01

    A recent study in Japan found that mycobacterial genotyping was associated with disease progression and susceptibility to certain drugs in Mycobacterium avium lung disease. However, it is not known whether this association is true in other populations. To investigate the association between mycobacterial genotype, clinical characteristics and the progression of M. avium lung disease in Korean patients. A total of 102 M. avium clinical isolates were genotyped using M. avium tandem repeats-variable number of tandem repeats (MATR-VNTR). MATR-VNTR typing demonstrated a high discriminatory power and genetic diversity for molecular epidemiological studies of M. avium. In the phylogenetic tree, the M. avium clinical isolates were divided into three major clusters: A, B and C. Cluster A was observed most frequently (64/102, 63%), whereas cluster C was found in a minor proportion of the isolates (8/102, 8%). However, there was no association between the clinical characteristics, disease progression and drug susceptibility and the phylogenetic tree based on VNTR genotyping. MATR-VNTR genotyping may be useful for epidemiological studies of M. avium lung disease; however, no association was found between the specific VNTR genotypes of M. avium and the clinical characteristics of Korean patients.

  8. Software for peak finding and elemental composition assignment for glycosaminoglycan tandem mass spectra.

    PubMed

    Hogan, John D; Klein, Joshua A; Wu, Jiandong; Chopra, Pradeep; Boons, Geert-Jan; Carvalho, Luis; Lin, Cheng; Zaia, Joseph

    2018-04-03

    Glycosaminoglycans (GAGs) covalently linked to proteoglycans (PGs) are characterized by repeating disaccharide units and variable sulfation patterns along the chain. GAG length and sulfation patterns impact disease etiology, cellular signaling, and structural support for cells. We and others have demonstrated the usefulness of tandem mass spectrometry (MS2) for assigning the structures of GAG saccharides; however, manual interpretation of tandem mass spectra is time-consuming, so computational methods must be employed. In the proteomics domain, the identification of monoisotopic peaks and charge states relies on algorithms that use averagine, or the average building block of the compound class being analyzed. While these methods perform well for protein and peptide spectra, they perform poorly on GAG tandem mass spectra, due to the fact that a single average building block does not characterize the variable sulfation of GAG disaccharide units. In addition, it is necessary to assign product ion isotope patterns in order to interpret the tandem mass spectra of GAG saccharides. To address these problems, we developed GAGfinder, the first tandem mass spectrum peak finding algorithm developed specifically for GAGs. We define peak finding as assigning experimental isotopic peaks directly to a given product ion composition, as opposed to deconvolution or peak picking, which are terms more accurately describing the existing methods previously mentioned. GAGfinder is a targeted, brute force approach to spectrum analysis that utilizes precursor composition information to generate all theoretical fragments. GAGfinder also performs peak isotope composition annotation, which is typically a subsequent step for averagine-based methods. Data are available via ProteomeXchange with identifier PXD009101. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.

  9. High Genetic Diversity Revealed by Variable-Number Tandem Repeat Genotyping and Analysis of hsp65 Gene Polymorphism in a Large Collection of “Mycobacterium canettii” Strains Indicates that the M. tuberculosis Complex Is a Recently Emerged Clone of “M. canettii”

    PubMed Central

    Fabre, Michel; Koeck, Jean-Louis; Le Flèche, Philippe; Simon, Fabrice; Hervé, Vincent; Vergnaud, Gilles; Pourcel, Christine

    2004-01-01

    We have analyzed, using complementary molecular methods, the diversity of 43 strains of “Mycobacterium canettii” originating from the Republic of Djibouti, on the Horn of Africa, from 1998 to 2003. Genotyping by multiple-locus variable-number tandem repeat analysis shows that all the strains belong to a single but very distant group when compared to strains of the Mycobacterium tuberculosis complex (MTBC). Thirty-one strains cluster into one large group with little variability and five strains form another group, whereas the other seven are more diverged. In total, 14 genotypes are observed. The DR locus analysis reveals additional variability, some strains being devoid of a direct repeat locus and others having unique spacers. The hsp65 gene polymorphism was investigated by restriction enzyme analysis and sequencing of PCR amplicons. Four new single nucleotide polymorphisms were discovered. One strain was characterized by three nucleotide changes in 441 bp, creating new restriction enzyme polymorphisms. As no sequence variability was found for hsp65 in the whole MTBC, and as a single point mutation separates M. tuberculosis from the closest “M. canettii” strains, this diversity within “M. canettii” subspecies strongly suggests that it is the most probable source species of the MTBC rather than just another branch of the MTBC. PMID:15243089

  10. Cloning and Molecular Characterization of an Immunogenic LigA Protein of Leptospira interrogans

    PubMed Central

    Palaniappan, Raghavan U. M.; Chang, Yung-Fu; Jusuf, S. S. D.; Artiushin, S.; Timoney, John F.; McDonough, Sean P.; Barr, Steve C.; Divers, Thomas J.; Simpson, Kenneth W.; McDonough, Patrick L.; Mohammed, Hussni O.

    2002-01-01

    A clone expressing a novel immunoreactive leptospiral immunoglobulin-like protein A of 130 kDa (LigA) from Leptospira interrogans serovar pomona type kennewicki was isolated by screening a genomic DNA library with serum from a mare that had recently aborted due to leptospiral infection. LigA is encoded by an open reading frame of 3,675 bp, and the deduced amino acid sequence consists of a series of 90-amino-acid tandem repeats. A search of the NCBI database found that homology of the LigA repeat region was limited to an immunoglobulin-like domain of the bacterial intimin binding protein of Escherichia coli, the cell adhesion domain of Clostridium acetobutylicum, and the invasin of Yersinia pestis. Secondary structure prediction analysis indicates that LigA consists mostly of beta sheets with a few alpha-helical regions. No LigA was detectable by immunoblot analysis of lysates of the leptospires grown in vitro at 30°C or when cultures were shifted to 37°C. Strikingly, immunohistochemistry on kidney from leptospira-infected hamsters demonstrated LigA expression. These findings suggest that LigA is specifically induced only in vivo. Sera from horses, which aborted as a result of natural Leptospira infection, strongly recognize LigA. LigA is the first leptospiral protein described to have 12 tandem repeats and is also the first to be expressed only during infection. Thus, LigA may have value in serodiagnosis or as a protective immunogen in novel vaccines. PMID:12379666

  11. Highly diverse variable number tandem repeat loci in the E. coli O157:H7 and O55:H7 genomes for high-resolution molecular typing.

    PubMed

    Keys, C; Kemper, S; Keim, P

    2005-01-01

    Evaluation of the Escherichia coli genome for variable number tandem repeat (VNTR) loci in order to provide a subtyping tool with greater discrimination and more efficient capacity. Twenty-nine putative VNTR loci were identified from the E. coli genomic sequence. Their variability was validated by characterizing the number of repeats at each locus in a set of 56 E. coli O157:H7/HN and O55:H7 isolates. An optimized multiplex assay system was developed to facility high capacity analysis. Locus diversity values ranged from 0.23 to 0.95 while the number of alleles ranged from two to 29. This multiple-locus VNTR analysis (MLVA) data was used to describe genetic relationships among these isolates and was compared with PFGE (pulse field gel electrophoresis) data from a subset of the same strains. Genetic similarity values were highly correlated between the two approaches, through MLVA was capable of discrimination amongst closely related isolates when PFGE similar values were equal to 1.0. Highly variable VNTR loci exist in the E. coli O157:H7 genome and are excellent estimators of genetic relationships, in particular for closely related isolates. Escherichia coli O157:H7 MLVA offers a complimentary analysis to the more traditional PFGE approach. Application of MLVA to an outbreak cluster could generate superior molecular epidemiology and result in a more effective public health response.

  12. Molecular cloning and sequence analysis of the gene coding for the 57kDa soluble antigen of the salmonid fish pathogen Renibacterium salmoninarum

    USGS Publications Warehouse

    Chien, Maw-Sheng; Gilbert , Teresa L.; Huang, Chienjin; Landolt, Marsha L.; O'Hara, Patrick J.; Winton, James R.

    1992-01-01

    The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated Mr value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of Mr 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.

  13. Chromosome fragility at FRAXA in human cleavage stage embryos at risk for fragile X syndrome.

    PubMed

    Verdyck, Pieter; Berckmoes, Veerle; De Vos, Anick; Verpoest, Willem; Liebaers, Inge; Bonduelle, Maryse; De Rycke, Martine

    2015-10-01

    Fragile X syndrome (FXS), the most common inherited intellectual disability syndrome, is caused by expansion and hypermethylation of the CGG repeat in the 5' UTR of the FMR1 gene. This expanded repeat, also known as the rare fragile site FRAXA, causes X chromosome fragility in cultured cells from patients but only when induced by perturbing pyrimidine synthesis. We performed preimplantation genetic diagnosis (PGD) on 595 blastomeres biopsied from 442 cleavage stage embryos at risk for FXS using short tandem repeat (STR) markers. In six blastomeres, from five embryos an incomplete haplotype was observed with loss of all alleles telomeric to the CGG repeat. In all five embryos, the incomplete haplotype corresponded to the haplotype carrying the CGG repeat expansion. Subsequent analysis of additional blastomeres from three embryos by array comparative genomic hybridization (aCGH) confirmed the presence of a terminal deletion with a breakpoint close to the CGG repeat in two blastomeres from one embryo. A blastomere from another embryo showed the complementary duplication. We conclude that a CGG repeat expansion at FRAXA causes X chromosome fragility in early human IVF embryos at risk for FXS. © 2015 Wiley Periodicals, Inc.

  14. Repeat-mediated epigenetic dysregulation of the FMR1 gene in the fragile X-related disorders.

    PubMed

    Usdin, Karen; Kumari, Daman

    2015-01-01

    The fragile X-related disorders are members of the Repeat Expansion Diseases, a group of genetic conditions resulting from an expansion in the size of a tandem repeat tract at a specific genetic locus. The repeat responsible for disease pathology in the fragile X-related disorders is CGG/CCG and the repeat tract is located in the 5' UTR of the FMR1 gene, whose protein product FMRP, is important for the proper translation of dendritic mRNAs in response to synaptic activation. There are two different pathological FMR1 allele classes that are distinguished only by the number of repeats. Premutation alleles have 55-200 repeats and confer risk of fragile X-associated tremor/ataxia syndrome and fragile X-associated primary ovarian insufficiency. Full mutation alleles on the other hand have >200 repeats and result in fragile X syndrome, a disorder that affects learning and behavior. Different symptoms are seen in carriers of premutation and full mutation alleles because the repeat number has paradoxical effects on gene expression: Epigenetic changes increase transcription from premutation alleles and decrease transcription from full mutation alleles. This review will cover what is currently known about the mechanisms responsible for these changes in FMR1 expression and how they may relate to other Repeat Expansion Diseases that also show repeat-mediated changes in gene expression.

  15. Non-RVD mutations that enhance the dynamics of the TAL repeat array along the superhelical axis improve TALEN genome editing efficacy

    PubMed Central

    Tochio, Naoya; Umehara, Kohei; Uewaki, Jun-ichi; Flechsig, Holger; Kondo, Masaharu; Dewa, Takehisa; Sakuma, Tetsushi; Yamamoto, Takashi; Saitoh, Takashi; Togashi, Yuichi; Tate, Shin-ichi

    2016-01-01

    Transcription activator-like effector (TALE) nuclease (TALEN) is widely used as a tool in genome editing. The DNA binding part of TALEN consists of a tandem array of TAL-repeats that form a right-handed superhelix. Each TAL-repeat recognises a specific base by the repeat variable diresidue (RVD) at positions 12 and 13. TALEN comprising the TAL-repeats with periodic mutations to residues at positions 4 and 32 (non-RVD sites) in each repeat (VT-TALE) exhibits increased efficacy in genome editing compared with a counterpart without the mutations (CT-TALE). The molecular basis for the elevated efficacy is unknown. In this report, comparison of the physicochemical properties between CT- and VT-TALEs revealed that VT-TALE has a larger amplitude motion along the superhelical axis (superhelical motion) compared with CT-TALE. The greater superhelical motion in VT-TALE enabled more TAL-repeats to engage in the target sequence recognition compared with CT-TALE. The extended sequence recognition by the TAL-repeats improves site specificity with limiting the spatial distribution of FokI domains to facilitate their dimerization at the desired site. Molecular dynamics simulations revealed that the non-RVD mutations alter inter-repeat hydrogen bonding to amplify the superhelical motion of VT-TALE. The TALEN activity is associated with the inter-repeat hydrogen bonding among the TAL repeats. PMID:27883072

  16. Repeats of base oligomers as the primordial coding sequences of the primeval earth and their vestiges in modern genes.

    PubMed

    Ohno, S

    1984-01-01

    Three outstanding properties uniquely qualify repeats of base oligomers as the primordial coding sequences of all polypeptide chains. First, when compared with randomly generated base sequences in general, they are more likely to have long open reading frames. Second, periodical polypeptide chains specified by such repeats are more likely to assume either alpha-helical or beta-sheet secondary structures than are polypeptide chains of random sequence. Third, provided that the number of bases in the oligomeric unit is not a multiple of 3, these internally repetitious coding sequences are impervious to randomly sustained base substitutions, deletions, and insertions. This is because the recurring periodicity of their polypeptide chains is given by three consecutive copies of the oligomeric unit translated in three different reading frames. Accordingly, when one reading frame is open, the other two are automatically open as well, all three being capable of coding for polypeptide chains of identical periodicity. Under this circumstance, a frame shift due to the deletion or insertion of a number of bases that is not a multiple of 3 fails to alter the down-stream amino acid sequence, and even a base change causing premature chain-termination can silence only one of the three potential coding units. Newly arisen coding sequences in modern organisms are oligomeric repeats, and most of the older genes retain various vestiges of their original internal repetitions. Some of the genes (e.g., oncogenes) have even inherited the property of being impervious to randomly sustained base changes.

  17. Unraveling the Molecular Basis of Temperature-Dependent Genetic Regulation in Penicillium marneffei

    PubMed Central

    Yang, Ence; Wang, Gang; Woo, Patrick C. Y.; Lau, Susanna K. P.; Chow, Wang-Ngai; Chong, Ken T. K.; Tse, Herman; Kao, Richard Y. T.; Chan, Che-Man; Che, Xiaoyan; Yuen, Kwok-Yung

    2013-01-01

    Penicillium marneffei is an opportunistic fungal pathogen endemic in Southeast Asia, causing lethal systemic infections in immunocompromised patients. P. marneffei grows in a mycelial form at the ambient temperature of 25°C and transitions to a yeast form at 37°C. The ability to alternate between the mycelial and yeast forms at different temperatures, namely, thermal dimorphism, has long been considered critical for the pathogenicity of P. marneffei, yet the underlying genetic mechanisms remain elusive. Here we employed high-throughput sequencing to unravel global transcriptional profiles of P. marneffei PM1 grown at 25 and 37°C. Among ∼11,000 protein-coding genes, 1,447 were overexpressed and 1,414 were underexpressed at 37°C. Counterintuitively, heat-responsive genes, predicted in P. marneffei through sequence comparison, did not tend to be overexpressed at 37°C. These results suggest that P. marneffei may take a distinct strategy of genetic regulation at the elevated temperature; the current knowledge concerning fungal heat response, based on studies of model fungal organisms, may not be applicable to P. marneffei. Our results further showed that the tandem repeat sequences (TRSs) are overrepresented in coding regions of P. marneffei genes, and TRS-containing genes tend to be overexpressed at 37°C. Furthermore, genomic sequences and expression data were integrated to characterize gene clusters, multigene families, and species-specific genes of P. marneffei. In sum, we present an integrated analysis and a comprehensive resource toward a better understanding of temperature-dependent genetic regulation in P. marneffei. PMID:23851338

  18. DeNovoGUI: An Open Source Graphical User Interface for de Novo Sequencing of Tandem Mass Spectra

    PubMed Central

    2013-01-01

    De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com. PMID:24295440

  19. DeNovoGUI: an open source graphical user interface for de novo sequencing of tandem mass spectra.

    PubMed

    Muth, Thilo; Weilnböck, Lisa; Rapp, Erdmann; Huber, Christian G; Martens, Lennart; Vaudel, Marc; Barsnes, Harald

    2014-02-07

    De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com .

  20. Constructing LDPC Codes from Loop-Free Encoding Modules

    NASA Technical Reports Server (NTRS)

    Divsalar, Dariush; Dolinar, Samuel; Jones, Christopher; Thorpe, Jeremy; Andrews, Kenneth

    2009-01-01

    A method of constructing certain low-density parity-check (LDPC) codes by use of relatively simple loop-free coding modules has been developed. The subclasses of LDPC codes to which the method applies includes accumulate-repeat-accumulate (ARA) codes, accumulate-repeat-check-accumulate codes, and the codes described in Accumulate-Repeat-Accumulate-Accumulate Codes (NPO-41305), NASA Tech Briefs, Vol. 31, No. 9 (September 2007), page 90. All of the affected codes can be characterized as serial/parallel (hybrid) concatenations of such relatively simple modules as accumulators, repetition codes, differentiators, and punctured single-parity check codes. These are error-correcting codes suitable for use in a variety of wireless data-communication systems that include noisy channels. These codes can also be characterized as hybrid turbolike codes that have projected graph or protograph representations (for example see figure); these characteristics make it possible to design high-speed iterative decoders that utilize belief-propagation algorithms. The present method comprises two related submethods for constructing LDPC codes from simple loop-free modules with circulant permutations. The first submethod is an iterative encoding method based on the erasure-decoding algorithm. The computations required by this method are well organized because they involve a parity-check matrix having a block-circulant structure. The second submethod involves the use of block-circulant generator matrices. The encoders of this method are very similar to those of recursive convolutional codes. Some encoders according to this second submethod have been implemented in a small field-programmable gate array that operates at a speed of 100 megasymbols per second. By use of density evolution (a computational- simulation technique for analyzing performances of LDPC codes), it has been shown through some examples that as the block size goes to infinity, low iterative decoding thresholds close to channel capacity limits can be achieved for the codes of the type in question having low maximum variable node degrees. The decoding thresholds in these examples are lower than those of the best-known unstructured irregular LDPC codes constrained to have the same maximum node degrees. Furthermore, the present method enables the construction of codes of any desired rate with thresholds that stay uniformly close to their respective channel capacity thresholds.

Top