tandem repeat proteins: Topics by Science.gov

Sample records for tandem repeat proteins

Tandem-repeat protein domains across the tree of life.

PubMed

Jernigan, Kristin K; Bordenstein, Seth R

2015-01-01

Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20-40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species.
Tandem-repeat protein domains across the tree of life

PubMed Central

Jernigan, Kristin K.

2015-01-01

Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20–40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species. PMID:25653910
Rational design of alpha-helical tandem repeat proteins with closed architectures

PubMed Central

Doyle, Lindsey; Hallinan, Jazmine; Bolduc, Jill; Parmeggiani, Fabio; Baker, David; Stoddard, Barry L.; Bradley, Philip

2015-01-01

Tandem repeat proteins, which are formed by repetition of modular units of protein sequence and structure, play important biological roles as macromolecular binding and scaffolding domains, enzymes, and building blocks for the assembly of fibrous materials1,2. The modular nature of repeat proteins enables the rapid construction and diversification of extended binding surfaces by duplication and recombination of simple building blocks3,4. The overall architecture of tandem repeat protein structures – which is dictated by the internal geometry and local packing of the repeat building blocks – is highly diverse, ranging from extended, super-helical folds that bind peptide, DNA, and RNA partners5–9, to closed and compact conformations with internal cavities suitable for small molecule binding and catalysis10. Here we report the development and validation of computational methods for de novo design of tandem repeat protein architectures driven purely by geometric criteria defining the inter-repeat geometry, without reference to the sequences and structures of existing repeat protein families. We have applied these methods to design a series of closed alpha-solenoid11 repeat structures (alpha-toroids) in which the inter-repeat packing geometry is constrained so as to juxtapose the N- and C-termini; several of these designed structures have been validated by X-ray crystallography. Unlike previous approaches to tandem repeat protein engineering12–20, our design procedure does not rely on template sequence or structural information taken from natural repeat proteins and hence can produce structures unlike those seen in nature. As an example, we have successfully designed and validated closed alpha-solenoid repeats with a left-handed helical architecture that – to our knowledge – is not yet present in the protein structure database21. PMID:26675735
Tandem Repeat Proteins Inspired By Squid Ring Teeth

NASA Astrophysics Data System (ADS)

Pena-Francesch, Abdon

Proteins are large biomolecules consisting of long chains of amino acids that hierarchically assemble into complex structures, and provide a variety of building blocks for biological materials. The repetition of structural building blocks is a natural evolutionary strategy for increasing the complexity and stability of protein structures. However, the relationship between amino acid sequence, structure, and material properties of protein systems remains unclear due to the lack of control over the protein sequence and the intricacies of the assembly process. In order to investigate the repetition of protein building blocks, a recently discovered protein from squids is examined as an ideal protein system. Squid ring teeth are predatory appendages located inside the suction cups that provide a strong grasp of prey, and are solely composed of a group of proteins with tandem repetition of building blocks. The objective of this thesis is the understanding of sequence, structure and property relationship in repetitive protein materials inspired in squid ring teeth for the first time. Specifically, this work focuses on squid-inspired structural proteins with tandem repeat units in their sequence (i.e., repetition of alternating building blocks) that are physically cross-linked via beta-sheet structures. The research work presented here tests the hypothesis that, in these systems, increasing the number of building blocks in the polypeptide chain decreases the protein network defects and improves the material properties. Hence, the sequence, nanostructure, and properties (thermal, mechanical, and conducting) of tandem repeat squid-inspired protein materials are examined. Spectroscopic structural analysis, advanced materials characterization, and entropic elasticity theory are combined to elucidate the structure and material properties of these repetitive proteins. This approach is applied not only to native squid proteins but also to squid-inspired synthetic polypeptides
A TALE-inspired computational screen for proteins that contain approximate tandem repeats.

PubMed

Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias

2017-01-01

TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.
ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae.

PubMed

Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta

2012-11-07

Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40
ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae

PubMed Central

2012-01-01

Background Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. Results ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. Conclusions We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the
A TALE-inspired computational screen for proteins that contain approximate tandem repeats

PubMed Central

Krwawicz, Joanna

2017-01-01

TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen. PMID:28617832
RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins.

PubMed

Hirsh, Layla; Paladin, Lisanna; Piovesan, Damiano; Tosatto, Silvio C E

2018-05-09

RepeatsDB-lite (http://protein.bio.unipd.it/repeatsdb-lite) is a web server for the prediction of repetitive structural elements and units in tandem repeat (TR) proteins. TRs are a widespread but poorly annotated class of non-globular proteins carrying heterogeneous functions. RepeatsDB-lite extends the prediction to all TR types and strongly improves the performance both in terms of computational time and accuracy over previous methods, with precision above 95% for solenoid structures. The algorithm exploits an improved TR unit library derived from the RepeatsDB database to perform an iterative structural search and assignment. The web interface provides tools for analyzing the evolutionary relationships between units and manually refine the prediction by changing unit positions and protein classification. An all-against-all structure-based sequence similarity matrix is calculated and visualized in real-time for every user edit. Reviewed predictions can be submitted to RepeatsDB for review and inclusion.
Molecular tandem repeat strategy for elucidating mechanical properties of high-strength proteins

PubMed Central

Jung, Huihun; Pena-Francesch, Abdon; Saadat, Alham; Sebastian, Aswathy; Kim, Dong Hwan; Hamilton, Reginald F.; Albert, Istvan; Allen, Benjamin D.; Demirel, Melik C.

2016-01-01

Many globular and structural proteins have repetitions in their sequences or structures. However, a clear relationship between these repeats and their contribution to the mechanical properties remains elusive. We propose a new approach for the design and production of synthetic polypeptides that comprise one or more tandem copies of a single unit with distinct amorphous and ordered regions. Our designed sequences are based on a structural protein produced in squid suction cups that has a segmented copolymer structure with amorphous and crystalline domains. We produced segmented polypeptides with varying repeat number, while keeping the lengths and compositions of the amorphous and crystalline regions fixed. We showed that mechanical properties of these synthetic proteins could be tuned by modulating their molecular weights. Specifically, the toughness and extensibility of synthetic polypeptides increase as a function of the number of tandem repeats. This result suggests that the repetitions in native squid proteins could have a genetic advantage for increased toughness and flexibility. PMID:27222581
Tandem Repeats in Proteins: Prediction Algorithms and Biological Role.

PubMed

Pellegrini, Marco

2015-01-01

Tandem repetitions in protein sequence and structure is a fascinating subject of research which has been a focus of study since the late 1990s. In this survey, we give an overview on the multi-faceted aspects of research on protein tandem repeats (PTR for short), including prediction algorithms, databases, early classification efforts, mechanisms of PTR formation and evolution, and synthetic PTR design. We also touch on the rather open issue of the relationship between PTR and flexibility (or disorder) in proteins. Detection of PTR either from protein sequence or structure data is challenging due to inherent high (biological) signal-to-noise ratio that is a key feature of this problem. As early in silico analytic tools have been key enablers for starting this field of study, we expect that current and future algorithmic and statistical breakthroughs will have a high impact on the investigations of the biological role of PTR.
[Molecular cloning and characterization of a novel Clonorchis sinensis antigenic protein containing tandem repeat sequences].

PubMed

Liu, Qian; Xu, Xue-Nian; Zhou, Yan; Cheng, Na; Dong, Yu-Ting; Zheng, Hua-Jun; Zhu, Yong-Qiang; Zhu, Yong-Qiang

2013-08-01

To find and clone new antigen genes from the lambda-ZAP cDNA expression library of adult Clonorchis sinensis, and determine the immunological characteristics of the recombinant proteins. The cDNA expression library of adult C. sinensis was screened by pooled sera of clonorchiasis patients. The sequences of the positive phage clones were compared with the sequences in EST database, and the full-length sequence of the gene (Cs22 gene) was obtained by RT-PCR. cDNA fragments containing 2 and 3 times tandem repeat sequences were generated by jumping PCR. The sequence encoding the mature peptide or the tandem repeat sequence was respectively cloned into the prokaryotic expression vector pET28a (+), and then transformed into E. coli Rosetta DE3 cells for expression. The recombinant proteins (rCs22-2r, rCs22-3r, rCs22M-2r, and rCs22M-3r) were purified by His-bind-resin (Ni-NTA) affinity chromatography. The immunogenicity of rCs22-2r and rCs22-3r was identified by ELISA. To evaluate the immunological diagnostic value of rCs22-2r and rCs22-3r, serum samples from 35 clonorchiasis patients, 31 healthy individuals, 15 schistosomiasis patients, 15 paragonimiasis westermani patients and 13 cysticercosis patients were examined by ELISA. To locate antigenic determinants, the pooled sera of clonorchiasis patients and healthy persons were analyzed for specific antibodies by ELISA with recombinant protein rCs22M-2r and rCs22M-3r containing the tandem repeat sequences. The full-length sequence of Cs22 antigen gene of C. sinensis was obtained. It contained 13 times tandem repeat sequences of EQQDGDEEGMGGDGGRGKEKGKVEGEDGAGEQKEQA. Bioinformatics analysis indicated that the protein (Cs22) belonged to GPI-anchored proteins family. The recombinant proteins rCs22-2r and rCs22-3r showed a certain level of immunogenicity. The positive rate by ELISA coated with the purified PrCs22-2r and PrCs22-3r for sera of clonorchiasis patients both were 45.7% (16/35), and 3.2% (1/31) for those of healthy
Versatile communication strategies among tandem WW domain repeats

PubMed Central

Dodson, Emma Joy; Fishbain-Yoskovitz, Vered; Rotem-Bamberger, Shahar

2015-01-01

Interactions mediated by short linear motifs in proteins play major roles in regulation of cellular homeostasis since their transient nature allows for easy modulation. We are still far from a full understanding and appreciation of the complex regulation patterns that can be, and are, achieved by this type of interaction. The fact that many linear-motif-binding domains occur in tandem repeats in proteins indicates that their mutual communication is used extensively to obtain complex integration of information toward regulatory decisions. This review is an attempt to overview, and classify, different ways by which two and more tandem repeats cooperate in binding to their targets, in the well-characterized family of WW domains and their corresponding polyproline ligands. PMID:25710931
[Polymorphic loci and polymorphism analysis of short tandem repeats within XNP gene].

PubMed

Liu, Qi-Ji; Gong, Yao-Qin; Guo, Chen-Hong; Chen, Bing-Xi; Li, Jiang-Xia; Guo, Yi-Shou

2002-01-01

To select polymorphic short tandem repeat markers within X-linked nuclear protein (XNP) gene, genomic clones which contain XNP gene were recognized by homologous analysis with XNP cDNA. By comparing the cDNA with genomic DNA, non-exonic sequences were identified, and short tandem repeats were selected from non-exonic sequences by using BCM search Launcher. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five short tandem repeats were identified from XNP gene, two of which were polymorphic. Four and 11 alleles were observed in Chinese population for XNPSTR1 and XNPSTR4, respectively. Heterozygosities were 47% for XNPSTR1 and 70% for XNPSTR4. XNPSTR1 and XNPSTR4 localized within 3' end and intron 10, respectively. Two polymorphic short tandem repeats have been identified within XNP gene and will be useful for linkage analysis and gene diagnosis of XNP gene.
Topological characteristics of helical repeat proteins.

PubMed

Groves, M R; Barford, D

1999-06-01

The recent elucidation of protein structures based upon repeating amino acid motifs, including the armadillo motif, the HEAT motif and tetratricopeptide repeats, reveals that they belong to the class of helical repeat proteins. These proteins share the common property of being assembled from tandem repeats of an alpha-helical structural unit, creating extended superhelical structures that are ideally suited to create a protein recognition interface.
Short Tandem Repeat DNA Internet Database

National Institute of Standards and Technology Data Gateway

SRD 130 Short Tandem Repeat DNA Internet Database (Web, free access) Short Tandem Repeat DNA Internet Database is intended to benefit research and application of short tandem repeat DNA markers for human identity testing. Facts and sequence information on each STR system, population data, commonly used multiplex STR systems, PCR primers and conditions, and a review of various technologies for analysis of STR alleles have been included.
The evolution and function of protein tandem repeats in plants.

PubMed

Schaper, Elke; Anisimova, Maria

2015-04-01

Sequence tandem repeats (TRs) are abundant in proteomes across all domains of life. For plants, little is known about their distribution or contribution to protein function. We exhaustively annotated TRs and studied the evolution of TR unit variations for all Ensembl plants. Using phylogenetic patterns of TR units, we detected conserved TRs with unit number and order preserved during evolution, and those TRs that have diverged via recent TR unit gains/losses. We correlated the mode of evolution of TRs to protein function. TR number was strongly correlated with proteome size, with about one-half of all TRs recognized as common protein domains. The majority of TRs have been highly conserved over long evolutionary distances, some since the separation of red algae and green plants c. 1.6 billion yr ago. Conversely, recurrent recent TR unit mutations were rare. Our results suggest that the first TRs by far predate the first plants, and that TR appearance is an ongoing process with similar rates across the plant kingdom. Interestingly, the few detected highly mutable TRs might provide a source of variation for rapid adaptation. In particular, such TRs are enriched in leucine-rich repeats (LRRs) commonly found in R genes, where TR unit gain/loss may facilitate resistance to emerging pathogens. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.
Exploring the repeat protein universe through computational protein design

DOE PAGES

Brunette, TJ; Parmeggiani, Fabio; Huang, Po-Ssu; ...

2015-12-16

A central question in protein evolution is the extent to which naturally occurring proteins sample the space of folded structures accessible to the polypeptide chain. Repeat proteins composed of multiple tandem copies of a modular structure unit are widespread in nature and have critical roles in molecular recognition, signalling, and other essential biological processes. Naturally occurring repeat proteins have been re-engineered for molecular recognition and modular scaffolding applications. In this paper, we use computational protein design to investigate the space of folded structures that can be generated by tandem repeating a simple helix–loop–helix–loop structural motif. Eighty-three designs with sequences unrelatedmore » to known repeat proteins were experimentally characterized. Of these, 53 are monomeric and stable at 95 °C, and 43 have solution X-ray scattering spectra consistent with the design models. Crystal structures of 15 designs spanning a broad range of curvatures are in close agreement with the design models with root mean square deviations ranging from 0.7 to 2.5 Å. Finally, our results show that existing repeat proteins occupy only a small fraction of the possible repeat protein sequence and structure space and that it is possible to design novel repeat proteins with precisely specified geometries, opening up a wide array of new possibilities for biomolecular engineering.« less
Ehrlichia chaffeensis Tandem Repeat Proteins and Ank200 are Type 1 Secretion System Substrates Related to the Repeats-in-Toxin Exoprotein Family

PubMed Central

Wakeel, Abdul; den Dulk-Ras, Amke; Hooykaas, Paul J. J.; McBride, Jere W.

2011-01-01

Ehrlichia chaffeensis has type 1 and 4 secretion systems (T1SS and T4SS), but the substrates have not been identified. Potential substrates include secreted tandem repeat protein (TRP) 47, TRP120, and TRP32, and the ankyrin repeat protein, Ank200, that are involved in molecular host–pathogen interactions including DNA binding and a network of protein–protein interactions with host targets associated with signaling, transcriptional regulation, vesicle trafficking, and apoptosis. In this study we report that E. chaffeensis TRP47, TRP32, TRP120, and Ank200 were not secreted in the Agrobacterium tumefaciens Cre recombinase reporter assay routinely used to identify T4SS substrates. In contrast, all TRPs and the Ank200 proteins were secreted by the Escherichia coli complemented with the hemolysin secretion system (T1SS), and secretion was reduced in a T1SS mutant (ΔTolC), demonstrating that these proteins are T1SS substrates. Moreover, T1SS secretion signals were identified in the C-terminal domains of the TRPs and Ank200, and a detailed bioinformatic analysis of E. chaffeensis TRPs and Ank200 revealed features consistent with those described in the repeats-in-toxins (RTX) family of exoproteins, including glycine- and aspartate-rich tandem repeats, homology with ATP-transporters, a non-cleavable C-terminal T1SS signal, acidic pIs, and functions consistent with other T1SS substrates. Using a heterologous E. coli T1SS, this investigation has identified the first Ehrlichia T1SS substrates supporting the conclusion that the T1SS and corresponding substrates are involved in molecular host–pathogen interactions that contribute to Ehrlichia pathobiology. Further investigation of the relationship between Ehrlichia TRPs, Ank200, and the RTX exoprotein family may lead to a greater understanding of the importance of T1SS substrates and specific functions of T1SS in the pathobiology of obligately intracellular bacteria. PMID:22919588
New paradigm in ankyrin repeats: Beyond protein-protein interaction module.

PubMed

Islam, Zeyaul; Nagampalli, Raghavendra Sashi Krishna; Fatima, Munazza Tamkeen; Ashraf, Ghulam Md

2018-04-01

Classically, ankyrin repeat (ANK) proteins are built from tandems of two or more repeats and form curved solenoid structures that are associated with protein-protein interactions. These are short, widespread structural motif of around 33 amino acids repeats in tandem, having a canonical helix-loop-helix fold, found individually or in combination with other domains. The multiplicity of structural pattern enables it to form assemblies of diverse sizes, required for their abilities to confer multiple binding and structural roles of proteins. Three-dimensional structures of these repeats determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. Recent work on the ANK has proposed novel structural information, especially protein-lipid, protein-sugar and protein-protein interaction. Self-assembly of these repeats was also shown to prevent the associated protein in forming filaments. In this review, we summarize the latest findings and how the new structural information has increased our understanding of the structural determinants of ANK proteins. We discussed latest findings on how these proteins participate in various interactions to diversify the ANK roles in numerous biological processes, and explored the emerging and evolving field of designer ankyrins and its framework for protein engineering emphasizing on biotechnological applications. Copyright © 2017 Elsevier B.V. All rights reserved.

The evolution of filamin-a protein domain repeat perspective.

PubMed

Light, Sara; Sagit, Rauan; Ithychanda, Sujay S; Qin, Jun; Elofsson, Arne

2012-09-01

Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. Copyright © 2012 Elsevier Inc. All rights reserved.
Sequence repeats and protein structure

NASA Astrophysics Data System (ADS)

Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos

2012-11-01

Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.
TRDistiller: a rapid filter for enrichment of sequence datasets with proteins containing tandem repeats.

PubMed

Richard, François D; Kajava, Andrey V

2014-06-01

The dramatic growth of sequencing data evokes an urgent need to improve bioinformatics tools for large-scale proteome analysis. Over the last two decades, the foremost efforts of computer scientists were devoted to proteins with aperiodic sequences having globular 3D structures. However, a large portion of proteins contain periodic sequences representing arrays of repeats that are directly adjacent to each other (so called tandem repeats or TRs). These proteins frequently fold into elongated fibrous structures carrying different fundamental functions. Algorithms specific to the analysis of these regions are urgently required since the conventional approaches developed for globular domains have had limited success when applied to the TR regions. The protein TRs are frequently not perfect, containing a number of mutations, and some of them cannot be easily identified. To detect such "hidden" repeats several algorithms have been developed. However, the most sensitive among them are time-consuming and, therefore, inappropriate for large scale proteome analysis. To speed up the TR detection we developed a rapid filter that is based on the comparison of composition and order of short strings in the adjacent sequence motifs. Tests show that our filter discards up to 22.5% of proteins which are known to be without TRs while keeping almost all (99.2%) TR-containing sequences. Thus, we are able to decrease the size of the initial sequence dataset enriching it with TR-containing proteins which allows a faster subsequent TR detection by other methods. The program is available upon request. Copyright © 2014 Elsevier Inc. All rights reserved.
The evolution of filamin – A protein domain repeat perspective

PubMed Central

Light, Sara; Sagit, Rauan; Ithychanda, Sujay S.; Qin, Jun; Elofsson, Arne

2013-01-01

Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. PMID:22414427
Ten tandem repeats of {beta}-hCG 109-118 enhance immunogenicity and anti-tumor effects of {beta}-hCG C-terminal peptide carried by mycobacterial heat-shock protein HSP65

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang Yankai; Yan Rong; He Yi

2006-07-14

The {beta}-subunit of human chorionic gonadotropin ({beta}-hCG) is secreted by many kinds of tumors and it has been used as an ideal target antigen to develop vaccines against tumors. In view of the low immunogenicity of this self-peptide,we designed a method based on isocaudamer technique to repeat tandemly the 10-residue sequence X of {beta}-hCG (109-118), then 10 tandemly repeated copies of the 10-residue sequence combined with {beta}-hCG C-terminal 37 peptides were fused to mycobacterial heat-shock protein 65 to construct a fusion protein HSP65-X10-{beta}hCGCTP37 as an immunogen. In this study, we examined the effect of the tandem repeats of this 10-residuemore » sequence in eliciting an immune by comparing the immunogenicity and anti-tumor effects of the two immunogens, HSP65-X10-{beta}hCGCTP37 and HSP65-{beta}hCGCTP37 (without the 10 tandem repeats). Immunization of mice with the fusion protein HSP65-X10-{beta}hCGCTP37 elicited much higher levels of specific anti-{beta}-hCG antibodies and more effectively inhibited the growth of Lewis lung carcinoma (LLC) in vivo than with HSP65-{beta}hCGCTP37, which should suggest that HSP65-X10-{beta}hCGCTP37 may be an effective protein vaccine for the treatment of {beta}-hCG-dependent tumors and multiple tandem repeats of a certain epitope are an efficient method to overcome the low immunogenicity of self-peptide antigens.« less
Stabilization of perfect and imperfect tandem repeats by single-strand DNA exonucleases

PubMed Central

Feschenko, Vladimir V.; Rajman, Luis A.; Lovett, Susan T.

2003-01-01

Rearrangements between tandemly repeated DNA sequences are a common source of genetic instability. Such rearrangements underlie several human genetic diseases. In many organisms, the mismatch-repair (MMR) system functions to stabilize repeats when the repeat unit is short or when sequence imperfections are present between the repeats. We show here that the action of single-stranded DNA (ssDNA) exonucleases plays an additional, important role in stabilizing tandem repeats, independent of their role in MMR. For perfect repeats of ≈100 bp in Escherichia coli that are not susceptible to MMR, exonuclease (Exo)-I, ExoX, and RecJ exonuclease redundantly inhibit deletion. Our data suggest that >90% of potential deletion events are avoided by the combined action of these three exonucleases. Imperfect tandem repeats, less prone to rearrangements, are stabilized by both the MMR-pathway and ssDNA-specific exonucleases. For 100-bp repeats containing four mispairs, ExoI alone aborts most deletion events, even in the presence of a functional MMR system. By genetic analysis, we show that the inhibitory effect of ssDNA exonucleases on deletion formation is independent of the MutS and UvrD proteins. Exonuclease degradation of DNA displaced during the deletion process may abort slipped misalignment. Exonuclease action is therefore a significant force in genetic stabilization of many forms of repetitive DNA. PMID:12538867
TRedD—A database for tandem repeats over the edit distance

PubMed Central

Sokol, Dina; Atagun, Firat

2010-01-01

A ‘tandem repeat’ in DNA is a sequence of two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats are common in the genomes of both eukaryotic and prokaryotic organisms. They are significant markers for human identity testing, disease diagnosis, sequence homology and population studies. In this article, we describe a new database, TRedD, which contains the tandem repeats found in the human genome. The database is publicly available online, and the software for locating the repeats is also freely available. The definition of tandem repeats used by TRedD is a new and innovative definition based upon the concept of ‘evolutive tandem repeats’. In addition, we have developed a tool, called TandemGraph, to graphically depict the repeats occurring in a sequence. This tool can be coupled with any repeat finding software, and it should greatly facilitate analysis of results. Database URL: http://tandem.sci.brooklyn.cuny.edu/ PMID:20624712
TRAP: automated classification, quantification and annotation of tandemly repeated sequences.

PubMed

Sobreira, Tiago José P; Durham, Alan M; Gruber, Arthur

2006-02-01

TRAP, the Tandem Repeats Analysis Program, is a Perl program that provides a unified set of analyses for the selection, classification, quantification and automated annotation of tandemly repeated sequences. TRAP uses the results of the Tandem Repeats Finder program to perform a global analysis of the satellite content of DNA sequences, permitting researchers to easily assess the tandem repeat content for both individual sequences and whole genomes. The results can be generated in convenient formats such as HTML and comma-separated values. TRAP can also be used to automatically generate annotation data in the format of feature table and GFF files.
Ligand binding by repeat proteins: natural and designed

PubMed Central

Grove, Tijana Z; Cortajarena, Aitziber L; Regan, Lynne

2012-01-01

Repeat proteins contain tandem arrays of small structural motifs. As a consequence of this architecture, they adopt non-globular, extended structures that present large, highly specific surfaces for ligand binding. Here we discuss recent advances toward understanding the functional role of this unique modular architecture. We showcase specific examples of natural repeat proteins interacting with diverse ligands and also present examples of designed repeat protein–ligand interactions. PMID:18602006
Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.

PubMed

Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru

2015-01-01

The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.
Typing Clostridium difficile strains based on tandem repeat sequences

PubMed Central

2009-01-01

Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124
Identification and characterization of tandem repeats in exon III of dopamine receptor D4 (DRD4) genes from different mammalian species.

PubMed

Larsen, Svend Arild; Mogensen, Line; Dietz, Rune; Baagøe, Hans Jørgen; Andersen, Mogens; Werge, Thomas; Rasmussen, Henrik Berg

2005-12-01

In this study we have identified and characterized dopamine receptor D4 (DRD4) exon III tandem repeats in 33 public available nucleotide sequences from different mammalian species. We found that the tandem repeat in canids could be described in a novel and simple way, namely, as a structure composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp tandem repeat, while a tandem repeat consisting of 27-bp modules was identified in a sequence from European badger. Both these tandem repeats were composed of 9-bp basic units, which were closely related with the 9-bp repeat modules identified in the mink and ferret. Tandem repeats could not be identified in sequences from rodents. All tandem repeats possessed a high GC content with a strong bias for C. On phylogenetic analysis of the tandem repeats evolutionary related species were clustered into the same groups. The degree of conservation of the tandem repeats varied significantly between species. The deduced amino acid sequences of most of the tandem repeats exhibited a high propensity for disorder. This was also the case with an amino acid sequence of the human DRD4 exon III tandem repeat, which was included in the study for comparative purposes. We identified proline-containing motifs for SH3 and WW domain binding proteins, potential phosphorylation sites, PDZ domain binding motifs, and FHA domain binding motifs in the amino acid sequences of the tandem repeats. The numbers of
Repeat-containing protein effectors of plant-associated organisms

PubMed Central

Mesarich, Carl H.; Bowen, Joanna K.; Hamiaux, Cyril; Templeton, Matthew D.

2015-01-01

Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs) that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms. PMID:26557126
Repeat-containing protein effectors of plant-associated organisms.

PubMed

Mesarich, Carl H; Bowen, Joanna K; Hamiaux, Cyril; Templeton, Matthew D

2015-01-01

Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs) that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms.
Protein arginine methyltransferase 7 has a novel homodimer-like structure formed by tandem repeats.

PubMed

Hasegawa, Morio; Toma-Fukai, Sachiko; Kim, Jun-Dal; Fukamizu, Akiyoshi; Shimizu, Toshiyuki

2014-05-21

Protein arginine methyltransferase 7 (PRMT7) is a member of a family of enzymes that catalyze the transfer of methyl groups from S-adenosyl-l-methionine to nitrogen atoms on arginine residues. Here, we describe the crystal structure of Caenorhabditis elegans PRMT7 in complex with its reaction product S-adenosyl-L-homocysteine. The structural data indicated that PRMT7 harbors two tandem repeated PRMT core domains that form a novel homodimer-like structure. S-adenosyl-L-homocysteine bound to the N-terminal catalytic site only; the C-terminal catalytic site is occupied by a loop that inhibits cofactor binding. Mutagenesis demonstrated that only the N-terminal catalytic site of PRMT7 is responsible for cofactor binding. Copyright © 2014 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Mechanical unfolding of an ankyrin repeat protein.

PubMed

Serquera, David; Lee, Whasil; Settanni, Giovanni; Marszalek, Piotr E; Paci, Emanuele; Itzhaki, Laura S

2010-04-07

Ankryin repeat proteins comprise tandem arrays of a 33-residue, predominantly alpha-helical motif that stacks roughly linearly to produce elongated and superhelical structures. They function as scaffolds mediating a diverse range of protein-protein interactions, and some have been proposed to play a role in mechanical signal transduction processes in the cell. Here we use atomic force microscopy and molecular-dynamics simulations to investigate the natural 7-ankyrin repeat protein gankyrin. We find that gankyrin unfolds under force via multiple distinct pathways. The reactions do not proceed in a cooperative manner, nor do they always involve fully stepwise unfolding of one repeat at a time. The peeling away of half an ankyrin repeat, or one or more ankyrin repeats, occurs at low forces; however, intermediate species are formed that are resistant to high forces, and the simulations indicate that in some instances they are stabilized by nonnative interactions. The unfolding of individual ankyrin repeats generates a refolding force, a feature that may be more easily detected in these proteins than in globular proteins because the refolding of a repeat involves a short contraction distance and incurs a low entropic cost. We discuss the origins of the differences between the force- and chemical-induced unfolding pathways of ankyrin repeat proteins, as well as the differences between the mechanics of natural occurring ankyrin repeat proteins and those of designed consensus ankyin repeat and globular proteins. Copyright (c) 2010 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Immunogenicity of a recombinant fusion protein of tandem repeat epitopes of foot-and-mouth disease virus type Asia 1 for guinea pigs.

PubMed

Zhang, Q; Yang, Y Q; Zhang, Z Y; Li, L; Yan, W Y; Jiang, W J; Xin, A G; Lei, C X; Zheng, Z X

2002-01-01

In this study, the sequences of capsid protein VPI regions of YNAs1.1 and YNAs1.2 isolates of foot-and-mouth disease virus (FMDV) were analyzed and a peptide containing amino acids (aa) 133-158 of VP1 and aa 20-34 of VP4 of FMDV type Asia I was assumed to contain B and T cell epitopes, because it is hypervariable and includes a cell attachment site RGD located in the G-H loop. The DNA fragments encoding aa 133-158 of VP1 and aa 20-34 of VP4 of FMDV type Asia 1 were chemically synthesized and ligated into a tandem repeat of aa 133-158-20 approximately 34-133-158. In order to enhance its immunogenicity, the tandem repeat was inserted downstream of the beta-galactosidase gene in the expression vector pWR590. This insertion yielded a recombinant expression vector pAS1 encoding the fusion protein. The latter reacted with sera from FMDV type Asia 1-infected animals in vitro and elicited high levels of neutralizing antibodies in guinea pigs. The T cell proliferation in immunized animals increased following stimulation with the fusion protein. It is reported for the first time that a recombinant fusion protein vaccine was produced using B and T cell epitopes of FMDV type Asia 1 and that this fusion protein was immunogenic. The fusion protein reported here can serve as a candidate of fusion epitopes for design of a vaccine against FMDV type Asia 1.
De novo generation of plant centromeres at tandem repeats.

PubMed

Teo, Chee How; Lermontova, Inna; Houben, Andreas; Mette, Michael Florian; Schubert, Ingo

2013-06-01

Artificial minichromosomes are highly desirable tools for basic research, breeding, and biotechnology purposes. We present an option to generate plant artificial minichromosomes via de novo engineering of plant centromeres in Arabidopsis thaliana by targeting kinetochore proteins to tandem repeat arrays at non-centromeric positions. We employed the bacterial lactose repressor/lactose operator system to guide derivatives of the centromeric histone H3 variant cenH3 to LacO operator sequences. Tethering of cenH3 to non-centromeric loci led to de novo assembly of kinetochore proteins and to dicentric carrier chromosomes which potentially form anaphase bridges. This approach will be further developed and may contribute to generating minichromosomes from preselected genomic regions, potentially even in a diploid background.
Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.

PubMed

Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L

2013-01-30

Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

PubMed Central

2013-01-01

Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705

A Legionella pneumophila collagen-like protein encoded by a gene with a variable number of tandem repeats is involved in the adherence and invasion of host cells.

PubMed

Vandersmissen, Liesbeth; De Buck, Emmy; Saels, Veerle; Coil, David A; Anné, Jozef

2010-05-01

Legionella pneumophila is a Gram-negative, facultative intracellular pathogen and the causative agent of Legionnaires' disease, a severe pneumonia in humans. Analysis of the Legionella sequenced genomes revealed a gene with a variable number of tandem repeats (VNTRs), whose number varies between strains. We examined the strain distribution of this gene among a collection of 108 clinical, environmental and hot spring serotype I strains. Twelve variants were identified, but no correlation was observed between the number of repeat units and clinical and environmental strains. The encoded protein contains the C-terminal consensus motif of outer membrane proteins and has a large region of collagen-like repeats that is encoded by the VNTR region. We have therefore annotated this protein Lcl for Legionella collagen-like protein. Lcl was shown to contribute to the adherence and invasion of host cells and it was demonstrated that the number of repeat units present in lcl had an influence on these adhesion characteristics.
The proliferation marker pKi-67 becomes masked to MIB-1 staining after expression of its tandem repeats.

PubMed

Schmidt, Mirko H H; Broll, Rainer; Bruch, Hans-Peter; Duchrow, Michael

2002-11-01

The Ki-67 antigen, pKi-67, is one of the most commonly used markers of proliferating cells. The protein can only be detected in dividing cells (G(1)-, S-, G(2)-, and M-phase) but not in quiescent cells (G(0)). The standard antibody to detect pKi-67 is MIB-1, which detects the so-called 'Ki-67 motif' FKELF in 9 of the protein's 16 tandem repeats. To investigate the function of these repeats we expressed three of them in an inducible gene expression system in HeLa cells. Surprisingly, addition of a nuclear localization sequence led to a complete absence of signal in the nuclei of MIB-1-stained cells. At the same time antibodies directed against different epitopes of pKi-67 did not fail to detect the protein. We conclude that the overexpression of the 'Ki-67 motif', which is present in the repeats, can lead to inability of MIB-1 to detect its antigen as demonstrated in adenocarcinoma tissue samples. Thereafter, in order to prevent the underestimation of Ki-67 proliferation indices in MIB-1-labeled preparations, additional antibodies (for example, MIB-21) should be used. Additionally, we could show in a mammalian two-hybrid assay that recombinant pKi-67 repeats are capable of self-associating with endogenous pKi-67. Speculating that the tandem repeats are intimately involved in its protein-protein interactions, this offers new insights in how access to these repeats is regulated by pKi-67 itself.
Characterization of the variable-number tandem repeats in vrrA from different Bacillus anthracis isolates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jackson, P.J.; Walthers, E.A.; Richmond, K.L.

1997-04-01

PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less
Molecular characterization and distribution of a 145-bp tandem repeat family in the genus Populus.

PubMed

Rajagopal, J; Das, S; Khurana, D K; Srivastava, P S; Lakshmikumaran, M

1999-10-01

This report aims to describe the identification and molecular characterization of a 145-bp tandem repeat family that accounts for nearly 1.5% of the Populus genome. Three members of this repeat family were cloned and sequenced from Populus deltoides and P. ciliata. The dimers of the repeat were sequenced in order to confirm the head-to-tail organization of the repeat. Hybridization-based analysis using the 145-bp tandem repeat as a probe on genomic DNA gave rise to ladder patterns which were identified to be a result of methylation and (or) sequence heterogeneity. Analysis of the methylation pattern of the repeat family using methylation-sensitive isoschizomers revealed variable methylation of the C residues and lack of methylation of the A residues. Sequence comparisons between the monomers revealed a high degree of sequence divergence that ranged between 6% and 11% in P. deltoides and between 4.2% and 8.3% in P. ciliata. This indicated the presence of sub-families within the 145-bp tandem family of repeats. Divergence was mainly due to the accumulation of point mutations and was concentrated in the central region of the repeat. The 145-bp tandem repeat family did not show significant homology to known tandem repeats from plants. A short stretch of 36 bp was found to show homology of 66.7% to a centromeric repeat from Chironomus plumosus. Dot-blot analysis and Southern hybridization data revealed the presence of the repeat family in 13 of the 14 Populus species examined. The absence of the 145-bp repeat from P. euphratica suggested that this species is relatively distant from other members of the genus, which correlates with taxonomic classifications. The widespread occurrence of the tandem family in the genus indicated that this family may be of ancient origin.
Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.

PubMed

Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C

1997-12-01

Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.
Variable Number Of Tandem Repeats (VNTR) and its application in bacterial epidemiology.

PubMed

Ramazanzadeh, Rashid; McNerney, Ruth

2007-08-15

Molecular epidemiology is the using of molecular techniques to study bacterial distribution in human populations. Recently molecular epidemiologist benefit from several techniques such as Variable Number Tandem Repeat (VNTR) typing method to typing bacterial strains. Variable Number Tandem Repeat (VNTR) typing is a tool for genotyping and provides data in a simple and numeric format based on the number of repetitive sequences. VNTR for first time identified in M. tuberculosis as Mycobacterial Interspersed Repeat Units (MIRUs). General terms of VNTR have now been reported in Bacillus anthracis, Legionella pneumophila, Pseudomonas aeruginosa, Salmonella enterica and Escherichia coli O157.
Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

USDA-ARS?s Scientific Manuscript database

Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres comprise of megabase-scale arrays of tandem repeats. The true prevalence of centromere tandem repeats, and whether they exhibit conserved seque...
Tandemly repeated sequences in mtDNA control region of whitefish, Coregonus lavaretus.

PubMed

Brzuzan, P

2000-06-01

Length variation of the mitochondrial DNA control region was observed with PCR amplification of a sample of 138 whitefish (Coregonus lavaretus). Nucleotide sequences of representative PCR products showed that the variation was due to the presence of an approximately 100-bp motif tandemly repeated two, three, or five times in the region between the conserved sequence block-3 (CSB-3) and the gene for phenylalanine tRNA. This is the first report on the tandem array composed of long repeat units in mitochondrial DNA of salmonids.
Functional centromeres in Astragalus sinicus include a compact centromere-specific histone H3 and a 20-bp tandem repeat.

PubMed

Tek, Ahmet L; Kashihara, Kazunari; Murata, Minoru; Nagaki, Kiyotaka

2011-11-01

The centromere plays an essential role for proper chromosome segregation during cell division and usually harbors long arrays of tandem repeated satellite DNA sequences. Although this function is conserved among eukaryotes, the sequences of centromeric DNA repeats are variable. Most of our understanding of functional centromeres, which are defined by localization of a centromere-specific histone H3 (CENH3) protein, comes from model organisms. The components of the functional centromere in legumes are poorly known. The genus Astragalus is a member of the legumes and bears the largest numbers of species among angiosperms. Therefore, we studied the components of centromeres in Astragalus sinicus. We identified the CenH3 homolog of A. sinicus, AsCenH3 that is the most compact in size among higher eukaryotes. A CENH3-based assay revealed the functional centromeric DNA sequences from A. sinicus, called CentAs. The CentAs repeat is localized in A. sinicus centromeres, and comprises an AT-rich tandem repeat with a monomer size of 20 nucleotides.
The structure of the protein phosphatase 2A PR65/A subunit reveals the conformation of its 15 tandemly repeated HEAT motifs.

PubMed

Groves, M R; Hanlon, N; Turowski, P; Hemmings, B A; Barford, D

1999-01-08

The PR65/A subunit of protein phosphatase 2A serves as a scaffolding molecule to coordinate the assembly of the catalytic subunit and a variable regulatory B subunit, generating functionally diverse heterotrimers. Mutations of the beta isoform of PR65 are associated with lung and colon tumors. The crystal structure of the PR65/Aalpha subunit, at 2.3 A resolution, reveals the conformation of its 15 tandemly repeated HEAT sequences, degenerate motifs of approximately 39 amino acids present in a variety of proteins, including huntingtin and importin beta. Individual motifs are composed of a pair of antiparallel alpha helices that assemble in a mainly linear, repetitive fashion to form an elongated molecule characterized by a double layer of alpha helices. Left-handed rotations at three interrepeat interfaces generate a novel left-hand superhelical conformation. The protein interaction interface is formed from the intrarepeat turns that are aligned to form a continuous ridge.
Tandem Repeated Irritation Test (TRIT) Studies and Clinical Relevance: Post 2006.

PubMed

Reddy, Rasika; Maibach, Howard

2018-06-11

Single or multiple applications of irritants can lead to occupational contact dermatitis, and most commonly irritant contact dermatitis (ICD). Tandem irritation, the sequential application of two irritants to a target skin area, has been studied using the Tandem Repeated Irritation Test (TRIT) to provide a more accurate representation of skin irritation. Here we present an update to Kartono's review on tandem irritation studies since 2006 [1]. We surveyed the literature available on PubMed, Embase, Google Scholar, and the UCSF Dermatology library databases since 2006. The studies included discuss the tandem effects of common chemical irritants, organic solvents, occlusion as well as clinical relevance - and enlarge our ability to discern whether multiple chemical exposures are more or less likely to enhance irritation.
A novel species-specific tandem repeat DNA family from Sinapis arvensis: detection of telomere-like sequences.

PubMed

Kapila, R; Das, S; Srivastava, P S; Lakshmikumaran, M

1996-08-01

DNA sequences representing a tandemly repeated DNA family of the Sinapis arvensis genome were cloned and characterized. The 700-bp tandem repeat family is represented by two clones, pSA35 and pSA52, which are 697 and 709 bp in length, respectively. Dot matrix analysis of the sequences indicates the presence of repeated elements within each monomeric unit. Sequence analysis of the repetitive region of clones pSA35 and pSA52 shows that there are several copies of a 7-bp repeat element organized in tandem. The consensus sequence of this repeat element is 5'-TTTAGGG-3'. These elements are highly mutated and the difference in length between the two clones is due to different copy numbers of these elements. The repetitive region of clone pSA35 has 26 copies of the element TTTAGGG, whereas clone pSA52 has 28 copies. The repetitive region in both clones is flanked on either side by inverted repeats that may be footprints of a transposition event. Sequence comparison indicates that the element TTTAGGG is identical to telomeric repeats present in Arabidopsis, maize, tomato, and other plants. However, Bal31 digestion kinetics indicates non-telomeric localization of the 700-bp tandem repeats. The clones represent a novel repeat family as (i) they contain telomere-like motifs as subrepeats within each unit; and (ii) they do not hybridize to related crucifers and are species-specific in nature.
Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping.

PubMed

U'Ren, Jana M; Schupp, James M; Pearson, Talima; Hornstra, Heidie; Friedman, Christine L Clark; Smith, Kimothy L; Daugherty, Rebecca R Leadem; Rhoton, Shane D; Leadem, Ben; Georgia, Shalamar; Cardon, Michelle; Huynh, Lynn Y; DeShazer, David; Harvey, Steven P; Robison, Richard; Gal, Daniel; Mayo, Mark J; Wagner, David; Currie, Bart J; Keim, Paul

2007-03-30

The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation) to that of the most diverse tandemly repeated regions found in other less diverse bacteria. The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were identical using previous typing methods. Given the health
Exceptionally long 5' UTR short tandem repeats specifically linked to primates.

PubMed

Namdar-Aligoodarzi, P; Mohammadparast, S; Zaker-Kandjani, B; Talebi Kakroodi, S; Jafari Vesiehsari, M; Ohadi, M

2015-09-10

We have previously reported genome-scale short tandem repeats (STRs) in the core promoter interval (i.e. -120 to +1 to the transcription start site) of protein-coding genes that have evolved identically in primates vs. non-primates. Those STRs may function as evolutionary switch codes for primate speciation. In the current study, we used the Ensembl database to analyze the 5' untranslated region (5' UTR) between +1 and +60 of the transcription start site of the entire human protein-coding genes annotated in the GeneCards database, in order to identify "exceptionally long" STRs (≥5-repeats), which may be of selective/adaptive advantage. The importance of this critical interval is its function as core promoter, and its effect on transcription and translation. In order to minimize ascertainment bias, we analyzed the evolutionary status of the human 5' UTR STRs of ≥5-repeats in several species encompassing six major orders and superorders across mammals, including primates, rodents, Scandentia, Laurasiatheria, Afrotheria, and Xenarthra. We introduce primate-specific STRs, and STRs which have expanded from mouse to primates. Identical co-occurrence of the identified STRs of rare average frequency between 0.006 and 0.0001 in primates supports a role for those motifs in processes that diverged primates from other mammals, such as neuronal differentiation (e.g. APOD and FGF4), and craniofacial development (e.g. FILIP1L). A number of the identified STRs of ≥5-repeats may be human-specific (e.g. ZMYM3 and DAZAP1). Future work is warranted to examine the importance of the listed genes in primate/human evolution, development, and disease. Copyright © 2015 Elsevier B.V. All rights reserved.
Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.

PubMed Central

Benslimane, A A; Dron, M; Hartmann, C; Rode, A

1986-01-01

Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553
Comparative and functional characterization of intragenic tandem repeats in 10 Aspergillus genomes.

PubMed

Gibbons, John G; Rokas, Antonis

2009-03-01

Intragenic tandem repeats (ITRs) are consecutive repeats of three or more nucleotides found in coding regions. ITRs are the underlying cause of several human genetic diseases and have been associated with phenotypic variation, including pathogenesis, in several clades of the tree of life. We have examined the evolution and functional role of ITRs in 10 genomes spanning the fungal genus Aspergillus, a clade of relevance to medicine, agriculture, and industry. We identified several hundred ITRs in each of the species examined. ITR content varied extensively between species, with an average 79% of ITRs unique to a given species. For the fraction of conserved ITR regions, sequence comparisons within species and between close relatives revealed that they were highly variable. ITR-containing proteins were evolutionarily less conserved, compositionally distinct, and overrepresented for domains associated with cell-surface localization and function relative to the rest of the proteome. Furthermore, ITRs were preferentially found in proteins involved in transcription, cellular communication, and cell-type differentiation but were underrepresented in proteins involved in metabolism and energy. Importantly, although ITRs were evolutionarily labile, their functional associations appeared. To be remarkably conserved across eukaryotes. Fungal ITRs likely participate in a variety of developmental processes and cell-surface-associated functions, suggesting that their contribution to fungal lifestyle and evolution may be more general than previously assumed.
Variable-Number Tandem Repeats That Are Useful in Genotyping Isolates of Salmonella enterica subsp. enterica Serovars Typhimurium and Newport▿

PubMed Central

Witonski, D. ; Stefanova, R.; Ranganathan, A.; Schutze, G. E.; Eisenach, K. D.; Cave, M. D.

2006-01-01

The genome of Salmonella enterica subsp. enterica serovar Typhimurium strain LT2 was analyzed for direct repeats, and 54 sequences containing variable-number tandem repeat loci were identified. Ten primer pairs that anneal upstream and downstream of each selected locus were designed and used to amplify PCR targets in isolates of S. enterica serovars Typhimurium and Newport. Four of the 10 loci did not show polymorphism in the length of products. Six loci were selected for analysis. Isolates of S. enterica serovars Typhimurium and Newport that were related to specific outbreaks and showed identical pulsed-field gel electrophoresis patterns were indistinguishable by the length of the six variable-number tandem repeats. Isolates that differed in their pulsed-field gel electrophoresis patterns showed polymorphism in variable-number tandem repeat profiles. Length of the products was confirmed by DNA sequence analysis. Only 2 of the 10 loci contained exact integers of the direct repeat. Eight loci contained partial copies. The partial copies were maintained at the ends of the variable-number tandem repeat loci in all isolates. In spite of having partial copies that were maintained in all isolates, the number of direct repeats at a locus was polymorphic. Six variable-number tandem repeat loci were useful in distinguishing isolates of S. enterica serovars Typhimurium and Newport that had different pulsed-field gel electrophoresis patterns and in identifying outbreak-associated cases that shared a common pulsed-field gel pattern. PMID:16943354
Chicken microsatellite markers isolated from libraries enriched for simple tandem repeats.

PubMed

Gibbs, M; Dawson, D A; McCamley, C; Wardle, A F; Armour, J A; Burke, T

1997-12-01

The total number of microsatellite loci is considered to be at least 10-fold lower in avian species than in mammalian species. Therefore, efficient large-scale cloning of chicken microsatellites, as required for the construction of a high-resolution linkage map, is facilitated by the construction of libraries using an enrichment strategy. In this study, a plasmid library enriched for tandem repeats was constructed from chicken genomic DNA by hybridization selection. Using this technique the proportion of recombinant clones that cross-hybridized to probes containing simple tandem repeats was raised to 16%, compared with < 0.1% in a non-enriched library. Primers were designed from 121 different sequences. Polymerase chain reaction (PCR) analysis of two chicken reference pedigrees enabled 72 loci to be localized within the collaborative chicken genetic map, and at least 30 of the remaining loci have been shown to be informative in these or other crosses.
Visualization of tandem repeat mutagenesis in Bacillus subtilis.

PubMed

Dormeyer, Miriam; Lentes, Sabine; Ballin, Patrick; Wilkens, Markus; Klumpp, Stefan; Kohlheyer, Dietrich; Stannek, Lorena; Grünberger, Alexander; Commichau, Fabian M

2018-03-01

Mutations are crucial for the emergence and evolution of proteins with novel functions, and thus for the diversity of life. Tandem repeats (TRs) are mutational hot spots that are present in the genomes of all organisms. Understanding the molecular mechanism underlying TR mutagenesis at the level of single cells requires the development of mutation reporter systems. Here, we present a mutation reporter system that is suitable to visualize mutagenesis of TRs occurring in single cells of the Gram-positive model bacterium Bacillus subtilis using microfluidic single-cell cultivation. The system allows measuring the elimination of TR units due to growth rate recovery. The cultivation of bacteria carrying the mutation reporter system in microfluidic chambers allowed us for the first time to visualize the emergence of a specific mutation at the level of single cells. The application of the mutation reporter system in combination with microfluidics might be helpful to elucidate the molecular mechanism underlying TR (in)stability in bacteria. Moreover, the mutation reporter system might be useful to assess whether mutations occur in response to nutrient starvation. Copyright © 2018 Elsevier B.V. All rights reserved.
APE1 incision activity at abasic sites in tandem repeat sequences.

PubMed

Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M

2014-05-29

Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.

5meCpG epigenetic marks neighboring a primate-conserved core promoter short tandem repeat indicate X-chromosome inactivation.

PubMed

Machado, Filipe Brum; Machado, Fabricio Brum; Faria, Milena Amendro; Lovatel, Viviane Lamim; Alves da Silva, Antonio Francisco; Radic, Claudia Pamela; De Brasi, Carlos Daniel; Rios, Álvaro Fabricio Lopes; de Sousa Lopes, Susana Marina Chuva; da Silveira, Leonardo Serafim; Ruiz-Miranda, Carlos Ramon; Ramos, Ester Silveira; Medina-Acosta, Enrique

2014-01-01

X-chromosome inactivation (XCI) is the epigenetic transcriptional silencing of an X-chromosome during the early stages of embryonic development in female eutherian mammals. XCI assures monoallelic expression in each cell and compensation for dosage-sensitive X-linked genes between females (XX) and males (XY). DNA methylation at the carbon-5 position of the cytosine pyrimidine ring in the context of a CpG dinucleotide sequence (5meCpG) in promoter regions is a key epigenetic marker for transcriptional gene silencing. Using computational analysis, we revealed an extragenic tandem GAAA repeat 230-bp from the landmark CpG island of the human X-linked retinitis pigmentosa 2 RP2 promoter whose 5meCpG status correlates with XCI. We used this RP2 onshore tandem GAAA repeat to develop an allele-specific 5meCpG-based PCR assay that is highly concordant with the human androgen receptor (AR) exonic tandem CAG repeat-based standard HUMARA assay in discriminating active (Xa) from inactive (Xi) X-chromosomes. The RP2 onshore tandem GAAA repeat contains neutral features that are lacking in the AR disease-linked tandem CAG repeat, is highly polymorphic (heterozygosity rates approximately 0.8) and shows minimal variation in the Xa/Xi ratio. The combined informativeness of RP2/AR is approximately 0.97, and this assay excels at determining the 5meCpG status of alleles at the Xp (RP2) and Xq (AR) chromosome arms in a single reaction. These findings are relevant and directly translatable to nonhuman primate models of XCI in which the AR CAG-repeat is monomorphic. We conducted the RP2 onshore tandem GAAA repeat assay in the naturally occurring chimeric New World monkey marmoset (Callitrichidae) and found it to be informative. The RP2 onshore tandem GAAA repeat will facilitate studies on the variable phenotypic expression of dominant and recessive X-linked diseases, epigenetic changes in twins, the physiology of aging hematopoiesis, the pathogenesis of age-related hematopoietic
5meCpG Epigenetic Marks Neighboring a Primate-Conserved Core Promoter Short Tandem Repeat Indicate X-Chromosome Inactivation

PubMed Central

Machado, Filipe Brum; Machado, Fabricio Brum; Faria, Milena Amendro; Lovatel, Viviane Lamim; Alves da Silva, Antonio Francisco; Radic, Claudia Pamela; De Brasi, Carlos Daniel; Rios, Álvaro Fabricio Lopes; de Sousa Lopes, Susana Marina Chuva; da Silveira, Leonardo Serafim; Ruiz-Miranda, Carlos Ramon; Ramos, Ester Silveira; Medina-Acosta, Enrique

2014-01-01

X-chromosome inactivation (XCI) is the epigenetic transcriptional silencing of an X-chromosome during the early stages of embryonic development in female eutherian mammals. XCI assures monoallelic expression in each cell and compensation for dosage-sensitive X-linked genes between females (XX) and males (XY). DNA methylation at the carbon-5 position of the cytosine pyrimidine ring in the context of a CpG dinucleotide sequence (5meCpG) in promoter regions is a key epigenetic marker for transcriptional gene silencing. Using computational analysis, we revealed an extragenic tandem GAAA repeat 230-bp from the landmark CpG island of the human X-linked retinitis pigmentosa 2 RP2 promoter whose 5meCpG status correlates with XCI. We used this RP2 onshore tandem GAAA repeat to develop an allele-specific 5meCpG-based PCR assay that is highly concordant with the human androgen receptor (AR) exonic tandem CAG repeat-based standard HUMARA assay in discriminating active (Xa) from inactive (Xi) X-chromosomes. The RP2 onshore tandem GAAA repeat contains neutral features that are lacking in the AR disease-linked tandem CAG repeat, is highly polymorphic (heterozygosity rates approximately 0.8) and shows minimal variation in the Xa/Xi ratio. The combined informativeness of RP2/AR is approximately 0.97, and this assay excels at determining the 5meCpG status of alleles at the Xp (RP2) and Xq (AR) chromosome arms in a single reaction. These findings are relevant and directly translatable to nonhuman primate models of XCI in which the AR CAG-repeat is monomorphic. We conducted the RP2 onshore tandem GAAA repeat assay in the naturally occurring chimeric New World monkey marmoset (Callitrichidae) and found it to be informative. The RP2 onshore tandem GAAA repeat will facilitate studies on the variable phenotypic expression of dominant and recessive X-linked diseases, epigenetic changes in twins, the physiology of aging hematopoiesis, the pathogenesis of age-related hematopoietic
The central domain of bovine submaxillary mucin consists of over 50 tandem repeats of 329 amino acids. Chromosomal localization of the BSM1 gene and relations to ovine and porcine counterparts.

PubMed

Jiang, W; Gupta, D; Gallagher, D; Davis, S; Bhavanandan, V P

2000-04-01

We previously elucidated five distinct protein domains (I-V) for bovine submaxillary mucin, which is encoded by two genes, BSM1 and BSM2. Using Southern blot analysis, genomic cloning and sequencing of the BSM1 gene, we now show that the central domain (V) consists of approximately 55 tandem repeats of 329 amino acids and that domains III-V are encoded by a 58.4-kb exon, the largest exon known for all genes to date. The BSM1 gene was mapped by fluorescence in situ hybridization to the proximal half of chromosome 5 at bands q2. 2-q2.3. The amino-acid sequence of six tandem repeats (two full and four partial) were found to have only 92-94% identities. We propose that the variability in the amino-acid sequences of the mucin tandem repeat is important for generating the combinatorial library of saccharides that are necessary for the protective function of mucins. The deduced peptide sequences of the central domain match those determined from the purified bovine submaxillary mucin and also show 68-94% identity to published peptide sequences of ovine submaxillary mucin. This indicates that the core protein of ovine submaxillary mucin is closely related to that of bovine submaxillary mucin and contains similar tandem repeats in the central domain. In contrast, the central domain of porcine submaxillary mucin is reported to consist of 81-amino-acid tandem repeats. However, both bovine submaxillary mucin and porcine submaxillary mucin contain similar N-terminal and C-terminal domains and the corresponding genes are in the conserved linkage regions of the respective genomes.
TANDEM: matching proteins with tandem mass spectra.

PubMed

Craig, Robertson; Beavis, Ronald C

2004-06-12

Tandem mass spectra obtained from fragmenting peptide ions contain some peptide sequence specific information, but often there is not enough information to sequence the original peptide completely. Several proprietary software applications have been developed to attempt to match the spectra with a list of protein sequences that may contain the sequence of the peptide. The application TANDEM was written to provide the proteomics research community with a set of components that can be used to test new methods and algorithms for performing this type of sequence-to-data matching. The source code and binaries for this software are available at http://www.proteome.ca/opensource.html, for Windows, Linux and Macintosh OSX. The source code is made available under the Artistic License, from the authors.
The production and characterization of novel heavy-chain antibodies against the tandem repeat region of MUC1 mucin.

PubMed

Rahbarizadeh, Fatemeh; Rasaee, Mohammad J; Forouzandeh, Mehdi; Allameh, Abdolamir; Sarrami, Ramin; Nasiry, Habib; Sadeghizadeh, Majid

2005-01-01

Camelidae are known to produce immunoglobulins (Igs) devoid of light chains and constant heavy-chain domains (CH1). Antigen-specific fragments of these heavy-chain IgGs (VHH) are of great interest in biotechnology applications. This paper describes the first example of successfully raised heavy-chain antibodies in Camelus dromedarius (single-humped camel) and Camelus bactrianus (two-humped camel) against a MUC1 related peptide that is found to be an important epitope expressed in cancerous tissue. Camels were immunized against a synthetic peptide corresponding to the tandem repeat region of MUC1 mucin and cancerous tissue preparation obtained from patients suffering from breast carcinoma. Three IgG subclasses with different binding properties to protein A and G were purified by affinity chromatography. Both conventional and heavy-chain IgG antibodies were produced in response to MUC1-related peptide. The elicited antibodies could react specifically with the tandem repeat region of MUC1 mucin in an enzyme linked immunosorbant assay (ELISA). Anti-peptide antibodies were purified after passing antiserum over two affinity chromatography columns. Using ELISA, immunocytochemistry and Western blotting, the interaction of purified antibodies with different antigens was evaluated. The antibodies were observed to be selectively bound to antigens namely: MUC1 peptide (tandem repeat region), human milk fat globule membrane (HMFG), deglycosylated human milk fat globule membrane (D-HMFG), homogenized cancerous breast tissue and a native MUC1 purified from ascitic fluid. Ka values of specific polyclonal antipeptide antibodies were estimated in C. dromedarius and C. bactrianus, as 7 x 10(10) M(-1) and 1.4 x 10(10) M(-1) respectively.
Genome-wide analysis of tandem repeats in plants and green algae

Treesearch

Zhixin Zhao; Cheng Guo; Sreeskandarajan Sutharzan; Pei Li; Craig Echt; Jie Zhang; Chun Liang

2014-01-01

Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among...
Multiple functions of the leucine-rich repeat protein LrrA of Treponema denticola.

PubMed

Ikegami, Akihiko; Honma, Kiyonobu; Sharma, Ashu; Kuramitsu, Howard K

2004-08-01

The gene lrrA, encoding a leucine-rich repeat protein, LrrA, that contains eight consensus tandem repeats of 23 amino acid residues, has been identified in Treponema denticola ATCC 35405. A leucine-rich repeat is a generally useful protein-binding motif, and proteins containing this repeat are typically involved in protein-protein interactions. Southern blot analysis demonstrated that T. denticola ATCC 35405 expresses the lrrA gene, but the gene was not identified in T. denticola ATCC 33520. In order to analyze the functions of LrrA in T. denticola, an lrrA-inactivated mutant of strain ATCC 35405 and an lrrA gene expression transformant of strain ATCC 33520 were constructed. Characterization of the mutant and transformant demonstrated that LrrA is associated with the extracytoplasmic fraction of T. denticola and expresses multifunctional properties. It was demonstrated that the attachment of strain ATCC 35405 to HEp-2 cell cultures and coaggregation with Tannerella forsythensis were attenuated by the lrrA mutation. In addition, an in vitro binding assay demonstrated specific binding of LrrA to a portion of the Tannerella forsythensis leucine-rich repeat protein, BspA, which is mediated by the N-terminal region of LrrA. It was also observed that the lrrA mutation caused a reduction of swarming in T. denticola ATCC 35405 and consequently attenuated tissue penetration. These results suggest that the leucine-rich repeat protein LrrA plays a role in the attachment and penetration of human epithelial cells and coaggregation with Tannerella forsythensis. These properties may play important roles in the virulence of T. denticola.
Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain

PubMed Central

de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

2014-01-01

The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163
Medium-sized tandem repeats represent an abundant component of the Drosophila virilis genome.

PubMed

Abdurashitov, Murat A; Gonchar, Danila A; Chernukhin, Valery A; Tomilov, Victor N; Tomilova, Julia E; Schostak, Natalia G; Zatsepina, Olga G; Zelentsova, Elena S; Evgen'ev, Michael B; Degtyarev, Sergey K H

2013-11-09

Previously, we developed a simple method for carrying out a restriction enzyme analysis of eukaryotic DNA in silico, based on the known DNA sequences of the genomes. This method allows the user to calculate lengths of all DNA fragments that are formed after a whole genome is digested at the theoretical recognition sites of a given restriction enzyme. A comparison of the observed peaks in distribution diagrams with the results from DNA cleavage using several restriction enzymes performed in vitro have shown good correspondence between the theoretical and experimental data in several cases. Here, we applied this approach to the annotated genome of Drosophila virilis which is extremely rich in various repeats. Here we explored the combined approach to perform the restriction analysis of D. virilis DNA. This approach enabled to reveal three abundant medium-sized tandem repeats within the D. virilis genome. While the 225 bp repeats were revealed previously in intergenic non-transcribed spacers between ribosomal genes of D. virilis, two other families comprised of 154 bp and 172 bp repeats were not described. Tandem Repeats Finder search demonstrated that 154 bp and 172 bp units are organized in multiple clusters in the genome of D. virilis. Characteristically, only 154 bp repeats derived from Helitron transposon are transcribed. Using in silico digestion in combination with conventional restriction analysis and sequencing of repeated DNA fragments enabled us to isolate and characterize three highly abundant families of medium-sized repeats present in the D. virilis genome. These repeats comprise a significant portion of the genome and may have important roles in genome function and structural integrity. Therefore, we demonstrated an approach which makes possible to investigate in detail the gross arrangement and expression of medium-sized repeats basing on sequencing data even in the case of incompletely assembled and/or annotated genomes.
A naturally occurring, noncanonical GTP aptamer made of simple tandem repeats

PubMed Central

Curtis, Edward A; Liu, David R

2014-01-01

Recently, we used in vitro selection to identify a new class of naturally occurring GTP aptamer called the G motif. Here we report the discovery and characterization of a second class of naturally occurring GTP aptamer, the “CA motif.” The primary sequence of this aptamer is unusual in that it consists entirely of tandem repeats of CA-rich motifs as short as three nucleotides. Several active variants of the CA motif aptamer lack the ability to form consecutive Watson-Crick base pairs in any register, while others consist of repeats containing only cytidine and adenosine residues, indicating that noncanonical interactions play important roles in its structure. The circular dichroism spectrum of the CA motif aptamer is distinct from that of A-form RNA and other major classes of nucleic acid structures. Bioinformatic searches indicate that the CA motif is absent from most archaeal and bacterial genomes, but occurs in at least 70 percent of approximately 400 eukaryotic genomes examined. These searches also uncovered several phylogenetically conserved examples of the CA motif in rodent (mouse and rat) genomes. Together, these results reveal the existence of a second class of naturally occurring GTP aptamer whose sequence requirements, like that of the G motif, are not consistent with those of a canonical secondary structure. They also indicate a new and unexpected potential biochemical activity of certain naturally occurring tandem repeats. PMID:24824832
Multivalent binding of formin-binding protein 21 (FBP21)-tandem-WW domains fosters protein recognition in the pre-spliceosome.

PubMed

Klippel, Stefan; Wieczorek, Marek; Schümann, Michael; Krause, Eberhard; Marg, Berenice; Seidel, Thorsten; Meyer, Tim; Knapp, Ernst-Walter; Freund, Christian

2011-11-04

The high abundance of repetitive but nonidentical proline-rich sequences in spliceosomal proteins raises the question of how these known interaction motifs recruit their interacting protein domains. Whereas complex formation of these adaptors with individual motifs has been studied in great detail, little is known about the binding mode of domains arranged in tandem repeats and long proline-rich sequences including multiple motifs. Here we studied the interaction of the two adjacent WW domains of spliceosomal protein FBP21 with several ligands of different lengths and composition to elucidate the hallmarks of multivalent binding for this class of recognition domains. First, we show that many of the proteins that define the cellular proteome interacting with FBP21-WW1-WW2 contain multiple proline-rich motifs. Among these is the newly identified binding partner SF3B4. Fluorescence resonance energy transfer (FRET) analysis reveals the tandem-WW domains of FBP21 to interact with splicing factor 3B4 (SF3B4) in nuclear speckles where splicing takes place. Isothermal titration calorimetry and NMR shows that the tandem arrangement of WW domains and the multivalency of the proline-rich ligands both contribute to affinity enhancement. However, ligand exchange remains fast compared with the NMR time scale. Surprisingly, a N-terminal spin label attached to a bivalent ligand induces NMR line broadening of signals corresponding to both WW domains of the FBP21-WW1-WW2 protein. This suggests that distinct orientations of the ligand contribute to a delocalized and semispecific binding mode that should facilitate search processes within the spliceosome.
A novel tandem repeat sequence located on human chromosome 4p: isolation and characterization.

PubMed

Kogi, M; Fukushige, S; Lefevre, C; Hadano, S; Ikeda, J E

1997-06-01

In an effort to analyze the genomic region of the distal half of human chromosome 4p, to where Huntington disease and other diseases have been mapped, we have isolated the cosmid clone (CRS447) that was likely to contain a region with specific repeat sequences. Clone CRS447 was subjected to detailed analysis, including chromosome mapping, restriction mapping, and DNA sequencing. Chromosome mapping by both a human-CHO hybrid cell panel and FISH revealed that CRS447 was predominantly located in the 4p15.1-15.3 region. CRS447 was shown to consist of tandem repeats of 4.7-kb units present on chromosome 4p. A single EcoRI unit was subcloned (pRS447), and the complete sequence was determined as 4752 nucleotides. When pRS447 was used as a probe, the number of copies of this repeat per haploid genome was estimated to be 50-70. Sequence analysis revealed that it contained two internal CA repeats and one putative ORF. Database search established that this sequence was unreported. However, two homologous STS markers were found in the database. We concluded that CRS447/pRS447 is a novel tandem repeat sequence that is mainly specific to human chromosome 4p.
Solution structure of the tandem acyl carrier protein domains from a polyunsaturated fatty acid synthase reveals beads-on-a-string configuration.

PubMed

Trujillo, Uldaeliz; Vázquez-Rosa, Edwin; Oyola-Robles, Delise; Stagg, Loren J; Vassallo, David A; Vega, Irving E; Arold, Stefan T; Baerga-Ortiz, Abel

2013-01-01

The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP
Solution Structure of the Tandem Acyl Carrier Protein Domains from a Polyunsaturated Fatty Acid Synthase Reveals Beads-on-a-String Configuration

PubMed Central

Trujillo, Uldaeliz; Vázquez-Rosa, Edwin; Oyola-Robles, Delise; Stagg, Loren J.; Vassallo, David A.; Vega, Irving E.; Arold, Stefan T.; Baerga-Ortiz, Abel

2013-01-01

The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP
Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain.

PubMed

de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

2014-06-01

The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
MULTIPLE-LOCUS VARIABLE-NUMBER TANDEM REPEAT ANALYSIS OF BRUCELLA ISOLATES FROM THAILAND.

PubMed

Kumkrong, Khurawan; Chankate, Phanita; Tonyoung, Wittawat; Intarapuk, Apiradee; Kerdsin, Anusak; Kalambaheti, Thareerat

2017-01-01

Brucellosis-induced abortion can result in significant economic loss to farm animals. Brucellosis can be transmitted to humans during slaughter of infected animals or via consumption of contaminated food products. Strain identification of Brucella isolates can reveal the route of transmission. Brucella strains were isolated from vaginal swabs of farm animal, cow milk and from human blood cultures. Multiplex PCR was used to identify Brucella species, and owing to high DNA homology among Brucella isolates, multiple-locus variable-number tandem repeat analysis (MLVA) based on the number of tandem repeats at 16 different genomic loci was used for strain identification. Multiplex PCR categorized the isolates into B. abortus (n = 7), B. melitensis (n = 37), B. suis (n = 3), and 5 of unknown Brucella spp. MLVA-16 clustering analysis differentiated the strains into various genotypes, with Brucella isolates from the same geographic region being closely related, and revealed that the Thai isolates were phylogenetically distinct from those in other countries, including within the Southeast Asian region. Thus, MLVA-16 typing has utility in epidemiological studies.
Multivalent Binding of Formin-binding Protein 21 (FBP21)-Tandem-WW Domains Fosters Protein Recognition in the Pre-spliceosome*

PubMed Central

Klippel, Stefan; Wieczorek, Marek; Schümann, Michael; Krause, Eberhard; Marg, Berenice; Seidel, Thorsten; Meyer, Tim; Knapp, Ernst-Walter; Freund, Christian

2011-01-01

The high abundance of repetitive but nonidentical proline-rich sequences in spliceosomal proteins raises the question of how these known interaction motifs recruit their interacting protein domains. Whereas complex formation of these adaptors with individual motifs has been studied in great detail, little is known about the binding mode of domains arranged in tandem repeats and long proline-rich sequences including multiple motifs. Here we studied the interaction of the two adjacent WW domains of spliceosomal protein FBP21 with several ligands of different lengths and composition to elucidate the hallmarks of multivalent binding for this class of recognition domains. First, we show that many of the proteins that define the cellular proteome interacting with FBP21-WW1-WW2 contain multiple proline-rich motifs. Among these is the newly identified binding partner SF3B4. Fluorescence resonance energy transfer (FRET) analysis reveals the tandem-WW domains of FBP21 to interact with splicing factor 3B4 (SF3B4) in nuclear speckles where splicing takes place. Isothermal titration calorimetry and NMR shows that the tandem arrangement of WW domains and the multivalency of the proline-rich ligands both contribute to affinity enhancement. However, ligand exchange remains fast compared with the NMR time scale. Surprisingly, a N-terminal spin label attached to a bivalent ligand induces NMR line broadening of signals corresponding to both WW domains of the FBP21-WW1-WW2 protein. This suggests that distinct orientations of the ligand contribute to a delocalized and semispecific binding mode that should facilitate search processes within the spliceosome. PMID:21917930
STRBase: a short tandem repeat DNA database for the human identity testing community

PubMed Central

Ruitberg, Christian M.; Reeder, Dennis J.; Butler, John M.

2001-01-01

The National Institute of Standards and Technology (NIST) has compiled and maintained a Short Tandem Repeat DNA Internet Database (http://www.cstl.nist.gov/biotech/strbase/) since 1997 commonly referred to as STRBase. This database is an information resource for the forensic DNA typing community with details on commonly used short tandem repeat (STR) DNA markers. STRBase consolidates and organizes the abundant literature on this subject to facilitate on-going efforts in DNA typing. Observed alleles and annotated sequence for each STR locus are described along with a review of STR analysis technologies. Additionally, commercially available STR multiplex kits are described, published polymerase chain reaction (PCR) primer sequences are reported, and validation studies conducted by a number of forensic laboratories are listed. To supplement the technical information, addresses for scientists and hyperlinks to organizations working in this area are available, along with the comprehensive reference list of over 1300 publications on STRs used for DNA typing purposes. PMID:11125125
Upstream mononucleotide A-repeats play a cis-regulatory role in mammals through the DICER1 and Ago proteins.

PubMed

Aporntewan, Chatchawit; Pin-on, Piyapat; Chaiyaratana, Nachol; Pongpanich, Monnat; Boonyaratanakornkit, Viroj; Mutirangura, Apiwat

2013-10-01

A-repeats are the simplest form of tandem repeats and are found ubiquitously throughout genomes. These mononucleotide repeats have been widely believed to be non-functional 'junk' DNA. However, studies in yeasts suggest that A-repeats play crucial biological functions, and their role in humans remains largely unknown. Here, we showed a non-random pattern of distribution of sense A- and T-repeats within 20 kb around transcription start sites (TSSs) in the human genome. Different distributions of these repeats are observed upstream and downstream of TSSs. Sense A-repeats are enriched upstream, whereas sense T-repeats are enriched downstream of TSSs. This enrichment directly correlates with repeat size. Genes with different functions contain different lengths of repeats. In humans, tissue-specific genes are enriched for short repeats of <10 bp, whereas housekeeping genes are enriched for long repeats of ≥10 bp. We demonstrated that DICER1 and Argonaute proteins are required for the cis-regulatory role of A-repeats. Moreover, in the presence of a synthetic polymer that mimics an A-repeat, protein binding to A-repeats was blocked, resulting in a dramatic change in the expression of genes containing upstream A-repeats. Our findings suggest a length-dependent cis-regulatory function of A-repeats and that Argonaute proteins serve as trans-acting factors, binding to A-repeats.
An examination of the origin and evolution of additional tandem repeats in the mitochondrial DNA control region of Japanese sika deer (Cervus Nippon).

PubMed

Ba, Hengxing; Wu, Lang; Liu, Zongyue; Li, Chunyi

2016-01-01

Tandem repeat units are only detected in the left domain of the mitochondrial DNA control region in sika deer. Previous studies showed that Japanese sika deer have more tandem repeat units than its cousins from the Asian continent and Taiwan, which often have only three repeat units. To determine the origin and evolution of these additional repeat units in Japanese sika deer, we obtained the sequence of repeat units from an expanded dataset of the control region from all sika deer lineages. The functional constraint is inferred to act on the first repeat unit because this repeat has the least sequence divergence in comparison to the other units. Based on slipped-strand mispairing mechanisms, the illegitimate elongation model could account for the addition or deletion of these additional repeat units in the Japanese sika deer population. We also report that these additional repeat units could be occurring in the internal positions of tandem repeat regions, possibly via coupling with a homogenization mechanism within and among these lineages. Moreover, the increased number of repeat units in the Japanese sika deer population could reflect a balance between mutation and selection, as well as genetic drift.

Novel protein domains and repeats in Drosophila melanogaster: insights into structure, function, and evolution.

PubMed

Ponting, C P; Mott, R; Bork, P; Copley, R R

2001-12-01

Sequence database searching methods such as BLAST, are invaluable for predicting molecular function on the basis of sequence similarities among single regions of proteins. Searches of whole databases however, are not optimized to detect multiple homologous regions within a single polypeptide. Here we have used the prospero algorithm to perform self-comparisons of all predicted Drosophila melanogaster gene products. Predicted repeats, and their homologs from all species, were analyzed further to detect hitherto unappreciated evolutionary relationships. Results included the identification of novel tandem repeats in the human X-linked retinitis pigmentosa type-2 gene product, repeated segments in cystinosin, associated with a defect in cystine transport, and 'nested' homologous domains in dysferlin, whose gene is mutated in limb girdle muscular dystrophy. Novel signaling domain families were found that may regulate the microtubule-based cytoskeleton and ubiquitin-mediated proteolysis, respectively. Two families of glycosyl hydrolases were shown to contain internal repetitions that hint at their evolution via a piecemeal, modular approach. In addition, three examples of fruit fly genes were detected with tandem exons that appear to have arisen via internal duplication. These findings demonstrate how completely sequenced genomes can be exploited to further understand the relationships between molecular structure, function, and evolution.
Tandem repeated application of organic solvents and sodium lauryl sulphate enhances cumulative skin irritation.

PubMed

Schliemann, Sibylle; Schmidt, Christina; Elsner, Peter

2014-01-01

The objective of our study was to investigate the tandem irritation potential of two organic solvents with concurrent exposure to the hydrophilic detergent irritant sodium lauryl sulphate (SLS). A tandem repeated irritation test was performed with two undiluted organic solvents, cumene (C) and octane (O), with either alternating application with SLS 0.5% or twice daily application of each irritant alone in 27 volunteers on the skin of the back. The cumulative irritation induced over 4 days was quantified using visual scoring and non-invasive bioengineering measurements (skin colour reflectance, skin hydration and transepidermal water loss). Repeated application of C/SLS and O/SLS induced more decline of stratum corneum hydration and higher degrees of clinical irritation and erythema compared to each irritant alone. Our results demonstrate a further example of additive harmful skin effects induced by particular skin irritants and indicate that exposure to organic solvents together with detergents may increase the risk of acquiring occupational contact dermatitis. © 2014 S. Karger AG, Basel.
De novo protein sequencing by combining top-down and bottom-up tandem mass spectra.

PubMed

Liu, Xiaowen; Dekker, Lennard J M; Wu, Si; Vanduijn, Martijn M; Luider, Theo M; Tolić, Nikola; Kou, Qiang; Dvorkin, Mikhail; Alexandrova, Sonya; Vyatkina, Kira; Paša-Tolić, Ljiljana; Pevzner, Pavel A

2014-07-03

There are two approaches for de novo protein sequencing: Edman degradation and mass spectrometry (MS). Existing MS-based methods characterize a novel protein by assembling tandem mass spectra of overlapping peptides generated from multiple proteolytic digestions of the protein. Because each tandem mass spectrum covers only a short peptide of the target protein, the key to high coverage protein sequencing is to find spectral pairs from overlapping peptides in order to assemble tandem mass spectra to long ones. However, overlapping regions of peptides may be too short to be confidently identified. High-resolution mass spectrometers have become accessible to many laboratories. These mass spectrometers are capable of analyzing molecules of large mass values, boosting the development of top-down MS. Top-down tandem mass spectra cover whole proteins. However, top-down tandem mass spectra, even combined, rarely provide full ion fragmentation coverage of a protein. We propose an algorithm, TBNovo, for de novo protein sequencing by combining top-down and bottom-up MS. In TBNovo, a top-down tandem mass spectrum is utilized as a scaffold, and bottom-up tandem mass spectra are aligned to the scaffold to increase sequence coverage. Experiments on data sets of two proteins showed that TBNovo achieved high sequence coverage and high sequence accuracy.
Multi-locus variable number tandem repeat analysis for Escherichia coli causing extraintestinal infections.

PubMed

Manges, Amee R; Tellis, Patricia A; Vincent, Caroline; Lifeso, Kimberley; Geneau, Geneviève; Reid-Smith, Richard J; Boerlin, Patrick

2009-11-01

Discriminatory genotyping methods for the analysis of Escherichia coli other than O157:H7 are necessary for public health-related activities. A new multi-locus variable number tandem repeat analysis protocol is presented; this method achieves an index of discrimination of 99.5% and is reproducible and valid when tested on a collection of 836 diverse E. coli.
Intratypic variability of a tandem repeat locus within the DNA polymerase gene of human herpes simplex virus type 2.

PubMed

Sun, Yongjiang; Chan, Roy Kum Wah; Tan, Suat Hoon

2004-01-01

In this study, the irntratypic variability of a tandem repeat locus within the DNA polymerase (pol) gene of human herpes simplex virus type 2 (HSV2) was uncovered. The locus contained variable numbers of tandem dodecanucleotide (5'-GAC GAG GAC GGG-3') repetitive units. Our result showed that approximately 95% of analyzed HSV2 clinical isolates and the current GenBank HSV2 strains contained two copies of the repetitive units. From genital herpes specimens, three new HSV2 strains, which respectively contained 1, 3, and 4 copies of the repetitive units, were identified. This variable number of tandem repeat (VNTR) locus is absent in HSV1, and thus it also contributes to the intertypic variability of HSV1 and HSV2. The intratypic variability of the locus may be useful for HSV2 strain genotyping and this application is discussed.
Altered Methylation in Tandem Repeat Element and Elemental Component Levels in Inhalable Air Particles

PubMed Central

Hou, Lifang; Zhang, Xiao; Zheng, Yinan; Wang, Sheng; Dou, Chang; Guo, Liqiong; Byun, Hyang-Min; Motta, Valeria; McCracken, John; Díaz, Anaité; Kang, Choong-Min; Koutrakis, Petros; Bertazzi, Pier Alberto; Li, Jingyun; Schwartz, Joel; Baccarelli, Andrea A.

2014-01-01

Exposure to particulate matter (PM) has been associated with lung cancer risk in epidemiology investigations. Elemental components of PM have been suggested to have critical roles in PM toxicity, but the molecular mechanisms underlying their association with cancer risks remain poorly understood. DNA methylation has emerged as a promising biomarker for environmental-related diseases, including lung cancer. In this study, we evaluated the effects of PM elemental components on methylation of three tandem repeats in a highly-exposed population in Beijing, China. The Beijing Truck Driver Air Pollution Study was conducted shortly before the 2008 Beijing Olympic Games (June 15-July 27, 2008) and included 60 truck drivers and 60 office workers. On two days separated by 1-2 weeks, we measured blood DNA methylation of SATα, NBL2, D4Z4, and personal exposure to eight elemental components in PM2.5, including aluminum (Al), silicon (Si), sulfur (S), potassium (K), calcium (Ca) titanium (Ti), iron (Fe), and zinc (Zn). We estimated the associations of individual elemental component with each tandem repeat methylation in generalized estimating equations (GEE) models adjusted for PM2.5 mass and other covariates. Out of the eight examined elements, NBL2 methylation was positively associated with concentrations of Si (0.121, 95%CI: 0.030; 0.212, FDR=0.047) and Ca (0.065, 95%CI: 0.014; 0.115, FDR=0.047) in truck drivers. In office workers, SATα methylation was positively associated with concentrations of S (0.115, 95%CI: 0.034; 0.196, FDR=0.042). PM-associated differences in blood tandem-repeat methylation may help detect biological effects of the exposure and identify individuals who may eventually experience higher lung cancer risk. PMID:24273195
6-mercaptopurine influences TPMT gene transcription in a TPMT gene promoter variable number of tandem repeats-dependent manner.

PubMed

Kotur, Nikola; Stankovic, Biljana; Kassela, Katerina; Georgitsi, Marianthi; Vicha, Anna; Leontari, Iliana; Dokmanovic, Lidija; Janic, Dragana; Krstovski, Nada; Klaassen, Kristel; Radmilovic, Milena; Stojiljkovic, Maja; Nikcevic, Gordana; Simeonidis, Argiris; Sivolapenko, Gregory; Pavlovic, Sonja; Patrinos, George P; Zukic, Branka

2012-02-01

TPMT activity is characterized by a trimodal distribution, namely low, intermediate and high methylator. TPMT gene promoter contains a variable number of GC-rich tandem repeats (VNTRs), namely A, B and C, ranging from three to nine repeats in length in an A(n)B(m)C architecture. We have previously shown that the VNTR architecture in the TPMT gene promoter affects TPMT gene transcription. MATERIALS, METHODS & RESULTS: Here we demonstrate, using reporter assays, that 6-mercaptopurine (6-MP) treatment results in a VNTR architecture-dependent decrease of TPMT gene transcription, mediated by the binding of newly recruited protein complexes to the TPMT gene promoter, upon 6-MP treatment. We also show that acute lymphoblastic leukemia patients undergoing 6-MP treatment display a VNTR architecture-dependent response to 6-MP. These data suggest that the TPMT gene promoter VNTR architecture can be potentially used as a pharmacogenomic marker to predict toxicity due to 6-MP treatment in acute lymphoblastic leukemia patients.
Variable-number tandem repeats as molecular markers for biotypes of Pasteuria ramosa in Daphnia spp.

PubMed

Mouton, Laurence; Nong, Guang; Preston, James F; Ebert, Dieter

2007-06-01

Variable-number tandem repeats (VNTRs) have been identified in populations of Pasteuria ramosa, a castrating endobacterium of Daphnia species. The allelic polymorphisms at 14 loci in laboratory and geographically diverse soil samples showed that VNTRs may serve as biomarkers for the genetic characterization of P. ramosa isolates.
Repeatability and Reproducibility in Proteomic Identifications by Liquid Chromatography—Tandem Mass Spectrometry

PubMed Central

Tabb, David L.; Vega-Montoto, Lorenzo; Rudnick, Paul A.; Variyath, Asokan Mulayath; Ham, Amy-Joan L.; Bunk, David M.; Kilpatrick, Lisa E.; Billheimer, Dean D.; Blackman, Ronald K.; Cardasis, Helene L.; Carr, Steven A.; Clauser, Karl R.; Jaffe, Jacob D.; Kowalski, Kevin A.; Neubert, Thomas A.; Regnier, Fred E.; Schilling, Birgit; Tegeler, Tony J.; Wang, Mu; Wang, Pei; Whiteaker, Jeffrey R.; Zimmerman, Lisa J.; Fisher, Susan J.; Gibson, Bradford W.; Kinsinger, Christopher R.; Mesri, Mehdi; Rodriguez, Henry; Stein, Steven E.; Tempst, Paul; Paulovich, Amanda G.; Liebler, Daniel C.; Spiegelman, Cliff

2009-01-01

The complexity of proteomic instrumentation for LC-MS/MS introduces many possible sources of variability. Data-dependent sampling of peptides constitutes a stochastic element at the heart of discovery proteomics. Although this variation impacts the identification of peptides, proteomic identifications are far from completely random. In this study, we analyzed interlaboratory data sets from the NCI Clinical Proteomic Technology Assessment for Cancer to examine repeatability and reproducibility in peptide and protein identifications. Included data spanned 144 LC-MS/MS experiments on four Thermo LTQ and four Orbitrap instruments. Samples included yeast lysate, the NCI-20 defined dynamic range protein mix, and the Sigma UPS 1 defined equimolar protein mix. Some of our findings reinforced conventional wisdom, such as repeatability and reproducibility being higher for proteins than for peptides. Most lessons from the data, however, were more subtle. Orbitraps proved capable of higher repeatability and reproducibility, but aberrant performance occasionally erased these gains. Even the simplest protein digestions yielded more peptide ions than LC-MS/MS could identify during a single experiment. We observed that peptide lists from pairs of technical replicates overlapped by 35–60%, giving a range for peptide-level repeatability in these experiments. Sample complexity did not appear to affect peptide identification repeatability, even as numbers of identified spectra changed by an order of magnitude. Statistical analysis of protein spectral counts revealed greater stability across technical replicates for Orbitraps, making them superior to LTQ instruments for biomarker candidate discovery. The most repeatable peptides were those corresponding to conventional tryptic cleavage sites, those that produced intense MS signals, and those that resulted from proteins generating many distinct peptides. Reproducibility among different instruments of the same type lagged behind
Stability of Tandem Repeats in the Drosophila Melanogaster HSR-Omega Nuclear RNA

PubMed Central

Hogan, N. C.; Slot, F.; Traverse, K. L.; Garbe, J. C.; Bendena, W. G.; Pardue, M. L.

1995-01-01

The Drosophila melanogaster Hsr-omega locus produces a nuclear RNA containing >5 kb of tandem repeat sequences. These repeats are unique to Hsr-omega and show concerted evolution similar to that seen with classical satellite DNAs. In D. melanogaster the monomer is ~280 bp. Sequences of 191/2 monomers differ by 8 +/- 5% (mean +/- SD), when all pairwise comparisons are considered. Differences are single nucleotide substitutions and 1-3 nucleotide deletions/insertions. Changes appear to be randomly distributed over the repeat unit. Outer repeats do not show the decrease in monomer homogeneity that might be expected if homogeneity is maintained by recombination. However, just outside the last complete repeat at each end, there are a few fragments of sequence similar to the monomer. The sequences in these flanking regions are not those predicted for sequences decaying in the absence of recombination. Instead, the fragmentation of the sequence homology suggests that flanking regions have undergone more severe disruptions, possibly during an insertion or amplification event. Hsr-omega alleles differing in the number of repeats are detected and appear to be stable over a few thousand generations; however, both increases and decreases in repeat numbers have been observed. The new alleles appear to be as stable as their predecessors. No alleles of less than ~5 kb nor more than ~16 kb of repeats were seen in any stocks examined. The evidence that there is a limit on the minimum number of repeats is consistent with the suggestion that these repeats are important in the function of the unusual Hsr-omega nuclear RNA. PMID:7540581
Evolution of Protein Domain Repeats in Metazoa

PubMed Central

Schüler, Andreas; Bornberg-Bauer, Erich

2016-01-01

Repeats are ubiquitous elements of proteins and they play important roles for cellular function and during evolution. Repeats are, however, also notoriously difficult to capture computationally and large scale studies so far had difficulties in linking genetic causes, structural properties and evolutionary trajectories of protein repeats. Here we apply recently developed methods for repeat detection and analysis to a large dataset comprising over hundred metazoan genomes. We find that repeats in larger protein families experience generally very few insertions or deletions (indels) of repeat units but there is also a significant fraction of noteworthy volatile outliers with very high indel rates. Analysis of structural data indicates that repeats with an open structure and independently folding units are more volatile and more likely to be intrinsically disordered. Such disordered repeats are also significantly enriched in sites with a high functional potential such as linear motifs. Furthermore, the most volatile repeats have a high sequence similarity between their units. Since many volatile repeats also show signs of recombination, we conclude they are often shaped by concerted evolution. Intriguingly, many of these conserved yet volatile repeats are involved in host-pathogen interactions where they might foster fast but subtle adaptation in biological arms races. Key Words: protein evolution, domain rearrangements, protein repeats, concerted evolution. PMID:27671125
ACCA phosphopeptide recognition by the BRCT repeats of BRCA1.

PubMed

Ray, Hind; Moreau, Karen; Dizin, Eva; Callebaut, Isabelle; Venezia, Nicole Dalla

2006-06-16

The tumour suppressor gene BRCA1 encodes a 220 kDa protein that participates in multiple cellular processes. The BRCA1 protein contains a tandem of two BRCT repeats at its carboxy-terminal region. The majority of disease-associated BRCA1 mutations affect this region and provide to the BRCT repeats a central role in the BRCA1 tumour suppressor function. The BRCT repeats have been shown to mediate phospho-dependant protein-protein interactions. They recognize phosphorylated peptides using a recognition groove that spans both BRCT repeats. We previously identified an interaction between the tandem of BRCA1 BRCT repeats and ACCA, which was disrupted by germ line BRCA1 mutations that affect the BRCT repeats. We recently showed that BRCA1 modulates ACCA activity through its phospho-dependent binding to ACCA. To delineate the region of ACCA that is crucial for the regulation of its activity by BRCA1, we searched for potential phosphorylation sites in the ACCA sequence that might be recognized by the BRCA1 BRCT repeats. Using sequence analysis and structure modelling, we proposed the Ser1263 residue as the most favourable candidate among six residues, for recognition by the BRCA1 BRCT repeats. Using experimental approaches, such as GST pull-down assay with Bosc cells, we clearly showed that phosphorylation of only Ser1263 was essential for the interaction of ACCA with the BRCT repeats. We finally demonstrated by immunoprecipitation of ACCA in cells, that the whole BRCA1 protein interacts with ACCA when phosphorylated on Ser1263.
Development of Multiple-Locus Variable-Number Tandem-Repeat Analysis for Molecular Subtyping of Campylobacter jejuni by Using Capillary Electrophoresis

PubMed Central

Techaruvichit, Punnida; Vesaratchavest, Mongkol; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon

2015-01-01

Campylobacter jejuni is a common cause of the frequently reported food-borne diseases in developed and developing nations. This study describes the development of multiple-locus variable-number tandem-repeat (VNTR) analysis (MLVA) using capillary electrophoresis as a novel typing method for microbial source tracking and epidemiological investigation of C. jejuni. Among 36 tandem repeat loci detected by the Tandem Repeat Finder program, 7 VNTR loci were selected and used for characterizing 60 isolates recovered from chicken meat samples from retail shops, samples from chicken meat processing factory, and stool samples. The discrimination ability of MLVA was compared with that of multilocus sequence typing (MLST). MLVA (diversity index of 0.97 with 31 MLVA types) provided slightly higher discrimination than MLST (diversity index of 0.95 with 25 MLST types). The overall concordance between MLVA and MLST was estimated at 63% by adjusted Rand coefficient. MLVA predicted MLST type better than MLST predicted MLVA type, as reflected by Wallace coefficient (Wallace coefficient for MLVA to MLST versus MLST to MLVA, 86% versus 51%). MLVA is a useful tool and can be used for effective monitoring of C. jejuni and investigation of epidemics caused by C. jejuni. PMID:26025899
The solution structure of the pentatricopeptide repeat protein PPR10 upon binding atpH RNA

PubMed Central

Gully, Benjamin S.; Cowieson, Nathan; Stanley, Will A.; Shearston, Kate; Small, Ian D.; Barkan, Alice; Bond, Charles S.

2015-01-01

The pentatricopeptide repeat (PPR) protein family is a large family of RNA-binding proteins that is characterized by tandem arrays of a degenerate 35-amino-acid motif which form an α-solenoid structure. PPR proteins influence the editing, splicing, translation and stability of specific RNAs in mitochondria and chloroplasts. Zea mays PPR10 is amongst the best studied PPR proteins, where sequence-specific binding to two RNA transcripts, atpH and psaJ, has been demonstrated to follow a recognition code where the identity of two amino acids per repeat determines the base-specificity. A recently solved ZmPPR10:psaJ complex crystal structure suggested a homodimeric complex with considerably fewer sequence-specific protein–RNA contacts than inferred previously. Here we describe the solution structure of the ZmPPR10:atpH complex using size-exclusion chromatography-coupled synchrotron small-angle X-ray scattering (SEC-SY-SAXS). Our results support prior evidence that PPR10 binds RNA as a monomer, and that it does so in a manner that is commensurate with a canonical and predictable RNA-binding mode across much of the RNA–protein interface. PMID:25609698
Structural Studies of the Tandem Tudor Domains of Fragile X Mental Retardation Related Proteins FXR1 and FXR2

DOE Office of Scientific and Technical Information (OSTI.GOV)

Adams-Cioaba, Melanie A.; Guo, Yahong; Bian, ChuanBing

Expansion of the CGG trinucleotide repeat in the 5'-untranslated region of the FMR1, fragile X mental retardation 1, gene results in suppression of protein expression for this gene and is the underlying cause of Fragile X syndrome. In unaffected individuals, the FMRP protein, together with two additional paralogues (Fragile X Mental Retardation Syndrome-related Protein 1 and 2), associates with mRNA to form a ribonucleoprotein complex in the nucleus that is transported to dendrites and spines of neuronal cells. It is thought that the fragile X family of proteins contributes to the regulation of protein synthesis at sites where mRNAs aremore » locally translated in response to stimuli. Here, we report the X-ray crystal structures of the non-canonical nuclear localization signals of the FXR1 and FXR2 autosomal paralogues of FMRP, which were determined at 2.50 and 1.92 {angstrom}, respectively. The nuclear localization signals of the FXR1 and FXR2 comprise tandem Tudor domain architectures, closely resembling that of UHRF1, which is proposed to bind methylated histone H3K9. The FMRP, FXR1 and FXR2 proteins comprise a small family of highly conserved proteins that appear to be important in translational regulation, particularly in neuronal cells. The crystal structures of the N-terminal tandem Tudor domains of FXR1 and FXR2 revealed a conserved architecture with that of FMRP. Biochemical analysis of the tandem Tudor doamins reveals their ability to preferentially recognize trimethylated peptides in a sequence-specific manner.« less
GENETIC VARIATION IN RED RASPBERRIES (RUBUS IDAEUS L.; ROSACEAE) FROM SITES DIFFERING IN ORGANIC POLLUTANTS COMPARED WITH SYNTHETIC TANDEM REPEAT DNA PROBES

EPA Science Inventory

Two synthetic tandem repetitive DNA probes were used to compare genetic variation at variable-number-tandem-repeat (VNTR) loci among Rubus idaeus L. var. strigosus (Michx.) Maxim. (Rosaceae) individuals sampled at eight sites contaminated by pollutants (N = 39) and eight adjacent...
Fingerprinting of Cyanobacteria Based on PCR with Primers Derived from Short and Long Tandemly Repeated Repetitive Sequences

PubMed Central

Rasmussen, Ulla; Svenning, Mette M.

1998-01-01

The presence of repeated DNA (short tandemly repeated repetitive [STRR] and long tandemly repeated repetitive [LTRR]) sequences in the genome of cyanobacteria was used to generate a fingerprint method for symbiotic and free-living isolates. Primers corresponding to the STRR and LTRR sequences were used in the PCR, resulting in a method which generate specific fingerprints for individual isolates. The method was useful both with purified DNA and with intact cyanobacterial filaments or cells as templates for the PCR. Twenty-three Nostoc isolates from a total of 35 were symbiotic isolates from the angiosperm Gunnera species, including isolates from the same Gunnera species as well as from different species. The results show a genetic similarity among isolates from different Gunnera species as well as a genetic heterogeneity among isolates from the same Gunnera species. Isolates which have been postulated to be closely related or identical revealed similar results by the PCR method, indicating that the technique is useful for clustering of even closely related strains. The method was applied to nonheterocystus cyanobacteria from which a fingerprint pattern was obtained. PMID:16349487
Multiple-Locus Variable-Number Tandem-Repeat Analysis in Genotyping Yersinia enterocolitica Strains from Human and Porcine Origins

PubMed Central

Laukkanen-Ninios, R.; Ortiz Martínez, P.; Siitonen, A.; Fredriksson-Ahomaa, M.; Korkeala, H.

2013-01-01

Sporadic and epidemiologically linked Yersinia enterocolitica strains (n = 379) isolated from fecal samples from human patients, tonsil or fecal samples from pigs collected at slaughterhouses, and pork samples collected at meat stores were genotyped using multiple-locus variable-number tandem-repeat analysis (MLVA) with six loci, i.e., V2A, V4, V5, V6, V7, and V9. In total, 312 different MLVA types were found. Similar types were detected (i) in fecal samples collected from human patients over 2 to 3 consecutive years, (ii) in samples from humans and pigs, and (iii) in samples from pigs that originated from the same farms. Among porcine strains, we found farm-specific MLVA profiles. Variations in the numbers of tandem repeats from one to four for variable-number tandem-repeat (VNTR) loci V2A, V5, V6, and V7 were observed within a farm. MLVA was applicable for serotypes O:3, O:5,27, and O:9 and appeared to be a highly discriminating tool for distinguishing sporadic and outbreak-related strains. With long-term use, interpretation of the results became more challenging due to variations in more-discriminating loci, as was observed for strains originating from pig farms. Additionally, we encountered unexpectedly short V2A VNTR fragments and sequenced them. According to the sequencing results, updated guidelines for interpreting V2A VNTR results were prepared. PMID:23637293
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

PubMed

Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D

2015-05-01

Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.
Thermal denaturation of the BRCT tandem repeat region of human tumour suppressor gene product BRCA1.

PubMed

Pyrpassopoulos, Serapion; Ladopoulou, Angela; Vlassi, Metaxia; Papanikolau, Yannis; Vorgias, Constantinos E; Yannoukakos, Drakoulis; Nounesis, George

2005-04-01

Reduced stability of the tandem BRCT domains of human BReast CAncer 1 (BRCA1) due to missense mutations may be critical for loss of function in DNA repair and damage-induced checkpoint control. In the present thermal denaturation study of the BRCA1 BRCT region, high-precision differential scanning calorimetry (DSC) and circular dichroism (CD) spectroscopy provide evidence for the existence of a denatured state that is structurally very similar to the native. Consistency between theoretical structure-based estimates of the enthalpy (DeltaH) and heat capacity change (DeltaCp) and the calorimetric results is obtained when considering partial thermal unfolding contained in the region of the conserved hydrophobic pocket formed at the interface of the two BRCT repeats. The structural integrity of this region has been shown to be crucial for the interaction of BRCA1 with phosphorylated peptides. In addition, cancer-causing missense mutations located at the inter-BRCT-repeat interface have been linked to the destabilization of the tandem BRCT structure.

Analysis of sequence repeats of proteins in the PDB.

PubMed

Mary Rajathei, David; Selvaraj, Samuel

2013-12-01

Internal repeats in protein sequences play a significant role in the evolution of protein structure and function. Applications of different bioinformatics tools help in the identification and characterization of these repeats. In the present study, we analyzed sequence repeats in a non-redundant set of proteins available in the Protein Data Bank (PDB). We used RADAR for detecting internal repeats in a protein, PDBeFOLD for assessing structural similarity, PDBsum for finding functional involvement and Pfam for domain assignment of the repeats in a protein. Through the analysis of sequence repeats, we found that identity of the sequence repeats falls in the range of 20-40% and, the superimposed structures of the most of the sequence repeats maintain similar overall folding. Analysis sequence repeats at the functional level reveals that most of the sequence repeats are involved in the function of the protein through functionally involved residues in the repeat regions. We also found that sequence repeats in single and two domain proteins often contained conserved sequence motifs for the function of the domain. Copyright © 2013 Elsevier Ltd. All rights reserved.
The protein network surrounding the human telomere repeat binding factors TRF1, TRF2, and POT1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Giannone, Richard J; McDonald, W Hayes; Hurst, Gregory

Telomere integrity (including telomere length and capping) is critical in overall genomic stability. Telomere repeat binding factors and their associated proteins play vital roles in telomere length regulation and end protection. In this study, we explore the protein network surrounding telomere repeat binding factors, TRF1, TRF2, and POT1 using dual-tag affinity purification in combination with multidimensional protein identification technology liquid chromatography - tandem mass spectrometry (MudPIT LC-MS/MS). After control subtraction and data filtering, we found that TRF2 and POT1 co-purified all six members of the telomere protein complex, while TRF1 identified five of six components at frequencies that lend evidencemore » towards the currently accepted telomere architecture. Many of the known TRF1 or TRF2 interacting proteins were also identified. Moreover, putative associating partners identified for each of the three core components fell into functional categories such as DNA damage repair, ubiquitination, chromosome cohesion, chromatin modification/remodeling, DNA replication, cell cycle and transcription regulation, nucleotide metabolism, RNA processing, and nuclear transport. These putative protein-protein associations may participate in different biological processes at telomeres or, intriguingly, outside telomeres.« less
Intergenic Variable-Number Tandem-Repeat Polymorphism Upstream of rocA Alters Toxin Production and Enhances Virulence in Streptococcus pyogenes.

PubMed

Zhu, Luchang; Olsen, Randall J; Horstmann, Nicola; Shelburne, Samuel A; Fan, Jia; Hu, Ye; Musser, James M

2016-07-01

Variable-number tandem-repeat (VNTR) polymorphisms are ubiquitous in bacteria. However, only a small fraction of them has been functionally studied. Here, we report an intergenic VNTR polymorphism that confers an altered level of toxin production and increased virulence in Streptococcus pyogenes The nature of the polymorphism is a one-unit deletion in a three-tandem-repeat locus upstream of the rocA gene encoding a sensor kinase. S. pyogenes strains with this type of polymorphism cause human infection and produce significantly larger amounts of the secreted cytotoxins S. pyogenes NADase (SPN) and streptolysin O (SLO). Using isogenic mutant strains, we demonstrate that deleting one or more units of the tandem repeats abolished RocA production, reduced CovR phosphorylation, derepressed multiple CovR-regulated virulence factors (such as SPN and SLO), and increased virulence in a mouse model of necrotizing fasciitis. The phenotypic effect of the VNTR polymorphism was nearly the same as that of inactivating the rocA gene. In summary, we identified and characterized an intergenic VNTR polymorphism in S. pyogenes that affects toxin production and virulence. These new findings enhance understanding of rocA biology and the function of VNTR polymorphisms in S. pyogenes. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Protein Sequencing with Tandem Mass Spectrometry

NASA Astrophysics Data System (ADS)

Ziady, Assem G.; Kinter, Michael

The recent introduction of electrospray ionization techniques that are suitable for peptides and whole proteins has allowed for the design of mass spectrometric protocols that provide accurate sequence information for proteins. The advantages gained by these approaches over traditional Edman Degradation sequencing include faster analysis and femtomole, sometimes attomole, sensitivity. The ability to efficiently identify proteins has allowed investigators to conduct studies on their differential expression or modification in response to various treatments or disease states. In this chapter, we discuss the use of electrospray tandem mass spectrometry, a technique whereby protein-derived peptides are subjected to fragmentation in the gas phase, revealing sequence information for the protein. This powerful technique has been instrumental for the study of proteins and markers associated with various disorders, including heart disease, cancer, and cystic fibrosis. We use the study of protein expression in cystic fibrosis as an example.
A Dynamic Tandem Repeat in Monocotyledons Inferred from a Comparative Analysis of Chloroplast Genomes in Melanthiaceae.

PubMed

Do, Hoang Dang Khoa; Kim, Joo-Hwan

2017-01-01

Chloroplast genomes (cpDNA) are highly valuable resources for evolutionary studies of angiosperms, since they are highly conserved, are small in size, and play critical roles in plants. Slipped-strand mispairing (SSM) was assumed to be a mechanism for generating repeat units in cpDNA. However, research on the employment of different small repeated sequences through SSM events, which may induce the accumulation of distinct types of repeats within the same region in cpDNA, has not been documented. Here, we sequenced two chloroplast genomes from the endemic species Heloniopsis tubiflora (Korea) and Xerophyllum tenax (USA) to cover the gap between molecular data and explore "hot spots" for genomic events in Melanthiaceae. Comparative analysis of 23 complete cpDNA sequences revealed that there were different stages of deletion in the rps16 region across the Melanthiaceae. Based on the partial or complete loss of rps16 gene in cpDNA, we have firstly reported potential molecular markers for recognizing two sections ( Veratrum and Fuscoveratrum ) of Veratrum . Melathiaceae exhibits a significant change in the junction between large single copy and inverted repeat regions, ranging from trnH_GUG to a part of rps3 . Our results show an accumulation of tandem repeats in the rpl23-ycf2 regions of cpDNAs. Small conserved sequences exist and flank tandem repeats in further observation of this region across most of the examined taxa of Liliales. Therefore, we propose three scenarios in which different small repeated sequences were used during SSM events to generate newly distinct types of repeats. Occasionally, prior to the SSM process, point mutation event and double strand break repair occurred and induced the formation of initial repeat units which are indispensable in the SSM process. SSM may have likely occurred more frequently for short repeats than for long repeat sequences in tribe Parideae (Melanthiaceae, Liliales). Collectively, these findings add new evidence of dynamic
Genetic diversity of Y-short tandem repeats in Chinese native cattle breeds.

PubMed

Xin, Y P; Zan, L S; Liu, Y F; Tian, W Q; Wang, H B; Cheng, G; Li, A N; Yang, W C

2014-11-14

The aim of this study is to use Y-chromosome gene polymorphism method to investigate regional differences in genetic variation and population evolution history of the Chinese native cattle breeds. Six Y-chromosome short tandem repeat (Y-STR) loci (UMN0929, UMN0108, UMN0920, INRA124, UMN2404, and UMN0103) were analyzed using 1016 healthy and heterogenetic males and 90 females of 9 native cattle breeds (Qinchuan, Jinnan, Zaosheng, Luxi, Nanyang, Jiaxian, Dabieshan, Yanbian, and Menggu) in China. Allele frequency and gene diversity were calculated for the various populations. The results indicated that Y-STRs in the 6 loci have polymorphisms and genetic diversity in Chinese cattle populations. The genetic diversity analysis revealed that the Chinese cattle populations have a close genetic relationship. The analysis of INRA124, UMN2404, and UMN0103 loci revealed the original history of Chinese cattle because of which cattle belonging to Bos taurus or Bos indicus could be determined. Interestingly, a declining zebu introgression was displayed from South to North and from East to West in the Chinese geographical distribution, which implied that cattle population from various regions of China had been subjected to somewhat different evolutionary history. This conclusion supported other evidences such as earlier archaeological, historical research, and blood protein polymorphism analysis.
Tandem neopentyl glycol maltosides (TNMs) for membrane protein stabilisation†

PubMed Central

Bae, Hyoung Eun; Mortensen, Jonas S.; Ribeiro, Orquidea; Du, Yang; Ehsan, Muhammad; Kobilka, Brian K.; Loland, Claus J.; Byrne, Bernadette

2017-01-01

A novel class of detergents, designated tandem neopentyl glycol maltosides (TNMs), were evaluated with four target membrane proteins. The best detergent varied depending on the target, but TNM-C12L and TNM-C11S were notable for their ability to confer increased membrane protein stability compared to DDM. These agents have potential for use in membrane protein research. PMID:27711401
Tandem neopentyl glycol maltosides (TNMs) for membrane protein stabilisation.

PubMed

Bae, Hyoung Eun; Mortensen, Jonas S; Ribeiro, Orquidea; Du, Yang; Ehsan, Muhammad; Kobilka, Brian K; Loland, Claus J; Byrne, Bernadette; Chae, Pil Seok

2016-10-04

A novel class of detergents, designated tandem neopentyl glycol maltosides (TNMs), were evaluated with four target membrane proteins. The best detergent varied depending on the target, but TNM-C12L and TNM-C11S were notable for their ability to confer increased membrane protein stability compared to DDM. These agents have potential for use in membrane protein research.
Effect of Repeat Copy Number on Variable-Number Tandem Repeat Mutations in Escherichia coli O157:H7

PubMed Central

Vogler, Amy J.; Keys, Christine; Nemoto, Yoshimi; Colman, Rebecca E.; Jay, Zack; Keim, Paul

2006-01-01

Variable-number tandem repeat (VNTR) loci have shown a remarkable ability to discriminate among isolates of the recently emerged clonal pathogen Escherichia coli O157:H7, making them a very useful molecular epidemiological tool. However, little is known about the rates at which these sequences mutate, the factors that affect mutation rates, or the mechanisms by which mutations occur at these loci. Here, we measure mutation rates for 28 VNTR loci and investigate the effects of repeat copy number and mismatch repair on mutation rate using in vitro-generated populations for 10 E. coli O157:H7 strains. We find single-locus rates as high as 7.0 × 10−4 mutations/generation and a combined 28-locus rate of 6.4 × 10−4 mutations/generation. We observed single- and multirepeat mutations that were consistent with a slipped-strand mispairing mutation model, as well as a smaller number of large repeat copy number mutations that were consistent with recombination-mediated events. Repeat copy number within an array was strongly correlated with mutation rate both at the most mutable locus, O157-10 (r2 = 0.565, P = 0.0196), and across all mutating loci. The combined locus model was significant whether locus O157-10 was included (r2 = 0.833, P < 0.0001) or excluded (r2 = 0.452, P < 0.0001) from the analysis. Deficient mismatch repair did not affect mutation rate at any of the 28 VNTRs with repeat unit sizes of >5 bp, although a poly(G) homomeric tract was destabilized in the mutS strain. Finally, we describe a general model for VNTR mutations that encompasses insertions and deletions, single- and multiple-repeat mutations, and their relative frequencies based upon our empirical mutation rate data. PMID:16740932
Effect of repeat copy number on variable-number tandem repeat mutations in Escherichia coli O157:H7.

PubMed

Vogler, Amy J; Keys, Christine; Nemoto, Yoshimi; Colman, Rebecca E; Jay, Zack; Keim, Paul

2006-06-01

Variable-number tandem repeat (VNTR) loci have shown a remarkable ability to discriminate among isolates of the recently emerged clonal pathogen Escherichia coli O157:H7, making them a very useful molecular epidemiological tool. However, little is known about the rates at which these sequences mutate, the factors that affect mutation rates, or the mechanisms by which mutations occur at these loci. Here, we measure mutation rates for 28 VNTR loci and investigate the effects of repeat copy number and mismatch repair on mutation rate using in vitro-generated populations for 10 E. coli O157:H7 strains. We find single-locus rates as high as 7.0 x 10(-4) mutations/generation and a combined 28-locus rate of 6.4 x 10(-4) mutations/generation. We observed single- and multirepeat mutations that were consistent with a slipped-strand mispairing mutation model, as well as a smaller number of large repeat copy number mutations that were consistent with recombination-mediated events. Repeat copy number within an array was strongly correlated with mutation rate both at the most mutable locus, O157-10 (r2= 0.565, P = 0.0196), and across all mutating loci. The combined locus model was significant whether locus O157-10 was included (r2= 0.833, P < 0.0001) or excluded (r2= 0.452, P < 0.0001) from the analysis. Deficient mismatch repair did not affect mutation rate at any of the 28 VNTRs with repeat unit sizes of >5 bp, although a poly(G) homomeric tract was destabilized in the mutS strain. Finally, we describe a general model for VNTR mutations that encompasses insertions and deletions, single- and multiple-repeat mutations, and their relative frequencies based upon our empirical mutation rate data.
TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

PubMed

Pellegrini, Marco; Renda, M Elena; Vecchio, Alessio

2010-06-15

Genomes in higher eukaryotic organisms contain a substantial amount of repeated sequences. Tandem Repeats (TRs) constitute a large class of repetitive sequences that are originated via phenomena such as replication slippage and are characterized by close spatial contiguity. They play an important role in several molecular regulatory mechanisms, and also in several diseases (e.g. in the group of trinucleotide repeat disorders). While for TRs with a low or medium level of divergence the current methods are rather effective, the problem of detecting TRs with higher divergence (fuzzy TRs) is still open. The detection of fuzzy TRs is propaedeutic to enriching our view of their role in regulatory mechanisms and diseases. Fuzzy TRs are also important as tools to shed light on the evolutionary history of the genome, where higher divergence correlates with more remote duplication events. We have developed an algorithm (christened TRStalker) with the aim of detecting efficiently TRs that are hard to detect because of their inherent fuzziness, due to high levels of base substitutions, insertions and deletions. To attain this goal, we developed heuristics to solve a Steiner version of the problem for which the fuzziness is measured with respect to a motif string not necessarily present in the input string. This problem is akin to the 'generalized median string' that is known to be an NP-hard problem. Experiments with both synthetic and biological sequences demonstrate that our method performs better than current state of the art for fuzzy TRs and that the fuzzy TRs of the type we detect are indeed present in important biological sequences. TRStalker will be integrated in the web-based TRs Discovery Service (TReaDS) at bioalgo.iit.cnr.it. Supplementary data are available at Bioinformatics online.
Microevolution of Pandemic Vibrio parahaemolyticus Assessed by the Number of Repeat Units in Short Sequence Tandem Repeat Regions

PubMed Central

García, Katherine; Gavilán, Ronnie G.; Höfle, Manfred G.; Martínez-Urtaza, Jaime; Espejo, Romilio T.

2012-01-01

The emergence of the pandemic strain Vibrio parahaemolyticus O3:K6 in 1996 caused a large increase of diarrhea outbreaks related to seafood consumption in Southeast Asia, and later worldwide. Isolates of this strain constitutes a clonal complex, and their effectual differentiation is possible by comparison of their variable number tandem repeats (VNTRs). The differentiation of the isolates by the differences in VNTRs will allow inferring the population dynamics and microevolution of this strain but this requires knowing the rate and mechanism of VNTRs' variation. Our study of mutants obtained after serial cultivation of clones showed that mutation rates of the six VNTRs examined are on the order of 10−4 mutant per generation and that difference increases by stepwise addition of single mutations. The single stepwise mutation (SSM) was deduced because mutants with 1, 2, 3, or more repeat unit deletions or insertions follow a geometric distribution. Plausible phylogenetic trees are obtained when, according to SSM, the genetic distance between clusters with different number of repeats is assessed by the absolute differences in repeats. Using this approach, mutants originated from different isolates of pandemic V. parahaemolyticus after serial cultivation are clustered with their parental isolates. Additionally, isolates of pandemic V. parahaemolyticus from Southeast Asia, Tokyo, and northern and southern Chile are clustered according their geographical origin. The deepest split in these four populations is observed between the Tokyo and southern Chile populations. We conclude that proper phylogenetic relations and successful tracing of pandemic V. parahaemolyticus requires measuring the differences between isolates by the absolute number of repeats in the VNTRs considered. PMID:22292049
Production of monoclonal antibody, PR81, recognizing the tandem repeat region of MUC1 mucin.

PubMed

Paknejad, M; Rasaee, M J; Tehrani, F Karami; Kashanian, S; Mohagheghi, M A; Omidfar, K; Bazl, M Rajabi

2003-06-01

A monoclonal antibody (MAb) was generated by immunizing BALB/c mice with homogenized breast cancerous tissues. This antibody (PR81) was found to be of IgG(1) class and subclass, containing kappa light chain. PR81 reacted with either the membrane extracts of several breast cancerous tissues or the cell surface of some MUC1 positive cell lines (MCF-7, BT-20 and T-47D) tested by enzyme immunoassay and for MCF-7 by immunofluorescence method. PR81 also reacted with two synthetic 27 and 16-amino acid peptides, TSA-P1-24 and A-P1-15, respectively, which included the core tandem repeat sequence of MUC1. However, this antibody did not react with a synthetic 14 amino acid peptide that has no similarity with tandem repeat found in MUC1. The generated antibody had good and similar affinities (2.19 x 10(8) M(-1)) toward TSA-P1-24 and A-P1-15, which are mainly shared in the hydrophilic sequence of PDTRPAP. Through Western blot analysis of homogenized breast tissues, PR81 recognized only a major band of 250 kDa. This band is stronger in malignant tissue than benign and normal tissues.
DNA Fingerprint Analysis of Three Short Tandem Repeat (STR) Loci for Biochemistry and Forensic Science Laboratory Courses

ERIC Educational Resources Information Center

McNamara-Schroeder, Kathleen; Olonan, Cheryl; Chu, Simon; Montoya, Maria C.; Alviri, Mahta; Ginty, Shannon; Love, John J.

2006-01-01

We have devised and implemented a DNA fingerprinting module for an upper division undergraduate laboratory based on the amplification and analysis of three of the 13 short tandem repeat loci that are required by the Federal Bureau of Investigation Combined DNA Index System (FBI CODIS) data base. Students first collect human epithelial (cheek)…
Crux: Rapid Open Source Protein Tandem Mass Spectrometry Analysis

PubMed Central

2015-01-01

Efficiently and accurately analyzing big protein tandem mass spectrometry data sets requires robust software that incorporates state-of-the-art computational, machine learning, and statistical methods. The Crux mass spectrometry analysis software toolkit (http://cruxtoolkit.sourceforge.net) is an open source project that aims to provide users with a cross-platform suite of analysis tools for interpreting protein mass spectrometry data. PMID:25182276
Surface display of monkey metallothionein α tandem repeats and EGFP fusion protein on Pseudomonas putida X4 for biosorption and detection of cadmium.

PubMed

He, Xiaochuan; Chen, Wenli; Huang, Qiaoyun

2012-09-01

Monkey metallothionein α domain tandem repeats (4mMTα), which exhibit high cadmium affinity, have been displayed for the first time on the surface of a bacterium using ice nucleation protein N-domain (inaXN) protein from the Xanthomonas campestris pv (ACCC-10049) as an anchoring motif. The shuttle vector pIME, which codes for INAXN-4mMTα-EGFP fusion, was constructed and used to target 4mMTα and EGFP on the surface of Pseudomonas putida X4 (CCTCC-209319). The surface location of the INAXN-4mMTα-EGFP fusion was further verified by western blot analysis and immunofluorescence microscopy. The growth of X4 showed resistance to cadmium presence. The presence of surface-exposed 4mMTα on the engineered strains was four times higher than that of the wild-type X4. The Cd²⁺ accumulation by X4/pIME was not only four times greater than that of the original host bacterial cells but was also remarkably unaffected by the presence of Cu²⁺ and Zn²⁺. Moreover, the surface-engineered strains could effectively bind Cd²⁺ under a wide range of pH levels, from 4 to 7. P. putida X4/pIME with surface-expressed 4mMTα-EGFP had twice the cadmium binding capacity as well as 1.4 times the fluorescence as the cytoplasmic 4mMTa-EGFP. These results suggest that P. putida X4 expressing 4mMTα-EGFP with the INAXN anchor motif on the surface would be a useful tool for the remediation and biodetection of environmental cadmium contaminants.
Isolation of human simple repeat loci by hybridization selection.

PubMed

Armour, J A; Neumann, R; Gobert, S; Jeffreys, A J

1994-04-01

We have isolated short tandem repeat arrays from the human genome, using a rapid method involving filter hybridization to enrich for tri- or tetranucleotide tandem repeats. About 30% of clones from the enriched library cross-hybridize with probes containing trimeric or tetrameric tandem arrays, facilitating the rapid isolation of large numbers of clones. In an initial analysis of 54 clones, 46 different tandem arrays were identified. Analysis of these tandem repeat loci by PCR showed that 24 were polymorphic in length; substantially higher levels of polymorphism were displayed by the tetrameric repeat loci isolated than by the trimeric repeats. Primary mapping of these loci by linkage analysis showed that they derive from 17 chromosomes, including the X chromosome. We anticipate the use of this strategy for the efficient isolation of tandem repeats from other sources of genomic DNA, including DNA from flow-sorted chromosomes, and from other species.
Tandem repeats analysis for the high resolution phylogenetic analysis of Yersinia pestis

PubMed Central

Pourcel, C; André-Mazeaud, F; Neubauer, H; Ramisse, F; Vergnaud, G

2004-01-01

Background Yersinia pestis, the agent of plague, is a young and highly monomorphic species. Three biovars, each one thought to be associated with the last three Y. pestis pandemics, have been defined based on biochemical assays. More recently, DNA based assays, including DNA sequencing, IS typing, DNA arrays, have significantly improved current knowledge on the origin and phylogenetic evolution of Y. pestis. However, these methods suffer either from a lack of resolution or from the difficulty to compare data. Variable number of tandem repeats (VNTRs) provides valuable polymorphic markers for genotyping and performing phylogenetic analyses in a growing number of pathogens and have given promising results for Y. pestis as well. Results In this study we have genotyped 180 Y. pestis isolates by multiple locus VNTR analysis (MLVA) using 25 markers. Sixty-one different genotypes were observed. The three biovars were distributed into three main branches, with some exceptions. In particular, the Medievalis phenotype is clearly heterogeneous, resulting from different mutation events in the napA gene. Antiqua strains from Asia appear to hold a central position compared to Antiqua strains from Africa. A subset of 7 markers is proposed for the quick comparison of a new strain with the collection typed here. This can be easily achieved using a Web-based facility, specifically set-up for running such identifications. Conclusion Tandem-repeat typing may prove to be a powerful complement to the existing phylogenetic tools for Y. pestis. Typing can be achieved quickly at a low cost in terms of consumables, technical expertise and equipment. The resulting data can be easily compared between different laboratories. The number and selection of markers will eventually depend upon the type and aim of investigations. PMID:15186506
Interpreting short tandem repeat variations in humans using mutational constraint

PubMed Central

Gymrek, Melissa; Willems, Thomas; Reich, David; Erlich, Yaniv

2017-01-01

Identifying regions of the genome that are depleted of mutations can reveal potentially deleterious variants. Short tandem repeats (STRs), also known as microsatellites, are among the largest contributors of de novo mutations in humans. However, per-locus studies of STR mutations have been limited to highly ascertained panels of several dozen loci. Here, we harnessed bioinformatics tools and a novel analytical framework to estimate mutation parameters for each STR in the human genome by correlating STR genotypes with local sequence heterozygosity. We applied our method to obtain robust estimates of the impact of local sequence features on mutation parameters and used this to create a framework for measuring constraint at STRs by comparing observed vs. expected mutation rates. Constraint scores identified known pathogenic variants with early onset effects. Our metric will provide a valuable tool for prioritizing pathogenic STRs in medical genetics studies. PMID:28892063
rTANDEM, an R/Bioconductor package for MS/MS protein identification.

PubMed

Fournier, Frédéric; Joly Beauparlant, Charles; Paradis, René; Droit, Arnaud

2014-08-01

rTANDEM is an R/Bioconductor package that interfaces the X!Tandem protein identification algorithm. The package can run the multi-threaded algorithm on proteomic data files directly from R. It also provides functions to convert search parameters and results to/from R as well as functions to manipulate parameters and automate searches. An associated R package, shinyTANDEM, provides a web-based graphical interface to visualize and interpret the results. Together, those two packages form an entry point for a general MS/MS-based proteomic pipeline in R/Bioconductor. rTANDEM and shinyTANDEM are distributed in R/Bioconductor, http://bioconductor.org/packages/release/bioc/. The packages are under open licenses (GPL-3 and Artistice-1.0). frederic.fournier@crchuq.ulaval.ca or arnaud.droit@crchuq.ulaval.ca Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Analysis of tandem repeat units of the promoter of capsanthin/capsorubin synthase (Ccs) gene in pepper fruit.

PubMed

Tian, Shi-Lin; Li, Zheng; Li, Li; Shah, S N M; Gong, Zhen-Hui

2017-07-01

Capsanthin/capsorubin synthase ( Ccs ) gene is a key gene that regulates the synthesis of capsanthin and the development of red coloration in pepper fruits. There are three tandem repeat units in the promoter region of Ccs , but the potential effects of the number of repetitive units on the transcriptional regulation of Ccs has been unclear. In the present study, expression vectors carrying different numbers of repeat units of the Ccs promoter were constructed, and the transient expression of the β-glucuronidase ( GUS ) gene was used to detect differences in expression levels associated with the promoter fragments. These repeat fragments and the plant expression vector PBI121 containing the 35s CaMV promoter were ligated to form recombinant vectors that were transfected into Agrobacterium tumefaciens GV3101. A fluorescence spectrophotometer was used to analyze the expression associated with the various repeat units. It was concluded that the constructs containing at least one repeat were associated with GUS expression, though they did not differ from one another. This repeating unit likely plays a role in transcription and regulation of Ccs expression.
Protein Inference from the Integration of Tandem MS Data and Interactome Networks.

PubMed

Zhong, Jiancheng; Wang, Jianxing; Ding, Xiaojun; Zhang, Zhen; Li, Min; Wu, Fang-Xiang; Pan, Yi

2017-01-01

Since proteins are digested into a mixture of peptides in the preprocessing step of tandem mass spectrometry (MS), it is difficult to determine which specific protein a shared peptide belongs to. In recent studies, besides tandem MS data and peptide identification information, some other information is exploited to infer proteins. Different from the methods which first use only tandem MS data to infer proteins and then use network information to refine them, this study proposes a protein inference method named TMSIN, which uses interactome networks directly. As two interacting proteins should co-exist, it is reasonable to assume that if one of the interacting proteins is confidently inferred in a sample, its interacting partners should have a high probability in the same sample, too. Therefore, we can use the neighborhood information of a protein in an interactome network to adjust the probability that the shared peptide belongs to the protein. In TMSIN, a multi-weighted graph is constructed by incorporating the bipartite graph with interactome network information, where the bipartite graph is built with the peptide identification information. Based on multi-weighted graphs, TMSIN adopts an iterative workflow to infer proteins. At each iterative step, the probability that a shared peptide belongs to a specific protein is calculated by using the Bayes' law based on the neighbor protein support scores of each protein which are mapped by the shared peptides. We carried out experiments on yeast data and human data to evaluate the performance of TMSIN in terms of ROC, q-value, and accuracy. The experimental results show that AUC scores yielded by TMSIN are 0.742 and 0.874 in yeast dataset and human dataset, respectively, and TMSIN yields the maximum number of true positives when q-value less than or equal to 0.05. The overlap analysis shows that TMSIN is an effective complementary approach for protein inference.
Characterization of toxin-producing cyanobacteria by using an oligonucleotide probe containing a tandemly repeated heptamer.

PubMed Central

Rouhiainen, L; Sivonen, K; Buikema, W J; Haselkorn, R

1995-01-01

Cyanobacteria produce toxins that kill animals. The two main classes of cyanobacterial toxins are cyclic peptides that cause liver damage and alkaloids that block nerve transmission. Many toxin-producing strains from Finnish lakes were brought into axenic culture, and their toxins were characterized. Restriction fragment length polymorphism analysis, probing with a short tandemly repeated DNA sequence found at many locations in the chromosome of Anabaena sp. strain PCC 7120, distinguishes hepatotoxic Anabaena isolates from neurotoxin-producing strains and from Nostoc spp. PMID:7592362
Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population.

PubMed

Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao

2018-05-01

Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.
Short tandem repeat analysis in Japanese population.

PubMed

Hashiyada, M

2000-01-01

Short tandem repeats (STRs), known as microsatellites, are one of the most informative genetic markers for characterizing biological materials. Because of the relatively small size of STR alleles (generally 100-350 nucleotides), amplification by polymerase chain reaction (PCR) is relatively easy, affording a high sensitivity of detection. In addition, STR loci can be amplified simultaneously in a multiplex PCR. Thus, substantial information can be obtained in a single analysis with the benefits of using less template DNA, reducing labor, and reducing the contamination. We investigated 14 STR loci in a Japanese population living in Sendai by three multiplex PCR kits, GenePrint PowerPlex 1.1 and 2.2. Fluorescent STR System (Promega, Madison, WI, USA) and AmpF/STR Profiler (Perkin-Elmer, Norwalk, CT, USA). Genomic DNA was extracted using sodium dodecyl sulfate (SDS) proteinase K or Chelex 100 treatment followed by the phenol/chloroform extraction. PCR was performed according to the manufacturer's protocols. Electrophoresis was carried out on an ABI 377 sequencer and the alleles were determined by GeneScan 2.0.2 software (Perkin-Elmer). In 14 STRs loci, statistical parameters indicated a relatively high rate, and no significant deviation from Hardy-Weinberg equilibrium was detected. We apply this STR system to paternity testing and forensic casework, e.g., personal identification in rape cases. This system is an effective tool in the forensic sciences to obtain information on individual identification.
StaRProtein, A Web Server for Prediction of the Stability of Repeat Proteins

PubMed Central

Xu, Yongtao; Zhou, Xu; Huang, Meilan

2015-01-01

Repeat proteins have become increasingly important due to their capability to bind to almost any proteins and the potential as alternative therapy to monoclonal antibodies. In the past decade repeat proteins have been designed to mediate specific protein-protein interactions. The tetratricopeptide and ankyrin repeat proteins are two classes of helical repeat proteins that form different binding pockets to accommodate various partners. It is important to understand the factors that define folding and stability of repeat proteins in order to prioritize the most stable designed repeat proteins to further explore their potential binding affinities. Here we developed distance-dependant statistical potentials using two classes of alpha-helical repeat proteins, tetratricopeptide and ankyrin repeat proteins respectively, and evaluated their efficiency in predicting the stability of repeat proteins. We demonstrated that the repeat-specific statistical potentials based on these two classes of repeat proteins showed paramount accuracy compared with non-specific statistical potentials in: 1) discriminate correct vs. incorrect models 2) rank the stability of designed repeat proteins. In particular, the statistical scores correlate closely with the equilibrium unfolding free energies of repeat proteins and therefore would serve as a novel tool in quickly prioritizing the designed repeat proteins with high stability. StaRProtein web server was developed for predicting the stability of repeat proteins. PMID:25807112
Multilocus Variable-Number Tandem Repeat Typing of Mycobacterium ulcerans

PubMed Central

Ablordey, Anthony; Swings, Jean; Hubans, Christine; Chemlal, Karim; Locht, Camille; Portaels, Françoise; Supply, Philip

2005-01-01

The apparent genetic homogeneity of Mycobacterium ulcerans contributes to the poorly understood epidemiology of M. ulcerans infection. Here, we report the identification of variable number tandem repeat (VNTR) sequences as novel polymorphic elements in the genome of this species. A total of 19 potential VNTR loci identified in the closely related M. marinum genome sequence were screened in a collection of 23 M. ulcerans isolates, one Mycobacterium species referred to here as an intermediate species, and five M. marinum strains. Nine of the 19 loci were polymorphic in the three species (including the intermediate species) and revealed eight M. ulcerans and five M. marinum genotypes. The results from the VNTR analysis corroborated the genetic relationships of M. ulcerans isolates from various geographical origins, as defined by independent molecular markers. Although these results further highlight the extremely high clonal homogeneity within certain geographic regions, we report for the first time the discrimination of the two South American strains from Surinam and French Guyana. These findings support the potential of a VNTR-based genotyping method for strain discrimination within M. ulcerans and M. marinum. PMID:15814964
Revisiting the TALE repeat.

PubMed

Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng

2014-04-01

Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.
Rapid carrier screening using short tandem repeats in the phenylalanine hydroxylase gene.

PubMed

Shawky, R M; el-Aleem, K A; Rifaat, M M; el-Naggar, R L; Marzouk, G M

2002-01-01

Phenylketonuria (PKU) is an autosomal recessive genetic disorder caused by defects in the phenylalanine hydroxylase (PAH) system. Our work aimed to screen the PAH locus for the presence of potentially useful short tandem repeats (STR) as markers for carrier detection in PKU families in Egypt, and to determine the level of PAH heterozygosity within the Egyptian population. The system contains at least eight independent alleles in the Egyptian population, transmitted in a Mendelian fashion. Variations in the number of STR in the 16 families studied gave rise to polymorphisms that proved to be suitable markers for PKU carrier detection and prenatal diagnosis. The most frequent allelic fragment size in PKU patients was 246 bp (35.7%), which together with a fragment of 254 bp accounted for 60.7% of the mutant chromosomes.
Structural and biophysical properties of h-FANCI ARM repeat protein.

PubMed

Siddiqui, Mohd Quadir; Choudhary, Rajan Kumar; Thapa, Pankaj; Kulkarni, Neha; Rajpurohit, Yogendra S; Misra, Hari S; Gadewal, Nikhil; Kumar, Satish; Hasan, Syed K; Varma, Ashok K

2017-11-01

Fanconi anemia complementation groups - I (FANCI) protein facilitates DNA ICL (Inter-Cross-link) repair and plays a crucial role in genomic integrity. FANCI is a 1328 amino acids protein which contains armadillo (ARM) repeats and EDGE motif at the C-terminus. ARM repeats are functionally diverse and evolutionarily conserved domain that plays a pivotal role in protein-protein and protein-DNA interactions. Considering the importance of ARM repeats, we have explored comprehensive in silico and in vitro approach to examine folding pattern. Size exclusion chromatography, dynamic light scattering (DLS) and glutaraldehyde crosslinking studies suggest that FANCI ARM repeat exist as monomer as well as in oligomeric forms. Circular dichroism (CD) and fluorescence spectroscopy results demonstrate that protein has predominantly α- helices and well-folded tertiary structure. DNA binding was analysed using electrophoretic mobility shift assay by autoradiography. Temperature-dependent CD, Fluorescence spectroscopy and DLS studies concluded that protein unfolds and start forming oligomer from 30°C. The existence of stable portion within FANCI ARM repeat was examined using limited proteolysis and mass spectrometry. The normal mode analysis, molecular dynamics and principal component analysis demonstrated that helix-turn-helix (HTH) motif present in ARM repeat is highly dynamic and has anti-correlated motion. Furthermore, FANCI ARM repeat has HTH structural motif which binds to double-stranded DNA.
REPPER—repeats and their periodicities in fibrous proteins

PubMed Central

Gruber, Markus; Söding, Johannes; Lupas, Andrei N.

2005-01-01

REPPER (REPeats and their PERiodicities) is an integrated server that detects and analyzes regions with short gapless repeats in protein sequences or alignments. It finds periodicities by Fourier Transform (FTwin) and internal similarity analysis (REPwin). FTwin assigns numerical values to amino acids that reflect certain properties, for instance hydrophobicity, and gives information on corresponding periodicities. REPwin uses self-alignments and displays repeats that reveal significant internal similarities. Both programs use a sliding window to ensure that different periodic regions within the same protein are detected independently. FTwin and REPwin are complemented by secondary structure prediction (PSIPRED) and coiled coil prediction (COILS), making the server a versatile analysis tool for sequences of fibrous proteins. REPPER is available at . PMID:15980460
Algorithm to find distant repeats in a single protein sequence

PubMed Central

Banerjee, Nirjhar; Sarani, Rangarajan; Ranjani, Chellamuthu Vasuki; Sowmiya, Govindaraj; Michael, Daliah; Balakrishnan, Narayanasamy; Sekar, Kanagaraj

2008-01-01

Distant repeats in protein sequence play an important role in various aspects of protein analysis. A keen analysis of the distant repeats would enable to establish a firm relation of the repeats with respect to their function and three-dimensional structure during the evolutionary process. Further, it enlightens the diversity of duplication during the evolution. To this end, an algorithm has been developed to find all distant repeats in a protein sequence. The scores from Point Accepted Mutation (PAM) matrix has been deployed for the identification of amino acid substitutions while detecting the distant repeats. Due to the biological importance of distant repeats, the proposed algorithm will be of importance to structural biologists, molecular biologists, biochemists and researchers involved in phylogenetic and evolutionary studies. PMID:19052663
A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

PubMed Central

2010-01-01

Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by
Production of novel recombinant single-domain antibodies against tandem repeat region of MUC1 mucin.

PubMed

Rahbarizadeh, F; Rasaee, M J; Forouzandeh Moghadam, M; Allameh, A A; Sadroddiny, E

2004-06-01

Recently, the existence of "heavy-chain" antibody in Camelidae has been described. However, as yet there is no data on the binding of this type of antibody to peptides. In addition, there was not any report of production of single-domain antibodies in two-humped camels (Camelus bactrianus). In the present study, these questions are addressed. We showed the feasibility of immunizing old world camels, cloning the repertoire of the variable domain of their heavy-chain antibodies, panning and selection, leading to the successful identification of minimum-sized antigen binders. Antigen-specific fragments of the heavy-chain IgGs (V(HH)) are of great interest in biotechnology because they are very stable, highly soluble, and react specifically and with high affinity to the antigens. In this study, we immunized two camels (Camelus dromedarius and Camelus bactrianus) with homogenized cancerous tissues, synthetic peptide, and human milk fat globule membrane (HMFG), and generated two V(HH) libraries displayed on phage particles. Some single-domain antibody fragments have been isolated that specifically recognize the tandem repeat region of MUC1. The camels' single-domain V(HH) harbor the original, intact antigen binding site and reacted specifically and with high affinity to the tandem repeat region of MUC1. Indeed soluble, specific antigen binders and good affinities (in the range of 0.2 x 10(9) M(-1) to 0.6 x 10(9) M(-1)) were identified from these libraries. This is the first example of the isolation of camel anti-peptide V(HH) domains.
Research Update: Programmable tandem repeat proteins inspired by squid ring teeth

NASA Astrophysics Data System (ADS)

Pena-Francesch, Abdon; Domeradzka, Natalia E.; Jung, Huihun; Barbu, Benjamin; Vural, Mert; Kikuchi, Yusuke; Allen, Benjamin D.; Demirel, Melik C.

2018-01-01

Cephalopods have evolved many interesting features that can serve as inspiration. Repetitive squid ring teeth (SRT) proteins from cephalopods exhibit properties such as strength, self-healing, and biocompatibility. These proteins have been engineered to design novel adhesives, self-healing textiles, and the assembly of 2d-layered materials. Compared to conventional polymers, repetitive proteins are easy to modify and can assemble in various morphologies and molecular architectures. This research update discusses the molecular biology and materials science of polypeptides inspired by SRT proteins, their properties, and perspectives for future applications.
Crystal structures of ryanodine receptor SPRY1 and tandem-repeat domains reveal a critical FKBP12 binding determinant

NASA Astrophysics Data System (ADS)

Yuchi, Zhiguang; Yuen, Siobhan M. Wong King; Lau, Kelvin; Underhill, Ainsley Q.; Cornea, Razvan L.; Fessenden, James D.; van Petegem, Filip

2015-08-01

Ryanodine receptors (RyRs) form calcium release channels located in the membranes of the sarcoplasmic and endoplasmic reticulum. RyRs play a major role in excitation-contraction coupling and other Ca2+-dependent signalling events, and consist of several globular domains that together form a large assembly. Here we describe the crystal structures of the SPRY1 and tandem-repeat domains at 1.2-1.5 Å resolution, which reveal several structural elements not detected in recent cryo-EM reconstructions of RyRs. The cryo-EM studies disagree on the position of SPRY domains, which had been proposed based on homology modelling. Computational docking of the crystal structures, combined with FRET studies, show that the SPRY1 domain is located next to FK506-binding protein (FKBP). Molecular dynamics flexible fitting and mutagenesis experiments suggest a hydrophobic cluster within SPRY1 that is crucial for FKBP binding. A RyR1 disease mutation, N760D, appears to directly impact FKBP binding through interfering with SPRY1 folding.
Multiple-locus variable-number tandem repeat analysis of Salmonella Enteritidis isolates from human and non-human sources using a single multiplex PCR

PubMed Central

Cho, Seongbeom; Boxrud, David J; Bartkus, Joanne M; Whittam, Thomas S; Saeed, Mahdi

2007-01-01

Simplified multiple-locus variable-number tandem repeat analysis (MLVA) was developed using one-shot multiplex PCR for seven variable-number tandem repeats (VNTR) markers with high diversity capacity. MLVA, phage typing, and PFGE methods were applied on 34 diverse Salmonella Enteritidis isolates from human and non-human sources. MLVA detected allelic variations that helped to classify the S. Enteritidis isolates into more evenly distributed subtypes than other methods. MLVA-based S. Enteritidis clonal groups were largely associated with sources of the isolates. Nei's diversity indices for polymorphism ranged from 0.25 to 0.70 for seven VNTR loci markers. Based on Simpson's and Shannon's diversity indices, MLVA had a higher discriminatory power than pulsed field gel electrophoresis (PFGE), phage typing, or multilocus enzyme electrophoresis. Therefore, MLVA may be used along with PFGE to enhance the effectiveness of the molecular epidemiologic investigation of S. Enteritidis infections. PMID:17692097
The elastic free energy of a tandem modular protein under force.

PubMed

Valle-Orero, Jessica; Eckels, Edward C; Stirnemann, Guillaume; Popa, Ionel; Berkovich, Ronen; Fernandez, Julio M

2015-05-01

Recent studies have provided a theoretical framework for including entropic elasticity in the free energy landscape of proteins under mechanical force. Accounting for entropic elasticity using polymer physics models has helped explain the hopping behavior seen in single molecule experiments in the low force regime. Here, we expand on the construction of the free energy of a single protein domain under force proposed by Berkovich et al. to provide a free energy landscape for N tandem domains along a continuous polypeptide. Calculation of the free energy of individual domains followed by their concatenation provides a continuous free energy landscape whose curvature is dominated by the worm-like chain at forces below 20 pN. We have validated our free energy model using Brownian dynamics and reproduce key features of protein folding. This free energy model can predict the effects of changes in the elastic properties of a multidomain protein as a consequence of biological modifications such as phosphorylation or the formation of disulfide bonds. This work lays the foundations for the modeling of tissue elasticity, which is largely determined by the properties of tandem polyproteins. Copyright © 2015. Published by Elsevier Inc.
Variability of CAG tandem repeats in exon 1 of the androgen receptor gene is not related with dog intersexuality.

PubMed

Nowacka-Woszuk, J; Switonski, M

2010-02-01

Numerous mutations of the human androgen receptor (AR) gene cause an intersexual phenotype, called the androgen insensitivity syndrome. The intersexual phenotype is also quite often diagnosed in dogs. The aim of this study was to conduct a comparative analysis of the entire coding sequence (eight exons) of the AR gene in healthy and four intersex dogs, as well as in three other canids (the red fox, arctic fox and Chinese raccoon dog). The coding sequence of the studied species appeared to be conserved (similarity above 97%) and polymorphism was found in exon 1 only. Altogether, 2 SNPs were identified in healthy dogs, 14 in red foxes, 16 in arctic foxes and 6 were found in Chinese raccoon dogs, respectively. Moreover, a variable number of tandem repeats (CAG and CAA), encoding an array of glutamines, was also observed in this exon. The CAA codon numbers were invariable within species, but the CAG repeats were polymorphic. The highest number of the CAG and CAA repeats was found in dogs (from 40 to 42) and the observed variability was similar in intersex and healthy dogs. In the other canids the variability fell within the following ranges: 29-37 (red fox), 37-39 (arctic fox) and 29-32 (Chinese raccoon dog). In addition, a polymorphic microsatellite marker in intron 2 was found in the dog, red fox and Chinese raccoon dog. It was concluded that the polymorphism level of the AR gene in the dog was lower than in the other canids and none of the detected polymorphisms, including variability of the CAG tandem repeats, could be related with the intersexual phenotype of the studied dogs.
Deletion of internal structured repeats increases the stability of a leucine-rich repeat protein, YopM

PubMed Central

Barrick, Doug

2011-01-01

Mapping the stability distributions of proteins in their native folded states provides a critical link between structure, thermodynamics, and function. Linear repeat proteins have proven more amenable to this kind of mapping than globular proteins. C-terminal deletion studies of YopM, a large, linear leucine-rich repeat (LRR) protein, show that stability is distributed quite heterogeneously, yet a high level of cooperativity is maintained [1]. Key components of this distribution are three interfaces that strongly stabilize adjacent sequences, thereby maintaining structural integrity and promoting cooperativity. To better understand the distribution of interaction energy around these critical interfaces, we studied internal (rather than terminal) deletions of three LRRs in this region, including one of these stabilizing interfaces. Contrary to our expectation that deletion of structured repeats should be destabilizing, we find that internal deletion of folded repeats can actually stabilize the native state, suggesting that these repeats are destabilizing, although paradoxically, they are folded in the native state. We identified two residues within this destabilizing segment that deviate from the consensus sequence at a position that normally forms a stacked leucine ladder in the hydrophobic core. Replacement of these nonconsensus residues with leucine is stabilizing. This stability enhancement can be reproduced in the context of nonnative interfaces, but it requires an extended hydrophobic core. Our results demonstrate that different LRRs vary widely in their contribution to stability, and that this variation is context-dependent. These two factors are likely to determine the types of rearrangements that lead to folded, functional proteins, and in turn, are likely to restrict the pathways available for the evolution of linear repeat proteins. PMID:21764506

Distribution and Evolution of Yersinia Leucine-Rich Repeat Proteins

PubMed Central

Hu, Yueming; Huang, He; Hui, Xinjie; Cheng, Xi; White, Aaron P.

2016-01-01

Leucine-rich repeat (LRR) proteins are widely distributed in bacteria, playing important roles in various protein-protein interaction processes. In Yersinia, the well-characterized type III secreted effector YopM also belongs to the LRR protein family and is encoded by virulence plasmids. However, little has been known about other LRR members encoded by Yersinia genomes or their evolution. In this study, the Yersinia LRR proteins were comprehensively screened, categorized, and compared. The LRR proteins encoded by chromosomes (LRR1 proteins) appeared to be more similar to each other and different from those encoded by plasmids (LRR2 proteins) with regard to repeat-unit length, amino acid composition profile, and gene expression regulation circuits. LRR1 proteins were also different from LRR2 proteins in that the LRR1 proteins contained an E3 ligase domain (NEL domain) in the C-terminal region or an NEL domain-encoding nucleotide relic in flanking genomic sequences. The LRR1 protein-encoding genes (LRR1 genes) varied dramatically and were categorized into 4 subgroups (a to d), with the LRR1a to -c genes evolving from the same ancestor and LRR1d genes evolving from another ancestor. The consensus and ancestor repeat-unit sequences were inferred for different LRR1 protein subgroups by use of a maximum parsimony modeling strategy. Structural modeling disclosed very similar repeat-unit structures between LRR1 and LRR2 proteins despite the different unit lengths and amino acid compositions. Structural constraints may serve as the driving force to explain the observed mutations in the LRR regions. This study suggests that there may be functional variation and lays the foundation for future experiments investigating the functions of the chromosomally encoded LRR proteins of Yersinia. PMID:27217422
aPPRove: An HMM-Based Method for Accurate Prediction of RNA-Pentatricopeptide Repeat Protein Binding Events

PubMed Central

Harrison, Thomas; Ruiz, Jaime; Sloan, Daniel B.; Ben-Hur, Asa; Boucher, Christina

2016-01-01

Pentatricopeptide repeat containing proteins (PPRs) bind to RNA transcripts originating from mitochondria and plastids. There are two classes of PPR proteins. The P class contains tandem P-type motif sequences, and the PLS class contains alternating P, L and S type sequences. In this paper, we describe a novel tool that predicts PPR-RNA interaction; specifically, our method, which we call aPPRove, determines where and how a PLS-class PPR protein will bind to RNA when given a PPR and one or more RNA transcripts by using a combinatorial binding code for site specificity proposed by Barkan et al. Our results demonstrate that aPPRove successfully locates how and where a PPR protein belonging to the PLS class can bind to RNA. For each binding event it outputs the binding site, the amino-acid-nucleotide interaction, and its statistical significance. Furthermore, we show that our method can be used to predict binding events for PLS-class proteins using a known edit site and the statistical significance of aligning the PPR protein to that site. In particular, we use our method to make a conjecture regarding an interaction between CLB19 and the second intronic region of ycf3. The aPPRove web server can be found at www.cs.colostate.edu/~approve. PMID:27560805
Histone and ribosomal RNA repetitive gene clusters of the boll weevil are linked in a tandem array.

PubMed

Roehrdanz, R; Heilmann, L; Senechal, P; Sears, S; Evenson, P

2010-08-01

Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and the clusters are tandemly repeated. Ribosomal DNA contains a cluster of the rRNA sequences 18S, 5.8S and 28S. The rRNA genes are separated by the spacers ITS1, ITS2 and IGS. This cluster is also tandemly repeated. We found that the ribosomal RNA repeat unit of at least two species of Anthonomine weevils, Anthonomus grandis and Anthonomus texanus (Coleoptera: Curculionidae), is interspersed with a block containing the histone gene quintet. The histone genes are situated between the rRNA 18S and 28S genes in what is known as the intergenic spacer region (IGS). The complete reiterated Anthonomus grandis histone-ribosomal sequence is 16,248 bp.
Whole genome evaluation of tandem repeat polymorphisms between two pathogenically similar strains of Xylella fastidiosa isolated from almond and grape in California

USDA-ARS?s Scientific Manuscript database

Whole genome tandem repeat polymorphisms were evaluated between two closely related Xylella fastidiosa strains, M23 and Temecula1, both cause almond leaf scorch disease (ALSD) and grape Pierce’s disease (PD) in California. Strain M23 was isolated from almond and the genome was sequenced in this stu...
De novo identification of highly diverged protein repeats by probabilistic consistency.

PubMed

Biegert, A; Söding, J

2008-03-15

An estimated 25% of all eukaryotic proteins contain repeats, which underlines the importance of duplication for evolving new protein functions. Internal repeats often correspond to structural or functional units in proteins. Methods capable of identifying diverged repeated segments or domains at the sequence level can therefore assist in predicting domain structures, inferring hypotheses about function and mechanism, and investigating the evolution of proteins from smaller fragments. We present HHrepID, a method for the de novo identification of repeats in protein sequences. It is able to detect the sequence signature of structural repeats in many proteins that have not yet been known to possess internal sequence symmetry, such as outer membrane beta-barrels. HHrepID uses HMM-HMM comparison to exploit evolutionary information in the form of multiple sequence alignments of homologs. In contrast to a previous method, the new method (1) generates a multiple alignment of repeats; (2) utilizes the transitive nature of homology through a novel merging procedure with fully probabilistic treatment of alignments; (3) improves alignment quality through an algorithm that maximizes the expected accuracy; (4) is able to identify different kinds of repeats within complex architectures by a probabilistic domain boundary detection method and (5) improves sensitivity through a new approach to assess statistical significance. Server: http://toolkit.tuebingen.mpg.de/hhrepid; Executables: ftp://ftp.tuebingen.mpg.de/pub/protevo/HHrepID
Multilocus variable-number tandem repeat analysis distinguishes outbreak and sporadic Escherichia coli O157:H7 isolates.

PubMed

Noller, Anna C; McEllistrem, M Catherine; Pacheco, Antonio G F; Boxrud, David J; Harrison, Lee H

2003-12-01

Escherichia coli O157:H7 is a major cause of food-borne illness in the United States. Outbreak detection involves traditional epidemiological methods and routine molecular subtyping by pulsed-field gel electrophoresis (PFGE). PFGE is labor-intensive, and the results are difficult to analyze and not easily transferable between laboratories. Multilocus variable-number tandem repeat (VNTR) analysis (MLVA) is a fast, portable method that analyzes multiple VNTR loci, which are areas of the bacterial genome that evolve quickly. Eighty isolates, including 21 isolates from five epidemiologically well-characterized outbreaks from Pennsylvania and Minnesota, were analyzed by PFGE and MLVA. Strains in PFGE clusters were defined as strains that differed by less than or equal to one band by using XbaI and the confirmatory enzyme SpeI. MLVA was performed by comparing the number of tandem repeats at seven loci. From 6 to 30 alleles were found at the seven loci, resulting in 64 MLVA types among the 80 isolates. MLVA correctly identified the isolates from all five outbreaks if only a single-locus variant was allowed. MLVA differentiated strains with unique PFGE types. Additionally, MLVA discriminated strains within PFGE-defined clusters that were not known to be part of an outbreak. In addition to being a simple and validated method for E. coli O157:H7 outbreak detection, MLVA appears to have a sensitivity equal to that of PFGE and a specificity superior to that of PFGE.
Novel variable number of tandem repeats of gibbon MAOA gene and its evolutionary significance.

PubMed

Choi, Yuri; Jung, Yi-Deun; Ayarpadikannan, Selvam; Koga, Akihiko; Imai, Hiroo; Hirai, Hirohisa; Roos, Christian; Kim, Heui-Soo

2014-08-01

Variable number of tandem repeats (VNTRs) are scattered throughout the primate genome, and genetic variation of these VNTRs have been accumulated during primate radiation. Here, we analyzed VNTRs upstream of the monoamine oxidase A (MAOA) gene in 11 different gibbon species. An abundance of truncated VNTR sequences and copy number differences were observed compared to those of human VNTR sequences. To better understand the biological role of these VNTRs, a luciferase activity assay was conducted and results indicated that selected VNTR sequences of the MAOA gene from human and three different gibbon species (Hylobates klossii, Hylobates lar, and Nomascus concolor) showed silencing ability. Together, these data could be useful for understanding the evolutionary history and functional significance of MAOA VNTR sequences in gibbon species.
Tandem Affinity Purification of Protein Complexes from Eukaryotic Cells.

PubMed

Ma, Zheng; Fung, Victor; D'Orso, Iván

2017-01-26

The purification of active protein-protein and protein-nucleic acid complexes is crucial for the characterization of enzymatic activities and de novo identification of novel subunits and post-translational modifications. Bacterial systems allow for the expression and purification of a wide variety of single polypeptides and protein complexes. However, this system does not enable the purification of protein subunits that contain post-translational modifications (e.g., phosphorylation and acetylation), and the identification of novel regulatory subunits that are only present/expressed in the eukaryotic system. Here, we provide a detailed description of a novel, robust, and efficient tandem affinity purification (TAP) method using STREP- and FLAG-tagged proteins that facilitates the purification of protein complexes with transiently or stably expressed epitope-tagged proteins from eukaryotic cells. This protocol can be applied to characterize protein complex functionality, to discover post-translational modifications on complex subunits, and to identify novel regulatory complex components by mass spectrometry. Notably, this TAP method can be applied to study protein complexes formed by eukaryotic or pathogenic (viral and bacterial) components, thus yielding a wide array of downstream experimental opportunities. We propose that researchers working with protein complexes could utilize this approach in many different ways.
Submegabase Clusters of Unstable Tandem Repeats Unique to the Tla Region of Mouse T Haplotypes

PubMed Central

Uehara, H.; Ebersole, T.; Bennett, D.; Artzt, K.

1990-01-01

We describe here the identification and genomic organization of mouse t haplotype-specific elements (TSEs) 7.8 and 5.8 kb in length. The TSEs exist as submegabase-long clusters of tandem repeats localized in the Tla region of the major histocompatibility complex of all t haplotype chromosomes examined. In contrast, no such clusters were detected among 12 inbred strains of Mus musculus and other Mus species; thus, clusters of TSEs represent the first absolutely qualitative difference between t haplotypes and wild-type chromosomes. Pulsed field gel electrophoresis shows that the number of clusters, and the number of repeats in each cluster are extremely variable. Dramatic quantitative differences of TSEs uniquely distinguish every independent t haplotype from any other. The complete nucleotide sequence of one 7.8-kb TSE reveals significant homology to the ETn (a major transcript in the early embryo of the mouse), and some homologies to intracisternal A-particles and the mammary tumor virus env gene. Apart from the diagnostic relevance to t haplotypes, evolutionary and functional significances are discussed with respect to chromosome structure and genetic recombination. PMID:2076812
Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights

PubMed Central

Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.

2016-01-01

Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794
Mimosoid legume plastome evolution: IR expansion, tandem repeat expansions, and accelerated rate of evolution in clpP.

PubMed

Dugas, Diana V; Hernandez, David; Koenen, Erik J M; Schwarz, Erika; Straub, Shannon; Hughes, Colin E; Jansen, Robert K; Nageswara-Rao, Madhugiri; Staats, Martijn; Trujillo, Joshua T; Hajrah, Nahid H; Alharbi, Njud S; Al-Malki, Abdulrahman L; Sabir, Jamal S M; Bailey, C Donovan

2015-11-23

The Leguminosae has emerged as a model for studying angiosperm plastome evolution because of its striking diversity of structural rearrangements and sequence variation. However, most of what is known about legume plastomes comes from few genera representing a subset of lineages in subfamily Papilionoideae. We investigate plastome evolution in subfamily Mimosoideae based on two newly sequenced plastomes (Inga and Leucaena) and two recently published plastomes (Acacia and Prosopis), and discuss the results in the context of other legume and rosid plastid genomes. Mimosoid plastomes have a typical angiosperm gene content and general organization as well as a generally slow rate of protein coding gene evolution, but they are the largest known among legumes. The increased length results from tandem repeat expansions and an unusual 13 kb IR-SSC boundary shift in Acacia and Inga. Mimosoid plastomes harbor additional interesting features, including loss of clpP intron1 in Inga, accelerated rates of evolution in clpP for Acacia and Inga, and dN/dS ratios consistent with neutral and positive selection for several genes. These new plastomes and results provide important resources for legume comparative genomics, plant breeding, and plastid genetic engineering, while shedding further light on the complexity of plastome evolution in legumes and angiosperms.
A designed repeat protein as an affinity capture reagent

PubMed Central

Speltz, Elizabeth B.; Brown, Rebecca S.H.; Hajare, Holly S.; Schlieker, Christian; Regan, Lynne

2017-01-01

Repeat proteins are an attractive target for protein engineering and design. We have focused our attention on the design and engineering of one particular class - tetratricopeptide repeat (TPR) proteins. In previous work we have shown that the structure and stability of TPR proteins can be manipulated in a rational fashion [Cortajarena 2011; Main 2003]. Building on those studies, we have designed and characterized a number of different peptide-binding TPR modules and we have also assembled these modules into supramolecular arrays [Cortajarena 2009; Cortajarena 2008; Jackrel 2009; Kajander 2007]. Here we focus on the development of one such TPR-peptide interaction for a practical application – affinity purification. We illustrate the general utility of our designed protein interaction. Furthermore, this example highlights how basic research on protein-peptide interactions can lead to the development of novel reagents with important practical applications. PMID:26517897
Effective application of multiple locus variable number of tandem repeats analysis to tracing Staphylococcus aureus in food-processing environment.

PubMed

Rešková, Z; Koreňová, J; Kuchta, T

2014-04-01

A total of 256 isolates of Staphylococcus aureus were isolated from 98 samples (34 swabs and 64 food samples) obtained from small or medium meat- and cheese-processing plants in Slovakia. The strains were genotypically characterized by multiple locus variable number of tandem repeats analysis (MLVA), involving multiplex polymerase chain reaction (PCR) with subsequent separation of the amplified DNA fragments by an automated flow-through gel electrophoresis. With the panel of isolates, MLVA produced 31 profile types, which was a sufficient discrimination to facilitate the description of spatial and temporal aspects of contamination. Further data on MLVA discrimination were obtained by typing a subpanel of strains by multiple locus sequence typing (MLST). MLVA coupled to automated electrophoresis proved to be an effective, comparatively fast and inexpensive method for tracing S. aureus contamination of food-processing factories. Subspecies genotyping of microbial contaminants in food-processing factories may facilitate identification of spatial and temporal aspects of the contamination. This may help to properly manage the process hygiene. With S. aureus, multiple locus variable number of tandem repeats analysis (MLVA) proved to be an effective method for the purpose, being sufficiently discriminative, yet comparatively fast and inexpensive. The application of automated flow-through gel electrophoresis to separation of DNA fragments produced by multiplex PCR helped to improve the accuracy and speed of the method. © 2013 The Society for Applied Microbiology.
Stable isotope labeling tandem mass spectrometry (SILT) to quantify protein production and clearance rates

PubMed Central

Bateman, Randall J.; Munsell, Ling Y.; Chen, Xianghong; Holtzman, David M.; Yarasheski, Kevin E.

2007-01-01

In all biological systems, protein amount is a function of the rate of production and clearance. The speed of a response to a disturbance in protein homeostasis is determined by turnover rate. Quantifying alterations in protein synthesis and clearance rates is vital to understanding disease pathogenesis (e.g., aging, inflammation). No methods exist for quantifying production and clearance rates of low abundance (femtomole) proteins in vivo. We describe a novel, mass spectrometry-based method for quantitating low abundance protein synthesis and clearance rates in vitro and in vivo in animals and humans. The utility of this method is demonstrated with amyloid-beta (Aß), an important low abundance protein involved in Alzheimer's disease pathogenesis. We used in vivo stable isotope labeling, immunoprecipitation of Aß from cerebrospinal fluid, and quantitative liquid chromatography electrospray-ionization tandem mass spectrometry (LC-ESI-tandem MS) to quantify human Aß protein production and clearance rates. The method is sensitive and specific for stable isotope labeled amino acid incorporation into CNS (± 1% accuracy). This in vivo method can be used to identify pathophysiologic changes in protein metabolism; and may serve as a biomarker for monitoring disease risk, progression, or response to novel therapeutic agents. The technique is adaptable to other macromolecules, such as carbohydrates or lipids. PMID:17383190
Four Amino Acids within a Tandem QxVx Repeat in a Predicted Extended α-Helix of the Smad-Binding Domain of Sip1 Are Necessary for Binding to Activated Smad Proteins

PubMed Central

Conidi, Andrea; van den Berghe, Veronique; Leslie, Kris; Stryjewska, Agata; Xue, Hua; Chen, Ye-Guang; Seuntjens, Eve; Huylebroeck, Danny

2013-01-01

The zinc finger transcription factor Smad-interacting protein-1 (Sip1; Zeb2, Zfhx1b) plays an important role during vertebrate embryogenesis in various tissues and differentiating cell types, and during tumorigenesis. Previous biochemical analysis suggests that interactions with several partner proteins, including TGFβ family receptor-activated Smads, regulate the activities of Sip1 in the nucleus both as a DNA-binding transcriptional repressor and activator. Using a peptide aptamer approach we mapped in Sip1 its Smad-binding domain (SBD), initially defined as a segment of 51 amino acids, to a shorter stretch of 14 amino acids within this SBD. Modelling suggests that this short SBD stretch is part of an extended α-helix that may fit the binding to a hydrophobic corridor within the MH2 domain of activated Smads. Four amino acids (two polar Q residues and two non-polar V residues) that form the tandem repeat (QxVx)2 in this 14-residue stretch were found to be crucial for binding to both TGFβ/Nodal/Activin-Smads and BMP-Smads. A full-length Sip1 with collective mutation of these Q and V residues (to A) no longer binds to Smads, while it retains its binding activity to its cognate bipartite target DNA sequence. This missense mutant Sip1(AxAx)2 provides a new molecular tool to identify SBD (in)dependent target genes in Sip1-controlled TGFβ and/or BMP (de)regulated cellular, developmental and pathological processes. PMID:24146916
Transferability of short tandem repeat markers for two wild Canid species inhabiting the Brazilian Cerrado.

PubMed

Rodrigues, F M; Telles, M P C; Resende, L V; Soares, T N; Diniz-Filho, J A F; Jácomo, A T A; Silveira, L

2006-12-13

The maned wolf (Chrysocyon brachyurus) and the crab-eating fox (Cerdocyon thous) are two wild-canid species found in the Brazilian Cerrado. We tested cross-amplification and transferability of 29 short tandem repeat primers originally developed for cattle and domestic dogs and cats on 38 individuals of each of these two species, collected in the Emas National Park, which is the largest national park in the Cerrado region. Six of these primers were successfully transferred (CSSM-038, PEZ-05, PEZ-12, LOCO-13, LOCO-15, and PEZ-20); five of which were found to be polymorphic. Genetic parameter values (number of alleles per locus, observed and expected heterozygosities, and fixation indices) were within the expected range reported for canid populations worldwide.
Variable-number-of-tandem-repeats analysis of genetic diversity in Pasteuria ramosa.

PubMed

Mouton, L; Ebert, D

2008-05-01

Variable-number-of-tandem-repeats (VNTR) markers are increasingly being used in population genetic studies of bacteria. They were recently developed for Pasteuria ramosa, an endobacterium that infects Daphnia species. In the present study, we genotyped P. ramosa in 18 infected hosts from the United Kingdom, Belgium, and two lakes in the United States using seven VNTR markers. Two Daphnia species were collected: D. magna and D. dentifera. Six loci showed length polymorphism, with as many as five alleles identified for a single locus. Similarity coefficient calculations showed that the extent of genetic variation between pairs of isolates within populations differed according to the population, but it was always less than the genetic distances among populations. Analysis of the genetic distances performed using principal component analysis revealed strong clustering by location of origin, but not by host Daphnia species. Our study demonstrated that the VNTR markers available for P. ramosa are informative in revealing genetic differences within and among populations and may therefore become an important tool for providing detailed analysis of population genetics and epidemiology.
PGLa-H tandem-repeat peptides active against multidrug resistant clinical bacterial isolates.

PubMed

Rončević, Tomislav; Gajski, Goran; Ilić, Nada; Goić-Barišić, Ivana; Tonkić, Marija; Zoranić, Larisa; Simunić, Juraj; Benincasa, Monica; Mijaković, Marijana; Tossi, Alessandro; Juretić, Davor

2017-02-01

Antimicrobial peptides (AMPs) are promising candidates for new antibiotic classes but often display an unacceptably high toxicity towards human cells. A naturally produced C-terminal fragment of PGLa, named PGLa-H, has been reported to have a very low haemolytic activity while maintaining a moderate antibacterial activity. A sequential tandem repeat of this fragment, diPGLa-H, was designed, as well as an analogue with a Val to Gly substitution at a key position. These peptides showed markedly improved in vitro bacteriostatic and bactericidal activity against both reference strains and multidrug resistant clinical isolates of Gram-negative and Gram-positive pathogens, with generally low toxicity for human cells as assessed by haemolysis, cell viability, and DNA damage assays. The glycine substitution analogue, kiadin, had a slightly better antibacterial activity and reduced haemolytic activity, which may correlate with an increased flexibility of its helical structure, as deduced using molecular dynamics simulations. These peptides may serve as useful lead compounds for developing anti-infective agents against resistant Gram-negative and Gram-positive species. Copyright © 2016 Elsevier B.V. All rights reserved.
Clustering of Tuberculosis Cases Based on Variable-Number Tandem-Repeat Typing in Relation to the Population Structure of Mycobacterium tuberculosis in the Netherlands

PubMed Central

Sloot, Rosa; Borgdorff, Martien W.; de Beer, Jessica L.; van Ingen, Jakko; Supply, Philip

2013-01-01

The population structure of 3,776 Mycobacterium tuberculosis isolates was determined using variable-number tandem-repeat (VNTR) typing. The degree of clonality was so high that a more relaxed definition of clustering cannot be applied. Among recent immigrants with non-Euro-American isolates, transmission is overestimated if based on identical VNTR patterns. PMID:23658260
Design, production and molecular structure of a new family of artificial alpha-helicoidal repeat proteins (αRep) based on thermostable HEAT-like repeats.

PubMed

Urvoas, Agathe; Guellouz, Asma; Valerio-Lepiniec, Marie; Graille, Marc; Durand, Dominique; Desravines, Danielle C; van Tilbeurgh, Herman; Desmadril, Michel; Minard, Philippe

2010-11-26

Repeat proteins have a modular organization and a regular architecture that make them attractive models for design and directed evolution experiments. HEAT repeat proteins, although very common, have not been used as a scaffold for artificial proteins, probably because they are made of long and irregular repeats. Here, we present and validate a consensus sequence for artificial HEAT repeat proteins. The sequence was defined from the structure-based sequence analysis of a thermostable HEAT-like repeat protein. Appropriate sequences were identified for the N- and C-caps. A library of genes coding for artificial proteins based on this sequence design, named αRep, was assembled using new and versatile methodology based on circular amplification. Proteins picked randomly from this library are expressed as soluble proteins. The biophysical properties of proteins with different numbers of repeats and different combinations of side chains in hypervariable positions were characterized. Circular dichroism and differential scanning calorimetry experiments showed that all these proteins are folded cooperatively and are very stable (T(m) >70 °C). Stability of these proteins increases with the number of repeats. Detailed gel filtration and small-angle X-ray scattering studies showed that the purified proteins form either monomers or dimers. The X-ray structure of a stable dimeric variant structure was solved. The protein is folded with a highly regular topology and the repeat structure is organized, as expected, as pairs of alpha helices. In this protein variant, the dimerization interface results directly from the variable surface enriched in aromatic residues located in the randomized positions of the repeats. The dimer was crystallized both in an apo and in a PEG-bound form, revealing a very well defined binding crevice and some structure flexibility at the interface. This fortuitous binding site could later prove to be a useful binding site for other low molecular mass

Identifying Protein-protein Interaction in Drosophila Adult Heads by Tandem Affinity Purification (TAP)

PubMed Central

Tian, Xiaolin; Zhu, Mingwei; Li, Long; Wu, Chunlai

2013-01-01

Genetic screens conducted using Drosophila melanogaster (fruit fly) have made numerous milestone discoveries in the advance of biological sciences. However, the use of biochemical screens aimed at extending the knowledge gained from genetic analysis was explored only recently. Here we describe a method to purify the protein complex that associates with any protein of interest from adult fly heads. This method takes advantage of the Drosophila GAL4/UAS system to express a bait protein fused with a Tandem Affinity Purification (TAP) tag in fly neurons in vivo, and then implements two rounds of purification using a TAP procedure similar to the one originally established in yeast1 to purify the interacting protein complex. At the end of this procedure, a mixture of multiple protein complexes is obtained whose molecular identities can be determined by mass spectrometry. Validation of the candidate proteins will benefit from the resource and ease of performing loss-of-function studies in flies. Similar approaches can be applied to other fly tissues. We believe that the combination of genetic manipulations and this proteomic approach in the fly model system holds tremendous potential for tackling fundamental problems in the field of neurobiology and beyond. PMID:24335807
Single-cell forensic short tandem repeat typing within microfluidic droplets.

PubMed

Geng, Tao; Novak, Richard; Mathies, Richard A

2014-01-07

A short tandem repeat (STR) typing method is developed for forensic identification of individual cells. In our strategy, monodisperse 1.5 nL agarose-in-oil droplets are produced with a high frequency using a microfluidic droplet generator. Statistically dilute single cells, along with primer-functionalized microbeads, are randomly compartmentalized in the droplets. Massively parallel single-cell droplet polymerase chain reaction (PCR) is performed to transfer replicas of desired STR targets from the single-cell genomic DNA onto the coencapsulated microbeads. These DNA-conjugated beads are subsequently harvested and reamplified under statistically dilute conditions for conventional capillary electrophoresis (CE) STR fragment size analysis. The 9-plex STR profiles of single cells from both pure and mixed populations of GM09947 and GM09948 human lymphoid cells show that all alleles are correctly called and allelic drop-in/drop-out is not observed. The cell mixture study exhibits a good linear relationship between the observed and input cell ratios in the range of 1:1 to 10:1. Additionally, the STR profile of GM09947 cells could be deduced even in the presence of a high concentration of cell-free contaminating 9948 genomic DNA. Our method will be valuable for the STR analysis of samples containing mixtures of cells/DNA from multiple contributors and for low-concentration samples.
Genome-Wide Analyses and Functional Classification of Proline Repeat-Rich Proteins: Potential Role of eIF5A in Eukaryotic Evolution

PubMed Central

Mandal, Ajeet; Mandal, Swati; Park, Myung Hee

2014-01-01

The eukaryotic translation factor, eIF5A has been recently reported as a sequence-specific elongation factor that facilitates peptide bond formation at consecutive prolines in Saccharomyces cerevisiae, as its ortholog elongation factor P (EF-P) does in bacteria. We have searched the genome databases of 35 representative organisms from six kingdoms of life for PPP (Pro-Pro-Pro) and/or PPG (Pro-Pro-Gly)-encoding genes whose expression is expected to depend on eIF5A. We have made detailed analyses of proteome data of 5 selected species, Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster, Mus musculus and Homo sapiens. The PPP and PPG motifs are low in the prokaryotic proteomes. However, their frequencies markedly increase with the biological complexity of eukaryotic organisms, and are higher in newly derived proteins than in those orthologous proteins commonly shared in all species. Ontology classifications of S. cerevisiae and human genes encoding the highest level of polyprolines reveal their strong association with several specific biological processes, including actin/cytoskeletal associated functions, RNA splicing/turnover, DNA binding/transcription and cell signaling. Previously reported phenotypic defects in actin polarity and mRNA decay of eIF5A mutant strains are consistent with the proposed role for eIF5A in the translation of the polyproline-containing proteins. Of all the amino acid tandem repeats (≥3 amino acids), only the proline repeat frequency correlates with functional complexity of the five organisms examined. Taken together, these findings suggest the importance of proline repeat-rich proteins and a potential role for eIF5A and its hypusine modification pathway in the course of eukaryotic evolution. PMID:25364902
The Effective Mutation Rate at Y Chromosome Short Tandem Repeats, with Application to Human Population-Divergence Time

PubMed Central

Zhivotovsky, Lev A.; Underhill, Peter A.; Cinnioğlu, Cengiz; Kayser, Manfred; Morar, Bharti; Kivisild, Toomas; Scozzari, Rosaria; Cruciani, Fulvio; Destro-Bisol, Giovanni; Spedini, Gabriella; Chambers, Geoffrey K.; Herrera, Rene J.; Yong, Kiau Kiun; Gresham, David; Tournev, Ivailo; Feldman, Marcus W.; Kalaydjieva, Luba

2004-01-01

We estimate an effective mutation rate at an average Y chromosome short-tandem repeat locus as 6.9×10-4 per 25 years, with a standard deviation across loci of 5.7×10-4, using data on microsatellite variation within Y chromosome haplogroups defined by unique-event polymorphisms in populations with documented short-term histories, as well as comparative data on worldwide populations at both the Y chromosome and various autosomal loci. This value is used to estimate the times of the African Bantu expansion, the divergence of Polynesian populations (the Maoris, Cook Islanders, and Samoans), and the origin of Gypsy populations from Bulgaria. PMID:14691732
Direct Observation of Parallel Folding Pathways Revealed Using a Symmetric Repeat Protein System

PubMed Central

Aksel, Tural; Barrick, Doug

2014-01-01

Although progress has been made to determine the native fold of a polypeptide from its primary structure, the diversity of pathways that connect the unfolded and folded states has not been adequately explored. Theoretical and computational studies predict that proteins fold through parallel pathways on funneled energy landscapes, although experimental detection of pathway diversity has been challenging. Here, we exploit the high translational symmetry and the direct length variation afforded by linear repeat proteins to directly detect folding through parallel pathways. By comparing folding rates of consensus ankyrin repeat proteins (CARPs), we find a clear increase in folding rates with increasing size and repeat number, although the size of the transition states (estimated from denaturant sensitivity) remains unchanged. The increase in folding rate with chain length, as opposed to a decrease expected from typical models for globular proteins, is a clear demonstration of parallel pathways. This conclusion is not dependent on extensive curve-fitting or structural perturbation of protein structure. By globally fitting a simple parallel-Ising pathway model, we have directly measured nucleation and propagation rates in protein folding, and have quantified the fluxes along each path, providing a detailed energy landscape for folding. This finding of parallel pathways differs from results from kinetic studies of repeat-proteins composed of sequence-variable repeats, where modest repeat-to-repeat energy variation coalesces folding into a single, dominant channel. Thus, for globular proteins, which have much higher variation in local structure and topology, parallel pathways are expected to be the exception rather than the rule. PMID:24988356
Solution properties of the archaeal CRISPR DNA repeat-binding homeodomain protein Cbp2

PubMed Central

Kenchappa, Chandra S.; Heidarsson, Pétur O.; Kragelund, Birthe B.; Garrett, Roger A.; Poulsen, Flemming M.

2013-01-01

Clustered regularly interspaced short palindromic repeats (CRISPR) form the basis of diverse adaptive immune systems directed primarily against invading genetic elements of archaea and bacteria. Cbp1 of the crenarchaeal thermoacidophilic order Sulfolobales, carrying three imperfect repeats, binds specifically to CRISPR DNA repeats and has been implicated in facilitating production of long transcripts from CRISPR loci. Here, a second related class of CRISPR DNA repeat-binding protein, denoted Cbp2, is characterized that contains two imperfect repeats and is found amongst members of the crenarchaeal thermoneutrophilic order Desulfurococcales. DNA repeat-binding properties of the Hyperthermus butylicus protein Cbp2Hb were characterized and its three-dimensional structure was determined by NMR spectroscopy. The two repeats generate helix-turn-helix structures separated by a basic linker that is implicated in facilitating high affinity DNA binding of Cbp2 by tethering the two domains. Structural studies on mutant proteins provide support for Cys7 and Cys28 enhancing high thermal stability of Cbp2Hb through disulphide bridge formation. Consistent with their proposed CRISPR transcriptional regulatory role, Cbp2Hb and, by inference, other Cbp1 and Cbp2 proteins are closely related in structure to homeodomain proteins with linked helix-turn-helix (HTH) domains, in particular the paired domain Pax and Myb family proteins that are involved in eukaryal transcriptional regulation. PMID:23325851
Origin of a folded repeat protein from an intrinsically disordered ancestor

PubMed Central

Zhu, Hongbo; Sepulveda, Edgardo; Hartmann, Marcus D; Kogenaru, Manjunatha; Ursinus, Astrid; Sulz, Eva; Albrecht, Reinhard; Coles, Murray; Martin, Jörg; Lupas, Andrei N

2016-01-01

Repetitive proteins are thought to have arisen through the amplification of subdomain-sized peptides. Many of these originated in a non-repetitive context as cofactors of RNA-based replication and catalysis, and required the RNA to assume their active conformation. In search of the origins of one of the most widespread repeat protein families, the tetratricopeptide repeat (TPR), we identified several potential homologs of its repeated helical hairpin in non-repetitive proteins, including the putatively ancient ribosomal protein S20 (RPS20), which only becomes structured in the context of the ribosome. We evaluated the ability of the RPS20 hairpin to form a TPR fold by amplification and obtained structures identical to natural TPRs for variants with 2–5 point mutations per repeat. The mutations were neutral in the parent organism, suggesting that they could have been sampled in the course of evolution. TPRs could thus have plausibly arisen by amplification from an ancestral helical hairpin. DOI: http://dx.doi.org/10.7554/eLife.16761.001 PMID:27623012
Diversity and Plasticity of the Intracellular Plant Pathogen and Insect Symbiont “Candidatus Liberibacter asiaticus” as Revealed by Hypervariable Prophage Genes with Intragenic Tandem Repeats ▿ †

PubMed Central

Zhou, Lijuan; Powell, Charles A.; Hoffman, Michele T.; Li, Wenbin; Fan, Guocheng; Liu, Bo; Lin, Hong; Duan, Yongping

2011-01-01

“Candidatus Liberibacter asiaticus” is a psyllid-transmitted, phloem-limited alphaproteobacterium and the most prevalent species of “Ca. Liberibacter” associated with a devastating worldwide citrus disease known as huanglongbing (HLB). Two related and hypervariable genes (hyvI and hyvII) were identified in the prophage regions of the Psy62 “Ca. Liberibacter asiaticus” genome. Sequence analyses of the hyvI and hyvII genes in 35 “Ca. Liberibacter asiaticus” DNA isolates collected globally revealed that the hyvI gene contains up to 12 nearly identical tandem repeats (NITRs, 132 bp) and 4 partial repeats, while hyvII contains up to 2 NITRs and 4 partial repeats and shares homology with hyvI. Frequent deletions or insertions of these repeats within the hyvI and hyvII genes were observed, none of which disrupted the open reading frames. Sequence conservation within the individual repeats but an extensive variation in repeat numbers, rearrangement, and the sequences flanking the repeat region indicate the diversity and plasticity of “Ca. Liberibacter asiaticus” bacterial populations in the world. These differences were found not only in samples of distinct geographical origins but also in samples from a single origin and even from a single “Ca. Liberibacter asiaticus”-infected sample. This is the first evidence of different “Ca. Liberibacter asiaticus” populations coexisting in a single HLB-affected sample. The Florida “Ca. Liberibacter asiaticus” isolates contain both hyvI and hyvII, while all other global “Ca. Liberibacter asiaticus” isolates contain either one or the other. Interclade assignments of the putative HyvI and HyvII proteins from Florida isolates with other global isolates in phylogenetic trees imply multiple “Ca. Liberibacter asiaticus” populations in the world and a multisource introduction of the “Ca. Liberibacter asiaticus” bacterium into Florida. PMID:21784907
Highly Discriminatory Variable-Number Tandem-Repeat Markers for Genotyping of Trichophyton interdigitale Strains

PubMed Central

Drira, Ines; Hadrich, Ines; Neji, Sourour; Mahfouth, Nedia; Trabelsi, Houaida; Sellami, Hayet; Makni, Fattouma

2014-01-01

Trichophyton interdigitale is the second most frequent cause of superficial fungal infections of various parts of the human body. Studying the population structure and genotype differentiation of T. interdigitale strains may lead to significant improvements in clinical practice. The present study aimed to develop and select suitable variable-number tandem-repeat (VNTR) markers for 92 clinical strains of T. interdigitale. On the basis of an analysis of four VNTR markers, four to eight distinct alleles were detected for each marker. The marker with the highest discriminatory power had eight alleles and a D value of 0.802. The combination of all four markers yielded a D value of 0.969 with 29 distinct multilocus genotypes. VNTR typing revealed the genetic diversity of the strains, identifying three populations according to their colonization sites. A correlation between phenotypic characteristics and multilocus genotypes was observed. Seven patients harbored T. interdigitale strains with different genotypes. Typing of clinical T. interdigitale samples by VNTR markers displayed excellent discriminatory power and 100% reproducibility. PMID:24989614
Tandem SUMO fusion vectors for improving soluble protein expression and purification.

PubMed

Guerrero, Fernando; Ciragan, Annika; Iwaï, Hideo

2015-12-01

Availability of highly purified proteins in quantity is crucial for detailed biochemical and structural investigations. Fusion tags are versatile tools to facilitate efficient protein purification and to improve soluble overexpression of proteins. Various purification and fusion tags have been widely used for overexpression in Escherichia coli. However, these tags might interfere with biological functions and/or structural investigations of the protein of interest. Therefore, an additional purification step to remove fusion tags by proteolytic digestion might be required. Here, we describe a set of new vectors in which yeast SUMO (SMT3) was used as the highly specific recognition sequence of ubiquitin-like protease 1, together with other commonly used solubility enhancing proteins, such as glutathione S-transferase, maltose binding protein, thioredoxin and trigger factor for optimizing soluble expression of protein of interest. This tandem SUMO (T-SUMO) fusion system was tested for soluble expression of the C-terminal domain of TonB from different organisms and for the antiviral protein scytovirin. Copyright © 2015 Elsevier Inc. All rights reserved.
Multiple intermediates on the energy landscape of a 15-HEAT-repeat protein

PubMed Central

Tsytlonok, Maksym; Craig, Patricio O.; Sivertsson, Elin; Serquera, David; Perrett, Sarah; Best, Robert B.; Wolynes, Peter G.; Itzhaki, Laura S.

2014-01-01

Repeat proteins are a special class of modular, non-globular proteins composed of small structural motifs arrayed to form elongated architectures and stabilised solely by short-range contacts. We find a remarkable complexity in the unfolding of the large HEAT repeat protein PR65/A. In contrast to what has been seen for small repeat proteins in which unfolding propagates from one end, the HEAT array of PR65/A ruptures at multiple distant sites, leading to intermediate states with non-contiguous folded subdomains. Kinetic analysis allows us to define a network of intermediates and to delineate the pathways that connect them. There is a dominant sequence of unfolding, reflecting a non-uniform distribution of stability across the repeat array; however the unfolding of certain intermediates is competitive, leading to parallel pathways. Theoretical models accounting for the heterogeneous contact density in the folded structure are able to rationalize the variation in stability across the array. This variation in stability also suggests how folding may direct function in a large repeat protein: The stability distribution enables certain regions to present rigid motifs for molecular recognition while affording others flexibility to broaden the search area as in a fly-casting mechanism. Thus PR65/A uses the two ends of the repeat array to bind diverse partners and thereby coordinate the dephosphorylation of many different substrates and of multiple sites within hyperphosphorylated substrates. PMID:24120762
Analysis of an "off-ladder" allele at the Penta D short tandem repeat locus.

PubMed

Yang, Y L; Wang, J G; Wang, D X; Zhang, W Y; Liu, X J; Cao, J; Yang, S L

2015-11-25

Kinship testing of a father and his son from Guangxi, China, the location of the Zhuang minority people, was performed using the PowerPlex® 18D System with a short tandem repeat typing kit. The results indicated that both the father and his son had an off-ladder allele at the Penta D locus, with a genetic size larger than that of the maximal standard allelic ladder. To further identify this locus, monogenic amplification, gene cloning, and genetic sequencing were performed. Sequencing analysis demonstrated that the fragment size of the Penta D-OL locus was 469 bp and the core sequence was [AAAGA]21, also called Penta D-21. The rare Penta D-21 allele was found to be distributed among the Zhuang population from the Guangxi Zhuang Autonomous Region of China; therefore, this study improved the range of DNA data available for this locus and enhanced our ability for individual identification of gene loci.
Functional insights from the distribution and role of homopeptide repeat-containing proteins

PubMed Central

Faux, Noel G.; Bottomley, Stephen P.; Lesk, Arthur M.; Irving, James A.; Morrison, John R.; de la Banda, Maria Garcia; Whisstock, James C.

2005-01-01

Expansion of “low complex” repeats of amino acids such as glutamine (Poly-Q) is associated with protein misfolding and the development of degenerative diseases such as Huntington's disease. The mechanism by which such regions promote misfolding remains controversial, the function of many repeat-containing proteins (RCPs) remains obscure, and the role (if any) of repeat regions remains to be determined. Here, a Web-accessible database of RCPs is presented. The distribution and evolution of RCPs that contain homopeptide repeats tracts are considered, and the existence of functional patterns investigated. Generally, it is found that while polyamino acid repeats are extremely rare in prokaryotes, several eukaryote putative homologs of prokaryote RCP—involved in important housekeeping processes—retain the repetitive region, suggesting an ancient origin for certain repeats. Within eukarya, the most common uninterrupted amino acid repeats are glutamine, asparagines, and alanine. Interestingly, while poly-Q repeats are found in vertebrates and nonvertebrates, poly-N repeats are only common in more primitive nonvertebrate organisms, such as insects and nematodes. We have assigned function to eukaryote RCPs using Online Mendelian Inheritance in Man (OMIM), the Human Reference Protein Database (HRPD), FlyBase, and Wormpep. Prokaryote RCPs were annotated using BLASTp searches and Gene Ontology. These data reveal that the majority of RCPs are involved in processes that require the assembly of large, multiprotein complexes, such as transcription and signaling. PMID:15805494
Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

PubMed

Prajapati, Surendra K; Joshi, Hema; Carlton, Jane M; Rizvi, M Alam

2013-01-01

The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75) from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years) and long-term population history (79,235 to 104,008) of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes) to that inferred from mitochondrial genome diversity.
Determination of Sources of Escherichia coli on Beef by Multiple-Locus Variable-Number Tandem Repeat Analysis.

PubMed

Yang, Xianqin; Tran, Frances; Youssef, Mohamed K; Gill, Colin O

2015-07-01

The possible origin of Escherichia coli found on cuts and trimmings in the breaking facility of a beef packing plant was examined using multiple-locus variable-number tandem repeat analysis. Coliforms and E. coli were enumerated in samples obtained from 160 carcasses that would enter the breaking facility when work commenced and after each of the three production breaks throughout the day, from the conveyor belt before work and after each break, and from cuts and trimmings when work commenced and after each break. Most samples yielded no E. coli, irrespective of the surface types. E. coli was recovered from 7 (<5%) carcasses, at numbers mostly ≤1.0 log CFU/160,000 cm(2). The log total numbers of E. coli recovered from the conveyor belt, cuts, and trimmings were mostly between 1 and 2 log CFU/80,000 cm(2). A total of 554 E. coli isolates were recovered. Multiple-locus variable-number tandem repeat analysis of 327 selected isolates identified 80 distinct genotypes, with 37 (46%) each containing one isolate. However, 28% of the isolates were of genotypes that were recovered from more than one sampling day. Of the 80 genotypes, 65 and 2% were found in one or all four sampling periods throughout the day. However, they represented 23 and 14% of the isolates, respectively. Of the genotypes identified for each surface type, at least one contained ≥9 isolates. No unique genotypes were associated with carcasses, but 10, 17, and 19 were uniquely associated with cuts, trimmings, and the belt, respectively. Of the isolates recovered from cuts, 49, 3, and 19% were of genotypes that were found among isolates recovered from the belt, carcasses, or both the belt and carcasses, respectively. A similar composition was found for isolates recovered from trimmings. These findings show that the E. coli found on cuts and trimmings at this beef packing plant mainly originated from the conveyor belt and that small number of E. coli strains survived the daily cleaning and sanitation
Tandem repeats of the 5' non-transcribed spacer of Tetrahymena rDNA function as high copy number autonomous replicons in the macronucleus but do not prevent rRNA gene dosage regulation.

PubMed Central

Pan, W J; Blackburn, E H

1995-01-01

The rRNA genes in the somatic macronucleus of Tetrahymena thermophila are normally on 21 kb linear palindromic molecules (rDNA). We examined the effect on rRNA gene dosage of transforming T.thermophila macronuclei with plasmid constructs containing a pair of tandemly repeated rDNA replication origin regions unlinked to the rRNA gene. A significant proportion of the plasmid sequences were maintained as high copy circular molecules, eventually consisting solely of tandem arrays of origin regions. As reported previously for cells transformed by a construct in which the same tandem rDNA origins were linked to the rRNA gene [Yu, G.-L. and Blackburn, E. H. (1990) Mol. Cell. Biol., 10, 2070-2080], origin sequences recombined to form linear molecules bearing several tandem repeats of the origin region, as well as rRNA genes. The total number of rDNA origin sequences eventually exceeded rRNA gene copies by approximately 20- to 40-fold and the number of circular replicons carrying only rDNA origin sequences exceeded rRNA gene copies by 2- to 3-fold. However, the rRNA gene dosage was unchanged. Hence, simply monitoring the total number of rDNA origin regions is not sufficient to regulate rRNA gene copy number. Images PMID:7784211
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats

PubMed Central

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-01-01

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363
Application of multilocus variable number tandem repeat analysis to monitor Verocytotoxin-producing Escherichia coli O157 phage type 8 in England and Wales: emergence of a profile associated with a national outbreak.

PubMed

Perry, N; Cheasty, T; Dallman, T; Launders, N; Willshaw, G

2013-10-01

Evaluation of multilocus variable number tandem repeat analysis (MLVA) to subtype all isolates of Vero cytotoxin-producing Escherichia coli O157 phage type 8 in England and Wales. Over a 13 month period from December 2010, 483 isolates of VTEC O157 PT8 were tested by MLVA; 39% were received in the first 4 months of 2011, when infections are generally low. One profile, or single locus variants of it, was present in 249 (52%) isolates but was not common previously. These cases represented a national increase in PT8, associated epidemiologically with soil-contaminated vegetables. Most of the 177 other MLVA profiles were unique to a single isolate. Profiles shared by >1 isolate included cases from two small community, food-borne outbreaks and 11 households. Several shared profiles were found among 23 isolates without known links. Apart from one group, isolates linked to travel abroad had very diverse profiles. Multilocus variable number tandem repeat analysis discriminated apparent sporadic isolates of the same PT and assisted in detection of cases in an emerging national outbreak. Multilocus variable number tandem repeat analysis is an epidemiologically valid complement to surveillance and applicable as a rapid, practical test for large numbers of isolates. © 2013 The Society for Applied Microbiology.
Tandem Fusion of Hepatitis B Core Antigen Allows Assembly of Virus-Like Particles in Bacteria and Plants with Enhanced Capacity to Accommodate Foreign Proteins

PubMed Central

Peyret, Hadrien; Gehin, Annick; Thuenemann, Eva C.; Blond, Donatienne; El Turabi, Aadil; Beales, Lucy; Clarke, Dean; Gilbert, Robert J. C.; Fry, Elizabeth E.; Stuart, David I.; Holmes, Kris; Stonehouse, Nicola J.; Whelan, Mike; Rosenberg, William; Lomonossoff, George P.; Rowlands, David J.

2015-01-01

The core protein of the hepatitis B virus, HBcAg, assembles into highly immunogenic virus-like particles (HBc VLPs) when expressed in a variety of heterologous systems. Specifically, the major insertion region (MIR) on the HBcAg protein allows the insertion of foreign sequences, which are then exposed on the tips of surface spike structures on the outside of the assembled particle. Here, we present a novel strategy which aids the display of whole proteins on the surface of HBc particles. This strategy, named tandem core, is based on the production of the HBcAg dimer as a single polypeptide chain by tandem fusion of two HBcAg open reading frames. This allows the insertion of large heterologous sequences in only one of the two MIRs in each spike, without compromising VLP formation. We present the use of tandem core technology in both plant and bacterial expression systems. The results show that tandem core particles can be produced with unmodified MIRs, or with one MIR in each tandem dimer modified to contain the entire sequence of GFP or of a camelid nanobody. Both inserted proteins are correctly folded and the nanobody fused to the surface of the tandem core particle (which we name tandibody) retains the ability to bind to its cognate antigen. This technology paves the way for the display of natively folded proteins on the surface of HBc particles either through direct fusion or through non-covalent attachment via a nanobody. PMID:25830365
Concerted evolution of the tandemly repeated genes encoding primate U2 small nuclear RNA (the RNU2 locus) does not prevent rapid diversification of the (CT){sub n} {center_dot} (GA){sub n} microsatellite embedded within the U2 repeat unit

DOE Office of Scientific and Technical Information (OSTI.GOV)

Liao, D.; Weiner, A.M.

1995-12-10

The RNU2 locus encoding human U2 small nuclear RNA (snRNA) is organized as a nearly perfect tandem array containing 5 to 22 copies of a 5.8-kb repeat unit. Just downstream of the U2 snRNA gene in each 5.8-kb repeat unit lies a large (CT){sub n}{center_dot}(GA){sub n} dinucleotide repeat (n {approx} 70). This form of genomic organization, in which one repeat is embedded within another, provides an unusual opportunity to study the balance of forces maintaining the homogeneity of both kinds of repeats. Using a combination of field inversion gel electrophoresis and polymerase chain reaction, we have been able to studymore » the CT microsatellites within individual U2 tandem arrays. We find that the CT microsatellites within an RNU2 allele exhibit significant length polymorphism, despite the remarkable homogeneity of the surrounding U2 repeat units. Length polymorphism is due primarily to loss or gain of CT dinucleotide repeats, but other types of deletions, insertions, and substitutions are also frequent. Polymorphism is greatly reduced in regions where pure (CT){sub n} tracts are interrupted by occasional G residues, suggesting that irregularities stabilize both the length and the sequence of the dinucleotide repeat. We further show that the RNU2 loci of other catarrhine primates (gorilla, chimpanzee, ogangutan, and baboon) contain orthologous CT microsatellites; these also exhibit length polymorphism, but are highly divergent from each other. Thus, although the CT microsatellite is evolving far more rapidly than the rest of the U2 repeat unit, it has persisted through multiple speciation events spanning >35 Myr. The persistence of the CT microsatellite, despite polymorphism and rapid evolution, suggests that it might play a functional role in concerted evolution of the RNU2 loci, perhaps as an initiation site for recombination and/or gene conversion. 70 refs., 5 figs.« less

Slipped-strand mispairing at noncontiguous repeats in Poecilia reticulata: a model for minisatellite birth.

PubMed Central

Taylor, J S; Breden, F

2000-01-01

The standard slipped-strand mispairing (SSM) model for the formation of variable number tandem repeats (VNTRs) proposes that a few tandem repeats, produced by chance mutations, provide the "raw material" for VNTR expansion. However, this model is unlikely to explain the formation of VNTRs with long motifs (e.g., minisatellites), because the likelihood of a tandem repeat forming by chance decreases rapidly as the length of the repeat motif increases. Phylogenetic reconstruction of the birth of a mitochondrial (mt) DNA minisatellite in guppies suggests that VNTRs with long motifs can form as a consequence of SSM at noncontiguous repeats. VNTRs formed in this manner have motifs longer than the noncontiguous repeat originally formed by chance and are flanked by one unit of the original, noncontiguous repeat. SSM at noncontiguous repeats can therefore explain the birth of VNTRs with long motifs and the "imperfect" or "short direct" repeats frequently observed adjacent to both mtDNA and nuclear VNTRs. PMID:10880490
Maximal oxygen uptake is associated with allele -202 A of insulin-like growth factor binding protein-3 (IGFBP3) promoter polymorphism and (CA)n tandem repeats of insulin-like growth factor IGF1 in Caucasians from Poland.

PubMed

Gronek, Piotr; Holdys, Joanna; Kryściak, Jakub; Wieliński, Dariusz; Słomski, Ryszard

2014-01-01

Physical fitness is a trait determined by multiple genes, and its genetic basis is modified by numerous environmental factors. The present study examines the effects of the (CA)n tandem repeats polymorphism in IGFI gene and SNP Alw21I restriction site -202 A>C polymorphism in IGF1BP3 on VO2max--a physiological index of aerobic capacity of high heritability. The study sample consisted of 239 (154 male and 85 female) students of the University School of Physical Education in Poznań and athletes practicing various sports, including members of the Polish national team. An association was found between -202 A/C polymorphism of IGFBP3 gene with VO2max in men. Higher VO2max values were attained by men with CC genotype, especially male athletes practicing endurance sports and sports featuring energy metabolism of aerobic/anaerobic character. A statistically significant influence of allele 188 and genotype 188/188 of tandem repeats (CA)n polymorphism of IGF1 gene on VO2max was found in women. Also, lower values of maximal oxygen uptake were noted in individuals with allele 186 or genotype 186/186, and higher VO2max values in athletes with allele 194.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

PubMed

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-11-16

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
A novel signal transduction protein: Combination of solute binding and tandem PAS-like sensor domains in one polypeptide chain: Periplasmic Ligand Binding Protein Dret_0059

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, R.; Wilton, R.; Cuff, M. E.

We report the structural and biochemical characterization of a novel periplasmic ligand-binding protein, Dret_0059, from Desulfohalobium retbaense DSM 5692, an organism isolated from the Salt Lake Retba in Senegal. The structure of the protein consists of a unique combination of a periplasmic solute binding protein (SBP) domain at the N-terminal and a tandem PAS-like sensor domain at the C-terminal region. SBP domains are found ubiquitously and their best known function is in solute transport across membranes. PAS-like sensor domains are commonly found in signal transduction proteins. These domains are widely observed as parts of many protein architectures and complexes butmore » have not been observed previously within the same polypeptide chain. In the structure of Dret_0059, a ketoleucine moiety is bound to the SBP, whereas a cytosine molecule is bound in the distal PAS-like domain of the tandem PAS-like domain. Differential scanning flourimetry support the binding of ligands observed in the crystal structure. There is significant interaction between the SBP and tandem PAS-like domains, and it is possible that the binding of one ligand could have an effect on the binding of the other. We uncovered three other proteins with this structural architecture in the non-redundant sequence data base, and predict that they too bind the same substrates. The genomic context of this protein did not offer any clues for its function. We did not find any biological process in which the two observed ligands are coupled. The protein Dret_0059 could be involved in either signal transduction or solute transport.« less
Evaluation of advanced multiplex short tandem repeat systems in pairwise kinship analysis.

PubMed

Tamura, Tomonori; Osawa, Motoki; Ochiai, Eriko; Suzuki, Takanori; Nakamura, Takashi

2015-09-01

The AmpFLSTR Identifiler Kit, comprising 15 autosomal short tandem repeat (STR) loci, is commonly employed in forensic practice for calculating match probabilities and parentage testing. The conventional system exhibits insufficient estimation for kinship analysis such as sibship testing because of shortness of examined loci. This study evaluated the power of the PowerPlex Fusion System, GlobalFiler Kit, and PowerPlex 21 System, which comprise more than 20 autosomal STR loci, to estimate pairwise blood relatedness (i.e., parent-child, full siblings, second-degree relatives, and first cousins). The genotypes of all 24 STR loci in 10,000 putative pedigrees were constructed by simulation. The likelihood ratio for each locus was calculated from joint probabilities for relatives and non-relatives. The combined likelihood ratio was calculated according to the product rule. The addition of STR loci improved separation between relatives and non-relatives. However, these systems were less effectively extended to the inference for first cousins. In conclusion, these advanced systems will be useful in forensic personal identification, especially in the evaluation of full siblings and second-degree relatives. Moreover, the additional loci may give rise to two major issues of more frequent mutational events and several pairs of linked loci on the same chromosome. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Multiple-locus variable-number tandem repeat analysis for strain discrimination of non-O157 Shiga toxin-producing Escherichia coli.

PubMed

Timmons, Chris; Trees, Eija; Ribot, Efrain M; Gerner-Smidt, Peter; LaFon, Patti; Im, Sung; Ma, Li Maria

2016-06-01

Non-O157 Shiga toxin-producing Escherichia coli (STEC) are foodborne pathogens of growing concern worldwide that have been associated with several recent multistate and multinational outbreaks of foodborne illness. Rapid and sensitive molecular-based bacterial strain discrimination methods are critical for timely outbreak identification and contaminated food source traceback. One such method, multiple-locus variable-number tandem repeat analysis (MLVA), is being used with increasing frequency in foodborne illness outbreak investigations to augment the current gold standard bacterial subtyping technique, pulsed-field gel electrophoresis (PFGE). The objective of this study was to develop a MLVA assay for intra- and inter-serogroup discrimination of six major non-O157 STEC serogroups-O26, O111, O103, O121, O45, and O145-and perform a preliminary internal validation of the method on a limited number of clinical isolates. The resultant MLVA scheme consists of ten variable number tandem repeat (VNTR) loci amplified in three multiplex PCR reactions. Sixty-five unique MLVA types were obtained among 84 clinical non-O157 STEC strains comprised of geographically diverse sporadic and outbreak related isolates. Compared to PFGE, the developed MLVA scheme allowed similar discrimination among serogroups O26, O111, O103, and O121 but not among O145 and O45. To more fully compare the discriminatory power of this preliminary MLVA method to PFGE and to determine its epidemiological congruence, a thorough internal and external validation needs to be performed on a carefully selected large panel of strains, including multiple isolates from single outbreaks. Copyright © 2016. Published by Elsevier B.V.
Sel1-like repeat proteins in signal transduction.

PubMed

Mittl, Peer R E; Schneider-Brachert, Wulf

2007-01-01

Solenoid proteins, which are distinguished from general globular proteins by their modular architectures, are frequently involved in signal transduction pathways. Proteins from the tetratricopeptide repeat (TPR) and Sel1-like repeat (SLR) families share similar alpha-helical conformations but different consensus sequence lengths and superhelical topologies. Both families are characterized by low sequence similarity levels, rendering the identification of functional homologous difficult. Therefore current knowledge of the molecular and cellular functions of the SLR proteins Sel1, Hrd3, Chs4, Nif1, PodJ, ExoR, AlgK, HcpA, Hsp12, EnhC, LpnE, MotX, and MerG has been reviewed. Although SLR proteins possess different cellular functions they all seem to serve as adaptor proteins for the assembly of macromolecular complexes. Sel1, Hrd3, Hsp12 and LpnE are activated under cellular stress. The eukaryotic Sel1 and Hrd3 proteins are involved in the ER-associated protein degradation, whereas the bacterial LpnE, EnhC, HcpA, ExoR, and AlgK proteins mediate the interactions between bacterial and eukaryotic host cells. LpnE and EnhC are responsible for the entry of L. pneumophila into epithelial cells and macrophages. ExoR from the symbiotic microorganism S. melioti and AlgK from the pathogen P. aeruginosa regulate exopolysaccaride synthesis. Nif1 and Chs4 from yeast are responsible for the regulation of mitosis and septum formation during cell division, respectively, and PodJ guides the cellular differentiation during the cell cycle of the bacterium C. crescentus. Taken together the SLR motif establishes a link between signal transduction pathways from eukaryotes and bacteria. The SLR motif is so far absent from archaea. Therefore the SLR could have developed in the last common ancestor between eukaryotes and bacteria.
Incomplete proteasomal degradation of green fluorescent proteins in the context of tandem fluorescent protein timers

PubMed Central

Khmelinskii, Anton; Meurer, Matthias; Ho, Chi-Ting; Besenbeck, Birgit; Füller, Julia; Lemberg, Marius K.; Bukau, Bernd; Mogk, Axel; Knop, Michael

2016-01-01

Tandem fluorescent protein timers (tFTs) report on protein age through time-dependent change in color, which can be exploited to study protein turnover and trafficking. Each tFT, composed of two fluorescent proteins (FPs) that differ in maturation kinetics, is suited to follow protein dynamics within a specific time range determined by the maturation rates of both FPs. So far, tFTs have been constructed by combining slower-maturing red fluorescent proteins (redFPs) with the faster-maturing superfolder green fluorescent protein (sfGFP). Toward a comprehensive characterization of tFTs, we compare here tFTs composed of different faster-maturing green fluorescent proteins (greenFPs) while keeping the slower-maturing redFP constant (mCherry). Our results indicate that the greenFP maturation kinetics influences the time range of a tFT. Moreover, we observe that commonly used greenFPs can partially withstand proteasomal degradation due to the stability of the FP fold, which results in accumulation of tFT fragments in the cell. Depending on the order of FPs in the timer, incomplete proteasomal degradation either shifts the time range of the tFT toward slower time scales or precludes its use for measurements of protein turnover. We identify greenFPs that are efficiently degraded by the proteasome and provide simple guidelines for the design of new tFTs. PMID:26609072
NIST mixed stain study 3: signal intensity balance in commercial short tandem repeat multiplexes.

PubMed

Duewer, David L; Kline, Margaret C; Redman, Janette W; Butler, John M

2004-12-01

Short-tandem repeat (STR) allelic intensities were collected from more than 60 forensic laboratories for a suite of seven samples as part of the National Institute of Standards and Technology-coordinated 2001 Mixed Stain Study 3 (MSS3). These interlaboratory challenge data illuminate the relative importance of intrinsic and user-determined factors affecting the locus-to-locus balance of signal intensities for currently used STR multiplexes. To varying degrees, seven of the eight commercially produced multiplexes used by MSS3 participants displayed very similar patterns of intensity differences among the different loci probed by the multiplexes for all samples, in the hands of multiple analysts, with a variety of supplies and instruments. These systematic differences reflect intrinsic properties of the individual multiplexes, not user-controllable measurement practices. To the extent that quality systems specify minimum and maximum absolute intensities for data acceptability and data interpretation schema require among-locus balance, these intrinsic intensity differences may decrease the utility of multiplex results and surely increase the cost of analysis.
Lymphatic filarial species differentiation using evolutionarily modified tandem repeats: generation of new genetic markers.

PubMed

Sakthidevi, Moorthy; Murugan, Vadivel; Hoti, Sugeerappa Laxmanappa; Kaliraj, Perumal

2010-05-01

Polymerase chain reaction based methods are promising tools for the monitoring and evaluation of the Global Program for the Elimination of Lymphatic Filariasis. The currently available PCR methods do not differentiate the DNA of Wuchereria bancrofti or Brugia malayi by a single PCR and hence are cumbersome. Therefore, we designed a single step PCR strategy for differentiating Bancroftian infection from Brugian infection based on a newly identified gene from the W. bancrofti genome, abundant larval transcript-2 (alt-2), which is abundantly expressed. The difference in PCR product sizes generated from the presence or absence of evolutionarily altered tandem repeats in alt-2 intron-3 differentiated W. bancrofti from B. malayi. The analysis was performed on the genomic DNA of microfilariae from a number of patient blood samples or microfilariae positive slides from different Indian geographical regions. The assay gave consistent results, differentiating the two filarial parasite species accurately. This alt-2 intron-3 based PCR assay can be a potential tool for the diagnosis and differentiation of co-infections by lymphatic filarial parasites. Copyright (c) 2010 Elsevier B.V. All rights reserved.
MSH3 polymorphisms and protein levels affect CAG repeat instability in Huntington's disease mice.

PubMed

Tomé, Stéphanie; Manley, Kevin; Simard, Jodie P; Clark, Greg W; Slean, Meghan M; Swami, Meera; Shelbourne, Peggy F; Tillier, Elisabeth R M; Monckton, Darren G; Messer, Anne; Pearson, Christopher E

2013-01-01

Expansions of trinucleotide CAG/CTG repeats in somatic tissues are thought to contribute to ongoing disease progression through an affected individual's life with Huntington's disease or myotonic dystrophy. Broad ranges of repeat instability arise between individuals with expanded repeats, suggesting the existence of modifiers of repeat instability. Mice with expanded CAG/CTG repeats show variable levels of instability depending upon mouse strain. However, to date the genetic modifiers underlying these differences have not been identified. We show that in liver and striatum the R6/1 Huntington's disease (HD) (CAG)∼100 transgene, when present in a congenic C57BL/6J (B6) background, incurred expansion-biased repeat mutations, whereas the repeat was stable in a congenic BALB/cByJ (CBy) background. Reciprocal congenic mice revealed the Msh3 gene as the determinant for the differences in repeat instability. Expansion bias was observed in congenic mice homozygous for the B6 Msh3 gene on a CBy background, while the CAG tract was stabilized in congenics homozygous for the CBy Msh3 gene on a B6 background. The CAG stabilization was as dramatic as genetic deficiency of Msh2. The B6 and CBy Msh3 genes had identical promoters but differed in coding regions and showed strikingly different protein levels. B6 MSH3 variant protein is highly expressed and associated with CAG expansions, while the CBy MSH3 variant protein is expressed at barely detectable levels, associating with CAG stability. The DHFR protein, which is divergently transcribed from a promoter shared by the Msh3 gene, did not show varied levels between mouse strains. Thus, naturally occurring MSH3 protein polymorphisms are modifiers of CAG repeat instability, likely through variable MSH3 protein stability. Since evidence supports that somatic CAG instability is a modifier and predictor of disease, our data are consistent with the hypothesis that variable levels of CAG instability associated with polymorphisms of
MSH3 Polymorphisms and Protein Levels Affect CAG Repeat Instability in Huntington's Disease Mice

PubMed Central

Simard, Jodie P.; Clark, Greg W.; Slean, Meghan M.; Swami, Meera; Shelbourne, Peggy F.; Tillier, Elisabeth R. M.; Monckton, Darren G.; Messer, Anne; Pearson, Christopher E.

2013-01-01

Expansions of trinucleotide CAG/CTG repeats in somatic tissues are thought to contribute to ongoing disease progression through an affected individual's life with Huntington's disease or myotonic dystrophy. Broad ranges of repeat instability arise between individuals with expanded repeats, suggesting the existence of modifiers of repeat instability. Mice with expanded CAG/CTG repeats show variable levels of instability depending upon mouse strain. However, to date the genetic modifiers underlying these differences have not been identified. We show that in liver and striatum the R6/1 Huntington's disease (HD) (CAG)∼100 transgene, when present in a congenic C57BL/6J (B6) background, incurred expansion-biased repeat mutations, whereas the repeat was stable in a congenic BALB/cByJ (CBy) background. Reciprocal congenic mice revealed the Msh3 gene as the determinant for the differences in repeat instability. Expansion bias was observed in congenic mice homozygous for the B6 Msh3 gene on a CBy background, while the CAG tract was stabilized in congenics homozygous for the CBy Msh3 gene on a B6 background. The CAG stabilization was as dramatic as genetic deficiency of Msh2. The B6 and CBy Msh3 genes had identical promoters but differed in coding regions and showed strikingly different protein levels. B6 MSH3 variant protein is highly expressed and associated with CAG expansions, while the CBy MSH3 variant protein is expressed at barely detectable levels, associating with CAG stability. The DHFR protein, which is divergently transcribed from a promoter shared by the Msh3 gene, did not show varied levels between mouse strains. Thus, naturally occurring MSH3 protein polymorphisms are modifiers of CAG repeat instability, likely through variable MSH3 protein stability. Since evidence supports that somatic CAG instability is a modifier and predictor of disease, our data are consistent with the hypothesis that variable levels of CAG instability associated with polymorphisms of
Toward Male Individualization with Rapidly Mutating Y-Chromosomal Short Tandem Repeats

PubMed Central

Ballantyne, Kaye N; Ralf, Arwin; Aboukhalid, Rachid; Achakzai, Niaz M; Anjos, Maria J; Ayub, Qasim; Balažic, Jože; Ballantyne, Jack; Ballard, David J; Berger, Burkhard; Bobillo, Cecilia; Bouabdellah, Mehdi; Burri, Helen; Capal, Tomas; Caratti, Stefano; Cárdenas, Jorge; Cartault, François; Carvalho, Elizeu F; Carvalho, Monica; Cheng, Baowen; Coble, Michael D; Comas, David; Corach, Daniel; D'Amato, Maria E; Davison, Sean; de Knijff, Peter; De Ungria, Maria Corazon A; Decorte, Ronny; Dobosz, Tadeusz; Dupuy, Berit M; Elmrghni, Samir; Gliwiński, Mateusz; Gomes, Sara C; Grol, Laurens; Haas, Cordula; Hanson, Erin; Henke, Jürgen; Henke, Lotte; Herrera-Rodríguez, Fabiola; Hill, Carolyn R; Holmlund, Gunilla; Honda, Katsuya; Immel, Uta-Dorothee; Inokuchi, Shota; Jobling, Mark A; Kaddura, Mahmoud; Kim, Jong S; Kim, Soon H; Kim, Wook; King, Turi E; Klausriegler, Eva; Kling, Daniel; Kovačević, Lejla; Kovatsi, Leda; Krajewski, Paweł; Kravchenko, Sergey; Larmuseau, Maarten H D; Lee, Eun Young; Lessig, Ruediger; Livshits, Ludmila A; Marjanović, Damir; Minarik, Marek; Mizuno, Natsuko; Moreira, Helena; Morling, Niels; Mukherjee, Meeta; Munier, Patrick; Nagaraju, Javaregowda; Neuhuber, Franz; Nie, Shengjie; Nilasitsataporn, Premlaphat; Nishi, Takeki; Oh, Hye H; Olofsson, Jill; Onofri, Valerio; Palo, Jukka U; Pamjav, Horolma; Parson, Walther; Petlach, Michal; Phillips, Christopher; Ploski, Rafal; Prasad, Samayamantri P R; Primorac, Dragan; Purnomo, Gludhug A; Purps, Josephine; Rangel-Villalobos, Hector; Rębała, Krzysztof; Rerkamnuaychoke, Budsaba; Gonzalez, Danel Rey; Robino, Carlo; Roewer, Lutz; Rosa, Alexandra; Sajantila, Antti; Sala, Andrea; Salvador, Jazelyn M; Sanz, Paula; Schmitt, Cornelia; Sharma, Anil K; Silva, Dayse A; Shin, Kyoung-Jin; Sijen, Titia; Sirker, Miriam; Siváková, Daniela; Škaro, Vedrana; Solano-Matamoros, Carlos; Souto, Luis; Stenzl, Vlastimil; Sudoyo, Herawati; Syndercombe-Court, Denise; Tagliabracci, Adriano; Taylor, Duncan; Tillmar, Andreas; Tsybovsky, Iosif S; Tyler-Smith, Chris; van der Gaag, Kristiaan J; Vanek, Daniel; Völgyi, Antónia; Ward, Denise; Willemse, Patricia; Yap, Eric PH; Yong, Rita YY; Pajnič, Irena Zupanič; Kayser, Manfred

2014-01-01

Relevant for various areas of human genetics, Y-chromosomal short tandem repeats (Y-STRs) are commonly used for testing close paternal relationships among individuals and populations, and for male lineage identification. However, even the widely used 17-loci Yfiler set cannot resolve individuals and populations completely. Here, 52 centers generated quality-controlled data of 13 rapidly mutating (RM) Y-STRs in 14,644 related and unrelated males from 111 worldwide populations. Strikingly, >99% of the 12,272 unrelated males were completely individualized. Haplotype diversity was extremely high (global: 0.9999985, regional: 0.99836–0.9999988). Haplotype sharing between populations was almost absent except for six (0.05%) of the 12,156 haplotypes. Haplotype sharing within populations was generally rare (0.8% nonunique haplotypes), significantly lower in urban (0.9%) than rural (2.1%) and highest in endogamous groups (14.3%). Analysis of molecular variance revealed 99.98% of variation within populations, 0.018% among populations within groups, and 0.002% among groups. Of the 2,372 newly and 156 previously typed male relative pairs, 29% were differentiated including 27% of the 2,378 father–son pairs. Relative to Yfiler, haplotype diversity was increased in 86% of the populations tested and overall male relative differentiation was raised by 23.5%. Our study demonstrates the value of RM Y-STRs in identifying and separating unrelated and related males and provides a reference database. PMID:24917567
GENETIC DIVERSITY OF TYPHA LATIFOLIA (TYPHACEAE) AND THE IMPACT OF POLLUTANTS EXAMINED WITH TANDEM-REPETITIVE DNA PROBES

EPA Science Inventory

Genetic diversity at variable-number-tandem-repeat (VNTR) loci was examined in the common cattail, Typha latifolia (Typhaceae), using three synthetic DNA probes composed of tandemly repeated "core" sequences (GACA, GATA, and GCAC). The principal objectives of this investigation w...
Detecting long tandem duplications in genomic sequences.

PubMed

Audemard, Eric; Schiex, Thomas; Faraut, Thomas

2012-05-08

Detecting duplication segments within completely sequenced genomes provides valuable information to address genome evolution and in particular the important question of the emergence of novel functions. The usual approach to gene duplication detection, based on all-pairs protein gene comparisons, provides only a restricted view of duplication. In this paper, we introduce ReD Tandem, a software using a flow based chaining algorithm targeted at detecting tandem duplication arrays of moderate to longer length regions, with possibly locally weak similarities, directly at the DNA level. On the A. thaliana genome, using a reference set of tandem duplicated genes built using TAIR,(a) we show that ReD Tandem is able to predict a large fraction of recently duplicated genes (dS < 1) and that it is also able to predict tandem duplications involving non coding elements such as pseudo-genes or RNA genes. ReD Tandem allows to identify large tandem duplications without any annotation, leading to agnostic identification of tandem duplications. This approach nicely complements the usual protein gene based which ignores duplications involving non coding regions. It is however inherently restricted to relatively recent duplications. By recovering otherwise ignored events, ReD Tandem gives a more comprehensive view of existing evolutionary processes and may also allow to improve existing annotations.
A novel signal transduction protein: Combination of solute binding and tandem PAS-like sensor domains in one polypeptide chain.

PubMed

Wu, R; Wilton, R; Cuff, M E; Endres, M; Babnigg, G; Edirisinghe, J N; Henry, C S; Joachimiak, A; Schiffer, M; Pokkuluri, P R

2017-04-01

We report the structural and biochemical characterization of a novel periplasmic ligand-binding protein, Dret_0059, from Desulfohalobium retbaense DSM 5692, an organism isolated from Lake Retba, in Senegal. The structure of the protein consists of a unique combination of a periplasmic solute binding protein (SBP) domain at the N-terminal and a tandem PAS-like sensor domain at the C-terminal region. SBP domains are found ubiquitously, and their best known function is in solute transport across membranes. PAS-like sensor domains are commonly found in signal transduction proteins. These domains are widely observed as parts of many protein architectures and complexes but have not been observed previously within the same polypeptide chain. In the structure of Dret_0059, a ketoleucine moiety is bound to the SBP, whereas a cytosine molecule is bound in the distal PAS-like domain of the tandem PAS-like domain. Differential scanning flourimetry support the binding of ligands observed in the crystal structure. There is significant interaction between the SBP and tandem PAS-like domains, and it is possible that the binding of one ligand could have an effect on the binding of the other. We uncovered three other proteins with this structural architecture in the non-redundant sequence data base, and predict that they too bind the same substrates. The genomic context of this protein did not offer any clues for its function. We did not find any biological process in which the two observed ligands are coupled. The protein Dret_0059 could be involved in either signal transduction or solute transport. © 2017 The Protein Society.
A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.

Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less
A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization

DOE PAGES

Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.; ...

2014-12-15

Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less
Identification and Analysis of Novel Amino-Acid Sequence Repeats in Bacillus anthracis str. Ames Proteome Using Computational Tools

PubMed Central

Hemalatha, G. R.; Rao, D. Satyanarayana; Guruprasad, L.

2007-01-01

We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure. PMID:17538688
Structure of thrombospondin type 3 repeats in bacterial outer membrane protein A reveals its intra-repeat disulfide bond-dependent calcium-binding capability

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dai, Shuyan; Sun, Cancan; Tan, Kemin

Eukaryotic thrombospondin type 3 repeat (TT3R) is an efficient calcium ion (Ca2+) binding motif only found in mammalian thrombospondin family. TT3R has also been found in prokaryotic cellulase Cel5G, which was thought to forfeit the Ca2+-binding capability due to the formation of intra-repeat disulfide bonds, instead of the inter-repeat ones possessed by eukaryotic TT3Rs. In this study, we have identified an enormous number of prokaryotic TT3R-containing proteins belonging to several different protein families, including outer membrane protein A (OmpA), an important structural protein connecting the outer membrane and the periplasmic peptidoglycan layer in gram-negative bacteria. Here, we report the crystalmore » structure of the periplasmic region of OmpA from Capnocytophaga gingivalis, which contains a linker region comprising five consecutive TT3Rs. The structure of OmpA-TT3R exhibits a well-ordered architecture organized around two tightly-coordinated Ca2+ and confirms the presence of abnormal intra-repeat disulfide bonds. Further mutagenesis studies showed that the Ca2+-binding capability of OmpA-TT3R is indeed dependent on the proper formation of intra-repeat disulfide bonds, which help to fix a conserved glycine residue at its proper position for Ca2+ coordination. Additionally, despite lacking inter repeat disulfide bonds, the interfaces between adjacent OmpA-TT3Rs are enhanced by both hydrophobic and conserved aromatic-proline interactions.« less

Development of a Multiple-Locus Variable number of tandem repeat Analysis (MLVA) for Leptospira interrogans and its application to Leptospira interrogans serovar Australis isolates from Far North Queensland, Australia

PubMed Central

Slack, Andrew T; Dohnt, Michael F; Symonds, Meegan L; Smythe, Lee D

2005-01-01

Background Leptospirosis is a zoonotic disease caused by the genus, Leptospira. Leptospira interrogans is the most common genomospecies implicated in the disease. Epidemiological investigations are needed to distinguish outbreak situations or to trace reservoirs of the organisms. Current methodologies used for typing Leptospira have significant drawbacks. The development of an easy to perform yet high resolution method is needed for this organism. Methods In this study we have searched the available genomic sequence of L. interrogans serovar Copenhageni strain Fiocruz L1-130 for the presence of tandem repeats [1]. These repeats were evaluated against reference strains for diversity. Six loci were selected to create a Multiple Locus Variable Number of Tandem Repeats (VNTR) Analysis (MLVA) to explore the genetic diversity within L. interrogans serovar Australis clinical isolates from Far North Queensland. Results The 39 reference strains used for the development of the method displayed 39 distinct patterns. Diversity Indexes for the loci varied between 0.80 and 0.93 and the number of repeat units at each locus varied between less than one to 52 repeats. When the MLVA was applied to serovar Australis isolates three large clusters were distinguishable, each comprising various hosts including Rattus species, human and canines. Conclusion The MLVA described in this report, was easy to perform, analyse and was reproducible. The loci selected had high diversity allowing discrimination between serovars and also between strains within a serovar. This method provides a starting point on which improvements to the method and comparisons to other techniques can be made. PMID:15987533
Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese

PubMed Central

Ebstein, Richard P.; Monakhov, Mikhail V.; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

2015-01-01

Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal–conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. PMID:26246555
Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese.

PubMed

Ebstein, Richard P; Monakhov, Mikhail V; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

2015-08-22

Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal-conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. © 2015 The Author(s).
Characterization of muscle ankyrin repeat proteins in human skeletal muscle.

PubMed

Wette, Stefan G; Smith, Heather K; Lamb, Graham D; Murphy, Robyn M

2017-09-01

Muscle ankyrin repeat proteins (MARPs) are a family of titin-associated, stress-response molecules and putative transducers of stretch-induced signaling in skeletal muscle. In cardiac muscle, cardiac ankyrin repeat protein (CARP) and diabetes-related ankyrin repeat protein (DARP) reportedly redistribute from binding sites on titin to the nucleus following a prolonged stretch. However, it is unclear whether ankyrin repeat domain protein 2 (Ankrd 2) shows comparable stretch-induced redistribution to the nucleus. We measured the following in rested human skeletal muscle: 1 ) the absolute amount of MARPs and 2 ) the distribution of Ankrd 2 and DARP in both single fibers and whole muscle preparations. In absolute amounts, Ankrd 2 is the most abundant MARP in human skeletal muscle, there being ~3.1 µmol/kg, much greater than DARP and CARP (~0.11 and ~0.02 µmol/kg, respectively). All DARP was found to be tightly bound at cytoskeletal (or possibly nuclear) sites. In contrast, ~70% of the total Ankrd 2 is freely diffusible in the cytosol [including virtually all of the phosphorylated (p)Ankrd 2-Ser99 form], ~15% is bound to non-nuclear membranes, and ~15% is bound at cytoskeletal sites, likely at the N2A region of titin. These data are not consistent with the proposal that Ankrd 2, per se, or pAnkrd 2-Ser99 mediates stretch-induced signaling in skeletal muscle, dissociating from titin and translocating to the nucleus, because the majority of these forms of Ankrd 2 are already free in the cytosol. It will be necessary to show that the titin-associated Ankrd 2 is modified by stretch in some as-yet-unidentified way, distinct from the diffusible pool, if it is to act as a stretch-sensitive signaling molecule. Copyright © 2017 the American Physiological Society.
Identification and characterization of short tandem repeats in the Tibetan macaque genome based on resequencing data.

PubMed

Liu, San-Xu; Hou, Wei; Zhang, Xue-Yan; Peng, Chang-Jun; Yue, Bi-Song; Fan, Zhen-Xin; Li, Jing

2018-07-18

The Tibetan macaque, which is endemic to China, is currently listed as a Near Endangered primate species by the International Union for Conservation of Nature (IUCN). Short tandem repeats (STRs) refer to repetitive elements of genome sequence that range in length from 1-6 bp. They are found in many organisms and are widely applied in population genetic studies. To clarify the distribution characteristics of genome-wide STRs and understand their variation among Tibetan macaques, we conducted a genome-wide survey of STRs with next-generation sequencing of five macaque samples. A total of 1 077 790 perfect STRs were mined from our assembly, with an N50 of 4 966 bp. Mono-nucleotide repeats were the most abundant, followed by tetra- and di-nucleotide repeats. Analysis of GC content and repeats showed consistent results with other macaques. Furthermore, using STR analysis software (lobSTR), we found that the proportion of base pair deletions in the STRs was greater than that of insertions in the five Tibetan macaque individuals (P<0.05, t-test). We also found a greater number of homozygous STRs than heterozygous STRs (P<0.05, t-test), with the Emei and Jianyang Tibetan macaques showing more heterozygous loci than Huangshan Tibetan macaques. The proportion of insertions and mean variation of alleles in the Emei and Jianyang individuals were slightly higher than those in the Huangshan individuals, thus revealing differences in STR allele size between the two populations. The polymorphic STR loci identified based on the reference genome showed good amplification efficiency and could be used to study population genetics in Tibetan macaques. The neighbor-joining tree classified the five macaques into two different branches according to their geographical origin, indicating high genetic differentiation between the Huangshan and Sichuan populations. We elucidated the distribution characteristics of STRs in the Tibetan macaque genome and provided an effective method for
[Convergent origin of repeats in genes coding for globular proteins. An analysis of the factors determining the presence of inverted and symmetrical repeats].

PubMed

Solov'ev, V V; Kel', A E; Kolchanov, N A

1989-01-01

The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.
An improved genome assembly uncovers prolific tandem repeats in Atlantic cod.

PubMed

Tørresen, Ole K; Star, Bastiaan; Jentoft, Sissel; Reinar, William B; Grove, Harald; Miller, Jason R; Walenz, Brian P; Knight, James; Ekholm, Jenny M; Peluso, Paul; Edvardsen, Rolf B; Tooming-Klunderud, Ave; Skage, Morten; Lien, Sigbjørn; Jakobsen, Kjetill S; Nederbragt, Alexander J

2017-01-18

The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance.
[Association of aggressive behaviors of schizophrenia with short tandem repeats loci].

PubMed

Yang, Chun; Ba, Huajie; Tan, Xingqi; Zhao, Hanqing; Zhang, Shuyou; Yu, Haiying

2017-12-10

To assess the association of short tandem repeats (STRs) loci with aggressive behaviors of schizophrenia. Blood samples from 123 schizophrenic patients with aggressive behaviors and 489 schizophrenic patients without aggressive behaviors were collected. DNA from all samples was amplified with a PowerPlex 21 system and separated by electrophoresis to determine the genotypes and allelic frequencies of 20 STR loci including D3S1368, D1S1656, D6S1043, D13S317, Penta E, D16S639, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433, and FGA. All of the 20 STR loci have reached Hardy-Weinberg equilibrium in both groups. A significant difference was found in allelic and genotypic frequencies of loci Penta D between the two groups (alleles: P=0.042; genotypes: P=0.014) but not for the remaining 19 loci (P> 0.05). Univariate analysis also showed a significant difference for allele 10 and genotypes 10-12 of Penta D between the two groups (P=0.0027, P=0.0001), with the OR being 1.81 (95%CI: 1.22-2.67) and 4.33 (95%CI: 1.95-9.59), respectively. Penta D may be associated with aggressive behaviors of schizophrenia. Allele 10 and genotypes 10-12 of Penta D may confer a risk for the disease.
A carrot leucine-rich-repeat protein that inhibits ice recrystallization.

PubMed

Worrall, D; Elias, L; Ashford, D; Smallwood, M; Sidebottom, C; Lillford, P; Telford, J; Holt, C; Bowles, D

1998-10-02

Many organisms adapted to live at subzero temperatures express antifreeze proteins that improve their tolerance to freezing. Although structurally diverse, all antifreeze proteins interact with ice surfaces, depress the freezing temperature of aqueous solutions, and inhibit ice crystal growth. A protein purified from carrot shares these functional features with antifreeze proteins of fish. Expression of the carrot complementary DNA in tobacco resulted in the accumulation of antifreeze activity in the apoplast of plants grown at greenhouse temperatures. The sequence of carrot antifreeze protein is similar to that of polygalacturonase inhibitor proteins and contains leucine-rich repeats.
Development of liquid chromatography-tandem mass spectrometry methods for the quantitation of Anisakis simplex proteins in fish.

PubMed

Fæste, Christiane Kruse; Moen, Anders; Schniedewind, Björn; Haug Anonsen, Jan; Klawitter, Jelena; Christians, Uwe

2016-02-05

The parasite Anisakis simplex is present in many marine fish species that are directly used as food or in processed products. The anisakid larvae infect mostly the gut and inner organs of fish but have also been shown to penetrate into the fillet. Thus, human health can be at risk, either by contracting anisakiasis through the consumption of raw or under-cooked fish, or by sensitisation to anisakid proteins in processed food. A number of different methods for the detection of A. simplex in fish and products thereof have been developed, including visual techniques and PCR for larvae tracing, and immunological assays for the determination of proteins. The recent identification of a number of anisakid proteins by mass spectrometry-based proteomics has laid the groundwork for the development of two quantitative liquid chromatography-tandem mass spectrometry methods for the detection of A. simplex in fish that are described in the present study. Both, the label-free semi-quantitative nLC-nESI-Orbitrap-MS/MS (MS1) and the heavy peptide-applying absolute-quantitative (AQUA) LC-TripleQ-MS/MS (MS2) use unique reporter peptides derived from anisakid hemoglobin and SXP/RAL-2 protein as analytes. Standard curves in buffer and in salmon matrix showed limits of detection at 1μg/mL and 10μg/mL for MS1 and 0.1μg/mL and 2μg/mL for MS2. Preliminary method validation included the assessment of sensitivity, repeatability, reproducibility, and applicability to incurred and naturally-contaminated samples for both assays. By further optimization and full validation in accordance with current recommendations the LC-MS/MS methods could be standardized and used generally as confirmative techniques for the detection of A. simplex protein in fish. Copyright © 2016 Elsevier B.V. All rights reserved.
Developmental Validation of Short Tandem Repeat Reagent Kit for Forensic DNA Profiling of Canine Biological Materials

PubMed Central

Dayton, Melody; Koskinen, Mikko T; Tom, Bradley K; Mattila, Anna-Maria; Johnston, Eric; Halverson, Joy; Fantin, Dennis; DeNise, Sue; Budowle, Bruce; Smith, David Glenn; Kanthaswamy, Sree

2009-01-01

Aim To develop a reagent kit that enables multiplex polymerase chain reaction (PCR) amplification of 18 short tandem repeats (STR) and the canine sex-determining Zinc Finger marker. Methods Validation studies to determine the robustness and reliability in forensic DNA typing of this multiplex assay included sensitivity testing, reproducibility studies, intra- and inter-locus color balance studies, annealing temperature and cycle number studies, peak height ratio determination, characterization of artifacts such as stutter percentages and dye blobs, mixture analyses, species-specificity, case type samples analyses and population studies. Results The kit robustly amplified domesticated dog samples and consistently generated full 19-locus profiles from as little as 125 pg of dog DNA. In addition, wolf DNA samples could be analyzed with the kit. Conclusion The kit, which produces robust, reliable, and reproducible results, will be made available for the forensic research community after modifications based on this study’s evaluation to comply with the quality standards expected for forensic casework. PMID:19480022
The association of 22 Y chromosome short tandem repeat loci with initiative-aggressive behavior.

PubMed

Yang, Chun; Ba, Huajie; Zhang, Wei; Zhang, Shuyou; Zhao, Hanqing; Yu, Haiying; Gao, Zhiqin; Wang, Binbin

2018-05-15

Aggressive behavior represents an important public concern and a clinical challenge to behaviorists and psychiatrists. Aggression in humans is known to have an important genetic basis, so to investigate the association of Y chromosome short tandem repeat (Y-STR) loci with initiative-aggressive behavior, we compared allelic and haplotypic distributions of 22 Y-STRs in a group of Chinese males convicted of premeditated extremely violent crimes (n = 271) with a normal control group (n = 492). Allelic distributions of DYS533 and DYS437 loci differed significantly between the two groups (P < 0.05). The case group had higher frequencies of DYS533 allele 14, DYS437 allele 14, and haplotypes 11-14 of DYS533-DYS437 compared with the control group. Additionally, the DYS437 allele 15 frequency was significantly lower in cases than controls. No frequency differences were observed in the other 20 Y-STR loci between these two groups. Our results indicate a genetic role for Y-STR loci in the development of initiative aggression in non-psychiatric subjects. Copyright © 2018 Elsevier B.V. All rights reserved.
Identification of Variable-Number Tandem-Repeat (VNTR) Sequences in Acinetobacter baumannii and Interlaboratory Validation of an Optimized Multiple-Locus VNTR Analysis Typing Scheme▿†

PubMed Central

Pourcel, Christine; Minandri, Fabrizia; Hauck, Yolande; D'Arezzo, Silvia; Imperi, Francesco; Vergnaud, Gilles; Visca, Paolo

2011-01-01

Acinetobacter baumannii is an important opportunistic pathogen responsible for nosocomial outbreaks, mostly occurring in intensive care units. Due to the multiplicity of infection sources, reliable molecular fingerprinting techniques are needed to establish epidemiological correlations among A. baumannii isolates. Multiple-locus variable-number tandem-repeat analysis (MLVA) has proven to be a fast, reliable, and cost-effective typing method for several bacterial species. In this study, an MLVA assay compatible with simple PCR- and agarose gel-based electrophoresis steps as well as with high-throughput automated methods was developed for A. baumannii typing. Preliminarily, 10 potential polymorphic variable-number tandem repeats (VNTRs) were identified upon bioinformatic screening of six annotated genome sequences of A. baumannii. A collection of 7 reference strains plus 18 well-characterized isolates, including unique types and representatives of the three international A. baumannii lineages, was then evaluated in a two-center study aimed at validating the MLVA assay and comparing it with other genotyping assays, namely, macrorestriction analysis with pulsed-field gel electrophoresis (PFGE) and PCR-based sequence group (SG) profiling. The results showed that MLVA can discriminate between isolates with identical PFGE types and SG profiles. A panel of eight VNTR markers was selected, all showing the ability to be amplified and good amounts of polymorphism in the majority of strains. Independently generated MLVA profiles, composed of an ordered string of allele numbers corresponding to the number of repeats at each VNTR locus, were concordant between centers. Typeability, reproducibility, stability, discriminatory power, and epidemiological concordance were excellent. A database containing information and MLVA profiles for several A. baumannii strains is available from http://mlva.u-psud.fr/. PMID:21147956
Electrospray-assisted laser desorption/ionization and tandem mass spectrometry of peptides and proteins.

PubMed

Peng, Ivory X; Shiea, Jentaie; Ogorzalek Loo, Rachel R; Loo, Joseph A

2007-01-01

We have constructed an electrospray-assisted laser desorption/ionization (ELDI) source which utilizes a nitrogen laser pulse to desorb intact molecules from matrix-containing sample solution droplets, followed by electrospray ionization (ESI) post-ionization. The ELDI source is coupled to a quadrupole ion trap mass spectrometer and allows sampling under ambient conditions. Preliminary data showed that ELDI produces ESI-like multiply charged peptides and proteins up to 29 kDa carbonic anhydrase and 66 kDa bovine albumin from single-protein solutions, as well as from complex digest mixtures. The generated multiply charged polypeptides enable efficient tandem mass spectrometric (MS/MS)-based peptide sequencing. ELDI-MS/MS of protein digests and small intact proteins was performed both by collisionally activated dissociation (CAD) and by nozzle-skimmer dissociation (NSD). ELDI-MS/MS may be a useful tool for protein sequencing analysis and top-down proteomics study, and may complement matrix-assisted laser desorption/ionization (MALDI)-based measurements. Copyright (c) 2007 John Wiley & Sons, Ltd.
[Reticulate evolution of parthenogenetic species of the Lacertidae rock lizards: inheritance of CLsat tandem repeats and anonymous RAPD markers].

PubMed

Chobanu, D; Rudykh, I A; Riabinina, N L; Grechko, V V; Kramerov, D A; Darevskiĭ, I S

2002-01-01

The genetic relatedness of several bisexual and of four unisexual "Lacerta saxicola complex" lizards was studied, using monomer sequences of the complex-specific CLsat tandem repeats and anonymous RAPD markers. Genomes of parthenospecies were shown to include different satellite monomers. The structure of each such monomer is specific for a certain pair of bisexual species. This fact might be interpreted in favor of co-dominant inheritance of these markers in bisexual species hybridogenesis. This idea is supported by the results obtained with RAPD markers; i.e., unisexual species genomes include only the loci characteristic of certain bisexual species. At the same time, in neither case parthenospecies possess specific, autoapomorphic loci that were not present in this or that bisexual species.
Binomial probability distribution model-based protein identification algorithm for tandem mass spectrometry utilizing peak intensity information.

PubMed

Xiao, Chuan-Le; Chen, Xiao-Zhou; Du, Yang-Li; Sun, Xuesong; Zhang, Gong; He, Qing-Yu

2013-01-04

Mass spectrometry has become one of the most important technologies in proteomic analysis. Tandem mass spectrometry (LC-MS/MS) is a major tool for the analysis of peptide mixtures from protein samples. The key step of MS data processing is the identification of peptides from experimental spectra by searching public sequence databases. Although a number of algorithms to identify peptides from MS/MS data have been already proposed, e.g. Sequest, OMSSA, X!Tandem, Mascot, etc., they are mainly based on statistical models considering only peak-matches between experimental and theoretical spectra, but not peak intensity information. Moreover, different algorithms gave different results from the same MS data, implying their probable incompleteness and questionable reproducibility. We developed a novel peptide identification algorithm, ProVerB, based on a binomial probability distribution model of protein tandem mass spectrometry combined with a new scoring function, making full use of peak intensity information and, thus, enhancing the ability of identification. Compared with Mascot, Sequest, and SQID, ProVerB identified significantly more peptides from LC-MS/MS data sets than the current algorithms at 1% False Discovery Rate (FDR) and provided more confident peptide identifications. ProVerB is also compatible with various platforms and experimental data sets, showing its robustness and versatility. The open-source program ProVerB is available at http://bioinformatics.jnu.edu.cn/software/proverb/ .
REPETITA: detection and discrimination of the periodicity of protein solenoid repeats by discrete Fourier transform

PubMed Central

Marsella, Luca; Sirocco, Francesco; Trovato, Antonio; Seno, Flavio; Tosatto, Silvio C.E.

2009-01-01

Motivation: Proteins with solenoid repeats evolve more quickly than non-repetitive ones and their periodicity may be rapidly hidden at sequence level, while still evident in structure. In order to identify these repeats, we propose here a novel method based on a metric characterizing amino-acid properties (polarity, secondary structure, molecular volume, codon diversity, electric charge) using five previously derived numerical functions. Results: The five spectra of the candidate sequences coding for structural repeats, obtained by Discrete Fourier Transform (DFT), show common features allowing determination of repeat periodicity with excellent results. Moreover it is possible to introduce a phase space parameterized by two quantities related to the Fourier spectra which allow for a clear distinction between a non-homologous set of globular proteins and proteins with solenoid repeats. The DFT method is shown to be competitive with other state of the art methods in the detection of solenoid structures, while improving its performance especially in the identification of periodicities, since it is able to recognize the actual repeat length in most cases. Moreover it highlights the relevance of local structural propensities in determining solenoid repeats. Availability: A web tool implementing the algorithm presented in the article (REPETITA) is available with additional details on the data sets at the URL: http://protein.bio.unipd.it/repetita/. Contact: silvio.tosatto@unipd.it PMID:19478001
Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm

PubMed Central

Glunčić, Matko; Paar, Vladimir

2013-01-01

The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes). PMID:22977183
Myotonin protein-kinase [AGC]n trinucleotide repeat in seven nonhuman primates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Novelli, G.; Sineo, L.; Pontieri, E.

Myotonic dystrophy (DM) is due to a genomic instability of a trinucleotide [AGC]n motif, located at the 3{prime} UTR region of a protein-kinase gene (myotonin protein kinase, MT-PK). The [AGC] repeat is meiotically and mitotically unstable, and it is directly related to the manifestations of the disorder. Although a gene dosage effect of the MT-PK has been demonstrated n DM muscle, the mechanism(s) by which the intragenic repeat expansion leads to disease is largely unknown. This non-standard mutational event could reflect an evolutionary mechanism widespread among animal genomes. We have isolated and sequenced the complete 3{prime}UTR region of the MT-PKmore » gene in seven primates (macaque, orangutan, gorilla, chimpanzee, gibbon, owl monkey, saimiri), and examined by comparative sequence nucleotide analysis the [AGC]n intragenic repeat and the surrounding nucleotides. The genomic organization, including the [AGC]n repeat structure, was conserved in all examined species, excluding the gibbon (Hylobates agilis), in which the [AGC]n upstream sequence (GGAA) is replaced by a GA dinucleotide. The number of [AGC]n in the examined species ranged between 7 (gorilla) and 13 repeats (owl monkeys), with a polymorphism informative content (PIC) similar to that observed in humans. These results indicate that the 3{prime}UTR [AGC] repeat within the MT-PK gene is evolutionarily conserved, supporting that this region has important regulatory functions.« less
Constraints and consequences of the emergence of amino acid repeats in eukaryotic proteins.

PubMed

Chavali, Sreenivas; Chavali, Pavithra L; Chalancon, Guilhem; de Groot, Natalia Sanchez; Gemayel, Rita; Latysheva, Natasha S; Ing-Simmons, Elizabeth; Verstrepen, Kevin J; Balaji, Santhanam; Babu, M Madan

2017-09-01

Proteins with amino acid homorepeats have the potential to be detrimental to cells and are often associated with human diseases. Why, then, are homorepeats prevalent in eukaryotic proteomes? In yeast, homorepeats are enriched in proteins that are essential and pleiotropic and that buffer environmental insults. The presence of homorepeats increases the functional versatility of proteins by mediating protein interactions and facilitating spatial organization in a repeat-dependent manner. During evolution, homorepeats are preferentially retained in proteins with stringent proteostasis, which might minimize repeat-associated detrimental effects such as unregulated phase separation and protein aggregation. Their presence facilitates rapid protein divergence through accumulation of amino acid substitutions, which often affect linear motifs and post-translational-modification sites. These substitutions may result in rewiring protein interaction and signaling networks. Thus, homorepeats are distinct modules that are often retained in stringently regulated proteins. Their presence facilitates rapid exploration of the genotype-phenotype landscape of a population, thereby contributing to adaptation and fitness.

Reverse Transcription Errors and RNA-DNA Differences at Short Tandem Repeats.

PubMed

Fungtammasan, Arkarachai; Tomaszkiewicz, Marta; Campos-Sánchez, Rebeca; Eckert, Kristin A; DeGiorgio, Michael; Makova, Kateryna D

2016-10-01

Transcript variation has important implications for organismal function in health and disease. Most transcriptome studies focus on assessing variation in gene expression levels and isoform representation. Variation at the level of transcript sequence is caused by RNA editing and transcription errors, and leads to nongenetically encoded transcript variants, or RNA-DNA differences (RDDs). Such variation has been understudied, in part because its detection is obscured by reverse transcription (RT) and sequencing errors. It has only been evaluated for intertranscript base substitution differences. Here, we investigated transcript sequence variation for short tandem repeats (STRs). We developed the first maximum-likelihood estimator (MLE) to infer RT error and RDD rates, taking next generation sequencing error rates into account. Using the MLE, we empirically evaluated RT error and RDD rates for STRs in a large-scale DNA and RNA replicated sequencing experiment conducted in a primate species. The RT error rates increased exponentially with STR length and were biased toward expansions. The RDD rates were approximately 1 order of magnitude lower than the RT error rates. The RT error rates estimated with the MLE from a primate data set were concordant with those estimated with an independent method, barcoded RNA sequencing, from a Caenorhabditis elegans data set. Our results have important implications for medical genomics, as STR allelic variation is associated with >40 diseases. STR nonallelic transcript variation can also contribute to disease phenotype. The MLE and empirical rates presented here can be used to evaluate the probability of disease-associated transcripts arising due to RDD. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Evolution and function of CAG/polyglutamine repeats in protein–protein interaction networks

PubMed Central

Schaefer, Martin H.; Wanker, Erich E.; Andrade-Navarro, Miguel A.

2012-01-01

Expanded runs of consecutive trinucleotide CAG repeats encoding polyglutamine (polyQ) stretches are observed in the genes of a large number of patients with different genetic diseases such as Huntington's and several Ataxias. Protein aggregation, which is a key feature of most of these diseases, is thought to be triggered by these expanded polyQ sequences in disease-related proteins. However, polyQ tracts are a normal feature of many human proteins, suggesting that they have an important cellular function. To clarify the potential function of polyQ repeats in biological systems, we systematically analyzed available information stored in sequence and protein interaction databases. By integrating genomic, phylogenetic, protein interaction network and functional information, we obtained evidence that polyQ tracts in proteins stabilize protein interactions. This happens most likely through structural changes whereby the polyQ sequence extends a neighboring coiled-coil region to facilitate its interaction with a coiled-coil region in another protein. Alteration of this important biological function due to polyQ expansion results in gain of abnormal interactions, leading to pathological effects like protein aggregation. Our analyses suggest that research on polyQ proteins should shift focus from expanded polyQ proteins into the characterization of the influence of the wild-type polyQ on protein interactions. PMID:22287626
Multi-laboratory validation study of multilocus variable-number tandem repeat analysis (MLVA) for Salmonella enterica serovar Enteritidis, 2015

PubMed Central

Peters, Tansy; Bertrand, Sophie; Björkman, Jonas T; Brandal, Lin T; Brown, Derek J; Erdõsi, Tímea; Heck, Max; Ibrahem, Salha; Johansson, Karin; Kornschober, Christian; Kotila, Saara M; Le Hello, Simon; Lienemann, Taru; Mattheus, Wesley; Nielsen, Eva Møller; Ragimbeau, Catherine; Rumore, Jillian; Sabol, Ashley; Torpdahl, Mia; Trees, Eija; Tuohy, Alma; de Pinna, Elizabeth

2017-01-01

Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data. PMID:28277220
Multi-laboratory validation study of multilocus variable-number tandem repeat analysis (MLVA) for Salmonella enterica serovar Enteritidis, 2015.

PubMed

Peters, Tansy; Bertrand, Sophie; Björkman, Jonas T; Brandal, Lin T; Brown, Derek J; Erdõsi, Tímea; Heck, Max; Ibrahem, Salha; Johansson, Karin; Kornschober, Christian; Kotila, Saara M; Le Hello, Simon; Lienemann, Taru; Mattheus, Wesley; Nielsen, Eva Møller; Ragimbeau, Catherine; Rumore, Jillian; Sabol, Ashley; Torpdahl, Mia; Trees, Eija; Tuohy, Alma; de Pinna, Elizabeth

2017-03-02

Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data. This article is copyright of The Authors, 2017.
Symmetry based assembly of a 2 dimensional protein lattice

DOE Office of Scientific and Technical Information (OSTI.GOV)

Poulos, Sandra; Agah, Sayeh; Jallah, Nikardi

2017-04-18

The design of proteins that self-assemble into higher order architectures is of great interest due to their potential application in nanotechnology. Specifically, the self-assembly of proteins into ordered lattices is of special interest to the field of structural biology. Here we designed a 2 dimensional (2D) protein lattice using a fusion of a tandem repeat of three TelSAM domains (TTT) to the Ferric uptake regulator (FUR) domain. We determined the structure of the designed (TTT-FUR) fusion protein to 2.3 Å by X-ray crystallographic methods. In agreement with the design, a 2D lattice composed of TelSAM fibers interdigitated by the FURmore » domain was observed. As expected, the fusion of a tandem repeat of three TelSAM domains formed 21 screw axis, and the self-assembly of the ordered oligomer was under pH control. We demonstrated that the fusion of TTT to a domain having a 2-fold symmetry, such as the FUR domain, can produce an ordered 2D lattice. The TTT-FUR system combines features from the rotational symmetry matching approach with the oligomer driven crystallization method. This TTT-FUR fusion was amenable to X-ray crystallographic methods, and is a promising crystallization chaperone.« less
Multi-locus variable number tandem repeat analysis of 7th pandemic Vibrio cholerae

PubMed Central

2012-01-01

Background Seven pandemics of cholera have been recorded since 1817, with the current and ongoing pandemic affecting almost every continent. Cholera remains endemic in developing countries and is still a significant public health issue. In this study we use multilocus variable number of tandem repeats (VNTRs) analysis (MLVA) to discriminate between isolates of the 7th pandemic clone of Vibrio cholerae. Results MLVA of six VNTRs selected from previously published data distinguished 66 V. cholerae isolates collected between 1961–1999 into 60 unique MLVA profiles. Only 4 MLVA profiles consisted of more than 2 isolates. The discriminatory power was 0.995. Phylogenetic analysis showed that, except for the closely related profiles, the relationships derived from MLVA profiles were in conflict with that inferred from Single Nucleotide Polymorphism (SNP) typing. The six SNP groups share consensus VNTR patterns and two SNP groups contained isolates which differed by only one VNTR locus. Conclusions MLVA is highly discriminatory in differentiating 7th pandemic V. cholerae isolates and MLVA data was most useful in resolving the genetic relationships among isolates within groups previously defined by SNPs. Thus MLVA is best used in conjunction with SNP typing in order to best determine the evolutionary relationships among the 7th pandemic V. cholerae isolates and for longer term epidemiological typing. PMID:22624829
Use of Variable-Number Tandem Repeats To Examine Genetic Diversity of Neisseria meningitidis

PubMed Central

Yazdankhah, Siamak P.; Lindstedt, Bjørn-Arne; Caugant, Dominique A.

2005-01-01

Repetitive DNA motifs with potential variable-number tandem repeats (VNTR) were identified in the genome of Neisseria meningitidis and used to develop a typing method. A total of 146 meningococcal isolates recovered from carriers and patients were studied. These included 82 of the 107 N. meningitidis isolates previously used in the development of multilocus sequence typing (MLST), 45 isolates recovered from different counties in Norway in connection with local outbreaks, and 19 serogroup W135 isolates of sequence type 11 (ST-11), which were recovered in several parts of the world. The latter group comprised isolates related to the Hajj outbreak of 2000 and isolates recovered from outbreaks in Burkina Faso in 2001 and 2002. All isolates had been characterized previously by MLST or multilocus enzyme electrophoresis (MLEE). VNTR analysis showed that meningococcal isolates with similar MLST or MLEE types recovered from epidemiologically linked cases in a defined geographical area often presented similar VNTR patterns while isolates of the same MLST or MLEE types without an obvious epidemiological link showed variable VNTR patterns. Thus, VNTR analysis may be used for fine typing of meningococcal isolates after MLST or MLEE typing. The method might be especially valuable for differentiating among ST-11 strains, as shown by the VNTR analyses of serogroup W135 ST-11 meningococcal isolates recovered since the mid-1990s. PMID:15814988
Selection of specific protein binders for pre-defined targets from an optimized library of artificial helicoidal repeat proteins (alphaRep).

PubMed

Guellouz, Asma; Valerio-Lepiniec, Marie; Urvoas, Agathe; Chevrel, Anne; Graille, Marc; Fourati-Kammoun, Zaineb; Desmadril, Michel; van Tilbeurgh, Herman; Minard, Philippe

2013-01-01

We previously designed a new family of artificial proteins named αRep based on a subgroup of thermostable helicoidal HEAT-like repeats. We have now assembled a large optimized αRep library. In this library, the side chains at each variable position are not fully randomized but instead encoded by a distribution of codons based on the natural frequency of side chains of the natural repeats family. The library construction is based on a polymerization of micro-genes and therefore results in a distribution of proteins with a variable number of repeats. We improved the library construction process using a "filtration" procedure to retain only fully coding modules that were recombined to recreate sequence diversity. The final library named Lib2.1 contains 1.7×10(9) independent clones. Here, we used phage display to select, from the previously described library or from the new library, new specific αRep proteins binding to four different non-related predefined protein targets. Specific binders were selected in each case. The results show that binders with various sizes are selected including relatively long sequences, with up to 7 repeats. ITC-measured affinities vary with Kd values ranging from micromolar to nanomolar ranges. The formation of complexes is associated with a significant thermal stabilization of the bound target protein. The crystal structures of two complexes between αRep and their cognate targets were solved and show that the new interfaces are established by the variable surfaces of the repeated modules, as well by the variable N-cap residues. These results suggest that αRep library is a new and versatile source of tight and specific binding proteins with favorable biophysical properties.
Disease-associated repeat instability and mismatch repair.

PubMed

Schmidt, Monika H M; Pearson, Christopher E

2016-02-01

Expanded tandem repeat sequences in DNA are associated with at least 40 human genetic neurological, neurodegenerative, and neuromuscular diseases. Repeat expansion can occur during parent-to-offspring transmission, and arise at variable rates in specific tissues throughout the life of an affected individual. Since the ongoing somatic repeat expansions can affect disease age-of-onset, severity, and progression, targeting somatic expansion holds potential as a therapeutic target. Thus, understanding the factors that regulate this mutation is crucial. DNA repair, in particular mismatch repair (MMR), is the major driving force of disease-associated repeat expansions. In contrast to its anti-mutagenic roles, mammalian MMR curiously drives the expansion mutations of disease-associated (CAG)·(CTG) repeats. Recent advances have broadened our knowledge of both the MMR proteins involved in disease repeat expansions, including: MSH2, MSH3, MSH6, MLH1, PMS2, and MLH3, as well as the types of repeats affected by MMR, now including: (CAG)·(CTG), (CGG)·(CCG), and (GAA)·(TTC) repeats. Mutagenic slipped-DNA structures have been detected in patient tissues, and the size of the slip-out and their junction conformation can determine the involvement of MMR. Furthermore, the formation of other unusual DNA and R-loop structures is proposed to play a key role in MMR-mediated instability. A complex correlation is emerging between tissues showing varying amounts of repeat instability and MMR expression levels. Notably, naturally occurring polymorphic variants of DNA repair genes can have dramatic effects upon the levels of repeat instability, which may explain the variation in disease age-of-onset, progression and severity. An increasing grasp of these factors holds prognostic and therapeutic potential. Copyright © 2015 Elsevier B.V. All rights reserved.
Multiplex detection of protein toxins using MALDI-TOF-TOF tandem mass spectrometry: application in unambiguous toxin detection from bioaerosol.

PubMed

Alam, Syed Imteyaz; Kumar, Bhoj; Kamboj, Dev Vrat

2012-12-04

Protein toxins, such as botulinum neurotoxins (BoNTs), Clostridium perfringens epsilon toxin (ETX), staphylococcal enterotoxin B (SEB), shiga toxin (STX), and plant toxin ricin, are involved in a number of diseases and are considered as potential agents for bioterrorism and warfare. From a bioterrorism and warfare perspective, these agents are likely to cause maximum damage to a civilian or military population through an inhalational route of exposure and aerosol is considered the envisaged mode of delivery. Unambiguous detection of toxin from aerosol is of paramount importance, both for bringing mitigation protocols into operation and for implementation of effective medical countermeasures, in case a "biological cloud" is seen over a population. A multiplex, unambiguous, and qualitative detection of protein toxins is reported here using tandem mass spectrometry with MALDI-TOF-TOF. The methodology involving simple sample processing steps was demonstrated to identify toxins (ETX, Clostridium perfringes phospholipase C, and SEB) from blind spiked samples. The novel directed search approach using a list of unique peptides was used to identify toxins from a complex protein mixture. The bioinformatic analysis of seven protein toxins for elucidation of unique peptides with conservation status across all known sequences provides a high confidence for detecting toxins originating from any geographical location and source organism. Use of tandem MS data with peptide sequence information increases the specificity of the method. A prototype for generation of aerosol using a nebulizer and collection using a cyclone collector was used to provide a proof of concept for unambiguous detection of toxin from aerosol using precursor directed tandem mass spectrometry combined with protein database searching. ETX prototoxin could be detected from aerosol at 0.2 ppb concentration in aerosol.
A large complement of the predicted Arabidopsis ARM repeat proteins are members of the U-box E3 ubiquitin ligase family.

PubMed

Mudgil, Yashwanti; Shiu, Shin-Han; Stone, Sophia L; Salt, Jennifer N; Goring, Daphne R

2004-01-01

The Arabidopsis genome was searched to identify predicted proteins containing armadillo (ARM) repeats, a motif known to mediate protein-protein interactions in a number of different animal proteins. Using domain database predictions and models generated in this study, 108 Arabidopsis proteins were identified that contained a minimum of two ARM repeats with the majority of proteins containing four to eight ARM repeats. Clustering analysis showed that the 108 predicted Arabidopsis ARM repeat proteins could be divided into multiple groups with wide differences in their domain compositions and organizations. Interestingly, 41 of the 108 Arabidopsis ARM repeat proteins contained a U-box, a motif present in a family of E3 ligases, and these proteins represented the largest class of Arabidopsis ARM repeat proteins. In 14 of these U-box/ARM repeat proteins, there was also a novel conserved domain identified in the N-terminal region. Based on the phylogenetic tree, representative U-box/ARM repeat proteins were selected for further study. RNA-blot analyses revealed that these U-box/ARM proteins are expressed in a variety of tissues in Arabidopsis. In addition, the selected U-box/ARM proteins were found to be functional E3 ubiquitin ligases. Thus, these U-box/ARM proteins represent a new family of E3 ligases in Arabidopsis.
Improved Tandem Affinity Purification Tag and Methods for Isolation of Proteins and Protein Complexes from Schizosaccharomyces pombe.

PubMed

Zilio, Nicola; Boddy, Michael N

2017-03-01

The tandem affinity purification (TAP) method uses an epitope that contains two different affinity purification tags separated by a site-specific protease site to isolate a protein rapidly and easily. Proteins purified via the TAP tag are eluted under mild conditions, allowing them to be used for structural and biochemical analyses. The original TAP tag contains a calmodulin-binding peptide and the IgG-binding domain from protein A separated by a tobacco etch virus (TEV) protease cleavage site. After capturing the Protein A epitope on an IgG resin, bound proteins are released by incubation with the TEV protease and then isolated on a calmodulin matrix in the presence of calcium; elution from this resin is achieved by chelating calcium with EGTA. However, because the robustness of the calmodulin-binding step in this procedure is highly variable, we replaced the calmodulin-binding peptide with three copies of the FLAG epitope, (3× FLAG)-TEV-Protein A, which can be isolated using an anti-FLAG resin. Elution from this matrix is achieved in the presence of an excess of a 3× FLAG peptide. In addition to allowing proteins to be released under mild conditions, elution by the 3× FLAG peptide adds an extra layer of specificity to the TAP procedure, because it liberates only FLAG-tagged proteins. © 2017 Cold Spring Harbor Laboratory Press.
Gibbs motif sampling: detection of bacterial outer membrane protein repeats.

PubMed Central

Neuwald, A. F.; Liu, J. S.; Lawrence, C. E.

1995-01-01

The detection and alignment of locally conserved regions (motifs) in multiple sequences can provide insight into protein structure, function, and evolution. A new Gibbs sampling algorithm is described that detects motif-encoding regions in sequences and optimally partitions them into distinct motif models; this is illustrated using a set of immunoglobulin fold proteins. When applied to sequences sharing a single motif, the sampler can be used to classify motif regions into related submodels, as is illustrated using helix-turn-helix DNA-binding proteins. Other statistically based procedures are described for searching a database for sequences matching motifs found by the sampler. When applied to a set of 32 very distantly related bacterial integral outer membrane proteins, the sampler revealed that they share a subtle, repetitive motif. Although BLAST (Altschul SF et al., 1990, J Mol Biol 215:403-410) fails to detect significant pairwise similarity between any of the sequences, the repeats present in these outer membrane proteins, taken as a whole, are highly significant (based on a generally applicable statistical test for motifs described here). Analysis of bacterial porins with known trimeric beta-barrel structure and related proteins reveals a similar repetitive motif corresponding to alternating membrane-spanning beta-strands. These beta-strands occur on the membrane interface (as opposed to the trimeric interface) of the beta-barrel. The broad conservation and structural location of these repeats suggests that they play important functional roles. PMID:8520488
Inter-laboratory comparison of multi-locus variable-number tandem repeat analysis (MLVA) for verocytotoxin-producing Escherichia coli O157 to facilitate data sharing.

PubMed

Holmes, A; Perry, N; Willshaw, G; Hanson, M; Allison, L

2015-01-01

Multi-locus variable number tandem repeat analysis (MLVA) is used in clinical and reference laboratories for subtyping verocytotoxin-producing Escherichia coli O157 (VTEC O157). However, as yet there is no common allelic or profile nomenclature to enable laboratories to easily compare data. In this study, we carried out an inter-laboratory comparison of an eight-loci MLVA scheme using a set of 67 isolates of VTEC O157. We found all but two isolates were identical in profile in the two laboratories, and repeat units were homogeneous in size but some were incomplete. A subset of the isolates (n = 17) were sequenced to determine the actual copy number of representative alleles, thereby enabling alleles to be named according to international consensus guidelines. This work has enabled us to realize the potential of MLVA as a portable, highly discriminatory and convenient subtyping method.
An Ankyrin Repeat-Containing Protein, Characterized as a Ubiquitin Ligase, Is Closely Associated with Membrane-Enclosed Organelles and Required for Pollen Germination and Pollen Tube Growth in Lily1[W

PubMed Central

Huang, Jian; Chen, Feng; Del Casino, Cecilia; Autino, Antonella; Shen, Mouhua; Yuan, Shuai; Peng, Jia; Shi, Hexin; Wang, Chen; Cresti, Mauro; Li, Yiqin

2006-01-01

Exhibiting rapid polarized growth, the pollen tube delivers the male gametes into the ovule for fertilization in higher plants. To get an overall picture of gene expression during pollen germination and pollen tube growth, we profiled the transcription patterns of 1,536 pollen cDNAs from lily (Lilium longiflorum) by microarray. Among those that exhibited significant differential expression, a cDNA named lily ankyrin repeat-containing protein (LlANK) was thoroughly studied. The full-length LlANK cDNA sequence predicts a protein containing five tandem ankyrin repeats and a RING zinc-finger domain. The LlANK protein possesses ubiquitin ligase activity in vitro. RNA blots demonstrated that LlANK transcript is present in mature pollen and its level, interestingly contrary to most pollen mRNAs, up-regulated significantly during pollen germination and pollen tube growth. When fused with green fluorescent protein and transiently expressed in pollen, LlANK was found dominantly associated with membrane-enclosed organelles as well as the generative cell. Overexpression of LlANK, however, led to abnormal growth of the pollen tube. On the other hand, transient silencing of LlANK impaired pollen germination and tube growth. Taken together, these results showed that LlANK is a ubiquitin ligase associated with membrane-enclosed organelles and required for polarized pollen tube growth. PMID:16461387
Short tandem repeat DNA typing provides an international reference standard for authentication of human cell lines.

PubMed

Dirks, Wilhelm Gerhard; Faehnrich, Silke; Estella, Isabelle Annick Janine; Drexler, Hans Guenter

2005-01-01

Cell lines have wide applications as model systems in the medical and pharmaceutical industry. Much drug and chemical testing is now first carried out exhaustively on in vitro systems, reducing the need for complicated and invasive animal experiments. The basis for any research, development or production program involving cell lines is the choice of an authentic cell line. Microsatellites in the human genome that harbour short tandem repeat (STR) DNA markers allow individualisation of established cell lines at the DNA level. Fluorescence polymerase chain reaction amplification of eight highly polymorphic microsatellite STR loci plus gender determination was found to be the best tool to screen the uniqueness of DNA profiles in a fingerprint database. Our results demonstrate that cross-contamination and misidentification remain chronic problems in the use of human continuous cell lines. The combination of rapidly generated DNA types based on single-locus STR and their authentication or individualisation by screening the fingerprint database constitutes a highly reliable and robust method for the identification and verification of cell lines.
Selection of Specific Protein Binders for Pre-Defined Targets from an Optimized Library of Artificial Helicoidal Repeat Proteins (alphaRep)

PubMed Central

Chevrel, Anne; Graille, Marc; Fourati-Kammoun, Zaineb; Desmadril, Michel; van Tilbeurgh, Herman; Minard, Philippe

2013-01-01

We previously designed a new family of artificial proteins named αRep based on a subgroup of thermostable helicoidal HEAT-like repeats. We have now assembled a large optimized αRep library. In this library, the side chains at each variable position are not fully randomized but instead encoded by a distribution of codons based on the natural frequency of side chains of the natural repeats family. The library construction is based on a polymerization of micro-genes and therefore results in a distribution of proteins with a variable number of repeats. We improved the library construction process using a “filtration” procedure to retain only fully coding modules that were recombined to recreate sequence diversity. The final library named Lib2.1 contains 1.7×109 independent clones. Here, we used phage display to select, from the previously described library or from the new library, new specific αRep proteins binding to four different non-related predefined protein targets. Specific binders were selected in each case. The results show that binders with various sizes are selected including relatively long sequences, with up to 7 repeats. ITC-measured affinities vary with Kd values ranging from micromolar to nanomolar ranges. The formation of complexes is associated with a significant thermal stabilization of the bound target protein. The crystal structures of two complexes between αRep and their cognate targets were solved and show that the new interfaces are established by the variable surfaces of the repeated modules, as well by the variable N-cap residues. These results suggest that αRep library is a new and versatile source of tight and specific binding proteins with favorable biophysical properties. PMID:24014183
The profile of repeat-associated histone lysine methylation states in the mouse epigenome

PubMed Central

Martens, Joost H A; O'Sullivan, Roderick J; Braunschweig, Ulrich; Opravil, Susanne; Radolf, Martin; Steinlein, Peter; Jenuwein, Thomas

2005-01-01

Histone lysine methylation has been shown to index silenced chromatin regions at, for example, pericentric heterochromatin or of the inactive X chromosome. Here, we examined the distribution of repressive histone lysine methylation states over the entire family of DNA repeats in the mouse genome. Using chromatin immunoprecipitation in a cluster analysis representing repetitive elements, our data demonstrate the selective enrichment of distinct H3-K9, H3-K27 and H4-K20 methylation marks across tandem repeats (e.g. major and minor satellites), DNA transposons, retrotransposons, long interspersed nucleotide elements and short interspersed nucleotide elements. Tandem repeats, but not the other repetitive elements, give rise to double-stranded (ds) RNAs that are further elevated in embryonic stem (ES) cells lacking the H3-K9-specific Suv39h histone methyltransferases. Importantly, although H3-K9 tri- and H4-K20 trimethylation appear stable at the satellite repeats, many of the other repeat-associated repressive marks vary in chromatin of differentiated ES cells or of embryonic trophoblasts and fibroblasts. Our data define a profile of repressive histone lysine methylation states for the repetitive complement of four distinct mouse epigenomes and suggest tandem repeats and dsRNA as primary triggers for more stable chromatin imprints. PMID:15678104
Identification of marker proteins for the adulteration of meat products with soybean proteins by multidimensional liquid chromatography-tandem mass spectrometry.

PubMed

Leitner, Alexander; Castro-Rubio, Florentina; Marina, Maria Luisa; Lindner, Wolfgang

2006-09-01

Soybean proteins are frequently added to processed meat products for economic reasons and to improve their functional properties. Monitoring of the addition of soybean protein to meat products is of high interest due to the existence of regulations forbidding or limiting the amount of soybean proteins that can be added during the processing of meat products. We have used chromatographic prefractionation on the protein level by perfusion liquid chromatography to isolate peaks of interest from extracts of soybean protein isolate (SPI) and of meat products containing SPI. After enzymatic digestion using trypsin, the collected fractions were analyzed by nanoflow liquid chromatography-tandem mass spectrometry. Several variants and subunits of the major seed proteins, glycinin and beta-conglycinin, were identified in SPI, along with two other proteins. In soybean-protein-containing meat samples, different glycinin A subunits could be identified from the peak discriminating between samples with and without soybean proteins added. Among those, glycinin G4 subunit A4 was consistently found in all samples. Consequently, this protein (subunit) can be used as a target for new analytical techniques in the course of identifying the addition of soybean protein to meat products.
Filipino DNA variation at 12 X-chromosome short tandem repeat markers.

PubMed

Salvador, Jazelyn M; Apaga, Dame Loveliness T; Delfin, Frederick C; Calacal, Gayvelline C; Dennis, Sheila Estacio; De Ungria, Maria Corazon A

2018-06-08

Demands for solving complex kinship scenarios where only distant relatives are available for testing have risen in the past years. In these instances, other genetic markers such as X-chromosome short tandem repeat (X-STR) markers are employed to supplement autosomal and Y-chromosomal STR DNA typing. However, prior to use, the degree of STR polymorphism in the population requires evaluation through generation of an allele or haplotype frequency population database. This population database is also used for statistical evaluation of DNA typing results. Here, we report X-STR data from 143 unrelated Filipino male individuals who were genotyped via conventional polymerase chain reaction-capillary electrophoresis (PCR-CE) using the 12 X-STR loci included in the Investigator ® Argus X-12 kit (Qiagen) and via massively parallel sequencing (MPS) of seven X-STR loci included in the ForenSeq ™ DNA Signature Prep kit of the MiSeq ® FGx ™ Forensic Genomics System (Illumina). Allele calls between PCR-CE and MPS systems were consistent (100% concordance) across seven overlapping X-STRs. Allele and haplotype frequencies and other parameters of forensic interest were calculated based on length (PCR-CE, 12 X-STRs) and sequence (MPS, seven X-STRs) variations observed in the population. Results of our study indicate that the 12 X-STRs in the PCR-CE system are highly informative for the Filipino population. MPS of seven X-STR loci identified 73 X-STR alleles compared with 55 X-STR alleles that were identified solely by length via PCR-CE. Of the 73 sequence-based alleles observed, six alleles have not been reported in the literature. The population data presented here may serve as a reference Philippine frequency database of X-STRs for forensic casework applications. Copyright © 2018 Elsevier B.V. All rights reserved.

Copy Number Heterogeneity, Large Origin Tandem Repeats, and Interspecies Recombination in Human Herpesvirus 6A (HHV-6A) and HHV-6B Reference Strains

PubMed Central

Roychoudhury, Pavitra; Makhsous, Negar; Hanson, Derek; Chase, Jill; Krueger, Gerhard; Xie, Hong; Huang, Meei-Li; Saunders, Lindsay; Ablashi, Dharam; Koelle, David M.; Cook, Linda; Jerome, Keith R.

2018-01-01

ABSTRACT Quantitative PCR is a diagnostic pillar for clinical virology testing, and reference materials are necessary for accurate, comparable quantitation between clinical laboratories. Accurate quantitation of human herpesvirus 6A/B (HHV-6A/B) is important for detection of viral reactivation and inherited chromosomally integrated HHV-6A/B in immunocompromised patients. Reference materials in clinical virology commonly consist of laboratory-adapted viral strains that may be affected by the culture process. We performed next-generation sequencing to make relative copy number measurements at single nucleotide resolution of eight candidate HHV-6A and seven HHV-6B reference strains and DNA materials from the HHV-6 Foundation and Advanced Biotechnologies Inc. Eleven of 17 (65%) HHV-6A/B candidate reference materials showed multiple copies of the origin of replication upstream of the U41 gene by next-generation sequencing. These large tandem repeats arose independently in culture-adapted HHV-6A and HHV-6B strains, measuring 1,254 bp and 983 bp, respectively. The average copy number measured was between 5 and 10 times the number of copies of the rest of the genome. We also report the first interspecies recombinant HHV-6A/B strain with a HHV-6A backbone and a >5.5-kb region from HHV-6B, from U41 to U43, that covered the origin tandem repeat. Specific HHV-6A reference strains demonstrated duplication of regions at U1/U2, U87, and U89, as well as deletion in the U12-to-U24 region and the U94/U95 genes. HHV-6A/B strains derived from cord blood mononuclear cells from different laboratories on different continents with fewer passages revealed no copy number differences throughout the viral genome. These data indicate that large origin tandem duplications are an adaptation of both HHV-6A and HHV-6B in culture and show interspecies recombination is possible within the Betaherpesvirinae. IMPORTANCE Anything in science that needs to be quantitated requires a standard unit of
Tandem assays of protein and glucose with functionalized core/shell particles based on magnetic separation and surface-enhanced Raman scattering.

PubMed

Kong, Xianming; Yu, Qian; Lv, Zhongpeng; Du, Xuezhong

2013-10-11

Tandem assays of protein and glucose in combination with mannose-functionalized Fe3 O4 @SiO2 and Ag@SiO2 tag particles have promising potential in effective magnetic separation and highly sensitive and selective SERS assays of biomaterials. It is for the first time that tandem assay of glucose is developed using SERS based on the Con A-sandwiched microstructures between the functionalized magnetic and tag particles. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The impact of CRISPR repeat sequence on structures of a Cas6 protein-RNA complex

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wang, Ruiying; Zheng, Han; Preamplume, Gan

The repeat-associated mysterious proteins (RAMPs) comprise the most abundant family of proteins involved in prokaryotic immunity against invading genetic elements conferred by the clustered regularly interspaced short palindromic repeat (CRISPR) system. Cas6 is one of the first characterized RAMP proteins and is a key enzyme required for CRISPR RNA maturation. Despite a strong structural homology with other RAMP proteins that bind hairpin RNA, Cas6 distinctly recognizes single-stranded RNA. Previous structural and biochemical studies show that Cas6 captures the 5' end while cleaving the 3' end of the CRISPR RNA. Here, we describe three structures and complementary biochemical analysis of amore » noncatalytic Cas6 homolog from Pyrococcus horikoshii bound to CRISPR repeat RNA of different sequences. Our study confirms the specificity of the Cas6 protein for single-stranded RNA and further reveals the importance of the bases at Positions 5-7 in Cas6-RNA interactions. Substitutions of these bases result in structural changes in the protein-RNA complex including its oligomerization state.« less
The application of new software tools to quantitative protein profiling via isotope-coded affinity tag (ICAT) and tandem mass spectrometry: I. Statistically annotated datasets for peptide sequences and proteins identified via the application of ICAT and tandem mass spectrometry to proteins copurifying with T cell lipid rafts.

PubMed

von Haller, Priska D; Yi, Eugene; Donohoe, Samuel; Vaughn, Kelly; Keller, Andrew; Nesvizhskii, Alexey I; Eng, Jimmy; Li, Xiao-jun; Goodlett, David R; Aebersold, Ruedi; Watts, Julian D

2003-07-01

Lipid rafts were prepared according to standard protocols from Jurkat T cells stimulated via T cell receptor/CD28 cross-linking and from control (unstimulated) cells. Co-isolating proteins from the control and stimulated cell preparations were labeled with isotopically normal (d0) and heavy (d8) versions of the same isotope-coded affinity tag (ICAT) reagent, respectively. Samples were combined, proteolyzed, and resultant peptides fractionated via cation exchange chromatography. Cysteine-containing (ICAT-labeled) peptides were recovered via the biotin tag component of the ICAT reagents by avidin-affinity chromatography. On-line micro-capillary liquid chromatography tandem mass spectrometry was performed on both avidin-affinity (ICAT-labeled) and flow-through (unlabeled) fractions. Initial peptide sequence identification was by searching recorded tandem mass spectrometry spectra against a human sequence data base using SEQUEST software. New statistical data modeling algorithms were then applied to the SEQUEST search results. These allowed for discrimination between likely "correct" and "incorrect" peptide assignments, and from these the inferred proteins that they collectively represented, by calculating estimated probabilities that each peptide assignment and subsequent protein identification was a member of the "correct" population. For convenience, the resultant lists of peptide sequences assigned and the proteins to which they corresponded were filtered at an arbitrarily set cut-off of 0.5 (i.e. 50% likely to be "correct") and above and compiled into two separate datasets. In total, these data sets contained 7667 individual peptide identifications, which represented 2669 unique peptide sequences, corresponding to 685 proteins and related protein groups.
Interleukin-1 Receptor Antagonist and Interleukin-4 Genes Variable Number Tandem Repeats Are Associated with Adiposity in Malaysian Subjects

PubMed Central

Kok, Yung-Yean; Ong, Hing-Huat

2017-01-01

Interleukin-1 receptor antagonist (IL1RA) intron 2 86 bp repeat and interleukin-4 (IL4) intron 3 70 bp repeat are variable number tandem repeats (VNTRs) that have been associated with various diseases, but their role in obesity is elusive. The objective of this study was to investigate the association of IL1RA and IL4 VNTRs with obesity and adiposity in 315 Malaysian subjects (128 M/187 F; 23 Malays/251 ethnic Chinese/41 ethnic Indians). The allelic distributions of IL1RA and IL4 were significantly different among ethnicities, and the alleles were associated with total body fat (TBF) classes. Individuals with IL1RA I/II genotype or allele II had greater risk of having higher overall adiposity, relative to those having the I/I genotype or I allele, respectively, even after controlling for ethnicity [Odds Ratio (OR) of I/II genotype = 12.21 (CI = 2.54, 58.79; p = 0.002); II allele = 5.78 (CI = 1.73, 19.29; p = 0.004)]. However, IL4 VNTR B2 allele was only significantly associated with overall adiposity status before adjusting for ethnicity [OR = 1.53 (CI = 1.04, 2.23; p = 0.03)]. Individuals with IL1RA II allele had significantly higher TBF than those with I allele (31.79 ± 2.52 versus 23.51 ± 0.40; p = 0.005). Taken together, IL1RA intron 2 VNTR seems to be a genetic marker for overall adiposity status in Malaysian subjects. PMID:28293435
Interleukin-1 Receptor Antagonist and Interleukin-4 Genes Variable Number Tandem Repeats Are Associated with Adiposity in Malaysian Subjects.

PubMed

Kok, Yung-Yean; Ong, Hing-Huat; Say, Yee-How

2017-01-01

Interleukin-1 receptor antagonist ( IL1RA ) intron 2 86 bp repeat and interleukin-4 ( IL4 ) intron 3 70 bp repeat are variable number tandem repeats (VNTRs) that have been associated with various diseases, but their role in obesity is elusive. The objective of this study was to investigate the association of IL1RA and IL4 VNTRs with obesity and adiposity in 315 Malaysian subjects (128 M/187 F; 23 Malays/251 ethnic Chinese/41 ethnic Indians). The allelic distributions of IL1RA and IL4 were significantly different among ethnicities, and the alleles were associated with total body fat (TBF) classes. Individuals with IL1RA I/II genotype or allele II had greater risk of having higher overall adiposity, relative to those having the I/I genotype or I allele, respectively, even after controlling for ethnicity [Odds Ratio (OR) of I/II genotype = 12.21 (CI = 2.54, 58.79; p = 0.002); II allele = 5.78 (CI = 1.73, 19.29; p = 0.004)]. However, IL4 VNTR B2 allele was only significantly associated with overall adiposity status before adjusting for ethnicity [OR = 1.53 (CI = 1.04, 2.23; p = 0.03)]. Individuals with IL1RA II allele had significantly higher TBF than those with I allele (31.79 ± 2.52 versus 23.51 ± 0.40; p = 0.005). Taken together, IL1RA intron 2 VNTR seems to be a genetic marker for overall adiposity status in Malaysian subjects.
Repeated immobilization stress increases uncoupling protein 1 expression and activity in Wistar rats.

PubMed

Gao, Bihu; Kikuchi-Utsumi, Kazue; Ohinata, Hiroshi; Hashimoto, Masaaki; Kuroshima, Akihiro

2003-06-01

Repeat immobilization-stressed rats are leaner and have improved cold tolerance due to enhancement of brown adipose tissue (BAT) thermogenesis. This process likely involves stress-induced sympathetic nervous system activation and adrenocortical hormone release, which dynamically enhances and suppresses uncoupling protein 1 (UCP1) function, respectively. To investigate whether repeated immobilization influences UCP1 thermogenic properties, we assessed UCP1 mRNA, protein expression, and activity (GDP binding) in BAT from immobilization-naive or repeatedly immobilized rats (3 h daily for 4 weeks) and sham operated or adrenalectomized (ADX) rats. UCP1 properties were assessed before (basal) and after exposure to 3 h of acute immobilization. Basal levels of GDP binding and UCP1 expression was significantly increased (140 and 140%) in the repeated immobilized group. Acute immobilization increased GDP binding in both naive (180%) and repeated immobilized groups (220%) without changing UCP1 expression. In ADX rats, basal GDP binding and UCP1 gene expression significantly increased (140 and 110%), and acute immobilization induced further increase. These data demonstrate that repeated immobilization resulted in enhanced UCP1 function, suggesting that enhanced BAT thermogenesis contributes to lower body weight gain through excess energy loss and an improved ability to maintain body temperature during cold exposure.
MicroRNAs in CAG trinucleotide repeat expansion disorders: an integrated review of the literature.

PubMed

Dumitrescu, Laura; Popescu, Bogdan O

2015-01-01

MicroRNAs are small RNAs involved in gene silencing. They play important roles in transcriptional regulation and are selectively and abundantly expressed in the central nervous system. A considerable amount of the human genome is comprised of tandem repeating nucleotide streams. Several diseases are caused by above-threshold expansion of certain trinucleotide repeats occurring in a protein-coding or non-coding region. Though monogenic, CAG trinucleotide repeat expansion disorders have a complex pathogenesis, various combinations of multiple coexisting pathways resulting in one common final consequence: selective neurodegeneration. Mutant protein and mutant transcript gain of toxic function are considered to be the core pathogenic mechanisms. The profile of microRNAs in CAG trinucleotide repeat disorders is scarcely described, however microRNA dysregulation has been identified in these diseases and microRNA-related intereference with gene expression is considered to be involved in their pathogenesis. Better understanding of microRNAs functions and means of manipulation promises to offer further insights into the pathogenic pathways of CAG repeat expansion disorders, to point out new potential targets for drug intervention and to provide some of the much needed etiopathogenic therapeutic agents. A number of disease-modifying microRNA silencing strategies are under development, but several implementation impediments still have to be resolved. CAG targeting seems feasible and efficient in animal models and is an appealing approach for clinical practice. Preliminary human trials are just beginning.
Multi-locus variable-number tandem repeat analysis for outbreak studies of Salmonella enterica serotype Enteritidis

PubMed Central

Malorny, Burkhard; Junker, Ernst; Helmuth, Reiner

2008-01-01

Background Salmonella enterica subsp. enterica serotype Enteritidis is known as an important and pathogenic clonal group which continues to cause worldwide sporadic cases and outbreaks in humans. Here a new multiple-locus variable-number tandem repeat analysis (MLVA) method is reported for highly-discriminative subtyping of Salmonella Enteritidis. Emphasis was given on the most predominant phage types PT4 and PT8. The method comprises multiplex PCR specifically amplifying repeated sequences from nine different loci followed by an automatic fragment size analysis using a multicolor capillary electrophoresis instrument. A total of 240 human, animal, food and environmental isolates of S. Enteritidis including 23 definite phage types were used for development and validation. Furthermore, the MLVA types were compared to the phage types of several isolates from two recent outbreaks to determine the concordance between both methods and to estimate their in vivo stability. The in vitro stability of the two MLVA types specifically for PT4 and PT8 strains were determined by multiple freeze-thaw cycles. Results Seventy-nine different MLVA types were identified in 240 S. Enteritidis strains. The Simpson's diversity index for the MLVA method was 0.919 and Nei diversity values for the nine VNTR loci ranged from 0.07 to 0.65. Twenty-four MLVA types could be assigned to 62 PT4 strains and 21 types to 81 PT8 strains. All outbreak isolates had an indistinguishable outbreak specific MLVA type. The in vitro stability experiments showed no changes of the MLVA type compared to the original isolate. Conclusion This MLVA method is useful to discriminate S. Enteritidis strains even within a single phage type. It is easy in use, fast, and cheap compared to other high-resolution molecular methods and therefore an important tool for surveillance and outbreak studies for S. Enteritidis. PMID:18513386
The Leishmania infantum PUF proteins are targets of the humoral response during visceral leishmaniasis

PubMed Central

2010-01-01

Background RNA-binding proteins of the PUF family share a conserved domain consisting of tandemly repeated 36-40 amino acid motifs (typically eight) known as Puf repeats. Proteins containing tandem repeats are often dominant targets of humoral responses during infectious diseases. Thus, we considered of interest to analyze whether Leishmania PUF proteins result antigenic during visceral leishmaniasis (VL). Findings Here, employing whole-genome databases, we report the composition, and structural features, of the PUF family in Leishmania infantum. Additionally, the 10 genes of the L. infantum PUF family were cloned and used to express the Leishmania PUFs in bacteria as recombinant proteins. Finally, the antigenicity of these PUF proteins was evaluated by determining levels of specific antibodies in sera from experimentally infected hamsters. The Leishmania PUFs were all recognized by the sera, even though with different degree of reactivity and/or frequency of recognition. The reactivity of hamster sera against recombinant LiPUF1 and LiPUF2 was particularly prominent, and these proteins were subsequently assayed against sera from human patients. High antibody responses against rLiPUF1 and rLiPUF2 were found in sera from VL patients, but these proteins resulted also recognized by sera from Chagas' disease patients. Conclusion Our results suggest that Leishmania PUFs are targets of the humoral response during L. infantum infection and may represent candidates for serodiagnosis and/or vaccine reagents; however, it should be kept in mind the cross-reactivity of LiPUFs with antibodies induced against other trypanosomatids such as Trypanosoma cruzi. PMID:20180988
Quantitative Protein Topography Analysis and High-Resolution Structure Prediction Using Hydroxyl Radical Labeling and Tandem-Ion Mass Spectrometry (MS)*

PubMed Central

Kaur, Parminder; Kiselar, Janna; Yang, Sichun; Chance, Mark R.

2015-01-01

Hydroxyl radical footprinting based MS for protein structure assessment has the goal of understanding ligand induced conformational changes and macromolecular interactions, for example, protein tertiary and quaternary structure, but the structural resolution provided by typical peptide-level quantification is limiting. In this work, we present experimental strategies using tandem-MS fragmentation to increase the spatial resolution of the technique to the single residue level to provide a high precision tool for molecular biophysics research. Overall, in this study we demonstrated an eightfold increase in structural resolution compared with peptide level assessments. In addition, to provide a quantitative analysis of residue based solvent accessibility and protein topography as a basis for high-resolution structure prediction; we illustrate strategies of data transformation using the relative reactivity of side chains as a normalization strategy and predict side-chain surface area from the footprinting data. We tested the methods by examination of Ca+2-calmodulin showing highly significant correlations between surface area and side-chain contact predictions for individual side chains and the crystal structure. Tandem ion based hydroxyl radical footprinting-MS provides quantitative high-resolution protein topology information in solution that can fill existing gaps in structure determination for large proteins and macromolecular complexes. PMID:25687570
Taxonomic distribution, repeats, and functions of the S1 domain-containing proteins as members of the OB-fold family.

PubMed

Deryusheva, Evgeniia I; Machulin, Andrey V; Selivanova, Olga M; Galzitskaya, Oxana V

2017-04-01

Proteins of the nucleic acid-binding proteins superfamily perform such functions as processing, transport, storage, stretching, translation, and degradation of RNA. It is one of the 16 superfamilies containing the OB-fold in protein structures. Here, we have analyzed the superfamily of nucleic acid-binding proteins (the number of sequences exceeds 200,000) and obtained that this superfamily prevalently consists of proteins containing the cold shock DNA-binding domain (ca. 131,000 protein sequences). Proteins containing the S1 domain compose 57% from the cold shock DNA-binding domain family. Furthermore, we have found that the S1 domain was identified mainly in the bacterial proteins (ca. 83%) compared to the eukaryotic and archaeal proteins, which are available in the UniProt database. We have found that the number of multiple repeats of S1 domain in the S1 domain-containing proteins depends on the taxonomic affiliation. All archaeal proteins contain one copy of the S1 domain, while the number of repeats in the eukaryotic proteins varies between 1 and 15 and correlates with the protein size. In the bacterial proteins, the number of repeats is no more than 6, regardless of the protein size. The large variation of the repeat number of S1 domain as one of the structural variants of the OB-fold is a distinctive feature of S1 domain-containing proteins. Proteins from the other families and superfamilies have either one OB-fold or change slightly the repeat numbers. On the whole, it can be supposed that the repeat number is a vital for multifunctional activity of the S1 domain-containing proteins. Proteins 2017; 85:602-613. © 2016 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Comprehensive mutation analysis of 17 Y-chromosomal short tandem repeat polymorphisms included in the AmpFlSTR Yfiler PCR amplification kit.

PubMed

Goedbloed, Miriam; Vermeulen, Mark; Fang, Rixun N; Lembring, Maria; Wollstein, Andreas; Ballantyne, Kaye; Lao, Oscar; Brauer, Silke; Krüger, Carmen; Roewer, Lutz; Lessig, Rüdiger; Ploski, Rafal; Dobosz, Tadeusz; Henke, Lotte; Henke, Jürgen; Furtado, Manohar R; Kayser, Manfred

2009-11-01

The Y-chromosomal short tandem repeat (Y-STR) polymorphisms included in the AmpFlSTR Yfiler polymerase chain reaction amplification kit have become widely used for forensic and evolutionary applications where a reliable knowledge on mutation properties is necessary for correct data interpretation. Therefore, we investigated the 17 Yfiler Y-STRs in 1,730-1,764 DNA-confirmed father-son pairs per locus and found 84 sequence-confirmed mutations among the 29,792 meiotic transfers covered. Of the 84 mutations, 83 (98.8%) were single-repeat changes and one (1.2%) was a double-repeat change (ratio, 1:0.01), as well as 43 (51.2%) were repeat gains and 41 (48.8%) repeat losses (ratio, 1:0.95). Medians from Bayesian estimation of locus-specific mutation rates ranged from 0.0003 for DYS448 to 0.0074 for DYS458, with a median rate across all 17 Y-STRs of 0.0025. The mean age (at the time of son's birth) of fathers with mutations was with 34.40 (+/-11.63) years higher than that of fathers without ones at 30.32 (+/-10.22) years, a difference that is highly statistically significant (p < 0.001). A Poisson-based modeling revealed that the Y-STR mutation rate increased with increasing father's age on a statistically significant level (alpha = 0.0294, 2.5% quantile = 0.0001). From combining our data with those previously published, considering all together 135,212 meiotic events and 331 mutations, we conclude for the Yfiler Y-STRs that (1) none had a mutation rate of >1%, 12 had mutation rates of >0.1% and four of <0.1%, (2) single-repeat changes were strongly favored over multiple-repeat ones for all loci but 1 and (3) considerable variation existed among loci in the ratio of repeat gains versus losses. Our finding of three Y-STR mutations in one father-son pair (and two pairs with two mutations each) has consequences for determining the threshold of allelic differences to conclude exclusion constellations in future applications of Y-STRs in paternity testing and pedigree analyses.
Subtyping of a Large Collection of Historical Listeria monocytogenes Strains from Ontario, Canada, by an Improved Multilocus Variable-Number Tandem-Repeat Analysis (MLVA)

PubMed Central

Saleh-Lakha, S.; Allen, V. G.; Li, J.; Pagotto, F.; Odumeru, J.; Taboada, E.; Lombos, M.; Tabing, K. C.; Blais, B.; Ogunremi, D.; Downing, G.; Lee, S.; Gao, A.; Nadon, C.

2013-01-01

Listeria monocytogenes is responsible for severe and often fatal food-borne infections in humans. A collection of 2,421 L. monocytogenes isolates originating from Ontario's food chain between 1993 and 2010, along with Ontario clinical isolates collected from 2004 to 2010, was characterized using an improved multilocus variable-number tandem-repeat analysis (MLVA). The MLVA method was established based on eight primer pairs targeting seven variable-number tandem-repeat (VNTR) loci in two 4-plex fluorescent PCRs. Diversity indices and amplification rates of the individual VNTR loci ranged from 0.38 to 0.92 and from 0.64 to 0.99, respectively. MLVA types and pulsed-field gel electrophoresis (PFGE) patterns were compared using Comparative Partitions analysis involving 336 clinical and 99 food and environmental isolates. The analysis yielded Simpson's diversity index values of 0.998 and 0.992 for MLVA and PFGE, respectively, and adjusted Wallace coefficients of 0.318 when MLVA was used as a primary subtyping method and 0.088 when PFGE was a primary typing method. Statistical data analysis using BioNumerics allowed for identification of at least 8 predominant and persistent L. monocytogenes MLVA types in Ontario's food chain. The MLVA method correctly clustered epidemiologically related outbreak strains and separated unrelated strains in a subset analysis. An MLVA database was established for the 2,421 L. monocytogenes isolates, which allows for comparison of data among historical and new isolates of different sources. The subtyping method coupled with the MLVA database will help in effective monitoring/prevention approaches to identify environmental contamination by pathogenic strains of L. monocytogenes and investigation of outbreaks. PMID:23956391
Diversity and evolution of centromere repeats in the maize genome.

PubMed

Bilinski, Paul; Distor, Kevin; Gutierrez-Lopez, Jose; Mendoza, Gabriela Mendoza; Shi, Jinghua; Dawe, R Kelly; Ross-Ibarra, Jeffrey

2015-03-01

Centromere repeats are found in most eukaryotes and play a critical role in kinetochore formation. Though centromere repeats exhibit considerable diversity both within and among species, little is understood about the mechanisms that drive centromere repeat evolution. Here, we use maize as a model to investigate how a complex history involving polyploidy, fractionation, and recent domestication has impacted the diversity of the maize centromeric repeat CentC. We first validate the existence of long tandem arrays of repeats in maize and other taxa in the genus Zea. Although we find considerable sequence diversity among CentC copies genome-wide, genetic similarity among repeats is highest within these arrays, suggesting that tandem duplications are the primary mechanism for the generation of new copies. Nonetheless, clustering analyses identify similar sequences among distant repeats, and simulations suggest that this pattern may be due to homoplasious mutation. Although the two ancestral subgenomes of maize have contributed nearly equal numbers of centromeres, our analysis shows that the majority of all CentC repeats derive from one of the parental genomes, with an even stronger bias when examining the largest assembled contiguous clusters. Finally, by comparing maize with its wild progenitor teosinte, we find that the abundance of CentC likely decreased after domestication, while the pericentromeric repeat Cent4 has drastically increased.
Proteomic profiling of tandem affinity purified 14-3-3 protein complexes in Arabidopsis thaliana

PubMed Central

Chang, Ing-Feng; Curran, Amy; Woolsey, Rebekah; Quilici, David; Cushman, John; Mittler, Ron; Harmon, Alice; Harper, Jeffrey

2014-01-01

In eukaryotes, 14-3-3 dimers regulate hundreds of functionally diverse proteins (clients), typically in phosphorylation-dependent interactions. To uncover new clients, a 14-3-3 omega (At1g78300) from Arabidopsis was engineered with a “tandem affinity purification” (TAP) tag and expressed in transgenic plants. Purified complexes were analyzed by tandem mass spectrometry. Results indicate that 14-3-3 omega can dimerize with at least 10 of the 12 14-3-3 isoforms expressed in Arabidopsis. The identification here of 121 putative clients provides support for in vivo 14-3-3 interactions with a diverse array of proteins, including those involved in: (1) Ion transport, such as a K+ channel (GORK), a Cl− channel (CLCg), Ca2+ channels belonging to the glutamate receptor family (GLRs 1.2, 2.1, 2.9, 3.4, 3.7); (2) hormone signaling, such as ACC synthase (isoforms ACS-6, 7 and 8 involved in ethylene synthesis) and the brassinolide receptors BRI1 and BAK1; (3) transcription, such as 7 WRKY family transcription factors; (4) metabolism, such as phosphoenol pyruvate (PEP) carboxylase; and (5) lipid signaling, such as phospholipase D (β, and γ). More than 80% (101) of these putative clients represent previously unidentified 14-3-3 interactors. These results raise the number of putative 14-3-3 clients identified in plants to over 300. PMID:19452453
Proteomic profiling of tandem affinity purified 14-3-3 protein complexes in Arabidopsis thaliana.

PubMed

Chang, Ing-Feng; Curran, Amy; Woolsey, Rebekah; Quilici, David; Cushman, John C; Mittler, Ron; Harmon, Alice; Harper, Jeffrey F

2009-06-01

In eukaryotes, 14-3-3 dimers regulate hundreds of functionally diverse proteins (clients), typically in phosphorylation-dependent interactions. To uncover new clients, 14-3-3 omega (At1g78300) from Arabidopsis was engineered with a "tandem affinity purification" tag and expressed in transgenic plants. Purified complexes were analyzed by tandem MS. Results indicate that 14-3-3 omega can dimerize with at least 10 of the 12 14-3-3 isoforms expressed in Arabidopsis. The identification here of 121 putative clients provides support for in vivo 14-3-3 interactions with a diverse array of proteins, including those involved in: (i) Ion transport, such as a K(+) channel (GORK), a Cl(-) channel (CLCg), Ca(2+) channels belonging to the glutamate receptor family (1.2, 2.1, 2.9, 3.4, 3.7); (ii) hormone signaling, such as ACC synthase (isoforms ACS-6, -7 and -8 involved in ethylene synthesis) and the brassinolide receptors BRI1 and BAK1; (iii) transcription, such as 7 WRKY family transcription factors; (iv) metabolism, such as phosphoenol pyruvate carboxylase; and (v) lipid signaling, such as phospholipase D (beta and gamma). More than 80% (101) of these putative clients represent previously unidentified 14-3-3 interactors. These results raise the number of putative 14-3-3 clients identified in plants to over 300.
Allele Frequencies for 15 Short Tandem Repeat Loci in Representative Sample of Croatian Population

PubMed Central

Projić, Petar; Škaro, Vedrana; Šamija, Ivana; Pojskić, Naris; Durmić-Pašić, Adaleta; Kovačević, Lejla; Bakal, Narcisa; Primorac, Dragan; Marjanović, Damir

2007-01-01

Aim To study the distribution of allele frequencies of 15 short tandem repeat (STR) loci in a representative sample of the Croatian population. Methods A total of 195 unrelated Caucasian individuals born in Croatia, from 14 counties and the City of Zagreb, were sampled for the analysis. All the tested individuals were voluntary donors. Buccal swab was used as the DNA source. AmpFlSTR® Identifiler® was applied to simultaneously amplify 15 STR loci. Total reaction volume was 12.5 μL. The polymerase chain reaction (PCR) amplification was carried out in PE Gene Amp PCR System Thermal Cycler. Electrophoresis of the amplification products was preformed on an ABI PRISM 3130 Genetic Analyzer. After PCR amplification and separation by electrophoresis, raw data were compiled, analyzed, and numerical allele designations of the profiles were obtained. Deviation from Hardy-Weinberg equilibrium, observed and expected heterozygosity, power of discrimination, and power of exclusion were calculated. Bonferroni’s correction was used before each comparative analysis. Results We compared Croatian data with those obtained from geographically neighboring European populations. The significant difference (at P<0.01) in allele frequencies was recorded only between the Croatian and Slovenian populations for vWA locus. There was no significant deviation from Hardy-Weinberg equilibrium for all the observed loci. Conclusion Obtained population data concurred with the expected “STR data frame” for this part of Europe. PMID:17696301
Expanded complexity of unstable repeat diseases

PubMed Central

Polak, Urszula; McIvor, Elizabeth; Dent, Sharon Y.R.; Wells, Robert D.; Napierala, Marek

2015-01-01

Unstable Repeat Diseases (URDs) share a common mutational phenomenon of changes in the copy number of short, tandemly repeated DNA sequences. More than 20 human neurological diseases are caused by instability, predominantly expansion, of microsatellite sequences. Changes in the repeat size initiate a cascade of pathological processes, frequently characteristic of a unique disease or a small subgroup of the URDs. Understanding of both the mechanism of repeat instability and molecular consequences of the repeat expansions is critical to developing successful therapies for these diseases. Recent technological breakthroughs in whole genome, transcriptome and proteome analyses will almost certainly lead to new discoveries regarding the mechanisms of repeat instability, the pathogenesis of URDs, and will facilitate development of novel therapeutic approaches. The aim of this review is to give a general overview of unstable repeats diseases, highlight the complexities of these diseases, and feature the emerging discoveries in the field. PMID:23233240
Deep landscape update of dispersed and tandem repeats in the genome model of the red jungle fowl, Gallus gallus, using a series of de novo investigating tools.

PubMed

Guizard, Sébastien; Piégu, Benoît; Arensburger, Peter; Guillou, Florian; Bigot, Yves

2016-08-19

The program RepeatMasker and the database Repbase-ISB are part of the most widely used strategy for annotating repeats in animal genomes. They have been used to show that avian genomes have a lower repeat content (8-12 %) than the sequenced genomes of many vertebrate species (30-55 %). However, the efficiency of such a library-based strategies is dependent on the quality and completeness of the sequences in the database that is used. An alternative to these library based methods are methods that identify repeats de novo. These alternative methods have existed for a least a decade and may be more powerful than the library based methods. We have used an annotation strategy involving several complementary de novo tools to determine the repeat content of the model genome galGal4 (1.04 Gbp), including identifying simple sequence repeats (SSRs), tandem repeats and transposable elements (TEs). We annotated over one Gbp. of the galGal4 genome and showed that it is composed of approximately 19 % SSRs and TEs repeats. Furthermore, we estimate that the actual genome of the red jungle fowl contains about 31-35 % repeats. We find that library-based methods tend to overestimate TE diversity. These results have a major impact on the current understanding of repeats distributions throughout chromosomes in the red jungle fowl. Our results are a proof of concept of the reliability of using de novo tools to annotate repeats in large animal genomes. They have also revealed issues that will need to be resolved in order to develop gold-standard methodologies for annotating repeats in eukaryote genomes.

New Multilocus Variable-Number Tandem-Repeat Analysis Tool for Surveillance and Local Epidemiology of Bacterial Leaf Blight and Bacterial Leaf Streak of Rice Caused by Xanthomonas oryzae

PubMed Central

Poulin, L.; Grygiel, P.; Magne, M.; Rodriguez-R, L. M.; Forero Serna, N.; Zhao, S.; El Rafii, M.; Dao, S.; Tekete, C.; Wonni, I.; Koita, O.; Pruvost, O.; Verdier, V.; Vernière, C.

2014-01-01

Multilocus variable-number tandem-repeat analysis (MLVA) is efficient for routine typing and for investigating the genetic structures of natural microbial populations. Two distinct pathovars of Xanthomonas oryzae can cause significant crop losses in tropical and temperate rice-growing countries. Bacterial leaf streak is caused by X. oryzae pv. oryzicola, and bacterial leaf blight is caused by X. oryzae pv. oryzae. For the latter, two genetic lineages have been described in the literature. We developed a universal MLVA typing tool both for the identification of the three X. oryzae genetic lineages and for epidemiological analyses. Sixteen candidate variable-number tandem-repeat (VNTR) loci were selected according to their presence and polymorphism in 10 draft or complete genome sequences of the three X. oryzae lineages and by VNTR sequencing of a subset of loci of interest in 20 strains per lineage. The MLVA-16 scheme was then applied to 338 strains of X. oryzae representing different pathovars and geographical locations. Linkage disequilibrium between MLVA loci was calculated by index association on different scales, and the 16 loci showed linear Mantel correlation with MLSA data on 56 X. oryzae strains, suggesting that they provide a good phylogenetic signal. Furthermore, analyses of sets of strains for different lineages indicated the possibility of using the scheme for deeper epidemiological investigation on small spatial scales. PMID:25398857
[Family-based association study of a variable number of tandem repeat polymorphism of DAT1 gene with Tourette syndrome in a Chinese Han population].

PubMed

Zheng, Lanlan; Han, Zhen-liang; Zhang, Xin-hua; Wang, Xue-qin; Jiang, Wei-hua; Yi, Ming-ji; Liu, Shi-guo

2013-10-01

To assess the association of a 40 bp variable number of tandem repeat (VNTR) polymorphism within 3 untranslated region of dopamine transporter gene (DAT1) with Tourette syndrome (TS) in a Chinese Han population. A total of 160 TS patients and their parents were recruited. The VNTR polymorphism was detected with polymerase chain reaction-VNTR analysis, and its association with TS and its subtypes were assessed through a family-based association study comprising transmission disequilibrium test (TDT) and haplotype relative risk (HRR) analysis. The repeat numbers at the DAT1 40 bp locus were 11, 10, 9, 7.5 and 7 among the patients and their parents, with the most common type being a 10-repeat allele. No significant association was detected between the polymorphism and TS (TDT: X ² = 0.472, df = 1, P = 0.583; HRR: X ² = 0.313, P = 0.576, OR = 0.855, 95%CI: 0.493-1.481). Our data suggested that the VNTR polymorphism of DAT1 gene is not associated with susceptibility to TS in Chinese Han population. However, our results are to be validated in larger sets of patients collected from other populations.
Differential Occurrence of Interactions and Interaction Domains in Proteins Containing Homopolymeric Amino Acid Repeats

PubMed Central

Pelassa, Ilaria; Fiumara, Ferdinando

2015-01-01

Homopolymeric amino acids repeats (AARs), which are widespread in proteomes, have often been viewed simply as spacers between protein domains, or even as “junk” sequences with no obvious function but with a potential to cause harm upon expansion as in genetic diseases associated with polyglutamine or polyalanine expansions, including Huntington disease and cleidocranial dysplasia. A growing body of evidence indicates however that at least some AARs can form organized, functional protein structures, and can regulate protein function. In particular, certain AARs can mediate protein-protein interactions, either through homotypic AAR-AAR contacts or through heterotypic contacts with other protein domains. It is still unclear however, whether AARs may have a generalized, proteome-wide role in shaping protein-protein interaction networks. Therefore, we have undertaken here a bioinformatics screening of the human proteome and interactome in search of quantitative evidence of such a role. We first identified the sets of proteins that contain repeats of any one of the 20 amino acids, as well as control sets of proteins chosen at random in the proteome. We then analyzed the connectivity between the proteins of the AAR-containing protein sets and we compared it with that observed in the corresponding control networks. We find evidence for different degrees of connectivity in the different AAR-containing protein networks. Indeed, networks of proteins containing polyglutamine, polyglutamate, polyproline, and other AARs show significantly increased levels of connectivity, whereas networks containing polyleucine and other hydrophobic repeats show lower degrees of connectivity. Furthermore, we observed that numerous protein-protein, -nucleic acid, and -lipid interaction domains are significantly enriched in specific AAR protein groups. These findings support the notion of a generalized, combinatorial role of AARs, together with conventional protein interaction domains, in
Substructure of a Tunisian Berber population as inferred from 15 autosomal short tandem repeat loci.

PubMed

Khodjet-El-Khil, Houssein; Fadhlaoui-Zid, Karima; Gusmão, Leonor; Alves, Cíntia; Benammar-Elgaaied, Amel; Amorim, Antonio

2008-08-01

Currently, language and cultural practices are the only criteria to distinguish between Berber autochthonous Tunisian populations. To evaluate these populations' possible genetic structure and differentiation, we have analyzed 15 autosomal short tandem repeat loci (CSF1PO, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, D21S11, FGA, TH01, TPOX, VWA, D2S1338, and D19S433) in three southern Tunisian Berber groups: Sened, Matmata, and Chenini-Douiret. The exact test of population differentiation based on allele frequencies at the 15 loci shows significant P values at 7 loci between Chenini-Douiret and both Sened and Matmata, whereas just 5 loci show significant P values between Sened and Matmata. Comparative analyses between the three Berber groups based on genetic distances show that P values for F(ST) distances are significant between the three Berber groups. Population analysis performed using Structure shows a clear differentiation between these Berber groups, with strong genetic isolation of Chenini-Douiret. These results confirm at the autosomal level the high degree of heterogeneity of Tunisian Berber populations that had been previously reported for uniparental markers.
Molecular typing of Argentinian Mycobacterium avium subsp. paratuberculosis isolates by multiple-locus variable number-tandem repeat analysis

PubMed Central

Gioffré, Andrea; Correa Muñoz, Magnolia; Alvarado Pinedo, María F.; Vaca, Roberto; Morsella, Claudia; Fiorentino, María Andrea; Paolicchi, Fernando; Ruybal, Paula; Zumárraga, Martín; Travería, Gabriel E.; Romano, María Isabel

2015-01-01

Multiple-locus variable number-tandem repeat analysis (MLVA) of Mycobacterium avium subspecies paratuberculosis (MAP) isolates may contribute to the knowledge of strain diversity in Argentina. Although the diversity of MAP has been previously investigated in Argentina using IS900-RFLP, a small number of isolates were employed, and a low discriminative power was reached. The aim of the present study was to test the genetic diversity among MAP isolates using an MLVA approach based on 8 repetitive loci. We studied 97 isolates from cattle, goat and sheep and could describe 7 different patterns: INMV1, INMV2, INMV11, INMV13, INMV16, INMV33 and one incomplete pattern. INMV1 and INMV2 were the most frequent patterns, grouping 76.3% of the isolates. We were also able to demonstrate the coexistence of genotypes in herds and co-infection at the organism level. This study shows that all the patterns described are common to those described in Europe, suggesting an epidemiological link between the continents. PMID:26273274
Ruminant Rhombencephalitis-Associated Listeria monocytogenes Alleles Linked to a Multilocus Variable-Number Tandem-Repeat Analysis Complex ▿ †

PubMed Central

Balandyté, Lina; Brodard, Isabelle; Frey, Joachim; Oevermann, Anna; Abril, Carlos

2011-01-01

Listeria monocytogenes is among the most important food-borne pathogens and is well adapted to persist in the environment. To gain insight into the genetic relatedness and potential virulence of L. monocytogenes strains causing central nervous system (CNS) infections, we used multilocus variable-number tandem-repeat analysis (MLVA) to subtype 183 L. monocytogenes isolates, most from ruminant rhombencephalitis and some from human patients, food, and the environment. Allelic-profile-based comparisons grouped L. monocytogenes strains mainly into three clonal complexes and linked single-locus variants (SLVs). Clonal complex A essentially consisted of isolates from human and ruminant brain samples. All but one rhombencephalitis isolate from cattle were located in clonal complex A. In contrast, food and environmental isolates mainly clustered into clonal complex C, and none was classified as clonal complex A. Isolates of the two main clonal complexes (A and C) obtained by MLVA were analyzed by PCR for the presence of 11 virulence-associated genes (prfA, actA, inlA, inlB, inlC, inlD, inlE, inlF, inlG, inlJ, and inlC2H). Virulence gene analysis revealed significant differences in the actA, inlF, inlG, and inlJ allelic profiles between clinical isolates (complex A) and nonclinical isolates (complex C). The association of particular alleles of actA, inlF, and newly described alleles of inlJ with isolates from CNS infections (particularly rhombencephalitis) suggests that these virulence genes participate in neurovirulence of L. monocytogenes. The overall absence of inlG in clinical complex A and its presence in complex C isolates suggests that the InlG protein is more relevant for the survival of L. monocytogenes in the environment. PMID:21984240
A Large Complement of the Predicted Arabidopsis ARM Repeat Proteins Are Members of the U-Box E3 Ubiquitin Ligase Family1[w

PubMed Central

Mudgil, Yashwanti; Shiu, Shin-Han; Stone, Sophia L.; Salt, Jennifer N.; Goring, Daphne R.

2004-01-01

The Arabidopsis genome was searched to identify predicted proteins containing armadillo (ARM) repeats, a motif known to mediate protein-protein interactions in a number of different animal proteins. Using domain database predictions and models generated in this study, 108 Arabidopsis proteins were identified that contained a minimum of two ARM repeats with the majority of proteins containing four to eight ARM repeats. Clustering analysis showed that the 108 predicted Arabidopsis ARM repeat proteins could be divided into multiple groups with wide differences in their domain compositions and organizations. Interestingly, 41 of the 108 Arabidopsis ARM repeat proteins contained a U-box, a motif present in a family of E3 ligases, and these proteins represented the largest class of Arabidopsis ARM repeat proteins. In 14 of these U-box/ARM repeat proteins, there was also a novel conserved domain identified in the N-terminal region. Based on the phylogenetic tree, representative U-box/ARM repeat proteins were selected for further study. RNA-blot analyses revealed that these U-box/ARM proteins are expressed in a variety of tissues in Arabidopsis. In addition, the selected U-box/ARM proteins were found to be functional E3 ubiquitin ligases. Thus, these U-box/ARM proteins represent a new family of E3 ligases in Arabidopsis. PMID:14657406
A novel typing method for Listeria monocytogenes using high-resolution melting analysis (HRMA) of tandem repeat regions.

PubMed

Ohshima, Chihiro; Takahashi, Hajime; Iwakawa, Ai; Kuda, Takashi; Kimura, Bon

2017-07-17

Listeria monocytogenes, which is responsible for causing food poisoning known as listeriosis, infects humans and animals. Widely distributed in the environment, this bacterium is known to contaminate food products after being transmitted to factories via raw materials. To minimize the contamination of products by food pathogens, it is critical to identify and eliminate factory entry routes and pathways for the causative bacteria. High resolution melting analysis (HRMA) is a method that takes advantage of differences in DNA sequences and PCR product lengths that are reflected by the disassociation temperature. Through our research, we have developed a multiple locus variable-number tandem repeat analysis (MLVA) using HRMA as a simple and rapid method to differentiate L. monocytogenes isolates. While evaluating our developed method, the ability of MLVA-HRMA, MLVA using capillary electrophoresis, and multilocus sequence typing (MLST) was compared for their ability to discriminate between strains. The MLVA-HRMA method displayed greater discriminatory ability than MLST and MLVA using capillary electrophoresis, suggesting that the variation in the number of repeat units, along with mutations within the DNA sequence, was accurately reflected by the melting curve of HRMA. Rather than relying on DNA sequence analysis or high-resolution electrophoresis, the MLVA-HRMA method employs the same process as PCR until the analysis step, suggesting a combination of speed and simplicity. The result of MLVA-HRMA method is able to be shared between different laboratories. There are high expectations that this method will be adopted for regular inspections at food processing facilities in the near future. Copyright © 2017. Published by Elsevier B.V.
Software for peak finding and elemental composition assignment for glycosaminoglycan tandem mass spectra.

PubMed

Hogan, John D; Klein, Joshua A; Wu, Jiandong; Chopra, Pradeep; Boons, Geert-Jan; Carvalho, Luis; Lin, Cheng; Zaia, Joseph

2018-04-03

Glycosaminoglycans (GAGs) covalently linked to proteoglycans (PGs) are characterized by repeating disaccharide units and variable sulfation patterns along the chain. GAG length and sulfation patterns impact disease etiology, cellular signaling, and structural support for cells. We and others have demonstrated the usefulness of tandem mass spectrometry (MS2) for assigning the structures of GAG saccharides; however, manual interpretation of tandem mass spectra is time-consuming, so computational methods must be employed. In the proteomics domain, the identification of monoisotopic peaks and charge states relies on algorithms that use averagine, or the average building block of the compound class being analyzed. While these methods perform well for protein and peptide spectra, they perform poorly on GAG tandem mass spectra, due to the fact that a single average building block does not characterize the variable sulfation of GAG disaccharide units. In addition, it is necessary to assign product ion isotope patterns in order to interpret the tandem mass spectra of GAG saccharides. To address these problems, we developed GAGfinder, the first tandem mass spectrum peak finding algorithm developed specifically for GAGs. We define peak finding as assigning experimental isotopic peaks directly to a given product ion composition, as opposed to deconvolution or peak picking, which are terms more accurately describing the existing methods previously mentioned. GAGfinder is a targeted, brute force approach to spectrum analysis that utilizes precursor composition information to generate all theoretical fragments. GAGfinder also performs peak isotope composition annotation, which is typically a subsequent step for averagine-based methods. Data are available via ProteomeXchange with identifier PXD009101. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.
Identification of TTAGGG-binding proteins in Neurospora crassa, a fungus with vertebrate-like telomere repeats.

PubMed

Casas-Vila, Núria; Scheibe, Marion; Freiwald, Anja; Kappei, Dennis; Butter, Falk

2015-11-17

To date, telomere research in fungi has mainly focused on Saccharomyces cerevisiae and Schizosaccharomyces pombe, despite the fact that both yeasts have degenerated telomeric repeats in contrast to the canonical TTAGGG motif found in vertebrates and also several other fungi. Using label-free quantitative proteomics, we here investigate the telosome of Neurospora crassa, a fungus with canonical telomeric repeats. We show that at least six of the candidates detected in our screen are direct TTAGGG-repeat binding proteins. While three of the direct interactors (NCU03416 [ncTbf1], NCU01991 [ncTbf2] and NCU02182 [ncTay1]) feature the known myb/homeobox DNA interaction domain also found in the vertebrate telomeric factors, we additionally show that a zinc-finger protein (NCU07846) and two proteins without any annotated DNA-binding domain (NCU02644 and NCU05718) are also direct double-strand TTAGGG binders. We further find two single-strand binders (NCU02404 [ncGbp2] and NCU07735 [ncTcg1]). By quantitative label-free interactomics we identify TTAGGG-binding proteins in Neurospora crassa, suggesting candidates for telomeric factors that are supported by phylogenomic comparison with yeast species. Intriguingly, homologs in yeast species with degenerated telomeric repeats are also TTAGGG-binding proteins, e.g. in S. cerevisiae Tbf1 recognizes the TTAGGG motif found in its subtelomeres. However, there is also a subset of proteins that is not conserved. While a rudimentary core TTAGGG-recognition machinery may be conserved across yeast species, our data suggests Neurospora as an emerging model organism with unique features.
Rare Sequence Variation in the Genome Flanking a Short Tandem Repeat Locus Can Lead to a Question of “Nonmaternity”

PubMed Central

Deucher, Anne; Chiang, Tsoyu; Schrijver, Iris

2010-01-01

Typing of STR (short tandem repeat) alleles is used in a variety of applications in clinical molecular pathology, including evaluations for maternal cell contamination. Using a commercially available STR typing assay for maternal cell contamination performed in conjunction with prenatal diagnostic testing, we were posed with apparent nonmaternity when the two fetal samples did not demonstrate the expected maternal allele at one locus. By designing primers external to the region amplified by the primers from the commercial assay and by performing direct sequencing of the resulting amplicon, we were able to determine that a guanine to adenine sequence variation led to primer mismatch and allele dropout. This explained the apparent null allele shared between the maternal and fetal samples. Therefore, although rare, allele dropout must be considered whenever unexplained homozygosity at an STR locus is observed. PMID:20203001
Characterization of protein N-glycosylation by tandem mass spectrometry using complementary fragmentation techniques

DOE PAGES

Ford, Kristina L.; Zeng, Wei; Heazlewood, Joshua L.; ...

2015-08-28

The analysis of post-translational modifications (PTMs) by proteomics is regarded as a technically challenging undertaking. While in recent years approaches to examine and quantify protein phosphorylation have greatly improved, the analysis of many protein modifications, such as glycosylation, are still regarded as problematic. Limitations in the standard proteomics workflow, such as use of suboptimal peptide fragmentation methods, can significantly prevent the identification of glycopeptides. The current generation of tandem mass spectrometers has made available a variety of fragmentation options, many of which are becoming standard features on these instruments. Lastly, we have used three common fragmentation techniques, namely CID, HCD,more » and ETD, to analyze a glycopeptide and highlight how an integrated fragmentation approach can be used to identify the modified residue and characterize the N-glycan on a peptide.« less
Crystal structure of tandem type III fibronectin domains from Drosophila neuroglian at 2.0 A.

PubMed

Huber, A H; Wang, Y M; Bieber, A J; Bjorkman, P J

1994-04-01

We report the crystal structure of two adjacent fibronectin type III repeats from the Drosophila neural cell adhesion molecule neuroglian. Each domain consists of two antiparallel beta sheets and is folded topologically identically to single fibronectin type III domains from the extracellular matrix proteins tenascin and fibronectin. beta bulges and left-handed polyproline II helices disrupt the regular beta sheet structure of both neuroglian domains. The hydrophobic interdomain interface includes a metal-binding site, presumably involved in stabilizing the relative orientation between domains and predicted by sequence comparision to be present in the vertebrate homolog molecule L1. The neuroglian domains are related by a near perfect 2-fold screw axis along the longest molecular dimension. Using this relationship, a model for arrays of tandem fibronectin type III repeats in neuroglian and other molecules is proposed.
Optimizing Algorithm Choice for Metaproteomics: Comparing X!Tandem and Proteome Discoverer for Soil Proteomes

NASA Astrophysics Data System (ADS)

Diaz, K. S.; Kim, E. H.; Jones, R. M.; de Leon, K. C.; Woodcroft, B. J.; Tyson, G. W.; Rich, V. I.

2014-12-01

The growing field of metaproteomics links microbial communities to their expressed functions by using mass spectrometry methods to characterize community proteins. Comparison of mass spectrometry protein search algorithms and their biases is crucial for maximizing the quality and amount of protein identifications in mass spectral data. Available algorithms employ different approaches when mapping mass spectra to peptides against a database. We compared mass spectra from four microbial proteomes derived from high-organic content soils searched with two search algorithms: 1) Sequest HT as packaged within Proteome Discoverer (v.1.4) and 2) X!Tandem as packaged in TransProteomicPipeline (v.4.7.1). Searches used matched metagenomes, and results were filtered to allow identification of high probability proteins. There was little overlap in proteins identified by both algorithms, on average just ~24% of the total. However, when adjusted for spectral abundance, the overlap improved to ~70%. Proteome Discoverer generally outperformed X!Tandem, identifying an average of 12.5% more proteins than X!Tandem, with X!Tandem identifying more proteins only in the first two proteomes. For spectrally-adjusted results, the algorithms were similar, with X!Tandem marginally outperforming Proteome Discoverer by an average of ~4%. We then assessed differences in heat shock proteins (HSP) identification by the two algorithms by BLASTing identified proteins against the Heat Shock Protein Information Resource, because HSP hits typically account for the majority signal in proteomes, due to extraction protocols. Total HSP identifications for each of the 4 proteomes were approximately ~15%, ~11%, ~17%, and ~19%, with ~14% for total HSPs with redundancies removed. Of the ~15% average of proteins from the 4 proteomes identified as HSPs, ~10% of proteins and spectra were identified by both algorithms. On average, Proteome Discoverer identified ~9% more HSPs than X!Tandem.
PUF Proteins: Cellular Functions and Potential Applications.

PubMed

Kiani, Seyed Jalal; Taheri, Tahereh; Rafati, Sima; Samimi-Rad, Katayoun

2017-01-01

RNA-binding proteins play critical roles in the regulation of gene expression. Among several families of RNA-binding proteins, PUF (Pumilio and FBF) proteins have been the subject of extensive investigations, as they can bind RNA in a sequence-specific manner and they are evolutionarily conserved among a wide range of organisms. The outstanding feature of these proteins is a highly conserved RNA-binding domain, which is known as the Pumilio-homology domain (PUM-HD) that mostly consists of eight tandem repeats. Each repeat recognizes an RNA base with a simple three-letter code that can be programmed in order to change the sequence-specificity of the protein. Using this tailored architecture, researchers have been able to change the specificity of the PUM-HD and target desired transcripts in the cell, even in subcellular compartments. The potential applications of this versatile tool in molecular cell biology seem unbounded and the use of these factors in pharmaceutics might be an interesting field of study in near future. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Chlorovirus Skp1-binding ankyrin repeat protein interplay and mimicry of cellular ubiquitin ligase machinery.

PubMed

Noel, Eric A; Kang, Ming; Adamec, Jiri; Van Etten, James L; Oyler, George A

2014-12-01

The ubiquitin-proteasome system is targeted by many viruses that have evolved strategies to redirect host ubiquitination machinery. Members of the genus Chlorovirus are proposed to share an ancestral lineage with a broader group of related viruses, nucleo-cytoplasmic large DNA viruses (NCLDV). Chloroviruses encode an Skp1 homolog and ankyrin repeat (ANK) proteins. Several chlorovirus-encoded ANK repeats contain C-terminal domains characteristic of cellular F-boxes or related NCLDV chordopox PRANC (pox protein repeats of ankyrin at C-terminal) domains. These observations suggested that this unique combination of Skp1 and ANK repeat proteins might form complexes analogous to the cellular Skp1-Cul1-F-box (SCF) ubiquitin ligase complex. We identified two ANK proteins from the prototypic chlorovirus Paramecium bursaria chlorella virus-1 (PBCV-1) that functioned as binding partners for the virus-encoded Skp1, proteins A682L and A607R. These ANK proteins had a C-terminal Skp1 interactional motif that functioned similarly to cellular F-box domains. A C-terminal motif of ANK protein A682L binds Skp1 proteins from widely divergent species. Yeast two-hybrid analyses using serial domain deletion constructs confirmed the C-terminal localization of the Skp1 interactional motif in PBCV-1 A682L. ANK protein A607R represents an ANK family with one member present in all 41 sequenced chloroviruses. A comprehensive phylogenetic analysis of these related ANK and viral Skp1 proteins suggested partnered function tailored to the host alga or common ancestral heritage. Here, we show protein-protein interaction between corresponding family clusters of virus-encoded ANK and Skp1 proteins from three chlorovirus types. Collectively, our results indicate that chloroviruses have evolved complementing Skp1 and ANK proteins that mimic cellular SCF-associated proteins. Viruses have evolved ways to direct ubiquitination events in order to create environments conducive to their replication. As
Hierarchical modeling of genome-wide Short Tandem Repeat (STR) markers infers native American prehistory.

PubMed

Lewis, Cecil M

2010-02-01

This study examines a genome-wide dataset of 678 Short Tandem Repeat loci characterized in 444 individuals representing 29 Native American populations as well as the Tundra Netsi and Yakut populations from Siberia. Using these data, the study tests four current hypotheses regarding the hierarchical distribution of neutral genetic variation in native South American populations: (1) the western region of South America harbors more variation than the eastern region of South America, (2) Central American and western South American populations cluster exclusively, (3) populations speaking the Chibchan-Paezan and Equatorial-Tucanoan language stock emerge as a group within an otherwise South American clade, (4) Chibchan-Paezan populations in Central America emerge together at the tips of the Chibchan-Paezan cluster. This study finds that hierarchical models with the best fit place Central American populations, and populations speaking the Chibchan-Paezan language stock, at a basal position or separated from the South American group, which is more consistent with a serial founder effect into South America than that previously described. Western (Andean) South America is found to harbor similar levels of variation as eastern (Equatorial-Tucanoan and Ge-Pano-Carib) South America, which is inconsistent with an initial west coast migration into South America. Moreover, in all relevant models, the estimates of genetic diversity within geographic regions suggest a major bottleneck or founder effect occurring within the North American subcontinent, before the peopling of Central and South America. 2009 Wiley-Liss, Inc.
Analysis of short tandem repeat polymorphisms using infrared fluorescence with M18 tailed primers

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oetting, W.S.; Wiesner, G.; Laken, S.

The use of short tandem repeat polymorphisms (STRPs) are becoming increasingly important as markers for linkage analysis due to their large numbers of the human genome and their high degree of polymorphism. Fluorescence based detection of the STRP pattern using the LI-COR model 4000S automated DNA sequencer eliminates the need for radioactivity and produces a digitized image that can be used for the analysis of the polymorphisms. In an effort to reduce the cost of STRP analysis, we have synthesized primers with a 19 bp extension complementary to the sequence of the M13 primer on the 5{prime} end of onemore » of the two primers used in the amplification of the STRP instead of using primers with direct conjugation of the infrared fluorescent dye. Up to 5 primer pairs can be multiplexed together with the M13 primer-dye conjugate as the sole primer conjugated to the fluorescent dye. Comparisons between primers that have been directly conjugated to the fluor with those having the M13 sequence extension show no difference in the ability to determine the STRP pattern. At present, the entire Weber 4A set of STRP markers is available with the M13 5{prime} extension. We are currently using this technique for linkage analysis of familial breast cancer and asthma. The combination of STRP analysis using fluorescence detection will allow this technique to be fully automated for allele scoring and linkage analysis.« less
Antihypertensive activity of transgenic rice seed containing an 18-repeat novokinin peptide localized in the nucleolus of endosperm cells.

PubMed

Wakasa, Yuhya; Zhao, Hui; Hirose, Sakiko; Yamauchi, Daiki; Yamada, Yuko; Yang, Lijun; Ohinata, Kousaku; Yoshikawa, Masaaki; Takaiwa, Fumio

2011-09-01

Novokinin (Arg-Pro-Leu-Lys-Pro-Trp, RPLKPW) is a new potent antihypertensive peptide based on the sequence of ovokinin (2-7) derived from ovalbumin. We previously generated transgenic rice seeds in which eight novokinin were fused to storage protein glutelins (GluA2 and GluC) for expression. Oral administration of these seeds to spontaneously hypertensive rats (SHRs) reduced systolic blood pressures at a dose of 1 g seed/kg of SHR. Here, 10- or 18-tandem repeats of novokinin with an endoplasmic reticulum (ER) retention signal (Lys-Asp-Glu-Leu, KDEL) at the C terminus were directly expressed in rice under the control of the glutelin promoter containing its signal peptide. Only small amounts of the 18-repeat novokinin accumulated, and it was unexpectedly deposited in the nucleolus. This abnormal intracellular localization was explained by an endogenous signal for nuclear localization. The GFP reporter protein fused to this sequence targeted to nuclei by a transient assay using onion epidermal cells. Transgenic seed expressing the 18-repeat novokinin exhibited significantly higher antihypertensive activity after a single oral dose to SHR even at one-quarter the amount (0.25 g/kg) of the transgenic rice seed expressing the fusion construct; though, its novokinin content was much lower (1/5). Furthermore, in a long-term administration for 5 weeks, even a smaller dose (0.0625 g/kg) of transgenic seeds could confer antihypertensive activity. This high antihypertensive activity may be attributed to differences in digestibility of expressed products by gastrointestinal enzymes and the unique intracellular localization. These results indicate that accumulation of novokinin as a tandemly repeated structure in transgenic rice is more effective than as a fusion-type structure. © 2010 The Authors. Plant Biotechnology Journal © 2010 Society for Experimental Biology and Blackwell Publishing Ltd.
A Large Population Genetic Study of 15 Autosomal Short Tandem Repeat Loci for Establishment of Korean DNA Profile Database

PubMed Central

Yoo, Seong Yeon; Cho, Nam Soo; Park, Myung Jin; Seong, Ki Min; Hwang, Jung Ho; Song, Seok Bean; Han, Myun Soo; Lee, Won Tae; Chung, Ki Wha

2011-01-01

Genotyping of highly polymorphic short tandem repeat (STR) markers is widely used for the genetic identification of individuals in forensic DNA analyses and in paternity disputes. The National DNA Profile Databank recently established by the DNA Identification Act in Korea contains the computerized STR DNA profiles of individuals convicted of crimes. For the establishment of a large autosomal STR loci population database, 1805 samples were obtained at random from Korean individuals and 15 autosomal STR markers were analyzed using the AmpFlSTR Identifiler PCR Amplification kit. For the 15 autosomal STR markers, no deviations from the Hardy-Weinberg equilibrium were observed. The most informative locus in our data set was the D2S1338 with a discrimination power of 0.9699. The combined matching probability was 1.521 × 10-17. This large STR profile dataset including atypical alleles will be important for the establishment of the Korean DNA database and for forensic applications. PMID:21597912

A large-scale dataset of single and mixed-source short tandem repeat profiles to inform human identification strategies: PROVEDIt.

PubMed

Alfonse, Lauren E; Garrett, Amanda D; Lun, Desmond S; Duffy, Ken R; Grgicak, Catherine M

2018-01-01

DNA-based human identity testing is conducted by comparison of PCR-amplified polymorphic Short Tandem Repeat (STR) motifs from a known source with the STR profiles obtained from uncertain sources. Samples such as those found at crime scenes often result in signal that is a composite of incomplete STR profiles from an unknown number of unknown contributors, making interpretation an arduous task. To facilitate advancement in STR interpretation challenges we provide over 25,000 multiplex STR profiles produced from one to five known individuals at target levels ranging from one to 160 copies of DNA. The data, generated under 144 laboratory conditions, are classified by total copy number and contributor proportions. For the 70% of samples that were synthetically compromised, we report the level of DNA damage using quantitative and end-point PCR. In addition, we characterize the complexity of the signal by exploring the number of detected alleles in each profile. Copyright © 2017 Elsevier B.V. All rights reserved.
A large population genetic study of 15 autosomal short tandem repeat loci for establishment of Korean DNA profile database.

PubMed

Yoo, Seong Yeon; Cho, Nam Soo; Park, Myung Jin; Seong, Ki Min; Hwang, Jung Ho; Song, Seok Bean; Han, Myun Soo; Lee, Won Tae; Chung, Ki Wha

2011-07-01

Genotyping of highly polymorphic short tandem repeat (STR) markers is widely used for the genetic identification of individuals in forensic DNA analyses and in paternity disputes. The National DNA Profile Databank recently established by the DNA Identification Act in Korea contains the computerized STR DNA profiles of individuals convicted of crimes. For the establishment of a large autosomal STR loci population database, 1805 samples were obtained at random from Korean individuals and 15 autosomal STR markers were analyzed using the AmpFlSTR Identifiler PCR Amplification kit. For the 15 autosomal STR markers, no deviations from the Hardy-Weinberg equilibrium were observed. The most informative locus in our data set was the D2S1338 with a discrimination power of 0.9699. The combined matching probability was 1.521 × 10(-17). This large STR profile dataset including atypical alleles will be important for the establishment of the Korean DNA database and for forensic applications.
Characterization of Escherichia coli O157:H7 in New Zealand using multiple-locus variable-number tandem-repeat analysis.

PubMed

Dyet, K H; Robertson, I; Turbitt, E; Carter, P E

2011-03-01

Recently, multiple-locus variable-number tandem-repeat analysis (MLVA) has been proposed as an alternative to pulsed-field gel electrophoresis (PFGE) for characterization of Escherichia coli O157:H7. In this study we characterized 118 E. coli O157:H7 isolates from cases of gastrointestinal disease in New Zealand using XbaI PFGE profiles and a MLVA scheme that assessed variability in eight polymorphic loci. The 118 isolates characterized included all 80 E. coli O157:H7 referred to New Zealand's Enteric Reference Laboratory in 2006 and 29 phage-type 2 isolates from 2005. When applied to these isolates the discriminatory power of PFGE and MLVA was not significantly different. However, MLVA data may be more epidemiologically relevant as isolates from family clusters of disease had identical MLVA profiles, even when the XbaI PFGE profiles differed slightly. Furthermore, most isolates with indistinguishable XbaI PFGE profiles that did not appear to be epidemiologically related had distinct MLVA profiles.
The Energy Landscapes of Repeat-Containing Proteins: Topology, Cooperativity, and the Folding Funnels of One-Dimensional Architectures

PubMed Central

Komives, Elizabeth A.; Wolynes, Peter G.

2008-01-01

Repeat-proteins are made up of near repetitions of 20– to 40–amino acid stretches. These polypeptides usually fold up into non-globular, elongated architectures that are stabilized by the interactions within each repeat and those between adjacent repeats, but that lack contacts between residues distant in sequence. The inherent symmetries both in primary sequence and three-dimensional structure are reflected in a folding landscape that may be analyzed as a quasi–one-dimensional problem. We present a general description of repeat-protein energy landscapes based on a formal Ising-like treatment of the elementary interaction energetics in and between foldons, whose collective ensemble are treated as spin variables. The overall folding properties of a complete “domain” (the stability and cooperativity of the repeating array) can be derived from this microscopic description. The one-dimensional nature of the model implies there are simple relations for the experimental observables: folding free-energy (ΔGwater) and the cooperativity of denaturation (m-value), which do not ordinarily apply for globular proteins. We show how the parameters for the “coarse-grained” description in terms of foldon spin variables can be extracted from more detailed folding simulations on perfectly funneled landscapes. To illustrate the ideas, we present a case-study of a family of tetratricopeptide (TPR) repeat proteins and quantitatively relate the results to the experimentally observed folding transitions. Based on the dramatic effect that single point mutations exert on the experimentally observed folding behavior, we speculate that natural repeat proteins are “poised” at particular ratios of inter- and intra-element interaction energetics that allow them to readily undergo structural transitions in physiologically relevant conditions, which may be intrinsically related to their biological functions. PMID:18483553
Repeat-mediated epigenetic dysregulation of the FMR1 gene in the fragile X-related disorders.

PubMed

Usdin, Karen; Kumari, Daman

2015-01-01

The fragile X-related disorders are members of the Repeat Expansion Diseases, a group of genetic conditions resulting from an expansion in the size of a tandem repeat tract at a specific genetic locus. The repeat responsible for disease pathology in the fragile X-related disorders is CGG/CCG and the repeat tract is located in the 5' UTR of the FMR1 gene, whose protein product FMRP, is important for the proper translation of dendritic mRNAs in response to synaptic activation. There are two different pathological FMR1 allele classes that are distinguished only by the number of repeats. Premutation alleles have 55-200 repeats and confer risk of fragile X-associated tremor/ataxia syndrome and fragile X-associated primary ovarian insufficiency. Full mutation alleles on the other hand have >200 repeats and result in fragile X syndrome, a disorder that affects learning and behavior. Different symptoms are seen in carriers of premutation and full mutation alleles because the repeat number has paradoxical effects on gene expression: Epigenetic changes increase transcription from premutation alleles and decrease transcription from full mutation alleles. This review will cover what is currently known about the mechanisms responsible for these changes in FMR1 expression and how they may relate to other Repeat Expansion Diseases that also show repeat-mediated changes in gene expression.
Multicolor-based discrimination of 21 short tandem repeats and amelogenin using four fluorescent universal primers.

PubMed

Asari, Masaru; Okuda, Katsuhiro; Hoshina, Chisato; Omura, Tomohiro; Tasaki, Yoshikazu; Shiono, Hiroshi; Matsubara, Kazuo; Shimizu, Keiko

2016-02-01

The aim of this study was to develop a cost-effective genotyping method using high-quality DNA for human identification. A total of 21 short tandem repeats (STRs) and amelogenin were selected, and fluorescent fragments at 22 loci were simultaneously amplified in a single-tube reaction using locus-specific primers with 24-base universal tails and four fluorescent universal primers. Several nucleotide substitutions in universal tails and fluorescent universal primers enabled the detection of specific fluorescent fragments from the 22 loci. Multiplex polymerase chain reaction (PCR) produced intense FAM-, VIC-, NED-, and PET-labeled fragments ranging from 90 to 400 bp, and these fragments were discriminated using standard capillary electrophoretic analysis. The selected 22 loci were also analyzed using two commercial kits (the AmpFLSTR Identifiler Kit and the PowerPlex ESX 17 System), and results for two loci (D19S433 and D16S539) were discordant between these kits due to mutations at the primer binding sites. All genotypes from the 100 samples were determined using 2.5 ng of DNA by our method, and the expected alleles were completely recovered. Multiplex 22-locus genotyping using four fluorescent universal primers effectively reduces the costs to less than 20% of genotyping using commercial kits, and our method would be useful to detect silent alleles from commercial kit analysis. Copyright © 2015 Elsevier Inc. All rights reserved.
Tissue identity testing of cancer by short tandem repeat polymorphism: pitfalls of interpretation in the presence of microsatellite instability.

PubMed

Much, Melissa; Buza, Natalia; Hui, Pei

2014-03-01

Tissue identity testing by short tandem repeat (STR) polymorphism offers discriminating power in resolving tissue mix-up or contamination. However, one caveat is the presence of microsatellite unstable tumors, in which genetic alterations may drastically change the STR wild-type polymorphism leading to unexpected allelic discordance. We examined how tissue identity testing results can be altered by the presence of microsatellite instability (MSI). Eleven cases of MSI-unstable (9 intestinal and 2 endometrial adenocarcinomas) and 10 cases of MSI-stable tumors (all colorectal adenocarcinomas) were included. All had been previously tested by polymerase chain reaction testing at 5 National Cancer Institute (NCI) recommended MSI loci and/or immunohistochemistry for DNA mismatch repair proteins (MLH1, MSH2, MSH6, and PMS2). Tissue identity testing targeting 15 STR loci was performed using AmpF/STR Identifiler Amplification. Ten of 11 MSI-unstable tumors demonstrated novel alleles at 5 to 12 STR loci per case and frequently with 3 or more allelic peaks. However, all affected loci showed identifiable germline allele(s) in MSI-high tumors. A wild-type allelic profile was seen in 7 of 10 MSI-stable tumors. In the remaining 3 cases, isolated novel alleles were present at a unique single locus in addition to germline alleles. Loss of heterozygosity was observed frequently in both MSI-stable (6/11 cases) and MSI-unstable tumors (8/10 cases). In conclusion, MSI may significantly alter the wild-type allelic polymorphism, leading to potential interpretation errors of STR genotyping. Careful examination of the STR allelic pattern, high index of suspicion, and follow-up MSI testing are crucial to avoid erroneous conclusions and subsequent clinical and legal consequences. Copyright © 2014 Elsevier Inc. All rights reserved.
Enhancing Accuracy in Molecular Weight Determination of Highly Heterogeneously Glycosylated Proteins by Native Tandem Mass Spectrometry.

PubMed

Wang, Guanbo; de Jong, Rob N; van den Bremer, Ewald T J; Parren, Paul W H I; Heck, Albert J R

2017-05-02

The determination of molecular weights (MWs) of heavily glycosylated proteins is seriously hampered by the physicochemical characteristics and heterogeneity of the attached carbohydrates. Glycosylation impacts protein migration during sodium dodecyl sulfate (SDS)-polyacrylamide gel electrophoresis (PAGE) and size-exclusion chromatography (SEC) analysis. Standard electrospray ionization (ESI)-mass spectrometry does not provide a direct solution as this approach is hindered by extensive interference of ion signals caused by closely spaced charge states of broadly distributed glycoforms. Here, we introduce a native tandem MS-based approach, enabling charge-state resolution and charge assignment of protein ions including those that escape mass analysis under standard MS conditions. Using this method, we determined the MW of two model glycoproteins, the extra-cellular domains of the highly and heterogeneously glycosylated proteins CD38 and epidermal growth factor receptor (EGFR), as well as the overall MW and binding stoichiometries of these proteins in complex with a specific antibody.
Evaluation of a highly discriminating multiplex multi-locus variable-number of tandem-repeats (MLVA) analysis for Vibrio cholerae.

PubMed

Olsen, Jaran S; Aarskaug, Tone; Skogan, Gunnar; Fykse, Else Marie; Ellingsen, Anette Bauer; Blatny, Janet M

2009-09-01

Vibrio cholerae is the etiological agent of cholera and may be used in bioterror actions due to the easiness of its dissemination, and the public fear for acquiring the cholera disease. A simple and highly discriminating method for connecting clinical and environmental isolates of V. cholerae is needed in microbial forensics. Twelve different loci containing variable numbers of tandem-repeats (VNTRs) were evaluated in which six loci were polymorphic. Two multiplex reactions containing PCR primers targeting these six VNTRs resulted in successful DNA amplification of 142 various environmental and clinical V. cholerae isolates. The genetic distribution inside the V. cholerae strain collection was used to evaluate the discriminating power (Simpsons Diversity Index=0.99) of this new MLVA analysis, showing that the assay have a potential to differentiate between various strains, but also to identify those isolates which are collected from a common V. cholerae outbreak. This work has established a rapid and highly discriminating MLVA assay useful for track back analyses and/or forensic studies of V. cholerae infections.
Variable Number of Tandem Repeats in Salmonella enterica subsp. enterica for Typing Purposes

PubMed Central

Ramisse, Vincent; Houssu, Perrine; Hernandez, Eric; Denoeud, France; Hilaire, Valérie; Lisanti, Olivier; Ramisse, Françoise; Cavallo, Jean-Didier; Vergnaud, Gilles

2004-01-01

The genomic sequences of Salmonella enterica subsp. enterica strains CT18, Ty2 (serovar Typhi), and LT2 (serovar Typhimurium) were analyzed for potential variable number tandem repeats (VNTRs). A multiple-locus VNTR analysis (MLVA) of 99 strains of S. enterica supsp. enterica based on 10 VNTRs distinguished 52 genotypes and placed them into four groups. All strains tested were independent human isolates from France and did not reflect isolates from outbreak episodes. Of these 10 VNTRs, 7 showed variability within serovar Typhi, whereas 1 showed variability within serovar Typhimurium. Four VNTRs showed high Nei's diversity indices (DIs) of 0.81 to 0.87 within serovar Typhi (n = 27). Additionally, three of these more variable VNTRs showed DIs of 0.18 to 0.58 within serovar Paratyphi A (n = 10). The VNTR polymorphic site within multidrug-resistant (MDR) serovar Typhimurium isolates (n = 39; resistance to ampicillin, chloramphenicol, spectinomycin, sulfonamides, and tetracycline) showed a DI of 0.81. Cluster analysis not only identified three genetically distinct groups consistent with the present serovar classification of salmonellae (serovars Typhi, Paratyphi A, and Typhimurium) but also discriminated 25 subtypes (93%) within serovar Typhi isolates. The analysis discriminated only eight subtypes within serovar Typhimurium isolates resistant to ampicillin, chloramphenicol, spectinomycin, sulfonamides, and tetracycline, possibly reflecting the emergence in the mid-1990s of the DT104 phage type, which often displays such an MDR spectrum. Coupled with the ongoing improvements in automated procedures offered by capillary electrophoresis, use of these markers is proposed in further investigations of the potential of MLVA in outbreaks of salmonellosis, especially outbreaks of typhoid fever. PMID:15583305
Partners in crime: The role of tandem modules in gene transcription.

PubMed

Sharma, Rajal; Zhou, Ming-Ming

2015-09-01

Histones and their modifications play an important role in the regulation of gene transcription. Numerous modifications, such as acetylation, phosphorylation, methylation, ubiquitination, and SUMOylation, have been described. These modifications almost always co-occur and thereby increase the combinatorial complexity of post-translational modification detection. The domains that recognize these histone modifications often occur in tandem in the context of larger proteins and complexes. The presence of multiple modifications can positively or negatively regulate the binding of these tandem domains, influencing downstream cellular function. Alternatively, these tandem domains can have novel functions from their independent parts. Here we summarize structural and functional information known about major tandem domains and their histone binding properties. An understanding of these interactions is key for the development of epigenetic therapy. © 2015 The Protein Society.
Supplemented vaccination with tandem repeat M2e virus-like particles enhances protection against homologous and heterologous HPAI H5 viruses in chickens.

PubMed

Song, Byung-Min; Kang, Hyun-Mi; Lee, Eun-Kyoung; Jung, Suk Chan; Kim, Min-Chul; Lee, Yu-Na; Kang, Sang-Moo; Lee, Youn-Jeong

2016-01-27

Highly pathogenic avian influenza (HPAI) H5 viruses derived from A/Goose/Guangdong/1/96 have been continuously circulating globally, severely affecting the public health and poultry industries. The matrix 2 protein ectodomain (M2e) is considered a promising candidate for a universal cross-protective influenza vaccine that provides more effective control over HPAI H5 viruses harboring variant hemagglutinin (HA)-antigens. Here, we evaluated the protective efficacy of a tandem repeat construct of heterologous M2e presented on virus-like particles (M2e5x VLPs) either alone or as a supplement against HPAI H5 viruses in a chicken model. Chickens immunized with M2e5x VLPs alone induced M2e-specific antibodies but were not protected against HPAI H5. The homo- and cross-protective efficacy of M2e5x VLP-supplemented vaccination of chickens was also examined. Importantly, supplementation with M2e5x VLPs induced significantly higher levels of antibodies specific for M2e and different viruses as well as provided improved protection against homologous and heterologous HPAI H5 viruses. Considering the limited efficacy of inactivated vaccines, supplement vaccination with M2e5x VLPs may be an effective measure for preventing outbreaks of HPAI viruses that have the ability to constantly change their antigenic properties in poultry. Copyright © 2015 Elsevier Ltd. All rights reserved.
Allele frequency distribution for the variable number of tandem repeat locus D10S28 in Tamil Nadu (south India) population.

PubMed

Pandian, S K; Kumar, S; Krishnan, M; Dharmalingam, K; Damodaran, C

1995-09-01

Allele frequencies were determined in unrelated individuals of Tamil speaking population from the Madras City (Tamil Nadu, South India) area for the polymorphic DNA locus D10S28 using the probe TBQ7. Membranes hybridized with the probe YNH24 were subjected to deprobing and were subsequently hybridized with random priming - labeled, purified inserts of TBQ7. The sizes of the fragments were grouped to 100 bp as well as to arbitrary fixed bins (Federal Bureau of Investigation / Royal Canadian Mounted Police). There were 14 bins in the latter with the most common bin being 11 (1789-1924 bp) with a frequency of 9.8%. We observed a heterozygosity of 92% comparable to Caucasian populations. The data presented here can be used as the basis for utilizing this variable number of tandem repeats (TNTR) DNA marker for paternity determinations and forensic investigations.
Ankyrin-repeat containing proteins of microbes: a conserved structure with functional diversity

PubMed Central

Al-Khodor, Souhaila; Price, Christopher T.; Kalia, Awdhesh; Kwaik, Yousef Abu

2009-01-01

Summary The ankyrin repeat (ANK) is the most common protein-protein interaction motif in nature and predominantly found in eukaryotic proteins. The genome sequencing of various pathogenic or symbiotic bacteria and eukaryotic viruses identified numerous genes encoding ANK-containing proteins that were proposed to have been acquired from eukaryotes by horizontal gene transfer. However, the recent discovery of additional ANK-containing proteins encoded in the genomes of archaea and free-living bacteria suggests either a more ancient origin of the ANK motif or multiple convergent evolution events. Many bacterial pathogens employ various types of secretion systems to deliver ANK-containing proteins into eukaryotic cells where they mimic or manipulate various host functions. Understanding the molecular and biochemical functions of this family of proteins will enhance our understanding of important host-microbe interactions. PMID:19962898
Use of a tandem affinity purification assay to detect interactions between West Nile and dengue viral proteins and proteins of the mosquito vector

PubMed Central

Colpitts, Tonya M.; Cox, Jonathan; Nguyen, Annie; Feitosa, Fabiana; Krishnan, Manoj N.; Fikrig, Erol

2011-01-01

West Nile and dengue viruses are (re)emerging mosquito-borne flaviviruses that cause significant morbidity and mortality in man. The identification of mosquito proteins that associate with flaviviruses may provide novel targets to inhibit infection of the vector or block transmission to humans. Here, a tandem affinity purification (TAP) assay was used to identify 18 mosquito proteins that interact with dengue and West Nile capsid, envelope, NS2A or NS2B proteins. We further analyzed the interaction of mosquito cadherin with dengue and West Nile virus envelope protein using co-immunoprecipitation and immunofluorescence. Blocking the function of select mosquito factors, including actin, myosin, PI3-kinase and myosin light chain kinase, reduced both dengue and West Nile virus infection in mosquito cells. We show that the TAP method may be used in insect cells to accurately identify flaviviral-host protein interactions. Our data also provides several targets for interrupting flavivirus infection in mosquito vectors. PMID:21700306
Structure and Function of the Two Tandem WW Domains of the Pre-mRNA Splicing Factor FBP21 (Formin-binding Protein 21)*

PubMed Central

Huang, Xiaojuan; Beullens, Monique; Zhang, Jiahai; Zhou, Yi; Nicolaescu, Emilia; Lesage, Bart; Hu, Qi; Wu, Jihui; Bollen, Mathieu; Shi, Yunyu

2009-01-01

Human FBP21 (formin-binding protein 21) contains a matrin-type zinc finger and two tandem WW domains. It is a component of the spliceosomes and interacts with several established splicing factors. Here we demonstrate for the first time that FBP21 is an activator of pre-mRNA splicing in vivo and that its splicing activation function and interaction with the splicing factor SIPP1 (splicing factor that interacts with PQBP1 and PP1) are both mediated by the two tandem WW domains of group III. We determined the solution structure of the tandem WW domains of FBP21 and found that the WW domains recognize peptide ligands containing either group II (PPLP) or group III (PPR) motifs. The binding interfaces involve both the XP and XP2 grooves of the two WW domains. Significantly, the tandem WW domains of FBP21 are connected by a highly flexible region, enabling their simultaneous interaction with two proline-rich motifs of SIPP1. The strong interaction between SIPP1 and FBP21 can be explained by the conjugation of two low affinity interactions with the tandem WW domains. Our study provides a structural basis for understanding the molecular mechanism underlying the functional implication of FBP21 and the biological specificity of tandem WW domains. PMID:19592703
Multiple-locus variable-number tandem-repeat analysis of the swine dysentery pathogen, Brachyspira hyodysenteriae.

PubMed

Hidalgo, Alvaro; Carvajal, Ana; La, Tom; Naharro, Germán; Rubio, Pedro; Phillips, Nyree D; Hampson, David J

2010-08-01

The spirochete Brachyspira hyodysenteriae is the causative agent of swine dysentery, a severe colonic infection of pigs that has a considerable economic impact in many swine-producing countries. In spite of its importance, knowledge about the global epidemiology and population structure of B. hyodysenteriae is limited. Progress in this area has been hampered by the lack of a low-cost, portable, and discriminatory method for strain typing. The aim of the current study was to develop and test a multiple-locus variable-number tandem-repeat analysis (MLVA) method that could be used in basic veterinary diagnostic microbiology laboratories equipped with PCR technology or in more advanced laboratories with access to capillary electrophoresis. Based on eight loci, and when performed on isolates from different farms in different countries, as well as type and reference strains, the MLVA technique developed was highly discriminatory (Hunter and Gaston discriminatory index, 0.938 [95% confidence interval, 0.9175 to 0.9584]) while retaining a high phylogenetic value. Using the technique, the species was shown to be diverse (44 MLVA types from 172 isolates and strains), although isolates were stable in herds over time. The population structure appeared to be clonal. The finding of B. hyodysenteriae MLVA type 3 in piggeries in three European countries, as well as other, related, strains in different countries, suggests that spreading of the pathogen via carrier pigs is likely. MLVA overcame drawbacks associated with previous typing techniques for B. hyodysenteriae and was a powerful method for epidemiologic and population structure studies on this important pathogenic spirochete.
Systematic analyses of the ultraviolet radiation resistance-associated gene product (UVRAG) protein interactome by tandem affinity purification.

PubMed

Son, Ji-Hye; Hwang, Eurim C; Kim, Joungmok

2016-03-01

Ultraviolet radiation resistance-associated gene product (UVRAG) was originally identified as a protein involved in cellular responses to UV irradiation. Subsequent studies have demonstrated that UVRAG plays as an important role in autophagy, a lysosome-dependent catabolic program, as a part of a pro-autophagy PIK3C3/VPS34 lipid kinase complex. Several recent studies have shown that UVRAG is also involved in autophagy-independent cellular functions, such as DNA repair/stability and vesicular trafficking/fusion. Here, we examined the UVRAG protein interactome to obtain information about its functional network. To this end, we screened UVRAG-interacting proteins using a tandem affinity purification method coupled with MALDI-TOF/MS analysis. Our results demonstrate that UVRAG interacts with various proteins involved in a wide spectrum of cellular functions, including genome stability, protein translational elongation, protein localization (trafficking), vacuole organization, transmembrane transport as well as autophagy. Notably, the interactome list of high-confidence UVRAG-interacting proteins is enriched for proteins involved in the regulation of genome stability. Our systematic UVRAG interactome analysis should provide important clues for understanding a variety of UVRAG functions.
Tandem betatron

DOEpatents

Keinigs, Rhonald K.

1992-01-01

Two betatrons are provided in tandem for alternately accelerating an electron beam to avoid the single flux swing limitation of conventional betatrons and to accelerate the electron beam to high energies. The electron beam is accelerated in a first betatron during a period of increasing magnetic flux. The eletron beam is extracted from the first betatron as a peak magnetic flux is reached and then injected into a second betatron at a time of minimum magnetic flux in the second betatron. The cycle may be repeated until the desired electron beam energy is obtained. In one embodiment, the second betatron is axially offset from the first betatron to provide for electron beam injection directly at the axial location of the beam orbit in the second betatron.
Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts.

PubMed

Trofimova, Irina; Krasikova, Alla

2016-12-01

Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription.

Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts

PubMed Central

Krasikova, Alla

2016-01-01

ABSTRACT Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription. PMID:27763817
DNA fingerprinting of Shiga-toxin producing Escherichia coli O157 based on Multiple-Locus Variable-Number Tandem-Repeats Analysis (MLVA)

PubMed Central

Lindstedt, Bjørn-Arne; Heir, Even; Gjernes, Elisabet; Vardund, Traute; Kapperud, Georg

2003-01-01

Background The ability to react early to possible outbreaks of Escherichia coli O157:H7 and to trace possible sources relies on the availability of highly discriminatory and reliable techniques. The development of methods that are fast and has the potential for complete automation is needed for this important pathogen. Methods In all 73 isolates of shiga-toxin producing E. coli O157 (STEC) were used in this study. The two available fully sequenced STEC genomes were scanned for tandem repeated stretches of DNA, which were evaluated as polymorphic markers for isolate identification. Results The 73 E. coli isolates displayed 47 distinct patterns and the MLVA assay was capable of high discrimination between the E. coli O157 strains. The assay was fast and all the steps can be automated. Conclusion The findings demonstrate a novel high discriminatory molecular typing method for the important pathogen E. coli O157 that is fast, robust and offers many advantages compared to current methods. PMID:14664722
Advances in the Application of Designed Ankyrin Repeat Proteins (DARPins) as Research Tools and Protein Therapeutics.

PubMed

Boersma, Ykelien L

2018-01-01

Nonimmunoglobulin scaffolds have been developed to overcome the limitations of monoclonal antibodies with regard to stability and size. Of these scaffolds, the class of designed ankyrin repeat proteins (DARPins) has advanced the most in biochemical and biomedical applications. This review focuses on the recent progress in DARPin technology, highlighting the scaffold's potential and possibilities.
Modeling protein homopolymeric repeats: possible polyglutamine structural motifs for Huntington's disease.

PubMed

Lathrop, R H; Casale, M; Tobias, D J; Marsh, J L; Thompson, L M

1998-01-01

We describe a prototype system (Poly-X) for assisting an expert user in modeling protein repeats. Poly-X reduces the large number of degrees of freedom required to specify a protein motif in complete atomic detail. The result is a small number of parameters that are easily understood by, and under the direct control of, a domain expert. The system was applied to the polyglutamine (poly-Q) repeat in the first exon of huntingtin, the gene implicated in Huntington's disease. We present four poly-Q structural motifs: two poly-Q beta-sheet motifs (parallel and antiparallel) that constitute plausible alternatives to a similar previously published poly-Q beta-sheet motif, and two novel poly-Q helix motifs (alpha-helix and pi-helix). To our knowledge, helical forms of polyglutamine have not been proposed before. The motifs suggest that there may be several plausible aggregation structures for the intranuclear inclusion bodies which have been found in diseased neurons, and may help in the effort to understand the structural basis for Huntington's disease.
A novel signal transduction protein: Combination of solute binding and tandem PAS-like sensor domains in one polypeptide chain

DOE PAGES

Wu, R.; Wilton, R.; Cuff, M. E.; ...

2017-02-07

The tandem Per-Arnt-Sim (PAS) like sensors are commonly found in signal transduction proteins. The periplasmic solute binding protein (SBP) domains are found ubiquitously and are generally involved in solute transport. These domains are widely observed as parts of separate proteins but not within the same polypeptide chain. We report the structural and biochemical characterization of the extracellular ligand-binding receptor, Dret_0059 from Desulfohalobium retbaense DSM 5692, an organism isolated from the Retba salt lake in Senegal. The structure of Dret_0059 consists of a novel combination of SBP and TPAS sensor domains. The N-terminal region forms an SBP domain and the C-terminalmore » region folds into a tandem PAS-like domain structure. A ketoleucine moiety is bound to the SBP, whereas a cytosine molecule is bound in the distal PAS domain of the TPAS. The differential scanning flourimetry studies in solution support the ligands observed in the crystal structure. There are only two other proteins with this structural architecture in the non-redundant sequence data base and we predict that they too bind the same substrates. There is significant interaction between the SBP and TPAS domains, and it is quite conceivable that the binding of one ligand will have an effect on the binding of the other. Our attempts to remove the ligands bound to the protein during expression were not successful, therefore, it is not clear what the relative affects are. The genomic context of this receptor does not contain any protein components expected for transport function, hence, we suggest that Dret_0059 is likely involved in signal transduction and not in solute transport.« less
Skewing of the genetic architecture at the ZMYM3 human-specific 5' UTR short tandem repeat in schizophrenia.

PubMed

Alizadeh, F; Bozorgmehr, A; Tavakkoly-Bazzaz, J; Ohadi, M

2018-06-01

Differential expansion of a number of human short tandem repeats (STRs) at the critical core promoter and 5' untranslated region (UTR) support the hypothesis that at least some of these STRs may provide a selective advantage in human evolution. Following a genome-wide screen of all human protein-coding gene 5' UTRs based on the Ensembl database ( http://www.ensembl.org ), we previously reported that the longest STR in this interval is a (GA) 32 , which belongs to the X-linked zinc finger MYM-type containing 3 (ZMYM3) gene. In the present study, we analyzed the evolutionary implication of this region across evolution and examined the allele and genotype distribution of the "exceptionally long" STR by direct sequencing of 486 Iranian unrelated male subjects consisting of 196 cases of schizophrenia (SCZ) and 290 controls. We found that the ZMYM3 transcript containing the STR is human-specific (ENST00000373998.5). A significant allele variance difference was observed between the cases and controls (Levene's test for equality of variances F = 4.00, p < 0.03). In addition, six alleles were observed in the SCZ patients that were not detected in the control group ("disease-only" alleles) (mid p exact < 0.0003). Those alleles were at the extreme short and long ends of the allele distribution curve and composed 4% of the genotypes in the SCZ group. In conclusion, we found skewing of the genetic architecture at the ZMYM3 STR in SCZ. Further, we found a bell-shaped distribution of alleles and selection against alleles at the extreme ends of this STR. The ZMYM3 STR sets a prototype, the evolutionary course of which determines the range of alleles in a particular species. Extreme "disease-only" alleles and genotypes may change our perspective of adaptive evolution and complex disorders. The ZMYM3 gene "exceptionally long" STR should be sequenced in SCZ and other human-specific phenotypes/characteristics.
[Use of multiple locus variable number tandem repeats analysis for the Brucella systematization].

PubMed

Kulakov, Iu K; Kovalev, D A; Misetova, E N; Golovneva, S I; Liapustina, L V; Zheludkov, M M

2012-01-01

The methods of molecular-genetic differentiation to strain level acquire increasing significance in the current system of struggle with brucellosis. MLVA (multiple locus variable number tandem repeats analysis) was selected for molecular-genetic differentiation to strain level and simultaneous establishment of the genetic relationship of investigated Brucella strains. The goal of this work was MLVA typing of three pathogenic Brucella species strains with the analysis of stability of chosen loci, discrimination power and concordance to conventional phenotypic methods of the Brucella differentiation for use in systematization of brucellosis causing agents. Twenty six Brucella strains representing reference (n = 15), vaccine (n = 2) and field strains of three pathogenic Brucella species were tested: B. melitensis (n = 3), B. abortus (n = 2), B. suis (n = 2), and isolates (n = 2) with unidentified taxonomic position using MLVA with 9 pairs primers on known variable loci of Brucella genome. The analysis of the stability of chosen loci, discrimination power on Hunter-Gaston discrimination index (HGDI) and consistency to phenotypic methods of identification was performed. MLVA was confirmed for the results of phenotypic methods of identification, stability of the chosen loci in majority reference, and vaccine strains with a high index of variability HGDI 0.9969 for all loci. A dendrogram was plotted on the basis of MLVA data on distributed Brucella strains in related clusters according to its taxonomic species and biovar positions and construction of 25 genotypes. B. melitensis strains formed cluster related to the reference strain of B. melitensis 63/9 biovar 2. Australian isolates of Brucella 83-4 and Brucella 83-6 isolated from rodents formed a cluster distant from other strains of Brucella. MLVA is a promising method for differentiation of Brucella strains with known and unresolved taxonomic status for their systematization and creation of MLVA genotype catalogue that
The upstream Variable Number Tandem Repeat polymorphism of the monoamine oxidase type A gene influences trigeminal pain-related evoked responses.

PubMed

Di Lorenzo, Cherubino; Daverio, Andrea; Pasqualetti, Patrizio; Coppola, Gianluca; Giannoudas, Ioannis; Barone, Ylenia; Grieco, Gaetano S; Niolu, Cinzia; Pascale, Esterina; Santorelli, Filippo M; Nicoletti, Ferdinando; Pierelli, Francesco; Siracusano, Alberto; Seri, Stefano; Di Lorenzo, Giorgio

2014-02-01

Monoamines have an important role in neural plasticity, a key factor in cortical pain processing that promotes changes in neuronal network connectivity. Monoamine oxidase type A (MAOA) is an enzyme that, due to its modulating role in monoaminergic activity, could play a role in cortical pain processing. The X-linked MAOA gene is characterized by an allelic variant of length, the MAOA upstream Variable Number Tandem Repeat (MAOA-uVNTR) region polymorphism. Two allelic variants of this gene are known, the high-activity MAOA (HAM) and low-activity MAOA (LAM). We investigated the role of MAOA-uVNTR in cortical pain processing in a group of healthy individuals measured by the trigeminal electric pain-related evoked potential (tPREP) elicited by repeated painful stimulation. A group of healthy volunteers was genotyped to detect MAOA-uVNTR polymorphism. Electrical tPREPs were recorded by stimulating the right supraorbital nerve with a concentric electrode. The N2 and P2 component amplitude and latency as well as the N2-P2 inter-peak amplitude were measured. The recording was divided into three blocks, each containing 10 consecutive stimuli and the N2-P2 amplitude was compared between blocks. Of the 67 volunteers, 37 were HAM and 30 were LAM. HAM subjects differed from LAM subjects in terms of amplitude of the grand-averaged and first-block N2-P2 responses (HAM>LAM). The N2-P2 amplitude decreased between the first and third block in HAM subjects but not LAM subjects. The MAOA-uVNTR polymorphism seemed to influence the brain response in a repeated tPREP paradigm and suggested a role of the MAOA as a modulator of neural plasticity related to cortical pain processing. © 2014 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
First Worldwide Proficiency Study on Variable-Number Tandem-Repeat Typing of Mycobacterium tuberculosis Complex Strains

PubMed Central

de Beer, Jessica L.; Kremer, Kristin; Ködmön, Csaba; Supply, Philip

2012-01-01

Although variable-number tandem-repeat (VNTR) typing has gained recognition as the new standard for the DNA fingerprinting of Mycobacterium tuberculosis complex (MTBC) isolates, external quality control programs have not yet been developed. Therefore, we organized the first multicenter proficiency study on 24-locus VNTR typing. Sets of 30 DNAs of MTBC strains, including 10 duplicate DNA samples, were distributed among 37 participating laboratories in 30 different countries worldwide. Twenty-four laboratories used an in-house-adapted method with fragment sizing by gel electrophoresis or an automated DNA analyzer, nine laboratories used a commercially available kit, and four laboratories used other methods. The intra- and interlaboratory reproducibilities of VNTR typing varied from 0% to 100%, with averages of 72% and 60%, respectively. Twenty of the 37 laboratories failed to amplify particular VNTR loci; if these missing results were ignored, the number of laboratories with 100% interlaboratory reproducibility increased from 1 to 5. The average interlaboratory reproducibility of VNTR typing using a commercial kit was better (88%) than that of in-house-adapted methods using a DNA analyzer (70%) or gel electrophoresis (50%). Eleven laboratories using in-house-adapted manual typing or automated typing scored inter- and intralaboratory reproducibilities of 80% or higher, which suggests that these approaches can be used in a reliable way. In conclusion, this first multicenter study has documented the worldwide quality of VNTR typing of MTBC strains and highlights the importance of international quality control to improve genotyping in the future. PMID:22170917
Use of specific peptide biomarkers for quantitative confirmation of hidden allergenic peanut proteins Ara h 2 and Ara h 3/4 for food control by liquid chromatography-tandem mass spectrometry.

PubMed

Careri, M; Costa, A; Elviri, L; Lagos, J-B; Mangia, A; Terenghi, M; Cereti, A; Garoffo, L Perono

2007-11-01

A liquid chromatography-electrospray-tandem mass spectrometry (LC-ESI-MS-MS) method based on the detection of biomarker peptides from allergenic proteins was devised for confirming and quantifying peanut allergens in foods. Peptides obtained from tryptic digestion of Ara h 2 and Ara h 3/4 proteins were identified and characterized by LC-MS and LC-MS-MS with a quadrupole-time of flight mass analyzer. Four peptides were chosen and investigated as biomarkers taking into account their selectivity, the absence of missed cleavages, the uniform distribution in the Ara h 2 and Ara h 3/4 protein isoforms together with their spectral features under ESI-MS-MS conditions, and good repeatability of LC retention time. Because of the different expression levels, the selection of two different allergenic proteins was proved to be useful in the identification and univocal confirmation of the presence of peanuts in foodstuffs. Using rice crisp and chocolate-based snacks as model food matrix, an LC-MS-MS method with triple quadrupole mass analyzer allowed good detection limits to be obtained for Ara h 2 (5 microg protein g(-1) matrix) and Ara h 3/4 (1 microg protein g(-1) matrix). Linearity of the method was established in the 10-200 microg g(-1) range of peanut proteins in the food matrix investigated. Method selectivity was demonstrated by analyzing tree nuts (almonds, pecan nuts, hazelnuts, walnuts) and food ingredients such as milk, soy beans, chocolate, cornflakes, and rice crisp.
Independent movement, dimerization and stability of tandem repeats of chicken brain alpha-spectrin

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kusunoki, H.; Minasov, G.; Macdonald, R.I.

Previous X-ray crystal structures have shown that linkers of five amino acid residues connecting pairs of chicken brain {alpha}-spectrin and human erythroid {beta}-spectrin repeats can undergo bending without losing their {alpha}-helical structure. To test whether bending at one linker can influence bending at an adjacent linker, the structures of two and three repeat fragments of chicken brain {alpha}-spectrin have been determined by X-ray crystallography. The structure of the three-repeat fragment clearly shows that bending at one linker can occur independently of bending at an adjacent linker. This observation increases the possible trajectories of modeled chains of spectrin repeats. Furthermore, themore » three-repeat molecule crystallized as an antiparallel dimer with a significantly smaller buried interfacial area than that of {alpha}-actinin, a spectrin-related molecule, but large enough and of a type indicating biological specificity. Comparison of the structures of the spectrin and {alpha}-actinin dimers supports weak association of the former, which could not be detected by analytical ultracentrifugation, versus strong association of the latter, which has been observed by others. To correlate features of the structure with solution properties and to test a previous model of stable spectrin and dystrophin repeats, the number of inter-helical interactions in each repeat of several spectrin structures were counted and compared to their thermal stabilities. Inter-helical interactions, but not all interactions, increased in parallel with measured thermal stabilities of each repeat and in agreement with the thermal stabilities of two and three repeats and also partial repeats of spectrin.« less
Identification of ubiquitin/ubiquitin-like protein modification from tandem mass spectra with various PTMs

PubMed Central

2011-01-01

Background Various solutions have been introduced for the identification of post-translational modification (PTM) from tandem mass spectrometry (MS/MS) in proteomics field but the identification of peptide modifiers, such as Ubiquitin (Ub) and ubiquitin-like proteins (Ubls), is still a challenge. The fragmentation of peptide modifier produce complex shifted ion mass patterns in combination with other PTMs, which makes it difficult to identify and locate the PTMs on a protein sequence. Currently, most PTM identification methods do not consider the complex fragmentation of peptide modifier or deals it separately from the other PTMs. Results We developed an advanced PTM identification method that inspects possible ion patterns of the most known peptide modifiers as well as other known biological and chemical PTMs to make more comprehensive and accurate conclusion. The proposed method searches all detectable mass differences of measured peaks from their theoretical values and the mass differences within mass tolerance range are grouped as mass shift classes. The most possible locations of multiple PTMs including peptide modifiers can be determined by evaluating all possible scenarios generated by the combination of the qualified mass shift classes.The proposed method showed excellent performance in the test with simulated spectra having various PTMs including peptide modifiers and in the comparison with recently developed methods such as QuickMod and SUMmOn. In the analysis of HUPO Brain Proteome Project (BPP) datasets, the proposed method could find the ubiquitin modification sites that were not identified by other conventional methods. Conclusions This work presents a novel method for identifying bothpeptide modifiers that generate complex fragmentation patternsand PTMs that are not fragmented during fragmentation processfrom tandem mass spectra. PMID:22373085
A Dual Origin of the Xist Gene from a Protein-Coding Gene and a Set of Transposable Elements

PubMed Central

Elisaphenko, Eugeny A.; Kolesnikov, Nikolay N.; Shevchenko, Alexander I.; Rogozin, Igor B.; Nesterova, Tatyana B.; Brockdorff, Neil; Zakian, Suren M.

2008-01-01

X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC). Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA. PMID:18575625
Screening of repetitive motifs inside the genome of the flat oyster (Ostrea edulis): Transposable elements and short tandem repeats.

PubMed

Vera, Manuel; Bello, Xabier; Álvarez-Dios, Jose-Antonio; Pardo, Belen G; Sánchez, Laura; Carlsson, Jens; Carlsson, Jeanette E L; Bartolomé, Carolina; Maside, Xulio; Martinez, Paulino

2015-12-01

The flat oyster (Ostrea edulis) is one of the most appreciated molluscs in Europe, but its production has been greatly reduced by the parasite Bonamia ostreae. Here, new generation genomic resources were used to analyse the repetitive fraction of the oyster genome, with the aim of developing molecular markers to face this main oyster production challenge. The resulting oyster database, consists of two sets of 10,318 and 7159 unique contigs (4.8 Mbp and 6.8 Mbp in total length) representing the oyster's genome (WG) and haemocyte transcriptome (HT), respectively. A total of 1083 sequences were identified as TE-derived, which corresponded to 4.0% of WG and 1.1% of HT. They were clustered into 142 homology groups, most of which were assigned to the Penelope order of retrotransposons, and to the Helitron and TIR DNA-transposons. Simple repeats and rRNA pseudogenes, also made a significant contribution to the oyster's genome (0.5% and 0.3% of WG and HT, respectively).The most frequent short tandem repeats identified in WG were tetranucleotide motifs while trinucleotide motifs were in HT. Forty identified microsatellite loci, 20 from each database, were selected for technical validation. Success was much lower among WG than HT microsatellites (15% vs 55%), which could reflect higher variation in anonymous regions interfering with primer annealing. All microsatellites developed adjusted to Hardy-Weinberg proportions and represent a useful tool to support future breeding programmes and to manage genetic resources of natural flat oyster beds. Copyright © 2015 Elsevier B.V. All rights reserved.
Concerted evolution of the tandem array encoding primate U2 snRNA occurs in situ, without changing the cytological context of the RNU2 locus.

PubMed Central

Pavelitz, T; Rusché, L; Matera, A G; Scharf, J M; Weiner, A M

1995-01-01

In primates, the tandemly repeated genes encoding U2 small nuclear RNA evolve concertedly, i.e. the sequence of the U2 repeat unit is essentially homogeneous within each species but differs somewhat between species. Using chromosome painting and the NGFR gene as an outside marker, we show that the U2 tandem array (RNU2) has remained at the same chromosomal locus (equivalent to human 17q21) through multiple speciation events over > 35 million years leading to the Old World monkey and hominoid lineages. The data suggest that the U2 tandem repeat, once established in the primate lineage, contained sequence elements favoring perpetuation and concerted evolution of the array in situ, despite a pericentric inversion in chimpanzee, a reciprocal translocation in gorilla and a paracentric inversion in orang utan. Comparison of the 11 kb U2 repeat unit found in baboon and other Old World monkeys with the 6 kb U2 repeat unit in humans and other hominids revealed that an ancestral U2 repeat unit was expanded by insertion of a 5 kb retrovirus bearing 1 kb long terminal repeats (LTRs). Subsequent excision of the provirus by homologous recombination between the LTRs generated a 6 kb U2 repeat unit containing a solo LTR. Remarkably, both junctions between the human U2 tandem array and flanking chromosomal DNA at 17q21 fall within the solo LTR sequence, suggesting a role for the LTR in the origin or maintenance of the primate U2 array. Images PMID:7828589
Rapid Identification of Laboratory Contamination with Mycobacterium tuberculosis Using Variable Number Tandem Repeat Analysis

PubMed Central

Gascoyne-Binzi, Deborah M.; Barlow, Rachael E. L.; Frothingham, Richard; Robinson, Grant; Collyns, Timothy A.; Gelletlie, Ruth; Hawkey, Peter M.

2001-01-01

Compared with solid media, broth-based mycobacterial culture systems have increased sensitivity but also have higher false-positive rates due to cross-contamination. Systematic strain typing is rarely undertaken because the techniques are technically demanding and the data are difficult to organize. Variable number tandem repeat (VNTR) analysis by PCR is rapid and reproducible. The digital profile is easily manipulated in a database. We undertook a retrospective study of Mycobacterium tuberculosis isolates collected over an 18-month period following the introduction of the BACTEC MGIT 960 system. VNTR allele profiles were determined with early positive broth cultures and entered into a database with the specimen processing date and other specimen data. We found 36 distinct VNTR profiles in cultures from 144 patients. Three common VNTR profiles accounted for 45% of true-positive cases. By combining VNTR results with specimen data, we identified nine cross-contamination incidents, six of which were previously unsuspected. These nine incidents resulted in 34 false-positive cultures for 29 patients. False-positive cultures were identified for three patients who had previously been culture positive for tuberculosis and were receiving treatment. Identification of cross-contamination incidents requires careful documentation of specimen data and good communication between clinical and laboratory staff. Automated broth culture systems should be supplemented with molecular analysis to identify cross-contamination events. VNTR analysis is reproducible and provides timely results when applied to early positive broth cultures. This method should ensure that patients are not placed on unnecessary tuberculosis therapy or that cases are not falsely identified as treatment failures. In addition, areas where existing procedures may be improved can be identified. PMID:11136751
Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies.

PubMed Central

Wincker, P; Jubier-Maurin, V; Roizès, G

1987-01-01

Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies. Images PMID:3684566
Cloning and Molecular Characterization of an Immunogenic LigA Protein of Leptospira interrogans

PubMed Central

Palaniappan, Raghavan U. M.; Chang, Yung-Fu; Jusuf, S. S. D.; Artiushin, S.; Timoney, John F.; McDonough, Sean P.; Barr, Steve C.; Divers, Thomas J.; Simpson, Kenneth W.; McDonough, Patrick L.; Mohammed, Hussni O.

2002-01-01

A clone expressing a novel immunoreactive leptospiral immunoglobulin-like protein A of 130 kDa (LigA) from Leptospira interrogans serovar pomona type kennewicki was isolated by screening a genomic DNA library with serum from a mare that had recently aborted due to leptospiral infection. LigA is encoded by an open reading frame of 3,675 bp, and the deduced amino acid sequence consists of a series of 90-amino-acid tandem repeats. A search of the NCBI database found that homology of the LigA repeat region was limited to an immunoglobulin-like domain of the bacterial intimin binding protein of Escherichia coli, the cell adhesion domain of Clostridium acetobutylicum, and the invasin of Yersinia pestis. Secondary structure prediction analysis indicates that LigA consists mostly of beta sheets with a few alpha-helical regions. No LigA was detectable by immunoblot analysis of lysates of the leptospires grown in vitro at 30°C or when cultures were shifted to 37°C. Strikingly, immunohistochemistry on kidney from leptospira-infected hamsters demonstrated LigA expression. These findings suggest that LigA is specifically induced only in vivo. Sera from horses, which aborted as a result of natural Leptospira infection, strongly recognize LigA. LigA is the first leptospiral protein described to have 12 tandem repeats and is also the first to be expressed only during infection. Thus, LigA may have value in serodiagnosis or as a protective immunogen in novel vaccines. PMID:12379666
An Unusual Hydrophobic Core Confers Extreme Flexibility to HEAT Repeat Proteins

PubMed Central

Kappel, Christian; Zachariae, Ulrich; Dölker, Nicole; Grubmüller, Helmut

2010-01-01

Alpha-solenoid proteins are suggested to constitute highly flexible macromolecules, whose structural variability and large surface area is instrumental in many important protein-protein binding processes. By equilibrium and nonequilibrium molecular dynamics simulations, we show that importin-β, an archetypical α-solenoid, displays unprecedentedly large and fully reversible elasticity. Our stretching molecular dynamics simulations reveal full elasticity over up to twofold end-to-end extensions compared to its bound state. Despite the absence of any long-range intramolecular contacts, the protein can return to its equilibrium structure to within 3 Å backbone RMSD after the release of mechanical stress. We find that this extreme degree of flexibility is based on an unusually flexible hydrophobic core that differs substantially from that of structurally similar but more rigid globular proteins. In that respect, the core of importin-β resembles molten globules. The elastic behavior is dominated by nonpolar interactions between HEAT repeats, combined with conformational entropic effects. Our results suggest that α-solenoid structures such as importin-β may bridge the molecular gap between completely structured and intrinsically disordered proteins. PMID:20816072
Design of tryptophan-containing mutants of the symmetrical Pizza protein for biophysical studies.

PubMed

Noguchi, Hiroki; Mylemans, Bram; De Zitter, Elke; Van Meervelt, Luc; Tame, Jeremy R H; Voet, Arnout

2018-03-18

β-propeller proteins are highly symmetrical, being composed of a repeated motif with four anti-parallel β-sheets arranged around a central axis. Recently we designed the first completely symmetrical β-propeller protein, Pizza6, consisting of six identical tandem repeats. Pizza6 is expected to prove a useful building block for bionanotechnology, and also a tool to investigate the folding and evolution of β-propeller proteins. Folding studies are made difficult by the high stability and the lack of buried Trp residues to act as monitor fluorophores, so we have designed and characterized several Trp-containing Pizza6 derivatives. In total four proteins were designed, of which three could be purified and characterized. Crystal structures confirm these mutant proteins maintain the expected structure, and a clear redshift of Trp fluorescence emission could be observed upon denaturation. Among the derivative proteins, Pizza6-AYW appears to be the most suitable model protein for future folding/unfolding kinetics studies as it has a comparable stability as natural β-propeller proteins. Copyright © 2018 Elsevier Inc. All rights reserved.

Evolution of short inverted repeat in cupressophytes, transfer of accD to nucleus in Sciadopitys verticillata and phylogenetic position of Sciadopityaceae.

PubMed

Li, Jia; Gao, Lei; Chen, Shanshan; Tao, Ke; Su, Yingjuan; Wang, Ting

2016-02-11

Sciadopitys verticillata is an evergreen conifer and an economically valuable tree used in construction, which is the only member of the family Sciadopityaceae. Acquisition of the S. verticillata chloroplast (cp) genome will be useful for understanding the evolutionary mechanism of conifers and phylogenetic relationships among gymnosperm. In this study, we have first reported the complete chloroplast genome of S. verticillata. The total genome is 138,284 bp in length, consisting of 118 unique genes. The S. verticillata cp genome has lost one copy of the canonical inverted repeats and shown distinctive genomic structure comparing with other cupressophytes. Fifty-three simple sequence repeat loci and 18 forward tandem repeats were identified in the S. verticillata cp genome. According to the rearrangement of cupressophyte cp genome, we proposed one mechanism for the formation of inverted repeat: tandem repeat occured first, then rearrangement divided the tandem repeat into inverted repeats located at different regions. Phylogenetic estimates inferred from 59-gene sequences and cpDNA organizations have both shown that S. verticillata was sister to the clade consisting of Cupressaceae, Taxaceae, and Cephalotaxaceae. Moreover, accD gene was found to be lost in the S. verticillata cp genome, and a nucleus copy was identified from two transcriptome data.
A variable number of tandem repeats in the 3'-untranslated region of the dopamine transporter modulates striatal function during working memory updating across the adult age span.

PubMed

Sambataro, Fabio; Podell, Jamie E; Murty, Vishnu P; Das, Saumitra; Kolachana, Bhaskar; Goldberg, Terry E; Weinberger, Daniel R; Mattay, Venkata S

2015-08-01

Dopamine modulation of striatal function is critical for executive functions such as working memory (WM) updating. The dopamine transporter (DAT) regulates striatal dopamine signaling via synaptic reuptake. A variable number of tandem repeats in the 3'-untranslated region of SLC6A3 (DAT1-3'-UTR-VNTR) is associated with DAT expression, such that 9-repeat allele carriers tend to express lower levels (associated with higher extracellular dopamine concentrations) than 10-repeat homozygotes. Aging is also associated with decline of the dopamine system. The goal of the present study was to investigate the effects of aging and DAT1-3'-UTR-VNTR on the neural activity and functional connectivity of the striatum during WM updating. Our results showed both an age-related decrease in striatal activity and an effect of DAT1-3'-UTR-VNTR. Ten-repeat homozygotes showed reduced striatal activity and increased striatal-hippocampal connectivity during WM updating relative to the 9-repeat carriers. There was no age by DAT1-3'-UTR-VNTR interaction. These results suggest that, whereas striatal function during WM updating is modulated by both age and genetically determined DAT levels, the rate of the age-related decline in striatal function is similar across both DAT1-3'-UTR-VNTR genotype groups. They further suggest that, because of the baseline difference in striatal function based on DAT1-3'-UTR-VNTR polymorphism, 10-repeat homozygotes, who have lower levels of striatal function throughout the adult life span, may reach a threshold of decreased striatal function and manifest impairments in cognitive processes mediated by the striatum earlier in life than the 9-repeat carriers. Our data suggest that age and DAT1-3'-UTR-VNTR polymorphism independently modulate striatal function. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.
Evidence that a proposed repeated segment of glutamine residues is expressed in the Huntington disease protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jou, Y.S.; Myers, R.M.

1994-09-01

Huntington disease (HD) appears to be caused by a mutation that results in an expanded number of CAG repeats at the 5{prime} end of the gene. The nucleotide sequence of the gene and cDNA clones predicts a 347 kd protein that contains a stretch of polyglutamine, encoded by the CAG repeat, located 17 amino acids downstream from the proposed translation initiation site. Because understanding the mechanisms of the pathology of HD depends on whether the CAG-repeat is expressed in the protein, we used antibodies directed against portions of the predicted HD gene product to probe the structure of the proteinmore » in tissue culture cells. Two peptides, one located amino-terminal to the proposed polyglutamine stretch (hd1 peptide FESLKSFQQ from amino acids 11-19) and one located in the carboxy-terminal half of the predicted protein (hd2 peptide QQPRNKPLK from amino acids 2531-2539), were used to elicit polyclonal antibodies in NZW rabbits. We affinity-purified the antibodies and used them to analyze the HD protein. Both antisera specifically recognize the peptides used to elicit them, as well as the appropriate portions of the HD protein expressed in E. coli. Western blot analysis showed that both antisera recognize a protein with an apparent molecular weight of approximately 350,000 in human, monkey, rat and mouse cell lines, including two neutronal cell lines. These results, in combination with immunoprecipitation experiments, suggest strongly that the proposed polyglutamine stretch is indeed translated in the HD protein and is evolutionarily conserved in various mammalian species.« less
Expression of Plasmodium falciparum Circumsporozoite Proteins in Escherichia coli for Potential Use in a Human Malaria Vaccine

NASA Astrophysics Data System (ADS)

Young, James F.; Hockmeyer, Wayne T.; Gross, Mitchell; Ripley Ballou, W.; Wirtz, Robert A.; Trosper, James H.; Beaudoin, Richard L.; Hollingdale, Michael R.; Miller, Louis H.; Diggs, Carter L.; Rosenberg, Martin

1985-05-01

The circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum may be the most promising target for the development of a malaria vaccine. In this study, proteins composed of 16, 32, or 48 tandem copies of a tetrapeptide repeating sequence found in the CS protein were efficiently expressed in the bacterium Escherichia coli. When injected into mice, these recombinant products resulted in the production of high titers of antibodies that reacted with the authentic CS protein on live sporozoites and blocked sporozoite invasion of human hepatoma cells in vitro. These CS protein derivatives are therefore candidates for a human malaria vaccine.
Hexanucleotide Repeats in ALS/FTD Form Length-Dependent RNA Foci, Sequester RNA Binding Proteins, and Are Neurotoxic

PubMed Central

Lee, Youn-Bok; Chen, Han-Jou; Peres, João N.; Gomez-Deza, Jorge; Attig, Jan; Štalekar, Maja; Troakes, Claire; Nishimura, Agnes L.; Scotter, Emma L.; Vance, Caroline; Adachi, Yoshitsugu; Sardone, Valentina; Miller, Jack W.; Smith, Bradley N.; Gallo, Jean-Marc; Ule, Jernej; Hirth, Frank; Rogelj, Boris; Houart, Corinne; Shaw, Christopher E.

2013-01-01

Summary The GGGGCC (G4C2) intronic repeat expansion within C9ORF72 is the most common genetic cause of amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Intranuclear neuronal RNA foci have been observed in ALS and FTD tissues, suggesting that G4C2 RNA may be toxic. Here, we demonstrate that the expression of 38× and 72× G4C2 repeats form intranuclear RNA foci that initiate apoptotic cell death in neuronal cell lines and zebrafish embryos. The foci colocalize with a subset of RNA binding proteins, including SF2, SC35, and hnRNP-H in transfected cells. Only hnRNP-H binds directly to G4C2 repeats following RNA immunoprecipitation, and only hnRNP-H colocalizes with 70% of G4C2 RNA foci detected in C9ORF72 mutant ALS and FTD brain tissues. We show that expanded G4C2 repeats are potently neurotoxic and bind hnRNP-H and other RNA binding proteins. We propose that RNA toxicity and protein sequestration may disrupt RNA processing and contribute to neurodegeneration. PMID:24290757
History, rare, and multiple events of mechanical unfolding of repeat proteins

NASA Astrophysics Data System (ADS)

Sumbul, Fidan; Marchesi, Arin; Rico, Felix

2018-03-01

Mechanical unfolding of proteins consisting of repeat domains is an excellent tool to obtain large statistics. Force spectroscopy experiments using atomic force microscopy on proteins presenting multiple domains have revealed that unfolding forces depend on the number of folded domains (history) and have reported intermediate states and rare events. However, the common use of unspecific attachment approaches to pull the protein of interest holds important limitations to study unfolding history and may lead to discarding rare and multiple probing events due to the presence of unspecific adhesion and uncertainty on the pulling site. Site-specific methods that have recently emerged minimize this uncertainty and would be excellent tools to probe unfolding history and rare events. However, detailed characterization of these approaches is required to identify their advantages and limitations. Here, we characterize a site-specific binding approach based on the ultrastable complex dockerin/cohesin III revealing its advantages and limitations to assess the unfolding history and to investigate rare and multiple events during the unfolding of repeated domains. We show that this approach is more robust, reproducible, and provides larger statistics than conventional unspecific methods. We show that the method is optimal to reveal the history of unfolding from the very first domain and to detect rare events, while being more limited to assess intermediate states. Finally, we quantify the forces required to unfold two molecules pulled in parallel, difficult when using unspecific approaches. The proposed method represents a step forward toward more reproducible measurements to probe protein unfolding history and opens the door to systematic probing of rare and multiple molecule unfolding mechanisms.
An ice-binding and tandem beta-sandwich domain-containing protein in Shewanella frigidimarina is a potential new type of ice adhesin.

PubMed

Vance, Tyler D R; Graham, Laurie A; Davies, Peter L

2018-04-01

Out of the dozen different ice-binding protein (IBP) structures known, the DUF3494 domain is the most widespread, having been passed many times between prokaryotic and eukaryotic microorganisms by horizontal gene transfer. This ~25-kDa β-solenoid domain with an adjacent parallel α-helix is most commonly associated with an N-terminal secretory signal peptide. However, examples of the DUF3494 domain preceded by tandem Bacterial Immunoglobulin-like (BIg) domains are sometimes found, though uncharacterized. Here, we present one such protein (SfIBP_1) from the Antarctic bacterium Shewanella frigidimarina. We have confirmed and characterized the ice-binding activity of its ice-binding domain using thermal hysteresis measurements, fluorescent ice plane affinity analysis, and ice recrystallization inhibition assays. X-ray crystallography was used to solve the structure of the SfIBP_1 ice-binding domain, to further characterize its ice-binding surface and unique method of stabilizing or 'capping' the ends of the solenoid structure. The latter is formed from the interaction of two loops mediated by a combination of tandem prolines and electrostatic interactions. Furthermore, given their domain architecture and membrane association, we propose that these BIg-containing DUF3494 IBPs serve as ice-binding adhesion proteins that are capable of adsorbing their host bacterium onto ice. Submitted new structure to the Protein Data Bank (PDB: 6BG8). © 2018 Federation of European Biochemical Societies.
47 CFR 69.111 - Tandem-switched transport and tandem charge.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 47 Telecommunication 3 2011-10-01 2011-10-01 false Tandem-switched transport and tandem charge. 69... SERVICES (CONTINUED) ACCESS CHARGES Computation of Charges § 69.111 Tandem-switched transport and tandem...-switched transport shall consist of two rate elements, a transmission charge and a tandem switching charge...
Accurate quantification of chromosomal lesions via short tandem repeat analysis using minimal amounts of DNA

PubMed Central

Jann, Johann-Christoph; Nowak, Daniel; Nolte, Florian; Fey, Stephanie; Nowak, Verena; Obländer, Julia; Pressler, Jovita; Palme, Iris; Xanthopoulos, Christina; Fabarius, Alice; Platzbecker, Uwe; Giagounidis, Aristoteles; Götze, Katharina; Letsch, Anne; Haase, Detlef; Schlenk, Richard; Bug, Gesine; Lübbert, Michael; Ganser, Arnold; Germing, Ulrich; Haferlach, Claudia; Hofmann, Wolf-Karsten; Mossner, Maximilian

2017-01-01

Background Cytogenetic aberrations such as deletion of chromosome 5q (del(5q)) represent key elements in routine clinical diagnostics of haematological malignancies. Currently established methods such as metaphase cytogenetics, FISH or array-based approaches have limitations due to their dependency on viable cells, high costs or semi-quantitative nature. Importantly, they cannot be used on low abundance DNA. We therefore aimed to establish a robust and quantitative technique that overcomes these shortcomings. Methods For precise determination of del(5q) cell fractions, we developed an inexpensive multiplex-PCR assay requiring only nanograms of DNA that simultaneously measures allelic imbalances of 12 independent short tandem repeat markers. Results Application of this method to n=1142 samples from n=260 individuals revealed strong intermarker concordance (R²=0.77–0.97) and reproducibility (mean SD: 1.7%). Notably, the assay showed accurate quantification via standard curve assessment (R²>0.99) and high concordance with paired FISH measurements (R²=0.92) even with subnanogram amounts of DNA. Moreover, cytogenetic response was reliably confirmed in del(5q) patients with myelodysplastic syndromes treated with lenalidomide. While the assay demonstrated good diagnostic accuracy in receiver operating characteristic analysis (area under the curve: 0.97), we further observed robust correlation between bone marrow and peripheral blood samples (R²=0.79), suggesting its potential suitability for less-invasive clonal monitoring. Conclusions In conclusion, we present an adaptable tool for quantification of chromosomal aberrations, particularly in problematic samples, which should be easily applicable to further tumour entities. PMID:28600436
Recommendation of short tandem repeat profiling for authenticating human cell lines, stem cells, and tissues.

PubMed

Barallon, Rita; Bauer, Steven R; Butler, John; Capes-Davis, Amanda; Dirks, Wilhelm G; Elmore, Eugene; Furtado, Manohar; Kline, Margaret C; Kohara, Arihiro; Los, Georgyi V; MacLeod, Roderick A F; Masters, John R W; Nardone, Mark; Nardone, Roland M; Nims, Raymond W; Price, Paul J; Reid, Yvonne A; Shewale, Jaiprakash; Sykes, Gregory; Steuer, Anton F; Storts, Douglas R; Thomson, Jim; Taraporewala, Zenobia; Alston-Roberts, Christine; Kerrigan, Liz

2010-10-01

Cell misidentification and cross-contamination have plagued biomedical research for as long as cells have been employed as research tools. Examples of misidentified cell lines continue to surface to this day. Efforts to eradicate the problem by raising awareness of the issue and by asking scientists voluntarily to take appropriate actions have not been successful. Unambiguous cell authentication is an essential step in the scientific process and should be an inherent consideration during peer review of papers submitted for publication or during review of grants submitted for funding. In order to facilitate proper identity testing, accurate, reliable, inexpensive, and standardized methods for authentication of cells and cell lines must be made available. To this end, an international team of scientists is, at this time, preparing a consensus standard on the authentication of human cells using short tandem repeat (STR) profiling. This standard, which will be submitted for review and approval as an American National Standard by the American National Standards Institute, will provide investigators guidance on the use of STR profiling for authenticating human cell lines. Such guidance will include methodological detail on the preparation of the DNA sample, the appropriate numbers and types of loci to be evaluated, and the interpretation and quality control of the results. Associated with the standard itself will be the establishment and maintenance of a public STR profile database under the auspices of the National Center for Biotechnology Information. The consensus standard is anticipated to be adopted by granting agencies and scientific journals as appropriate methodology for authenticating human cell lines, stem cells, and tissues.
Recommendation of short tandem repeat profiling for authenticating human cell lines, stem cells, and tissues

PubMed Central

Barallon, Rita; Bauer, Steven R.; Butler, John; Capes-Davis, Amanda; Dirks, Wilhelm G.; Furtado, Manohar; Kline, Margaret C.; Kohara, Arihiro; Los, Georgyi V.; MacLeod, Roderick A. F.; Masters, John R. W.; Nardone, Mark; Nardone, Roland M.; Nims, Raymond W.; Price, Paul J.; Reid, Yvonne A.; Shewale, Jaiprakash; Sykes, Gregory; Steuer, Anton F.; Storts, Douglas R.; Thomson, Jim; Taraporewala, Zenobia; Alston-Roberts, Christine; Kerrigan, Liz

2010-01-01

Cell misidentification and cross-contamination have plagued biomedical research for as long as cells have been employed as research tools. Examples of misidentified cell lines continue to surface to this day. Efforts to eradicate the problem by raising awareness of the issue and by asking scientists voluntarily to take appropriate actions have not been successful. Unambiguous cell authentication is an essential step in the scientific process and should be an inherent consideration during peer review of papers submitted for publication or during review of grants submitted for funding. In order to facilitate proper identity testing, accurate, reliable, inexpensive, and standardized methods for authentication of cells and cell lines must be made available. To this end, an international team of scientists is, at this time, preparing a consensus standard on the authentication of human cells using short tandem repeat (STR) profiling. This standard, which will be submitted for review and approval as an American National Standard by the American National Standards Institute, will provide investigators guidance on the use of STR profiling for authenticating human cell lines. Such guidance will include methodological detail on the preparation of the DNA sample, the appropriate numbers and types of loci to be evaluated, and the interpretation and quality control of the results. Associated with the standard itself will be the establishment and maintenance of a public STR profile database under the auspices of the National Center for Biotechnology Information. The consensus standard is anticipated to be adopted by granting agencies and scientific journals as appropriate methodology for authenticating human cell lines, stem cells, and tissues. PMID:20614197
The proteins cleaved by endogenous tryptic proteases in normal EDTA plasma by C18 collection of peptides for liquid chromatography micro electrospray ionization and tandem mass spectrometry.

PubMed

Dufresne, Jaimie; Florentinus-Mefailoski, Angelique; Ajambo, Juliet; Ferwa, Ammara; Bowden, Peter; Marshall, John

2017-01-01

The tryptic peptides from ice cold versus room temperature plasma were identified by C18 liquid chromatography and micro electrospray ionization tandem mass spectrometry (LC-ESI-MS/MS). Samples collected on ice showed low levels of endogenous tryptic peptides compared to the same samples incubated at room temperature. Plasma on ice contained peptides from albumin, complement, and apolipoproteins and others that were observed by the X!TANDEM and SEQUEST algorithms. In contrast to ice cold samples, after incubation at room temperature, greater numbers of tryptic peptides from well characterized plasma proteins, and from cellular proteins were observed. A total of 583,927 precursor ions and MS/MS spectra were correlated to 94,669 best fit peptides that reduced to 22,287 correlations to the best accession within a gene symbol and to 7174 correlations to at least 510 gene symbols with ≥ 5 independent MS/MS correlations (peptide counts) that showed FDR q-values ranging from E-9 (i.e. FDR = 0.000000001) to E-227. A set of 528 gene symbols identified by X!TANDEM and SEQUEST including C4B showed ≥ fivefold variation between ice cold versus room temperature incubation. STRING analysis of the protein gene symbols observed from endogenous peptides in normal plasma revealed an extensive protein-interaction network of cellular factors associated with cell signalling and regulation, the formation of membrane bound organelles, cellular exosomes and exocytosis network proteins. Taken together the results indicated that a pool of cellular proteins, or protein complexes, in plasma are apparently not stable and degrade soon after incubation at room temperature.
MET-activating Residues in the B-repeat of the Listeria monocytogenes Invasion Protein InlB*

PubMed Central

Bleymüller, Willem M.; Lämmermann, Nina; Ebbes, Maria; Maynard, Daniel; Geerds, Christina; Niemann, Hartmut H.

2016-01-01

The facultative intracellular pathogen Listeria monocytogenes causes listeriosis, a rare but life-threatening disease. Host cell entry begins with activation of the human receptor tyrosine kinase MET through the bacterial invasion protein InlB, which contains an internalin domain, a B-repeat, and three GW domains. The internalin domain is known to bind MET, but no interaction partner is known for the B-repeat. Adding the B-repeat to the internalin domain potentiates MET activation and is required to stimulate Madin-Darby canine kidney (MDCK) cell scatter. Therefore, it has been hypothesized that the B-repeat may bind a co-receptor on host cells. To test this hypothesis, we mutated residues that might be important for binding an interaction partner. We identified two adjacent residues in strand β2 of the β-grasp fold whose mutation abrogated induction of MDCK cell scatter. Biophysical analysis indicated that these mutations do not alter protein structure. We then tested these mutants in human HT-29 cells that, in contrast to the MDCK cells, were responsive to the internalin domain alone. These assays revealed a dominant negative effect, reducing the activity of a construct of the internalin domain and mutated B-repeat below that of the individual internalin domain. Phosphorylation assays of MET and its downstream targets AKT and ERK confirmed the dominant negative effect. Attempts to identify a host cell receptor for the B-repeat were not successful. We conclude that there is limited support for a co-receptor hypothesis and instead suggest that the B-repeat contributes to MET activation through low affinity homodimerization. PMID:27789707
Tandem mass spectrometry data quality assessment by self-convolution.

PubMed

Choo, Keng Wah; Tham, Wai Mun

2007-09-20

Many algorithms have been developed for deciphering the tandem mass spectrometry (MS) data sets. They can be essentially clustered into two classes. The first performs searches on theoretical mass spectrum database, while the second based itself on de novo sequencing from raw mass spectrometry data. It was noted that the quality of mass spectra affects significantly the protein identification processes in both instances. This prompted the authors to explore ways to measure the quality of MS data sets before subjecting them to the protein identification algorithms, thus allowing for more meaningful searches and increased confidence level of proteins identified. The proposed method measures the qualities of MS data sets based on the symmetric property of b- and y-ion peaks present in a MS spectrum. Self-convolution on MS data and its time-reversal copy was employed. Due to the symmetric nature of b-ions and y-ions peaks, the self-convolution result of a good spectrum would produce a highest mid point intensity peak. To reduce processing time, self-convolution was achieved using Fast Fourier Transform and its inverse transform, followed by the removal of the "DC" (Direct Current) component and the normalisation of the data set. The quality score was defined as the ratio of the intensity at the mid point to the remaining peaks of the convolution result. The method was validated using both theoretical mass spectra, with various permutations, and several real MS data sets. The results were encouraging, revealing a high percentage of positive prediction rates for spectra with good quality scores. We have demonstrated in this work a method for determining the quality of tandem MS data set. By pre-determining the quality of tandem MS data before subjecting them to protein identification algorithms, spurious protein predictions due to poor tandem MS data are avoided, giving scientists greater confidence in the predicted results. We conclude that the algorithm performs well
Tandem repeat variation near the HIC1 (hypermethylated in cancer 1) promoter predicts outcome of oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer.

PubMed

Okazaki, Satoshi; Schirripa, Marta; Loupakis, Fotios; Cao, Shu; Zhang, Wu; Yang, Dongyun; Ning, Yan; Berger, Martin D; Miyamoto, Yuji; Suenaga, Mitsukuni; Iqubal, Syma; Barzi, Afsaneh; Cremolini, Chiara; Falcone, Alfredo; Battaglin, Francesca; Salvatore, Lisa; Borelli, Beatrice; Helentjaris, Timothy G; Lenz, Heinz-Josef

2017-11-15

The hypermethylated in cancer 1/sirtuin 1 (HIC1/SIRT1) axis plays an important role in regulating the nucleotide excision repair pathway, which is the main oxaliplatin-induced damage-repair system. On the basis of prior evidence that the variable number of tandem repeat (VNTR) sequence located near the promoter lesion of HIC1 is associated with HIC1 gene expression, the authors tested the hypothesis that this VNTR is associated with clinical outcome in patients with metastatic colorectal cancer who receive oxaliplatin-based chemotherapy. Four independent cohorts were tested. Patients who received oxaliplatin-based chemotherapy served as the training cohort (n = 218), and those who received treatment without oxaliplatin served as the control cohort (n = 215). Two cohorts of patients who received oxaliplatin-based chemotherapy were used for validation studies (n = 176 and n = 73). The VNTR sequence near HIC1 was analyzed by polymerase chain reaction analysis and gel electrophoresis and was tested for associations with the response rate, progression-free survival, and overall survival. In the training cohort, patients who harbored at least 5 tandem repeats (TRs) in both alleles had a significantly shorter PFS compared with those who had fewer than 4 TRs in at least 1 allele (9.5 vs 11.6 months; hazard ratio, 1.93; P = .012), and these findings remained statistically significant after multivariate analysis (hazard ratio, 2.00; 95% confidence interval, 1.13-3.54; P = .018). This preliminary association was confirmed in the validation cohort, and patients who had at least 5 TRs in both alleles had a worse PFS compared with the other cohort (7.9 vs 9.8 months; hazard ratio, 1.85; P = .044). The current findings suggest that the VNTR sequence near HIC1 could be a predictive marker for oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer. Cancer 2017;123:4506-14. © 2017 American Cancer Society. © 2017 American Cancer Society.
Variable number of tandem repeat profiles and antimicrobial resistance patterns of Staphylococcus haemolyticus strains isolated from blood cultures in children.

PubMed

Hosseinkhani, Faride; Jabalameli, Fereshteh; Nodeh Farahani, Narges; Taherikalani, Morovat; van Leeuwen, Willem B; Emaneini, Mohammad

2016-03-01

Staphylococcus haemolyticus is a healthcare-associated pathogen and can cause a variety of lifethreatening infections. Additionally, multi-drug resistance (MDR), in particular methicillin-resistant S. haemolyticus (MRSH) isolates, have emerged. Dissemination of such strains can be of great concern in the hospital environment. A total number of 20S. haemolyticus isolates from blood cultures obtained from children were included in this study. A high prevalence of MDR-MRSH isolates with high MIC values to vancomycin was found and 35% of the isolates were intermediate resistant to vancomycin. Multilocus variable number of tandem repeats analysis (MLVF) revealed 5 MLVF types among 20 isolates of S. haemolyticus. Twelve isolates shared the same MLVF type and were isolated from different wards in a pediatric hospital in Iran. This is a serious alarm for infection control; i.e. in the absence of adequate infection diagnostics and infection control guidelines, these resistant strains can spread to other sectors of a hospital and possibly among the community. Copyright © 2015 Elsevier B.V. All rights reserved.
Evolutionary Conservation of a Coding Function for D4Z4, the Tandem DNA Repeat Mutated in Facioscapulohumeral Muscular Dystrophy

PubMed Central

Clapp, Jannine ; Mitchell, Laura M. ; Bolland, Daniel J. ; Fantes, Judy ; Corcoran, Anne E. ; Scotting, Paul J. ; Armour, John A. L. ; Hewitt, Jane E.

2007-01-01

Facioscapulohumeral muscular dystrophy (FSHD) is caused by deletions within the polymorphic DNA tandem array D4Z4. Each D4Z4 repeat unit has an open reading frame (ORF), termed “DUX4,” containing two homeobox sequences. Because there has been no evidence of a transcript from the array, these deletions are thought to cause FSHD by a position effect on other genes. Here, we identify D4Z4 homologues in the genomes of rodents, Afrotheria (superorder of elephants and related species), and other species and show that the DUX4 ORF is conserved. Phylogenetic analysis suggests that primate and Afrotherian D4Z4 arrays are orthologous and originated from a retrotransposed copy of an intron-containing DUX gene, DUXC. Reverse-transcriptase polymerase chain reaction and RNA fluorescence and tissue in situ hybridization data indicate transcription of the mouse array. Together with the conservation of the DUX4 ORF for >100 million years, this strongly supports a coding function for D4Z4 and necessitates re-examination of current models of the FSHD disease mechanism. PMID:17668377
Highly efficient purification of protein complexes from mammalian cells using a novel streptavidin-binding peptide and hexahistidine tandem tag system: Application to Bruton's tyrosine kinase

PubMed Central

Li, Yifeng; Franklin, Sarah; Zhang, Michael J; Vondriska, Thomas M

2011-01-01

Tandem affinity purification (TAP) is a generic approach for the purification of protein complexes. The key advantage of TAP is the engineering of dual affinity tags that, when attached to the protein of interest, allow purification of the target protein along with its binding partners through two consecutive purification steps. The tandem tag used in the original method consists of two IgG-binding units of protein A from Staphylococcus aureus (ProtA) and the calmodulin-binding peptide (CBP), and it allows for recovery of 20–30% of the bait protein in yeast. When applied to higher eukaryotes, however, this classical TAP tag suffers from low yields. To improve protein recovery in systems other than yeast, we describe herein the development of a three-tag system comprised of CBP, streptavidin-binding peptide (SBP) and hexa-histidine. We illustrate the application of this approach for the purification of human Bruton's tyrosine kinase (Btk), which results in highly efficient binding and elution of bait protein in both purification steps (>50% recovery). Combined with mass spectrometry for protein identification, this TAP strategy facilitated the first nonbiased analysis of Btk interacting proteins. The high efficiency of the SBP-His6 purification allows for efficient recovery of protein complexes formed with a target protein of interest from a small amount of starting material, enhancing the ability to detect low abundance and transient interactions in eukaryotic cell systems. PMID:21080425
Mutation rates at 42 Y chromosomal short tandem repeats in Chinese Han population in Eastern China.

PubMed

Wu, Weiwei; Ren, Wenyan; Hao, Honglei; Nan, Hailun; He, Xin; Liu, Qiuling; Lu, Dejian

2018-01-31

Mutation analysis of 42 Y chromosomal short tandem repeats (Y-STRs) loci was performed using a sample of 1160 father-son pairs from the Chinese Han population in Eastern China. The results showed that the average mutation rate across the 42 Y-STR loci was 0.0041 (95% CI 0.0036-0.0047) per locus per generation. The locus-specific mutation rates varied from 0.000 to 0.0190. No mutation was found at DYS388, DYS437, DYS448, DYS531, and GATA_H4. DYS627, DYS570, DYS576, and DYS449 could be classified as rapidly mutating Y-STRs, with mutation rates higher than 1.0 × 10 -2 . DYS458, DYS630, and DYS518 were moderately mutating Y-STRs, with mutation rates ranging from 8 × 10 -3 to 1 × 10 -2 . Although the characteristics of the Y-STR mutations were consistent with those in previous studies, mutation rate differences between our data and previous published data were found at some rapidly mutating Y-STRs. The single-copy loci located on the short arm of the Y chromosome (Yp) showed relatively higher mutation rates more frequently than the multi-copy loci. These results will not only extend the data for Y-STR mutations but also be important for kinship analysis, paternal lineage identification, and family relationship reconstruction in forensic Y-STR analysis.
Combinatorial control of Drosophila circular RNA expression by intronic repeats, hnRNPs, and SR proteins.

PubMed

Kramer, Marianne C; Liang, Dongming; Tatomer, Deirdre C; Gold, Beth; March, Zachary M; Cherry, Sara; Wilusz, Jeremy E

2015-10-15

Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼ 400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3' end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine-arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. © 2015 Kramer et al.; Published by Cold Spring Harbor Laboratory Press.

Combinatorial control of Drosophila circular RNA expression by intronic repeats, hnRNPs, and SR proteins

PubMed Central

Kramer, Marianne C.; Liang, Dongming; Tatomer, Deirdre C.; Gold, Beth; March, Zachary M.; Cherry, Sara; Wilusz, Jeremy E.

2015-01-01

Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3′ end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine–arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. PMID:26450910
Linking Y-chromosomal short tandem repeat loci to human male impulsive aggression.

PubMed

Yang, Chun; Ba, Huajie; Cao, Yin; Dong, Guoying; Zhang, Shuyou; Gao, Zhiqin; Zhao, Hanqing; Zhou, Xianju

2017-11-01

Men are more susceptible to impulsive behavior than women. Epidemiological studies revealed that the impulsive aggressive behavior is affected by genetic factors, and the male-specific Y chromosome plays an important role in this behavior. In this study, we investigated the association between the impulsive aggressive behavior and Y-chromosomal short tandem repeats (Y-STRs) loci. The collected biologic samples from 271 offenders with impulsive aggressive behavior and 492 healthy individuals without impulsive aggressive behavior were amplified by PowerPlex R Y23 PCR System and the resultant products were separated by electrophoresis and further genotyped. Then, comparisons in allele and haplotype frequencies of the selected 22 Y-STRs were made in the two groups. Our results showed that there were significant differences in allele frequencies at DYS448 and DYS456 between offenders and controls ( p < .05). Univariate analysis further revealed significant frequency differences for alleles 18 and 22 at DYS448 (0.18 vs 0.27, compared to the controls, p = .003, OR=0.57,95% CI=0.39-0.82; 0.03 vs 0.01, compared to the controls, p = .003, OR=7.45, 95% CI=1.57-35.35, respectively) and for allele 17 at DYS456 (0.07 vs 0.14, compared to the controls, p = .006, OR=0.48, 95% CI =0.28-0.82) between two groups. Interestingly, the frequency of haploid haplotype 22-15 on the DYS448-DYS456 (DYS448-DYS456-22-15) was significantly higher in offenders than in controls (0.033 vs 0.004, compared to the control, p = .001, OR = 8.42, 95%CI =1.81-39.24). Moreover, there were no significant differences in allele frequencies of other Y-STRs loci between two groups. Furthermore, the unconditional logistic regression analysis confirmed that alleles 18 and 22 at DYS448 and allele 17 at DYS456 are associated with male impulsive aggression. However, the DYS448-DYS456-22-15 is less related to impulsive aggression. Our results suggest a link between Y-chromosomal allele types and male
Fifteen non-CODIS autosomal short tandem repeat loci multiplex data from nine population groups living in Taiwan.

PubMed

Hwa, Hsiao-Lin; Chang, Yih-Yuan; Lee, James Chun-I; Lin, Chun-Yen; Yin, Hsiang-Yi; Tseng, Li-Hui; Su, Yi-Ning; Ko, Tsang-Ming

2012-07-01

The analysis of autosomal short tandem repeat (STR) loci is a powerful tool in forensic genetics. We developed a multiplex system in which 15 non-Combined DNA Index System autosomal STRs (D3S1744, D4S2366, D8S1110, D10S2325, D12S1090, D13S765, D14S608, Penta E, D17S1294, D18S536, D18S1270, D20S470, D21S1437, Penta D, and D22S683) could be amplified in one single polymerase chain reaction. DNA samples from 1,098 unrelated subjects of nine population groups living in Taiwan, including Taiwanese Han, indigenous Taiwanese of Taiwan Island, Tao, mainland Chinese, Filipinos, Thais, Vietnamese, Indonesians, and Caucasians, were collected and analyzed using this system. The distributions of the allelic frequencies and the forensic parameters of each population group were presented. The combined discrimination power and the combined power of exclusion were high in all population groups tested in this study. A multidimensional scaling plot of these nine population groups based on the Reynolds' genetic distances calculated from 15 autosomal STRs was constructed, and the genetic substructure in this area was presented. In conclusion, this 15 autosomal STR multiplex system provides highly informative STR data and appears useful in forensic casework and parentage testing in different populations.
Highly Effective DNA Extraction Method for Nuclear Short Tandem Repeat Testing of Skeletal Remains from Mass Graves

PubMed Central

Davoren, Jon; Vanek, Daniel; Konjhodzić, Rijad; Crews, John; Huffine, Edwin; Parsons, Thomas J.

2007-01-01

Aim To quantitatively compare a silica extraction method with a commonly used phenol/chloroform extraction method for DNA analysis of specimens exhumed from mass graves. Methods DNA was extracted from twenty randomly chosen femur samples, using the International Commission on Missing Persons (ICMP) silica method, based on Qiagen Blood Maxi Kit, and compared with the DNA extracted by the standard phenol/chloroform-based method. The efficacy of extraction methods was compared by real time polymerase chain reaction (PCR) to measure DNA quantity and the presence of inhibitors and by amplification with the PowerPlex 16 (PP16) multiplex nuclear short tandem repeat (STR) kit. Results DNA quantification results showed that the silica-based method extracted on average 1.94 ng of DNA per gram of bone (range 0.25-9.58 ng/g), compared with only 0.68 ng/g by the organic method extracted (range 0.0016-4.4880 ng/g). Inhibition tests showed that there were on average significantly lower levels of PCR inhibitors in DNA isolated by the organic method. When amplified with PP16, all samples extracted by silica-based method produced 16 full loci profiles, while only 75% of the DNA extracts obtained by organic technique amplified 16 loci profiles. Conclusions The silica-based extraction method showed better results in nuclear STR typing from degraded bone samples than a commonly used phenol/chloroform method. PMID:17696302
Structure of the circumsporozoite protein gene in 18 strains of Plasmodium falciparum.

PubMed

Weber, J L; Hockmeyer, W T

1985-06-01

Using the cloned circumsporozoite (CS) protein gene of a Brazilian strain of Plasmodium falciparum as probe, we have analyzed the structure of the CS protein gene from 17 other Asian, African, Central and South American parasite strains by nucleic acid hybridization. Each strain appears to have one CS protein gene which hybridizes readily to the Brazilian strain probe. The 5' and 3' thirds of the genes are invariant in size in all 18 strains whereas the central third containing the 12 base pair tandem repeats varies in size over a range of about 100 base pairs. Several differences were found in the locations of Sau3A sites in the genes. The Sau3A sites are significant because each of the minority Asn-Val-Asp-Pro repeats in the cloned gene has a Sau3A site. DNA melting of hybrids revealed a high degree of homology between the sequences of the cloned gene and genes from an Asian strain and an African strain. A 14 base oligodeoxynucleotide with a sequence from the central repeat region hybridized to all strains tested. We conclude that the CS protein gene is highly conserved among strains of P. falciparum and that malaria vaccine development with the CS protein is unlikely to be complicated by strain variation.
An Ultra-High Discrimination Y Chromosome Short Tandem Repeat Multiplex DNA Typing System

PubMed Central

Hanson, Erin K.; Ballantyne, Jack

2007-01-01

In forensic casework, Y chromosome short tandem repeat markers (Y-STRs) are often used to identify a male donor DNA profile in the presence of excess quantities of female DNA, such as is found in many sexual assault investigations. Commercially available Y-STR multiplexes incorporating 12–17 loci are currently used in forensic casework (Promega's PowerPlex® Y and Applied Biosystems' AmpFlSTR® Yfiler®). Despite the robustness of these commercial multiplex Y-STR systems and the ability to discriminate two male individuals in most cases, the coincidence match probabilities between unrelated males are modest compared with the standard set of autosomal STR markers. Hence there is still a need to develop new multiplex systems to supplement these for those cases where additional discriminatory power is desired or where there is a coincidental Y-STR match between potential male participants. Over 400 Y-STR loci have been identified on the Y chromosome. While these have the potential to increase the discrimination potential afforded by the commercially available kits, many have not been well characterized. In the present work, 91 loci were tested for their relative ability to increase the discrimination potential of the commonly used ‘core’ Y-STR loci. The result of this extensive evaluation was the development of an ultra high discrimination (UHD) multiplex DNA typing system that allows for the robust co-amplification of 14 non-core Y-STR loci. Population studies with a mixed African American and American Caucasian sample set (n = 572) indicated that the overall discriminatory potential of the UHD multiplex was superior to all commercial kits tested. The combined use of the UHD multiplex and the Applied Biosystems' AmpFlSTR® Yfiler® kit resulted in 100% discrimination of all individuals within the sample set, which presages its potential to maximally augment currently available forensic casework markers. It could also find applications in human evolutionary
Multiple-locus variable-number tandem repeat analysis for molecular typing of Aspergillus fumigatus

PubMed Central

2010-01-01

Background Multiple-locus variable-number tandem repeat (VNTR) analysis (MLVA) is a prominent subtyping method to resolve closely related microbial isolates to provide information for establishing genetic patterns among isolates and to investigate disease outbreaks. The usefulness of MLVA was recently demonstrated for the avian major pathogen Chlamydophila psittaci. In the present study, we developed a similar method for another pathogen of birds: the filamentous fungus Aspergillus fumigatus. Results We selected 10 VNTR markers located on 4 different chromosomes (1, 5, 6 and 8) of A. fumigatus. These markers were tested with 57 unrelated isolates from different hosts or their environment (53 isolates from avian species in France, China or Morocco, 3 isolates from humans collected at CHU Henri Mondor hospital in France and the reference strain CBS 144.89). The Simpson index for individual markers ranged from 0.5771 to 0.8530. A combined loci index calculated with all the markers yielded an index of 0.9994. In a second step, the panel of 10 markers was used in different epidemiological situations and tested on 277 isolates, including 62 isolates from birds in Guangxi province in China, 95 isolates collected in two duck farms in France and 120 environmental isolates from a turkey hatchery in France. A database was created with the results of the present study http://minisatellites.u-psud.fr/MLVAnet/. Three major clusters of isolates were defined by using the graphing algorithm termed Minimum Spanning Tree (MST). The first cluster comprised most of the avian isolates collected in the two duck farms in France, the second cluster comprised most of the avian isolates collected in poultry farms in China and the third one comprised most of the isolates collected in the turkey hatchery in France. Conclusions MLVA displayed excellent discriminatory power. The method showed a good reproducibility. MST analysis revealed an interesting clustering with a clear separation between
Synthetic Peptide Arrays for Pathway-Level Protein Monitoring by Liquid Chromatography-Tandem Mass Spectrometry*

PubMed Central

Hewel, Johannes A.; Liu, Jian; Onishi, Kento; Fong, Vincent; Chandran, Shamanta; Olsen, Jonathan B.; Pogoutse, Oxana; Schutkowski, Mike; Wenschuh, Holger; Winkler, Dirk F. H.; Eckler, Larry; Zandstra, Peter W.; Emili, Andrew

2010-01-01

Effective methods to detect and quantify functionally linked regulatory proteins in complex biological samples are essential for investigating mammalian signaling pathways. Traditional immunoassays depend on proprietary reagents that are difficult to generate and multiplex, whereas global proteomic profiling can be tedious and can miss low abundance proteins. Here, we report a target-driven liquid chromatography-tandem mass spectrometry (LC-MS/MS) strategy for selectively examining the levels of multiple low abundance components of signaling pathways which are refractory to standard shotgun screening procedures and hence appear limited in current MS/MS repositories. Our stepwise approach consists of: (i) synthesizing microscale peptide arrays, including heavy isotope-labeled internal standards, for use as high quality references to (ii) build empirically validated high density LC-MS/MS detection assays with a retention time scheduling system that can be used to (iii) identify and quantify endogenous low abundance protein targets in complex biological mixtures with high accuracy by correlation to a spectral database using new software tools. The method offers a flexible, rapid, and cost-effective means for routine proteomic exploration of biological systems including “label-free” quantification, while minimizing spurious interferences. As proof-of-concept, we have examined the abundance of transcription factors and protein kinases mediating pluripotency and self-renewal in embryonic stem cell populations. PMID:20467045
Poly-dipeptides encoded by the C9ORF72 repeats block global protein translation.

PubMed

Kanekura, Kohsuke; Yagi, Takuya; Cammack, Alexander J; Mahadevan, Jana; Kuroda, Masahiko; Harms, Matthew B; Miller, Timothy M; Urano, Fumihiko

2016-05-01

The expansion of the GGGGCC hexanucleotide repeat in the non-coding region of the Chromosome 9 open-reading frame 72 (C9orf72) gene is the most common genetic cause of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS). This genetic alteration leads to the accumulation of five types of poly-dipeptides translated from the GGGGCC hexanucleotide repeat. Among these, poly-proline-arginine (poly-PR) and poly-glycine-arginine (poly-GR) peptides are known to be neurotoxic. However, the mechanisms of neurotoxicity associated with these poly-dipeptides are not clear. A proteomics approach identified a number of interacting proteins with poly-PR peptide, including mRNA-binding proteins, ribosomal proteins, translation initiation factors and translation elongation factors. Immunostaining of brain sections from patients with C9orf72 ALS showed that poly-GR was colocalized with a mRNA-binding protein, hnRNPA1. In vitro translation assays showed that poly-PR and poly-GR peptides made insoluble complexes with mRNA, restrained the access of translation factors to mRNA, and blocked protein translation. Our results demonstrate that impaired protein translation mediated by poly-PR and poly-GR peptides plays a role in neurotoxicity and reveal that the pathways altered by the poly-dipeptides-mRNA complexes are potential therapeutic targets for treatment of C9orf72 FTD/ALS. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Profiling of kidney vascular endothelial cell plasma membrane proteins by liquid chromatography-tandem mass spectrometry.

PubMed

Liu, Zan; Xu, Bo; Nameta, Masaaki; Zhang, Ying; Magdeldin, Sameh; Yoshida, Yutaka; Yamamoto, Keiko; Fujinaka, Hidehiko; Yaoita, Eishin; Tasaki, Masayuki; Nakagawa, Yuki; Saito, Kazuhide; Takahashi, Kota; Yamamoto, Tadashi

2013-06-01

Vascular endothelial cells (VECs) play crucial roles in physiological and pathologic conditions in tissues and organs. Most of these roles are related to VEC plasma membrane proteins. In the kidney, VECs are closely associated with structures and functions; however, plasma membrane proteins in kidney VECs remain to be fully elucidated. Rat kidneys were perfused with cationic colloidal silica nanoparticles (CCSN) to label the VEC plasma membrane. The CCSN-labeled plasma membrane fraction was collected by gradient ultracentrifugation. The VEC plasma membrane or whole-kidney lysate proteins were separated by sodium dodecyl sulfate polyacrylamide gel electrophoresis and digested with trypsin in gels for liquid chromatography-tandem mass spectrometry. Enrichment analysis was then performed. The VEC plasma membrane proteins were purified by the CCSN method with high yield (approximately 20 μg from 1 g of rat kidney). By Mascot search, 582 proteins were identified in the VEC plasma membrane fraction, and 1,205 proteins were identified in the kidney lysate. In addition to 16 VEC marker proteins such as integrin beta-1 and intercellular adhesion molecule-2 (ICAM-2), 8 novel proteins such as Deltex 3-like protein and phosphatidylinositol binding clathrin assembly protein (PICALM) were identified. As expected, many key functions of plasma membranes in general and of endothelial cells in particular (i.e., leukocyte adhesion) were significantly overrepresented in the proteome of CCSN-labeled kidney VEC fraction. The CCSN method is a reliable technique for isolation of VEC plasma membrane from the kidney, and proteomic analysis followed by bioinformatics revealed the characteristics of in vivo VECs in the kidney.
Second generation subtyping: a proposed PulseNet protocol for multiple-locus variable-number tandem repeat analysis of Shiga toxin-producing Escherichia coli O157 (STEC O157).

PubMed

Hyytiä-Trees, Eija; Smole, Sandra C; Fields, Patricia A; Swaminathan, Bala; Ribot, Efrain M

2006-01-01

Most bacterial genomes contain tandem duplications of short DNA sequences, termed "variable-number tandem repeats" (VNTR). A subtyping method targeting these repeats, multiple-locus VNTR analysis (MLVA), has emerged as a powerful tool for characterization of clonal organisms such as Shiga toxin-producing Escherichia coli O157 (STEC O157). We modified and optimized a recently published MLVA scheme targeting 29 polymorphic VNTR regions of STEC O157 to render it suitable for routine use by public health laboratories that participate in PulseNet, the national and international molecular subtyping network for foodborne disease surveillance. Nine VNTR loci were included in the final protocol. They were amplified in three PCR reactions, after which the PCR products were sized using capillary electrophoresis. Two hundred geographically diverse, sporadic and outbreak- related STEC O157 isolates were characterized by MLVA and the results were compared with data obtained by pulsed-field gel electrophoresis (PFGE) using XbaI macrorestriction of genomic DNA. A total of 139 unique XbaI PFGE patterns and 162 MLVA types were identified. A subset of 100 isolates characterized by both XbaI and BlnI macrorestriction had 62 unique PFGE and MLVA types. Although the clustering of isolates by the two subtyping systems was generally in agreement, some discrepancies were observed. Importantly, MLVA was able to discriminate among some epidemiologically unrelated isolates which were indistinguishable by PFGE. However, among strains from three of the eight outbreaks included in the study, two single locus MLVA variants and one double locus variant were detected among epidemiologically implicated isolates that were indistinguishable by PFGE. Conversely, in three other outbreaks, isolates that were indistinguishable by MLVA displayed multiple PFGE types. An additional more extensive multi-laboratory validation of the MLVA protocol is in progress in order to address critical issues such as
A Comparative Proteomic Analysis of the Simple Amino Acid Repeat Distributions in Plasmodia Reveals Lineage Specific Amino Acid Selection

PubMed Central

Dalby, Andrew R.

2009-01-01

Background Microsatellites have been used extensively in the field of comparative genomics. By studying microsatellites in coding regions we have a simple model of how genotypic changes undergo selection as they are directly expressed in the phenotype as altered proteins. The simplest of these tandem repeats in coding regions are the tri-nucleotide repeats which produce a repeat of a single amino acid when translated into proteins. Tri-nucleotide repeats are often disease associated, and are also known to be unstable to both expansion and contraction. This makes them sensitive markers for studying proteome evolution, in closely related species. Results The evolutionary history of the family of malarial causing parasites Plasmodia is complex because of the life-cycle of the organism, where it interacts with a number of different hosts and goes through a series of tissue specific stages. This study shows that the divergence between the primate and rodent malarial parasites has resulted in a lineage specific change in the simple amino acid repeat distribution that is correlated to A–T content. The paper also shows that this altered use of amino acids in SAARs is consistent with the repeat distributions being under selective pressure. Conclusions The study shows that simple amino acid repeat distributions can be used to group related species and to examine their phylogenetic relationships. This study also shows that an outgroup species with a similar A–T content can be distinguished based only on the amino acid usage in repeats, and suggest that this might be a useful feature for proteome clustering. The lineage specific use of amino acids in repeat regions suggests that comparative studies of SAAR distributions between proteomes gives an insight into the mechanisms of expansion and the selective pressures acting on the organism. PMID:19597555
Molecular characterization of Shiga-toxigenic Escherichia coli isolated from diverse sources from India by multi-locus variable number tandem repeat analysis (MLVA).

PubMed

Kumar, A; Taneja, N; Sharma, R K; Sharma, H; Ramamurthy, T; Sharma, M

2014-12-01

In a first study from India, a diverse collection of 140 environmental and clinical non-O157 Shiga-toxigenic Escherichia coli strains from a large geographical area in north India was typed by multi-locus variable number tandem repeat analysis (MLVA). The distribution of major virulence genes stx1, stx2 and eae was found to be 78%, 70% and 10%, respectively; 15 isolates were enterohaemorrhagic E. coli (stx1 +/stx2 + and eae +). By MLVA analysis, 44 different alleles were obtained. Dendrogram analysis revealed 104 different genotypes and 19 MLVA-type complexes divided into two main lineages, i.e. mutton and animal stool. Human isolates presented a statistically significant greater odds ratio for clustering with mutton samples compared to animal stool isolates. Five human isolates clustered with animal stool strains suggesting that some of the human infections may be from cattle, perhaps through milk, contact or the environment. Further epidemiological studies are required to explore these sources in context with occurrence of human cases.
A multiple-locus variable-number tandem repeat analysis (MLVA) of Listeria monocytogenes isolated from Norwegian salmon-processing factories and from listeriosis patients.

PubMed

Lunestad, B T; Truong, T T T; Lindstedt, B-A

2013-10-01

The objective of this study was to characterize Listeria monocytogenes isolated from farmed Atlantic salmon (Salmo salar) and the processing environment in three different Norwegian factories, and compare these to clinical isolates by multiple-locus variable-number tandem repeat analysis (MLVA). The 65 L. monocytogenes isolates obtained gave 15 distinct MLVA profiles. There was great heterogeneity in the distribution of MLVA profiles in factories and within each factory. Nine of the 15 MLVA profiles found in the fish-associated isolates were found to match human profiles. The MLVA profile 07-07-09-10-06 was the most common strain in Norwegian listeriosis patients. L. monocytogenes with this profile has previously been associated with at least two known listeriosis outbreaks in Norway, neither determined to be due to fish consumption. However, since this profile was also found in fish and in the processing environment, fish should be considered as a possible food vehicle during sporadic cases and outbreaks of listeriosis.
Classification of proteins with shared motifs and internal repeats in the ECOD database

PubMed Central

Kinch, Lisa N.; Liao, Yuxing

2016-01-01

Abstract Proteins and their domains evolve by a set of events commonly including the duplication and divergence of small motifs. The presence of short repetitive regions in domains has generally constituted a difficult case for structural domain classifications and their hierarchies. We developed the Evolutionary Classification Of protein Domains (ECOD) in part to implement a new schema for the classification of these types of proteins. Here we document the ways in which ECOD classifies proteins with small internal repeats, widespread functional motifs, and assemblies of small domain‐like fragments in its evolutionary schema. We illustrate the ways in which the structural genomics project impacted the classification and characterization of new structural domains and sequence families over the decade. PMID:26833690
DNA triplet repeats mediate heterochromatin-protein-1-sensitive variegated gene silencing.

PubMed

Saveliev, Alexander; Everett, Christopher; Sharpe, Tammy; Webster, Zoë; Festenstein, Richard

2003-04-24

Gene repression is crucial to the maintenance of differentiated cell types in multicellular organisms, whereas aberrant silencing can lead to disease. The organization of DNA into chromatin and heterochromatin is implicated in gene silencing. In chromatin, DNA wraps around histones, creating nucleosomes. Further condensation of chromatin, associated with large blocks of repetitive DNA sequences, is known as heterochromatin. Position effect variegation (PEV) occurs when a gene is located abnormally close to heterochromatin, silencing the affected gene in a proportion of cells. Here we show that the relatively short triplet-repeat expansions found in myotonic dystrophy and Friedreich's ataxia confer variegation of expression on a linked transgene in mice. Silencing was correlated with a decrease in promoter accessibility and was enhanced by the classical PEV modifier heterochromatin protein 1 (HP1). Notably, triplet-repeat-associated variegation was not restricted to classical heterochromatic regions but occurred irrespective of chromosomal location. Because the phenomenon described here shares important features with PEV, the mechanisms underlying heterochromatin-mediated silencing might have a role in gene regulation at many sites throughout the mammalian genome and modulate the extent of gene silencing and hence severity in several triplet-repeat diseases.
Identification of proteins associated with the yeast mitochondrial RNA polymerase by tandem affinity purification

PubMed Central

Markov, Dmitriy A; Savkina, Maria; Anikin, Michael; Del Campo, Mark; Ecker, Karen; Lambowitz, Alan M; De Gnore, Jon P; McAllister, William T

2009-01-01

The abundance of mitochondrial (mt) transcripts varies under different conditions, and is thought to depend upon rates of transcription initiation, transcription termination/attenuation and RNA processing/degradation. The requirement to maintain the balance between RNA synthesis and processing may involve coordination between these processes; however, little is known about factors that regulate the activity of mtRNA polymerase (mtRNAP). Recent attempts to identify mtRNAP–protein interactions in yeast by means of a generalized tandem affinity purification (TAP) protocol were not successful, most likely because they involved a C-terminal mtRNAP–TAP fusion (which is incompatible with mtRNAP function) and because of the use of whole-cell solubilization protocols that did not preserve the integrity of mt protein complexes. Based upon the structure of T7 RNAP (to which mtRNAPs show high sequence similarity), we identified positions in yeast mtRNAP that allow insertion of a small affinity tag, confirmed the mature N-terminus, constructed a functional N-terminal TAP–mtRNAP fusion, pulled down associated proteins, and identified them by LC–MS–MS. Among the proteins found in the pull-down were a DEAD-box protein (Mss116p) and an RNA-binding protein (Pet127p). Previous genetic experiments suggested a role for these proteins in linking transcription and RNA degradation, in that a defect in the mt degradadosome could be suppressed by overexpression of either of these proteins or, independently, by mutations in either mtRNAP or its initiation factor Mtf1p. Further, we found that Mss116p inhibits transcription by mtRNAP in vitro in a steady-state reaction. Our results support the hypothesis that Mss116p and Pet127p are involved in modulation of mtRNAP activity. Copyright © 2009 John Wiley & Sons, Ltd. PMID:19536766
Accurate quantification of chromosomal lesions via short tandem repeat analysis using minimal amounts of DNA.

PubMed

Jann, Johann-Christoph; Nowak, Daniel; Nolte, Florian; Fey, Stephanie; Nowak, Verena; Obländer, Julia; Pressler, Jovita; Palme, Iris; Xanthopoulos, Christina; Fabarius, Alice; Platzbecker, Uwe; Giagounidis, Aristoteles; Götze, Katharina; Letsch, Anne; Haase, Detlef; Schlenk, Richard; Bug, Gesine; Lübbert, Michael; Ganser, Arnold; Germing, Ulrich; Haferlach, Claudia; Hofmann, Wolf-Karsten; Mossner, Maximilian

2017-09-01

Cytogenetic aberrations such as deletion of chromosome 5q (del(5q)) represent key elements in routine clinical diagnostics of haematological malignancies. Currently established methods such as metaphase cytogenetics, FISH or array-based approaches have limitations due to their dependency on viable cells, high costs or semi-quantitative nature. Importantly, they cannot be used on low abundance DNA. We therefore aimed to establish a robust and quantitative technique that overcomes these shortcomings. For precise determination of del(5q) cell fractions, we developed an inexpensive multiplex-PCR assay requiring only nanograms of DNA that simultaneously measures allelic imbalances of 12 independent short tandem repeat markers. Application of this method to n=1142 samples from n=260 individuals revealed strong intermarker concordance (R²=0.77-0.97) and reproducibility (mean SD: 1.7%). Notably, the assay showed accurate quantification via standard curve assessment (R²>0.99) and high concordance with paired FISH measurements (R²=0.92) even with subnanogram amounts of DNA. Moreover, cytogenetic response was reliably confirmed in del(5q) patients with myelodysplastic syndromes treated with lenalidomide. While the assay demonstrated good diagnostic accuracy in receiver operating characteristic analysis (area under the curve: 0.97), we further observed robust correlation between bone marrow and peripheral blood samples (R²=0.79), suggesting its potential suitability for less-invasive clonal monitoring. In conclusion, we present an adaptable tool for quantification of chromosomal aberrations, particularly in problematic samples, which should be easily applicable to further tumour entities. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Clostridium botulinum group I strain genotyping by 15-locus multilocus variable-number tandem-repeat analysis.

PubMed

Fillo, Silvia; Giordani, Francesco; Anniballi, Fabrizio; Gorgé, Olivier; Ramisse, Vincent; Vergnaud, Gilles; Riehm, Julia M; Scholz, Holger C; Splettstoesser, Wolf D; Kieboom, Jasper; Olsen, Jaran-Strand; Fenicia, Lucia; Lista, Florigio

2011-12-01

Clostridium botulinum is a taxonomic designation that encompasses a broad variety of spore-forming, Gram-positive bacteria producing the botulinum neurotoxin (BoNT). C. botulinum is the etiologic agent of botulism, a rare but severe neuroparalytic disease. Fine-resolution genetic characterization of C. botulinum isolates of any BoNT type is relevant for both epidemiological studies and forensic microbiology. A 10-locus multiple-locus variable-number tandem-repeat analysis (MLVA) was previously applied to isolates of C. botulinum type A. The present study includes five additional loci designed to better address proteolytic B and F serotypes. We investigated 79 C. botulinum group I strains isolated from human and food samples in several European countries, including types A (28), B (36), AB (4), and F (11) strains, and 5 nontoxic Clostridium sporogenes. Additional data were deduced from in silico analysis of 10 available fully sequenced genomes. This 15-locus MLVA (MLVA-15) scheme identified 86 distinct genotypes that clustered consistently with the results of amplified fragment length polymorphism (AFLP) and MLVA genotyping in previous reports. An MLVA-7 scheme, a subset of the MLVA-15, performed on a lab-on-a-chip device using a nonfluorescent subset of primers, is also proposed as a first-line assay. The phylogenetic grouping obtained with the MLVA-7 does not differ significantly from that generated by the MLVA-15. To our knowledge, this report is the first to analyze genetic variability among all of the C. botulinum group I serotypes by MLVA. Our data provide new insights into the genetic variability of group I C. botulinum isolates worldwide and demonstrate that this group is genetically highly diverse.
Population genetic study of 10 short tandem repeat loci from 600 domestic dogs in Korea.

PubMed

Moon, Seo Hyun; Jang, Yoon-Jeong; Han, Myun Soo; Cho, Myung-Haing

2016-09-30

Dogs have long shared close relationships with many humans. Due to the large number of dogs in human populations, they are often involved in crimes. Occasionally, canine biological evidence such as saliva, bloodstains and hairs can be found at crime scenes. Accordingly, canine DNA can be used as forensic evidence. The use of short tandem repeat (STR) loci from biological evidence is valuable for forensic investigations. In Korea, canine STR profiling-related crimes are being successfully analyzed, leading to diverse crimes such as animal cruelty, dog-attacks, murder, robbery, and missing and abandoned dogs being solved. However, the probability of random DNA profile matches cannot be analyzed because of a lack of canine STR data. Therefore, in this study, 10 STR loci were analyzed in 600 dogs in Korea (344 dogs belonging to 30 different purebreds and 256 crossbred dogs) to estimate canine forensic genetic parameters. Among purebred dogs, a separate statistical analysis was conducted for five major subgroups, 97 Maltese, 47 Poodles, 31 Shih Tzus, 32 Yorkshire Terriers, and 25 Pomeranians. Allele frequencies, expected (Hexp) and observed heterozygosity (Hobs), fixation index (F), probability of identity (P(ID)), probability of sibling identity (P(ID)sib) and probability of exclusion (PE) were then calculated. The Hexp values ranged from 0.901 (PEZ12) to 0.634 (FHC2079), while the P(ID)sib values were between 0.481 (FHC2079) and 0.304 (PEZ12) and the P(ID)sib was about 3.35 × 10(-)⁵ for the combination of all 10 loci. The results presented herein will strengthen the value of canine DNA to solving dog-related crimes.

Simulation of Two Dimensional Electrophoresis and Tandem Mass Spectrometry for Teaching Proteomics

ERIC Educational Resources Information Center

Fisher, Amanda; Sekera, Emily; Payne, Jill; Craig, Paul

2012-01-01

In proteomics, complex mixtures of proteins are separated (usually by chromatography or electrophoresis) and identified by mass spectrometry. We have created 2DE Tandem MS, a computer program designed for use in the biochemistry, proteomics, or bioinformatics classroom. It contains two simulations--2D electrophoresis and tandem mass spectrometry.…
Development of new multilocus variable number of tandem repeat analysis (MLVA) for Listeria innocua and its application in a food processing plant.

PubMed

Takahashi, Hajime; Ohshima, Chihiro; Nakagawa, Miku; Thanatsang, Krittaporn; Phraephaisarn, Chirapiphat; Chaturongkasumrit, Yuphakhun; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon

2014-01-01

Listeria innocua is an important hygiene indicator bacterium in food industries because it behaves similar to Listeria monocytogenes, which is pathogenic to humans. PFGE is often used to characterize bacterial strains and to track contamination source. However, because PFGE is an expensive, complicated, time-consuming protocol, and poses difficulty in data sharing, development of a new typing method is necessary. MLVA is a technique that identifies bacterial strains on the basis of the number of tandem repeats present in the genome varies depending on the strains. MLVA has gained attention due to its high reproducibility and ease of data sharing. In this study, we developed a MLVA protocol to assess L. innocua and evaluated it by tracking the contamination source of L. innocua in an actual food manufacturing factory by typing the bacterial strains isolated from the factory. Three VNTR regions of the L. innocua genome were chosen for use in the MLVA. The number of repeat units in each VNTR region was calculated based on the results of PCR product analysis using capillary electrophoresis (CE). The calculated number of repetitions was compared with the results of the gene sequence analysis to demonstrate the accuracy of the CE repeat number analysis. The developed technique was evaluated using 60 L. innocua strains isolated from a food factory. These 60 strains were classified into 11 patterns using MLVA. Many of the strains were classified into ST-6, revealing that this MLVA strain type can contaminate each manufacturing process in the factory. The MLVA protocol developed in this study for L. innocua allowed rapid and easy analysis through the use of CE. This technique was found to be very useful in hygiene control in factories because it allowed us to track contamination sources and provided information regarding whether the bacteria were present in the factories.
Insights into the Aggregation Mechanism of PolyQ Proteins with Different Glutamine Repeat Lengths.

PubMed

Yushchenko, Tetyana; Deuerling, Elke; Hauser, Karin

2018-04-24

Polyglutamine (polyQ) diseases, including Huntington's disease, result from the aggregation of an abnormally expanded polyQ repeat in the affected protein. The length of the polyQ repeat is essential for the disease's onset; however, the molecular mechanism of polyQ aggregation is still poorly understood. Controlled conditions and initiation of the aggregation process are prerequisites for the detection of transient intermediate states. We present an attenuated total reflection Fourier-transform infrared spectroscopic approach combined with protein immobilization to study polyQ aggregation dependent on the polyQ length. PolyQ proteins were engineered mimicking the mammalian N-terminus fragment of the Huntingtin protein and containing a polyQ sequence with the number of glutamines below (Q11), close to (Q38), and above (Q56) the disease threshold. A monolayer of the polyQ construct was chemically immobilized on the internal reflection element of the attenuated total reflection cell, and the aggregation was initiated via enzymatic cleavage. Structural changes of the polyQ sequence were monitored by time-resolved infrared difference spectroscopy. We observed faster aggregation kinetics for the longer sequences, and furthermore, we could distinguish β-structured intermediates for the different constructs, allowing us to propose aggregation mechanisms dependent on the repeat length. Q11 forms a β-structured aggregate by intermolecular interaction of stretched monomers, whereas Q38 and Q56 undergo conformational changes to various β-structured intermediates, including intramolecular β-sheets. Copyright © 2018 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Development and Validation of a Multiplexed Protein Quantitation Assay for the Determination of Three Recombinant Proteins in Soybean Tissues by Liquid Chromatography with Tandem Mass Spectrometry.

PubMed

Hill, Ryan C; Oman, Trent J; Shan, Guomin; Schafer, Barry; Eble, Julie; Chen, Cynthia

2015-08-26

Currently, traditional immunochemistry technologies such as enzyme-linked immunosorbent assays (ELISA) are the predominant analytical tool used to measure levels of recombinant proteins expressed in genetically engineered (GE) plants. Recent advances in agricultural biotechnology have created a need to develop methods capable of selectively detecting and quantifying multiple proteins in complex matrices because of increasing numbers of transgenic proteins being coexpressed or "stacked" to achieve tolerance to multiple herbicides or to provide multiple modes of action for insect control. A multiplexing analytical method utilizing liquid chromatography with tandem mass spectrometry (LC-MS/MS) has been developed and validated to quantify three herbicide-tolerant proteins in soybean tissues: aryloxyalkanoate dioxygenase (AAD-12), 5-enol-pyruvylshikimate-3-phosphate synthase (2mEPSPS), and phosphinothricin acetyltransferase (PAT). Results from the validation showed high recovery and precision over multiple analysts and laboratories. Results from this method were comparable to those obtained with ELISA with respect to protein quantitation, and the described method was demonstrated to be suitable for multiplex quantitation of transgenic proteins in GE crops.
Characterization of a tandemly repeated DNA sequence family originally derived by retroposition of tRNA(Glu) in the newt.

PubMed

Nagahashi, S; Endoh, H; Suzuki, Y; Okada, N

1991-11-20

A previous report from this laboratory showed that in vitro transcription of total genomic DNA of the newt Cynopus pyrrhogaster resulted in a discrete sized 8 S RNA, which represented highly repetitive and transcribable sequences with a glutamic acid tRNA-like structure in the newt genome. We isolated four independent clones from a newt genomic library and determined the complete sequences of three 2000 to 2400 base-pair PstI fragments spanning the 8 S RNA gene. The glutamic acid tRNA-related segment in the 8 S RNA gene contains the CCA sequence expected as the 3' terminus of a tRNA molecule. Further, the 11 nucleotides located 13 nucleotides upstream from one of the two transcription initiation sites of the 8 S RNA were found to be repeated in the region upstream from the termination site, suggesting that the original unit, which is shorter than the 8 S RNA, was retrotransposed via cDNA intermediates from the PolIII transcript. In the upstream region of the 8 S RNA gene, a 360 nucleotide unit containing the glutamic acid tRNA-related segment was found to be duplicated (clones NE1 and NE10) or triplicated (clone NE3). Except for the difference in the number of the 360 nucleotide unit, the three sequences of the 2000 to 2400 base-pair PstI fragment were essentially the same with only a few mutations and minor deletions. Inverse polymerase chain reaction and sequence determination of the products, together with a Southern hybridization experiment, demonstrated that the family consists of a tandemly repeated unit of 3300, 3700 or 4100 base-pairs. Thus during evolution, this family in the newt was created by retroposition via cDNA intermediates, followed by duplication or triplication of the 360 nucleotide unit and multiplication of the 3300 to 4100 base-pair region at the DNA level.
PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile.

PubMed

Pereira, Luísa; Alshamali, Farida; Andreassen, Rune; Ballard, Ruth; Chantratita, Wasun; Cho, Nam Soo; Coudray, Clotilde; Dugoujon, Jean-Michel; Espinoza, Marta; González-Andrade, Fabricio; Hadi, Sibte; Immel, Uta-Dorothee; Marian, Catalin; Gonzalez-Martin, Antonio; Mertens, Gerhard; Parson, Walther; Perone, Carlos; Prieto, Lourdes; Takeshita, Haruo; Rangel Villalobos, Héctor; Zeng, Zhaoshu; Zhivotovsky, Lev; Camacho, Rui; Fonseca, Nuno A

2011-09-01

Because of their sensitivity and high level of discrimination, short tandem repeat (STR) maker systems are currently the method of choice in routine forensic casework and data banking, usually in multiplexes up to 15-17 loci. Constraints related to sample amount and quality, frequently encountered in forensic casework, will not allow to change this picture in the near future, notwithstanding the technological developments. In this study, we present a free online calculator named PopAffiliator ( http://cracs.fc.up.pt/popaffiliator ) for individual population affiliation in the three main population groups, Eurasian, East Asian and sub-Saharan African, based on genotype profiles for the common set of STRs used in forensics. This calculator performs affiliation based on a model constructed using machine learning techniques. The model was constructed using a data set of approximately fifteen thousand individuals collected for this work. The accuracy of individual population affiliation is approximately 86%, showing that the common set of STRs routinely used in forensics provide a considerable amount of information for population assignment, in addition to being excellent for individual identification.
Variable number of tandem repeat polymorphisms of DRD4: re-evaluation of selection hypothesis and analysis of association with schizophrenia

PubMed Central

Hattori, Eiji; Nakajima, Mizuho; Yamada, Kazuo; Iwayama, Yoshimi; Toyota, Tomoko; Saitou, Naruya; Yoshikawa, Takeo

2009-01-01

Associations have been reported between the variable number of tandem repeat (VNTR) polymorphisms in the exon 3 of dopamine D4 receptor gene gene and multiple psychiatric illnesses/traits. We examined the distribution of VNTR alleles of different length in a Japanese cohort and found that, as reported earlier, the size of allele ‘7R' was much rarer (0.5%) in Japanese than in Caucasian populations (∼20%). This presents a challenge to an earlier proposed hypothesis that positive selection favoring the allele 7R has contributed to its high frequency. To further address the issue of selection, we carried out sequencing of the VNTR region not only from human but also from chimpanzee samples, and made inference on the ancestral repeat motif and haplotype by use of a phylogenetic analysis program. The most common 4R variant was considered to be the ancestral haplotype as earlier proposed. However, in a gene tree of VNTR constructed on the basis of this inferred ancestral haplotype, the allele 7R had five descendent haplotypes in relatively long lineage, where genetic drift can have major influence. We also tested this length polymorphism for association with schizophrenia, studying two Japanese sample sets (one with 570 cases and 570 controls, and the other with 124 pedigrees). No evidence of association between the allele 7R and schizophrenia was found in any of the two data sets. Collectively, this study suggests that the VNTR variation does not have an effect large enough to cause either selection or a detectable association with schizophrenia in a study of samples of moderate size. PMID:19092778
Revisiting the Roco G-protein cycle.

PubMed

Terheyden, Susanne; Ho, Franz Y; Gilsbach, Bernd K; Wittinghofer, Alfred; Kortholt, Arjan

2015-01-01

Mutations in leucine-rich-repeat kinase 2 (LRRK2) are the most frequent cause of late-onset Parkinson's disease (PD). LRRK2 belongs to the Roco family of proteins which share a conserved Ras-like G-domain (Roc) and a C-terminal of Roc (COR) domain tandem. The nucleotide state of small G-proteins is strictly controlled by guanine-nucleotide-exchange factors (GEFs) and GTPase-activating proteins (GAPs). Because of contradictory structural and biochemical data, the regulatory mechanism of the LRRK2 Roc G-domain and the RocCOR tandem is still under debate. In the present study, we solved the first nucleotide-bound Roc structure and used LRRK2 and bacterial Roco proteins to characterize the RocCOR function in more detail. Nucleotide binding induces a drastic structural change in the Roc/COR domain interface, a region strongly implicated in patients with an LRRK2 mutation. Our data confirm previous assumptions that the C-terminal subdomain of COR functions as a dimerization device. We show that the dimer formation is independent of nucleotide. The affinity for GDP/GTP is in the micromolar range, the result of which is high dissociation rates in the s-1 range. Thus Roco proteins are unlikely to need GEFs to achieve activation. Monomeric LRRK2 and Roco G-domains have a similar low GTPase activity to small G-proteins. We show that GTPase activity in bacterial Roco is stimulated by the nucleotide-dependent dimerization of the G-domain within the complex. We thus propose that the Roco proteins do not require GAPs to stimulate GTP hydrolysis but stimulate each other by one monomer completing the catalytic machinery of the other.
[Discriminatory power of variable number on tandem repeats loci for genotyping Mycobacterium tuberculosis strains in China].

PubMed

Chen, H X; Cai, C; Liu, J Y; Zhang, Z G; Yuan, M; Jia, J N; Sun, Z G; Huang, H R; Gao, J M; Li, W M

2017-06-10

Objective: Using the standard genotype method, variable number of tandem repeats (VNTR), we constructed a VNTR database to cover all provinces and proposed a set of optimized VNTR loci combinations for each province, in order to improve the preventive and control programs on tuberculosis, in China. Methods: A total of 15 loci VNTR was used to analyze 4 116 Mycobacterium tuberculosis strains, isolated from national survey of Drug Resistant Tuberculosis, in 2007. Hunter-Gaston Index (HGI) was also used to analyze the discriminatory power of each VNTR site. A set combination of 12-VNTR, 10-VNTR, 8-VNTR and 5-VNTR was respectively constructed for each province, based on 1) epidemic characteristics of M. tuberculosis lineages in China, with high discriminatory power and genetic stability. Results: Through the completed 15 loci VNTR patterns of 3 966 strains under 96.36 % (3 966/4 116) coverage, we found seven high HGI loci (including QUB11b and MIRU26) as well as low stable loci (including QUB26, MIRU16, Mtub21 and QUB11b) in several areas. In all the 31 provinces, we found an optimization VNTR combination as 10-VNTR loci in Inner Mongolia, Chongqing and Heilongjiang, but with 8-VNTR combination shared in other provinces. Conclusions: It is necessary to not only use the VNTR database for tracing the source of infection and cluster of M. tuberculosis in the nation but also using the set of optimized VNTR combinations in monitoring those local epidemics and M. tuberculosis (genetics in local) population.
Expression of Anaplasma marginale ankyrin repeat-containing proteins during infection of the mammalian host and tick vector

USDA-ARS?s Scientific Manuscript database

Using searches of the NCBI conserved domain database and SMART genomic architecture analysis, we identified three ankyrin repeat-containing genes in Anaplasma marginale: AM705, AM926 and AM638. Recombinant protein was used to immunize mice and generate fusion hybridomas secreting protein-specific mo...
Tandem mass spectrometry for the detection of plant pathogenic fungi and the effects of database composition on protein inferences.

PubMed

Padliya, Neerav D; Garrett, Wesley M; Campbell, Kimberly B; Tabb, David L; Cooper, Bret

2007-11-01

LC-MS/MS has demonstrated potential for detecting plant pathogens. Unlike PCR or ELISA, LC-MS/MS does not require pathogen-specific reagents for the detection of pathogen-specific proteins and peptides. However, the MS/MS approach we and others have explored does require a protein sequence reference database and database-search software to interpret tandem mass spectra. To evaluate the limitations of database composition on pathogen identification, we analyzed proteins from cultured Ustilago maydis, Phytophthora sojae, Fusarium graminearum, and Rhizoctonia solani by LC-MS/MS. When the search database did not contain sequences for a target pathogen, or contained sequences to related pathogens, target pathogen spectra were reliably matched to protein sequences from nontarget organisms, giving an illusion that proteins from nontarget organisms were identified. Our analysis demonstrates that when database-search software is used as part of the identification process, a paradox exists whereby additional sequences needed to detect a wide variety of possible organisms may lead to more cross-species protein matches and misidentification of pathogens.
Variant Alleles, Triallelic Patterns, and Point Mutations Observed in Nuclear Short Tandem Repeat Typing of Populations in Bosnia and Serbia

PubMed Central

Huel, René L. M.; Bašić, Lara; Madacki-Todorović, Kamelija; Smajlović, Lejla; Eminović, Izet; Berbić, Irfan; Miloš, Ana; Parsons, Thomas J.

2007-01-01

Aim To present a compendium of off-ladder alleles and other genotyping irregularities relating to rare/unexpected population genetic variation, observed in a large short tandem repeat (STR) database from Bosnia and Serbia. Methods DNA was extracted from blood stain cards relating to reference samples from a population of 32 800 individuals from Bosnia and Serbia, and typed using Promega’s PowerPlex®16 STR kit. Results There were 31 distinct off-ladder alleles were observed in 10 of the 15 STR loci amplified from the PowerPlex®16 STR kit. Of these 31 alleles, 3 have not been previously reported. Furthermore, 16 instances of triallelic patterns were observed in 9 of the 15 loci. Primer binding site mismatches that affected amplification were observed in two loci, D5S818 and D8S1179. Conclusion Instances of deviations from manufacturer’s allelic ladders should be expected and caution taken to properly designate the correct alleles in large DNA databases. Particular care should be taken in kinship matching or paternity cases as incorrect designation of any of these deviations from allelic ladders could lead to false exclusions. PMID:17696304
Multiple-Locus Variable-Number Tandem-Repeats Analysis of Escherichia coli O157 using PCR multiplexing and multi-colored capillary electrophoresis.

PubMed

Lindstedt, Bjørn-Arne; Vardund, Traute; Kapperud, Georg

2004-08-01

The Multiple-Locus Variable-Number Tandem-Repeats Analysis (MLVA) method is currently being used as the primary typing tool for Shiga-toxin-producing Escherichia coli (STEC) O157 isolates in our laboratory. The initial assay was performed using a single fluorescent dye and the different patterns were assigned using a gel image. Here, we present a significantly improved assay using multiple dye colors and enhanced PCR multiplexing to increase speed, and ease the interpretation of the results. The different MLVA patterns are now based on allele sizes entered as character values, thus removing the uncertainties introduced when analyzing band patterns from the gel image. We additionally propose an easy numbering scheme for the identification of separate isolates that will facilitate exchange of typing data. Seventy-two human and animal strains of Shiga-toxin-producing E. coli O157 were used for the development of the improved MLVA assay. The method is based on capillary separation of multiplexed PCR products of VNTR loci in the E. coli O157 genome labeled with multiple fluorescent dyes. The different alleles at each locus were then assigned to allele numbers, which were used for strain comparison.
Application of Short Tandem Repeat markers in diagnosis of chromosomal aneuploidies and forensic DNA investigation in Pakistan.

PubMed

Chishti, Hafsah Muhammad; Ansar, Muhammad; Ajmal, Muhammad; Hameed, Abdul

2014-09-15

Short Tandem Repeat (STR) genetic markers hold great potential in forensic investigations, molecular diagnostics and molecular genetics research. AmpFlSTR® Identifiler™ PCR amplification kit is a multiplex system for co-amplification of 15 STR markers used worldwide in forensic investigations. This study attempts to assess forensic validity of these STRs in Pakistani population and to investigate its applicability in quick and simultaneous diagnosis and tracing parental source of common chromosomal aneuploidies. Samples from 554 healthy Pakistani individuals from 5 different ethnicities were analyzed for forensic parameters using Identifiler STRs and 74 patients' samples with different aneuploidies were evaluated for diagnostic strengths of these markers. All STRs hold sufficient forensic applicability in Pakistani population with paternity index between 1.5 and 3.5, polymorphic information content from 0.63 to 0.87 and discrimination power ≥0.9 (except TPOX locus). Variation from Hardy-Weinberg equilibrium was observed at some loci reflecting selective breeding and intermarriages trend in Pakistan. Among aneuploidic samples, all trisomies were precisely detectable while aneuploidies involving sex chromosomes or missing chromosomes were not clearly detectable using Identifiler STRs. Parental origin of aneuploidy was traceable in 92.54% patients. The studied STR markers are valuable tools for forensic application in Pakistan and utilizable for quick and simultaneous identification of some common trisomic conditions. Adding more sex chromosome specific STR markers can immensely increase the diagnostic and forensic potential of this system. Copyright © 2014 Elsevier B.V. All rights reserved.
Regulation of Nucleocytoplasmic Shuttling of Bruton's Tyrosine Kinase (Btk) through a Novel SH3-Dependent Interaction with Ankyrin Repeat Domain 54 (ANKRD54)

PubMed Central

Hussain, Alamdar; Mohammad, Dara K.; Mohamed, Abdalla J.; Nguyen, Vivian; Metalnikov, Pavel; Colwill, Karen; Pawson, Tony; Nore, Beston F.

2012-01-01

Bruton's tyrosine kinase (Btk), belonging to the Tec family of tyrosine kinases (TFKs), is essential for B-lymphocyte development. Abrogation of Btk signaling causes human X-linked agammaglobulinemia (XLA) and murine X-linked immunodeficiency (Xid). We employed affinity purification of Flag-tagged Btk, combined with tandem mass spectrometry, to capture and identify novel interacting proteins. We here characterize the interaction with ankryin repeat domain 54 protein (ANKRD54), also known as Lyn-interacting ankyrin repeat protein (Liar). While Btk is a nucleocytoplasmic protein, the Liar pool was found to shuttle at a higher rate than Btk. Importantly, our results suggest that Liar mediates nuclear export of both Btk and another TFK, Txk/Rlk. Liar-mediated Btk shuttling was enriched for activation loop, nonphosphorylated Btk and entirely dependent on Btk's SH3 domain. Liar also showed reduced binding to an aspartic acid phosphomimetic SH3 mutant. Three other investigated nucleus-located proteins, Abl, estrogen receptor β (ERβ), and transcription factor T-bet, were all unaffected by Liar. We mapped the interaction site to the C terminus of the Btk SH3 domain. A biotinylated, synthetic Btk peptide, ARDKNGQEGYIPSNYVTEAEDS, was sufficient for this interaction. Liar is the first protein identified that specifically influences the nucleocytoplasmic shuttling of Btk and Txk and belongs to a rare group of known proteins carrying out this activity in a Crm1-dependent manner. PMID:22527282
A pollen-specific novel calmodulin-binding protein with tetratricopeptide repeats

NASA Technical Reports Server (NTRS)

Safadi, F.; Reddy, V. S.; Reddy, A. S.

2000-01-01

Calcium is essential for pollen germination and pollen tube growth. A large body of information has established a link between elevation of cytosolic Ca(2+) at the pollen tube tip and its growth. Since the action of Ca(2+) is primarily mediated by Ca(2+)-binding proteins such as calmodulin (CaM), identification of CaM-binding proteins in pollen should provide insights into the mechanisms by which Ca(2+) regulates pollen germination and tube growth. In this study, a CaM-binding protein from maize pollen (maize pollen calmodulin-binding protein, MPCBP) was isolated in a protein-protein interaction-based screening using (35)S-labeled CaM as a probe. MPCBP has a molecular mass of about 72 kDa and contains three tetratricopeptide repeats (TPR) suggesting that it is a member of the TPR family of proteins. MPCBP protein shares a high sequence identity with two hypothetical TPR-containing proteins from Arabidopsis. Using gel overlay assays and CaM-Sepharose binding, we show that the bacterially expressed MPCBP binds to bovine CaM and three CaM isoforms from Arabidopsis in a Ca(2+)-dependent manner. To map the CaM-binding domain several truncated versions of the MPCBP were expressed in bacteria and tested for their ability to bind CaM. Based on these studies, the CaM-binding domain was mapped to an 18-amino acid stretch between the first and second TPR regions. Gel and fluorescence shift assays performed with CaM and a CaM-binding synthetic peptide further confirmed MPCBP binding to CaM. Western, Northern, and reverse transcriptase-polymerase chain reaction analysis have shown that MPCBP expression is specific to pollen. MPCBP was detected in both soluble and microsomal proteins. Immunoblots showed the presence of MPCBP in mature and germinating pollen. Pollen-specific expression of MPCBP, its CaM-binding properties, and the presence of TPR motifs suggest a role for this protein in Ca(2+)-regulated events during pollen germination and growth.
Genetic mapping of 15 human X chromosomal forensic short tandem repeat (STR) loci by means of multi-core parallelization.

PubMed

Diegoli, Toni Marie; Rohde, Heinrich; Borowski, Stefan; Krawczak, Michael; Coble, Michael D; Nothnagel, Michael

2016-11-01

Typing of X chromosomal short tandem repeat (X STR) markers has become a standard element of human forensic genetic analysis. Joint consideration of many X STR markers at a time increases their discriminatory power but, owing to physical linkage, requires inter-marker recombination rates to be accurately known. We estimated the recombination rates between 15 well established X STR markers using genotype data from 158 families (1041 individuals) and following a previously proposed likelihood-based approach that allows for single-step mutations. To meet the computational requirements of this family-based type of analysis, we modified a previous implementation so as to allow multi-core parallelization on a high-performance computing system. While we obtained recombination rate estimates larger than zero for all but one pair of adjacent markers within the four previously proposed linkage groups, none of the three X STR pairs defining the junctions of these groups yielded a recombination rate estimate of 0.50. Corroborating previous studies, our results therefore argue against a simple model of independent X chromosomal linkage groups. Moreover, the refined recombination fraction estimates obtained in our study will facilitate the appropriate joint consideration of all 15 investigated markers in forensic analysis. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats.

PubMed

Seman, Ali; Sapawi, Azizian Mohd; Salleh, Mohd Zaki

2015-06-01

Y-chromosome short tandem repeats (Y-STRs) are genetic markers with practical applications in human identification. However, where mass identification is required (e.g., in the aftermath of disasters with significant fatalities), the efficiency of the process could be improved with new statistical approaches. Clustering applications are relatively new tools for large-scale comparative genotyping, and the k-Approximate Modal Haplotype (k-AMH), an efficient algorithm for clustering large-scale Y-STR data, represents a promising method for developing these tools. In this study we improved the k-AMH and produced three new algorithms: the Nk-AMH I (including a new initial cluster center selection), the Nk-AMH II (including a new dominant weighting value), and the Nk-AMH III (combining I and II). The Nk-AMH III was the superior algorithm, with mean clustering accuracy that increased in four out of six datasets and remained at 100% in the other two. Additionally, the Nk-AMH III achieved a 2% higher overall mean clustering accuracy score than the k-AMH, as well as optimal accuracy for all datasets (0.84-1.00). With inclusion of the two new methods, the Nk-AMH III produced an optimal solution for clustering Y-STR data; thus, the algorithm has potential for further development towards fully automatic clustering of any large-scale genotypic data.
Investigation of Salmonella Enteritidis outbreaks in South Africa using multi-locus variable-number tandem-repeats analysis, 2013-2015.

PubMed

Muvhali, Munyadziwa; Smith, Anthony Marius; Rakgantso, Andronica Moipone; Keddy, Karen Helena

2017-10-02

Salmonella enterica serovar Enteritidis (Salmonella Enteritidis) has become a significant pathogen in South Africa, and the need for improved molecular surveillance of this pathogen has become important. Over the years, multi-locus variable-number tandem-repeats analysis (MLVA) has become a valuable molecular subtyping technique for Salmonella, particularly for highly homogenic serotypes such as Salmonella Enteritidis. This study describes the use of MLVA in the molecular epidemiological investigation of outbreak isolates in South Africa. Between the years 2013 and 2015, the Centre for Enteric Diseases (CED) received 39 Salmonella Enteritidis isolates from seven foodborne illness outbreaks, which occurred in six provinces. MLVA was performed on all isolates. Three MLVA profiles (MLVA profiles 21, 22 and 28) were identified among the 39 isolates. MLVA profile 28 accounted for 77% (30/39) of the isolates. Isolates from a single outbreak were grouped into a single MLVA profile. A minimum spanning tree (MST) created from the MLVA data showed a close relationship between MLVA profiles 21, 22 and 28, with a single VNTR locus difference between them. MLVA has proven to be a reliable method for the molecular epidemiological investigation of Salmonella Enteritidis outbreaks in South Africa. These foodborne outbreaks emphasize the importance of the One Health approach as an essential component for combating the spread of zoonotic pathogens such as Salmonella Enteritidis.
Targeted liquid chromatography tandem mass spectrometry to quantitate wheat gluten using well-defined reference proteins.

PubMed

Schalk, Kathrin; Koehler, Peter; Scherf, Katharina Anne

2018-01-01

Celiac disease (CD) is an inflammatory disorder of the upper small intestine caused by the ingestion of storage proteins (prolamins and glutelins) from wheat, barley, rye, and, in rare cases, oats. CD patients need to follow a gluten-free diet by consuming gluten-free products with gluten contents of less than 20 mg/kg. Currently, the recommended method for the quantitative determination of gluten is an enzyme-linked immunosorbent assay (ELISA) based on the R5 monoclonal antibody. Because the R5 ELISA mostly detects the prolamin fraction of gluten, a new independent method is required to detect prolamins as well as glutelins. This paper presents the development of a method to quantitate 16 wheat marker peptides derived from all wheat gluten protein types by liquid chromatography tandem mass spectrometry (LC-MS/MS) in the multiple reaction monitoring mode. The quantitation of each marker peptide in the chymotryptic digest of a defined amount of the respective reference wheat protein type resulted in peptide-specific yields. This enabled the conversion of peptide into protein type concentrations. Gluten contents were expressed as sum of all determined protein type concentrations. This new method was applied to quantitate gluten in wheat starches and compared to R5 ELISA and gel-permeation high-performance liquid chromatography with fluorescence detection (GP-HPLC-FLD), which resulted in a strong correlation between LC-MS/MS and the other two methods.

Targeted liquid chromatography tandem mass spectrometry to quantitate wheat gluten using well-defined reference proteins

PubMed Central

Schalk, Kathrin; Koehler, Peter

2018-01-01

Celiac disease (CD) is an inflammatory disorder of the upper small intestine caused by the ingestion of storage proteins (prolamins and glutelins) from wheat, barley, rye, and, in rare cases, oats. CD patients need to follow a gluten-free diet by consuming gluten-free products with gluten contents of less than 20 mg/kg. Currently, the recommended method for the quantitative determination of gluten is an enzyme-linked immunosorbent assay (ELISA) based on the R5 monoclonal antibody. Because the R5 ELISA mostly detects the prolamin fraction of gluten, a new independent method is required to detect prolamins as well as glutelins. This paper presents the development of a method to quantitate 16 wheat marker peptides derived from all wheat gluten protein types by liquid chromatography tandem mass spectrometry (LC-MS/MS) in the multiple reaction monitoring mode. The quantitation of each marker peptide in the chymotryptic digest of a defined amount of the respective reference wheat protein type resulted in peptide-specific yields. This enabled the conversion of peptide into protein type concentrations. Gluten contents were expressed as sum of all determined protein type concentrations. This new method was applied to quantitate gluten in wheat starches and compared to R5 ELISA and gel-permeation high-performance liquid chromatography with fluorescence detection (GP-HPLC-FLD), which resulted in a strong correlation between LC-MS/MS and the other two methods. PMID:29425234
Multiple-locus variable number of tandem repeat analysis (MLVA) of Irish verocytotoxigenic Escherichia coli O157 from feedlot cattle: uncovering strain dissemination routes.

PubMed

Murphy, Mary; Minihan, Donal; Buckley, James F; O'Mahony, Micheál; Whyte, Paul; Fanning, Séamus

2008-01-24

The identification of the routes of dissemination of Escherichia coli (E. coli) O157 through a cohort of cattle is a critical step to control this pathogen at farm level. The aim of this study was to identify potential routes of dissemination of E. coli O157 using Multiple-Locus Variable number of tandem repeat Analysis (MLVA). Thirty-eight environmental and sixteen cattle faecal isolates, which were detected in four adjacent pens over a four-month period were sub-typed. MLVA could separate these isolates into broadly defined clusters consisting of twelve MLVA types. Strain diversity was observed within pens, individual cattle and the environment. Application of MLVA is a broadly useful and convenient tool when applied to uncover the dissemination of E. coli O157 in the environment and in supporting improved on-farm management of this important pathogen. These data identified diverse strain types based on amplification of VNTR markers in each case.
Application of a multilocus variable number of tandem repeats analysis to regional outbreak surveillance of Enterohemorrhagic Escherichia coli O157:H7 infections.

PubMed

Konno, Takayuki; Yatsuyanagi, Jun; Saito, Shioko

2011-01-01

A total of 18 strains of EHEC O157:H7 were isolated from distinct cases in Akita Prefecture, Japan from July to September 2007. The genetic relatedness of these isolates was investigated by performing a multilocus variable number of tandem repeats analysis (MLVA) and a pulsed-field gel electrophoresis (PFGE) analysis using XbaI. The PFGE analyses allowed us to group these 18 isolates into three major clusters. The MLVA results correlated closely with those obtained by PFGE, although some variants were found within the clusters obtained by PFGE, thus highlighting the utility of this technique for determining a precise classification when it is difficult to differentiate between isolates with indistinguishable or very similar PFGE patterns. In addition, MLVA is a much easier and more rapid method than PFGE for analysis of the genetic relatedness of strains. Thus, as a second molecular epidemiological subtyping method, MLVA is useful for the regional outbreak surveillance of EHEC O157:H7 infections.
The repeat organizer, a specialized insulator element within the intergenic spacer of the Xenopus rRNA genes.

PubMed Central

Robinett, C C; O'Connor, A; Dunaway, M

1997-01-01

We have identified a novel activity for the region of the intergenic spacer of the Xenopus laevis rRNA genes that contains the 35- and 100-bp repeats. We devised a new assay for this region by constructing DNA plasmids containing a tandem repeat of rRNA reporter genes that were separated by the 35- and 100-bp repeat region and a rRNA gene enhancer. When the 35- and 100-bp repeat region is present in its normal position and orientation at the 3' end of the rRNA reporter genes, the enhancer activates the adjacent downstream promoter but not the upstream rRNA promoter on the same plasmid. Because this element can restrict the range of an enhancer's activity in the context of tandem genes, we have named it the repeat organizer (RO). The ability to restrict enhancer action is a feature of insulator elements, but unlike previously described insulator elements the RO does not block enhancer action in a simple enhancer-blocking assay. Instead, the activity of the RO requires that it be in its normal position and orientation with respect to the other sequence elements of the rRNA genes. The enhancer-binding transcription factor xUBF also binds to the repetitive sequences of the RO in vitro, but these sequences do not activate transcription in vivo. We propose that the RO is a specialized insulator element that organizes the tandem array of rRNA genes into single-gene expression units by promoting activation of a promoter by its proximal enhancers. PMID:9111359
Optimization of sequence alignment for simple sequence repeat regions.

PubMed

Jighly, Abdulqader; Hamwieh, Aladdin; Ogbonnaya, Francis C

2011-07-20

Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs).SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type.When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.
MR-Tandem: parallel X!Tandem using Hadoop MapReduce on Amazon Web Services.

PubMed

Pratt, Brian; Howbert, J Jeffry; Tasman, Natalie I; Nilsson, Erik J

2012-01-01

MR-Tandem adapts the popular X!Tandem peptide search engine to work with Hadoop MapReduce for reliable parallel execution of large searches. MR-Tandem runs on any Hadoop cluster but offers special support for Amazon Web Services for creating inexpensive on-demand Hadoop clusters, enabling search volumes that might not otherwise be feasible with the compute resources a researcher has at hand. MR-Tandem is designed to drop in wherever X!Tandem is already in use and requires no modification to existing X!Tandem parameter files, and only minimal modification to X!Tandem-based workflows. MR-Tandem is implemented as a lightly modified X!Tandem C++ executable and a Python script that drives Hadoop clusters including Amazon Web Services (AWS) Elastic Map Reduce (EMR), using the modified X!Tandem program as a Hadoop Streaming mapper and reducer. The modified X!Tandem C++ source code is Artistic licensed, supports pluggable scoring, and is available as part of the Sashimi project at http://sashimi.svn.sourceforge.net/viewvc/sashimi/trunk/trans_proteomic_pipeline/extern/xtandem/. The MR-Tandem Python script is Apache licensed and available as part of the Insilicos Cloud Army project at http://ica.svn.sourceforge.net/viewvc/ica/trunk/mr-tandem/. Full documentation and a windows installer that configures MR-Tandem, Python and all necessary packages are available at this same URL. brian.pratt@insilicos.com
Supplementation of H1N1pdm09 split vaccine with heterologous tandem repeat M2e5x virus-like particles confers improved cross-protection in ferrets.

PubMed

Music, Nedzad; Reber, Adrian J; Kim, Min-Chul; York, Ian A; Kang, Sang-Moo

2016-01-20

Current influenza vaccines induce strain-specific immunity to the highly variable hemagglutinin (HA) protein. It is therefore a high priority to develop vaccines that induce broadly cross-protective immunity to different strains of influenza. Since influenza A M2 proteins are highly conserved among different strains, five tandem repeats of the extracellular peptide of M2 in a membrane-anchored form on virus-like particles (VLPs) have been suggested to be a promising candidate for universal influenza vaccine. In this study, ferrets were intramuscularly immunized with 2009 H1N1 split HA vaccine ("Split") alone, influenza split vaccine supplemented with M2e5x VLP ("Split+M2e5x"), M2e5x VLP alone ("M2e5x"), or mock immunized. Vaccine efficacy was measured serologically and by protection against a serologically distinct viral challenge. Ferrets immunized with Split+M2e5x induced HA strain specific and conserved M2e immunity. Supplementation of M2e5x VLP to split vaccination significantly increased the immunogenicity of split vaccine compared to split alone. The Split+M2e5x ferret group showed evidence of cross-reactive protection, including faster recovery from weight loss, and reduced inflammation, as inferred from changes in peripheral leukocyte subsets, compared to mock-immunized animals. In addition, ferrets immunized with Split+M2e5x shed lower viral nasal-wash titers than the other groups. Ferrets immunized with M2e5x alone also show some protective effects, while those immunized with split vaccine alone induced no protective effects compared to mock-immunized ferrets. These studies suggest that supplementation of split vaccine with M2e5x-VLP may provide broader and improved cross-protection than split vaccine alone. Published by Elsevier Ltd.
Supplementation of H1N1pdm09 split vaccine with heterologous tandem repeat M2e5x virus-like particles confers improved cross-protection in ferrets

PubMed Central

Music, Nedzad; Reber, Adrian J.; Kim, Min-Chul; York, Ian A.; Kang, Sang-Moo

2015-01-01

Current influenza vaccines induce strain-specific immunity to the highly variable hemagglutinin (HA) protein. It is therefore a high priority to develop vaccines that induce broadly cross-protective immunity to different strains of influenza. Since influenza A M2 proteins are highly conserved among different strains, five tandem repeats of the extracellular peptide of M2 in a membrane-anchored form on virus-like particles (VLPs) have been suggested to be a promising candidate for universal influenza vaccine. In this study, ferrets were intramuscularly immunized with 2009 H1N1 split HA vaccine (“Split”) alone, influenza split vaccine supplemented with M2e5x VLP (“Split+M2e5x”), M2e5x VLP alone (“M2e5x”), or mock immunized. Vaccine efficacy was measured serologically and by protection against a serologically distinct viral challenge. Ferrets immunized with Split+M2e5x induced HA strain specific and conserved M2e immunity. Supplementation of M2e5x VLP to split vaccination significantly increased the immunogenicity of split vaccine compared to split alone. The Split+M2e5x ferret group showed evidence of cross-reactive protection, including faster recovery from weight loss, and reduced inflammation, as inferred from changes in peripheral leukocyte subsets, compared to mock-immunized animals. In addition, ferrets immunized with Split+M2e5x shed lower viral nasal-wash titers than the other groups. Ferrets immunized with M2e5x alone also show some protective effects, while those immunized with split vaccine alone induced no protective effects compared to mock-immunized ferrets. These studies suggest that supplementation of split vaccine with M2e5x-VLP may provide broader and improved cross-protection than split vaccine alone. PMID:26709639
MR-Tandem: parallel X!Tandem using Hadoop MapReduce on Amazon Web Services

PubMed Central

Pratt, Brian; Howbert, J. Jeffry; Tasman, Natalie I.; Nilsson, Erik J.

2012-01-01

Summary: MR-Tandem adapts the popular X!Tandem peptide search engine to work with Hadoop MapReduce for reliable parallel execution of large searches. MR-Tandem runs on any Hadoop cluster but offers special support for Amazon Web Services for creating inexpensive on-demand Hadoop clusters, enabling search volumes that might not otherwise be feasible with the compute resources a researcher has at hand. MR-Tandem is designed to drop in wherever X!Tandem is already in use and requires no modification to existing X!Tandem parameter files, and only minimal modification to X!Tandem-based workflows. Availability and implementation: MR-Tandem is implemented as a lightly modified X!Tandem C++ executable and a Python script that drives Hadoop clusters including Amazon Web Services (AWS) Elastic Map Reduce (EMR), using the modified X!Tandem program as a Hadoop Streaming mapper and reducer. The modified X!Tandem C++ source code is Artistic licensed, supports pluggable scoring, and is available as part of the Sashimi project at http://sashimi.svn.sourceforge.net/viewvc/sashimi/trunk/trans_proteomic_pipeline/extern/xtandem/. The MR-Tandem Python script is Apache licensed and available as part of the Insilicos Cloud Army project at http://ica.svn.sourceforge.net/viewvc/ica/trunk/mr-tandem/. Full documentation and a windows installer that configures MR-Tandem, Python and all necessary packages are available at this same URL. Contact: brian.pratt@insilicos.com PMID:22072385
Mitogen-activated protein kinase is required for the behavioral desensitization that occurs after repeated injections of angiotensin II

PubMed Central

Vento, Peter J.; Daniels, Derek

2013-01-01

Angiotensin II (AngII) acts on central angiotensin type 1 (AT1) receptors to increase water and saline intake. Prolonged exposure to AngII in cell culture models results in a desensitization of the AT1 receptor that is thought to involve receptor internalization, and a behavioral correlate of this desensitization has been shown in rats after repeated central injections of AngII. Specifically, rats given repeated injections of AngII drink less water than controls after a subsequent test injection of AngII. Under the same conditions, however, repeated injections of AngII have no effect on AngII-induced saline intake. Given earlier studies indicating that separate intracellular signaling pathways mediate AngII-induced water and saline intake, we hypothesized that the desensitization observed in rats may be incomplete, leaving the receptor able to activate mitogen-activated protein (MAP) kinases (ERK1/2), which play a role in AngII-induced saline intake without affecting water intake. In support of this hypothesis, we found no difference in MAP kinase phosphorylation after an AngII test injection in rats given prior treatment with repeated injections of vehicle, AngII, or Sar1,Ile4,Ile8-AngII (SII), an AngII analog that activates MAP kinase without G protein coupling. In addition, we found that pretreatment with the MAP kinase inhibitor U0126 completely blocked the desensitizing effect of repeated AngII injections on water intake. Furthermore, AngII-induced water intake was reduced similarly by repeated injections of AngII or SII. The results suggest that G protein-independent signaling is sufficient to produce behavioral desensitization of the angiotensin system and that the desensitization requires MAP kinase activation. PMID:22581747
Mitogen-activated protein kinase is required for the behavioural desensitization that occurs after repeated injections of angiotensin II.

PubMed

Vento, Peter J; Daniels, Derek

2012-12-01

Angiotensin II (Ang II) acts on central angiotensin type 1 (AT(1)) receptors to increase water and saline intake. Prolonged exposure to Ang II in cell culture models results in a desensitization of the AT(1) receptor that is thought to involve receptor internalization, and a behavioural correlate of this desensitization has been shown in rats after repeated central injections of Ang II. Specifically, rats given repeated injections of Ang II drink less water than control animals after a subsequent test injection of Ang II. In the same conditions, however, repeated injections of Ang II have no effect on Ang II-induced saline intake. Given earlier studies indicating that separate intracellular signalling pathways mediate Ang II-induced water and saline intake, we hypothesized that the desensitization observed in rats may be incomplete, leaving the receptor able to activate mitogen-activated protein (MAP) kinases (ERK1/2), which play a role in Ang II-induced saline intake without affecting water intake. In support of this hypothesis, we found no difference in MAP kinase phosphorylation after an Ang II test injection in rats given prior treatment with repeated injections of vehicle, Ang II or Sar(1),Ile(4),Ile(8)-Ang II (SII), an Ang II analogue that activates MAP kinase without G protein coupling. In addition, we found that pretreatment with the MAP kinase inhibitor U0126 completely blocked the desensitizing effect of repeated Ang II injections on water intake. Furthermore, Ang II-induced water intake was reduced to a similar extent by repeated injections of Ang II or SII. The results suggest that G protein-independent signalling is sufficient to produce behavioural desensitization of the angiotensin system and that the desensitization requires MAP kinase activation.
Evolutionary dynamics of the immunodominant repeats of the Plasmodium vivax malaria-vaccine candidate circumsporozoite protein (CSP)

PubMed Central

Patil, Aarti; Orjuela-Sánchez, Pamela; da Silva-Nunes, Mônica; Ferreira, Marcelo U.

2010-01-01

The circumsporozoite protein (CSP) of Plasmodium vivax, a major target for malaria vaccine development, has immunodominant B-cell epitopes mapped to central nonapeptide repeat arrays. To determine whether rearrangements of repeat motifs during mitotic DNA replication of parasites create significant CSP diversity under conditions of low effective meiotic recombination rates, we examined csp alleles from sympatric P. vivax isolates systematically sampled from an area of low malaria endemicity in Brazil over a period of 14 months. Nine unique csp types, comprising six different nonapeptide repeats, were observed in 45 isolates analyzed. Identical or nearly identical repeats predominated in most arrays, consistent with their recent expansion. We found strong linkage disequilibrium at sites across the chromosome 8 segment flanking the csp locus, consistent with rare meiotic recombination in this region. We conclude that CSP repeat diversity may not be severely constrained by rare meiotic recombination in areas of low malaria endemicity. New repeat variants may be readily created by nonhomologous recombination even when meiotic recombination is rare, with potential implications for CSP-based vaccine development. PMID:20097310
Identifying proteins that bind to specific RNAs - focus on simple repeat expansion diseases

PubMed Central

Jazurek, Magdalena; Ciesiolka, Adam; Starega-Roslan, Julia; Bilinska, Katarzyna; Krzyzosiak, Wlodzimierz J.

2016-01-01

RNA–protein complexes play a central role in the regulation of fundamental cellular processes, such as mRNA splicing, localization, translation and degradation. The misregulation of these interactions can cause a variety of human diseases, including cancer and neurodegenerative disorders. Recently, many strategies have been developed to comprehensively analyze these complex and highly dynamic RNA–protein networks. Extensive efforts have been made to purify in vivo-assembled RNA–protein complexes. In this review, we focused on commonly used RNA-centric approaches that involve mass spectrometry, which are powerful tools for identifying proteins bound to a given RNA. We present various RNA capture strategies that primarily depend on whether the RNA of interest is modified. Moreover, we briefly discuss the advantages and limitations of in vitro and in vivo approaches. Furthermore, we describe recent advances in quantitative proteomics as well as the methods that are most commonly used to validate robust mass spectrometry data. Finally, we present approaches that have successfully identified expanded repeat-binding proteins, which present abnormal RNA–protein interactions that result in the development of many neurological diseases. PMID:27625393
Short tandem repeat profiling: part of an overall strategy for reducing the frequency of cell misidentification.

PubMed

Nims, Raymond W; Sykes, Greg; Cottrill, Karin; Ikonomi, Pranvera; Elmore, Eugene

2010-12-01

The role of cell authentication in biomedical science has received considerable attention, especially within the past decade. This quality control attribute is now beginning to be given the emphasis it deserves by granting agencies and by scientific journals. Short tandem repeat (STR) profiling, one of a few DNA profiling technologies now available, is being proposed for routine identification (authentication) of human cell lines, stem cells, and tissues. The advantage of this technique over methods such as isoenzyme analysis, karyotyping, human leukocyte antigen typing, etc., is that STR profiling can establish identity to the individual level, provided that the appropriate number and types of loci are evaluated. To best employ this technology, a standardized protocol and a data-driven, quality-controlled, and publically searchable database will be necessary. This public STR database (currently under development) will enable investigators to rapidly authenticate human-based cultures to the individual from whom the cells were sourced. Use of similar approaches for non-human animal cells will require developing other suitable loci sets. While implementing STR analysis on a more routine basis should significantly reduce the frequency of cell misidentification, additional technologies may be needed as part of an overall authentication paradigm. For instance, isoenzyme analysis, PCR-based DNA amplification, and sequence-based barcoding methods enable rapid confirmation of a cell line's species of origin while screening against cross-contaminations, especially when the cells present are not recognized by the species-specific STR method. Karyotyping may also be needed as a supporting tool during establishment of an STR database. Finally, good cell culture practices must always remain a major component of any effort to reduce the frequency of cell misidentification.
Association of STin2 Variable Number of Tandem Repeat (VNTR) Polymorphism of Serotonin Transporter Gene with Lifelong Premature Ejaculation: A Case-Control Study in Han Chinese Subjects

PubMed Central

Huang, Yuanyuan; Zhang, Xiansheng; Gao, Jingjing; Tang, Dongdong; Gao, Pan; Peng, Dangwei; Liang, Chaozhao

2016-01-01

Background The STin2 VNTR polymorphism has a variable number of tandem repeats in intron 2 of the serotonin transporter gene. We aimed to explore the relationship between STin2 VNTR polymorphism and lifelong premature ejaculation (LPE). Material/Methods We recruited a total of 115 outpatients who complained of ejaculating prematurely and who were diagnosed as LPE, and 101 controls without PE complaint. Allelic variations of STin2 VNTR were genotyped using PCR-based technology. We evaluated the associations between STin2 VNTR allelic and genotypic frequencies and LPE, as well as the intravaginal ejaculation latency time (IELT) of different STin2 VNTR genotypes among LPE patients. Results The patients and controls did not differ significantly in terms of any characteristic except age. A significantly higher frequency of STin2.12/12 genotype was found among LPE patients versus controls (P=0.026). Frequency of patients carrying at least 1 copy of the 10-repeat allele was significantly lower compared to the control group (28.3% vs. 41.8%, OR=0.55; 95%CI=0.31–0.97, P=0.040). In the LPE group, the mean IELT showed significant difference in STin2.12/12 genotype when compared to those with STin2.12/10 and STin2.10/10 genotypes. The mean IELT in10-repeat allele carriers was 50% longer compared to homozygous carriers of the STin2.12 allele. Conclusions Our results indicate the presence of STin2.10 allele is a protective factor for LPE. Men carrying the higher expression genotype STin2. 12/12 have shorter IELT than 10-repeat allele carriers. PMID:27713390
New Multilocus Variable-Number Tandem-Repeat Analysis (MLVA) Scheme for Fine-Scale Monitoring and Microevolution-Related Study of Ralstonia pseudosolanacearum Phylotype I Populations

PubMed Central

Guinard, Jérémy; Latreille, Anne; Guérin, Fabien; Poussier, Stéphane

2016-01-01

ABSTRACT Bacterial wilt caused by the Ralstonia solanacearum species complex (RSSC) is considered one of the most harmful plant diseases in the world. Special attention should be paid to R. pseudosolanacearum phylotype I due to its large host range, its worldwide distribution, and its high evolutionary potential. So far, the molecular epidemiology and population genetics of this bacterium are poorly understood. Until now, the genetic structure of the RSSC has been analyzed on the worldwide and regional scales. Emerging questions regarding evolutionary forces in RSSC adaptation to hosts now require genetic markers that are able to monitor RSSC field populations. In this study, we aimed to evaluate the multilocus variable-number tandem-repeat analysis (MLVA) approach for its ability to discriminate genetically close phylotype I strains and for population genetics studies. We developed a new MLVA scheme (MLVA-7) allowing us to genotype 580 R. pseudosolanacearum phylotype I strains extracted from susceptible and resistant hosts and from different habitats (stem, soil, and rhizosphere). Based on specificity, polymorphism, and the amplification success rate, we selected seven fast-evolving variable-number tandem-repeat (VNTR) markers. The newly developed MLVA-7 scheme showed higher discriminatory power than the previously published MLVA-13 scheme when applied to collections sampled from the same location on different dates and to collections from different locations on very small scales. Our study provides a valuable tool for fine-scale monitoring and microevolution-related study of R. pseudosolanacearum phylotype I populations. IMPORTANCE Understanding the evolutionary dynamics of adaptation of plant pathogens to new hosts or ecological niches has become a key point for the development of innovative disease management strategies, including durable resistance. Whereas the molecular mechanisms underlying virulence or pathogenicity changes have been studied thoroughly, the
Arabidopsis leucine-rich repeat extensin (LRX) proteins modify cell wall composition and influence plant growth.

PubMed

Draeger, Christian; Ndinyanka Fabrice, Tohnyui; Gineau, Emilie; Mouille, Grégory; Kuhn, Benjamin M; Moller, Isabel; Abdou, Marie-Therese; Frey, Beat; Pauly, Markus; Bacic, Antony; Ringli, Christoph

2015-06-24

Leucine-rich repeat extensins (LRXs) are extracellular proteins consisting of an N-terminal leucine-rich repeat (LRR) domain and a C-terminal extensin domain containing the typical features of this class of structural hydroxyproline-rich glycoproteins (HRGPs). The LRR domain is likely to bind an interaction partner, whereas the extensin domain has an anchoring function to insolubilize the protein in the cell wall. Based on the analysis of the root hair-expressed LRX1 and LRX2 of Arabidopsis thaliana, LRX proteins are important for cell wall development. The importance of LRX proteins in non-root hair cells and on the structural changes induced by mutations in LRX genes remains elusive. The LRX gene family of Arabidopsis consists of eleven members, of which LRX3, LRX4, and LRX5 are expressed in aerial organs, such as leaves and stem. The importance of these LRX genes for plant development and particularly cell wall formation was investigated. Synergistic effects of mutations with gradually more severe growth retardation phenotypes in double and triple mutants suggest a similar function of the three genes. Analysis of cell wall composition revealed a number of changes to cell wall polysaccharides in the mutants. LRX3, LRX4, and LRX5, and most likely LRX proteins in general, are important for cell wall development. Due to the complexity of changes in cell wall structures in the lrx mutants, the exact function of LRX proteins remains to be determined. The increasingly strong growth-defect phenotypes in double and triple mutants suggests that the LRX proteins have similar functions and that they are important for proper plant development.
Simultaneous quantification of protein phosphorylation sites using liquid chromatography-tandem mass spectrometry-based targeted proteomics: a linear algebra approach for isobaric phosphopeptides.

PubMed

Xu, Feifei; Yang, Ting; Sheng, Yuan; Zhong, Ting; Yang, Mi; Chen, Yun

2014-12-05

As one of the most studied post-translational modifications (PTM), protein phosphorylation plays an essential role in almost all cellular processes. Current methods are able to predict and determine thousands of phosphorylation sites, whereas stoichiometric quantification of these sites is still challenging. Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS)-based targeted proteomics is emerging as a promising technique for site-specific quantification of protein phosphorylation using proteolytic peptides as surrogates of proteins. However, several issues may limit its application, one of which relates to the phosphopeptides with different phosphorylation sites and the same mass (i.e., isobaric phosphopeptides). While employment of site-specific product ions allows for these isobaric phosphopeptides to be distinguished and quantified, site-specific product ions are often absent or weak in tandem mass spectra. In this study, linear algebra algorithms were employed as an add-on to targeted proteomics to retrieve information on individual phosphopeptides from their common spectra. To achieve this simultaneous quantification, a LC-MS/MS-based targeted proteomics assay was first developed and validated for each phosphopeptide. Given the slope and intercept of calibration curves of phosphopeptides in each transition, linear algebraic equations were developed. Using a series of mock mixtures prepared with varying concentrations of each phosphopeptide, the reliability of the approach to quantify isobaric phosphopeptides containing multiple phosphorylation sites (≥ 2) was discussed. Finally, we applied this approach to determine the phosphorylation stoichiometry of heat shock protein 27 (HSP27) at Ser78 and Ser82 in breast cancer cells and tissue samples.
Ca2+-stabilized adhesin helps an Antarctic bacterium reach out and bind ice.

PubMed

Vance, Tyler D R; Olijve, Luuk L C; Campbell, Robert L; Voets, Ilja K; Davies, Peter L; Guo, Shuaiqi

2014-07-04

The large size of a 1.5-MDa ice-binding adhesin [MpAFP (Marinomonas primoryensis antifreeze protein)] from an Antarctic Gram-negative bacterium, M. primoryensis, is mainly due to its highly repetitive RII (Region II). MpAFP_RII contains roughly 120 tandem copies of an identical 104-residue repeat. We have previously determined that a single RII repeat folds as a Ca2+-dependent immunoglobulin-like domain. Here, we solved the crystal structure of RII tetra-tandemer (four tandem RII repeats) to a resolution of 1.8 Å. The RII tetra-tandemer reveals an extended (~190-Å × ~25-Å), rod-like structure with four RII-repeats aligned in series with each other. The inter-repeat regions of the RII tetra-tandemer are strengthened by Ca2+ bound to acidic residues. SAXS (small-angle X-ray scattering) profiles indicate the RII tetra-tandemer is significantly rigidified upon Ca2+ binding, and that the protein's solution structure is in excellent agreement with its crystal structure. We hypothesize that >600 Ca2+ help rigidify the chain of ~120 104-residue repeats to form a ~0.6 μm rod-like structure in order to project the ice-binding domain of MpAFP away from the bacterial cell surface. The proposed extender role of RII can help the strictly aerobic, motile bacterium bind ice in the upper reaches of the Antarctic lake where oxygen and nutrients are most abundant. Ca2+-induced rigidity of tandem Ig-like repeats in large adhesins might be a general mechanism used by bacteria to bind to their substrates and help colonize specific niches.
Bone protein “extractomics”: comparing the efficiency of bone protein extractions of Gallus gallus in tandem mass spectrometry, with an eye towards paleoproteomics

PubMed Central

DeHart, Caroline J.; Schweitzer, Mary H.; Thomas, Paul M.; Kelleher, Neil L.

2016-01-01

Proteomic studies of bone require specialized extraction protocols to demineralize and solubilize proteins from within the bone matrix. Although various protocols exist for bone protein recovery, little is known about how discrete steps in each protocol affect the subset of the bone proteome recovered by mass spectrometry (MS) analyses. Characterizing these different “extractomes” will provide critical data for development of novel and more efficient protein extraction methodologies for fossils. Here, we analyze 22 unique sub-extractions of chicken bone and directly compare individual extraction components for their total protein yield and diversity and coverage of bone proteins identified by MS. We extracted proteins using different combinations and ratios of demineralizing reagents, protein-solubilizing reagents, and post-extraction buffer removal methods, then evaluated tryptic digests from 20 µg aliquots of each fraction by tandem MS/MS on a 12T FT-ICR mass spectrometer. We compared total numbers of peptide spectral matches, peptides, and proteins identified from each fraction, the redundancy of protein identifications between discrete steps of extraction methods, and the sequence coverage obtained for select, abundant proteins. Although both alpha chains of collagen I (the most abundant protein in bone) were found in all fractions, other collagenous and non-collagenous proteins (e.g., apolipoprotein, osteonectin, hemoglobin) were differentially identified. We found that when a standardized amount of extracted proteins was analyzed, extraction steps that yielded the most protein (by weight) from bone were often not the ones that produced the greatest diversity of bone proteins, or the highest degree of protein coverage. Generally, the highest degrees of diversity and coverage were obtained from demineralization fractions, and the proteins found in the subsequent solubilization fractions were highly redundant with those in the previous fraction. Based on

Identification of host proteins, Spata3 and Dkk2, interacting with Toxoplasma gondii micronemal protein MIC3.

PubMed

Wang, Yifan; Fang, Rui; Yuan, Yuan; Pan, Ming; Hu, Min; Zhou, Yanqin; Shen, Bang; Zhao, Junlong

2016-07-01

As an obligate intracellular protozoan, Toxoplasma gondii is a successful pathogen infecting a variety of animals, including humans. As an adhesin involving in host invasion, the micronemal protein MIC3 plays important roles in host cell attachment, as well as modulation of host EGFR signaling cascade. However, the specific host proteins that interact with MIC3 are unknown and the identification of such proteins will increase our understanding of how MIC3 exerts its functions. This study was designed to identify host proteins interacting with MIC3 by yeast two-hybrid screens. Using MIC3 as bait, a library expressing mouse proteins was screened, uncovering eight mouse proteins that showed positive interactions with MIC3. Two of which, spermatogenesis-associated protein 3 (Spata3) and dickkopf-related protein 2 (Dkk2), were further confirmed to interact with MIC3 by additional protein-protein interaction tests. The results also revealed that the tandem repeat EGF domains of MIC3 were critical in mediating the interactions with the identified host proteins. This is the first study to show that MIC3 interacts with host proteins that are involved in reproduction, growth, and development. The results will provide a clearer understanding of the functions of adhesion-associated micronemal proteins in T. gondii.
Markov state models of protein misfolding

NASA Astrophysics Data System (ADS)

Sirur, Anshul; De Sancho, David; Best, Robert B.

2016-02-01

Markov state models (MSMs) are an extremely useful tool for understanding the conformational dynamics of macromolecules and for analyzing MD simulations in a quantitative fashion. They have been extensively used for peptide and protein folding, for small molecule binding, and for the study of native ensemble dynamics. Here, we adapt the MSM methodology to gain insight into the dynamics of misfolded states. To overcome possible flaws in root-mean-square deviation (RMSD)-based metrics, we introduce a novel discretization approach, based on coarse-grained contact maps. In addition, we extend the MSM methodology to include "sink" states in order to account for the irreversibility (on simulation time scales) of processes like protein misfolding. We apply this method to analyze the mechanism of misfolding of tandem repeats of titin domains, and how it is influenced by confinement in a chaperonin-like cavity.
Phosphate Control of Oxytetracycline Production by Streptomyces rimosus Is at the Level of Transcription from Promoters Overlapped by Tandem Repeats Similar to Those of the DNA-Binding Sites of the OmpR Family

PubMed Central

McDowall, Kenneth J.; Thamchaipenet, Arinthip; Hunter, Iain S.

1999-01-01

Physiological studies have shown that Streptomyces rimosus produces the polyketide antibiotic oxytetracycline abundantly when its mycelial growth is limited by phosphate starvation. We show here that transcripts originating from the promoter for one of the biosynthetic genes, otcC (encoding anhydrotetracycline oxygenase), and from a promoter for the divergent otcX genes peak in abundance at the onset of antibiotic production induced by phosphate starvation, indicating that the synthesis of oxytetracycline is controlled, at least in part, at the level of transcription. Furthermore, analysis of the sequences of the promoters for otcC, otcX, and the polyketide synthase (otcY) genes revealed tandem repeats having significant similarity to the DNA-binding sites of ActII-Orf4 and DnrI, which are Streptomyces antibiotic regulatory proteins (SARPs) related to the OmpR family of transcription activators. Together, the above results suggest that oxytetracycline production by S. rimosus requires a SARP-like transcription factor that is either produced or activated or both under conditions of low phosphate concentrations. We also provide evidence consistent with the otrA resistance gene being cotranscribed with otcC as part of a polycistronic message, suggesting a simple mechanism of coordinate regulation which ensures that resistance to the antibiotic increases in proportion to production. PMID:10322002
The development and application of a multiplex short tandem repeat (STR) system for identifying subspecies, individuals and sex in tigers.

PubMed

Zou, Zheng-Ting; Uphyrkina, Olga V; Fomenko, Pavel; Luo, Shu-Jin

2015-07-01

Poaching and trans-boundary trafficking of tigers and body parts are threatening the world's last remaining wild tigers. Development of an efficient molecular genetic assay for tracing the origins of confiscated specimens will assist in law enforcement and wildlife forensics for this iconic flagship species. We developed a multiplex genotyping system "tigrisPlex" to simultaneously assess 22 short tandem repeat (STR, or microsatellite) loci and a gender-identifying SRY gene, all amplified in 4 reactions using as little as 1 ng of template DNA. With DNA samples used for between-run calibration, the system generates STR genotypes that are directly compatible with voucher tiger subspecies genetic profiles, hence making it possible to identify subspecies via bi-parentally inherited markers. We applied "tigrisPlex" to 12 confiscated specimens from Russia and identified 6 individuals (3 females and 3 males), each represented by duplicated samples and all designated as Amur tigers (Panthera tigris altaica) with high confidence. This STR multiplex system can serve as an effective and versatile approach for genetic profiling of both wild and captive tigers as well as confiscated tiger products, fulfilling various conservation needs for identifying the origins of tiger samples. © 2015 International Society of Zoological Sciences, Institute of Zoology/Chinese Academy of Sciences and Wiley Publishing Asia Pty Ltd.
Variable number of tandem repeats and pulsed-field gel electrophoresis cluster analysis of enterohemorrhagic Escherichia coli serovar O157 strains.

PubMed

Yokoyama, Eiji; Uchimura, Masako

2007-11-01

Ninety-five enterohemorrhagic Escherichia coli serovar O157 strains, including 30 strains isolated from 13 intrafamily outbreaks and 14 strains isolated from 3 mass outbreaks, were studied by pulsed-field gel electrophoresis (PFGE) and variable number of tandem repeats (VNTR) typing, and the resulting data were subjected to cluster analysis. Cluster analysis of the VNTR typing data revealed that 57 (60.0%) of 95 strains, including all epidemiologically linked strains, formed clusters with at least 95% similarity. Cluster analysis of the PFGE patterns revealed that 67 (70.5%) of 95 strains, including all but 1 of the epidemiologically linked strains, formed clusters with 90% similarity. The number of epidemiologically unlinked strains forming clusters was significantly less by VNTR cluster analysis than by PFGE cluster analysis. The congruence value between PFGE and VNTR cluster analysis was low and did not show an obvious correlation. With two-step cluster analysis, the number of clustered epidemiologically unlinked strains by PFGE cluster analysis that were divided by subsequent VNTR cluster analysis was significantly higher than the number by VNTR cluster analysis that were divided by subsequent PFGE cluster analysis. These results indicate that VNTR cluster analysis is more efficient than PFGE cluster analysis as an epidemiological tool to trace the transmission of enterohemorrhagic E. coli O157.
How proteins bind to DNA: target discrimination and dynamic sequence search by the telomeric protein TRF1

PubMed Central

2017-01-01

Abstract Target search as performed by DNA-binding proteins is a complex process, in which multiple factors contribute to both thermodynamic discrimination of the target sequence from overwhelmingly abundant off-target sites and kinetic acceleration of dynamic sequence interrogation. TRF1, the protein that binds to telomeric tandem repeats, faces an intriguing variant of the search problem where target sites are clustered within short fragments of chromosomal DNA. In this study, we use extensive (>0.5 ms in total) MD simulations to study the dynamical aspects of sequence-specific binding of TRF1 at both telomeric and non-cognate DNA. For the first time, we describe the spontaneous formation of a sequence-specific native protein–DNA complex in atomistic detail, and study the mechanism by which proteins avoid off-target binding while retaining high affinity for target sites. Our calculated free energy landscapes reproduce the thermodynamics of sequence-specific binding, while statistical approaches allow for a comprehensive description of intermediate stages of complex formation. PMID:28633355
Two novel monoclonal antibodies against the MUC4 tandem repeat reacting with an antigen overexpressed by lung cancer.

PubMed

Botti, C; Seregni, E; Ménard, S; Collini, P; Tagliabue, E; Campiglio, M; Vergani, B; Ghirelli, C; Aiello, P; Pilotti, S; Bombardieri, E

2000-01-01

In this study we investigated the immunochemical and cytochemical reactivity of two monoclonal antibodies against the 16-amino acid tandem repeat of MUC4 to demonstrate a possible variation of the mucin core peptide expression related to lung cancer. The immunocytochemical anti-MUC4 reactivity was analyzed in four lung cancer cell lines (Calu-1, Calu-3, H460, SKMES) and in other tumor cell lines, as well as in frozen materials from 21 lung adenocarcinomas (ACs), including five bronchioloalveolar carcinomas (BACs), and 11 squamous cell lung carcinomas (SqCCs). A weak fluorescence anti-MUC4 positivity (range: 10.3-16.2) was observed only in acetone-fixed lung cancer cell lines Calu-1, Calu-3 and H460. These three lung cancer cell lines also showed a cytoplasmic immunoperoxidase reactivity. The immunostaining in lung cancer tissues showed a granular cytoplasmic reactivity: 15/21 (71%) and 17/21 (80%) ACs were positive with BC-LuC18.2 and BC-LuCF12, respectively. All BACs were positive. Moderate to strong reactivity was present in well-differentiated ACs. In the normal lung parenchyma counterparts weak reactivity was found only in bronchiolar cells. All SqCCs were negative. Anti-MUC4 reactivity was also observed in the alveolar mucus. In conclusion, our anti-MUC4 MAbs detect a secretion product present in mucus and this product is elaborated by lung cancer cells and overexpressed in well-differentiated lung ACs.
Two different size classes of 5S rDNA units coexisting in the same tandem array in the razor clam Ensis macha: is this region suitable for phylogeographic studies?

PubMed

Fernández-Tajes, Juan; Méndez, Josefina

2009-12-01

For a study of 5S ribosomal genes (rDNA) in the razor clam Ensis macha, the 5S rDNA region was amplified and sequenced. Two variants, so-called type I or short repeat (approximately 430 bp) and type II or long repeat (approximately 735 bp), appeared to be the main components of the 5S rDNA of this species. Their spacers differed markedly, both in length and nucleotide composition. The organization of the two variants was investigated by amplifying the genomic DNA with primers based on the sequence of the type I and type II spacers. PCR amplification products with primers EMLbF and EMSbR showed that the long and short repeats are associated within the same tandem array, suggesting an intermixed arrangement of both spacers. Nevertheless, amplifications carried out with inverse primers EMSinvF/R and EMLinvF/R revealed that some short and long repeats are contiguous in the same tandem array. This is the first report of the coexistence of two variable spacers in the same tandem array in bivalve mollusks.
A tandem affinity purification tag of TGA2 for isolation of interacting proteins in Arabidopsis thaliana

PubMed Central

Stotz, Henrik U; Findling, Simone; Nukarinen, Ella; Weckwerth, Wolfram; Mueller, Martin J; Berger, Susanne

2014-01-01

Tandem affinity purification (TAP) tagging provides a powerful tool for isolating interacting proteins in vivo. TAP-tag purification offers particular advantages for the identification of stimulus-induced protein interactions. Type II bZIP transcription factors (TGA2, TGA5 and TGA6) play key roles in pathways that control salicylic acid, ethylene, xenobiotic and reactive oxylipin signaling. Although proteins interacting with these transcription factors have been identified through genetic and yeast 2-hybrid screening, others are still elusive. We have therefore generated a C-terminal TAP-tag of TGA2 to isolate additional proteins that interact with this transcription factor. Three lines most highly expressing TAP-tagged TGA2 were functional in that they partially complemented reactive oxylipin-responsive gene expression in a tga2 tga5 tga6 triple mutant. TAP-tagged TGA2 in the most strongly overexpressing line was proteolytically less stable than in the other 2 lines. Only this overexpressing line could be used in a 2-step purification process, resulting in isolation of co-purifying bands of larger molecular weight than TGA2. TAP-tagged TGA2 was used to pull down NPR1, a protein known to interact with this transcription factor. Mass spectrometry was used to identify peptides that co-purified with TAP-tagged TGA2. Having generated this TGA2 TAP-tag line will therefore be an asset to researchers interested in stimulus-induced signal transduction processes. PMID:25482810
Development of a Tandem Repeat-Based Polymerase Chain Displacement Reaction Method for Highly Sensitive Detection of 'Candidatus Liberibacter asiaticus'.

PubMed

Lou, Binghai; Song, Yaqin; RoyChowdhury, Moytri; Deng, Chongling; Niu, Ying; Fan, Qijun; Tang, Yan; Zhou, Changyong

2018-02-01

Huanglongbing (HLB) is one of the most destructive diseases in citrus production worldwide. Early detection of HLB pathogens can facilitate timely removal of infected citrus trees in the field. However, low titer and uneven distribution of HLB pathogens in host plants make reliable detection challenging. Therefore, the development of effective detection methods with high sensitivity is imperative. This study reports the development of a novel method, tandem repeat-based polymerase chain displacement reaction (TR-PCDR), for the detection of 'Candidatus Liberibacter asiaticus', a widely distributed HLB-associated bacterium. A uniquely designed primer set (TR2-PCDR-F/TR2-PCDR-1R) and a thermostable Taq DNA polymerase mutant with strand displacement activity were used for TR-PCDR amplification. Performed in a regular thermal cycler, TR-PCDR could produce more than two amplicons after each amplification cycle. Sensitivity of the developed TR-PCDR was 10 copies of target DNA fragment. The sensitive level was proven to be 100× higher than conventional PCR and similar to real-time PCR. Data from the detection of 'Ca. L. asiaticus' with filed samples using the above three methods also showed similar results. No false-positive TR-PCDR amplification was observed from healthy citrus samples and water controls. These results thereby illustrated that the developed TR-PCDR method can be applied to the reliable, highly sensitive, and cost-effective detection of 'Ca. L. asiaticus'.
Basic Fibroblast Growth Factor Fused with Tandem Collagen-Binding Domains from Clostridium histolyticum Collagenase ColG Increases Bone Formation.

PubMed

Sekiguchi, Hiroyuki; Uchida, Kentaro; Matsushita, Osamu; Inoue, Gen; Nishi, Nozomu; Masuda, Ryo; Hamamoto, Nana; Koide, Takaki; Shoji, Shintaro; Takaso, Masashi

2018-01-01

Basic fibroblast growth factor 2 (bFGF) accelerates bone formation during fracture healing. Because the efficacy of bFGF decreases rapidly following its diffusion from fracture sites, however, repeated dosing is required to ensure a sustained therapeutic effect. We previously developed a fusion protein comprising bFGF, a polycystic kidney disease domain (PKD; s2b), and collagen-binding domain (CBD; s3) sourced from the Clostridium histolyticum class II collagenase, ColH, and reported that the combination of this fusion protein with a collagen-like peptide, poly(Pro-Hyp-Gly) 10 , induced mesenchymal cell proliferation and callus formation at fracture sites. In addition, C. histolyticum produces class I collagenase (ColG) with tandem CBDs (s3a and s3b) at the C-terminus. We therefore hypothesized that a bFGF fusion protein containing ColG-derived tandem CBDs (s3a and s3b) would show enhanced collagen-binding activity, leading to improved bone formation. Here, we examined the binding affinity of four collagen anchors derived from the two clostridial collagenases to H-Gly-Pro-Arg-Gly-(Pro-Hyp-Gly) 12 -NH 2 , a collagenous peptide, by surface plasmon resonance and found that tandem CBDs (s3a-s3b) have the highest affinity for the collagenous peptide. We also constructed four fusion proteins consisting of bFGF and s3 (bFGF-s3), s2b-s3b (bFGF-s2b-s3), s3b (bFGF-s3b), and s3a-s3b (bFGF-s3a-s3b) and compared their biological activities to those of a previous fusion construct (bFGF-s2b-s3) using a cell proliferation assay in vitro and a mouse femoral fracture model in vivo. Among these CB-bFGFs, bFGF-s3a-s3b showed the highest capacity to induce mesenchymal cell proliferation and callus formation in the mice fracture model. The poly(Pro-Hyp-Gly) 10 /bFGF-s3a-s3b construct may therefore have the potential to promote bone formation in clinical settings.
High-Resolution Mapping of a Repeat Protein Folding Free Energy Landscape.

PubMed

Fossat, Martin J; Dao, Thuy P; Jenkins, Kelly; Dellarole, Mariano; Yang, Yinshan; McCallum, Scott A; Garcia, Angel E; Barrick, Doug; Roumestand, Christian; Royer, Catherine A

2016-12-06

A complete description of the pathways and mechanisms of protein folding requires a detailed structural and energetic characterization of the conformational ensemble along the entire folding reaction coordinate. Simulations can provide this level of insight for small proteins. In contrast, with the exception of hydrogen exchange, which does not monitor folding directly, experimental studies of protein folding have not yielded such structural and energetic detail. NMR can provide residue specific atomic level structural information, but its implementation in protein folding studies using chemical or temperature perturbation is problematic. Here we present a highly detailed structural and energetic map of the entire folding landscape of the leucine-rich repeat protein, pp32 (Anp32), obtained by combining pressure-dependent site-specific 1 H- 15 N HSQC data with coarse-grained molecular dynamics simulations. The results obtained using this equilibrium approach demonstrate that the main barrier to folding of pp32 is quite broad and lies near the unfolded state, with structure apparent only in the C-terminal region. Significant deviation from two-state unfolding under pressure reveals an intermediate on the folded side of the main barrier in which the N-terminal region is disordered. A nonlinear temperature dependence of the population of this intermediate suggests a large heat capacity change associated with its formation. The combination of pressure, which favors the population of folding intermediates relative to chemical denaturants; NMR, which allows their observation; and constrained structure-based simulations yield unparalleled insight into protein folding mechanisms. Copyright Â© 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Identification of a repeated domain within mammalian alpha-synemin that interacts directly with talin.

PubMed

Sun, Ning; Critchley, David R; Paulin, Denise; Li, Zhenlin; Robson, Richard M

2008-05-01

The type VI intermediate filament (IF) protein synemin is a unique member of the IF protein superfamily. Synemin associates with the major type III IF protein desmin forming heteropolymeric intermediate filaments (IFs) within developed mammalian striated muscle cells. These IFs encircle and link all adjacent myofibrils together at their Z-lines, as well as link the Z-lines of the peripheral layer of cellular myofibrils to the costameres located periodically along and subjacent to the sarcolemma. Costameres are multi-protein assemblies enriched in the cytoskeletal proteins vinculin, alpha-actinin, and talin. We report herein a direct interaction of human alpha-synemin with the cytoskeletal protein talin by protein-protein interaction assays. The 312 amino acid insert (SNTIII) present only within alpha-synemin binds to the rod domain of talin in vitro and co-localizes with talin at focal adhesion sites within mammalian muscle cells. Confocal microscopy studies showed that synemin co-localizes with talin within the costameres of human skeletal muscle cells. Analysis of the primary sequences of human alpha- and beta-synemins revealed that SNTIII is composed of seven tandem repeats, each containing a specific Ser/Thr-X-Arg-His/Gln (S/T-X-R-H/Q) motif. Our results suggest human alpha-synemin plays an essential role in linking the heteropolymeric IFs to adherens-type junctions, such as the costameres within mammalian striated muscle cells, via its interaction with talin, thereby helping provide mechanical integration for the muscle cell cytoskeleton.
Multi-locus variable-number tandem repeat analysis of Chinese Brucella strains isolated from 1953 to 2013.

PubMed

Tian, Guo-Zhong; Cui, Bu-Yun; Piao, Dong-Ri; Zhao, Hong-Yan; Li, Lan-Yu; Liu, Xi; Xiao, Pei; Zhao, Zhong-Zhi; Xu, Li-Qing; Jiang, Hai; Li, Zhen-Jun

2017-05-02

Brucellosis was a common human and livestock disease caused by Brucella strains, the category B priority pathogens by the US Center for Disease Control (CDC). Identified as a priority disease in human and livestock populations, the increasing incidence in recent years in China needs urgent control measures for this disease but the molecular background important for monitoring the epidemiology of Brucella strains at the national level is still lacking. A total of 600 Brucella isolates collected during 60 years (from 1953 to 2013) in China were genotyped by multiple locus variable-number tandem repeat analysis (MLVA) and the variation degree of MLVA11 loci was calculated by the Hunter Gaston Diversity Index (HGDI) values. The charts and map were processed by Excel 2013, and cluster analysis and epidemiological distribution was performed using BioNumerics (version 5.1). The 600 representative Brucella isolates fell into 104 genotypes with 58 singleton genotypes by the MLVA11 assay, including B. melitensis biovars 2 and 3 (five main genotypes), B. abortus biovars 1 and 3 (two main genotypes), B. suis biovars 1 and 3 (three main genotypes), and B. canis (two main genotypes) respectively. While most B. suis biovar 1 and biovar 3 were respectively found in northern provinces and southern provinces, B. melitensis and B. abortus strains were dominant in China. Canine Brucellosis was only found in animals without any human cases reported. Eight Brucellosis epidemic peaks emerged during the 60 years between 1953 and 2013: 1955 - 1959, 1962 - 1969, 1971 - 1975, 1977 - 1983, 1985 - 1989, 1992 - 1997, 2000 - 2008 and 2010 - 2013 in China. Brucellosis has its unique molecular epidemiological patterns with specific spatial and temporal distribution according to MLVA. IDOP-D-16-00101.
Linkage analysis with multiplexed short tandem repeat polymorphisms using infrared fluorescence and M13 tailed primers

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oetting, W.S.; Lee, H.K.; Flanders, D.J.

The use of short tandem repeat polymorphisms (STRPs) as marker loci for linkage analysis is becoming increasingly important due to their large numbers in the human genome and their high degree of polymorphism. Fluorescence-based detection of the STRP pattern with an automated DNA sequencer has improved the efficiency of this technique by eliminating the need for radioactivity and producing a digitized autoradiogram-like image that can be used for computer analysis. In an effort to simplify the procedure and to reduce the cost of fluorescence STRP analysis, we have developed a technique known as multiplexing STRPs with tailed primers (MSTP) usingmore » primers that have a 19-bp extension, identical to the sequence of an M13 sequencing primer, on the 5{prime} end of the forward primer in conjunction with multiplexing several primer pairs in a single polymerase chain reaction (PCR) amplification. The banding pattern is detected with the addition of the M13 primer-dye conjugate as the sole primer conjugated to the fluorescent dye, eliminating the need for direct conjugation of the infrared fluorescent dye to the STRP primers. The use of MSTP for linkage analysis greatly reduces the number of PCR reactions. Up to five primer pairs can be multiplexed together in the same reaction. At present, a set of 148 STRP markers spaced at an average genetic distance of 28 cM throughout the autosomal genome can be analyzed in 37 sets of multiplexed amplification reactions. We have automated the analysis of these patterns for linkage using software that both detects the STRP banding pattern and determines their sizes. This information can then be exported in a user-defined format from a database manager for linkage analysis. 15 refs., 2 figs., 4 tabs.« less
Novel isoprenylated proteins identified by an expression library screen.

PubMed

Biermann, B J; Morehead, T A; Tate, S E; Price, J R; Randall, S K; Crowell, D N

1994-10-14

Isoprenylated proteins are involved in eukaryotic cell growth and signal transduction. The protein determinant for prenylation is a short carboxyl-terminal motif containing a cysteine, to which the isoprenoid is covalently attached via thioether linkage. To date, isoprenylated proteins have almost all been identified by demonstrating the attachment of an isoprenoid to previously known proteins. Thus, many isoprenylated proteins probably remain undiscovered. To identify novel isoprenylated proteins for subsequent biochemical study, colony blots of a Glycine max cDNA expression library were [3H]farnesyl-labeled in vitro. Proteins identified by this screen contained several different carboxyl termini that conform to consensus farnesylation motifs. These proteins included known farnesylated proteins (DnaJ homologs) and several novel proteins, two of which contained six or more tandem repeats of a hexapeptide having the consensus sequence (E/G)(G/P)EK(P/K)K. Thus, plants contain a diverse array of genes encoding farnesylated proteins, and our results indicate that fundamental differences in the identities of farnesylated proteins may exist between plants and other eukaryotes. Expression library screening by direct labeling can be adapted to identify isoprenylated proteins from other organisms, as well as proteins with other post-translational modifications.
Multilocus variable-number tandem repeat analysis for molecular typing and phylogenetic analysis of Shigella flexneri

PubMed Central

2009-01-01

Background Shigella flexneri is one of the causative agents of shigellosis, a major cause of childhood mortality in developing countries. Multilocus variable-number tandem repeat (VNTR) analysis (MLVA) is a prominent subtyping method to resolve closely related bacterial isolates for investigation of disease outbreaks and provide information for establishing phylogenetic patterns among isolates. The present study aimed to develop an MLVA method for S. flexneri and the VNTR loci identified were tested on 242 S. flexneri isolates to evaluate their variability in various serotypes. The isolates were also analyzed by pulsed-field gel electrophoresis (PFGE) to compare the discriminatory power and to evaluate the usefulness of MLVA as a tool for phylogenetic analysis of S. flexneri. Results Thirty-six VNTR loci were identified by exploring the repeat sequence loci in genomic sequences of Shigella species and by testing the loci on nine isolates of different subserotypes. The VNTR loci in different serotype groups differed greatly in their variability. The discriminatory power of an MLVA assay based on four most variable VNTR loci was higher, though not significantly, than PFGE for the total isolates, a panel of 2a isolates, which were relatively diverse, and a panel of 4a/Y isolates, which were closely-related. Phylogenetic groupings based on PFGE patterns and MLVA profiles were considerably concordant. The genetic relationships among the isolates were correlated with serotypes. The phylogenetic trees constructed using PFGE patterns and MLVA profiles presented two distinct clusters for the isolates of serotype 3 and one distinct cluster for each of the serotype groups, 1a/1b/NT, 2a/2b/X/NT, 4a/Y, and 6. Isolates that had different serotypes but had closer genetic relatedness than those with the same serotype were observed between serotype Y and subserotype 4a, serotype X and subserotype 2b, subserotype 1a and 1b, and subserotype 3a and 3b. Conclusions The 36 VNTR loci
Spectrum of Phenylalanine Hydroxylase Gene Mutations in Hamadan and Lorestan Provinces of Iran and Their Associations with Variable Number of Tandem Repeat Alleles.

PubMed

Alibakhshi, Reza; Moradi, Keivan; Biglari, Mostafa; Shafieenia, Samaneh

2018-05-01

Phenylketonuria (PKU) is one of the most common known inherited metabolic diseases. The present study aimed to investigate the status of molecular defects in phenylalanine hydroxylase ( PAH ) gene in western Iranian PKU patients (predominantly from Kermanshah, Hamadan, and Lorestan provinces) during 2014-2016. Additionally, the results were compared with similar studies in Iran. Nucleotide sequence analysis of all 13 exons and their flanking intronic regions of the PAH gene was performed in 18 western Iranian PKU patients. Moreover, a variable number of tandem repeat (VNTR) located in the PAH gene was studied. The results revealed a mutational spectrum encompassing 11 distinct mutations distributed along the PAH gene sequence on 34 of the 36 mutant alleles (diagnostic efficiency of 94.4%). Also, four PAH VNTR alleles (with repeats of 3, 7, 8 and 9) were detected. The three most frequent mutations were IVS9+5G>A, IVS7-5T>C, and p.P281L with the frequency of 27.8%, 11%, and 11%, respectively. The results showed that there is not only a consanguineous relation, but also a difference in PAH characters of mutations between Kermanshah and the other two parts of western Iran (Hamadan and Lorestan). Also, it seems that the spectrum of mutations in western Iran is relatively distinct from other parts of the country, suggesting that this region might be a special PAH gene distribution region. Moreover, our findings can be useful in the identification of genotype to phenotype relationship in patients, and provide future abilities for confirmatory diagnostic testing, prognosis, and predict the severity of PKU patients.
Differential interaction and aggregation of 3-repeat and 4-repeat tau isoforms with 14-3-3{zeta} protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sadik, Golam; Tanaka, Toshihisa, E-mail: tanaka@psy.med.osaka-u.ac.jp; Kato, Kiyoko

2009-05-22

Tau isoforms, 3-repeat (3R) and 4-repeat tau (4R), are differentially involved in neuronal development and in several tauopathies. 14-3-3 protein binds to tau and 14-3-3/tau association has been found both in the development and in tauopathies. To understand the role of 14-3-3 in the differential regulation of tau isoforms, we have performed studies on the interaction and aggregation of 3R-tau and 4R-tau, either phosphorylated or unphosphorylated, with 14-3-3{zeta}. We show by surface plasmon resonance studies that the interaction between unphosphorylated 3R-tau and 14-3-3{zeta} is {approx}3-folds higher than that between unphosphorylated 4R-tau and 14-3-3{zeta}. Phosphorylation of tau by protein kinase Amore » (PKA) increases the affinity of both 3R- and 4R-tau for 14-3-3{zeta} to a similar level. An in vitro aggregation assay employing both transmission electron microscopy and fluorescence spectroscopy revealed the aggregation of unphosphorylated 4R-tau to be significantly higher than that of unphosphorylated 3R-tau following the induction of 14-3-3{zeta}. The filaments formed from 3R- and 4R-tau were almost similar in morphology. In contrast, the aggregation of both 3R- and 4R-tau was reduced to a similar low level after phosphorylation with PKA. Taken together, these results suggest that 14-3-3{zeta} exhibits a similar role for tau isoforms after PKA-phosphorylation, but a differential role for unphosphorylated tau. The significant aggregation of 4R-tau by 14-3-3{zeta} suggests that 14-3-3 may act as an inducer in the generation of 4R-tau-predominant neurofibrillary tangles in tauopathies.« less
A proteomics method using immunoaffinity fluorogenic derivatization-liquid chromatography/tandem mass spectrometry (FD-LC-MS/MS) to identify a set of interacting proteins.

PubMed

Nakata, Katsunori; Saitoh, Ryoichi; Ishigai, Masaki; Imai, Kazuhiro

2018-02-01

Biological functions in organisms are usually controlled by a set of interacting proteins, and identifying the proteins that interact is useful for understanding the mechanism of the functions. Immunoprecipitation is a method that utilizes the affinity of an antibody to isolate and identify the proteins that have interacted in a biological sample. In this study, the FD-LC-MS/MS method, which involves fluorogenic derivatization followed by separation and quantification by HPLC and finally identification of proteins by HPLC-tandem mass spectrometry, was used to identify proteins in immunoprecipitated samples, using heat shock protein 90 (HSP90) as a model of an interacting protein in HepaRG cells. As a result, HSC70 protein, which was known to form a complex with HSP90, was isolated, together with three different types of HSP90-beta. The results demonstrated that the proposed immunoaffinity-FD-LC-MS/MS method could be useful for simultaneously detecting and identifying the proteins that interact with a certain protein. Copyright © 2017 John Wiley & Sons, Ltd.

Tandem mass spectrometry of human tryptic blood peptides calculated by a statistical algorithm and captured by a relational database with exploration by a general statistical analysis system.

PubMed

Bowden, Peter; Beavis, Ron; Marshall, John

2009-11-02

A goodness of fit test may be used to assign tandem mass spectra of peptides to amino acid sequences and to directly calculate the expected probability of mis-identification. The product of the peptide expectation values directly yields the probability that the parent protein has been mis-identified. A relational database could capture the mass spectral data, the best fit results, and permit subsequent calculations by a general statistical analysis system. The many files of the Hupo blood protein data correlated by X!TANDEM against the proteins of ENSEMBL were collected into a relational database. A redundant set of 247,077 proteins and peptides were correlated by X!TANDEM, and that was collapsed to a set of 34,956 peptides from 13,379 distinct proteins. About 6875 distinct proteins were only represented by a single distinct peptide, 2866 proteins showed 2 distinct peptides, and 3454 proteins showed at least three distinct peptides by X!TANDEM. More than 99% of the peptides were associated with proteins that had cumulative expectation values, i.e. probability of false positive identification, of one in one hundred or less. The distribution of peptides per protein from X!TANDEM was significantly different than those expected from random assignment of peptides.
Multiple-locus, variable number of tandem repeat analysis (MLVA) of the fish-pathogen Francisella noatunensis

PubMed Central

2011-01-01

Background Since Francisella noatunensis was first isolated from cultured Atlantic cod in 2004, it has emerged as a global fish pathogen causing disease in both warm and cold water species. Outbreaks of francisellosis occur in several important cultured fish species making a correct management of this disease a matter of major importance. Currently there are no vaccines or treatments available. A strain typing system for use in studies of F. noatunensis epizootics would be an important tool for disease management. However, the high genetic similarity within the Francisella spp. makes strain typing difficult, but such typing of the related human pathogen Francisella tullarensis has been performed successfully by targeting loci with higher genetic variation than the traditional signature sequences. These loci are known as Variable Numbers of Tandem Repeat (VNTR). The aim of this study is to identify possible useful VNTRs in the genome of F. noatunensis. Results Seven polymorphic VNTR loci were identified in the preliminary genome sequence of F. noatunensis ssp. noatunensis GM2212 isolate. These VNTR-loci were sequenced in F. noatunensis isolates collected from Atlantic cod (Gadus morhua) from Norway (n = 21), Three-line grunt (Parapristipoma trilineatum) from Japan (n = 1), Tilapia (Oreochromis spp.) from Indonesia (n = 3) and Atlantic salmon (Salmo salar) from Chile (n = 1). The Norwegian isolates presented in this study show both nine allelic profiles and clades, and that the majority of the farmed isolates belong in two clades only, while the allelic profiles from wild cod are unique. Conclusions VNTRs can be used to separate isolates belonging to both subspecies of F. noatunensis. Low allelic diversity in F. noatunensis isolates from outbreaks in cod culture compared to isolates wild cod, indicate that transmission of these isolates may be a result of human activity. The sequence based MLVA system presented in this study should provide a good starting point for
[Polymorphism analysis of 20 autosomal short-tandem repeat loci in southern Chinese Han population].

PubMed

Chen, Ling; Lu, Hui-Jie; DU, Wei-An; Qiu, Ping-Ming; Liu, Chao

2016-02-20

To evaluate the value of PowerPlex ® 21 System (Promega) and study the genetic polymorphism of its 20 short-tandem repeat (STR) loci in southern Chinese Han population. We conducted genotyping experiments using PowerPlex ® 21 System on 20 autosomal STR loci (D3S1358, D1S1656, D6S1043, D13S317, Penta E, D16S539, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433 and FGA) in 2367 unrelated Chinese Han individuals living in South China. The allele frequencies and parameters commonly used in forensic science were statistically analyzed in these individuals and compared with the reported data of other populations. The PowerPlex ® 21 System had a power of discrimination (PD) ranging from 0.7839 to 0.9852 and a power of exclusion (PE) ranging from 0.2974 to 0.8099 for the 20 loci. No significant deviation from Hardy-Weinberg expectations was found for all the loci except for D5S818. This southern Chinese Han population had significant differences in the allele frequencies from 8 ethnic groups reported in China, and showed significant differences at 8 to 20 STR foci from 5 foreign populations. The allele frequency at the locus D1S1656 in this southern Chinese Han population differed significantly from those in the 5 foreign populations and from 3 reported Han populations in Beijing, Zhejiang Province and Fujian Province of China. The neighbor-joining phylogenetictree showed clustering of all the Asian populations in one branch, while the northern Italian and Argentina populations clustered in a separate branch. This southern Chinese Han population had the nearest affinity with the Yi ethnic population in Yunnan Province of China. The 20 STR loci are highly polymorphic in this southern Chinese Han population, suggesting the value of this set of STR loci in forensic personal identification, paternity testing and anthropological study.
Clostridium botulinum Group I Strain Genotyping by 15-Locus Multilocus Variable-Number Tandem-Repeat Analysis ▿ †

PubMed Central

Fillo, Silvia; Giordani, Francesco; Anniballi, Fabrizio; Gorgé, Olivier; Ramisse, Vincent; Vergnaud, Gilles; Riehm, Julia M.; Scholz, Holger C.; Splettstoesser, Wolf D.; Kieboom, Jasper; Olsen, Jaran-Strand; Fenicia, Lucia; Lista, Florigio

2011-01-01

Clostridium botulinum is a taxonomic designation that encompasses a broad variety of spore-forming, Gram-positive bacteria producing the botulinum neurotoxin (BoNT). C. botulinum is the etiologic agent of botulism, a rare but severe neuroparalytic disease. Fine-resolution genetic characterization of C. botulinum isolates of any BoNT type is relevant for both epidemiological studies and forensic microbiology. A 10-locus multiple-locus variable-number tandem-repeat analysis (MLVA) was previously applied to isolates of C. botulinum type A. The present study includes five additional loci designed to better address proteolytic B and F serotypes. We investigated 79 C. botulinum group I strains isolated from human and food samples in several European countries, including types A (28), B (36), AB (4), and F (11) strains, and 5 nontoxic Clostridium sporogenes. Additional data were deduced from in silico analysis of 10 available fully sequenced genomes. This 15-locus MLVA (MLVA-15) scheme identified 86 distinct genotypes that clustered consistently with the results of amplified fragment length polymorphism (AFLP) and MLVA genotyping in previous reports. An MLVA-7 scheme, a subset of the MLVA-15, performed on a lab-on-a-chip device using a nonfluorescent subset of primers, is also proposed as a first-line assay. The phylogenetic grouping obtained with the MLVA-7 does not differ significantly from that generated by the MLVA-15. To our knowledge, this report is the first to analyze genetic variability among all of the C. botulinum group I serotypes by MLVA. Our data provide new insights into the genetic variability of group I C. botulinum isolates worldwide and demonstrate that this group is genetically highly diverse. PMID:22012011
Multiple-locus variable-number tandem-repeats analysis of Listeria monocytogenes using multicolour capillary electrophoresis and comparison with pulsed-field gel electrophoresis typing.

PubMed

Lindstedt, Bjørn-Arne; Tham, Wilhelm; Danielsson-Tham, Marie-Louise; Vardund, Traute; Helmersson, Seved; Kapperud, Georg

2008-02-01

The multiple-locus variable-number tandem-repeats analysis (MLVA) method for genotyping has proven to be a fast and reliable typing tool in several bacterial species. MLVA is in our laboratory the routine typing method for Salmonella enterica subsp. enterica serovar Typhimurium and Escherichia coli O157. The gram-positive bacteria Listeria monocytogenes, while not isolated as frequent as S. Typhimurium and E. coli, causes severe illness with an overall mortality rate of 30%. Thus, it is important that any outbreak of this pathogen is detected early and a fast trace to the source can be performed. In view of this, we have used the information provided by two fully sequenced L. monocytogenes strains to develop a MLVA assay coupled with high-resolution capillary electrophoresis and compared it to pulsed-field gel electrophoresis (PFGE) in two sets of isolates, one Norwegian (79 isolates) and one Swedish (61 isolates) set. The MLVA assay could resolve all of the L. monocytogenes serotypes tested, and was slightly more discriminatory than PFGE for the Norwegian isolates (28 MLVA profiles and 24 PFGE profiles) and opposite for the Swedish isolates (42 MLVA profiles and 43 PFGE profiles).
Fast tandem mass spectra-based protein identification regardless of the number of spectra or potential modifications examined.

PubMed

Falkner, Jayson; Andrews, Philip

2005-05-15

Comparing tandem mass spectra (MSMS) against a known dataset of protein sequences is a common method for identifying unknown proteins; however, the processing of MSMS by current software often limits certain applications, including comprehensive coverage of post-translational modifications, non-specific searches and real-time searches to allow result-dependent instrument control. This problem deserves attention as new mass spectrometers provide the ability for higher throughput and as known protein datasets rapidly grow in size. New software algorithms need to be devised in order to address the performance issues of conventional MSMS protein dataset-based protein identification. This paper describes a novel algorithm based on converting a collection of monoisotopic, centroided spectra to a new data structure, named 'peptide finite state machine' (PFSM), which may be used to rapidly search a known dataset of protein sequences, regardless of the number of spectra searched or the number of potential modifications examined. The algorithm is verified using a set of commercially available tryptic digest protein standards analyzed using an ABI 4700 MALDI TOFTOF mass spectrometer, and a free, open source PFSM implementation. It is illustrated that a PFSM can accurately search large collections of spectra against large datasets of protein sequences (e.g. NCBI nr) using a regular desktop PC; however, this paper only details the method for identifying peptide and subsequently protein candidates from a dataset of known protein sequences. The concept of using a PFSM as a peptide pre-screening technique for MSMS-based search engines is validated by using PFSM with Mascot and XTandem. Complete source code, documentation and examples for the reference PFSM implementation are freely available at the Proteome Commons, http://www.proteomecommons.org and source code may be used both commercially and non-commercially as long as the original authors are credited for their work.
Absolute quantification of protein NP24 in tomato fruit by liquid chromatography/tandem mass spectrometry using stable isotope-labelled tryptic peptide standard.

PubMed

Ippoushi, Katsunari; Sasanuma, Motoe; Oike, Hideaki; Kobori, Masuko; Maeda-Yamamoto, Mari

2015-04-15

Protein NP24 is a thaumatin-like protein contained in tomato (Lycopersicon esculentum Mill.). This protein is reported to be a putative tomato allergen and is listed as a food allergen in Structural Database of Allergenic Proteins (SDAP). In this research, we developed the quantitative analysis of NP24 by employing the protein absolute quantification (AQUA) technology composed of stable isotope-labelled internal standard (SIIS) peptide (GQTWVINAPR[(13)C6,(15)N4]) and liquid chromatography/tandem mass spectrometry (LC/MS/MS). A linear relationship (r(2)>0.99) was found throughout the concentration range (2.0-500 fmol/μL). The coefficients of variation (CVs) measured on each of the five days when NP24 contained in the tomato skin was analysed did not exceed 13%. Our developed assay of NP24 will contribute to the allergological examination of tomato and its derived products. Copyright © 2014 Elsevier Ltd. All rights reserved.
Sel1 repeat protein LpnE is a Legionella pneumophila virulence determinant that influences vacuolar trafficking.

PubMed

Newton, Hayley J; Sansom, Fiona M; Dao, Jenny; McAlister, Adrian D; Sloan, Joan; Cianciotto, Nicholas P; Hartland, Elizabeth L

2007-12-01

The environmental pathogen Legionella pneumophila possesses five proteins with Sel1 repeats (SLRs) from the tetratricopeptide repeat protein family. Three of these proteins, LpnE, EnhC, and LidL, have been implicated in the ability of L. pneumophila to efficiently establish infection and/or manipulate host cell trafficking events. Previously, we showed that LpnE is important for L. pneumophila entry into macrophages and epithelial cells. In further virulence studies here, we show that LpnE is also required for efficient infection of Acanthamoeba castellanii by L. pneumophila and for replication of L. pneumophila in the lungs of A/J mice. In addition, we found that the role of LpnE in host cell invasion is dependent on the eight SLR regions of the protein. A truncated form of LpnE lacking the two C-terminal SLR domains was unable to complement the invasion defect of an lpnE mutant of L. pneumophila 130b in both the A549 and THP-1 cell lines. The lpnE mutant displayed impaired avoidance of LAMP-1 association, suggesting that LpnE influenced trafficking of the L. pneumophila vacuole, similar to the case for EnhC and LidL. We also found that LpnE was present in L. pneumophila culture supernatants and that its export was independent of both the Lsp type II secretion system and the Dot/Icm type IV secretion system. The fact that LpnE was exported suggested that the protein may interact with a eukaryotic protein. Using LpnE as bait, we screened a HeLa cell cDNA library for interacting partners, using the yeast two-hybrid system. Examination of the protein-protein interaction between LpnE and a eukaryotic protein, obscurin-like protein 1, suggested that LpnE can interact with eukaryotic proteins containing immunoglobulin-like folds via the SLR regions. This investigation has further characterized the contribution of LpnE to L. pneumophila virulence and, more specifically, the importance of the SLR regions to LpnE function.
Invasive Species Management on Military Lands: Clustered Regularly Interspaced Short Palindromic Repeat/ CRISPR associated protein 9 (CRISPR/Cas9) based Gene Drives

DTIC Science & Technology

2017-06-30

Clustered Regularly Interspaced Short Palindromic Repeat/ CRISPR -associated protein 9 ( CRISPR /Cas9)-based Gene Drives En vi ro nm en ta l L ab or at...Management on Military Lands Clustered Regularly Interspaced Short Palindromic Repeat/ CRISPR -associated protein 9 ( CRISPR /Cas9)-based Gene Drives Ping... CRISPR /Cas9-based Gene Drives for Invasive Species Management on Military Lands” ERDC/EL SR-17-2 ii Abstract Applications of genetic engineering
Bipartite Topology of Treponema pallidum Repeat Proteins C/D and I

PubMed Central

Anand, Arvind; LeDoyt, Morgan; Karanian, Carson; Luthra, Amit; Koszelak-Rosenblum, Mary; Malkowski, Michael G.; Puthenveetil, Robbins; Vinogradova, Olga; Radolf, Justin D.

2015-01-01

We previously identified Treponema pallidum repeat proteins TprC/D, TprF, and TprI as candidate outer membrane proteins (OMPs) and subsequently demonstrated that TprC is not only a rare OMP but also forms trimers and has porin activity. We also reported that TprC contains N- and C-terminal domains (TprCN and TprCC) orthologous to regions in the major outer sheath protein (MOSPN and MOSPC) of Treponema denticola and that TprCC is solely responsible for β-barrel formation, trimerization, and porin function by the full-length protein. Herein, we show that TprI also possesses bipartite architecture, trimeric structure, and porin function and that the MOSPC-like domains of native TprC and TprI are surface-exposed in T. pallidum, whereas their MOSPN-like domains are tethered within the periplasm. TprF, which does not contain a MOSPC-like domain, lacks amphiphilicity and porin activity, adopts an extended inflexible structure, and, in T. pallidum, is tightly bound to the protoplasmic cylinder. By thermal denaturation, the MOSPN and MOSPC-like domains of TprC and TprI are highly thermostable, endowing the full-length proteins with impressive conformational stability. When expressed in Escherichia coli with PelB signal sequences, TprC and TprI localize to the outer membrane, adopting bipartite topologies, whereas TprF is periplasmic. We propose that the MOSPN-like domains enhance the structural integrity of the cell envelope by anchoring the β-barrels within the periplasm. In addition to being bona fide T. pallidum rare outer membrane proteins, TprC/D and TprI represent a new class of dual function, bipartite bacterial OMP. PMID:25805501
The Impact of Multilocus Variable-Number Tandem-Repeat Analysis on PulseNet Canada Escherichia coli O157:H7 Laboratory Surveillance and Outbreak Support, 2008-2012.

PubMed

Rumore, Jillian Leigh; Tschetter, Lorelee; Nadon, Celine

2016-05-01

The lack of pattern diversity among pulsed-field gel electrophoresis (PFGE) profiles for Escherichia coli O157:H7 in Canada does not consistently provide optimal discrimination, and therefore, differentiating temporally and/or geographically associated sporadic cases from potential outbreak cases can at times impede investigations. To address this limitation, DNA sequence-based methods such as multilocus variable-number tandem-repeat analysis (MLVA) have been explored. To assess the performance of MLVA as a supplemental method to PFGE from the Canadian perspective, a retrospective analysis of all E. coli O157:H7 isolated in Canada from January 2008 to December 2012 (inclusive) was conducted. A total of 2285 E. coli O157:H7 isolates and 63 clusters of cases (by PFGE) were selected for the study. Based on the qualitative analysis, the addition of MLVA improved the categorization of cases for 60% of clusters and no change was observed for ∼40% of clusters investigated. In such situations, MLVA serves to confirm PFGE results, but may not add further information per se. The findings of this study demonstrate that MLVA data, when used in combination with PFGE-based analyses, provide additional resolution to the detection of clusters lacking PFGE diversity as well as demonstrate good epidemiological concordance. In addition, MLVA is able to identify cluster-associated isolates with variant PFGE pattern combinations that may have been previously missed by PFGE alone. Optimal laboratory surveillance in Canada is achieved with the application of PFGE and MLVA in tandem for routine surveillance, cluster detection, and outbreak response.
Highly diverse variable number tandem repeat loci in the E. coli O157:H7 and O55:H7 genomes for high-resolution molecular typing.

PubMed

Keys, C; Kemper, S; Keim, P

2005-01-01

Evaluation of the Escherichia coli genome for variable number tandem repeat (VNTR) loci in order to provide a subtyping tool with greater discrimination and more efficient capacity. Twenty-nine putative VNTR loci were identified from the E. coli genomic sequence. Their variability was validated by characterizing the number of repeats at each locus in a set of 56 E. coli O157:H7/HN and O55:H7 isolates. An optimized multiplex assay system was developed to facility high capacity analysis. Locus diversity values ranged from 0.23 to 0.95 while the number of alleles ranged from two to 29. This multiple-locus VNTR analysis (MLVA) data was used to describe genetic relationships among these isolates and was compared with PFGE (pulse field gel electrophoresis) data from a subset of the same strains. Genetic similarity values were highly correlated between the two approaches, through MLVA was capable of discrimination amongst closely related isolates when PFGE similar values were equal to 1.0. Highly variable VNTR loci exist in the E. coli O157:H7 genome and are excellent estimators of genetic relationships, in particular for closely related isolates. Escherichia coli O157:H7 MLVA offers a complimentary analysis to the more traditional PFGE approach. Application of MLVA to an outbreak cluster could generate superior molecular epidemiology and result in a more effective public health response.
RRW: repeated random walks on genome-scale protein networks for local cluster discovery

PubMed Central

Macropol, Kathy; Can, Tolga; Singh, Ambuj K

2009-01-01

Background We propose an efficient and biologically sensitive algorithm based on repeated random walks (RRW) for discovering functional modules, e.g., complexes and pathways, within large-scale protein networks. Compared to existing cluster identification techniques, RRW implicitly makes use of network topology, edge weights, and long range interactions between proteins. Results We apply the proposed technique on a functional network of yeast genes and accurately identify statistically significant clusters of proteins. We validate the biological significance of the results using known complexes in the MIPS complex catalogue database and well-characterized biological processes. We find that 90% of the created clusters have the majority of their catalogued proteins belonging to the same MIPS complex, and about 80% have the majority of their proteins involved in the same biological process. We compare our method to various other clustering techniques, such as the Markov Clustering Algorithm (MCL), and find a significant improvement in the RRW clusters' precision and accuracy values. Conclusion RRW, which is a technique that exploits the topology of the network, is more precise and robust in finding local clusters. In addition, it has the added flexibility of being able to find multi-functional proteins by allowing overlapping clusters. PMID:19740439
Selection and Validation of a Multilocus Variable-Number Tandem-Repeat Analysis Panel for Typing Shigella spp.▿ †

PubMed Central

Gorgé, Olivier; Lopez, Stéphanie; Hilaire, Valérie; Lisanti, Olivier; Ramisse, Vincent; Vergnaud, Gilles

2008-01-01

The Shigella genus has historically been separated into four species, based on biochemical assays. The classification within each species relies on serotyping. Recently, genome sequencing and DNA assays, in particular the multilocus sequence typing (MLST) approach, greatly improved the current knowledge of the origin and phylogenetic evolution of Shigella spp. The Shigella and Escherichia genera are now considered to belong to a unique genomospecies. Multilocus variable-number tandem-repeat (VNTR) analysis (MLVA) provides valuable polymorphic markers for genotyping and performing phylogenetic analyses of highly homogeneous bacterial pathogens. Here, we assess the capability of MLVA for Shigella typing. Thirty-two potentially polymorphic VNTRs were selected by analyzing in silico five Shigella genomic sequences and subsequently evaluated. Eventually, a panel of 15 VNTRs was selected (i.e., MLVA15 analysis). MLVA15 analysis of 78 strains or genome sequences of Shigella spp. and 11 strains or genome sequences of Escherichia coli distinguished 83 genotypes. Shigella population cluster analysis gave consistent results compared to MLST. MLVA15 analysis showed capabilities for E. coli typing, providing classification among pathogenic and nonpathogenic E. coli strains included in the study. The resulting data can be queried on our genotyping webpage (http://mlva.u-psud.fr). The MLVA15 assay is rapid, highly discriminatory, and reproducible for Shigella and Escherichia strains, suggesting that it could significantly contribute to epidemiological trace-back analysis of Shigella infections and pathogenic Escherichia outbreaks. Typing was performed on strains obtained mostly from collections. Further studies should include strains of much more diverse origins, including all pathogenic E. coli types. PMID:18216214
An update on polygalacturonase-inhibiting protein (PGIP), a leucine-rich repeat protein that protects crop plants against pathogens

PubMed Central

Kalunke, Raviraj M.; Tundo, Silvio; Benedetti, Manuel; Cervone, Felice; De Lorenzo, Giulia; D'Ovidio, Renato

2015-01-01

Polygalacturonase inhibiting proteins (PGIPs) are cell wall proteins that inhibit the pectin-depolymerizing activity of polygalacturonases secreted by microbial pathogens and insects. These ubiquitous inhibitors have a leucine-rich repeat structure that is strongly conserved in monocot and dicot plants. Previous reviews have summarized the importance of PGIP in plant defense and the structural basis of PG-PGIP interaction; here we update the current knowledge about PGIPs with the recent findings on the composition and evolution of pgip gene families, with a special emphasis on legume and cereal crops. We also update the information about the inhibition properties of single pgip gene products against microbial PGs and the results, including field tests, showing the capacity of PGIP to protect crop plants against fungal, oomycetes and bacterial pathogens. PMID:25852708
[Progress in the spectral library based protein identification strategy].

PubMed

Yu, Derui; Ma, Jie; Xie, Zengyan; Bai, Mingze; Zhu, Yunping; Shu, Kunxian

2018-04-25

Exponential growth of the mass spectrometry (MS) data is exhibited when the mass spectrometry-based proteomics has been developing rapidly. It is a great challenge to develop some quick, accurate and repeatable methods to identify peptides and proteins. Nowadays, the spectral library searching has become a mature strategy for tandem mass spectra based proteins identification in proteomics, which searches the experiment spectra against a collection of confidently identified MS/MS spectra that have been observed previously, and fully utilizes the abundance in the spectrum, peaks from non-canonical fragment ions, and other features. This review provides an overview of the implement of spectral library search strategy, and two key steps, spectral library construction and spectral library searching comprehensively, and discusses the progress and challenge of the library search strategy.
Aggregation landscapes of Huntingtin exon 1 protein fragments and the critical repeat length for the onset of Huntington’s disease

PubMed Central

Chen, Mingchen; Wolynes, Peter G.

2017-01-01

Huntington’s disease (HD) is a neurodegenerative disease caused by an abnormal expansion in the polyglutamine (polyQ) track of the Huntingtin (HTT) protein. The severity of the disease depends on the polyQ repeat length, arising only in patients with proteins having 36 repeats or more. Previous studies have shown that the aggregation of N-terminal fragments (encoded by HTT exon 1) underlies the disease pathology in mouse models and that the HTT exon 1 gene product can self-assemble into amyloid structures. Here, we provide detailed structural mechanisms for aggregation of several protein fragments encoded by HTT exon 1 by using the associative memory, water-mediated, structure and energy model (AWSEM) to construct their free energy landscapes. We find that the addition of the N-terminal 17-residue sequence (NT17) facilitates polyQ aggregation by encouraging the formation of prefibrillar oligomers, whereas adding the C-terminal polyproline sequence (P10) inhibits aggregation. The combination of both terminal additions in HTT exon 1 fragment leads to a complex aggregation mechanism with a basic core that resembles that found for the aggregation of pure polyQ repeats using AWSEM. At the extrapolated physiological concentration, although the grand canonical free energy profiles are uphill for HTT exon 1 fragments having 20 or 30 glutamines, the aggregation landscape for fragments with 40 repeats has become downhill. This computational prediction agrees with the critical length found for the onset of HD and suggests potential therapies based on blocking early binding events involving the terminal additions to the polyQ repeats. PMID:28400517
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

PubMed

Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

1987-08-01

To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded.
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

PubMed Central

Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

1987-01-01

To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded. Images PMID:2823109
Simulation of two dimensional electrophoresis and tandem mass spectrometry for teaching proteomics.

PubMed

Fisher, Amanda; Sekera, Emily; Payne, Jill; Craig, Paul

2012-01-01

In proteomics, complex mixtures of proteins are separated (usually by chromatography or electrophoresis) and identified by mass spectrometry. We have created 2DE Tandem MS, a computer program designed for use in the biochemistry, proteomics, or bioinformatics classroom. It contains two simulations-2D electrophoresis and tandem mass spectrometry. The two simulations are integrated together and are designed to teach the concept of proteome analysis of prokaryotic and eukaryotic organisms. 2DE-Tandem MS can be used as a freestanding simulation, or in conjunction with a wet lab, to introduce proteomics in the undergraduate classroom. 2DE Tandem MS is a free program available on Sourceforge at https://sourceforge.net/projects/jbf/. It was developed using Java Swing and functions in Mac OSX, Windows, and Linux, ensuring that every student sees a consistent and informative graphical user interface no matter the computer platform they choose. Java must be installed on the host computer to run 2DE Tandem MS. Example classroom exercises are provided in the Supporting Information. Copyright © 2012 Wiley Periodicals, Inc.

Comparison of seven techniques for typing international epidemic strains of Clostridium difficile: restriction endonuclease analysis, pulsed-field gel electrophoresis, PCR-ribotyping, multilocus sequence typing, multilocus variable-number tandem-repeat analysis, amplified fragment length polymorphism, and surface layer protein A gene sequence typing.

PubMed

Killgore, George; Thompson, Angela; Johnson, Stuart; Brazier, Jon; Kuijper, Ed; Pepin, Jacques; Frost, Eric H; Savelkoul, Paul; Nicholson, Brad; van den Berg, Renate J; Kato, Haru; Sambol, Susan P; Zukowski, Walter; Woods, Christopher; Limbago, Brandi; Gerding, Dale N; McDonald, L Clifford

2008-02-01

Using 42 isolates contributed by laboratories in Canada, The Netherlands, the United Kingdom, and the United States, we compared the results of analyses done with seven Clostridium difficile typing techniques: multilocus variable-number tandem-repeat analysis (MLVA), amplified fragment length polymorphism (AFLP), surface layer protein A gene sequence typing (slpAST), PCR-ribotyping, restriction endonuclease analysis (REA), multilocus sequence typing (MLST), and pulsed-field gel electrophoresis (PFGE). We assessed the discriminating ability and typeability of each technique as well as the agreement among techniques in grouping isolates by allele profile A (AP-A) through AP-F, which are defined by toxinotype, the presence of the binary toxin gene, and deletion in the tcdC gene. We found that all isolates were typeable by all techniques and that discrimination index scores for the techniques tested ranged from 0.964 to 0.631 in the following order: MLVA, REA, PFGE, slpAST, PCR-ribotyping, MLST, and AFLP. All the techniques were able to distinguish the current epidemic strain of C. difficile (BI/027/NAP1) from other strains. All of the techniques showed multiple types for AP-A (toxinotype 0, binary toxin negative, and no tcdC gene deletion). REA, slpAST, MLST, and PCR-ribotyping all included AP-B (toxinotype III, binary toxin positive, and an 18-bp deletion in tcdC) in a single group that excluded other APs. PFGE, AFLP, and MLVA grouped two, one, and two different non-AP-B isolates, respectively, with their AP-B isolates. All techniques appear to be capable of detecting outbreak strains, but only REA and MLVA showed sufficient discrimination to distinguish strains from different outbreaks.
Genetic considerations in human sex-mate selection: partners share human leukocyte antigen but not short-tandem-repeat identity markers.

PubMed

Israeli, Moshe; Kristt, Don; Nardi, Yuval; Klein, Tirza

2014-05-01

Previous studies support a role for MHC on mating preference, yet it remains unsettled as to whether mating occurs preferentially between individuals sharing human leukocyte antigen (HLA) determinants or not. Investigating sex-mate preferences in the contemporary Israeli population is of further curiosity being a population with distinct genetic characteristics, where multifaceted cultural considerations influence mate selection. Pairs of male-female sex partners were evaluated in three groups. Two groups represented unmarried (n = 1002) or married (n = 308) couples and a control group of fictitious male-female couples. HLA and short-tandem-repeat (STR) genetic identification markers were assessed for the frequency of shared antigens and alleles. Human leukocyte antigen results showed that Class I and/ or Class II single antigen as well as double antigen sharing was more common in sex partners than in control group couples (P < 0.001). Married versus unmarried pairs were not distinguishable. In contrast, STR-DNA markers failed to differentiate between sex-mates and controls (P = 0.78). Sex partnerships shared HLA determinants more frequently than randomly constituted male-female pairs. The observed phenomenon does not reflect a syngenetic background between sex-mates as STR markers were not selectively shared. Thus, sex-mate selection in man may contravene the evolutionary pressure for genetic diversity in regard to HLA. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Locus-specific mutational events in a multilocus variable-number tandem repeat analysis of Escherichia coli O157:H7.

PubMed

Noller, Anna C; McEllistrem, M Catherine; Shutt, Kathleen A; Harrison, Lee H

2006-02-01

Multilocus variable-number tandem repeat analysis (MLVA) is a validated molecular subtyping method for detecting and evaluating Escherichia coli O157:H7 outbreaks. In a previous study, five outbreaks with a total of 21 isolates were examined by MLVA. Nearly 20% of the epidemiologically linked strains were single-locus variants (SLV) of their respective predominant outbreak clone. This result prompted an investigation into the mutation rates of the seven MLVA loci (TR1 to TR7). With an outbreak strain that was an SLV at the TR1 locus of the predominant clone, parallel and serial batch culture experiments were performed. In a parallel experiment, none (0/384) of the strains analyzed had mutations at the seven MLVA loci. In contrast, in the two 5-day serial experiments, 4.3% (41/960) of the strains analyzed had a significant variation in at least one of these loci (P < 0.001). The TR2 locus accounted for 85.3% (35/41) of the mutations, with an average mutation rate of 3.5 x 10(-3); the mutations rates for TR1 and TR5 were 10-fold lower. Single additions accounted for 77.1% (27/35) of the mutation events in TR2 and all (6/6) of the additions in TR1 and TR5. The remaining four loci had no slippage events detected. The mutation rates were locus specific and may impact the interpretation of MLVA data for epidemiologic investigations.
Genetic variation in a compound short tandem repeat/Alu haplotype system at the SB19.3 locus: properties and interpretation.

PubMed

Gaspar, Paulo; Seixas, Susana; Rocha, Jorge

2004-04-01

The genetic variation at a compound nonrecombining haplotype system, consisting of the previously reported SB19.3 Alu insertion polymorphism and a newly identified adjacent short tandem repeat (STR), was studied in population samples from Portugal and São Tomé (Gulf of Guinea, West Africa). Age estimates based on the linked microsatellite variation suggest that the Alu insertion occurred about 190,000 years ago. In accordance with the global patterns of distribution of human genetic variation, the highest haplotype diversity was found in the African sample. This excess in African diversity was due to both a substantial reduction in heterozygosity at the Alu polymorphism and a lower STR variability associated with the predominant Alu insertion allele in the Portuguese sample. The high level of interpopulation differentiation observed at the Alu locus (F(ST) = 0.43) was interpreted under alternative selective and demographic scenarios. The need for compatibility between patterns of variation at the STR and Alu loci could be used to restrict the range of selection coefficients in selection-driven genetic hitchhiking frameworks and to favor demographic scenarios dominated by larger pre-expansion African population sizes. Taken together, the data show that the SB19.3 Alu-STR system is an informative marker that can be included in more extended batteries of compound haplotypes used in human evolutionary studies.
Identification of exhumed remains of fire tragedy victims using conventional methods and autosomal/Y-chromosomal short tandem repeat DNA profiling.

PubMed

Calacal, Gayvelline C; Delfin, Frederick C; Tan, Michelle Music M; Roewer, Lutz; Magtanong, Danilo L; Lara, Myra C; Fortun, Raquel dR; De Ungria, Maria Corazon A

2005-09-01

In a fire tragedy in Manila in December 1998, one of the worst tragic incidents which resulted in the reported death of 23 children, identity could not be established initially resulting in the burial of still unidentified bodies. Underscoring the importance of identifying each of the human remains, the bodies were exhumed 3 months after the tragedy. We describe here our work, which was the first national case handled by local laboratories wherein conventional and molecular-based techniques were successfully applied in forensic identification. The study reports analysis of DNA obtained from skeletal remains exposed to conditions of burning, burial, and exhumation. DNA typing methods using autosomal and Y-chromosomal short tandem repeat (Y-STR) markers reinforced postmortem examinations using conventional identification techniques. The strategy resulted in the identification of 18 out of the 21 human remains analyzed, overcoming challenges encountered due to the absence of established procedures for the recovery of mass disaster remains. There was incomplete antemortem information to match the postmortem data obtained from the remains of 3 female child victims. Two victims were readily identified due to the availability of antemortem tissues. In the absence of this biologic material, parentage testing was performed using reference blood samples collected from parents and relatives. Data on patrilineal lineage based on common Y-STR haplotypes augmented autosomal DNA typing, particularly in deficiency cases.
Characterization of Spindle Checkpoint Kinase Mps1 Reveals Domain with Functional and Structural Similarities to Tetratricopeptide Repeat Motifs of Bub1 and BubR1 Checkpoint Kinases*

PubMed Central

Lee, Semin; Thebault, Philippe; Freschi, Luca; Beaufils, Sylvie; Blundell, Tom L.; Landry, Christian R.; Bolanos-Garcia, Victor M.; Elowe, Sabine

2012-01-01

Kinetochore targeting of the mitotic kinases Bub1, BubR1, and Mps1 has been implicated in efficient execution of their functions in the spindle checkpoint, the self-monitoring system of the eukaryotic cell cycle that ensures chromosome segregation occurs with high fidelity. In all three kinases, kinetochore docking is mediated by the N-terminal region of the protein. Deletions within this region result in checkpoint failure and chromosome segregation defects. Here, we use an interdisciplinary approach that includes biophysical, biochemical, cell biological, and bioinformatics methods to study the N-terminal region of human Mps1. We report the identification of a tandem repeat of the tetratricopeptide repeat (TPR) motif in the N-terminal kinetochore binding region of Mps1, with close homology to the tandem TPR motif of Bub1 and BubR1. Phylogenetic analysis indicates that TPR Mps1 was acquired after the split between deutorostomes and protostomes, as it is distinguishable in chordates and echinoderms. Overexpression of TPR Mps1 resulted in decreased efficiency of both chromosome alignment and mitotic arrest, likely through displacement of endogenous Mps1 from the kinetochore and decreased Mps1 catalytic activity. Taken together, our multidisciplinary strategy provides new insights into the evolution, structural organization, and function of Mps1 N-terminal region. PMID:22187426
Characterization of spindle checkpoint kinase Mps1 reveals domain with functional and structural similarities to tetratricopeptide repeat motifs of Bub1 and BubR1 checkpoint kinases.

PubMed

Lee, Semin; Thebault, Philippe; Freschi, Luca; Beaufils, Sylvie; Blundell, Tom L; Landry, Christian R; Bolanos-Garcia, Victor M; Elowe, Sabine

2012-02-17

Kinetochore targeting of the mitotic kinases Bub1, BubR1, and Mps1 has been implicated in efficient execution of their functions in the spindle checkpoint, the self-monitoring system of the eukaryotic cell cycle that ensures chromosome segregation occurs with high fidelity. In all three kinases, kinetochore docking is mediated by the N-terminal region of the protein. Deletions within this region result in checkpoint failure and chromosome segregation defects. Here, we use an interdisciplinary approach that includes biophysical, biochemical, cell biological, and bioinformatics methods to study the N-terminal region of human Mps1. We report the identification of a tandem repeat of the tetratricopeptide repeat (TPR) motif in the N-terminal kinetochore binding region of Mps1, with close homology to the tandem TPR motif of Bub1 and BubR1. Phylogenetic analysis indicates that TPR Mps1 was acquired after the split between deutorostomes and protostomes, as it is distinguishable in chordates and echinoderms. Overexpression of TPR Mps1 resulted in decreased efficiency of both chromosome alignment and mitotic arrest, likely through displacement of endogenous Mps1 from the kinetochore and decreased Mps1 catalytic activity. Taken together, our multidisciplinary strategy provides new insights into the evolution, structural organization, and function of Mps1 N-terminal region.
The Candidate Phylum Poribacteria by Single-Cell Genomics: New Insights into Phylogeny, Cell-Compartmentation, Eukaryote-Like Repeat Proteins, and Other Genomic Features

PubMed Central

Kamke, Janine; Rinke, Christian; Schwientek, Patrick; Mavromatis, Kostas; Ivanova, Natalia; Sczyrba, Alexander; Woyke, Tanja; Hentschel, Ute

2014-01-01

The candidate phylum Poribacteria is one of the most dominant and widespread members of the microbial communities residing within marine sponges. Cell compartmentalization had been postulated along with their discovery about a decade ago and their phylogenetic association to the Planctomycetes, Verrucomicrobia, Chlamydiae superphylum was proposed soon thereafter. In the present study we revised these features based on genomic data obtained from six poribacterial single cells. We propose that Poribacteria form a distinct monophyletic phylum contiguous to the PVC superphylum together with other candidate phyla. Our genomic analyses supported the possibility of cell compartmentalization in form of bacterial microcompartments. Further analyses of eukaryote-like protein domains stressed the importance of such proteins with features including tetratricopeptide repeats, leucin rich repeats as well as low density lipoproteins receptor repeats, the latter of which are reported here for the first time from a sponge symbiont. Finally, examining the most abundant protein domain family on poribacterial genomes revealed diverse phyH family proteins, some of which may be related to dissolved organic posphorus uptake. PMID:24498082
The RNase P RNA from cyanobacteria: short tandemly repeated repetitive (STRR) sequences are present within the RNase P RNA gene in heterocyst-forming cyanobacteria.

PubMed Central

Vioque, A

1997-01-01

The RNase P RNA gene (rnpB) from 10 cyanobacteria has been characterized. These new RNAs, together with the previously available ones, provide a comprehensive data set of RNase P RNA from diverse cyanobacterial lineages. All heterocystous cyanobacteria, but none of the non-heterocystous strains analyzed, contain short tandemly repeated repetitive (STRR) sequences that increase the length of helix P12. Site-directed mutagenesis experiments indicate that the STRR sequences are not required for catalytic activity in vitro. STRR sequences seem to have recently and independently invaded the RNase P RNA genes in heterocyst-forming cyanobacteria because closely related strains contain unrelated STRR sequences. Most cyanobacteria RNase P RNAs lack the sequence GGU in the loop connecting helices P15 and P16 that has been established to interact with the 3'-end CCA in precursor tRNA substrates in other bacteria. This character is shared with plastid RNase P RNA. Helix P6 is longer than usual in most cyanobacteria as well as in plastid RNase P RNA. PMID:9254706
Heterogeneous expression pattern of tandem duplicated sHsps genes during fruit ripening in two tomato species

NASA Astrophysics Data System (ADS)

Arce, DP; Krsticevic, FJ; Ezpeleta, J.; Ponce, SD; Pratta, GR; Tapia, E.

2016-04-01

The small heat shock proteins (sHSPs) have been found to play a critical role in physiological stress conditions in protecting proteins from irreversible aggregation. To characterize the gene expression profile of four sHsps with a tandem gene structure arrangement in the domesticated Solanum lycopersicum (Heinz 1706) genome and its wild close relative Solanum pimpinellifolium (LA1589), differential gene expression analysis using RNA-Seq was conducted in three ripening stages in both cultivars fruits. Gene promoter analysis was performed to explain the heterogeneous pattern of gene expression found for these tandem duplicated sHsps. In silico analysis results contribute to refocus wet experiment analysis in tomato sHsp family proteins.
Tandem duplications of a degenerated GTP-binding domain at the origin of GTPase receptors Toc159 and thylakoidal SRP

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hernandez Torres, Jorge; Maldonado, Monica Alexandra Arias; Chomilier, Jacques

2007-12-14

The evolutionary origin of some nuclear encoded proteins that translocate proteins across the chloroplast envelope remains unknown. Therefore, sequences of GTPase proteins constituting the Arabidopsis thaliana translocon at the outer membrane of chloroplast (atToc) complexes were analyzed by means of HCA. In particular, atToc159 and related proteins (atToc132, atToc120, and atToc90) do not have proven homologues of prokaryotic or eukaryotic ancestry. We established that the three domains commonly referred to as A, G, and M originate from the GTPase G domain, tandemly repeated, and probably evolving toward an unstructured conformation in the case of the A domain. It resulted frommore » this study a putative common ancestor for these proteins and a new domain definition, in particular the splitting of A into three domains (A1, A2, and A3), has been proposed. The family of Toc159, previously containing A. thaliana and Pisum sativum, has been extended to Medicago truncatula and Populus trichocarpa and it has been revised for Oryza sativa. They have also been compared to GTPase subunits involved in the cpSRP system. A distant homology has been revealed among Toc and cpSRP GTP-hydrolyzing proteins of A. thaliana, and repetitions of a GTPase domain were also found in cpSRP protein receptors, by means of HCA analysis.« less
High-Pressure NMR and SAXS Reveals How Capping Modulates Folding Cooperativity of the pp32 Leucine-rich Repeat Protein.

PubMed

Zhang, Yi; Berghaus, Melanie; Klein, Sean; Jenkins, Kelly; Zhang, Siwen; McCallum, Scott A; Morgan, Joel E; Winter, Roland; Barrick, Doug; Royer, Catherine A

2018-04-27

Many repeat proteins contain capping motifs, which serve to shield the hydrophobic core from solvent and maintain structural integrity. While the role of capping motifs in enhancing the stability and structural integrity of repeat proteins is well documented, their contribution to folding cooperativity is not. Here we examined the role of capping motifs in defining the folding cooperativity of the leucine-rich repeat protein, pp32, by monitoring the pressure- and urea-induced unfolding of an N-terminal capping motif (N-cap) deletion mutant, pp32-∆N-cap, and a C-terminal capping motif destabilization mutant pp32-Y131F/D146L, using residue-specific NMR and small-angle X-ray scattering. Destabilization of the C-terminal capping motif resulted in higher cooperativity for the unfolding transition compared to wild-type pp32, as these mutations render the stability of the C-terminus similar to that of the rest of the protein. In contrast, deletion of the N-cap led to strong deviation from two-state unfolding. In both urea- and pressure-induced unfolding, residues in repeats 1-3 of pp32-ΔN-cap lost their native structure first, while the C-terminal half was more stable. The residue-specific free energy changes in all regions of pp32-ΔN-cap were larger in urea compared to high pressure, indicating a less cooperative destabilization by pressure. Moreover, in contrast to complete structural disruption of pp32-ΔN-cap at high urea concentration, its pressure unfolded state remained compact. The contrasting effects of the capping motifs on folding cooperativity arise from the differential local stabilities of pp32, whereas the contrasting effects of pressure and urea on the pp32-ΔN-cap variant arise from their distinct mechanisms of action. Copyright © 2018 Elsevier Ltd. All rights reserved.
Repeat region of Brugia malayi sheath protein (Shp-1) carries Dominant B epitopes recognized in filarial endemic population.

PubMed

Jawaharlal, Jeya Prita Parasurama; Madhumathi, Jayaprakasam; Prince, Rajaiah Prabhu; Kaliraj, Perumal

2014-09-01

Transmission of lymphatic filariasis is mediated through microfilariae (L1 stage of the parasite) which is encased in an eggshell called sheath. The sheath protein Shp-1 stabilizes the structure due to the unique repeat region with Met-Pro-Pro-Gln-Gly sequences. Microfilarial proteins could be used as transmission blocking vaccines. Since the repeat region of Shp-1 was predicted to carry putative B epitopes, this region was used to analyze its reactivity with clinical samples towards construction of peptide vaccine. In silico analysis of Shp-1 showed the presence of B epitopes in the region 49-107. The polypeptide epitopic region Shp-149-107 was cloned and expressed in Escherichia coli. Antibody reactivity of the Shp-149-107 construct was evaluated in filarial endemic population by ELISA. Putatively immune endemic normals (EN) showed significantly high reactivity (P < 0.05) when compared to all the other categories. Antibody reactivity of Shp-1 repeat region was similar to that of whole protein proving that this region carries B epitopes responsible for its humoral response in humans. Thus this can be employed for inducing anti-microfilarial immunity in the infected population that may lead to reduction in transmission intensity and also it could be used along with other epitopes from different stages of the parasite in order to manage the disease effectively.
Solution structure of a repeated unit of the ABA-1 nematode polyprotein allergen of Ascaris reveals a novel fold and two discrete lipid-binding sites.

PubMed

Meenan, Nicola A G; Ball, Graeme; Bromek, Krystyna; Uhrín, Dušan; Cooper, Alan; Kennedy, Malcolm W; Smith, Brian O

2011-04-19

Nematode polyprotein allergens (NPAs) are an unusual class of lipid-binding proteins found only in nematodes. They are synthesized as large, tandemly repetitive polyproteins that are post-translationally cleaved into multiple copies of small lipid binding proteins with virtually identical fatty acid and retinol (Vitamin A)-binding characteristics. They are probably central to transport and distribution of small hydrophobic compounds between the tissues of nematodes, and may play key roles in nutrient scavenging, immunomodulation, and IgE antibody-based responses in infection. In some species the repeating units are diverse in amino acid sequence, but, in ascarid and filarial nematodes, many of the units are identical or near-identical. ABA-1A is the most common repeating unit of the NPA of Ascaris suum, and is closely similar to that of Ascaris lumbricoides, the large intestinal roundworm of humans. Immune responses to NPAs have been associated with naturally-acquired resistance to infection in humans, and the immune repertoire to them is under strict genetic control. The solution structure of ABA-1A was determined by protein nuclear magnetic resonance spectroscopy. The protein adopts a novel seven-helical fold comprising a long central helix that participates in two hollow four-helical bundles on either side. Discrete hydrophobic ligand-binding pockets are found in the N-terminal and C-terminal bundles, and the amino acid sidechains affected by ligand (fatty acid) binding were identified. Recombinant ABA-1A contains tightly-bound ligand(s) of bacterial culture origin in one of its binding sites. This is the first mature, post-translationally processed, unit of a naturally-occurring tandemly-repetitive polyprotein to be structurally characterized from any source, and it belongs to a new structural class. NPAs have no counterparts in vertebrates, so represent potential targets for drug or immunological intervention. The nature of the (as yet) unidentified bacterial
A strategy of gene overexpression based on tandem repetitive promoters in Escherichia coli.

PubMed

Li, Mingji; Wang, Junshu; Geng, Yanping; Li, Yikui; Wang, Qian; Liang, Quanfeng; Qi, Qingsheng

2012-02-06

For metabolic engineering, many rate-limiting steps may exist in the pathways of accumulating the target metabolites. Increasing copy number of the desired genes in these pathways is a general method to solve the problem, for example, the employment of the multi-copy plasmid-based expression system. However, this method may bring genetic instability, structural instability and metabolic burden to the host, while integrating of the desired gene into the chromosome may cause inadequate transcription or expression. In this study, we developed a strategy for obtaining gene overexpression by engineering promoter clusters consisted of multiple core-tac-promoters (MCPtacs) in tandem. Through a uniquely designed in vitro assembling process, a series of promoter clusters were constructed. The transcription strength of these promoter clusters showed a stepwise enhancement with the increase of tandem repeats number until it reached the critical value of five. Application of the MCPtacs promoter clusters in polyhydroxybutyrate (PHB) production proved that it was efficient. Integration of the phaCAB genes with the 5CPtacs promoter cluster resulted in an engineered E.coli that can accumulate 23.7% PHB of the cell dry weight in batch cultivation. The transcription strength of the MCPtacs promoter cluster can be greatly improved by increasing the tandem repeats number of the core-tac-promoter. By integrating the desired gene together with the MCPtacs promoter cluster into the chromosome of E. coli, we can achieve high and stale overexpression with only a small size. This strategy has an application potential in many fields and can be extended to other bacteria.
Predicting repeat protein folding kinetics from an experimentally determined folding energy landscape

PubMed Central

Street, Timothy O; Barrick, Doug

2009-01-01

The Notch ankyrin domain is a repeat protein whose folding has been characterized through equilibrium and kinetic measurements. In previous work, equilibrium folding free energies of truncated constructs were used to generate an experimentally determined folding energy landscape (Mello and Barrick, Proc Natl Acad Sci USA 2004;101:14102–14107). Here, this folding energy landscape is used to parameterize a kinetic model in which local transition probabilities between partly folded states are based on energy values from the landscape. The landscape-based model correctly predicts highly diverse experimentally determined folding kinetics of the Notch ankyrin domain and sequence variants. These predictions include monophasic folding and biphasic unfolding, curvature in the unfolding limb of the chevron plot, population of a transient unfolding intermediate, relative folding rates of 19 variants spanning three orders of magnitude, and a change in the folding pathway that results from C-terminal stabilization. These findings indicate that the folding pathway(s) of the Notch ankyrin domain are thermodynamically selected: the primary determinants of kinetic behavior can be simply deduced from the local stability of individual repeats. PMID:19177351
Lateral diffusion of proteins on supported lipid bilayers: additive friction of synaptotagmin 7 C2A-C2B tandem domains.

PubMed

Vasquez, Joseph K; Chantranuvatana, Kan; Giardina, Daniel T; Coffman, Matthew D; Knight, Jefferson D

2014-12-23

The synaptotagmin (Syt) family of proteins contains tandem C2 domains, C2A and C2B, which bind membranes in the presence of Ca(2+) to trigger vesicle fusion during exocytosis. Despite recent progress, the role and extent of interdomain interactions between C2A and C2B in membrane binding remain unclear. To test whether the two domains interact on a planar lipid bilayer (i.e., experience thermodynamic interdomain contacts), diffusion of fluorescent-tagged C2A, C2B, and C2AB domains from human Syt7 was measured using total internal reflection fluorescence microscopy with single-particle tracking. The C2AB tandem exhibits a lateral diffusion constant approximately half the value of the isolated single domains and does not change when additional residues are engineered into the C2A-C2B linker. This is the expected result if C2A and C2B are separated when membrane-bound; theory predicts that C2AB diffusion would be faster if the two domains were close enough together to have interdomain contact. Stopped-flow measurements of membrane dissociation kinetics further support an absence of interdomain interactions, as dissociation kinetics of the C2AB tandem remain unchanged when rigid or flexible linker extensions are included. Together, the results suggest that the two C2 domains of Syt7 bind independently to planar membranes, in contrast to reported interdomain cooperativity in Syt1.
Wound induced Beta vulgaris polygalacturonase-inhibiting protein genes encode a longer leucine-rich repeat domain and inhibit fungal polygalacturonases

USDA-ARS?s Scientific Manuscript database

Polygalacturonase-inhibiting proteins (PGIPs) are leucine-rich repeat (LRR) proteins involved in plant defense. Sugar beet (Beta vulgaris L.) PGIP genes, BvPGIP1, BvPGIP2 and BvPGIP3, were isolated from two breeding lines, F1016 and F1010. Full-length cDNA sequences of the three BvPGIP genes encod...
Disruption of a Rice Pentatricopeptide Repeat Protein Causes a Seedling-Specific Albino Phenotype and Its Utilization to Enhance Seed Purity in Hybrid Rice Production1[W][OA

PubMed Central

Su, Ning; Hu, Mao-Long; Wu, Dian-Xing; Wu, Fu-Qing; Fei, Gui-Lin; Lan, Ying; Chen, Xiu-Ling; Shu, Xiao-Li; Zhang, Xin; Guo, Xiu-Ping; Cheng, Zhi-Jun; Lei, Cai-Lin; Qi, Cun-Kou; Jiang, Ling; Wang, Haiyang; Wan, Jian-Min

2012-01-01

The pentatricopeptide repeat (PPR) gene family represents one of the largest gene families in higher plants. Accumulating data suggest that PPR proteins play a central and broad role in modulating the expression of organellar genes in plants. Here we report a rice (Oryza sativa) mutant named young seedling albino (ysa) derived from the rice thermo/photoperiod-sensitive genic male-sterile line Pei'ai64S, which is a leading male-sterile line for commercial two-line hybrid rice production. The ysa mutant develops albino leaves before the three-leaf stage, but the mutant gradually turns green and recovers to normal green at the six-leaf stage. Further investigation showed that the change in leaf color in ysa mutant is associated with changes in chlorophyll content and chloroplast development. Map-based cloning revealed that YSA encodes a PPR protein with 16 tandem PPR motifs. YSA is highly expressed in young leaves and stems, and its expression level is regulated by light. We showed that the ysa mutation has no apparent negative effects on several important agronomic traits, such as fertility, stigma extrusion rate, selfed seed-setting rate, hybrid seed-setting rate, and yield heterosis under normal growth conditions. We further demonstrated that ysa can be used as an early marker for efficient identification and elimination of false hybrids in commercial hybrid rice production, resulting in yield increases by up to approximately 537 kg ha−1. PMID:22430843
Plasmodium cysteine repeat modular proteins 1-4: complex proteins with roles throughout the malaria parasite life cycle.

PubMed

Thompson, Joanne; Fernandez-Reyes, Delmiro; Sharling, Lisa; Moore, Sally G; Eling, Wijnand M; Kyes, Sue A; Newbold, Christopher I; Kafatos, Fotis C; Janse, Chris J; Waters, Andrew P

2007-06-01

The Cysteine Repeat Modular Proteins (PCRMP1-4) of Plasmodium, are encoded by a small gene family that is conserved in malaria and other Apicomplexan parasites. They are very large, predicted surface proteins with multipass transmembrane domains containing motifs that are conserved within families of cysteine-rich, predicted surface proteins in a range of unicellular eukaryotes, and a unique combination of protein-binding motifs, including a >100 kDa cysteine-rich modular region, an epidermal growth factor-like domain and a Kringle domain. PCRMP1 and 2 are expressed in life cycle stages in both the mosquito and vertebrate. They colocalize with PfEMP1 (P. falciparum Erythrocyte Membrane Antigen-1) during its export from P. falciparum blood-stage parasites and are exposed on the surface of haemolymph- and salivary gland-sporozoites in the mosquito, consistent with a role in host tissue targeting and invasion. Gene disruption of pcrmp1 and 2 in the rodent malaria model, P. berghei, demonstrated that both are essential for transmission of the parasite from the mosquito to the mouse and has established their discrete and important roles in sporozoite targeting to the mosquito salivary gland. The unprecedented expression pattern and structural features of the PCRMPs thus suggest a variety of roles mediating host-parasite interactions throughout the parasite life cycle.

Serine-Aspartate Repeat Protein D Increases Staphylococcus aureus Virulence and Survival in Blood.

PubMed

Askarian, Fatemeh; Uchiyama, Satoshi; Valderrama, J Andrés; Ajayi, Clement; Sollid, Johanna U E; van Sorge, Nina M; Nizet, Victor; van Strijp, Jos A G; Johannessen, Mona

2017-01-01

Staphylococcus aureus expresses a panel of cell wall-anchored adhesins, including proteins belonging to the microbial surface components recognizing adhesive matrix molecule (MSCRAMM) family, exemplified by the serine-aspartate repeat protein D (SdrD), which serve key roles in colonization and infection. Deletion of sdrD from S. aureus subsp. aureus strain NCTC8325-4 attenuated bacterial survival in human whole blood ex vivo, which was associated with increased killing by human neutrophils. Remarkably, SdrD was able to inhibit innate immune-mediated bacterial killing independently of other S. aureus proteins, since addition of recombinant SdrD protein and heterologous expression of SdrD in Lactococcus lactis promoted bacterial survival in human blood. SdrD contributes to bacterial virulence in vivo, since fewer S. aureus subsp. aureus NCTC8325-4 ΔsdrD bacteria than bacteria of the parent strain were recovered from blood and several organs using a murine intravenous infection model. Collectively, our findings reveal a new property of SdrD as an important key contributor to S. aureus survival and the ability to escape the innate immune system in blood. Copyright © 2016 Askarian et al.
Serine-Aspartate Repeat Protein D Increases Staphylococcus aureus Virulence and Survival in Blood

PubMed Central

Uchiyama, Satoshi; Valderrama, J. Andrés; Ajayi, Clement; Sollid, Johanna U. E.; van Sorge, Nina M.; Nizet, Victor; van Strijp, Jos A. G.

2016-01-01

ABSTRACT Staphylococcus aureus expresses a panel of cell wall-anchored adhesins, including proteins belonging to the microbial surface components recognizing adhesive matrix molecule (MSCRAMM) family, exemplified by the serine-aspartate repeat protein D (SdrD), which serve key roles in colonization and infection. Deletion of sdrD from S. aureus subsp. aureus strain NCTC8325-4 attenuated bacterial survival in human whole blood ex vivo, which was associated with increased killing by human neutrophils. Remarkably, SdrD was able to inhibit innate immune-mediated bacterial killing independently of other S. aureus proteins, since addition of recombinant SdrD protein and heterologous expression of SdrD in Lactococcus lactis promoted bacterial survival in human blood. SdrD contributes to bacterial virulence in vivo, since fewer S. aureus subsp. aureus NCTC8325-4 ΔsdrD bacteria than bacteria of the parent strain were recovered from blood and several organs using a murine intravenous infection model. Collectively, our findings reveal a new property of SdrD as an important key contributor to S. aureus survival and the ability to escape the innate immune system in blood. PMID:27795358
SPINDLY, a tetratricopeptide repeat protein involved in gibberellin signal transduction in Arabidopsis.

PubMed Central

Jacobsen, S E; Binkowski, K A; Olszewski, N E

1996-01-01

Gibberellins (GAs) are a major class of plant hormones that control many developmental processes, including seed development and germination, flower and fruit development, and flowering time. Genetic studies with Arabidopsis thaliana have identified two genes involved in GA perception or signal transduction. A semidominant mutation at the GIBBERELLIN INSENSITIVE (GAI) locus results in plants resembling GA-deficient mutants but exhibiting reduced sensitivity to GA. Recessive mutations at the SPINDLY (SPY) locus cause a phenotype that is consistent with constitutive activation of GA signal transduction. Here we show that a strong allele of spy is completely epistatic to gai, indicating that SPY acts downstream of GAI. We have cloned the SPY gene and shown that it encodes a new type of signal transduction protein, which contains a tetratricopeptide repeat region, likely serving as a protein interaction domain, and a novel C-terminal region. Mutations in both domains increase GA signal transduction. The presence of a similar gene in Caenorhabditis elegans suggests that SPY represents a class of signal transduction proteins that is present throughout the eukaryotes. Images Fig. 1 Fig. 2 Fig. 3 PMID:8799194
Sense-encoded poly-GR dipeptide repeat proteins correlate to neurodegeneration and uniquely co-localize with TDP-43 in dendrites of repeat-expanded C9orf72 amyotrophic lateral sclerosis.

PubMed

Saberi, Shahram; Stauffer, Jennifer E; Jiang, Jie; Garcia, Sandra Diaz; Taylor, Amy E; Schulte, Derek; Ohkubo, Takuya; Schloffman, Cheyenne L; Maldonado, Marcus; Baughn, Michael; Rodriguez, Maria J; Pizzo, Don; Cleveland, Don; Ravits, John

2018-03-01

Hexanucleotide repeat expansions in C9orf72 are the most common genetic cause of amyotrophic lateral sclerosis (C9 ALS). The main hypothesized pathogenic mechanisms are C9orf72 haploinsufficiency and/or toxicity from one or more of bi-directionally transcribed repeat RNAs and their dipeptide repeat proteins (DPRs) poly-GP, poly-GA, poly-GR, poly-PR and poly-PA. Recently, nuclear import and/or export defects especially caused by arginine-containing poly-GR or poly-PR have been proposed as significant contributors to pathogenesis based on disease models. We quantitatively studied and compared DPRs, nuclear pore proteins and C9orf72 protein in clinically related and clinically unrelated regions of the central nervous system, and compared them to phosphorylated TDP-43 (pTDP-43), the hallmark protein of ALS. Of the five DPRs, only poly-GR was significantly abundant in clinically related areas compared to unrelated areas (p < 0.001), and formed dendritic-like aggregates in the motor cortex that co-localized with pTDP-43 (p < 0.0001). While most poly-GR dendritic inclusions were pTDP-43 positive, only 4% of pTDP-43 dendritic inclusions were poly-GR positive. Staining for arginine-containing poly-GR and poly-PR in nuclei of neurons produced signals that were not specific to C9 ALS. We could not detect significant differences of nuclear markers RanGap, Lamin B1, and Importin β1 in C9 ALS, although we observed subtle nuclear changes in ALS, both C9 and non-C9, compared to control. The C9orf72 protein itself was diffusely expressed in cytoplasm of large neurons and glia, and nearly 50% reduced, in both clinically related frontal cortex and unrelated occipital cortex, but not in cerebellum. In summary, sense-encoded poly-GR DPR was unique, and localized to dendrites and pTDP43 in motor regions of C9 ALS CNS. This is consistent with new emerging ideas about TDP-43 functions in dendrites.
Sense-encoded poly-GR dipeptide repeat proteins correlate to neurodegeneration and uniquely co-localize with TDP-43 in dendrites of repeat expanded C9orf72 amyotrophic lateral sclerosis

PubMed Central

Saberi, Shahram; Stauffer, Jennifer E.; Jiang, Jie; Garcia, Sandra Diaz; Taylor, Amy E; Schulte, Derek; Ohkubo, Takuya; Schloffman, Cheyenne L.; Maldonado, Marcus; Baughn, Michael; Rodriguez, Maria J; Pizzo, Don; Cleveland, Don; Ravits, John

2018-01-01

Hexanucleotide repeat expansions in C9orf72 are the most common genetic cause of amyotrophic lateral sclerosis (C9 ALS). The main hypothesized pathogenic mechanisms are C9orf72 haploinsufficiency and/or toxicity from one or more of bi-directionally transcribed repeat RNAs and their dipeptide repeat proteins (DPRs) poly-GP, poly-GA, poly-GR, poly-PR and poly-PA. Recently, nuclear import and/or export defects especially caused by arginine-containing poly-GR or poly-PR have been proposed as significant contributors to pathogenesis based on disease models. We quantitatively studied and compared DPRs, nuclear pore proteins and C9orf72 protein in clinically-related and clinically-unrelated regions of the central nervous system, and compared them to phosphorylated TDP-43 (pTDP-43), the hallmark protein of ALS. Of the five DPRs, only poly-GR was significantly abundant in clinically-related areas compared to unrelated areas (p<0.001), and formed dendritic-like aggregates in the motor cortex that co-localized with pTDP-43 (p<0.0001). While most poly-GR dendritic inclusions were pTDP-43-positive, only 4% of pTDP-43 dendritic inclusions were poly-GR-positive. Staining for arginine-containing poly-GR and poly-PR in nuclei of neurons produced signals that were not specific to C9 ALS. We could not detect significant differences of nuclear markers RanGap, Lamin B1, and Importin β1 in C9 ALS, although we observed subtle nuclear changes in ALS, both C9 and non-C9, compared to control. The C9orf72 protein itself was diffusely expressed in cytoplasm of large neurons and glia, and nearly 50% reduced, in both clinically-related frontal cortex and unrelated occipital cortex, but not in cerebellum. In summary, sense-encoded poly-GR DPR was unique, and localized to neurites and pTDP43 in motor regions of C9 ALS CNS. This is consistent with new emerging ideas about TDP-43 functions in dendrites. PMID:29196813
Covalently Linked Tandem Lesions in DNA

PubMed Central

Patrzyc, Helen B.; Dawidzik, Jean B.; Budzinski, Edwin E.; Freund, Harold G.; Wilton, John H.; Box, Harold C.

2013-01-01

Reactive oxygen species (ROS) generate a type of DNA damage called tandem lesions, two adjacent nucleotides both modified. A subcategory of tandem lesions consists of adjacent nucleotides linked by a covalent bond. Covalently linked tandem lesions generate highly characteristic liquid chromotography-tandem mass spectrometry (LC-MS/MS) elution profiles. We have used this property to comprehensively survey X-irradiated DNA for covalently linked tandem lesions. A total of 15 tandem lesions were detected in DNA irradiated in deoxygenated aqueous solution, five tandem lesions were detected in DNA that was irradiated in oxygenated solution. PMID:23106212
CRISPRcompar: a website to compare clustered regularly interspaced short palindromic repeats.

PubMed

Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine

2008-07-01

Clustered regularly interspaced short palindromic repeat (CRISPR) elements are a particular family of tandem repeats present in prokaryotic genomes, in almost all archaea and in about half of bacteria, and which participate in a mechanism of acquired resistance against phages. They consist in a succession of direct repeats (DR) of 24-47 bp separated by similar sized unique sequences (spacers). In the large majority of cases, the direct repeats are highly conserved, while the number and nature of the spacers are often quite diverse, even among strains of a same species. Furthermore, the acquisition of new units (DR + spacer) was shown to happen almost exclusively on one side of the locus. Therefore, the CRISPR presents an interesting genetic marker for comparative and evolutionary analysis of closely related bacterial strains. CRISPRcompar is a web service created to assist biologists in the CRISPR typing process. Two tools facilitates the in silico investigation: CRISPRcomparison and CRISPRtionary. This website is freely accessible at http://crispr.u-psud.fr/CRISPRcompar/.
Pathogenic Leptospira species express surface-exposed proteins belonging to the bacterial immunoglobulin superfamily

PubMed Central

Matsunaga, James; Barocchi, Michele A.; Croda, Julio; Young, Tracy A.; Sanchez, Yolanda; Siqueira, Isadora; Bolin, Carole A.; Reis, Mitermayer G.; Riley, Lee W.; Haake, David A.; Ko, Albert I.

2005-01-01

Summary Proteins with bacterial immunoglobulin-like (Big) domains, such as the Yersinia pseudotuberculosis invasin and Escherichia coli intimin, are surface-expressed proteins that mediate host mammalian cell invasion or attachment. Here, we report the identification and characterization of a new family of Big domain proteins, referred to as Lig (leptospiral Ig-like) proteins, in pathogenic Leptospira. Screening of L. interrogans and L. kirschneri expression libraries with sera from leptospirosis patients identified 13 lambda phage clones that encode tandem repeats of the 90 amino acid Big domain. Two lig genes, designated ligA and ligB, and one pseudo-gene, ligC, were identified. The ligA and ligB genes encode amino-terminal lipoprotein signal peptides followed by 10 or 11 Big domain repeats and, in the case of ligB, a unique carboxy-terminal non-repeat domain. The organization of ligC is similar to that of ligB but contains mutations that disrupt the reading frame. The lig sequences are present in pathogenic but not saprophytic Leptospira species. LigA and LigB are expressed by a variety of virulent leptospiral strains. Loss of Lig protein and RNA transcript expression is correlated with the observed loss of virulence during culture attenuation of pathogenic strains. High-pressure freeze substitution followed by immunocytochemical electron microscopy confirmed that the Lig proteins were localized to the bacterial surface. Immunoblot studies with patient sera found that the Lig proteins are a major antigen recognized during the acute host infection. These observations demonstrate that the Lig proteins are a newly identified surface protein of pathogenic Leptospira, which by analogy to other bacterial immunoglobulin superfamily virulence factors, may play a role in host cell attachment and invasion during leptospiral pathogenesis. PMID:12890019
Pathogenic Leptospira species express surface-exposed proteins belonging to the bacterial immunoglobulin superfamily.

PubMed

Matsunaga, James; Barocchi, Michele A; Croda, Julio; Young, Tracy A; Sanchez, Yolanda; Siqueira, Isadora; Bolin, Carole A; Reis, Mitermayer G; Riley, Lee W; Haake, David A; Ko, Albert I

2003-08-01

Proteins with bacterial immunoglobulin-like (Big) domains, such as the Yersinia pseudotuberculosis invasin and Escherichia coli intimin, are surface-expressed proteins that mediate host mammalian cell invasion or attachment. Here, we report the identification and characterization of a new family of Big domain proteins, referred to as Lig (leptospiral Ig-like) proteins, in pathogenic Leptospira. Screening of L. interrogans and L. kirschneri expression libraries with sera from leptospirosis patients identified 13 lambda phage clones that encode tandem repeats of the 90 amino acid Big domain. Two lig genes, designated ligA and ligB, and one pseudogene, ligC, were identified. The ligA and ligB genes encode amino-terminal lipoprotein signal peptides followed by 10 or 11 Big domain repeats and, in the case of ligB, a unique carboxy-terminal non-repeat domain. The organization of ligC is similar to that of ligB but contains mutations that disrupt the reading frame. The lig sequences are present in pathogenic but not saprophytic Leptospira species. LigA and LigB are expressed by a variety of virulent leptospiral strains. Loss of Lig protein and RNA transcript expression is correlated with the observed loss of virulence during culture attenuation of pathogenic strains. High-pressure freeze substitution followed by immunocytochemical electron microscopy confirmed that the Lig proteins were localized to the bacterial surface. Immunoblot studies with patient sera found that the Lig proteins are a major antigen recognized during the acute host infection. These observations demonstrate that the Lig proteins are a newly identified surface protein of pathogenic Leptospira, which by analogy to other bacterial immunoglobulin superfamily virulence factors, may play a role in host cell attachment and invasion during leptospiral pathogenesis.
Rapid and high resolution genotyping of all Escherichia coli serotypes using 10 genomic repeat-containing loci.

PubMed

Løbersli, Inger; Haugum, Kjersti; Lindstedt, Bjørn-Arne

2012-01-01

Our laboratory has previously published two multiple-locus variable-number tandem-repeats analysis (MLVA) methods for rapid genotyping of Escherichia coli (E. coli), which are now in routine use for surveillance and outbreak detection. The first assay developed was specific for E. coli O157:H7; however this assay was not suitable for genotyping other E. coli serotypes. A new generic MLVA-assay was then developed with the capability of genotyping all E. coli serotypes. This generic E. coli MLVA (GECM7) was based on polymorphism in seven variable number of tandem repeats (VNTR) loci. GECM7 worked well with the majority of E. coli serotypes; however we wanted to increase the resolution for this method based in part of comparison with PFGE typing of E. coli O26:H11, where PFGE appeared to display higher resolution. The GECM7 method was improved by adding three new repeat-loci to a total of ten (GECM10), and a considerable increase in resolution was observed (from 296 to 507 genotypes on the same set of strains). Copyright © 2011 Elsevier B.V. All rights reserved.
Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures.

PubMed

Hayes, Michael L; Giang, Karolyn; Mulligan, R Michael

2012-05-14

Pentatricopeptide repeat (PPR) proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82) and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3' UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative selection even in the absence of an editing site target
[Usefulness of the variable numbers of tandem repeats (VNTR) analysis for complex infections of Mycobacterium avium and Mycobacterium intracellulare].

PubMed

Tsunematsu, Noriko; Goto, Mieko; Saiki, Yumiko; Baba, Michiko; Udagawa, Tadashi; Kazumi, Yuko

2008-09-01

The bacilli which were isolated from a patient suspected of the mixed infections with Mycobacterium avium and Mycobacterium intracellulare, were analyzed. The genotypes of M. avium in the sedimented fractions of treated sputum and in some colonies isolated from Ogawa medium were compared by the Variable Numbers of Tandem Repeats (VNTR). A woman, aged 57. Mycobacterial species isolated from some colonies by culture in 2004 and 2006 and from the treated sputum in 2006, were determined by DNA sequencing analysis of the 16S rRNA gene. Also, by using VNTR, the genotype of mycobacteria was analyzed. [Results] (1) The colony isolated from Ogawa medium in 2004 was monoclonal M. avium. (2) By VNTR analyses of specimens in 2006, multiple acid-fast bacteria were found in the sputum sediment and in isolated bacteria from Ogawa medium. (3) By analyses of 16S rRNA DNA sequence, M. avium and M. intracellulare were found in the colonies isolated from the sputum sediment and the Ogawa medium in 2006. (4) The same VNTR patterns were obtained in M. avium in 2004 and 2006 when single colony was analyzed. (5) From the showerhead and culvert of the bathroom in the patient's house, M. avium was not detected. By VNTR analyses, it was considered that the mixed infections of M. avium and M. intracellulare had been generated during treatment in this case. Therefore, in the case of suspected complex infection, VNTR analysis would be a useful genotyping method in M. avium complex infection.
Characterization and evolutionary analysis of tributyltin-binding protein and pufferfish saxitoxin and tetrodotoxin-binding protein genes in toxic and nontoxic pufferfishes.

PubMed

Hashiguchi, Y; Lee, J M; Shiraishi, M; Komatsu, S; Miki, S; Shimasaki, Y; Mochioka, N; Kusakabe, T; Oshima, Y

2015-05-01

Understanding the evolutionary mechanisms of toxin accumulation in pufferfishes has been long-standing problem in toxicology and evolutionary biology. Pufferfish saxitoxin and tetrodotoxin-binding protein (PSTBP) is involved in the transport and accumulation of tetrodotoxin and is one of the most intriguing proteins related to the toxicity of pufferfishes. PSTBPs are fusion proteins consisting of two tandem repeated tributyltin-binding protein type 2 (TBT-bp2) domains. In this study, we examined the evolutionary dynamics of TBT-bp2 and PSTBP genes to understand the evolution of toxin accumulation in pufferfishes. Database searches and/or PCR-based cDNA cloning in nine pufferfish species (6 toxic and 3 nontoxic) revealed that all species possessed one or more TBT-bp2 genes, but PSTBP genes were found only in 5 toxic species belonging to genus Takifugu. These toxic Takifugu species possessed two or three copies of PSTBP genes. Phylogenetic analysis of TBT-bp2 and PSTBP genes suggested that PSTBPs evolved in the common ancestor of Takifugu species by repeated duplications and fusions of TBT-bp2 genes. In addition, a detailed comparison of Takifugu TBT-bp2 and PSTBP gene sequences detected a signature of positive selection under the pressure of gene conversion. The complicated evolutionary dynamics of TBT-bp2 and PSTBP genes may reflect the diversity of toxicity in pufferfishes. © 2015 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2015 European Society For Evolutionary Biology.
DXYS156: a multi-purpose short tandem repeat locus for determination of sex, paternal and maternal geographic origins and DNA fingerprinting.

PubMed

Calì, Francesco; Forster, P; Kersting, Christian; Mirisola, Mario G; D'Anna, Rosalba; De Leo, Giacomo; Romano, Valentino

2002-06-01

In forensic science and in legal medicine Y chromosomal typing is indispensable for sex determination, for paternity testing in the absence of the father and for distinguishing males in multiple rape cases. Another potential application is the estimation of paternal geographic origin or family name from a crime stain to narrow down the range of suspects and thus reduce costs of mass screenings. However, Y typing alone cannot provide a sufficiently resolved DNA fingerprint as required for court convictions. Thus, there is a dilemma whether or not to sacrifice valuable material for the sake of extensive Y chromosomal investigations when stain DNA is limited (typically allowing only few PCR amplifications). We here describe a Y-chromosome-specific nucleotide insertion in the duplicate short tandem repeat (STR) locus DXYS156 which allows us to distinguish males from females as does the commonly used amelogenin system, but with the advantage that this locus is multi-allelic, thus substantially contributing towards DNA fingerprinting of a sample and furthermore enabling the detection of sample contamination. Yet another bonus is that both the X and the Y copies of DXYS156 have alleles specific to different parts of the world, offering separate estimates of maternal and paternal descent of that sample. We therefore recommend the inclusion of DXYS156 in standard multiplexing kits for forensic, archaeological and genealogical applications.
soc-2 encodes a leucine-rich repeat protein implicated in fibroblast growth factor receptor signaling

PubMed Central

Selfors, Laura M.; Schutzman, Jennifer L.; Borland, Christina Z.; Stern, Michael J.

1998-01-01

Activation of fibroblast growth factor (FGF) receptors elicits diverse cellular responses including growth, mitogenesis, migration, and differentiation. The intracellular signaling pathways that mediate these important processes are not well understood. In Caenorhabditis elegans, suppressors of clr-1 identify genes, termed soc genes, that potentially mediate or activate signaling through the EGL-15 FGF receptor. We demonstrate that three soc genes, soc-1, soc-2, and sem-5, suppress the activity of an activated form of the EGL-15 FGF receptor, consistent with the soc genes functioning downstream of EGL-15. We show that soc-2 encodes a protein composed almost entirely of leucine-rich repeats, a domain implicated in protein–protein interactions. We identified a putative human homolog, SHOC-2, which is 54% identical to SOC-2. We find that shoc-2 maps to 10q25, shoc-2 mRNA is expressed in all tissues assayed, and SHOC-2 protein is cytoplasmically localized. Within the leucine-rich repeats of both SOC-2 and SHOC-2 are two YXNX motifs that are potential tyrosine-phosphorylated docking sites for the SEM-5/GRB2 Src homology 2 domain. However, phosphorylation of these residues is not required for SOC-2 function in vivo, and SHOC-2 is not observed to be tyrosine phosphorylated in response to FGF stimulation. We conclude that this genetic system has allowed for the identification of a conserved gene implicated in mediating FGF receptor signaling in C. elegans. PMID:9618511
The hydrophobic repeated domain of the Clostridium cellulovorans cellulose-binding protein (CbpA) has specific interactions with endoglucanases.

PubMed Central

Takagi, M; Hashida, S; Goldstein, M A; Doi, R H

1993-01-01

We overexpressed one of the hydrophobic repeated domains (HBDs) (110 amino acid residues) of the cellulose-binding protein (CbpA) from Clostridium cellulovorans by making a hybrid protein with the Escherichia coli maltose-binding protein (MalE). The HBD was purified to homogeneity, and interactions between the HBD and endoglucanases were analyzed by a novel interaction Western blotting (immunoblotting) method. The HBD had specific interactions with endoglucanases (EngB and EngD) from C. cellulovorans. These results indicated that the HBD was an endoglucanase binding site of CbpA. Images PMID:8226657
A Novel Peptide Derived from the Fusion Protein Heptad Repeat Inhibits Replication of Subacute Sclerosing Panencephalitis Virus In Vitro and In Vivo.

PubMed

Watanabe, Masahiro; Hashimoto, Koichi; Abe, Yusaku; Kodama, Eiichi N; Nabika, Ryota; Oishi, Shinya; Ohara, Shinichiro; Sato, Masatoki; Kawasaki, Yukihiko; Fujii, Nobutaka; Hosoya, Mitsuaki

2016-01-01

Subacute sclerosing panencephalitis (SSPE) is a persistent, progressive, and fatal degenerative disease resulting from persistent measles virus (MV) infection of the central nervous system. Most drugs used to treat SSPE have been reported to have limited effects. Therefore, novel therapeutic strategies are urgently required. The SSPE virus, a variant MV strain, differs virologically from wild-type MV strain. One characteristic of the SSPE virus is its defective production of cell-free virus, which leaves cell-to-cell infection as the major mechanism of viral dissemination. The fusion protein plays an essential role in this cell-to-cell spread. It contains two critical heptad repeat regions that form a six-helix bundle in the trimer similar to most viral fusion proteins. In the case of human immunodeficiency virus type-1 (HIV-1), a synthetic peptide derived from the heptad repeat region of the fusion protein enfuvirtide inhibits viral replication and is clinically approved as an anti-HIV-1 agent. The heptad repeat regions of HIV-1 are structurally and functionally similar to those of the MV fusion protein. We therefore designed novel peptides derived from the fusion protein heptad repeat region of the MV and examined their effects on the measles and SSPE virus replication in vitro and in vivo. Some of these synthetic novel peptides demonstrated high antiviral activity against both the measles (Edmonston strain) and SSPE (Yamagata-1 strain) viruses at nanomolar concentrations with no cytotoxicity in vitro. In particular, intracranial administration of one of the synthetic peptides increased the survival rate from 0% to 67% in an SSPE virus-infected nude mouse model.
A Novel Peptide Derived from the Fusion Protein Heptad Repeat Inhibits Replication of Subacute Sclerosing Panencephalitis Virus In Vitro and In Vivo

PubMed Central

Watanabe, Masahiro; Hashimoto, Koichi; Abe, Yusaku; Kodama, Eiichi N.; Nabika, Ryota; Oishi, Shinya; Ohara, Shinichiro; Sato, Masatoki; Kawasaki, Yukihiko; Fujii, Nobutaka; Hosoya, Mitsuaki

2016-01-01

Subacute sclerosing panencephalitis (SSPE) is a persistent, progressive, and fatal degenerative disease resulting from persistent measles virus (MV) infection of the central nervous system. Most drugs used to treat SSPE have been reported to have limited effects. Therefore, novel therapeutic strategies are urgently required. The SSPE virus, a variant MV strain, differs virologically from wild-type MV strain. One characteristic of the SSPE virus is its defective production of cell-free virus, which leaves cell-to-cell infection as the major mechanism of viral dissemination. The fusion protein plays an essential role in this cell-to-cell spread. It contains two critical heptad repeat regions that form a six-helix bundle in the trimer similar to most viral fusion proteins. In the case of human immunodeficiency virus type-1 (HIV-1), a synthetic peptide derived from the heptad repeat region of the fusion protein enfuvirtide inhibits viral replication and is clinically approved as an anti-HIV-1 agent. The heptad repeat regions of HIV-1 are structurally and functionally similar to those of the MV fusion protein. We therefore designed novel peptides derived from the fusion protein heptad repeat region of the MV and examined their effects on the measles and SSPE virus replication in vitro and in vivo. Some of these synthetic novel peptides demonstrated high antiviral activity against both the measles (Edmonston strain) and SSPE (Yamagata-1 strain) viruses at nanomolar concentrations with no cytotoxicity in vitro. In particular, intracranial administration of one of the synthetic peptides increased the survival rate from 0% to 67% in an SSPE virus-infected nude mouse model. PMID:27612283
Genetic polymorphism of the 26 short tandem repeat loci in the Chinese Hebei Han population using two commercial forensic kits.

PubMed

Lei, Liang; Xu, Jie; Du, Qingqing; Fu, Lihong; Zhang, Xiaojing; Yu, Feng; Ma, Chunling; Cong, Bin; Li, Shujin

2015-01-01

We determined the allele frequencies and forensic parameters for the 26 short tandem repeat (STR) autosomal markers in two commercial kits (the Investigator HDplex and AmpFLSTR(®) Identifiler(®) systems) for 183 unrelated individuals from the Han population of the Hebei Province of China. The 26 STRs were all in Hardy-Weinberg equilibrium. No linkage disequilibrium was detected between any pair of loci. The combined power of discrimination and the combined power of exclusion for the 26 STR loci were 1-7.74E-31 and 1-1.21E-11, respectively. Six rare alleles of D10S2325 were identified and named 20, 21, 22, 23, 24, and 31. All the length of the six rare alleles were out of the range of allelic ladder. We calculated the population pairwise genetic distance based on the allele frequencies, using published population data including German, central Polish, south Dutch, northeastern Polish, south Brazilian, Korean, Sichuan Han of China, and Shanghai Han of China. Also we examined the population pairwise genetic distance of loci included in Identifiler system between Hebei Han and other ethnic population of China. These 26 autosomal STR loci could provide highly informative polymorphic data for paternity testing and forensic identification in the Hebei Han population in China. Because they are all in linkage equilibrium, they could be used together to solve deficient kinship cases or cases with mutations.
Infrared fluorescent automated detection of thirteen short tandem repeat polymorphisms and one gender-determining system of the CODIS core system.

PubMed

Ricci, U; Sani, I; Guarducci, S; Biondi, C; Pelagatti, S; Lazzerini, V; Brusaferri, A; Lapini, M; Andreucci, E; Giunti, L; Giovannucci Uzielli, M L

2000-11-01

We used an infrared (IR) automated fluorescence monolaser sequencer for the analysis of 13 autosomal short tandem repeat (STR) systems (TPOX, D3S1358, FGA, CSF1PO, D5S818, D7S820, D8S1179, TH01, vWA, D13S317, D16S359, D18S51, D21S11) and the X-Y homologous gene amelogenin system. These two systems represent the core of the combined DNA index systems (CODIS). Four independent multiplex reactions, based on the polymerase chain reaction (PCR) technique and on the direct labeling of the forward primer of every primer pair, with a new molecule (IRDye800), were set up, permitting the exact characterization of the alleles by comparison with ladders of specific sequenced alleles. This is the first report of the whole analysis of the STRs of the CODIS core using an IR automated DNA sequencer. The protocol was used to solve paternity/maternity tests and for population studies. The electrophoretic system also proved useful for the correct typing of those loci differing in size by only 2 bp. A sensibility study demonstrated that the test can detect an average of 10 pg of undegraded human DNA. We also performed a preliminary study analyzing some forensic samples and mixed stains, which suggested the usefulness of using this analytical system for human identification as well as for forensic purposes.

Comparative analysis of the folding dynamics and kinetics of an engineered knotted protein and its variants derived from HP0242 of Helicobacter pylori

NASA Astrophysics Data System (ADS)

Wang, Liang-Wei; Liu, Yu-Nan; Lyu, Ping-Chiang; Jackson, Sophie E.; Hsu, Shang-Te Danny

2015-09-01

Understanding the mechanism by which a polypeptide chain thread itself spontaneously to attain a knotted conformation has been a major challenge in the field of protein folding. HP0242 is a homodimeric protein from Helicobacter pylori with intertwined helices to form a unique pseudo-knotted folding topology. A tandem HP0242 repeat has been constructed to become the first engineered trefoil-knotted protein. Its small size renders it a model system for computational analyses to examine its folding and knotting pathways. Here we report a multi-parametric study on the folding stability and kinetics of a library of HP0242 variants, including the trefoil-knotted tandem HP0242 repeat, using far-UV circular dichroism and fluorescence spectroscopy. Equilibrium chemical denaturation of HP0242 variants shows the presence of highly populated dimeric and structurally heterogeneous folding intermediates. Such equilibrium folding intermediates retain significant amount of helical structures except those at the N- and C-terminal regions in the native structure. Stopped-flow fluorescence measurements of HP0242 variants show that spontaneous refolding into knotted structures can be achieved within seconds, which is several orders of magnitude faster than previously observed for other knotted proteins. Nevertheless, the complex chevron plots indicate that HP0242 variants are prone to misfold into kinetic traps, leading to severely rolled-over refolding arms. The experimental observations are in general agreement with the previously reported molecular dynamics simulations. Based on our results, kinetic folding pathways are proposed to qualitatively describe the complex folding processes of HP0242 variants.
A new family of dispersed repeats from Brassica nigra: characterization and localization.

PubMed

Kapila, R; Negi, M S; This, P; Delseny, M; Srivastava, P S; Lakshmikumaran, M

1996-11-01

The 459-bp HindIII (pBN-4) and the 1732-bp Eco RI (pBNE8) fragments from the Brassica nigra genome were cloned and shown to be members of a dispersed repeat family. Of the three major diploid Brassica species, the repeat pBN-4 was found to be highly specific for the B. nigra genome. The family also hybridized to Sinapis arvensis showing that B. nigra had a closer relationship with the S. arvensis genome than with B. oleracea or B. campestris. The clone pBNE8 showed homology to a number of tRNA species indicating that this family of repeats may have originated from a tRNA sequence. The species-specific 459-bp repeat pBN-4 was localized on the B. nigra chromosomes using monosomic addition lines. In addition to the localization of pBN-4, the chromosomal distribution of two other species-specific repeats, pBN34 and pBNBH35 (reported earlier), was studied. The dispersed repeats pBN-4 and pBNBH35 were found to be present on all of the chromosomes, whereas the tandem repeat pBN34 was localized on two chromosomes.
Large-scale modelling of the divergent spectrin repeats in nesprins: giant modular proteins.

PubMed

Autore, Flavia; Pfuhl, Mark; Quan, Xueping; Williams, Aisling; Roberts, Roland G; Shanahan, Catherine M; Fraternali, Franca

2013-01-01

Nesprin-1 and nesprin-2 are nuclear envelope (NE) proteins characterized by a common structure of an SR (spectrin repeat) rod domain and a C-terminal transmembrane KASH [Klarsicht-ANC-Syne-homology] domain and display N-terminal actin-binding CH (calponin homology) domains. Mutations in these proteins have been described in Emery-Dreifuss muscular dystrophy and attributed to disruptions of interactions at the NE with nesprins binding partners, lamin A/C and emerin. Evolutionary analysis of the rod domains of the nesprins has shown that they are almost entirely composed of unbroken SR-like structures. We present a bioinformatical approach to accurate definition of the boundaries of each SR by comparison with canonical SR structures, allowing for a large-scale homology modelling of the 74 nesprin-1 and 56 nesprin-2 SRs. The exposed and evolutionary conserved residues identify important pbs for protein-protein interactions that can guide tailored binding experiments. Most importantly, the bioinformatics analyses and the 3D models have been central to the design of selected constructs for protein expression. 1D NMR and CD spectra have been performed of the expressed SRs, showing a folded, stable, high content α-helical structure, typical of SRs. Molecular Dynamics simulations have been performed to study the structural and elastic properties of consecutive SRs, revealing insights in the mechanical properties adopted by these modules in the cell.
Crystallization of a pentapeptide-repeat protein by reductive cyclic pentylation of free amines with glutaraldehyde

DOE Office of Scientific and Technical Information (OSTI.GOV)

Vetting, Matthew W., E-mail: vetting@aecom.yu.edu; Hegde, Subray S.; Blanchard, John S.

2009-05-01

A method to modify proteins with glutaraldehyde under reducing conditions is presented. Treatment with glutaraldehyde and dimethylaminoborane was found to result in cyclic pentylation of free amines and facilitated the structural determination of a protein previously recalcitrant to the formation of diffraction quality crystals. The pentapeptide-repeat protein EfsQnr from Enterococcus faecalis protects DNA gyrase from inhibition by fluoroquinolones. EfsQnr was cloned and purified to homogeneity, but failed to produce diffraction-quality crystals in initial crystallization screens. Treatment of EfsQnr with glutaraldehyde and the strong reducing agent borane–dimethylamine resulted in a derivatized protein which produced crystals that diffracted to 1.6 Å resolution;more » their structure was subsequently determined by single-wavelength anomalous dispersion. Analysis of the derivatized protein using Fourier transform ion cyclotron resonance mass spectrometry indicated a mass increase of 68 Da per free amino group. Electron-density maps about a limited number of structurally ordered lysines indicated that the modification was a cyclic pentylation of free amines, producing piperidine groups.« less
A Method for WD40 Repeat Detection and Secondary Structure Prediction

PubMed Central

Wang, Yang; Jiang, Fan; Zhuo, Zhu; Wu, Xian-Hui; Wu, Yun-Dong

2013-01-01

WD40-repeat proteins (WD40s), as one of the largest protein families in eukaryotes, play vital roles in assembling protein-protein/DNA/RNA complexes. WD40s fold into similar β-propeller structures despite diversified sequences. A program WDSP (WD40 repeat protein Structure Predictor) has been developed to accurately identify WD40 repeats and predict their secondary structures. The method is designed specifically for WD40 proteins by incorporating both local residue information and non-local family-specific structural features. It overcomes the problem of highly diversified protein sequences and variable loops. In addition, WDSP achieves a better prediction in identifying multiple WD40-domain proteins by taking the global combination of repeats into consideration. In secondary structure prediction, the average Q3 accuracy of WDSP in jack-knife test reaches 93.7%. A disease related protein LRRK2 was used as a representive example to demonstrate the structure prediction. PMID:23776530
Translation of dipeptide repeat proteins from the C9ORF72 expanded repeat is associated with cellular stress.

PubMed

Sonobe, Yoshifumi; Ghadge, Ghanashyam; Masaki, Katsuhisa; Sendoel, Ataman; Fuchs, Elaine; Roos, Raymond P

2018-08-01

Expansion of a hexanucleotide repeat (HRE), GGGGCC, in the C9ORF72 gene is recognized as the most common cause of familial amyotrophic lateral sclerosis (FALS), frontotemporal dementia (FTD) and ALS-FTD, as well as 5-10% of sporadic ALS. Despite the location of the HRE in the non-coding region (with respect to the main C9ORF72 gene product), dipeptide repeat proteins (DPRs) that are thought to be toxic are translated from the HRE in all three reading frames from both the sense and antisense transcript. Here, we identified a CUG that has a good Kozak consensus sequence as the translation initiation codon. Mutation of this CTG significantly suppressed polyglycine-alanine (GA) translation. GA was translated when the G 4 C 2 construct was placed as the second cistron in a bicistronic construct. CRISPR/Cas9-induced knockout of a non-canonical translation initiation factor, eIF2A, impaired GA translation. Transfection of G 4 C 2 constructs induced an integrated stress response (ISR), while triggering the ISR led to a continuation of translation of GA with a decline in conventional cap-dependent translation. These in vitro observations were confirmed in chick embryo neural cells. The findings suggest that DPRs translated from an HRE in C9ORF72 aggregate and lead to an ISR that then leads to continuing DPR production and aggregation, thereby creating a continuing pathogenic cycle. Copyright © 2018 Elsevier Inc. All rights reserved.
Tandem-multimeric F3-gelonin fusion toxins for enhanced anti-cancer activity for prostate cancer treatment.

PubMed

Shin, Meong Cheol; Min, Kyoung Ah; Cheong, Heesun; Moon, Cheol; Huang, Yongzhuo; He, Huining; Yang, Victor C

2017-05-30

Despite significant progress in prostate cancer treatment, yet, it remains the leading diagnosed cancer and is responsible for high incidence of cancer related deaths in the U.S. Because of the insufficient efficacy of small molecule anti-cancer drugs, significant interest has been drawn to more potent macromolecular agents such as gelonin, a plant-derived ribosome inactivating protein (RIP) that efficiently inhibits protein translation. However, in spite of the great potency to kill tumor cells, gelonin lacks ability to internalize tumor cells and furthermore, cannot distinguish between tumor and normal cells. To address this challenge, we genetically engineered gelonin fusion proteins with varied numbers of F3 peptide possessing homing ability to various cancer cells and angiogenic blood vessels. The E. coli produced F3-gelonin fusion proteins possessed equipotent activity to inhibit protein translation in cell-free protein translation systems to unmodified gelonin; however, they displayed higher cell uptake that led to significantly augmented cytotoxicity. Compared with gelonin fusion with one F3 peptide (F3-Gel), tandem-multimeric F3-gelonins showed even greater cell internalization and tumor cell killing ability. Moreover, when tested against LNCaP s.c. xenograft tumor bearing mice, more significant tumor growth inhibition was observed from the mice treated with tandem-multimeric F3-gelonins. Overall, this research demonstrated the potential of utilizing tandem multimeric F3-modified gelonin as highly effective anticancer agents to overcome the limitations of current chemotherapeutic drugs. Copyright © 2017. Published by Elsevier B.V.
Tandem truncated rotavirus VP8* subunit protein with T cell epitope as non-replicating parenteral vaccine is highly immunogenic.

PubMed

Wen, Xiaobo; Cao, Dianjun; Jones, Ronald W; Hoshino, Yasutaka; Yuan, Lijuan

2015-01-01

The two currently available live oral rotavirus vaccines, Rotarix(®) and RotaTeq(®), are highly efficacious in the developed countries. However, the efficacy of such vaccines in resource deprived countries in Africa and Southeast Asia is low. We reported previously that a bacterially-expressed rotavirus P2-P[8] ΔVP8* subunit vaccine candidate administered intramuscularly elicited high-titers of neutralizing antibodies in guinea pigs and mice and significantly shortened the duration of diarrhea in neonatal gnotobiotic pigs upon oral challenge with virulent human rotavirus Wa strain. To further improve its vaccine potential and provide wider coverage against rotavirus strains of global and regional epidemiologic importance, we constructed 2 tandem recombinant VP8* proteins, P2-P[8] ΔVP8*-P[8] ΔVP8* and P2-P[8] ΔVP8*-P[6] ΔVP8* based on Escherichia coli expression system. The two resulting recombinant tandem proteins were highly soluble and P2-P[8] ΔVP8*-P[8] ΔVP8* was generated with high yield. Moreover, guinea pigs immunized intramuscularly by 3 doses of the P2-P[8] ΔVP8*-P[8] ΔVP8* or P2-P[8] ΔVP8*-P[6] ΔVP8* vaccine with aluminum phosphate adjuvant developed high titers of homotypic and heterotypic neutralizing antibodies against human rotaviruses bearing G1-G4, G8, G9 and G12 with P[8], P[4] or P[6] combination. The results suggest that these 2 subunit vaccines in monovalent or bivalent formulation can provide antigenic coverage to almost all the rotavirus G (VP7) types and major P (VP4) types of global as well as regional epidemiologic importance.
Whole genome sequencing of Salmonella Typhimurium illuminates distinct outbreaks caused by an endemic multi-locus variable number tandem repeat analysis type in Australia, 2014.

PubMed

Phillips, Anastasia; Sotomayor, Cristina; Wang, Qinning; Holmes, Nadine; Furlong, Catriona; Ward, Kate; Howard, Peter; Octavia, Sophie; Lan, Ruiting; Sintchenko, Vitali

2016-09-15

Salmonella Typhimurium (STM) is an important cause of foodborne outbreaks worldwide. Subtyping of STM remains critical to outbreak investigation, yet current techniques (e.g. multilocus variable number tandem repeat analysis, MLVA) may provide insufficient discrimination. Whole genome sequencing (WGS) offers potentially greater discriminatory power to support infectious disease surveillance. We performed WGS on 62 STM isolates of a single, endemic MLVA type associated with two epidemiologically independent, food-borne outbreaks along with sporadic cases in New South Wales, Australia, during 2014. Genomes of case and environmental isolates were sequenced using HiSeq (Illumina) and the genetic distance between them was assessed by single nucleotide polymorphism (SNP) analysis. SNP analysis was compared to the epidemiological context. The WGS analysis supported epidemiological evidence and genomes of within-outbreak isolates were nearly identical. Sporadic cases differed from outbreak cases by a small number of SNPs, although their close relationship to outbreak cases may represent an unidentified common food source that may warrant further public health follow up. Previously unrecognised mini-clusters were detected. WGS of STM can discriminate foodborne community outbreaks within a single endemic MLVA clone. Our findings support the translation of WGS into public health laboratory surveillance of salmonellosis.
Chicken Interferon-induced Protein with Tetratricopeptide Repeats 5 Antagonizes Replication of RNA Viruses.

PubMed

Santhakumar, Diwakar; Rohaim, Mohammed Abdel Mohsen Shahaat; Hussein, Hussein A; Hawes, Pippa; Ferreira, Helena Lage; Behboudi, Shahriar; Iqbal, Munir; Nair, Venugopal; Arns, Clarice W; Munir, Muhammad

2018-05-01

The intracellular actions of interferon (IFN)-regulated proteins, including IFN-induced proteins with tetratricopeptide repeats (IFITs), attribute a major component of the protective antiviral host defense. Here we applied genomics approaches to annotate the chicken IFIT locus and currently identified a single IFIT (chIFIT5) gene. The profound transcriptional level of this effector of innate immunity was mapped within its unique cis-acting elements. This highly virus- and IFN-responsive chIFIT5 protein interacted with negative sense viral RNA structures that carried a triphosphate group on its 5' terminus (ppp-RNA). This interaction reduced the replication of RNA viruses in lentivirus-mediated IFIT5-stable chicken fibroblasts whereas CRISPR/Cas9-edited chIFIT5 gene knockout fibroblasts supported the replication of RNA viruses. Finally, we generated mosaic transgenic chicken embryos stably expressing chIFIT5 protein or knocked-down for endogenous chIFIT5 gene. Replication kinetics of RNA viruses in these transgenic chicken embryos demonstrated the antiviral potential of chIFIT5 in ovo. Taken together, these findings propose that IFIT5 specifically antagonize RNA viruses by sequestering viral nucleic acids in chickens, which are unique in innate immune sensing and responses to viruses of both poultry and human health significance.
High-temperature protein G is essential for activity of the Escherichia coli clustered regularly interspaced short palindromic repeats (CRISPR)/Cas system.

PubMed

Yosef, Ido; Goren, Moran G; Kiro, Ruth; Edgar, Rotem; Qimron, Udi

2011-12-13

Prokaryotic DNA arrays arranged as clustered regularly interspaced short palindromic repeats (CRISPR), along with their associated proteins, provide prokaryotes with adaptive immunity by RNA-mediated targeting of alien DNA or RNA matching the sequences between the repeats. Here, we present a thorough screening system for the identification of bacterial proteins participating in immunity conferred by the Escherichia coli CRISPR system. We describe the identification of one such protein, high-temperature protein G (HtpG), a homolog of the eukaryotic chaperone heat-shock protein 90. We demonstrate that in the absence of htpG, the E. coli CRISPR system loses its suicidal activity against λ prophage and its ability to provide immunity from lysogenization. Transcomplementation of htpG restores CRISPR activity. We further show that inactivity of the CRISPR system attributable to htpG deficiency can be suppressed by expression of Cas3, a protein that is essential for its activity. Accordingly, we also find that the steady-state level of overexpressed Cas3 is significantly enhanced following HtpG expression. We conclude that HtpG is a newly identified positive modulator of the CRISPR system that is essential for maintaining functional levels of Cas3.
High-temperature protein G is essential for activity of the Escherichia coli clustered regularly interspaced short palindromic repeats (CRISPR)/Cas system

PubMed Central

Yosef, Ido; Goren, Moran G.; Kiro, Ruth; Edgar, Rotem; Qimron, Udi

2011-01-01

Prokaryotic DNA arrays arranged as clustered regularly interspaced short palindromic repeats (CRISPR), along with their associated proteins, provide prokaryotes with adaptive immunity by RNA-mediated targeting of alien DNA or RNA matching the sequences between the repeats. Here, we present a thorough screening system for the identification of bacterial proteins participating in immunity conferred by the Escherichia coli CRISPR system. We describe the identification of one such protein, high-temperature protein G (HtpG), a homolog of the eukaryotic chaperone heat-shock protein 90. We demonstrate that in the absence of htpG, the E. coli CRISPR system loses its suicidal activity against λ prophage and its ability to provide immunity from lysogenization. Transcomplementation of htpG restores CRISPR activity. We further show that inactivity of the CRISPR system attributable to htpG deficiency can be suppressed by expression of Cas3, a protein that is essential for its activity. Accordingly, we also find that the steady-state level of overexpressed Cas3 is significantly enhanced following HtpG expression. We conclude that HtpG is a newly identified positive modulator of the CRISPR system that is essential for maintaining functional levels of Cas3. PMID:22114197
Immunoreactive Coxiella burnetii Nine Mile proteins separated by 2D electrophoresis and identified by tandem mass spectrometry

PubMed Central

Deringer, James R.; Chen, Chen; Samuel, James E.; Brown, Wendy C.

2011-01-01

Coxiella burnetii is a Gram-negative obligate intracellular pathogen and the causative agent of Q fever in humans. Q fever causes acute flu-like symptoms and may develop into a chronic disease leading to endocarditis. Its potential as a bioweapon has led to its classification as a category B select agent. An effective inactivated whole-cell vaccine (WCV) currently exists but causes severe granulomatous/necrotizing reactions in individuals with prior exposure, and is not licensed for use in most countries. Current efforts to reduce or eliminate the deleterious reactions associated with WCVs have focused on identifying potential subunit vaccine candidates. Both humoral and T cell-mediated responses are required for protection in animal models. In this study, nine novel immunogenic C. burnetii proteins were identified in extracted whole-cell lysates using 2D electrophoresis, immunoblotting with immune guinea pig sera, and tandem MS. The immunogenic C. burnetii proteins elicited antigen-specific IgG in guinea pigs vaccinated with whole-cell killed Nine Mile phase I vaccine, suggesting a T cell-dependent response. Eleven additional proteins previously shown to react with immune human sera were also antigenic in guinea pigs, showing the relevance of the guinea pig immunization model for antigen discovery. The antigens described here warrant further investigation to validate their potential use as subunit vaccine candidates. PMID:21030434
Quantification of Major Royal Jelly Protein 1 in Fresh Royal Jelly by Ultraperformance Liquid Chromatography-Tandem Mass Spectrometry.

PubMed

Lin, Na; Chen, Si; Zhang, Hong; Li, Junmin; Fu, Linglin

2018-02-07

Major royal jelly protein 1 (MRJP1) is the most abundant protein in royal jelly (RJ), and the level of MRJP1 has been suggested as a promising parameter for standardization and evaluation of RJ authenticity in quality. Here, a quantitative method was developed for the quantification of MRJP1 in RJ based on a signature peptide and a stable isotope-labeled internal standard peptide FFDYDFGSDER*(R*, 13 C 6 , 15 N 4 ) by ultraperformance liquid chromatography-tandem mass spectrometry. Recoveries of the established method ranged from 85.33 to 95.80%, and both the intra- and interday precision were RSD < 4.97%. Quantification results showed that content of MRJP1 in fresh RJ was 41.96-55.01 mg/g. Abnormal levels of MRJP1 were found in three commercial RJs and implied that these samples were of low quality and might be adulterated. Results of the present work suggested that the developed method could be successfully applied to quantify MRJP1 in RJ and also could evaluate the quality of RJ.
Requirement of the Cytosolic Interaction between PATHOGENESIS-RELATED PROTEIN10 and LEUCINE-RICH REPEAT PROTEIN1 for Cell Death and Defense Signaling in Pepper[W

PubMed Central

Choi, Du Seok; Hwang, In Sun; Hwang, Byung Kook

2012-01-01

Plants recruit innate immune receptors such as leucine-rich repeat (LRR) proteins to recognize pathogen attack and activate defense genes. Here, we identified the pepper (Capsicum annuum) pathogenesis-related protein10 (PR10) as a leucine-rich repeat protein1 (LRR1)–interacting partner. Bimolecular fluorescence complementation and coimmunoprecipitation assays confirmed the specific interaction between LRR1 and PR10 in planta. Avirulent Xanthomonas campestris pv vesicatoria infection induces PR10 expression associated with the hypersensitive cell death response. Transient expression of PR10 triggers hypersensitive cell death in pepper and Nicotiana benthamiana leaves, which is amplified by LRR1 coexpression as a positive regulator. LRR1 promotes the ribonuclease activity and phosphorylation of PR10, leading to enhanced cell death signaling. The LRR1-PR10 complex is formed in the cytoplasm, resulting in its secretion into the apoplastic space. Engineered nuclear confinement of both proteins revealed that the cytoplasmic localization of the PR10-LRR1 complex is essential for cell death–mediated defense signaling. PR10/LRR1 silencing in pepper compromises resistance to avirulent X. campestris pv vesicatoria infection. By contrast, PR10/LRR1 overexpression in Arabidopsis thaliana confers enhanced resistance to Pseudomonas syringae pv tomato and Hyaloperonospora arabidopsidis. Together, these results suggest that the cytosolic LRR-PR10 complex is responsible for cell death–mediated defense signaling. PMID:22492811
Exaggerated phosphorylation of brain tau protein in CRH KO mice exposed to repeated immobilization stress.

PubMed

Kvetnansky, Richard; Novak, Petr; Vargovic, Peter; Lejavova, Katarina; Horvathova, Lubica; Ondicova, Katarina; Manz, George; Filipcik, Peter; Novak, Michal; Mravec, Boris

2016-07-01

Neuroendocrine and behavioral stress responses are orchestrated by corticotropin-releasing hormone (CRH) and norepinephrine (NE) synthesizing neurons. Recent findings indicate that stress may promote development of neurofibrillary pathology in Alzheimer's disease. Therefore, we investigated relationships among stress, tau protein phosphorylation, and brain NE using wild-type (WT) and CRH-knockout (CRH KO) mice. We assessed expression of phosphorylated tau (p-tau) at the PHF-1 epitope and NE concentrations in the locus coeruleus (LC), A1/C1 and A2/C2 catecholaminergic cell groups, hippocampus, amygdala, nucleus basalis magnocellularis, and frontal cortex of unstressed, singly stressed or repeatedly stressed mice. Moreover, gene expression and protein levels of tyrosine hydroxylase (TH) and CRH receptor mRNA were determined in the LC. Plasma corticosterone levels were also measured. Exposure to a single stress increases tau phosphorylation throughout the brain in WT mice when compared to singly stressed CRH KO animals. In contrast, repeatedly stressed CRH KO mice showed exaggerated tau phosphorylation relative to WT controls. We also observed differences in extent of tau phosphorylation between investigated structures, e.g. the LC and hippocampus. Moreover, CRH deficiency leads to different responses to stress in gene expression of TH, NE concentrations, CRH receptor mRNA, and plasma corticosterone levels. Our data indicate that CRH effects on tau phosphorylation are dependent on whether stress is single or repeated, and differs between brain regions. Our findings indicate that CRH attenuates mechanisms responsible for development of stress-induced tau neuropathology, particularly in conditions of chronic stress. However, the involvement of central catecholaminergic neurons in these mechanisms remains unclear and is in need of further investigation.
Leucine-rich-repeat-containing variable lymphocyte receptors as modules to target plant-expressed proteins

DOE PAGES

Velásquez, André C.; Nomura, Kinya; Cooper, Max D.; ...

2017-04-19

The ability to target and manipulate protein-based cellular processes would accelerate plant research; yet, the technology to specifically and selectively target plant-expressed proteins is still in its infancy. Leucine-rich repeats (LRRs) are ubiquitously present protein domains involved in mediating protein–protein interactions. LRRs confer the binding specificity to the highly diverse variable lymphocyte receptor (VLR) antibodies (including VLRA, VLRB and VLRC types) that jawless vertebrates make as the functional equivalents of jawed vertebrate immunoglobulin-based antibodies. Here, VLRBs targeting an effector protein from a plant pathogen, HopM1, were developed by immunizing lampreys and using yeast surface display to select for high-affinity VLRBs.more » HopM1-specific VLRBs (VLRM1) were expressed in planta in the cytosol, the trans-Golgi network, and the apoplast. Expression of VLRM1 was higher when the protein localized to an oxidizing environment that would favor disulfide bridge formation (when VLRM1 was not localized to the cytoplasm), as disulfide bonds are necessary for proper VLR folding. VLRM1 specifically interacted in planta with HopM1 but not with an unrelated bacterial effector protein while HopM1 failed to interact with a non-specific VLRB. Later, VLRs may be used as flexible modules to bind proteins or carbohydrates of interest in planta, with broad possibilities for their use by binding directly to their targets and inhibiting their action, or by creating chimeric proteins with new specificities in which endogenous LRR domains are replaced by those present in VLRs.« less
Leucine-rich-repeat-containing variable lymphocyte receptors as modules to target plant-expressed proteins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Velásquez, André C.; Nomura, Kinya; Cooper, Max D.

The ability to target and manipulate protein-based cellular processes would accelerate plant research; yet, the technology to specifically and selectively target plant-expressed proteins is still in its infancy. Leucine-rich repeats (LRRs) are ubiquitously present protein domains involved in mediating protein–protein interactions. LRRs confer the binding specificity to the highly diverse variable lymphocyte receptor (VLR) antibodies (including VLRA, VLRB and VLRC types) that jawless vertebrates make as the functional equivalents of jawed vertebrate immunoglobulin-based antibodies. Here, VLRBs targeting an effector protein from a plant pathogen, HopM1, were developed by immunizing lampreys and using yeast surface display to select for high-affinity VLRBs.more » HopM1-specific VLRBs (VLRM1) were expressed in planta in the cytosol, the trans-Golgi network, and the apoplast. Expression of VLRM1 was higher when the protein localized to an oxidizing environment that would favor disulfide bridge formation (when VLRM1 was not localized to the cytoplasm), as disulfide bonds are necessary for proper VLR folding. VLRM1 specifically interacted in planta with HopM1 but not with an unrelated bacterial effector protein while HopM1 failed to interact with a non-specific VLRB. Later, VLRs may be used as flexible modules to bind proteins or carbohydrates of interest in planta, with broad possibilities for their use by binding directly to their targets and inhibiting their action, or by creating chimeric proteins with new specificities in which endogenous LRR domains are replaced by those present in VLRs.« less
Preprocessing Significantly Improves the Peptide/Protein Identification Sensitivity of High-resolution Isobarically Labeled Tandem Mass Spectrometry Data*

PubMed Central

Sheng, Quanhu; Li, Rongxia; Dai, Jie; Li, Qingrun; Su, Zhiduan; Guo, Yan; Li, Chen; Shyr, Yu; Zeng, Rong

2015-01-01

Isobaric labeling techniques coupled with high-resolution mass spectrometry have been widely employed in proteomic workflows requiring relative quantification. For each high-resolution tandem mass spectrum (MS/MS), isobaric labeling techniques can be used not only to quantify the peptide from different samples by reporter ions, but also to identify the peptide it is derived from. Because the ions related to isobaric labeling may act as noise in database searching, the MS/MS spectrum should be preprocessed before peptide or protein identification. In this article, we demonstrate that there are a lot of high-frequency, high-abundance isobaric related ions in the MS/MS spectrum, and removing isobaric related ions combined with deisotoping and deconvolution in MS/MS preprocessing procedures significantly improves the peptide/protein identification sensitivity. The user-friendly software package TurboRaw2MGF (v2.0) has been implemented for converting raw TIC data files to mascot generic format files and can be downloaded for free from https://github.com/shengqh/RCPA.Tools/releases as part of the software suite ProteomicsTools. The data have been deposited to the ProteomeXchange with identifier PXD000994. PMID:25435543
The La-related protein 1-specific domain repurposes HEAT-like repeats to directly bind a 5'TOP sequence.

PubMed

Lahr, Roni M; Mack, Seshat M; Héroux, Annie; Blagden, Sarah P; Bousquet-Antonelli, Cécile; Deragon, Jean-Marc; Berman, Andrea J

2015-09-18

La-related protein 1 (LARP1) regulates the stability of many mRNAs. These include 5'TOPs, mTOR-kinase responsive mRNAs with pyrimidine-rich 5' UTRs, which encode ribosomal proteins and translation factors. We determined that the highly conserved LARP1-specific C-terminal DM15 region of human LARP1 directly binds a 5'TOP sequence. The crystal structure of this DM15 region refined to 1.86 Å resolution has three structurally related and evolutionarily conserved helix-turn-helix modules within each monomer. These motifs resemble HEAT repeats, ubiquitous helical protein-binding structures, but their sequences are inconsistent with consensus sequences of known HEAT modules, suggesting this structure has been repurposed for RNA interactions. A putative mTORC1-recognition sequence sits within a flexible loop C-terminal to these repeats. We also present modelling of pyrimidine-rich single-stranded RNA onto the highly conserved surface of the DM15 region. These studies lay the foundation necessary for proceeding toward a structural mechanism by which LARP1 links mTOR signalling to ribosome biogenesis. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.