Han, Runlin; Zwiefka, Antoni; Caswell, Clayton C; Xu, Yi; Keene, Douglas R; Lukomska, Ewa; Zhao, Zhihong; Höök, Magnus; Lukomski, Slawomir
2006-08-01
Collagen triple helix, composed of the repeating Gly-Xaa-Yaa (GXY) sequence, is a structural element found in all multicellular animals and also in some prokaryotes. Long GXY polymers are highly regarded components used in food, cosmetic, biomedical, and pharmaceutical industries. In this study, we explore a new concept for the production of recombinant GXY polymers which are based on the sequence of "prokaryotic collagens", the streptococcal collagen-like proteins Scl1 and Scl2. Analysis of 50 Scl variants identified the amino acid distribution and GXY-repeat usage that are involved in the stabilization of the triple helix in Scls. Using circular dichroism spectroscopy and electron microscopy, we show that significantly different recombinant rScl polypeptides form stable, unhydroxylated homotrimeric triple helices that can be produced both intra- and extracellularly in the Escherichia coli. These rScl constructs containing 20 to 129 GXY repeats had mid-point melting temperatures between 32 and 39 degrees C. Altogether, Scl-derived collagens, which are different from the mammalian collagens, can form stable triple helices under physiological conditions and can be used for the production of recombinant GXY polymers with a wide variety of potential applications.
Bacterial collagen-like proteins that form triple-helical structures
Yu, Zhuoxin; An, Bo; Ramshaw, John A.M.; Brodsky, Barbara
2014-01-01
A large number of collagen-like proteins have been identified in bacteria during the past ten years, principally from analysis of genome databases. These bacterial collagens share the distinctive Gly-Xaa-Yaa repeating amino acid sequence of animal collagens which underlies their unique triple-helical structure. A number of the bacterial collagens have been expressed in E. coli, and they all adopt a triple-helix conformation. Unlike animal collagens, these bacterial proteins do not contain the post-translationally modified amino acid, hydroxyproline, which is known to stabilize the triple-helix structure and may promote self-assembly. Despite the absence of collagen hydroxylation, the triple-helix structures of the bacterial collagens studied exhibit a high thermal stability of 35–39 °C, close to that seen for mammalian collagens. These bacterial collagens are readily produced in large quantities by recombinant methods, either in the original amino acid sequence or in genetically manipulated sequences. This new family of recombinant, easy to modify collagens could provide a novel system for investigating structural and functional motifs in animal collagens and could also form the basis of new biomedical materials with designed structural properties and functions. PMID:24434612
McElroy, Kerensa; Mouton, Laurence; Du Pasquier, Louis; Qi, Weihong; Ebert, Dieter
2011-09-01
Collagen-like proteins containing glycine-X-Y repeats have been identified in several pathogenic bacteria potentially involved in virulence. Recently, a collagen-like surface protein, Pcl1a, was identified in Pasteuria ramosa, a spore-forming parasite of Daphnia. Here we characterise 37 novel putative P. ramosa collagen-like protein genes (PCLs). PCR amplification and sequencing across 10 P. ramosa strains showed they were polymorphic, distinguishing genotypes matching known differences in Daphnia/P. ramosa interaction specificity. Thirty PCLs could be divided into four groups based on sequence similarity, conserved N- and C-terminal regions and G-X-Y repeat structure. Group 1, Group 2 and Group 3 PCLs formed triplets within the genome, with one member from each group represented in each triplet. Maximum-likelihood trees suggested that these groups arose through multiple instances of triplet duplication. For Group 1, 2, 3 and 4 PCLs, X was typically proline and Y typically threonine, consistent with other bacterial collagen-like proteins. The amino acid composition of Pcl2 closely resembled Pcl1a, with X typically being glutamic acid or aspartic acid and Y typically being lysine or glutamine. Pcl2 also showed sequence similarity to Pcl1a and contained a predicted signal peptide, cleavage site and transmembrane domain, suggesting that it is a surface protein. Copyright © 2011 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Non-linearity of the collagen triple helix in solution and implications for collagen function.
Walker, Kenneth T; Nan, Ruodan; Wright, David W; Gor, Jayesh; Bishop, Anthony C; Makhatadze, George I; Brodsky, Barbara; Perkins, Stephen J
2017-06-16
Collagen adopts a characteristic supercoiled triple helical conformation which requires a repeating (Xaa-Yaa-Gly) n sequence. Despite the abundance of collagen, a combined experimental and atomistic modelling approach has not so far quantitated the degree of flexibility seen experimentally in the solution structures of collagen triple helices. To address this question, we report an experimental study on the flexibility of varying lengths of collagen triple helical peptides, composed of six, eight, ten and twelve repeats of the most stable Pro-Hyp-Gly (POG) units. In addition, one unblocked peptide, (POG) 10unblocked , was compared with the blocked (POG) 10 as a control for the significance of end effects. Complementary analytical ultracentrifugation and synchrotron small angle X-ray scattering data showed that the conformations of the longer triple helical peptides were not well explained by a linear structure derived from crystallography. To interpret these data, molecular dynamics simulations were used to generate 50 000 physically realistic collagen structures for each of the helices. These structures were fitted against their respective scattering data to reveal the best fitting structures from this large ensemble of possible helix structures. This curve fitting confirmed a small degree of non-linearity to exist in these best fit triple helices, with the degree of bending approximated as 4-17° from linearity. Our results open the way for further studies of other collagen triple helices with different sequences and stabilities in order to clarify the role of molecular rigidity and flexibility in collagen extracellular and immune function and disease. © 2017 The Author(s).
Vandersmissen, Liesbeth; De Buck, Emmy; Saels, Veerle; Coil, David A; Anné, Jozef
2010-05-01
Legionella pneumophila is a Gram-negative, facultative intracellular pathogen and the causative agent of Legionnaires' disease, a severe pneumonia in humans. Analysis of the Legionella sequenced genomes revealed a gene with a variable number of tandem repeats (VNTRs), whose number varies between strains. We examined the strain distribution of this gene among a collection of 108 clinical, environmental and hot spring serotype I strains. Twelve variants were identified, but no correlation was observed between the number of repeat units and clinical and environmental strains. The encoded protein contains the C-terminal consensus motif of outer membrane proteins and has a large region of collagen-like repeats that is encoded by the VNTR region. We have therefore annotated this protein Lcl for Legionella collagen-like protein. Lcl was shown to contribute to the adherence and invasion of host cells and it was demonstrated that the number of repeat units present in lcl had an influence on these adhesion characteristics.
The influence of specific binding of collagen-silk chimeras to silk biomaterials on hMSC behavior
An, Bo; DesRochers, Teresa M.; Qin, Guokui; Xia, Xiaoxia; Thiagarajan, Geetha; Brodsky, Barbara; Kaplan, David
2012-01-01
Collagen-like proteins in the bacteria Streptococcus pyogenes adopt a triple-helix structure with a thermal stability similar to that of animal collagens, can be expressed in high yield in E. coli and can be easily modified through molecular biology techniques. However, potential applications for such recombinant collagens are limited by their lack of higher order structure to achieve the physical properties needed for most biomaterials. To overcome this problem, the S. pyrogenes collagen domain was fused to a repetitive Bombyx mori silk consensus sequence, as a strategy to direct specific non-covalent binding onto solid silk materials whose superior stability, mechanical and material properties have been previously established. This approach resulted in the successful binding of these new collagen-silk chimeric proteins to silk films and porous scaffolds, and the binding affinity could be controlled by varying the number of repeats in the silk sequence. To explore the potential of collagen-silk chimera for regulating biological activity, integrin (Int) and fibronectin (Fn) binding sequences from mammalian collagens were introduced into the bacterial collagen domain. The attachment of bioactive collagen-silk chimeras to solid silk biomaterials promoted hMSC spreading and proliferation substantially in comparison to the controls. The ability to combine the biomaterial features of silk with the biological activities of collagen allowed more rapid cell interactions with silk-based biomaterials, improved regulation of stem cell growth and differentiation, as well as the formation of artificial extracellular matrices useful for tissue engineering applications. PMID:23088839
Crystal structure of the second fibronectin type III (FN3) domain from human collagen α1 type XX.
Zhao, Jingfeng; Ren, Jixia; Wang, Nan; Cheng, Zhong; Yang, Runmei; Lin, Gen; Guo, Yi; Cai, Dayong; Xie, Yong; Zhao, Xiaohong
2017-12-01
Collagen α1 type XX, which contains fibronectin type III (FN3) repeats involving six FN3 domains (referred to as the FN#1-FN#6 domains), is an unusual member of the fibril-associated collagens with interrupted triple helices (FACIT) subfamily of collagens. The results of standard protein BLAST suggest that the FN3 repeats might contribute to collagen α1 type XX acting as a cytokine receptor. To date, solution NMR structures of the FN#3, FN#4 and FN#6 domains have been determined. To obtain further structural evidence to understand the relationship between the structure and function of the FN3 repeats from collagen α1 type XX, the crystal structure of the FN#2 domain from human collagen α1 type XX (residues Pro386-Pro466; referred to as FN2-HCXX) was solved at 2.5 Å resolution. The crystal structure of FN2-HCXX shows an immunoglobulin-like fold containing a β-sandwich structure, which is formed by a three-stranded β-sheet (β1, β2 and β5) packed onto a four-stranded β-sheet (β3, β4, β6 and β7). Two consensus domains, tencon and fibcon, are structural analogues of FN2-HCXX. Fn8, an FN3 domain from human oncofoetal fibronectin, is the closest structural analogue of FN2-HCXX derived from a naturally occurring sequence. Based solely on the structural similarity of FN2-HCXX to other FN3 domains, the detailed functions of FN2-HCXX and the FN3 repeats in collagen α1 type XX cannot be identified.
The influence of specific binding of collagen-silk chimeras to silk biomaterials on hMSC behavior.
An, Bo; DesRochers, Teresa M; Qin, Guokui; Xia, Xiaoxia; Thiagarajan, Geetha; Brodsky, Barbara; Kaplan, David L
2013-01-01
Collagen-like proteins in the bacteria Streptococcus pyogenes adopt a triple-helix structure with a thermal stability similar to that of animal collagens, can be expressed in high yield in Escherichia coli and can be easily modified through molecular biology techniques. However, potential applications for such recombinant collagens are limited by their lack of higher order structure to achieve the physical properties needed for most biomaterials. To overcome this problem, the S. pyogenes collagen domain was fused to a repetitive Bombyx mori silk consensus sequence, as a strategy to direct specific non-covalent binding onto solid silk materials whose superior stability, mechanical and material properties have been previously established. This approach resulted in the successful binding of these new collagen-silk chimeric proteins to silk films and porous scaffolds, and the binding affinity could be controlled by varying the number of repeats in the silk sequence. To explore the potential of collagen-silk chimera for regulating biological activity, integrin (Int) and fibronectin (Fn) binding sequences from mammalian collagens were introduced into the bacterial collagen domain. The attachment of bioactive collagen-silk chimeras to solid silk biomaterials promoted hMSC spreading and proliferation substantially in comparison to the controls. The ability to combine the biomaterial features of silk with the biological activities of collagen allowed more rapid cell interactions with silk-based biomaterials, improved regulation of stem cell growth and differentiation, as well as the formation of artificial extracellular matrices useful for tissue engineering applications. Copyright © 2012 Elsevier Ltd. All rights reserved.
Schroeter, Elena R; DeHart, Caroline J; Cleland, Timothy P; Zheng, Wenxia; Thomas, Paul M; Kelleher, Neil L; Bern, Marshall; Schweitzer, Mary H
2017-02-03
Sequence data from biomolecules such as DNA and proteins, which provide critical information for evolutionary studies, have been assumed to be forever outside the reach of dinosaur paleontology. Proteins, which are predicted to have greater longevity than DNA, have been recovered from two nonavian dinosaurs, but these results remain controversial. For proteomic data derived from extinct Mesozoic organisms to reach their greatest potential for investigating questions of phylogeny and paleobiology, it must be shown that peptide sequences can be reliably and reproducibly obtained from fossils and that fragmentary sequences for ancient proteins can be increasingly expanded. To test the hypothesis that peptides can be repeatedly detected and validated from fossil tissues many millions of years old, we applied updated extraction methodology, high-resolution mass spectrometry, and bioinformatics analyses on a Brachylophosaurus canadensis specimen (MOR 2598) from which collagen I peptides were recovered in 2009. We recovered eight peptide sequences of collagen I: two identical to peptides recovered in 2009 and six new peptides. Phylogenetic analyses place the recovered sequences within basal archosauria. When only the new sequences are considered, B. canadensis is grouped more closely to crocodylians, but when all sequences (current and those reported in 2009) are analyzed, B. canadensis is placed more closely to basal birds. The data robustly support the hypothesis of an endogenous origin for these peptides, confirm the idea that peptides can survive in specimens tens of millions of years old, and bolster the validity of the 2009 study. Furthermore, the new data expand the coverage of B. canadensis collagen I (a 33.6% increase in collagen I alpha 1 and 116.7% in alpha 2). Finally, this study demonstrates the importance of reexamining previously studied specimens with updated methods and instrumentation, as we obtained roughly the same amount of sequence data as the previous study with substantially less sample material. Data are available via ProteomeXchange with identifier PXD005087.
Identification of a polymorphic collagen-like protein in the crustacean bacteria Pasteuria ramosa.
Mouton, Laurence; Traunecker, Emmanuel; McElroy, Kerensa; Du Pasquier, Louis; Ebert, Dieter
2009-12-01
Pasteuria ramosa is a spore-forming bacterium that infects Daphnia species. Previous results demonstrated a high specificity of host clone/parasite genotype interactions. Surface proteins of bacteria often play an important role in attachment to host cells prior to infection. We analyzed surface proteins of P. ramosa spores by two-dimensional gel electrophoresis. For the first time, we prove that two isolates selected for their differences in infectivity reveal few but clear-cut differences in protein patterns. Using internal sequencing and LC/MS/MS, we identified a collagen-like protein named Pcl1a (Pasteuria collagen-like protein 1a). This protein, reconstructed with the help of Pasteuria genome sequences, contains three domains: a 75-amino-acid amino-terminal domain with a potential transmembrane helix domain, a central collagen-like region (CLR) containing Gly-Xaa-Yaa (GXY) repeats, and a 7-amino-acid carboxy-terminal domain. The CLR region is polymorphic among the two isolates with amino-acid substitutions and a variable number of GXY triplets. Collagen-like proteins are rare in prokaryotes, although they have been described in several pathogenic bacteria, including Bacillus cereus, Bacillus anthracis and Bacillus thuringiensis, closely related to Pasteuria species, in which they could be involved in the adherence of bacteria to host cells.
Efficient production of artificially designed gelatins with a Bacillus brevis system.
Kajino, T; Takahashi, H; Hirai, M; Yamada, Y
2000-01-01
Artificially designed gelatins comprising tandemly repeated 30-amino-acid peptide units derived from human alphaI collagen were successfully produced with a Bacillus brevis system. The DNA encoding the peptide unit was synthesized by taking into consideration the codon usage of the host cells, but no clones having a tandemly repeated gene were obtained through the above-mentioned strategy. Minirepeat genes could be selected in vivo from a mixture of every possible sequence encoding an artificial gelatin by randomly ligating the mixed sequence unit and transforming it into Escherichia coli. Larger repeat genes constructed by connecting minirepeat genes obtained by in vivo selection were also stable in the expression host cells. Gelatins derived from the eight-unit and six-unit repeat genes were extracellularly produced at the level of 0.5 g/liter and easily purified by ammonium sulfate fractionation and anion-exchange chromatography. The purified artificial gelatins had the predicted N-terminal sequences and amino acid compositions and a solgel property similar to that of the native gelatin. These results suggest that the selection of a repeat unit sequence stable in an expression host is a shortcut for the efficient production of repetitive proteins and that it can conveniently be achieved by the in vivo selection method. This study revealed the possible industrial application of artificially designed repetitive proteins.
Hydroxyapatite-binding peptides for bone growth and inhibition
Bertozzi, Carolyn R [Berkeley, CA; Song, Jie [Shrewsbury, MA; Lee, Seung-Wuk [Walnut Creek, CA
2011-09-20
Hydroxyapatite (HA)-binding peptides are selected using combinatorial phage library display. Pseudo-repetitive consensus amino acid sequences possessing periodic hydroxyl side chains in every two or three amino acid sequences are obtained. These sequences resemble the (Gly-Pro-Hyp).sub.x repeat of human type I collagen, a major component of extracellular matrices of natural bone. A consistent presence of basic amino acid residues is also observed. The peptides are synthesized by the solid-phase synthetic method and then used for template-driven HA-mineralization. Microscopy reveal that the peptides template the growth of polycrystalline HA crystals .about.40 nm in size.
Hydroxyproline Ring Pucker Causes Frustration of Helix Parameters in the Collagen Triple Helix
NASA Astrophysics Data System (ADS)
Ying Chow, W.; Bihan, Dominique; Forman, Chris J.; Slatter, David A.; Reid, David G.; Wales, David J.; Farndale, Richard W.; Duer, Melinda J.
2015-07-01
Collagens, the most abundant proteins in mammals, are defined by their triple-helical structures and distinctive Gly-Xaa-Yaa repeating sequence, where Xaa is often proline and Yaa, hydroxyproline (Hyp/O). It is known that hydroxyproline in the Yaa position stabilises the triple helix, and that lack of proline hydroxylation in vivo leads to dysfunctional collagen extracellular matrix assembly, due to a range of factors such as a change in hydration properties. In addition, we note that in model peptides, when Yaa is unmodified proline, the Xaa proline has a strong propensity to adopt an endo ring conformation, whilst when Yaa is hydroxyproline, the Xaa proline adopts a range of endo and exo conformations. Here we use a combination of solid-state NMR spectroscopy and potential energy landscape modelling of synthetic triple-helical collagen peptides to understand this effect. We show that hydroxylation of the Yaa proline causes the Xaa proline ring conformation to become metastable, which in turn confers flexibility on the triple helix.
Hydroxyproline Ring Pucker Causes Frustration of Helix Parameters in the Collagen Triple Helix
Ying Chow, W.; Bihan, Dominique; Forman, Chris J.; Slatter, David A.; Reid, David G.; Wales, David J.; Farndale, Richard W.; Duer, Melinda J.
2015-01-01
Collagens, the most abundant proteins in mammals, are defined by their triple-helical structures and distinctive Gly-Xaa-Yaa repeating sequence, where Xaa is often proline and Yaa, hydroxyproline (Hyp/O). It is known that hydroxyproline in the Yaa position stabilises the triple helix, and that lack of proline hydroxylation in vivo leads to dysfunctional collagen extracellular matrix assembly, due to a range of factors such as a change in hydration properties. In addition, we note that in model peptides, when Yaa is unmodified proline, the Xaa proline has a strong propensity to adopt an endo ring conformation, whilst when Yaa is hydroxyproline, the Xaa proline adopts a range of endo and exo conformations. Here we use a combination of solid-state NMR spectroscopy and potential energy landscape modelling of synthetic triple-helical collagen peptides to understand this effect. We show that hydroxylation of the Yaa proline causes the Xaa proline ring conformation to become metastable, which in turn confers flexibility on the triple helix. PMID:26220399
Qiu, Yimin; Mekkat, Arya; Yu, Hongtao; Yigit, Sezin; Hamaia, Samir; Farndale, Richard W; Kaplan, David L; Lin, Yu-Shan; Brodsky, Barbara
2018-05-11
Gly missense mutations in type I collagen, which replace a conserved Gly in the repeating (Gly-Xaa-Yaa) n sequence with a larger residue, are known to cause Osteogenesis Imperfecta (OI). The clinical consequences of such mutations range from mild to lethal, with more serious clinical severity associated with larger Gly replacement residues. Here, we investigate the influence of the identity of the residue replacing Gly within and adjacent to the integrin binding 502 GFPGER 507 sequence on triple-helix structure, stability and integrin binding using a recombinant bacterial collagen system. Recombinant collagens were constructed with Gly substituted by Ala, Ser or Val at four positions within the integrin binding region. All constructs formed a stable triple-helix structure with a small decrease in melting temperature. Trypsin was used to probe local disruption of the triple helix, and Gly to Val replacements made the triple helix trypsin sensitive at three of the four sites. Any mutation at Gly505, eliminated integrin binding, while decreased integrin binding affinity was observed in the replacement of Gly residues at Gly502 following the order Val > Ser > Ala. Molecular dynamics simulations indicated that all Gly replacements led to transient disruption of triple-helix interchain hydrogen bonds in the region of the Gly replacement. These computational and experimental results lend insight into the complex molecular basis of the varying clinical severity of OI. Copyright © 2018. Published by Elsevier Inc.
Mutations in the collagen XII gene define a new form of extracellular matrix-related myopathy.
Hicks, Debbie; Farsani, Golara Torabi; Laval, Steven; Collins, James; Sarkozy, Anna; Martoni, Elena; Shah, Ashoke; Zou, Yaqun; Koch, Manuel; Bönnemann, Carsten G; Roberts, Mark; Lochmüller, Hanns; Bushby, Kate; Straub, Volker
2014-05-01
Bethlem myopathy (BM) [MIM 158810] is a slowly progressive muscle disease characterized by contractures and proximal weakness, which can be caused by mutations in one of the collagen VI genes (COL6A1, COL6A2 and COL6A3). However, there may be additional causal genes to identify as in ∼50% of BM cases no mutations in the COL6 genes are identified. In a cohort of -24 patients with a BM-like phenotype, we first sequenced 12 candidate genes based on their function, including genes for known binding partners of collagen VI, and those enzymes involved in its correct post-translational modification, assembly and secretion. Proceeding to whole-exome sequencing (WES), we identified mutations in the COL12A1 gene, a member of the FACIT collagens (fibril-associated collagens with interrupted triple helices) in five individuals from two families. Both families showed dominant inheritance with a clinical phenotype resembling classical BM. Family 1 had a single-base substitution that led to the replacement of one glycine residue in the triple-helical domain, breaking the Gly-X-Y repeating pattern, and Family 2 had a missense mutation, which created a mutant protein with an unpaired cysteine residue. Abnormality at the protein level was confirmed in both families by the intracellular retention of collagen XII in patient dermal fibroblasts. The mutation in Family 2 leads to the up-regulation of genes associated with the unfolded protein response (UPR) pathway and swollen, dysmorphic rough-ER. We conclude that the spectrum of causative genes in extracellular matrix (ECM)-related myopathies be extended to include COL12A1.
Hyperunstable matrix proteins in the byssus of Mytilus galloprovincialis.
Sagert, Jason; Waite, J Herbert
2009-07-01
The marine mussel Mytilus galloprovincialis is tethered to rocks in the intertidal zone by a holdfast known as the byssus. Functioning as a shock absorber, the byssus is composed of threads, the primary molecular components of which are collagen-containing proteins (preCOLs) that largely dictate the higher order self-assembly and mechanical properties of byssal threads. The threads contain additional matrix components that separate and perhaps lubricate the collagenous microfibrils during deformation in tension. In this study, the thread matrix proteins (TMPs), a glycine-, tyrosine- and asparagine-rich protein family, were shown to possess unique repeated sequence motifs, significant transcriptional heterogeneity and were distributed throughout the byssal thread. Deamidation was shown to occur at a significant rate in a recombinant TMP and in the byssal thread as a function of time. Furthermore, charge heterogeneity presumably due to deamidation was observed in TMPs extracted from threads. The TMPs were localized to the preCOL-containing secretory granules in the collagen gland of the foot and are assumed to provide a viscoelastic matrix around the collagenous fibers in byssal threads.
Slatter, David A.; Bihan, Dominique G.; Jarvis, Gavin E.; Stone, Rachael; Pugh, Nicholas; Giddu, Sumana; Farndale, Richard W.
2012-01-01
Recently, the ability of polymeric collagen-like peptides to regulate cell behavior has generated great interest. A triple-helical peptide known as collagen-related peptide (CRP) contains the sequence (Gly-Pro-Hyp)10. With Gly-Pro-Cys triplets appended to both of its termini, designated CRPcys, chemical cross-linking using heterobifunctional reagents generates CRPcys-XL, a potent, widely used, polymeric agonist for platelet Glycoprotein VI, whereas non-cross-linked, monomeric CRPcys antagonizes Glycoprotein VI. Here, we describe how cysteine in these triplets may also undergo random air-induced oxidation, especially upon prolonged storage or repeated freeze–thawing, to form disulphide bonds, resulting in a lesser degree of polymerization than with chemical cross-linking. We investigated the monomeric and polymeric states of these and other cysteine-containing collagen-derived peptides, using gel filtration and dynamic light scattering, allowing the size of a CRP-XL aggregate to be estimated. The effect of cysteine thiols upon peptide adsorption to surfaces and subsequent platelet responses was investigated. This demonstrated that cysteine is required for strong binding to glass coverslips and to plastic plates used in ELISA assays. PMID:22555281
Slatter, David A; Bihan, Dominique G; Jarvis, Gavin E; Stone, Rachael; Pugh, Nicholas; Giddu, Sumana; Farndale, Richard W
2012-07-01
Recently, the ability of polymeric collagen-like peptides to regulate cell behavior has generated great interest. A triple-helical peptide known as collagen-related peptide (CRP) contains the sequence (Gly-Pro-Hyp)(10). With Gly-Pro-Cys triplets appended to both of its termini, designated CRP(cys), chemical cross-linking using heterobifunctional reagents generates CRP(cys)-XL, a potent, widely used, polymeric agonist for platelet Glycoprotein VI, whereas non-cross-linked, monomeric CRP(cys) antagonizes Glycoprotein VI. Here, we describe how cysteine in these triplets may also undergo random air-induced oxidation, especially upon prolonged storage or repeated freeze-thawing, to form disulphide bonds, resulting in a lesser degree of polymerization than with chemical cross-linking. We investigated the monomeric and polymeric states of these and other cysteine-containing collagen-derived peptides, using gel filtration and dynamic light scattering, allowing the size of a CRP-XL aggregate to be estimated. The effect of cysteine thiols upon peptide adsorption to surfaces and subsequent platelet responses was investigated. This demonstrated that cysteine is required for strong binding to glass coverslips and to plastic plates used in ELISA assays. Copyright © 2012 Elsevier Inc. All rights reserved.
Rich, R L; Deivanayagam, C C; Owens, R T; Carson, M; Höök, A; Moore, D; Symersky, J; Yang, V W; Narayana, S V; Höök, M
1999-08-27
Most mammalian cells and some pathogenic bacteria are capable of adhering to collagenous substrates in processes mediated by specific cell surface adherence molecules. Crystal structures of collagen-binding regions of the human integrin alpha(2)beta(1) and a Staphylococcus aureus adhesin reveal a "trench" on the surface of both of these proteins. This trench can accommodate a collagen triple-helical structure and presumably represents the ligand-binding site (Emsley, J., King, S. L., Bergelson, J. M., and Liddington, R. C. (1997) J. Biol. Chem. 272, 28512-28517; Symersky, J., Patti, J. M., Carson, M., House-Pompeo, K., Teale, M., Moore, D., Jin, L., Schneider, A., DeLucas, L. J., Höök, M., and Narayana, S. V. L. (1997) Nat. Struct. Biol. 4, 833-838). We report here the crystal structure of the alpha subunit I domain from the alpha(1)beta(1) integrin. This collagen-binding protein also contains a trench on one face in which the collagen triple helix may be docked. Furthermore, we compare the collagen-binding mechanisms of the human alpha(1) integrin I domain and the A domain from the S. aureus collagen adhesin, Cna. Although the S. aureus and human proteins have unrelated amino acid sequences, secondary structure composition, and cation requirements for effective ligand binding, both proteins bind at multiple sites within one collagen molecule, with the sites in collagen varying in their affinity for the adherence molecule. We propose that (i) these evolutionarily dissimilar adherence proteins recognize collagen via similar mechanisms, (ii) the multisite, multiclass protein/ligand interactions observed in these two systems result from a binding-site trench, and (iii) this unusual binding mechanism may be thematic for proteins binding extended, rigid ligands that contain repeating structural motifs.
Crystal and Molecular Structure of a Collagen-Like Peptide at 1.9 overset{circ}{A} Resolution
NASA Astrophysics Data System (ADS)
Bella, Jordi; Eaton, Mark; Brodsky, Barbara; Berman, Helen M.
1994-10-01
The structure of a protein triple helix has been determined at 1.9 angstrom resolution by x-ray crystallographic studies of a collagen-like peptide containing a single substitution of the consensus sequence. This peptide adopts a triple-helical structure that confirms the basic features determined from fiber diffraction studies on collagen: supercoiling of polyproline II helices and interchain hydrogen bonding that follows the model II of Rich and Crick. In addition, the structure provides new information concerning the nature of this protein fold. Each triple helix is surrounded by a cylinder of hydration, with an extensive hydrogen bonding network between water molecules and peptide acceptor groups. Hydroxyproline residues have a critical role in this water network. The interaxial spacing of triple helices in the crystal is similar to that in collagen fibrils, and the water networks linking adjacent triple helices in the crystal structure are likely to be present in connective tissues. The breaking of the repeating (X-Y-Gly)_n pattern by a Gly-->Ala substitution results in a subtle alteration of the conformation, with a local untwisting of the triple helix. At the substitution site, direct interchain hydrogen bonds are replaced with interstitial water bridges between the peptide groups. Similar conformational changes may occur in Gly-->X mutated collagens responsible for the diseases osteogenesis imperfecta, chondrodysplasias, and Ehlers-Danlos syndrome IV.
Font, B; Eichenberger, D; Goldschmidt, D; Boutillon, M M; Hulmes, D J
1998-06-15
Fibromodulin belongs to the family of small, leucine-rich proteoglycans which have been reported to interact with collagens and to inhibit type I collagen fibrillogenesis. Decorin and fibromodulin exhibit a noticeable degree of sequence similarity. However, as previously reported [Font, B., Eichenberger, D., Rosenberg, L. M. & van der Rest, M. (1996) Matrix Biol. 15, 341-348] the domains of these molecules implicated in the interactions with type XII and type XIV collagens are different, these being the dermatan sulphate/chondroitin sulphate chain for decorin and the core protein for fibromodulin. At the present time the fibromodulin domains implicated in the interactions with fibrillar collagens remain unknown. In experiments reported here, we have sought to identify the structural requirements for fibromodulin interaction with collagen and for the control of type I collagen fibrillogenesis. Circular dichroism spectra and fibrillogenesis inhibition studies show that fibromodulin structure and its collagen fibrillogenesis control function are strictly dependent on the presence of intact disulphide bridge(s). In addition, we show that the binding of fibromodulin (or fibromodulin-derived fragments) to type I collagen is not necessarily correlated with fibrillogenesis inhibition. To isolate fibromodulin domains, the native proteoglycan was submitted to mild proteolysis. We have isolated an alpha-chymotrypsin-resistant fragment which contains the bulk of the N-terminal and central region of the molecule including the leucine-rich repeats 4 and 6 reported for decorin to be involved in type I collagen binding. This fragment does not bind to type I collagen. Using enzymes with different specificities, a number of large fragments of fibromodulin were obtained, suggesting a compact structure for this molecule which is relatively resistant to proteolysis. None of these N-glycosylated fragments were able to bind to type I collagen in co-sedimentation experiments. Taken together these results suggest that fibromodulin-type I collagen interactions leading to fibrillogenesis inhibition require more than one binding domain. One of these domains could be the C-terminal end of the molecule containing the disulphide loop which is absent in the chymotrypsin-resistant fragment.
Miljkovic, Marija; Bertani, Iris; Fira, Djordje; Jovcic, Branko; Novovic, Katarina; Venturi, Vittorio; Kojic, Milan
2016-01-01
AggLb is the largest (318.6 kDa) aggregation-promoting protein of Lactobacillus paracasei subsp. paracasei BGNJ1-64 responsible for forming large cell aggregates, which causes auto-aggregation, collagen binding and pathogen exclusion in vitro. It contains an N-terminus leader peptide, followed by six successive collagen binding domains, 20 successive repeats (CnaB-like domains) and an LPXTG sorting signal at the C-terminus for cell wall anchoring. Experimental information about the roles of the domains of AggLb is currently unknown. To define the domain that confers cell aggregation and the key domains for interactions of specific affinity between AggLb and components of the extracellular matrix, we constructed a series of variants of the aggLb gene and expressed them in Lactococcus lactis subsp. lactis BGKP1-20 using a lactococcal promoter. All of the variants contained a leader peptide, an inter collagen binding-CnaB domain region (used to raise an anti-AggLb antibody), an anchor domain and a different number of collagen binding and CnaB-like domains. The role of the collagen binding repeats of the N-terminus in auto-aggregation and binding to collagen and fibronectin was confirmed. Deletion of the collagen binding repeats II, III, and IV resulted in a loss of the strong auto-aggregation, collagen and fibronectin binding abilities whereas the biofilm formation capability was increased. The strong auto-aggregation, collagen and fibronectin binding abilities of AggLb were negatively correlated to biofilm formation.
The Tyrosine Sulfate Domain of Fibromodulin Binds Collagen and Enhances Fibril Formation.
Tillgren, Viveka; Mörgelin, Matthias; Önnerfjord, Patrik; Kalamajski, Sebastian; Aspberg, Anders
2016-11-04
Small leucine-rich proteoglycans interact with other extracellular matrix proteins and are important regulators of matrix assembly. Fibromodulin has a key role in connective tissues, binding collagen through two identified binding sites in its leucine-rich repeat domain and regulating collagen fibril formation in vitro and in vivo Some nine tyrosine residues in the fibromodulin N-terminal domain are O-sulfated, a posttranslational modification often involved in protein interactions. The N-terminal domain mimics heparin, binding proteins with clustered basic amino acid residues. Because heparin affects collagen fibril formation, we investigated whether tyrosine sulfate is involved in fibromodulin interactions with collagen. Using full-length fibromodulin and its N-terminal tyrosine-sulfated domain purified from tissue, as well as recombinant fibromodulin fragments, we found that the N-terminal domain binds collagen. The tyrosine-sulfated domain and the leucine-rich repeat domain both bound to three specific sites along the collagen type I molecule, at the N terminus and at 100 and 220 nm from the N terminus. The N-terminal domain shortened the collagen fibril formation lag phase and tyrosine sulfation was required for this effect. The isolated leucine-rich repeat domain inhibited the fibril formation rate, and full-length fibromodulin showed a combination of these effects. The fibrils formed in the presence of fibromodulin or its fragments showed more organized structure. Fibromodulin and its tyrosine sulfate domain remained bound on the formed fiber. Taken together, this suggests a novel, regulatory function for tyrosine sulfation in collagen interaction and control of fibril formation. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Subramanian, Sundar Raman; Singam, Ettayapuram Ramaprasad Azhagiya; Berinski, Michael; Subramanian, Venkatesan; Wade, Rebecca C
2016-08-25
Sequence-specific cleavage of collagen by mammalian collagenase plays a pivotal role in cell function. Collagenases are matrix metalloproteinases that cleave the peptide bond at a specific position on fibrillar collagen. The collagenase Hemopexin-like (HPX) domain has been proposed to be responsible for substrate recognition, but the mechanism by which collagenases identify the cleavage site on fibrillar collagen is not clearly understood. In this study, Brownian dynamics simulations coupled with atomic-detail and coarse-grained molecular dynamics simulations were performed to dock matrix metalloproteinase-1 (MMP-1) on a collagen IIIα1 triple helical peptide. We find that the HPX domain recognizes the collagen triple helix at a conserved R-X11-R motif C-terminal to the cleavage site to which the HPX domain of collagen is guided electrostatically. The binding of the HPX domain between the two arginine residues is energetically stabilized by hydrophobic contacts with collagen. From the simulations and analysis of the sequences and structural flexibility of collagen and collagenase, a mechanistic scheme by which MMP-1 can recognize and bind collagen for proteolysis is proposed.
Soeda, Atsuko; Mamiya, Takashi; Hiroshima, Yoshinori; Sugiyama, Hiroaki; Shidara, Sayoko; Dai, Yuichi; Nakahara, Akira; Ikezawa, Kazuto
2014-10-01
Collagenous gastritis (CG) is a rare disorder characterized by the thick collagenous subepithelial bands associated with mucosal inflammation. There have been approximately fifty reports in the literature since it was first described in 1989. According to previous reports, CG is heterogeneous and classified into two groups-(1) cases limited to the gastric mucosa in children or young adults, and (2) CG associated with collagenous colitis in elderly adults presenting with chronic watery diarrhea. In Japan, only nine previous cases were reported, and all of them were young adults. We report a case of CG with collagenous duodenitis in a 22-year-old female. She had repeated upper gastrointestinal bleeding from a Dieulafoy lesion of the fornix, but had no symptoms of malabsorption or diarrhea. Endoscopic findings revealed striking nodularity with a smooth islet-shaped normal area in the antrum and the body. The pathological findings of nodular mucosa showed the deposition of collagen bands just under the mucoepithelial lesion. In addition, she had collagenous duodenitis in part of the bulbs, and a colonoscopy showed no abnormalities. We provide a literature review of CG and collagenous gastroduodenitis without colonic involvement.
Ishikawa, Yoshihiro; Bächinger, Hans Peter
2013-11-01
Collagen biosynthesis occurs in the rough endoplasmic reticulum, and many molecular chaperones and folding enzymes are involved in this process. The folding mechanism of type I procollagen has been well characterized, and protein disulfide isomerase (PDI) has been suggested as a key player in the formation of the correct disulfide bonds in the noncollagenous carboxyl-terminal and amino-terminal propeptides. Prolyl 3-hydroxylase 1 (P3H1) forms a hetero-trimeric complex with cartilage-associated protein and cyclophilin B (CypB). This complex is a multifunctional complex acting as a prolyl 3-hydroxylase, a peptidyl prolyl cis-trans isomerase, and a molecular chaperone. Two major domains are predicted from the primary sequence of P3H1: an amino-terminal domain and a carboxyl-terminal domain corresponding to the 2-oxoglutarate- and iron-dependent dioxygenase domains similar to the α-subunit of prolyl 4-hydroxylase and lysyl hydroxylases. The amino-terminal domain contains four CXXXC sequence repeats. The primary sequence of cartilage-associated protein is homologous to the amino-terminal domain of P3H1 and also contains four CXXXC sequence repeats. However, the function of the CXXXC sequence repeats is not known. Several publications have reported that short peptides containing a CXC or a CXXC sequence show oxido-reductase activity similar to PDI in vitro. We hypothesize that CXXXC motifs have oxido-reductase activity similar to the CXXC motif in PDI. We have tested the enzyme activities on model substrates in vitro using a GCRALCG peptide and the P3H1 complex. Our results suggest that this complex could function as a disulfide isomerase in the rough endoplasmic reticulum.
Loughlin, J; Irven, C; Hardwick, L J; Butcher, S; Walsh, S; Wordsworth, P; Sykes, B
1995-09-01
Ehlers-Danlos syndrome (EDS) is a group of heritable disorders of connective tissue with skin, ligaments and blood vessels being the main sites affected. The commonest variant (EDS II) exhibits an autosomal dominant mode of inheritance and is characterized by joint hypermobility, cigarette paper scars, lax skin and excessive bruising. As yet no gene has been linked to EDS II, nor has linkage been established to a specific region of the genome. However, several candidate genes encoding proteins of the extracellular matrix have been excluded. Using an intragenic simple sequence repeat polymorphism, we report linkage of the COL5A1 gene, which encodes the alpha 1(V) chain of type V collagen, to EDS II. A maximum LOD score (Zmax) for linkage of 8.3 at theta = 0.00 was generated for a single large pedigree.
Xu, Tingting; Zhou, Cong-Zhao; Xiao, Jianxi; Liu, Jinsong
2018-02-20
Naturally occurring interruptions in nonfibrillar collagen play key roles in molecular flexibility, collagen degradation, and ligand binding. The structural feature of the interruption sequences and the molecular basis for their functions have not been well studied. Here, we focused on a G5G type natural interruption sequence G-POALO-G from human type XIX collagen, a homotrimer collagen, as this sequence possesses distinct properties compared with those of a pathological similar Gly mutation sequence in collagen mimic peptides. We determined the crystal structures of the host-guest peptide (GPO) 3 -GPOALO-(GPO) 4 to 1.03 Å resolution in two crystal forms. In these structures, the interruption zone brings localized disruptions to the triple helix and introduces a light 6-8° bend with the same directional preference to the whole molecule, which may correspond structurally to the first physiological kink site in type XIX collagen. Furthermore, at the G5G interruption site, the presence of Ala and Leu residues, both with free N-H groups, allows the formation of more direct and water-mediated interchain hydrogen bonds than in the related Gly → Ala structure. These could partly explain the difference in thermal stability between the different interruptions. In addition, our structures provide a detailed view of the dynamic property of such an interrupted zone with respect to hydrogen bonding topology, torsion angles, and helical parameters. Our results, for the first time, also identified the binding of zinc to the end of the triple helix. These findings will shed light on how the interruption sequence influences the conformation of the collagen molecule and provide a structural basis for further functional studies.
Itoh, Aiko; Nonaka, Yasuhiro; Ogawa, Takashi; Nakamura, Takanori; Nishi, Nozomu
2017-11-01
We previously reported that galectin-9 (Gal-9), an immunomodulatory animal lectin, could bind to insoluble collagen preparations and exerted direct cytocidal effects on immune cells. In the present study, we found that mature insoluble elastin is capable of binding Gal-9 and other members of the human galectin family. Lectin blot analysis of a series of commercial water-soluble elastin preparations, PES-(A) ~ PES-(E), revealed that only PES-(E) contained substances recognized by Gal-9. Gal-9-interacting substances in PES-(E) were affinity-purified, digested with trypsin and then analyzed by reversed-phase HPLC. Peptide fragments derived from five members of the small leucine-rich repeat proteoglycan family, versican, lumican, osteoglycin/mimecan, prolargin, and fibromodulin, were identified by N-terminal amino acid sequence analysis. The results indicate that Gal-9 and possibly other galectins recognize glycans attached to small leucine-rich repeat proteoglycans associated with insoluble elastin and also indicate the possibility that mature insoluble elastin serves as an extracellular reservoir for galectins.
Phadnis, S V; Atilade, A; Bowring, J; Kyrgiou, M; Young, M P A; Evans, H; Paraskevaidis, E; Walker, P
2011-12-01
To study the distribution of collagen in the regenerated cervical tissue after excisional treatment for cervical intraepithelial neoplasia (CIN). Cohort study. A large tertiary teaching hospital in London. Women who underwent repeat excisional treatment for treatment failure or persistent CIN. Eligible women who underwent a repeat excisional treatment for treatment failure, including hysterectomy, between January 2002 and December 2007 in our colposcopy unit were identified by the Infoflex(®) database and SNOMED encoded histopathology database. Collagen expression was assessed using picro-Sirius red stain and the intensity of staining was compared in paired specimens from the first and second treatments. Differences in collagen expression were examined in the paired excisional treatment specimens. A total of 17 women were included. Increased collagen expression in the regenerated cervical tissue of the second cone compared with the first cone was noted in six women, decreased expression was noted in five women, and the pattern of collagen distribution was equivocal in six women. There is no overall change in collagen distribution during regeneration following excisional treatment for CIN. © 2011 The Authors BJOG An International Journal of Obstetrics and Gynaecology © 2011 RCOG.
Why fibrous proteins are romantic.
Cohen, C
1998-01-01
Here I give a personal account of the great history of fibrous protein structure. I describe how Astbury first recognized the essential simplicity of fibrous proteins and their paradigmatic role in protein structure. The poor diffraction patterns yielded by these proteins were then deciphered by Pauling, Crick, Ramachandran and others (in part by model building) to reveal alpha-helical coiled coils, beta-sheets, and the collagen triple helical coiled coil-all characterized by different local sequence periodicities. Longer-range sequence periodicities (or "magic numbers") present in diverse fibrous proteins, such as collagen, tropomyosin, paramyosin, myosin, and were then shown to account for the characteristic axial repeats observed in filaments of these proteins. More recently, analysis of fibrous protein structure has been extended in many cases to atomic resolution, and some systems, such as "leucine zippers," are providing a deeper understanding of protein design than similar studies of globular proteins. In the last sections, I provide some dramatic examples of fibrous protein dynamics. One example is the so-called "spring-loaded" mechanism for viral fusion by the hemagglutinin protein of influenza. Another is the possible conformational changes in prion proteins, implicated in "mad cow disease," which may be related to similar transitions in a variety of globular and fibrous proteins. Copyright 1998 Academic Press.
Mammoth and Mastodon collagen sequences; survival and utility
NASA Astrophysics Data System (ADS)
Buckley, M.; Larkin, N.; Collins, M.
2011-04-01
Near-complete collagen (I) sequences are proposed for elephantid and mammutid taxa, based upon available African elephant genomic data and supported with LC-MALDI-MS/MS and LC-ESI-MS/MS analyses of collagen digests from proboscidean bone. Collagen sequence coverage was investigated from several specimens of two extinct mammoths ( Mammuthus trogontherii and Mammuthus primigenius), the extinct American mastodon ( Mammut americanum), the extinct straight-tusked elephant ( Elephas ( Palaeoloxodon) antiquus) and extant Asian ( Elephas maximus) and African ( Loxodonta africana) elephants and compared between the two ionization techniques used. Two suspected mammoth fossils from the British Middle Pleistocene (Cromerian) deposits of the West Runton Forest Bed were analysed to investigate the potential use of peptide mass spectrometry for fossil identification. Despite the age of the fossils, sufficient peptides were obtained to identify these as elephantid, and sufficient sequence variation to discriminate elephantid and mammutid collagen (I). In-depth LC-MS analyses further failed to identify a peptide that could be used to reliably distinguish between the three genera of elephantids ( Elephas, Loxodonta and Mammuthus), an observation consistent with predicted amino acid substitution rates between these species.
Nanolayered Features of Collagen-like Peptides
NASA Technical Reports Server (NTRS)
Valluzzi, Regina; Bini, Elisabetta; Haas, Terry; Cebe, Peggy; Kaplan, David L.
2003-01-01
We have been investigating collagen-like model oligopeptides as molecular bases for complex ordered biomimetic materials. The collagen-like molecules incorporate aspects of native collagen sequence and secondary structure. Designed modifications to native primary and secondary structure have been incorporated to control the nanostructure and microstructure of the collagen-like materials produced. We find that the collagen-like molecules form a number of lyotropic rod liquid crystalline phases, which because of their strong temperature dependence in the liquid state can also be viewed as solvent intercalated thermotropic liquid crystals. The liquid crystalline phases formed by the molecules can be captured in the solid state by drying off solvent, resulting in solid nanopatterned (chemically and physically) thermally stable (to greater than 100 C) materials. Designed sequences which stabilize smectic phases have allowed a variety of nanoscale multilayered biopolymeric materials to be developed. Preliminary investigations suggest that chemical patterns running perpendicular to the smectic layer plane can be functionalized and used to localize a variety of organic, inorganic, and organometallic moieties in very simple multilayered nanocomposites. The phase behavior of collagen-like oligopeptide materials is described, emphasizing the correlation between mesophase, molecular orientation, and chemical patterning at the microscale and nanoscale. In many cases, the textures observed for smectic and hexatic phase collagens are remarkably similar to the complex (and not fully understood) helicoids observed in biological collagen-based tissues. Comparisons between biological morphologies and collagen model liquid crystalline (and solidified materials) textures may help us understand the molecular features which impart order and function to the extracellular matrix and to collagen-based mineralized tissues. Initial studies have utilized synthetic collagen-like peptides while future work will also focus on similar sequences generated via genetic engineering methods.
Prediction of molecular mimicry candidates in human pathogenic bacteria.
Doxey, Andrew C; McConkey, Brendan J
2013-08-15
Molecular mimicry of host proteins is a common strategy adopted by bacterial pathogens to interfere with and exploit host processes. Despite the availability of pathogen genomes, few studies have attempted to predict virulence-associated mimicry relationships directly from genomic sequences. Here, we analyzed the proteomes of 62 pathogenic and 66 non-pathogenic bacterial species, and screened for the top pathogen-specific or pathogen-enriched sequence similarities to human proteins. The screen identified approximately 100 potential mimicry relationships including well-characterized examples among the top-scoring hits (e.g., RalF, internalin, yopH, and others), with about 1/3 of predicted relationships supported by existing literature. Examination of homology to virulence factors, statistically enriched functions, and comparison with literature indicated that the detected mimics target key host structures (e.g., extracellular matrix, ECM) and pathways (e.g., cell adhesion, lipid metabolism, and immune signaling). The top-scoring and most widespread mimicry pattern detected among pathogens consisted of elevated sequence similarities to ECM proteins including collagens and leucine-rich repeat proteins. Unexpectedly, analysis of the pathogen counterparts of these proteins revealed that they have evolved independently in different species of bacterial pathogens from separate repeat amplifications. Thus, our analysis provides evidence for two classes of mimics: complex proteins such as enzymes that have been acquired by eukaryote-to-pathogen horizontal transfer, and simpler repeat proteins that have independently evolved to mimic the host ECM. Ultimately, computational detection of pathogen-specific and pathogen-enriched similarities to host proteins provides insights into potentially novel mimicry-mediated virulence mechanisms of pathogenic bacteria.
Prediction of molecular mimicry candidates in human pathogenic bacteria
Doxey, Andrew C; McConkey, Brendan J
2013-01-01
Molecular mimicry of host proteins is a common strategy adopted by bacterial pathogens to interfere with and exploit host processes. Despite the availability of pathogen genomes, few studies have attempted to predict virulence-associated mimicry relationships directly from genomic sequences. Here, we analyzed the proteomes of 62 pathogenic and 66 non-pathogenic bacterial species, and screened for the top pathogen-specific or pathogen-enriched sequence similarities to human proteins. The screen identified approximately 100 potential mimicry relationships including well-characterized examples among the top-scoring hits (e.g., RalF, internalin, yopH, and others), with about 1/3 of predicted relationships supported by existing literature. Examination of homology to virulence factors, statistically enriched functions, and comparison with literature indicated that the detected mimics target key host structures (e.g., extracellular matrix, ECM) and pathways (e.g., cell adhesion, lipid metabolism, and immune signaling). The top-scoring and most widespread mimicry pattern detected among pathogens consisted of elevated sequence similarities to ECM proteins including collagens and leucine-rich repeat proteins. Unexpectedly, analysis of the pathogen counterparts of these proteins revealed that they have evolved independently in different species of bacterial pathogens from separate repeat amplifications. Thus, our analysis provides evidence for two classes of mimics: complex proteins such as enzymes that have been acquired by eukaryote-to-pathogen horizontal transfer, and simpler repeat proteins that have independently evolved to mimic the host ECM. Ultimately, computational detection of pathogen-specific and pathogen-enriched similarities to host proteins provides insights into potentially novel mimicry-mediated virulence mechanisms of pathogenic bacteria. PMID:23715053
Okano, Kazuhiro; Schnaper, H William; Bomsztyk, Karol; Hayashida, Tomoko
2006-09-08
Although it is clear that transforming growth factor-beta1 (TGF-beta1) is critical for renal fibrogenesis, the complexity of the involved mechanisms is increasingly apparent. TGF-beta1 stimulates phosphorylation of Smad2/3 and activates other signaling molecules as well. The molecular link between these other kinases and Smads is not known. We sought new binding partners for Smad3 in renal cells and identified receptor for activated protein kinase C 1 (RACK1) as a novel binding partner of Smad3. The linker region of Smad3 and the tryptophan-aspartic acid repeat 6 and 7 of RACK1 are sufficient for the association. RACK1 also interacts with Smad3 in the human kidney epithelial cell line, HKC. Silencing RACK1 increases transcriptional activity of TGF-beta1-responsive promoter sequences of the Smad binding element (SBE), p3TP-Lux, and alpha2(I) collagen. Conversely, overexpressed RACK1 negatively modulates alpha2(I) collagen transcriptional activity in TGF-beta1-stimulated cells. RACK1 did not affect phosphorylation of Smad3 at the C terminus or in the linker region. However, RACK1 reduced direct binding of Smad3 to the SBE motif. Mutating a RACK1 tyrosine at residue 246, but not at 228, decreased the inhibitory effect of RACK1 on both alpha2(I) collagen promoter activity and Smad binding to SBE induced by TGF-beta1. These results suggest that RACK1 modulates transcription of alpha2(I) collagen by TGF-beta1 through interference with Smad3 binding to the gene promoter.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, O.; Masters, C.; Lewis, M.B.
1994-09-01
In an 8-year-old girl and her father, both of whom have severe type III OI, we have previously used RNA/RNA hybrid analysis to demonstrate a mismatch in the region of {alpha}1(I) mRNA coding for aa 558-861. We used SSCP to further localize the abnormality to a subregion coding for aa 579-679. This region was subcloned and sequenced. Each patient`s cDNA has a deletion of the sequences coding for the last residue of exon 34, and all of exons 35 and 36 (aa 604-639), followed by an insertion of 156 nt from the 3{prime}-end of intron 36. PCR amplification of leukocytemore » DNA from the patients and the clinically normal paternal grandmother yielded two fragments: a 1007 bp fragment predicted from normal genomic sequences and a 445 bp fragment. Subcloning and sequencing of the shorter genomic PCR product confirmed the presence of a 565 bp genomic deletion from the end of exon 34 to the middle of intron 36. The abnormal protein is apparently synthesized and incorporated into helix. The inserted nucleotides are in frame with the collagenous sequence and contain no stop codons. They encode a 52 aa non-collagenous region. The fibroblast procollagen of the patients has both normal and electrophoretically delayed pro{alpha}(I) bands. The electrophoretically delayed procollagen is very sensitive to pepsin or trypsin digestion, as predicted by its non-collagenous sequence, and cannot be visualized as collagen. This unique OI collagen mutation is an excellent candidate for molecular targeting to {open_quotes}turn off{close_quotes} a dominant mutant allele.« less
An, Bo; Abbonante, Vittorio; Xu, Huifang; Gavriilidou, Despoina; Yoshizumi, Ayumi; Bihan, Dominique; Farndale, Richard W.; Kaplan, David L.; Balduini, Alessandra; Leitinger, Birgit; Brodsky, Barbara
2016-01-01
A bacterial collagen-like protein Scl2 has been developed as a recombinant collagen model system to host human collagen ligand-binding sequences, with the goal of generating biomaterials with selective collagen bioactivities. Defined binding sites in human collagen for integrins, fibronectin, heparin, and MMP-1 have been introduced into the triple-helical domain of the bacterial collagen and led to the expected biological activities. The modular insertion of activities is extended here to the discoidin domain receptors (DDRs), which are collagen-activated receptor tyrosine kinases. Insertion of the DDR-binding sequence from human collagen III into bacterial collagen led to specific receptor binding. However, even at the highest testable concentrations, the construct was unable to stimulate DDR autophosphorylation. The recombinant collagen expressed in Escherichia coli does not contain hydroxyproline (Hyp), and complementary synthetic peptide studies showed that replacement of Hyp by Pro at the critical Gly-Val-Met-Gly-Phe-Hyp position decreased the DDR-binding affinity and consequently required a higher concentration for the induction of receptor activation. The ability of the recombinant bacterial collagen to bind the DDRs without inducing kinase activation suggested it could interfere with the interactions between animal collagen and the DDRs, and such an inhibitory role was confirmed in vitro and with a cell migration assay. This study illustrates that recombinant collagen can complement synthetic peptides in investigating structure-activity relationships, and this system has the potential for the introduction or inhibition of specific biological activities. PMID:26702058
Collagen-Like Proteins in Pathogenic E. coli Strains
Ghosh, Neelanjana; McKillop, Thomas J.; Jowitt, Thomas A.; Howard, Marjorie; Davies, Heather; Holmes, David F.; Roberts, Ian S.; Bella, Jordi
2012-01-01
The genome sequences of enterohaemorrhagic E. coli O157:H7 strains show multiple open-reading frames with collagen-like sequences that are absent from the common laboratory strain K-12. These putative collagens are included in prophages embedded in O157:H7 genomes. These prophages carry numerous genes related to strain virulence and have been shown to be inducible and capable of disseminating virulence factors by horizontal gene transfer. We have cloned two collagen-like proteins from E. coli O157:H7 into a laboratory strain and analysed the structure and conformation of the recombinant proteins and several of their constituting domains by a variety of spectroscopic, biophysical, and electron microscopy techniques. We show that these molecules exhibit many of the characteristics of vertebrate collagens, including trimer formation and the presence of a collagen triple helical domain. They also contain a C-terminal trimerization domain, and a trimeric α-helical coiled-coil domain with an unusual amino acid sequence almost completely lacking leucine, valine or isoleucine residues. Intriguingly, these molecules show high thermal stability, with the collagen domain being more stable than those of vertebrate fibrillar collagens, which are much longer and post-translationally modified. Under the electron microscope, collagen-like proteins from E. coli O157:H7 show a dumbbell shape, with two globular domains joined by a hinged stalk. This morphology is consistent with their likely role as trimeric phage side-tail proteins that participate in the attachment of phage particles to E. coli target cells, either directly or through assembly with other phage tail proteins. Thus, collagen-like proteins in enterohaemorrhagic E. coli genomes may have a direct role in the dissemination of virulence-related genes through infection of harmless strains by induced bacteriophages. PMID:22701585
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jiang, Tao; Meyer, Travis A.; Modlin, Charles
In this paper, we describe the co-assembly of two different building units: collagen-mimetic peptides and DNA origami. Two peptides CP ++ and sCP ++ are designed with a sequence comprising a central block (Pro-Hyp-Gly) and two positively charged domains (Pro-Arg-Gly) at both N- and C-termini. Co-assembly of peptides and DNA origami two-layer (TL) nanosheets affords the formation of one-dimensional nanowires with repeating periodicity of similar to 10 nm. Structural analyses suggest a face-to-face stacking of DNA nanosheets with peptides aligned perpendicularly to the sheet surfaces. We demonstrate the potential of selective peptide-DNA association between face-to-face and edge-to-edge packing by tailoringmore » the size of DNA nanostructures. Finally, this study presents an attractive strategy to create hybrid biomolecular assemblies from peptide and DNA-based building blocks that takes advantage of the intrinsic chemical and physical properties of the respective components to encode structural and, potentially, functional complexity within readily accessible biomimetic materials.« less
Jiang, Tao; Meyer, Travis A.; Modlin, Charles; ...
2017-09-26
In this paper, we describe the co-assembly of two different building units: collagen-mimetic peptides and DNA origami. Two peptides CP ++ and sCP ++ are designed with a sequence comprising a central block (Pro-Hyp-Gly) and two positively charged domains (Pro-Arg-Gly) at both N- and C-termini. Co-assembly of peptides and DNA origami two-layer (TL) nanosheets affords the formation of one-dimensional nanowires with repeating periodicity of similar to 10 nm. Structural analyses suggest a face-to-face stacking of DNA nanosheets with peptides aligned perpendicularly to the sheet surfaces. We demonstrate the potential of selective peptide-DNA association between face-to-face and edge-to-edge packing by tailoringmore » the size of DNA nanostructures. Finally, this study presents an attractive strategy to create hybrid biomolecular assemblies from peptide and DNA-based building blocks that takes advantage of the intrinsic chemical and physical properties of the respective components to encode structural and, potentially, functional complexity within readily accessible biomimetic materials.« less
Yamauchi, Mitsuo; Noyes, Claudia; Kuboki, Yoshinori; Mechanic, Gerald L.
1982-01-01
A three-chained peptide from type I collagen, crosslinked by hydroxyaldolhistidine, has been isolated from a tryptic digest of 5 M guanidine·HCl-insoluble bovine skin collagen (a small but as yet unknown percentage of the total collagen in whole skin). OsO4/NaIO4 specifically cleaved the crosslink at its double bond into a two-chained crosslink peptide and a single peptide. The sequence of the two-chained peptide containing the bifunctional crosslink was determined after amino acid analysis of the separated peptides. The crosslink consists of an aldehyde derived from hydroxylysine-87 in the aldehyde-containing cyanogen bromide fragment α1CB5ald and an aldehyde derived from the lysine in the COOH-terminal nonhelical region of the α1CB6ald fragment. The α1CB6ald portion of the peptide exhibited structural microheterogeneity, containing the inverted sequence Ala-Lys-His instead of the normal sequence Lys-Ala-His. This indicates that another structural gene exists for α1(I) chain. The original three-chained peptide did not contain any glycosylated hydroxylysine or glycosylated hydroxyaldolhistidine. The lack of glycosylation of hydroxylysine-87 in α1CB5, which is usually glycosylated, allowed formation of the aldehyde, and this, coupled with the sequence inversion, may have allowed formation of the nonreducible crosslink hydroxyaldolhistidine. We suggest that the role of glycosylation, a posttranslational modification, of specific hydroxylysine residues is to prevent their oxidative deamination to aldehydes, thereby precluding formation of complex stable crosslinks. Complex crosslinks would decrease the rate of collagen turnover. The decrease, with time, would increase the population of stable crosslinked collagen molecules, which would eventually accumulate with age. PMID:6961443
Characterisation of a collagen gene subfamily from the potato cyst nematode Globodera pallida.
Gray, L J; Curtis, R H; Jones, J T
2001-01-24
We have isolated two full-length genomic DNA sequences, which encode the cuticle collagen proteins GP-COL-1 and GP-COL-2, from the potato cyst nematode Globodera pallida. A third, partial collagen gene ORF termed gp-col-t(t=truncated) has also been isolated and appears to represent an unexpressed pseudogene. The gp-col-1 and gp-col-2 genes both contain three short (<97 bp) introns which disrupt coding regions predicted to specify proteins with molecular weights of 33 and 32.7 kDa respectively. All three sequences show high similarity to each other and to the previously isolated G. pallida cDNA clone gp-col-8. The conserved pattern of cysteine residues and non-(Gly-X-Y)(n) region sequence similarity observed in all four G. pallida genes suggests that these molecules form part of the same subfamily of collagens. Southern analysis indicates that this subfamily is likely to contain further members. The G. pallida collagen sequences show striking similarity to twelve genes from Caenorhabditis elegans which collectively represent the recently classified Group 1a collagen subfamily. No data exists on the function of this subfamily in C. elegans. gp-col-1 and gp-col-2 are developmentally regulated with transcripts of both genes detected in adult virgin and gravid females but not in pre-parasitic second stage juveniles. A similar expression pattern is observed for the Group 1a collagen lemmi 5 from Meloidogyne incognita perhaps indicating a generic link between subfamily and function during the various changes in cuticular structure which accompany nematode growth and reproduction. Immunochemical studies indicate that the GP-COL-1 protein is specifically located in the hypodermis of G. pallida adult females.
Buckley, Mike
2016-03-24
Collagen is one of the most ubiquitous proteins in the animal kingdom and the dominant protein in extracellular tissues such as bone, skin and other connective tissues in which it acts primarily as a supporting scaffold. It has been widely investigated scientifically, not only as a biomedical material for regenerative medicine, but also for its role as a food source for both humans and livestock. Due to the long-term stability of collagen, as well as its abundance in bone, it has been proposed as a source of biomarkers for species identification not only for heat- and pressure-rendered animal feed but also in ancient archaeological and palaeontological specimens, typically carried out by peptide mass fingerprinting (PMF) as well as in-depth liquid chromatography (LC)-based tandem mass spectrometric methods. Through the analysis of the three most common domesticates species, cow, sheep, and pig, this research investigates the advantages of each approach over the other, investigating sites of sequence variation with known functional properties of the collagen molecule. Results indicate that the previously identified species biomarkers through PMF analysis are not among the most variable type 1 collagen peptides present in these tissues, the latter of which can be detected by LC-based methods. However, it is clear that the highly repetitive sequence motif of collagen throughout the molecule, combined with the variability of the sites and relative abundance levels of hydroxylation, can result in high scoring false positive peptide matches using these LC-based methods. Additionally, the greater alpha 2(I) chain sequence variation, in comparison to the alpha 1(I) chain, did not appear to be specific to any particular functional properties, implying that intra-chain functional constraints on sequence variation are not as great as inter-chain constraints. However, although some of the most variable peptides were only observed in LC-based methods, until the range of publicly available collagen sequences improves, the simplicity of the PMF approach and suitable range of peptide sequence variation observed makes it the ideal method for initial taxonomic identification prior to further analysis by LC-based methods only when required.
Recombinant Collagenlike Proteins
NASA Technical Reports Server (NTRS)
Fertala, Andzej
2007-01-01
A group of collagenlike recombinant proteins containing high densities of biologically active sites has been invented. The method used to express these proteins is similar to a method of expressing recombinant procollagens and collagens described in U. S. Patent 5,593,859, "Synthesis of human procollagens and collagens in recombinant DNA systems." Customized collagenous proteins are needed for biomedical applications. In particular, fibrillar collagens are attractive for production of matrices needed for tissue engineering and drug delivery. Prior to this invention, there was no way of producing customized collagenous proteins for these and other applications. Heretofore, collagenous proteins have been produced by use of such biological systems as yeasts, bacteria, and transgenic animals and plants. These products are normal collagens that can also be extracted from such sources as tendons, bones, and hides. These products cannot be made to consist only of biologically active, specific amino acid sequences that may be needed for specific applications. Prior to this invention, it had been established that fibrillar collagens consist of domains that are responsible for such processes as interaction with cells, binding of growth factors, and interaction with a number of structural proteins present in the extracellular matrix. A normal collagen consists of a sequence of domains that can be represented by a corresponding sequence of labels, e.g., D1D2D3D4. A collagenlike protein of the present invention contains regions of collagen II that contain multiples of a single domain (e.g., D1D1D1D1 or D4D4D4D4) chosen for its specific biological activity. By virtue of the multiplicity of the chosen domain, the density of sites having that specific biological activity is greater than it is in a normal collagen. A collagenlike protein according to this invention can thus be made to have properties that are necessary for tissue engineering.
Bornstein, P; McKay, J; Liska, D J; Apone, S; Devarayalu, S
1988-01-01
The first intron of the human collagen alpha 1(I) gene contains several positively and negatively acting elements. We have studied the transcription of collagen-human growth hormone fusion genes, containing deletions and rearrangements of collagen intronic sequences, by transient transfection of chick tendon fibroblasts and NIH 3T3 cells. In chick tendon fibroblasts, but not in 3T3 cells, inversion of intronic sequences containing a previously studied 274-base-pair segment, A274, resulted in markedly reduced human growth hormone mRNA levels as determined by an RNase protection assay. This inhibitory effect was largely alleviated when deletions were introduced in the collagen promoter of plasmids containing negatively oriented intronic sequences. Evidence for interaction of the promoter with the intronic segment, A274, was obtained by gel mobility shift assays. We suggest that promoter-intron interactions, mediated by DNA-binding proteins, regulate collagen gene transcription. Inversion of intronic segments containing critical interactive elements might then lead to an altered geometry and reduced activity of a transcriptional complex in those cells with sufficiently high levels of appropriate transcription factors. We further suggest that the deleted promoter segment plays a key role in directing DNA interactions involved in transcriptional control. Images PMID:3211130
Comment on "Protein sequences from mastodon and Tyrannosaurus rex revealed by mass spectrometry".
Buckley, Mike; Walker, Angela; Ho, Simon Y W; Yang, Yue; Smith, Colin; Ashton, Peter; Oates, Jane Thomas; Cappellini, Enrico; Koon, Hannah; Penkman, Kirsty; Elsworth, Ben; Ashford, Dave; Solazzo, Caroline; Andrews, Phillip; Strahler, John; Shapiro, Beth; Ostrom, Peggy; Gandhi, Hasand; Miller, Webb; Raney, Brian; Zylber, Maria Ines; Gilbert, M Thomas P; Prigodich, Richard V; Ryan, Michael; Rijsdijk, Kenneth F; Janoo, Anwar; Collins, Matthew J
2008-01-04
We used authentication tests developed for ancient DNA to evaluate claims by Asara et al. (Reports, 13 April 2007, p. 280) of collagen peptide sequences recovered from mastodon and Tyrannosaurus rex fossils. Although the mastodon samples pass these tests, absence of amino acid composition data, lack of evidence for peptide deamidation, and association of alpha1(I) collagen sequences with amphibians rather than birds suggest that T. rex does not.
Nian, Linge; Hu, Yue; Fu, Caihong; Song, Chen; Wang, Jie; Xiao, Jianxi
2018-01-01
The development of novel assays to detect collagen fragments is of utmost importance for diagnostic, prognostic and therapeutic decisions in various collagen-related diseases, and one essential question is to discover probe peptides that can specifically recognize target collagen sequences. Herein we have developed the fluorescence self-quenching assay as a convenient tool to screen the capability of a series of fluorescent probe peptides of variable lengths to bind with target collagen peptides. We have revealed that the targeting ability of probe peptides is length-dependent, and have discovered a relatively short probe peptide FAM-G(POG) 8 capable to identify the target peptide. We have further demonstrated that fluorescence self-quenching assay together with this short probe peptide can be applied to specifically detect the desired collagen fragment in complex biological media. Fluorescence self-quenching assay provides a powerful new tool to discover effective peptides for the recognition of collagen biomarkers, and it may have great potential to identify probe peptides for various protein biomarkers involved in pathological conditions. Copyright © 2017 Elsevier B.V. All rights reserved.
Characterization of the COL2A1 VNTR polymorphism
DOE Office of Scientific and Technical Information (OSTI.GOV)
Berg, E.S.; Olaisen, B.
1993-05-01
The variable number of tandem repeat (VNTR) region 3{prime} to the collagen type II gene (COL2A1) was amplified in vitro by the polymerase chain reaction. Subsequent high-resolution gel electrophoresis showed that the five earlier reported alleles could be further subtyped. A total of 17 allelic variants with a heterozygosity of 73.0% were found in 202 unrelated Norwegians. DNA sequencing of 19 COL2A1 alleles has been performed. The internal organization of the VNTR was common for all alleles, as previously shown for a few alleles. Moreover, the polymorphism in the COL2A1 locus is mainly due to variation in the numbers ofmore » copies of two repeat units, containing 34 and 31 bp, respectively, and/or to small deletions in either of the two units. DNA sequencing of alleles with the same electrophoretic size revealed no heterogeneity such as an alternating order of the different units, a feature that might have been expected to be the result of unequal crossing-over events. The observed ordered structure of the VNTR and the possibility of single-stranded DNA from the cores in the VNTR forming hairpins and loops suggest that the COL2A1 polymorphism may have evolved mainly by replication slippage mechanisms. 23 refs., 2 figs., 3 tabs.« less
Buckley, Michael; Warwood, Stacey; van Dongen, Bart; Kitchener, Andrew C; Manning, Phillip L
2017-05-31
A decade ago, reports that organic-rich soft tissue survived from dinosaur fossils were apparently supported by proteomics-derived sequence information of exceptionally well-preserved bone. This initial claim to the sequencing of endogenous collagen peptides from an approximately 68 Myr Tyrannosaurus rex fossil was highly controversial, largely on the grounds of potential contamination from either bacterial biofilms or from laboratory practice. In a subsequent study, collagen peptide sequences from an approximately 78 Myr Brachylophosaurus canadensis fossil were reported that have remained largely unchallenged. However, the endogeneity of these sequences relies heavily on a single peptide sequence, apparently unique to both dinosaurs. Given the potential for cross-contamination from modern bone analysed by the same team, here we extract collagen from bone samples of three individuals of ostrich, Struthio camelus The resulting LC-MS/MS data were found to match all of the proposed sequences for both the original Tyrannosaurus and Brachylophosaurus studies. Regardless of the true nature of the dinosaur peptides, our finding highlights the difficulty of differentiating such sequences with confidence. Our results not only imply that cross-contamination cannot be ruled out, but that appropriate measures to test for endogeneity should be further evaluated. © 2017 The Authors.
Warwood, Stacey; van Dongen, Bart; Kitchener, Andrew C.; Manning, Phillip L.
2017-01-01
A decade ago, reports that organic-rich soft tissue survived from dinosaur fossils were apparently supported by proteomics-derived sequence information of exceptionally well-preserved bone. This initial claim to the sequencing of endogenous collagen peptides from an approximately 68 Myr Tyrannosaurus rex fossil was highly controversial, largely on the grounds of potential contamination from either bacterial biofilms or from laboratory practice. In a subsequent study, collagen peptide sequences from an approximately 78 Myr Brachylophosaurus canadensis fossil were reported that have remained largely unchallenged. However, the endogeneity of these sequences relies heavily on a single peptide sequence, apparently unique to both dinosaurs. Given the potential for cross-contamination from modern bone analysed by the same team, here we extract collagen from bone samples of three individuals of ostrich, Struthio camelus. The resulting LC–MS/MS data were found to match all of the proposed sequences for both the original Tyrannosaurus and Brachylophosaurus studies. Regardless of the true nature of the dinosaur peptides, our finding highlights the difficulty of differentiating such sequences with confidence. Our results not only imply that cross-contamination cannot be ruled out, but that appropriate measures to test for endogeneity should be further evaluated. PMID:28566488
Aouacheria, Abdel; Geourjon, Christophe; Aghajari, Nushin; Navratil, Vincent; Deléage, Gilbert; Lethias, Claire; Exposito, Jean-Yves
2006-12-01
Collagens are thought to represent one of the most important molecular innovations in the metazoan line. Basement membrane type IV collagen is present in all Eumetazoa and was found in Homoscleromorpha, a sponge group with a well-organized epithelium, which may represent the first stage of tissue differentiation during animal evolution. In contrast, spongin seems to be a demosponge-specific collagenous protein, which can totally substitute an inorganic skeleton, such as in the well-known bath sponge. In the freshwater sponge Ephydatia mülleri, we previously characterized a family of short-chain collagens that are likely to be main components of spongins. Using a combination of sequence- and structure-based methods, we present evidence of remote homology between the carboxyl-terminal noncollagenous NC1 domain of spongin short-chain collagens and type IV collagen. Unexpectedly, spongin short-chain collagen-related proteins were retrieved in nonsponge animals, suggesting that a family related to spongin constitutes an evolutionary sister to the type IV collagen family. Formation of the ancestral NC1 domain and divergence of the spongin short-chain collagen-related and type IV collagen families may have occurred before the parazoan-eumetazoan split, the earliest divergence among extant animal phyla. Molecular phylogenetics based on NC1 domain sequences suggest distinct evolutionary histories for spongin short-chain collagen-related and type IV collagen families that include spongin short-chain collagen-related gene loss in the ancestors of Ecdyzosoa and of vertebrates. The fact that a majority of invertebrates encodes spongin short-chain collagen-related proteins raises the important question to the possible function of its members. Considering the importance of collagens for animal structure and substratum attachment, both families may have played crucial roles in animal diversification.
Kovacs, A; Kandala, J C; Weber, K T; Guntaka, R V
1996-01-19
Type I and III fibrillar collagens are the major structural proteins of the extracellular matrix found in various organs including the myocardium. Abnormal and progressive accumulation of fibrillar type I collagen in the interstitial spaces compromises organ function and therefore, the study of transcriptional regulation of this gene and specific targeting of its expression is of major interest. Transient transfection of adult cardiac fibroblasts indicate that the polypurine-polypyrimidine sequence of alpha 1(I) collagen promoter between nucleotides - 200 and -140 represents an overall positive regulatory element. DNase I footprinting and electrophoretic mobility shift assays suggest that multiple factors bind to different elements of this promoter region. We further demonstrate that the unique polypyrimidine sequence between -172 and -138 of the promoter represents a suitable target for a single-stranded polypurine oligonucleotide (TFO) to form a triple helix DNA structure. Modified electrophoretic mobility shift assays show that this TFO specifically inhibits the protein-DNA interaction within the target region. In vitro transcription assays and transient transfection experiments demonstrate that the transcriptional activity of the promoter is inhibited by this oligonucleotide. We propose that TFOs represent a therapeutic potential to specifically influence the expression of alpha 1(I) collagen gene in various disease states where abnormal type I collagen accumulation is known to occur.
Modelling the mechanics of partially mineralized collagen fibrils, fibres and tissue
Liu, Yanxin; Thomopoulos, Stavros; Chen, Changqing; Birman, Victor; Buehler, Markus J.; Genin, Guy M.
2014-01-01
Progressive stiffening of collagen tissue by bioapatite mineral is important physiologically, but the details of this stiffening are uncertain. Unresolved questions about the details of the accommodation of bioapatite within and upon collagen's hierarchical structure have posed a central hurdle, but recent microscopy data resolve several major questions. These data suggest how collagen accommodates bioapatite at the lowest relevant hierarchical level (collagen fibrils), and suggest several possibilities for the progressive accommodation of bioapatite at higher hierarchical length scales (fibres and tissue). We developed approximations for the stiffening of collagen across spatial hierarchies based upon these data, and connected models across hierarchies levels to estimate mineralization-dependent tissue-level mechanics. In the five possible sequences of mineralization studied, percolation of the bioapatite phase proved to be an important determinant of the degree of stiffening by bioapatite. The models were applied to study one important instance of partially mineralized tissue, which occurs at the attachment of tendon to bone. All sequences of mineralization considered reproduced experimental observations of a region of tissue between tendon and bone that is more compliant than either tendon or bone, but the size and nature of this region depended strongly upon the sequence of mineralization. These models and observations have implications for engineered tissue scaffolds at the attachment of tendon to bone, bone development and graded biomimetic attachment of dissimilar hierarchical materials in general. PMID:24352669
Hong, Hui; Chaplot, Shreyak; Chalamaiah, Meram; Roy, Bimol C; Bruce, Heather L; Wu, Jianping
2017-08-30
The low-molecular-weight (LMW) peptides derived from collagen have shown a potential for various nutritional and pharmaceutical applications. However, production of LMW peptides from vertebrate collagen remains a challenge. Herein, we report a new method to produce LMW collagen peptides using pepsin pretreatment that removed cross-linked telopeptides in collagen molecules. After the pretreatment, the proportion of LMW collagen peptides (<1.4 kDa) that were obtained from pepsin-soluble collagen increased to 32.59% compared to heat-soluble collagen peptides (16.10%). Fourier transform infrared spectroscopy results indicated that telopeptide cleavage retained the triple-helical conformation of collagen. Liquid chromatography-tandem mass spectrometry analysis suggested that Gly-X-Y (X is often proline, while Y is either hydroxyproline or hydroxylysine) repeats were not the main factors that hindered the enzymatic hydrolysis of collagen molecules. However, cross-link quantification demonstrated that trivalent cross-links that included pyridinolines and pyrroles were the primary obstacles to producing small peptides from collagen of spent hens. This study demonstrated for the first time that removing cross-linked telopeptides could enhance the production of LMW peptides from spent hen collagen, which is also of interest to manufacturers who produce LMW collagen peptides from other vertebrate animals, such as bovids and porcids.
Pallela, Ramjee; Bojja, Sreedhar; Janapala, Venkateswara Rao
2011-07-01
Collagens were isolated and partially characterized from the marine demosponge, Ircinia fusca from Gulf of Mannar (GoM), India, with an aim to develop potentially applicable collagens from unused and under-used resources. The yield of insoluble, salt soluble and acid soluble forms of collagens was 31.71 ± 1.59, 20.69 ± 1.03, and 17.38 ± 0.87 mg/g dry weight, respectively. Trichrome staining, Scanning & Transmission Electron microscopic (SEM & TEM) studies confirmed the presence of collagen in the isolated, terminally globular irciniid filaments. The partially purified (gel filtration chromatography), non-fibrillar collagens appeared as basement type collagenous sheets under light microscopy whereas the purified fibrillar collagens appeared as fibrils with a repeated band periodicity of 67 nm under Atomic Force Microscope (AFM). The non-fibrillar and fibrillar collagens were seen to have affinity for anti-collagen type IV and type I antibodies raised against human collagens, respectively. The macromolecules, i.e., total protein, carbohydrate and lipid contents within the tissues were also quantified. The present information on the three characteristic irciniid collagens (filamentous, fibrillar and non-fibrillar) could assist the future attempts to unravel the therapeutically important, safer collagens from marine sponges for their use in pharmaceutical and cosmeceutical industries. Copyright © 2011 Elsevier B.V. All rights reserved.
Mackey, Abigail L.; Brandstetter, Simon; Schjerling, Peter; Bojsen-Moller, Jens; Qvortrup, Klaus; Pedersen, Mette M.; Doessing, Simon; Kjaer, Michael; Magnusson, S. Peter; Langberg, Henning
2011-01-01
The purpose of this study was to test the hypothesis that remodeling of skeletal muscle extracellular matrix (ECM) is involved in protecting human muscle against injury. Biopsies were obtained from medial gastrocnemius muscles after a single bout of electrical stimulation (B) or a repeated bout (RB) 30 d later, or 30 d after a single stimulation bout (RBc). A muscle biopsy was collected from the control leg for comparison with the stimulated leg. Satellite cell content, tenascin C, and muscle regeneration were assessed by immunohistochemistry; real-time PCR was used to measure mRNA levels of collagens, laminins, heat-shock proteins (HSPs), inflammation, and related growth factors. The large responses of HSPs, CCL2, and tenascin C detected 48 h after a single bout were attenuated in the RB trial, indicative of protection against injury. Satellite cell content and 12 target genes, including IGF-1, were elevated 30 d after a single bout. Among those displaying the greatest difference vs. control muscle, ECM laminin-β1 and collagen types I and III were elevated ∼6- to 9-fold (P<0.001). The findings indicate that the sequenced events of load-induced early deadhesion and later strengthening of skeletal muscle ECM play a role in protecting human muscle against future injury.—Mackey, A. L., Brandstetter, S., Schjerling, P., Bojsen-Moller, J., Qvortrup, K., Pedersen, M. M., Doessing, S. Kjaer, M., Magnusson, S. P., Langberg, H. Sequenced response of extracellular matrix deadhesion and fibrotic regulators after muscle damage is involved in protection against future injury in human skeletal muscle. PMID:21368102
A micro-mechanical model to determine changes of collagen fibrils under cyclic loading
NASA Astrophysics Data System (ADS)
Chen, Michelle L.; Susilo, Monica E.; Ruberti, Jeffrey A.; Nguyen, Thao D.
Dynamic mechanical loading induces growth and remodeling in biological tissues. It can alter the degradation rate and intrinsic mechanical properties of collagen through cellular activity. Experiments showed that repeated cyclic loading of a dense collagen fibril substrate increased collagen stiffness and strength, lengthened the substrate, but did not significantly change the fibril areal fraction or fibril anisotropy (Susilo, et al. ``Collagen Network Hardening Following Cyclic Tensile Loading'', Interface Focus, submitted). We developed a model for the collagen fibril substrate (Tonge, et al. ``A micromechanical modeling study of the mechanical stabilization of enzymatic degradation of collagen tissues'', Biophys J, in press.) to probe whether changes in the fibril morphology and mechanical properties can explain the tissue-level properties observed during cyclic loading. The fibrils were modeled as a continuous distribution of wavy elastica, based on experimental measurements of fibril density and collagen anisotropy, and can experience damage after a critical stress threshold. Other mechanical properties in the model were fit to the stress response measured before and after the extended cyclic loading to determine changes in the strength and stiffness of collagen fibrils.
Type I Collagen and Collagen Mimetics as Angiogenesis Promoting Superpolymers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Twardowski, T.; Fertala, A.; Orgel, J.P.R.O.
Angiogenesis, the development of blood vessels from the pre-existing vasculature, is a key component of embryogenesis and tissue regeneration. Angiogenesis also drives pathologies such as tumor growth and metastasis, and hemangioma development in newborns. On the other hand, promotion of angiogenesis is needed in tissues with vascular insufficiencies, and in bioengineering, to endow tissue substitutes with appropriate microvasculatures. Therefore, much research has focused on defining mechanisms of angiogenesis, and identifying pro- and anti-angiogenic molecules. Type I collagen, the most abundant protein in humans, potently stimulates angiogenesis in vitro and in vivo. Crucial to its angiogenic activity appears to be ligationmore » and possibly clustering of endothelial cell (EC) surface {alpha}1{beta}1/{alpha}2{beta}1 integrin receptors by the GFPGER502-507 sequence of the collagen fibril. However, additional aspects of collagen structure and function that may modulate its angiogenic properties are discussed. Moreover, type I collagen and fibrin, another angiogenic polymer, share several structural features. These observations suggest strategies for creating 'angiogenic superpolymers', including: modifying type I collagen to influence its biological half-life, immunogenicity, and integrin binding capacity; genetically engineering fibrillar collagens to include additional integrin binding sites or angiogenic determinants, and remove unnecessary or deleterious sequences without compromising fibril integrity; and exploring the suitability of poly(ortho ester), PEG-lysine copolymer, tubulin, and cholesteric cuticle as collagen mimetics, and suggesting means of modifying them to display ideal angiogenic properties. The collagenous and collagen mimetic angiogenic superpolymers described here may someday prove useful for many applications in tissue engineering and human medicine.« less
Weighing the mass spectrometric evidence for authentic Tyrannosaurus rex collagen
Buckley, Mike; Walker, Angela; Ho, Simon Y. W.; Yang, Yue; Smith, Colin; Ashton, Peter; Oates, Jane Thomas; Cappellini, Enrico; Koon, Hannah; Penkman, Kirsty; Elsworth, Ben; Ashford, Dave; Solazzo, Caroline; Andrews, Phil; Strahler, John; Shapiro, Beth; Ostrom, Peggy; Gandhi, Hasand; Miller, Webb; Raney, Brian; Zylber, Maria Ines; Gilbert, M. Thomas P.; Prigodich, Richard V.; Ryan, Michael; Rijsdijk, Kenneth F.; Janoo, Anwar; Collins, Matthew J.
2009-01-01
We use authentication tests developed for ancient DNA to evaluate claims by Asara et al. of collagen peptide sequences recovered from mastodon and Tyrannosaurus rex fossils. Although the mastodon passes, absence of amino acid composition data, lack of evidence for peptide deamidation, and association of the α1(I) peptide sequences with amphibians not birds, suggests that T. rex does not. PMID:18174420
Stefanovic, Branko
2013-01-01
Type I collagen is the most abundant protein in human body. The protein turns over slowly and its replacement synthesis is low. However, in wound healing or in pathological fibrosis the cells can increase production of type I collagen several hundred fold. This increase is predominantly due to posttranscriptional regulation, including increased half-life of collagen messenger RNAs (mRNAs) and their increased translatability. Type I collagen is composed of two α1 and one α2 polypeptides that fold into a triple helix. This stoichiometry is strictly regulated to prevent detrimental synthesis of α1 homotrimers. Collagen polypeptides are co-translationally modified and the rate of modifications is in dynamic equilibrium with the rate of folding, suggesting coordinated translation of collagen α1(I) and α2(I) polypeptides. Collagen α1(I) mRNA has in the 3' untranslated region (UTR) a C-rich sequence that binds protein αCP, this binding stabilizes the mRNA in collagen producing cells. In the 5' UTR both collagen mRNAs have a conserved stem-loop (5' SL) structure. The 5' SL is critical for high collagen expression, knock in mice with disruption of the 5' SL are resistant to liver fibrosis. the 5' SL binds protein LARP6 with strict sequence specificity and high affinity. LARP6 recruits RNA helicase A to facilitate translation initiation and associates collagen mRNAs with vimentin and nonmuscle myosin filaments. Binding to vimentin stabilizes collagen mRNAs, while nonmuscle myosin regulates coordinated translation of α1(I) and α2(I) mRNAs. When nonmuscle myosin filaments are disrupted the cells secrete only α1 homotrimers. Thus, the mechanism governing high collagen expression involves two RNA binding proteins and development of cytoskeletal filaments. Copyright © 2013 John Wiley & Sons, Ltd.
Self-healing Characteristics of Collagen Coatings with Respect to Surface Abrasion
Kim, Chang-Lae; Kim, Dae-Eun
2016-01-01
A coating based on collagen with self-healing properties was developed for applications in mechanical components that are prone to abrasion due to contact with a counter surface. The inherent swelling behavior of collagen in water was exploited as the fundamental mechanism behind self-healing of a wear scar formed on the surface. The effects of freeze-drying process and water treatment of the collagen coatings on their mechanical and self-healing properties were analyzed. Water was also used as the medium to trigger the self-healing effect of the collagen coatings after the wear test. It was found that collagen coatings without freeze-drying did not demonstrate any self-healing effect whereas the coatings treated by freeze-drying process showed remarkable self-healing effect. Overall, collagen coatings that were freeze-dried and water treated showed the best friction and self-healing properties. Repeated self-healing ability of these coatings with respect to wear scar was also demonstrated. It was also confirmed that the self-healing property of the collagen coating was effective over a relatively wide range of temperature. PMID:27010967
Self-healing Characteristics of Collagen Coatings with Respect to Surface Abrasion
NASA Astrophysics Data System (ADS)
Kim, Chang-Lae; Kim, Dae-Eun
2016-03-01
A coating based on collagen with self-healing properties was developed for applications in mechanical components that are prone to abrasion due to contact with a counter surface. The inherent swelling behavior of collagen in water was exploited as the fundamental mechanism behind self-healing of a wear scar formed on the surface. The effects of freeze-drying process and water treatment of the collagen coatings on their mechanical and self-healing properties were analyzed. Water was also used as the medium to trigger the self-healing effect of the collagen coatings after the wear test. It was found that collagen coatings without freeze-drying did not demonstrate any self-healing effect whereas the coatings treated by freeze-drying process showed remarkable self-healing effect. Overall, collagen coatings that were freeze-dried and water treated showed the best friction and self-healing properties. Repeated self-healing ability of these coatings with respect to wear scar was also demonstrated. It was also confirmed that the self-healing property of the collagen coating was effective over a relatively wide range of temperature.
Regenerative capacity of mdx mouse muscles after repeated applications of myo-necrotic bupivacaine.
Itagaki, Y; Saida, K; Iwamura, K
1995-01-01
We injected bupivacaine (BPVC), which produces muscle fiber necrosis, repeatedly into the soleus muscles of mdx mice, which represent a model of human Duchenne muscular dystrophy, over a 12-month period. Cytological and morphometric analysis revealed that the regenerative capacity of repeatedly BPVC-injected mdx muscles was almost equal to that of the saline-injected mdx muscles. At 9 months of age the endomysial collagen content of mdx muscles was 4.6 times that of control mice muscles, and was 7.2 times that of control mice muscle at 12 months. These results suggest that the regenerative capacity of the mdx muscle is quite large and that myo-necrosis induced by an extrinsic cause, such as BPVC, may not be an important factor in the disease progress. However, endomysial collagen, for which the mechanism of increase may be related to the defect of dystrophin, may play an important role in gradual decline of regeneration.
Genetics Home Reference: dermatofibrosarcoma protuberans
... part of a large molecule called type I collagen, which strengthens and supports many tissues in the ... the chimeric sequence formed by the fusion of collagen gene COL1A1 and the platelet derived growth factor ...
Analysis of sequence repeats of proteins in the PDB.
Mary Rajathei, David; Selvaraj, Samuel
2013-12-01
Internal repeats in protein sequences play a significant role in the evolution of protein structure and function. Applications of different bioinformatics tools help in the identification and characterization of these repeats. In the present study, we analyzed sequence repeats in a non-redundant set of proteins available in the Protein Data Bank (PDB). We used RADAR for detecting internal repeats in a protein, PDBeFOLD for assessing structural similarity, PDBsum for finding functional involvement and Pfam for domain assignment of the repeats in a protein. Through the analysis of sequence repeats, we found that identity of the sequence repeats falls in the range of 20-40% and, the superimposed structures of the most of the sequence repeats maintain similar overall folding. Analysis sequence repeats at the functional level reveals that most of the sequence repeats are involved in the function of the protein through functionally involved residues in the repeat regions. We also found that sequence repeats in single and two domain proteins often contained conserved sequence motifs for the function of the domain. Copyright © 2013 Elsevier Ltd. All rights reserved.
Repeated folding stress-induced morphological changes in the dermal equivalent.
Arai, Koji Y; Sugimoto, Mami; Ito, Kanako; Ogura, Yuki; Akutsu, Nobuko; Amano, Satoshi; Adachi, Eijiro; Nishiyama, Toshio
2014-11-01
Repeated mechanical stresses applied to the same region of the skin are thought to induce morphological changes known as wrinkle. However, the underlying mechanisms are not fully understood. To study the mechanisms, we examined effects of repeated mechanical stress on the dermal equivalent. We developed a novel device to apply repeated folding stress to the dermal equivalent. After applying the mechanical stress, morphological changes of the dermal equivalent and expression of several genes related to extracellular matrix turn over and cell contraction were examined. The repeated folding stress induced a noticeable decrease in the width of the dermal equivalent. The mechanical stress altered orientations of collagen fibrils. Hydroxyproline contents, dry weights and cell viability of the dermal equivalents were not affected by the mechanical stress. On the other hand, Rho-associated coiled-coil-containing kinase (ROCK) specific inhibitor Y27632 completely suppressed the decrease in the width of the dermal equivalent. The present results revealed that either degradation of collagen or changes in the number of cells were not responsible for the decrease in the width of the dermal equivalent and indicate that the repeated mechanical stress induces unidirectional contraction in the dermal equivalent through the RhoA-ROCK signaling pathway. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Berbís, M. Álvaro; André, Sabine; Cañada, F. Javier
2014-01-03
Highlights: •Galectin-3 is composed of a carbohydrate recognition domain and an N-terminal tail. •Synthetic peptides derived from the tail are shown to interact with the CRD. •This interaction is modulated by Ser- and Tyr-phosphorylation of the peptides. -- Abstract: Galectin-3 (Gal-3) is a multi-functional effector protein that functions in the cytoplasm and the nucleus, as well as extracellularly following non-classical secretion. Structurally, Gal-3 is unique among galectins with its carbohydrate recognition domain (CRD) attached to a rather long N-terminal tail composed mostly of collagen-like repeats (nine in the human protein) and terminating in a short non-collagenous terminal peptide sequence uniquemore » in this lectin family and not yet fully explored. Although several Ser and Tyr sites within the N-terminal tail can be phosphorylated, the physiological significance of this post-translational modification remains unclear. Here, we used a series of synthetic (phospho)peptides derived from the tail to assess phosphorylation-mediated interactions with {sup 15}N-labeled Gal-3 CRD. HSQC-derived chemical shift perturbations revealed selective interactions at the backface of the CRD that were attenuated by phosphorylation of Tyr 107 and Tyr 118, while phosphorylation of Ser 6 and Ser 12 was essential. Controls with sequence scrambling underscored inherent specificity. Our studies shed light on how phosphorylation of the N-terminal tail may impact on Gal-3 function and prompt further studies using phosphorylated full-length protein.« less
Hyaluronic acid injections protect patellar tendon from detraining-associated damage.
Frizziero, Antonio; Salamanna, Francesca; Giavaresi, Gianluca; Ferrari, Andrea; Martini, Lucia; Marini, Marina; Veicsteinas, Arsenio; Maffulli, Nicola; Masiero, Stefano; Fini, Milena
2015-09-01
Having previously demonstrated that detraining affects patellar tendon (PT) proteoglycan content and collagen fiber organization, we undertook the present study with two aims: to improve knowledge on the adaptation of PT and its enthesis to detraining from a histological and histomorphometric point of view, and to investigate the hypothesis that repeated peri-patellar injections of hyaluronic acid (HA) on detrained PT may reduce and limit detrained associated-damage. Twenty-four male Sprague-Dawley rats were divided into 3 groups: Untrained (n=6), Trained (n=6) (10 wks-treadmill) and Detrained (n=12). In the detrained rats, the left tendon was untreated while the right tendon received repeated peri-patellar injections of either HA or saline (NaCl). Structure and morphology of PTs (modified Movin score, tear density, collagen type I and III) and enthesis (cell morphology, chondrocyte cluster formation, tidemark integrity, matrix staining and vascularization) were evaluated. The left PT and enthesis of the Detrained groups showed altered structure and morphology with the highest Movin score values, the highest percentage of collagen III and the lowest of collagen I; the lowest score values were observed in the Trained and Detrained-HA groups. Detrained-NaCl PTs showed the highest collagen III and the lowest collagen I values with respect to Detrained-HA PTs. This study strengthens previously published data showing the alteration in tendon and enthesis morphology due to discontinuation of training, and provides new data showing that treatment with HA is effective in the maintenance of the structural properties of PT and enthesis in Detrained rats. Such beneficial effects could play a significant role in the management of conservative and rehabilitation strategies in athletes that change type, intensity and duration of training.
Dinosaur peptides suggest mechanisms of protein survival.
San Antonio, James D; Schweitzer, Mary H; Jensen, Shane T; Kalluri, Raghu; Buckley, Michael; Orgel, Joseph P R O
2011-01-01
Eleven collagen peptide sequences recovered from chemical extracts of dinosaur bones were mapped onto molecular models of the vertebrate collagen fibril derived from extant taxa. The dinosaur peptides localized to fibril regions protected by the close packing of collagen molecules, and contained few acidic amino acids. Four peptides mapped to collagen regions crucial for cell-collagen interactions and tissue development. Dinosaur peptides were not represented in more exposed parts of the collagen fibril or regions mediating intermolecular cross-linking. Thus functionally significant regions of collagen fibrils that are physically shielded within the fibril may be preferentially preserved in fossils. These results show empirically that structure-function relationships at the molecular level could contribute to selective preservation in fossilized vertebrate remains across geological time, suggest a 'preservation motif', and bolster current concepts linking collagen structure to biological function. This non-random distribution supports the hypothesis that the peptides are produced by the extinct organisms and suggests a chemical mechanism for survival.
Xu, Qiangjian; Li, Quanli; Chen, Jialong; Zhang, Weibo; Wu, Xiaoting; Cao, Ying
2015-03-01
To investigate the inhibition effect of dopamine on the activity of matrix metalloproteinases (MMP) and the effect of dopamine on degradation of dentin collagen for its potential use in caries treatment and dentin adhesive. In the experiment of MMP activity test, 2.0 g/L dopamine + 1.0 g/L highly purified collagenase type VIII from Clostridium histolyticum served as the experimental group, and deionized water + 1.0 g/L highly purified collagenase type VIII from Clostridium histolyticum served as the negative control group, and 2% chlorhexidine + 1.0 g/L highly purified collagenase type VIII from Clostridium histolyticum served as the positive control group, and the mixture volume ratio of the two ingredients in every group was 1:9. After 15 minutes, the enzyme activity of each sample was tested by MMP activity colerimetric quantitative detection kits, and the test was repeated 5 times in each group. In the experiment of collagen degradation, the dentin slices were demineralized with 37% phosphoric acid for 1 min. In sequence, 2 dentin slices were used to observe the morphology, and the remaining 30 dentine slices were randomly divided into three groups (n = 10) according to random number table: the negative control ones were stored in 100 µl deionized water and 900 µl collagenase (7 days, 37 °C), the positive control ones were stored in 100 µl chlorhexidine and 900 µl collagenase (7 days, 37 °C) and the experimental specimens were stored in 100 µl dopamine and 900 µl collagenase (7 days, 37 °C). The degraded collagen was investigated by assaying hydroxyproline. The framework of collagen was evaluated with field emission scanning electron microscope (FE-SEM). The statistical results of completely random design ANOVA showed that the MMP activity and the amount of degraded collagen of the negative control group [(0.089 ± 0.011) µmol · min⁻¹ · mg⁻¹ and (2 837 ± 201) µg/cm²] were significantly higher than those of the positive control group [(0.038 ± 0.006) µmol · min⁻¹ · mg⁻¹ and (1 288 ± 172) µg/cm²] and the experimental group [(0.030 ± 0.009) µmol · min⁻¹ · mg⁻¹ and (1 389 ± 255) µg/cm²] (P < 0.05). SEM observation indicated that the structural integrity of the collagen network on dentin still existed in experiment samples and positive control groups, however, collagen fibrils were destructed and the structural integrity disappeared in the negative control groups. Dopamine may inhibit MMP activity and reduce the amount of degraded collagen.
Three-dimensional bioprinting of rat embryonic neural cells.
Lee, Wonhye; Pinckney, Jason; Lee, Vivian; Lee, Jong-Hwan; Fischer, Krisztina; Polio, Samuel; Park, Je-Kyun; Yoo, Seung-Schik
2009-05-27
We present a direct cell printing technique to pattern neural cells in a three-dimensional (3D) multilayered collagen gel. A layer of collagen precursor was printed to provide a scaffold for the cells, and the rat embryonic neurons and astrocytes were subsequently printed on the layer. A solution of sodium bicarbonate was applied to the cell containing collagen layer as nebulized aerosols, which allowed the gelation of the collagen. This process was repeated layer-by-layer to construct the 3D cell-hydrogel composites. Upon characterizing the relationship between printing resolutions and the growth of printed neural cells, single/multiple layers of neural cell-hydrogel composites were constructed and cultured. The on-demand capability to print neural cells in a multilayered hydrogel scaffold offers flexibility in generating artificial 3D neural tissue composites.
Designed to Fail: A Novel Mode of Collagen Fibril Disruption and Its Relevance to Tissue Toughness
Veres, Samuel P.; Lee, J. Michael
2012-01-01
Collagen fibrils are nanostructured biological cables essential to the structural integrity of many of our tissues. Consequently, understanding the structural basis of their robust mechanical properties is of great interest. Here we present what to our knowledge is a novel mode of collagen fibril disruption that provides new insights into both the structure and mechanics of native collagen fibrils. Using enzyme probes for denatured collagen and scanning electron microscopy, we show that mechanically overloading collagen fibrils from bovine tail tendons causes them to undergo a sequential, two-stage, selective molecular failure process. Denatured collagen molecules—meaning molecules with a reduced degree of time-averaged helicity compared to those packed in undamaged fibrils—were first created within kinks that developed at discrete, repeating locations along the length of fibrils. There, collagen denaturation within the kinks was concentrated within certain subfibrils. Additional denatured molecules were then created along the surface of some disrupted fibrils. The heterogeneity of the disruption within fibrils suggests that either mechanical load is not carried equally by a fibril's subcomponents or that the subcomponents do not possess homogenous mechanical properties. Meanwhile, the creation of denatured collagen molecules, which necessarily involves the energy intensive breaking of intramolecular hydrogen bonds, provides a physical basis for the toughness of collagen fibrils. PMID:22735538
Sequence repeats and protein structure
NASA Astrophysics Data System (ADS)
Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos
2012-11-01
Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.
Stoichevska, Violet; Peng, Yong Y; Vashi, Aditya V; Werkmeister, Jerome A; Dumsday, Geoff J; Ramshaw, John A M
2017-03-01
Recombinant bacterial collagens provide a new opportunity for safe biomedical materials. They are readily expressed in Escherichia coli in good yield and can be readily purified by simple approaches. However, recombinant proteins are limited in that direct secondary modification during expression is generally not easily achieved. Thus, inclusion of unusual amino acids, cyclic peptides, sugars, lipids, and other complex functions generally needs to be achieved chemically after synthesis and extraction. In the present study, we have illustrated that bacterial collagens that have had their sequences modified to include cysteine residue(s), which are not normally present in bacterial collagen-like sequences, enable a range of specific chemical modification reactions to be produced. Various model reactions were shown to be effective for modifying the collagens. The ability to include alkyne (or azide) functions allows the extensive range of substitutions that are available via "click" chemistry to be accessed. When bifunctional reagents were used, some crosslinking occurred to give higher molecular weight polymeric proteins, but gels were not formed. © 2016 Wiley Periodicals, Inc. J Biomed Mater Res Part A: 105A: 806-813, 2017. © 2016 Wiley Periodicals, Inc.
Sicot, F X; Mesnage, M; Masselot, M; Exposito, J Y; Garrone, R; Deutsch, J; Gaill, F
2000-09-29
The annelid Alvinella pompejana is probably the most heat-tolerant metazoan organism known. Previous results have shown that the level of thermal stability of its interstitial collagen is significantly greater than that of coastal annelids and of vent organisms, such as the vestimentiferan Riftia pachyptila, living in colder parts of the deep-sea hydrothermal environment. In order to investigate the molecular basis of this thermal behavior, we cloned and sequenced a large cDNA molecule coding the fibrillar collagen of Alvinella, including one half of the helical domain and the entire C-propeptide domain. For comparison, we also cloned the 3' part of the homologous cDNA from Riftia. Comparison of the corresponding helical domains of these two species, together with that of the previously sequenced domain of the coastal lugworm Arenicola marina, showed that the increase in proline content and in the number of stabilizing triplets correlate with the outstanding thermostability of the interstitial collagen of A. pompejana. Phylogenetic analysis showed that triple helical and the C-propeptide parts of the same collagen molecule evolve at different rates, in favor of an adaptive mechanism at the molecular level. Copyright 2000 Academic Press.
2014-02-28
sockets.6 The commercially available INFUSE system (Medtronic Spinal and Biologics, Memphis, TN), compris- ing an absorbable collagen sponge plus...a collagen sponge carrier) by Medtronics27 for bone healing in rabbits. Even the 25mg rhBMP-2 dose used showed significantly greater re- generated...visualization No 3D morphological analysis for small-animal modelsCan be repeated over course of healing for temporal trends Potential risk of X-ray
Dinosaur Peptides Suggest Mechanisms of Protein Survival
San Antonio, James D.; Schweitzer, Mary H.; Jensen, Shane T.; Kalluri, Raghu; Buckley, Michael; Orgel, Joseph P. R. O.
2011-01-01
Eleven collagen peptide sequences recovered from chemical extracts of dinosaur bones were mapped onto molecular models of the vertebrate collagen fibril derived from extant taxa. The dinosaur peptides localized to fibril regions protected by the close packing of collagen molecules, and contained few acidic amino acids. Four peptides mapped to collagen regions crucial for cell-collagen interactions and tissue development. Dinosaur peptides were not represented in more exposed parts of the collagen fibril or regions mediating intermolecular cross-linking. Thus functionally significant regions of collagen fibrils that are physically shielded within the fibril may be preferentially preserved in fossils. These results show empirically that structure-function relationships at the molecular level could contribute to selective preservation in fossilized vertebrate remains across geological time, suggest a ‘preservation motif’, and bolster current concepts linking collagen structure to biological function. This non-random distribution supports the hypothesis that the peptides are produced by the extinct organisms and suggests a chemical mechanism for survival. PMID:21687667
Collagen cross-linking: insights on the evolution of metazoan extracellular matrix.
Rodriguez-Pascual, Fernando; Slatter, David Anthony
2016-11-23
Collagens constitute a large family of extracellular matrix (ECM) proteins that play a fundamental role in supporting the structure of various tissues in multicellular animals. The mechanical strength of fibrillar collagens is highly dependent on the formation of covalent cross-links between individual fibrils, a process initiated by the enzymatic action of members of the lysyl oxidase (LOX) family. Fibrillar collagens are present in a wide variety of animals, therefore often being associated with metazoan evolution, where the emergence of an ancestral collagen chain has been proposed to lead to the formation of different clades. While LOX-generated collagen cross-linking metabolites have been detected in different metazoan families, there is limited information about when and how collagen acquired this particular modification. By analyzing telopeptide and helical sequences, we identified highly conserved, potential cross-linking sites throughout the metazoan tree of life. Based on this analysis, we propose that they have importantly contributed to the formation and further expansion of fibrillar collagens.
Dinosaur Peptides Suggest Mechanisms of Protein Survival
DOE Office of Scientific and Technical Information (OSTI.GOV)
San Antonio, James D.; Schweitzer, Mary H.; Jensen, Shane T.
Eleven collagen peptide sequences recovered from chemical extracts of dinosaur bones were mapped onto molecular models of the vertebrate collagen fibril derived from extant taxa. The dinosaur peptides localized to fibril regions protected by the close packing of collagen molecules, and contained few acidic amino acids. Four peptides mapped to collagen regions crucial for cell-collagen interactions and tissue development. Dinosaur peptides were not represented in more exposed parts of the collagen fibril or regions mediating intermolecular cross-linking. Thus functionally significant regions of collagen fibrils that are physically shielded within the fibril may be preferentially preserved in fossils. These results showmore » empirically that structure-function relationships at the molecular level could contribute to selective preservation in fossilized vertebrate remains across geological time, suggest a 'preservation motif', and bolster current concepts linking collagen structure to biological function. This non-random distribution supports the hypothesis that the peptides are produced by the extinct organisms and suggests a chemical mechanism for survival.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chien, Yung-Ching; Tao, Jinhui; Saeki, Kuniko
In calcified tissues such as bones and teeth, mineralization is regulated by an extracellular matrix, which includes non-collagenous proteins (NCP). This natural process has been adapted or mimicked to restore tissues following physical damage or demineralization by using polyanionic acids in place of NCPs, but the remineralized tissues fail to fully recover their mechanical properties. Here we show that pre-treatment with certain amphiphilic peptoids, a class of peptide-like polymers consisting of N-substituted glycines that have defined monomer sequences, enhances ordering and mineralization of collagen and induces functional remineralization of dentin lesions in vitro. In the vicinity of dentin tubules, themore » newly formed apatite nano-crystals are co-aligned with the c-axis parallel to the tubular periphery and recovery of tissue ultrastructure is accompanied by development of high mechanical strength. The observed effects are highly sequence-dependent with alternating polar and non-polar groups leading to positive outcomes while diblock sequences have no effect. The observations suggest aromatic groups interact with the collagen while the hydrophilic side chains bind the mineralizing constituents and highlight the potential of synthetic sequence-defined biomimetic polymers to serve as NCP mimics in tissue remineralization.« less
Comparative Analysis of the First Complete Enterococcus faecium Genome
Lam, Margaret M. C.; Seemann, Torsten; Bulach, Dieter M.; Gladman, Simon L.; Chen, Honglei; Haring, Volker; Moore, Robert J.; Ballard, Susan; Grayson, M. Lindsay; Johnson, Paul D. R.; Howden, Benjamin P.
2012-01-01
Vancomycin-resistant enterococci (VRE) are one of the leading causes of nosocomial infections in health care facilities around the globe. In particular, infections caused by vancomycin-resistant Enterococcus faecium are becoming increasingly common. Comparative and functional genomic studies of E. faecium isolates have so far been limited owing to the lack of a fully assembled E. faecium genome sequence. Here we address this issue and report the complete 3.0-Mb genome sequence of the multilocus sequence type 17 vancomycin-resistant Enterococcus faecium strain Aus0004, isolated from the bloodstream of a patient in Melbourne, Australia, in 1998. The genome comprises a 2.9-Mb circular chromosome and three circular plasmids. The chromosome harbors putative E. faecium virulence factors such as enterococcal surface protein, hemolysin, and collagen-binding adhesin. Aus0004 has a very large accessory genome (38%) that includes three prophage and two genomic islands absent among 22 other E. faecium genomes. One of the prophage was present as inverted 50-kb repeats that appear to have facilitated a 683-kb chromosomal inversion across the replication terminus, resulting in a striking replichore imbalance. Other distinctive features include 76 insertion sequence elements and a single chromosomal copy of Tn1549 containing the vanB vancomycin resistance element. A complete E. faecium genome will be a useful resource to assist our understanding of this emerging nosocomial pathogen. PMID:22366422
Okamoto, Masaaki; Naito, Mariko; Miyanohara, Mayu; Imai, Susumu; Nomura, Yoshiaki; Saito, Wataru; Momoi, Yasuko; Takada, Kazuko; Miyabe-Nishiwaki, Takako; Tomonaga, Masaki; Hanada, Nobuhiro
2016-12-01
Streptococcus troglodytae TKU31 was isolated from the oral cavity of a chimpanzee (Pan troglodytes) and was found to be the most closely related species of the mutans group streptococci to Streptococcus mutans. The complete sequence of TKU31 genome consists of a single circular chromosome that is 2,097,874 base pairs long and has a G + C content of 37.18%. It possesses 2082 coding sequences (CDSs), 65 tRNAs and five rRNA operons (15 rRNAs). Two clustered regularly interspaced short palindromic repeats, six insertion sequences and two predicted prophage elements were identified. The genome of TKU31 harbors some putative virulence associated genes, including gtfB, gtfC and gtfD genes encoding glucosyltransferase and gbpA, gbpB, gbpC and gbpD genes encoding glucan-binding cell wall-anchored protein. The deduced amino acid identity of the rhamnose-glucose polysaccharide F gene (rgpF), which is one of the serotype determinants, is 91% identical with that of S. mutans LJ23 (serotype k) strain. However, two other virulence-associated genes cnm and cbm, which encode the collagen-binding proteins, were not found in the TKU31 genome. The complete genome sequence of S. troglodytae TKU31 has been deposited at DDBJ/European Nucleotide Archive/GenBank under the accession no. AP014612. © 2016 The Societies and John Wiley & Sons Australia, Ltd.
Hill, Ryan C.; Wither, Matthew J.; Nemkov, Travis; Barrett, Alexander; D'Alessandro, Angelo; Dzieciatkowska, Monika; Hansen, Kirk C.
2015-01-01
Bone samples from several vertebrates were collected from the Ziegler Reservoir fossil site, in Snowmass Village, Colorado, and processed for proteomics analysis. The specimens come from Pleistocene megafauna Bison latifrons, dating back ∼120,000 years. Proteomics analysis using a simplified sample preparation procedure and tandem mass spectrometry (MS/MS) was applied to obtain protein identifications. Several bioinformatics resources were used to obtain peptide identifications based on sequence homology to extant species with annotated genomes. With the exception of soil sample controls, all samples resulted in confident peptide identifications that mapped to type I collagen. In addition, we analyzed a specimen from the extinct B. latifrons that yielded peptide identifications mapping to over 33 bovine proteins. Our analysis resulted in extensive fibrillar collagen sequence coverage, including the identification of posttranslational modifications. Hydroxylysine glucosylgalactosylation, a modification thought to be involved in collagen fiber formation and bone mineralization, was identified for the first time in an ancient protein dataset. Meta-analysis of data from other studies indicates that this modification may be common in well-preserved prehistoric samples. Additional peptide sequences from extracellular matrix (ECM) and non-ECM proteins have also been identified for the first time in ancient tissue samples. These data provide a framework for analyzing ancient protein signatures in well-preserved fossil specimens, while also contributing novel insights into the molecular basis of organic matter preservation. As such, this analysis has unearthed common posttranslational modifications of collagen that may assist in its preservation over time. The data are available via ProteomeXchange with identifier PXD001827. PMID:25948757
DOE Office of Scientific and Technical Information (OSTI.GOV)
Orgel, Joseph P.R.O.; Eid, Aya; Antipova, Olga
Decorin is the archetypal small leucine rich repeat proteoglycan of the vertebrate extracellular matrix (ECM). With its glycosaminoglycuronan chain, it is responsible for stabilizing inter-fibrillar organization. Type I collagen is the predominant member of the fibrillar collagen family, fulfilling both organizational and structural roles in animal ECMs. In this study, interactions between decoron (the decorin core protein) and binding sites in the d and e1 bands of the type I collagen fibril were investigated through molecular modeling of their respective X-ray diffraction structures. Previously, it was proposed that a model-based, highly curved concave decoron interacts with a single collagen molecule,more » which would form extensive van der Waals contacts and give rise to strong non-specific binding. However, the large well-ordered aggregate that is the collagen fibril places significant restraints on modes of ligand binding and necessitates multi-collagen molecular contacts. We present here a relatively high-resolution model of the decoron-fibril collagen complex. We find that the respective crystal structures complement each other well, although it is the monomeric form of decoron that shows the most appropriate shape complementarity with the fibril surface and favorable calculated energies of interaction. One molecule of decoron interacts with four to six collagen molecules, and the binding specificity relies on a large number of hydrogen bonds and electrostatic interactions, primarily with the collagen motifs KXGDRGE and AKGDRGE (d and e{sub 1} bands). This work helps us to understand collagen-decorin interactions and the molecular architecture of the fibrillar ECM in health and disease.« less
Orgel, Joseph P R O; Eid, Aya; Antipova, Olga; Bella, Jordi; Scott, John E
2009-09-15
Decorin is the archetypal small leucine rich repeat proteoglycan of the vertebrate extracellular matrix (ECM). With its glycosaminoglycuronan chain, it is responsible for stabilizing inter-fibrillar organization. Type I collagen is the predominant member of the fibrillar collagen family, fulfilling both organizational and structural roles in animal ECMs. In this study, interactions between decoron (the decorin core protein) and binding sites in the d and e(1) bands of the type I collagen fibril were investigated through molecular modeling of their respective X-ray diffraction structures. Previously, it was proposed that a model-based, highly curved concave decoron interacts with a single collagen molecule, which would form extensive van der Waals contacts and give rise to strong non-specific binding. However, the large well-ordered aggregate that is the collagen fibril places significant restraints on modes of ligand binding and necessitates multi-collagen molecular contacts. We present here a relatively high-resolution model of the decoron-fibril collagen complex. We find that the respective crystal structures complement each other well, although it is the monomeric form of decoron that shows the most appropriate shape complementarity with the fibril surface and favorable calculated energies of interaction. One molecule of decoron interacts with four to six collagen molecules, and the binding specificity relies on a large number of hydrogen bonds and electrostatic interactions, primarily with the collagen motifs KXGDRGE and AKGDRGE (d and e(1) bands). This work helps us to understand collagen-decorin interactions and the molecular architecture of the fibrillar ECM in health and disease.
Residual transglutaminase in collagen - effects, detection, quantification, and removal.
Schloegl, W; Klein, A; Fürst, R; Leicht, U; Volkmer, E; Schieker, M; Jus, S; Guebitz, G M; Stachel, I; Meyer, M; Wiggenhorn, M; Friess, W
2012-02-01
In the present study, we developed an enzyme-linked immunosorbent assay (ELISA) for microbial transglutaminase (mTG) from Streptomyces mobaraensis to overcome the lack of a quantification method for mTG. We further performed a detailed follow-on-analysis of insoluble porcine collagen type I enzymatically modified with mTG primarily focusing on residuals of mTG. Repeated washing (4 ×) reduced mTG-levels in the washing fluids but did not quantitatively remove mTG from the material (p < 0.000001). Substantial amounts of up to 40% of the enzyme utilized in the crosslinking mixture remained associated with the modified collagen. Binding was non-covalent as could be demonstrated by Western blot analysis. Acidic and alkaline dialysis of mTG treated collagen material enabled complete removal the enzyme. Treatment with guanidinium chloride, urea, or sodium chloride was less effective in reducing the mTG content. Copyright © 2011 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tiller, G.E.; Polumbo, P.A.; Weis, M.A.
1995-02-01
Defects in type II collagen have been demonstrated in a phenotypic continuum of chondrodysplasias that includes achondrogenesis II, hypochondrogenesis, spondyloepiphyseal dysplasia congenita (SEDC), Kniest dysplasia, and Stickler syndrome. We have determined that cartilage from a terminated fetus with an inherited form of SEDC contained both normal {alpha}1(II) collagen chains and chains that lacked amino acids 256-273 of the triple-helical domain. PCR amplification of this region of COL2A1, from genomic DNA, yielded products of normal size, while amplification of cDNA yielded a normal sized species and a shorter fragment missing exon 20. Sequence analysis of genomic DNA from the fetus revealedmore » a G{yields}T transversion at position +5 of intron 20; the affected father was also heterozygous for the mutation. Allele-specific PCR and heteroduplex analysis of a VNTR in COL2A1 independently confirmed the unaffected status of a fetus in a subsequent pregnancy. Thermodynamic calculations suggest that the mutation prevents normal splicing of exon 20 by interfering with binding of U{sub 1} small-nuclear RNA to pre-mRNA, thus leading to skipping of exon 20 in transcripts from the mutant allele. Electron micrographs of diseased cartilage showed intracellular inclusion bodies, which were stained by an antibody to {alpha}1(II) procollagen. Our findings support the hypothesis that {alpha}-chain length alterations that preserve the Gly-X-Y repeat motif of the triple helix result in partial intracellular retention of {alpha}1(II) procollagen and produce mild to moderate chondrodysplasia phenotypes. 50 refs., 6 figs., 1 tab.« less
Comparison of simple sequence repeats in 19 Archaea.
Trivedi, S
2006-12-05
All organisms that have been studied until now have been found to have differential distribution of simple sequence repeats (SSRs), with more SSRs in intergenic than in coding sequences. SSR distribution was investigated in Archaea genomes where complete chromosome sequences of 19 Archaea were analyzed with the program SPUTNIK to find di- to penta-nucleotide repeats. The number of repeats was determined for the complete chromosome sequences and for the coding and non-coding sequences. Different from what has been found for other groups of organisms, there is an abundance of SSRs in coding regions of the genome of some Archaea. Dinucleotide repeats were rare and CG repeats were found in only two Archaea. In general, trinucleotide repeats are the most abundant SSR motifs; however, pentanucleotide repeats are abundant in some Archaea. Some of the tetranucleotide and pentanucleotide repeat motifs are organism specific. In general, repeats are short and CG-rich repeats are present in Archaea having a CG-rich genome. Among the 19 Archaea, SSR density was not correlated with genome size or with optimum growth temperature. Pentanucleotide density had an inverse correlation with the CG content of the genome.
Quantification of three-dimensional cell-mediated collagen remodeling using graph theory.
Bilgin, Cemal Cagatay; Lund, Amanda W; Can, Ali; Plopper, George E; Yener, Bülent
2010-09-30
Cell cooperation is a critical event during tissue development. We present the first precise metrics to quantify the interaction between mesenchymal stem cells (MSCs) and extra cellular matrix (ECM). In particular, we describe cooperative collagen alignment process with respect to the spatio-temporal organization and function of mesenchymal stem cells in three dimensions. We defined two precise metrics: Collagen Alignment Index and Cell Dissatisfaction Level, for quantitatively tracking type I collagen and fibrillogenesis remodeling by mesenchymal stem cells over time. Computation of these metrics was based on graph theory and vector calculus. The cells and their three dimensional type I collagen microenvironment were modeled by three dimensional cell-graphs and collagen fiber organization was calculated from gradient vectors. With the enhancement of mesenchymal stem cell differentiation, acceleration through different phases was quantitatively demonstrated. The phases were clustered in a statistically significant manner based on collagen organization, with late phases of remodeling by untreated cells clustering strongly with early phases of remodeling by differentiating cells. The experiments were repeated three times to conclude that the metrics could successfully identify critical phases of collagen remodeling that were dependent upon cooperativity within the cell population. Definition of early metrics that are able to predict long-term functionality by linking engineered tissue structure to function is an important step toward optimizing biomaterials for the purposes of regenerative medicine.
Parmar, Paresh A.; St-Pierre, Jean-Philippe; Chow, Lesley W.; Puetzer, Jennifer L.; Stoichevska, Violet; Peng, Yong Y.; Werkmeister, Jerome A.; Ramshaw, John A. M.; Stevens, Molly M.
2017-01-01
Collagen I foams are used in the clinic as scaffolds to promote articular cartilage repair as they provide a bioactive environment for cells with chondrogenic potential. However, collagen I as a base material does not allow for precise control over bioactivity. Alternatively, recombinant bacterial collagens can be used as “blank slate” collagen molecules to offer a versatile platform for incorporation of selected bioactive sequences and fabrication into 3D scaffolds. Here, we show the potential of Streptococcal collagen-like 2 (Scl2) protein foams modified with peptides designed to specifically and noncovalently bind hyaluronic acid and chondroitin sulfate to improve chondrogenesis of human mesenchymal stem cells (hMSCs) compared to collagen I foams. Specific compositions of functionalized Scl2 foams lead to improved chondrogenesis compared to both nonfunctionalized Scl2 and collagen I foams, as indicated by gene expression, extracellular matrix accumulation, and compression moduli. hMSCs cultured in functionalized Scl2 foams exhibit decreased collagens I and X gene and protein expression, suggesting an advantage over collagen I foams in promoting a chondrocytic phenotype. These highly modular foams can be further modified to improve specific aspects chondrogenesis. As such, these scaffolds also have the potential to be tailored for other regenerative medicine applications. PMID:27219220
[Mutation Analysis of 19 STR Loci in 20 723 Cases of Paternity Testing].
Bi, J; Chang, J J; Li, M X; Yu, C Y
2017-06-01
To observe and analyze the confirmed cases of paternity testing, and to explore the mutation rules of STR loci. The mutant STR loci were screened from 20 723 confirmed cases of paternity testing by Goldeneye 20A system.The mutation rates, and the sources, fragment length, steps and increased or decreased repeat sequences of mutant alleles were counted for the analysis of the characteristics of mutation-related factors. A total of 548 mutations were found on 19 STR loci, and 557 mutation events were observed. The loci mutation rate was 0.07‰-2.23‰. The ratio of paternal to maternal mutant events was 3.06:1. One step mutation was the main mutation, and the number of the increased repeat sequences was almost the same as the decreased repeat sequences. The repeat sequences were more likely to decrease in two steps mutation and above. Mutation mainly occurred in the medium allele, and the number of the increased repeat sequences was almost the same as the decreased repeat sequences. In long allele mutations, the decreased repeat sequences were significantly more than the increased repeat sequences. The number of the increased repeat sequences was almost the same as the decreased repeat sequences in paternal mutation, while the decreased repeat sequences were more than the increased in maternal mutation. There are significant differences in the mutation rate of each locus. When one or two loci do not conform to the genetic law, other detection system should be added, and PI value should be calculated combined with the information of the mutate STR loci in order to further clarify the identification opinions. Copyright© by the Editorial Department of Journal of Forensic Medicine
Symoens, Sofie; Steyaert, Wouter; Demuynck, Lynn; De Paepe, Anne; Diderich, Karin E M; Malfait, Fransiska; Coucke, Paul J
2017-04-01
Type I collagen is the predominant protein of connective tissues such as skin and bone. Mutations in the type I collagen genes (COL1A1 and COL1A2) mainly cause osteogenesis imperfecta (OI). We describe a patient with clinical signs of Ehlers-Danlos syndrome (EDS), including fragile skin, easy bruising, recurrent luxations, and fractures resembling mild OI. Biochemical collagen analysis of the patients' dermal fibroblasts showed faint overmodification of the type I collagen bands, a finding specific for structural defects in type I collagen. Bidirectional Sanger sequencing detected an in-frame deletion in exon 44 of COL1A1 (c.3150_3158del), resulting in the deletion of three amino acids (p.Ala1053_Gly1055del) in the collagen triple helix. This COL1A1 mutation was hitherto identified in four probands with lethal OI, and never in EDS patients. As the peaks on the electropherogram corresponding to the mutant allele were decreased in intensity, we performed next generation sequencing of COL1A1 to study mosaicism in skin and blood. While approximately 9% of the reads originating from fibroblast gDNA harbored the COL1A1 deletion, the deletion was not detected in gDNA from blood. Most likely, the mild clinical symptoms observed in our patient can be explained by the mosaic state of the mutation. © 2017 Wiley Periodicals, Inc.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-01-01
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363
NASA Astrophysics Data System (ADS)
Deniset-Besseau, A.; De Sa Peixoto, P.; Duboisset, J.; Loison, C.; Hache, F.; Benichou, E.; Brevet, P.-F.; Mosser, G.; Schanne-Klein, M.-C.
2010-02-01
Collagen is characterized by triple helical domains and plays a central role in the formation of fibrillar and microfibrillar networks, basement membranes, as well as other structures of the connective tissue. Remarkably, fibrillar collagen exhibits efficient Second Harmonic Generation (SHG) and SHG microscopy proved to be a sensitive tool to score fibrotic pathologies. However, the nonlinear optical response of fibrillar collagen is not fully characterized yet and quantitative data are required to further process SHG images. We therefore performed Hyper-Rayleigh Scattering (HRS) experiments and measured a second order hyperpolarisability of 1.25 10-27 esu for rat-tail type I collagen. This value is surprisingly large considering that collagen presents no strong harmonophore in its amino-acid sequence. In order to get insight into the physical origin of this nonlinear process, we performed HRS measurements after denaturation of the collagen triple helix and for a collagen-like short model peptide [(Pro-Pro-Gly)10]3. It showed that the collagen large nonlinear response originates in the tight alignment of a large number of weakly efficient harmonophores, presumably the peptide bonds, resulting in a coherent amplification of the nonlinear signal along the triple helix. To illustrate this mechanism, we successfully recorded SHG images in collagen liquid solutions by achieving liquid crystalline ordering of the collagen triple helices.
NASA Astrophysics Data System (ADS)
Li, Qi; Akihiro, Kijima
2007-01-01
The microsatellite-enriched library was constructed using magnetic bead hybridization selection method, and the microsatellite DNA sequences were analyzed in Pacific abalone Haliotis discus hannai. Three hundred and fifty white colonies were screened using PCR-based technique, and 84 clones were identified to potentially contain microsatellite repeat motif. The 84 clones were sequenced, and 42 microsatellites and 4 minisatellites with a minimum of five repeats were found (13.1% of white colonies screened). Besides the motif of CA contained in the oligoprobe, we also found other 16 types of microsatellite repeats including a dinucleotide repeat, two tetranucleotide repeats, twelve pentanucleotide repeats and a hexanucleotide repeat. According to Weber (1990), the microsatellite sequences obtained could be categorized structurally into perfect repeats (73.3%), imperfect repeats (13.3%), and compound repeats (13.4%). Among the microsatellite repeats, relatively short arrays (<20 repeats) were most abundant, accounting for 75.0%. The largest length of microsatellites was 48 repeats, and the average number of repeats was 13.4. The data on the composition and length distribution of microsatellites obtained in the present study can be useful for choosing the repeat motifs for microsatellite isolation in other abalone species.
Noguchi, Satoru; Ogawa, Megumu; Malicdan, May Christine; Nonaka, Ikuya; Nishino, Ichizo
2017-02-01
Congenital muscular dystrophies with collagen VI deficiency are inherited muscle disorders with a broad spectrum of clinical presentation and are caused by mutations in one of COL6A1-3 genes. Muscle pathology is characterized by fiber size variation and increased interstitial fibrosis and adipogenesis. In this study, we define critical events that contribute to muscle weakness and fibrosis in a mouse model with collagen VI deficiency. The Col6a1 GT/GT mice develop non-progressive weakness from younger age, accompanied by stunted muscle growth due to reduced IGF-1 signaling activity. In addition, the Col6a1 GT/GT mice have high numbers of interstitial skeletal muscle mesenchymal progenitor cells, which dramatically increase with repeated myofiber necrosis/regeneration. Our results suggest that impaired neonatal muscle growth and the activation of the mesenchymal cells in skeletal muscles contribute to the pathology of collagen VI deficient muscular dystrophy, and more importantly, provide the insights on the therapeutic strategies for collagen VI deficiency. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L
2013-01-30
Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
2013-01-01
Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705
Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm
Glunčić, Matko; Paar, Vladimir
2013-01-01
The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes). PMID:22977183
Larsen, Svend Arild; Mogensen, Line; Dietz, Rune; Baagøe, Hans Jørgen; Andersen, Mogens; Werge, Thomas; Rasmussen, Henrik Berg
2005-12-01
In this study we have identified and characterized dopamine receptor D4 (DRD4) exon III tandem repeats in 33 public available nucleotide sequences from different mammalian species. We found that the tandem repeat in canids could be described in a novel and simple way, namely, as a structure composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp tandem repeat, while a tandem repeat consisting of 27-bp modules was identified in a sequence from European badger. Both these tandem repeats were composed of 9-bp basic units, which were closely related with the 9-bp repeat modules identified in the mink and ferret. Tandem repeats could not be identified in sequences from rodents. All tandem repeats possessed a high GC content with a strong bias for C. On phylogenetic analysis of the tandem repeats evolutionary related species were clustered into the same groups. The degree of conservation of the tandem repeats varied significantly between species. The deduced amino acid sequences of most of the tandem repeats exhibited a high propensity for disorder. This was also the case with an amino acid sequence of the human DRD4 exon III tandem repeat, which was included in the study for comparative purposes. We identified proline-containing motifs for SH3 and WW domain binding proteins, potential phosphorylation sites, PDZ domain binding motifs, and FHA domain binding motifs in the amino acid sequences of the tandem repeats. The numbers of potential functional sites varied pronouncedly between species. Our observations provide a platform for future studies of the architecture and evolution of the DRD4 exon III tandem repeat, and they suggest that differences in the structure of this tandem repeat contribute to specialization and generation of diversity in receptor function.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-11-16
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Okimoto, R; Chamberlin, H M; Macfarlane, J L; Wolstenholme, D R
1991-01-01
Within a 7 kb segment of the mtDNA molecule of the root knot nematode, Meloidogyne javanica, that lacks standard mitochondrial genes, are three sets of strictly tandemly arranged, direct repeat sequences: approximately 36 copies of a 102 ntp sequence that contains a TaqI site; 11 copies of a 63 ntp sequence, and 5 copies of an 8 ntp sequence. The 7 kb repeat-containing segment is bounded by putative tRNAasp and tRNAf-met genes and the arrangement of sequences within this segment is: the tRNAasp gene; a unique 1,528 ntp segment that contains two highly stable hairpin-forming sequences; the 102 ntp repeat set; the 8 ntp repeat set; a unique 1,068 ntp segment; the 63 ntp repeat set; and the tRNAf-met gene. The nucleotide sequences of the 102 ntp copies and the 63 ntp copies have been conserved among the species examined. Data from Southern hybridization experiments indicate that 102 ntp and 63 ntp repeats occur in the mtDNAs of three, two and two races of M.incognita, M.hapla and M.arenaria, respectively. Nucleotide sequences of the M.incognita Race-3 102 ntp repeat were found to be either identical or highly similar to those of the M.javanica 102 ntp repeat. Differences in migration distance and number of 102 ntp repeat-containing bands seen in Southern hybridization autoradiographs of restriction-digested mtDNAs of M.javanica and the different host races of M.incognita, M.hapla and M.arenaria are sufficient to distinguish the different host races of each species. Images PMID:2027769
2010-01-01
Background Molecular characterization of collagen-VI related myopathies currently relies on standard sequencing, which yields a detection rate approximating 75-79% in Ullrich congenital muscular dystrophy (UCMD) and 60-65% in Bethlem myopathy (BM) patients as PCR-based techniques tend to miss gross genomic rearrangements as well as copy number variations (CNVs) in both the coding sequence and intronic regions. Methods We have designed a custom oligonucleotide CGH array in order to investigate the presence of CNVs in the coding and non-coding regions of COL6A1, A2, A3, A5 and A6 genes and a group of genes functionally related to collagen VI. A cohort of 12 patients with UCMD/BM negative at sequencing analysis and 2 subjects carrying a single COL6 mutation whose clinical phenotype was not explicable by inheritance were selected and the occurrence of allelic and genetic heterogeneity explored. Results A deletion within intron 1A of the COL6A2 gene, occurring in compound heterozygosity with a small deletion in exon 28, previously detected by routine sequencing, was identified in a BM patient. RNA studies showed monoallelic transcription of the COL6A2 gene, thus elucidating the functional effect of the intronic deletion. No pathogenic mutations were identified in the remaining analyzed patients, either within COL6A genes, or in genes functionally related to collagen VI. Conclusions Our custom CGH array may represent a useful complementary diagnostic tool, especially in recessive forms of the disease, when only one mutant allele is detected by standard sequencing. The intronic deletion we identified represents the first example of a pure intronic mutation in COL6A genes. PMID:20302629
Surface Wave Elastometry of the Cornea in Porcine and Human Donor Eyes
Dupps, William J.; Netto, Marcelo V.; Herekar, Satish; Krueger, Ronald R.
2007-01-01
PURPOSE To introduce a nondestructive technique for characterization of corneal stiffness, determine measurement precision, and investigate comparative stiffness values along central, radial, and circumferential vectors in porcine corneas. The effects of epithelial debridement, relaxing incisions, and crosslink-mediated stiffening on surface wave velocity are also studied. METHODS A handheld prototype system was used to measure ultrasound surface wave propagation time between two fixed-distance transducers along a ten-position map. Repeatability was assessed with replicate measurements in 6 porcine corneas. In 12 porcine globes with controlled intraocular pressure (IOP), serial measurements were performed before and after epithelial removal, then after 250- and 750-μm-deep relaxing incisions. In human globes with constant intravitreal pressure, central wave velocity and transcorneal IOP measurements were compared before and after collagen cross-linking. RESULTS Measurement repeatability across all regions was between 2.2% and 8.1%. Epithelial removal resulted in increases in measured stiffness in 67% of eyes, but statistical power was insufficient to detect a systematic change. Wave velocity across a central incision decreased significantly after 250-μm keratotomy (P<.001), but did not undergo a significant further decrease with deeper keratotomy. Meridional stiffness changes consistent with coupling effects were detected after keratotomy. Surface wave velocity and transcorneal IOP measurements increased markedly after collagen cross-linking despite maintenance of a constant IOP. CONCLUSIONS Handheld corneal elastometry provides a repeatable measure of regional stiffness changes after relaxing incisions and collagen cross-linking in in vitro experiments. Surface wave elastometry allows focal assessment of corneal biomechanical properties that are relevant in refractive surgery, ectatic disease, and glaucoma. PMID:17269246
Hansen, Uwe; Hussain, Muzaffar; Villone, Daniela; Herrmann, Mathias; Robenek, Horst; Peters, Georg; Sinha, Bhanu; Bruckner, Peter
2006-05-01
Besides a number of cell wall-anchored adhesins, the majority of Staphylococcus aureus strains produce anchorless, cell wall-associated proteins, such as Eap (extracellular adherence protein). Eap contains four to six tandem repeat (EAP)-domains. Eap mediates diverse biological functions, including adherence and immunomodulation, thus contributing to S. aureus pathogenesis. Eap binding to host macromolecules is unusually promiscuous and includes matrix or matricellular proteins as well as plasma proteins. The structural basis of this promiscuity is poorly understood. Here, we show that in spite of the preferential location of the binding epitopes within triple helical regions in some collagens there is a striking specificity of Eap binding to different collagen types. Collagen I, but not collagen II, is a binding substrate in monomolecular form. However, collagen I is virtually unrecognized by Eap when incorporated into banded fibrils. By contrast, microfibrils containing collagen VI as well as basement membrane-associated networks containing collagen IV, or aggregates containing fibronectin bound Eap as effectively as the monomeric proteins. Therefore, Eap-binding to extracellular matrix ligands is promiscuous at the molecular level but not indiscriminate with respect to supramolecular structures containing the same macromolecules. In addition, Eap bound to banded fibrils after their partial disintegration by matrix-degrading proteinases, including matrix metalloproteinase 1. Therefore, adherence to matrix suprastructures by S. aureus can be supported by inflammatory reactions.
Yang, Wei; Liu, Fuguo; Xu, Chenqi; Sun, Cuixia; Yuan, Fang; Gao, Yanxiang
2015-05-27
The aggregation of lactoferrin and (-)-epigallocatechin gallate (EGCG) was inhibited by polyphenols, oligosaccharides, and collagen peptide in this study. Polyphenols, oligosaccharides, or collagen peptide can effectively prevent the formation of lactoferrin-EGCG aggregates, respectively. The addition sequence of lactoferrin, polyphenols (oligosaccharides or collagen peptide) and EGCG can affect the turbidity and particle size of the ternary complexes in the buffer solution; however, it hardly affected the ζ-potential and fluorescence characteristics. With either positive or negative charge, polyphenols and collagen peptide disrupted the formation of lactoferrin-EGCG aggregate mainly through the mechanism of its competition with EGCG molecules which surrounded the lactoferrin molecule surface with weaker binding affinities, forming polyphenols or a collagen peptide-lactoferrin-EGCG ternary complex; for neutral oligosaccharides, the ternary complex was generated mainly through steric effects, accompanied by a change in the lactoferrin secondary structure induced by gallic acid, chlorogenic acid, and xylo-oligosaccharide. Polyphenols, oligosaccharides, or collagen peptide restraining the formation of lactoferrin-EGCG aggregate could be applied in the design of clear products in the food, pharmaceutical, and cosmetic industries.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Orgel, J.P.; Antipova, O.; Sagi, I.
Fibrillar collagens form the structural basis of organs and tissues including the vasculature, bone, and tendon. They are also dynamic, organizational scaffolds that present binding and recognition sites for ligands, cells, and platelets. We interpret recently published X-ray diffraction findings and use atomic force microscopy data to illustrate the significance of new insights into the functional organization of the collagen fibril. These data indicate that collagen's most crucial functional domains localize primarily to the overlap region, comprising a constellation of sites we call the 'master control region.' Moreover, the collagen's most exposed aspect contains its most stable part - themore » C-terminal region that controls collagen assembly, cross-linking, and blood clotting. Hidden beneath the fibril surface exists a constellation of 'cryptic' sequences poised to promote hemostasis and cell - collagen interactions in tissue injury and regeneration. These findings begin to address several important, and previously unresolved, questions: How functional domains are organized in the fibril, which domains are accessible, and which require proteolysis or structural trauma to become exposed? Here we speculate as to how collagen fibrillar organization impacts molecular processes relating to tissue growth, development, and repair.« less
Three reasons protein disorder analysis makes more sense in the light of collagen
Oates, Matt E.; Tompa, Peter; Gough, Julian
2016-01-01
Abstract We have identified that the collagen helix has the potential to be disruptive to analyses of intrinsically disordered proteins. The collagen helix is an extended fibrous structure that is both promiscuous and repetitive. Whilst its sequence is predicted to be disordered, this type of protein structure is not typically considered as intrinsic disorder. Here, we show that collagen‐encoding proteins skew the distribution of exon lengths in genes. We find that previous results, demonstrating that exons encoding disordered regions are more likely to be symmetric, are due to the abundance of the collagen helix. Other related results, showing increased levels of alternative splicing in disorder‐encoding exons, still hold after considering collagen‐containing proteins. Aside from analyses of exons, we find that the set of proteins that contain collagen significantly alters the amino acid composition of regions predicted as disordered. We conclude that research in this area should be conducted in the light of the collagen helix. PMID:26941008
Biomolecular characterization and protein sequences of the Campanian hadrosaur B. canadensis.
Schweitzer, Mary H; Zheng, Wenxia; Organ, Chris L; Avci, Recep; Suo, Zhiyong; Freimark, Lisa M; Lebleu, Valerie S; Duncan, Michael B; Vander Heiden, Matthew G; Neveu, John M; Lane, William S; Cottrell, John S; Horner, John R; Cantley, Lewis C; Kalluri, Raghu; Asara, John M
2009-05-01
Molecular preservation in non-avian dinosaurs is controversial. We present multiple lines of evidence that endogenous proteinaceous material is preserved in bone fragments and soft tissues from an 80-million-year-old Campanian hadrosaur, Brachylophosaurus canadensis [Museum of the Rockies (MOR) 2598]. Microstructural and immunological data are consistent with preservation of multiple bone matrix and vessel proteins, and phylogenetic analyses of Brachylophosaurus collagen sequenced by mass spectrometry robustly support the bird-dinosaur clade, consistent with an endogenous source for these collagen peptides. These data complement earlier results from Tyrannosaurus rex (MOR 1125) and confirm that molecular preservation in Cretaceous dinosaurs is not a unique event.
Variation, Repetition, And Choice
Abreu-Rodrigues, Josele; Lattal, Kennon A; dos Santos, Cristiano V; Matos, Ricardo A
2005-01-01
Experiment 1 investigated the controlling properties of variability contingencies on choice between repeated and variable responding. Pigeons were exposed to concurrent-chains schedules with two alternatives. In the REPEAT alternative, reinforcers in the terminal link depended on a single sequence of four responses. In the VARY alternative, a response sequence in the terminal link was reinforced only if it differed from the n previous sequences (lag criterion). The REPEAT contingency generated low, constant levels of sequence variation whereas the VARY contingency produced levels of sequence variation that increased with the lag criterion. Preference for the REPEAT alternative tended to increase directly with the degree of variation required for reinforcement. Experiment 2 examined the potential confounding effects in Experiment 1 of immediacy of reinforcement by yoking the interreinforcer intervals in the REPEAT alternative to those in the VARY alternative. Again, preference for REPEAT was a function of the lag criterion. Choice between varying and repeating behavior is discussed with respect to obtained behavioral variability, probability of reinforcement, delay of reinforcement, and switching within a sequence. PMID:15828592
Repeated whiskey binges promote liver injury in rats fed a choline-deficient diet.
Nieto, Natalia; Rojkind, Marcos
2007-02-01
Alcoholic liver disease is associated with nutritional deficiency and it may aggravate within the context of fatty liver. We investigated the relationship between alcohol intake (whiskey binge drinking) and a choline-deficient diet (CD) and assessed whether stellate cells could contribute to liver injury in this model. Rats fed the CD diet plus whiskey showed increased liver damage compared to rats fed the CD diet, as demonstrated by H&E staining, elevated transaminases, steatosis, TNF-alpha levels, enhanced CYP2E1 activity, impaired antioxidant defense, elevated lipid peroxidation, and protein carbonyls. The combined treatment triggered an apoptotic response as determined by elevated Bax, caspase-3 activity, cytochrome-c release, and decreased Bcl-2 and Bcl-XL. Stellate cells were activated as increased expression of alpha-Sma was observed over that by the CD diet alone. The combined treatment shifted extracellular matrix remodeling towards a pro-fibrogenic response due to up-regulation of collagen I, TIMP1, and Hsp47 proteins, along with down-regulation of MMP13, MMP2, and MMP9 expression, proteases which degrade collagen I. These events were accompanied by increased phosphorylation of p38, a kinase that elevates collagen I. Repeated alcohol binges in the context of mild steatosis may promote activation of stellate cells and contribute to liver injury.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.
Rehm, Charlotte; Wurmthaler, Lena A; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S
2015-01-01
In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.
Rehm, Charlotte; Wurmthaler, Lena A.; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S.
2015-01-01
In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1–5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6–9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria. PMID:26695179
NASA Astrophysics Data System (ADS)
Deniset-Besseau, A.; Strupler, M.; Duboisset, J.; De Sa Peixoto, P.; Benichou, E.; Fligny, C.; Tharaux, P.-L.; Mosser, G.; Brevet, P.-F.; Schanne-Klein, M.-C.
2009-09-01
Collagen is a major protein of the extracellular matrix that is characterized by triple helical domains. It plays a central role in the formation of fibrillar and microfibrillar networks, basement membranes, as well as other structures of the connective tissue. Remarkably, fibrillar collagen exhibits efficient Second Harmonic Generation (SHG) so that SHG microscopy proved to be a sensitive tool to probe the three-dimensional architecture of fibrillar collagen and to assess the progression of fibrotic pathologies. We obtained sensitive and reproducible measurements of the fibrosis extent, but we needed quantitative data at the molecular level to further process SHG images. We therefore performed Hyper- Rayleigh Scattering (HRS) experiments and measured a second order hyperpolarisability of 1.25 10-27 esu for rat-tail type I collagen. This value is surprisingly large considering that collagen presents no strong harmonophore in its aminoacid sequence. In order to get insight into the physical origin of this nonlinear process, we performed HRS measurements after denaturation of the collagen triple helix and for a collagen-like short model peptide [(Pro-Pro- Gly)10]3. It showed that the collagen large nonlinear response originates in the tight alignment of a large number of weakly efficient harmonophores, presumably the peptide bonds, resulting in a coherent amplification of the nonlinear signal along the triple helix. To illustrate this mechanism, we successfully recorded SHG images in collagenous biomimetic matrices.
Clayton, William; Eaton, Carla Jane; Dupont, Pierre-Yves; Gillanders, Tim; Cameron, Nick; Saikia, Sanjay; Scott, Barry
2017-01-01
Epichloë grass endophytes comprise a group of filamentous fungi of both sexual and asexual species. Known for the beneficial characteristics they endow upon their grass hosts, the identification of these endophyte species has been of great interest agronomically and scientifically. The use of simple sequence repeat loci and the variation in repeat elements has been used to rapidly identify endophyte species and strains, however, little is known of how the structure of repeat elements changes between species and strains, and where these repeat elements are located in the fungal genome. We report on an in-depth analysis of the structure and genomic location of the simple sequence repeat locus B10, commonly used for Epichloë endophyte species identification. The B10 repeat was found to be located within an exon of a putative bZIP transcription factor, suggesting possible impacts on polypeptide sequence and thus protein function. Analysis of this repeat in the asexual endophyte hybrid Epichloë uncinata revealed that the structure of B10 alleles reflects the ancestral species that hybridized to give rise to this species. Understanding the structure and sequence of these simple sequence repeats provides a useful set of tools for readily distinguishing strains and for gaining insights into the ancestral species that have undergone hybridization events.
TRAP: automated classification, quantification and annotation of tandemly repeated sequences.
Sobreira, Tiago José P; Durham, Alan M; Gruber, Arthur
2006-02-01
TRAP, the Tandem Repeats Analysis Program, is a Perl program that provides a unified set of analyses for the selection, classification, quantification and automated annotation of tandemly repeated sequences. TRAP uses the results of the Tandem Repeats Finder program to perform a global analysis of the satellite content of DNA sequences, permitting researchers to easily assess the tandem repeat content for both individual sequences and whole genomes. The results can be generated in convenient formats such as HTML and comma-separated values. TRAP can also be used to automatically generate annotation data in the format of feature table and GFF files.
Faragher, S G; Dalgarno, L
1986-07-20
The 3' untranslated (UT) sequences of the genomic RNAs of five geographic variants of the alphavirus Ross River virus (RRV) were determined and compared with the 3' UT sequence of RRV T48, the prototype strain. Part of the 3' UT region of Getah virus, a close serological relative of RRV, was also sequenced. The RRV 3' UT region varies markedly in length between variants. Large deletions or insertions, sequence rearrangements and single nucleotide substitutions are observed. A sequence tract of 49 to 58 nucleotides, which is repeated as four blocks in the RRV T48 3' UT region, occurs only once in the 3' UT region of one RRV strain (NB5092), indicating that the existence of repeat sequence blocks is not essential for RRV replication. However, the precise sequence of the 3' proximal copy of the repeat block and its position relative to the poly(A) tail were identical in all RRV isolates examined, suggesting that it has an important role in RRV replication. Nucleotide substitutions between RRV variants are distributed non-randomly along the length of the 3' UT region. The sequence of 120 to 130 nucleotides adjacent to the poly(A) tail is strongly conserved. Getah virus RNA contains three repeat sequence blocks in the 3' UT region. These are similar in sequence to those in RRV RNA but differ in their arrangement. Homology between the RRV and Getah 3' UT sequences is greatest in the 3' proximal repeat sequence block that shows three differences in 49 nucleotides. The 3' proximal repeat in Getah RNA occurs at the same position, relative to the poly(A) tail, as in all RRV variants. The RRV and Getah virus 3' UT sequences show extensive homology in the region between the 3' proximal repeat and the poly(A) tail but, apart from the repeat blocks themselves, they show no significant homology elsewhere.
Qin, Lei; Bi, Jing-Ran; Li, Dong-Mei; Dong, Meng; Zhao, Zi-Yuan; Dong, Xiu-Ping; Zhou, Da-Yong; Zhu, Bei-Wei
2016-11-16
We aimed to explore the differences of thermal behaviors between insoluble collagen fibrils (ICFs) and pepsin-solubilized collagens (PSCs) from sea cucumber Stichopus japonicus . The unfolding/refolding sequences of secondary structures of ICFs and PSCs during the heating and cooling cycle (5 → 70 → 5 °C) were identified by Fourier transform infrared spectrometry combined with curve-fitting and 2D correlation techniques. ICFs showed a higher proportion of α-helical structures and higher thermostability than PSCs, and thus had more-stable triple helical structures. The sequences of changes affecting the secondary structures during heating were essentially the same between ICFs and PSCs. In all cases, α-helix structure was the most important conformation and it disappeared to form a β-sheet structure. In the cooling cycle, ICFs showed a partially refolding ability, and the proportion of β-sheet structure rose before the increasing proportion of α-helix structure. PSCs did not obviously refold during the cooling stage.
Mechanical model for a collagen fibril pair in extracellular matrix.
Chan, Yue; Cox, Grant M; Haverkamp, Richard G; Hill, James M
2009-04-01
In this paper, we model the mechanics of a collagen pair in the connective tissue extracellular matrix that exists in abundance throughout animals, including the human body. This connective tissue comprises repeated units of two main structures, namely collagens as well as axial, parallel and regular anionic glycosaminoglycan between collagens. The collagen fibril can be modeled by Hooke's law whereas anionic glycosaminoglycan behaves more like a rubber-band rod and as such can be better modeled by the worm-like chain model. While both computer simulations and continuum mechanics models have been investigated for the behavior of this connective tissue typically, authors either assume a simple form of the molecular potential energy or entirely ignore the microscopic structure of the connective tissue. Here, we apply basic physical methodologies and simple applied mathematical modeling techniques to describe the collagen pair quantitatively. We found that the growth of fibrils was intimately related to the maximum length of the anionic glycosaminoglycan and the relative displacement of two adjacent fibrils, which in return was closely related to the effectiveness of anionic glycosaminoglycan in transmitting forces between fibrils. These reveal the importance of the anionic glycosaminoglycan in maintaining the structural shape of the connective tissue extracellular matrix and eventually the shape modulus of human tissues. We also found that some macroscopic properties, like the maximum molecular energy and the breaking fraction of the collagen, were also related to the microscopic characteristics of the anionic glycosaminoglycan.
Krebs, Kristi; Ruusmann, Anu; Simonlatser, Grethel; Velling, Teet
2015-12-01
FLNa is a ubiquitous cytoskeletal protein that links transmembrane receptors, including integrins, to F-actin and functions as a signalling intermediate. We investigated FLNa's role in the function of integrin-type collagen receptors, EGF-EGFR signalling and regulation of PKB/Akt and ERK1/2. Using FLNa-deficient M2 human melanoma cells, and same cells expressing EGFP-FLNa (M2F) or its Ig-like repeats 1-8+24, 8-15+24 and 16-24, we found that in M2F and M2 8-15+24 cells, EGF induced the increased phosphorylation of PKB/Akt and ERK1/2. In M2F cells EGF induced the localisation of these kinases to cell nucleus and lamellipodia, respectively, and the ERK1/2 phosphorylation-dependent co-immunoprecipitation of FLNa with ERK1/2. Only M2F and M2 8-15+24 cells adhered to and spread on type I collagen whereas on fibronectin all cells behaved similarly. α1β1 and α2β1 were the integrin-type collagen receptors expressed on these cells with primarily α1β1 localising to focal contacts and affecting cell adhesion and migration in a manner dependent on FLNa or its Ig-like repeats 8-15. Our results suggest a role for FLNa repeats 8-15 in the α1-subunit-dependent regulation of integrin α1β1 function, EGF-EGFR signalling to PKB/Akt and ERK1/2, identify ERK1/2 in EGF-induced FLNa-associated protein complexes, and show that the function of different integrins is subjected to differential regulation by FLNa. Copyright © 2015. Published by Elsevier GmbH.
Yan, Xiaoyan; Zhang, Chuanbao; Liang, Tingyu; Yang, Fan; Wang, Haoyuan; Wu, Fan; Wang, Wen; Wang, Zheng; Cheng, Wen; Xu, Jiangnan; Jiang, Tao; Chen, Jing; Ding, Yaozhong
2017-10-17
Collagen XVII expression has recently been demonstrated to be correlated with the tumor malignance. While Collagen XVII is known to be widely distributed in neurons of the human brain, its precise role in pathogenesis of glioblastoma multiforme (GBM) is unknown. In this study, we identified and characterized a new PTEN-COL17A1 fusion gene in GMB using transcriptome sequencing. Although fusion gene did not result in measurable fusion protein production, its presence is accompanied with high levels of COL17A1 expression, revealed a novel regulatory mechanism of Collagen XVII expression by PTEN-COL17A1 gene fusion. Knocked down Collagen XVII expression in glioma cell lines resulted in decreased tumor invasiveness, along with significant reduction of MMP9 expression, while increased Collagen XVII expression promotes invasive activities of glioma cells and associated with GBM recurrences. Together, our results uncovered a new PTEN-COL17A1 fusion gene and its novel regulatory role in Collagen XVII expression and GBM malignance, and demonstrated that COL17A1 could serve as a useful prognostic biomarker and therapeutic targets for GBM.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Choi, Yoon Jung; Lee, Jue Yeon; Lee, Seung Jin
Highlights: Black-Right-Pointing-Pointer CBP sequence is identified from BSP and has collagen binding activity. Black-Right-Pointing-Pointer CBP directly activates the MAPK signaling, especially ERK1/2. Black-Right-Pointing-Pointer CBP increase osteoblastic differentiation by the activation of Runx2. Black-Right-Pointing-Pointer CBP decrease adipogenic differentiation by the inhibition of PPAR{gamma}. -- Abstract: Bone sialoprotein (BSP) is a mineralized, tissue-specific, non-collagenous protein that is normally expressed only in mineralized tissues such as bone, dentin, cementum, and calcified cartilage, and at sites of new mineral formation. The binding of BSP to collagen is thought to be important for initiating bone mineralization and bone cell adhesion to the mineralized matrix. Severalmore » recent studies have isolated stem cells from muscle tissue, but their functional properties are still unclear. In this study, we examined the effects of a synthetic collagen-binding peptide (CBP) on the differentiation efficiency of muscle-derived stem cells (MDSCs). The CBP sequence (NGVFKYRPRYYLYKHAYFYPHLKRFPVQ) corresponds to residues 35-62 of bone sialoprotein (BSP), which are located within the collagen-binding domain in BSP. Interestingly, this synthetic CBP inhibited adipogenic differentiation but increased osteogenic differentiation in MDSCs. The CBP also induced expression of osteoblastic marker proteins, including alkaline phosphatase (ALP), type I collagen, Runt-related transcription factor 2 (Runx2), and osteocalcin; prevented adipogenic differentiation in MDSCs; and down-regulated adipose-specific mRNAs, such as adipocyte protein 2 (aP2) and peroxisome proliferator-activated receptor {gamma}. The CBP increased Extracellular signal-regulated kinases (ERK) 1/2 protein phosphorylation, which is important in lineage determination. These observations suggest that this CBP determines the osteogenic or adipogenic lineage in MDSCs by activating ERK1/2. Taken together, a novel CBP could be a useful candidate for regenerating bone and treating osteoporosis, which result from an imbalance in osteogenesis and adipogenesis differentiation.« less
Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K
2017-04-01
There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Sweeney, Shawn M.; Orgel, Joseph P.; Fertala, Andrzej; McAuliffe, Jon D.; Turner, Kevin R.; Di Lullo, Gloria A.; Chen, Steven; Antipova, Olga; Perumal, Shiamalee; Ala-Kokko, Leena; Forlino, Antonella; Cabral, Wayne A.; Barnes, Aileen M.; Marini, Joan C.; Antonio, James D. San
2008-01-01
Type I collagen, the predominant protein of vertebrates, polymerizes with type III and V collagens and non-collagenous molecules into large cable-like fibrils, yet how the fibril interacts with cells and other binding partners remains poorly understood. To help reveal insights into the collagen structure-function relationship, a data base was assembled including hundreds of type I collagen ligand binding sites and mutations on a two-dimensional model of the fibril. Visual examination of the distribution of functional sites, and statistical analysis of mutation distributions on the fibril suggest it is organized into two domains. The “cell interaction domain” is proposed to regulate dynamic aspects of collagen biology, including integrin-mediated cell interactions and fibril remodeling. The “matrix interaction domain” may assume a structural role, mediating collagen cross-linking, proteoglycan interactions, and tissue mineralization. Molecular modeling was used to superimpose the positions of functional sites and mutations from the two-dimensional fibril map onto a three-dimensional x-ray diffraction structure of the collagen microfibril in situ, indicating the existence of domains in the native fibril. Sequence searches revealed that major fibril domain elements are conserved in type I collagens through evolution and in the type II/XI collagen fibril predominant in cartilage. Moreover, the fibril domain model provides potential insights into the genotype-phenotype relationship for several classes of human connective tissue diseases, mechanisms of integrin clustering by fibrils, the polarity of fibril assembly, heterotypic fibril function, and connective tissue pathology in diabetes and aging. PMID:18487200
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sweeney, Shawn M.; Orgel, Joseph P.; Fertala, Andrzej
Type I collagen, the predominant protein of vertebrates, polymerizes with type III and V collagens and non-collagenous molecules into large cable-like fibrils, yet how the fibril interacts with cells and other binding partners remains poorly understood. To help reveal insights into the collagen structure-function relationship, a data base was assembled including hundreds of type I collagen ligand binding sites and mutations on a two-dimensional model of the fibril. Visual examination of the distribution of functional sites, and statistical analysis of mutation distributions on the fibril suggest it is organized into two domains. The 'cell interaction domain' is proposed to regulatemore » dynamic aspects of collagen biology, including integrin-mediated cell interactions and fibril remodeling. The 'matrix interaction domain' may assume a structural role, mediating collagen cross-linking, proteoglycan interactions, and tissue mineralization. Molecular modeling was used to superimpose the positions of functional sites and mutations from the two-dimensional fibril map onto a three-dimensional x-ray diffraction structure of the collagen microfibril in situ, indicating the existence of domains in the native fibril. Sequence searches revealed that major fibril domain elements are conserved in type I collagens through evolution and in the type II/XI collagen fibril predominant in cartilage. Moreover, the fibril domain model provides potential insights into the genotype-phenotype relationship for several classes of human connective tissue diseases, mechanisms of integrin clustering by fibrils, the polarity of fibril assembly, heterotypic fibril function, and connective tissue pathology in diabetes and aging.« less
Methods for sequencing GC-rich and CCT repeat DNA templates
Robinson, Donna L.
2007-02-20
The present invention is directed to a PCR-based method of cycle sequencing DNA and other polynucleotide sequences having high CG content and regions of high GC content, and includes for example DNA strands with a high Cytosine and/or Guanosine content and repeated motifs such as CCT repeats.
The Contribution of Short Repeats of Low Sequence Complexity to Large Conifer Genomes
A. Schmidt; R.L. Doudrick; J.S. Heslop-Harrison; T. Schmidt
2000-01-01
Abstract: The abundance and genomic organization of six simple sequence repeats, consisting of di-, tri-, and tetranucleotide sequence motifs, and a minisatellite repeat have been analyzed in different gymnosperms by Southern hybridization. Within the gymnosperm genomes investigated, the abundance and genomic organization of micro- and...
USDA-ARS?s Scientific Manuscript database
Simple sequence repeat (SSR) markers are widely used tools for inferences about genetic diversity, phylogeography and spatial genetic structure. Their applications assume that variation among alleles is essentially caused by an expansion or contraction of the number of repeats and that, accessorily,...
Kitamura, Akira; Ishida, Yoshihito; Kubota, Hiroshi; Pack, Chan-Gi; Homma, Takayuki; Ito, Shinya; Araki, Kazutaka; Kinjo, Masataka; Nagata, Kazuhiro
2018-02-26
Heat shock protein 47 kDa (HSP47), an ER-resident and collagen-specific molecular chaperone, recognizes collagenous hydrophobic amino acid sequences (Gly-Pro-Hyp) and assists in secretion of correctly folded collagen. Elevated collagen production is correlated with HSP47 expression in various diseases, including fibrosis and keloid. HSP47 knockdown ameliorates liver fibrosis by inhibiting collagen secretion, and inhibition of the interaction of HSP47 with procollagen also prevents collagen secretion. Therefore, a high-throughput system for screening of drugs capable of inhibiting the interaction between HSP47 and collagen would aid the development of novel therapies for fibrotic diseases. In this study, we established a straightforward method for rapidly and quantitatively measuring the interaction between HSP47 and collagen in solution using fluorescence correlation spectroscopy (FCS). The diffusion rate of HSP47 labeled with Alexa Fluor 488 (HSP47-AF), a green fluorescent dye, decreased upon addition of type I or III collagen, whereas that of dye-labeled protein disulfide isomerase (PDI) or bovine serum albumin (BSA) did not, indicating that specific binding of HSP47 to collagen could be detected using FCS. Using this method, we calculated the dissociation constant of the interaction between HSP47 and collagen. The binding ratio between HSP47-AF and collagen did not change in the presence of sodium chloride, confirming that the interaction was hydrophobic in nature. In addition, we observed dissociation of collagen from HSP47 at low pH and re-association after recovery to neutral pH. These observations indicate that this system is appropriate for detecting the interaction between HSP47 and collagen, and could be applied to high-throughput screening for drugs capable of suppressing and/or curing fibrosis. Copyright © 2018 Elsevier Inc. All rights reserved.
Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin
2015-04-01
This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.
Kapila, R; Das, S; Srivastava, P S; Lakshmikumaran, M
1996-08-01
DNA sequences representing a tandemly repeated DNA family of the Sinapis arvensis genome were cloned and characterized. The 700-bp tandem repeat family is represented by two clones, pSA35 and pSA52, which are 697 and 709 bp in length, respectively. Dot matrix analysis of the sequences indicates the presence of repeated elements within each monomeric unit. Sequence analysis of the repetitive region of clones pSA35 and pSA52 shows that there are several copies of a 7-bp repeat element organized in tandem. The consensus sequence of this repeat element is 5'-TTTAGGG-3'. These elements are highly mutated and the difference in length between the two clones is due to different copy numbers of these elements. The repetitive region of clone pSA35 has 26 copies of the element TTTAGGG, whereas clone pSA52 has 28 copies. The repetitive region in both clones is flanked on either side by inverted repeats that may be footprints of a transposition event. Sequence comparison indicates that the element TTTAGGG is identical to telomeric repeats present in Arabidopsis, maize, tomato, and other plants. However, Bal31 digestion kinetics indicates non-telomeric localization of the 700-bp tandem repeats. The clones represent a novel repeat family as (i) they contain telomere-like motifs as subrepeats within each unit; and (ii) they do not hybridize to related crucifers and are species-specific in nature.
Boros, D L; Singh, K P; Gerard, H C; Hudson, A P; White, S L; Cutroneo, K R
2005-08-01
Schistosomiasis mansoni disseminated worm eggs in mice and humans induce granulomatous inflammations and cumulative fibrosis causing morbidity and possibly mortality. In this study, intrahepatic and I.V. injections of a double-stranded oligodeoxynucleotide decoy containing the TGF-beta regulatory element found in the distal promoter of the COL1A1 gene into worm-infected mice suppressed TGF-beta1, COL1A1, tissue inhibitor of metalloproteinase-1, and decreased COL3A1 mRNAs to a lesser extent. Sequence comparisons within the mouse genome found homologous sequences within the COL3A1, TGF-beta1, and TIMP-1 5' flanking regions. Cold competition gel mobility shift assays using these homologous sequences with 5' and 3' flanking regions found in the natural COL1A1 gene showed competition. Competitive gel mobility assays in a separate experiment showed no competition using a 5-base mutated or scrambled sequence. Explanted liver granulomas from saline-injected mice incorporated 10.45 +/- 1.7% (3)H-proline into newly synthesized collagen, whereas decoy-treated mice showed no collagen synthesis. Compared with the saline control schistosomiasis mice phosphorothioate double-stranded oligodeoxynucleotide treatment decreased total liver collagen content (i.e. hydroxy-4-proline) by 34%. This novel molecular approach has the potential to be employed as a novel antifibrotic treatment modality. (c) 2005 Wiley-Liss, Inc.
NASA Astrophysics Data System (ADS)
Kurotobi, K.; Suzuki, Y.; Nakajima, H.; Suzuki, H.; Iwaki, M.
2003-05-01
He + ion implanted collagen-coated tubes with a fluence of 1 × 10 14 ions/cm 2 were exhibited antithrombogenicity. To investigate the mechanisms of antithrombogenicity of these samples, plasma protein adsorption assay and platelet adhesion experiments were performed. The adsorption of fibrinogen (Fg) and von Willebrand factor (vWf) was minimum on the He + ion implanted collagen with a fluence of 1 × 10 14 ions/cm 2. Platelet adhesion (using platelet rich plasma) was inhibited on the He + ion implanted collagen with a fluence of 1 × 10 14 ions/cm 2 and was accelerated on the untreated collagen and ion implanted collagen with fluences of 1 × 10 13, 1 × 10 15 and 1 × 10 16 ions/cm 2. Platelet activation with washed platelets was observed on untreated collagen and He + ion implanted collagen with a fluence of 1 × 10 14 ions/cm 2 and was inhibited with fluences of 1 × 10 13, 1 × 10 15 and 1 × 10 16 ions/cm 2. Generally, platelets can react with a specific ligand inside the collagen (GFOGER sequence). The results of platelets adhesion experiments using washed platelets indicated that there were no ligands such as GFOGER on the He + ion implanted collagen over a fluence of 1 × 10 13 ions/cm 2. On the 1 × 10 14 ions/cm 2 implanted collagen, no platelet activation was observed due to the influence of plasma proteins. From the above, it is concluded that the decrease of adsorbed Fg and vWf caused the antithrombogenicity of He + ion implanted collagen with a fluence of 1 × 10 14 ions/cm 2 and that plasma protein adsorption took an important role repairing the graft surface.
NASA Astrophysics Data System (ADS)
Tian, Zhenhua; Wu, Kun; Liu, Wentao; Shen, Lirui; Li, Guoying
2015-04-01
The thermal stability of collagen solution (5 mg/mL) crosslinked by glutaraldehyde (GTA) [GTA/collagen (w/w) = 0.5] was measured by differential scanning calorimetry and Fourier transform infrared spectroscopy (FTIR), and the thermally induced structural changes were analyzed using two-dimensional (2D) correlation spectra. The denaturation temperature (Td) and enthalpy change (ΔH) of crosslinked collagen were respectively about 27 °C and 88 J/g higher than those of native collagen, illuminating the thermal stability increased. With the increase of temperature, the red-shift of absorption bands and the decreased AIII/A1455 value obtained from FTIR spectra indicated that hydrogen bonds were weakened and the unwinding of triple helix occurred for both native and crosslinked collagens; whereas the less changes in red-shifting and AIII/A1455 values for crosslinked collagen also confirmed the increase in thermal stability. Additionally, the 2D correlation analysis provided information about the thermally induced structural changes. In the 2D synchronous spectra, the intensities of auto-peaks at 1655 and 1555 cm-1, respectively assigned to amide I band (Cdbnd O stretching vibration) and amide II band (combination of Nsbnd H bending and Csbnd N stretching vibrations) in helical conformation were weaker for crosslinked collagen than those for native collagen, indicating that the helical structure of crosslinked collagen was less sensitive to temperature. Moreover, the sequence of the band intensity variations showed that the band at 1555 cm-1 moved backwards owing to the addition of GTA, demonstrating that the response of helical structure of crosslinked collagen to the increased temperature lagged. It was speculated that the stabilization of collagen by GTA was due to the reinforcement of triple helical structure.
Tian, Zhenhua; Wu, Kun; Liu, Wentao; Shen, Lirui; Li, Guoying
2015-04-05
The thermal stability of collagen solution (5 mg/mL) crosslinked by glutaraldehyde (GTA) [GTA/collagen (w/w)=0.5] was measured by differential scanning calorimetry and Fourier transform infrared spectroscopy (FTIR), and the thermally induced structural changes were analyzed using two-dimensional (2D) correlation spectra. The denaturation temperature (Td) and enthalpy change (ΔH) of crosslinked collagen were respectively about 27°C and 88 J/g higher than those of native collagen, illuminating the thermal stability increased. With the increase of temperature, the red-shift of absorption bands and the decreased AIII/A1455 value obtained from FTIR spectra indicated that hydrogen bonds were weakened and the unwinding of triple helix occurred for both native and crosslinked collagens; whereas the less changes in red-shifting and AIII/A1455 values for crosslinked collagen also confirmed the increase in thermal stability. Additionally, the 2D correlation analysis provided information about the thermally induced structural changes. In the 2D synchronous spectra, the intensities of auto-peaks at 1655 and 1555 cm(-1), respectively assigned to amide I band (CO stretching vibration) and amide II band (combination of NH bending and CN stretching vibrations) in helical conformation were weaker for crosslinked collagen than those for native collagen, indicating that the helical structure of crosslinked collagen was less sensitive to temperature. Moreover, the sequence of the band intensity variations showed that the band at 1555 cm(-1) moved backwards owing to the addition of GTA, demonstrating that the response of helical structure of crosslinked collagen to the increased temperature lagged. It was speculated that the stabilization of collagen by GTA was due to the reinforcement of triple helical structure. Copyright © 2015 Elsevier B.V. All rights reserved.
Bone Collagen: New Clues to its Mineralization Mechanism From Recessive Osteogenesis Imperfecta
Eyre, David R.; Ann Weis, Mary
2013-01-01
Until 2006 the only mutations known to cause osteogenesis imperfecta (OI) were in the two genes coding for type I collagen chains. These dominant mutations affecting the expression or primary sequence of collagen α1(I) and α2(I) chains account for over 90% of OI cases. Since then a growing list of mutant genes causing the 5–10% of recessive cases has rapidly emerged. They include CRTAP, LEPRE1 and PPIB, which encode three proteins forming the prolyl 3-hydroxylase complex; PLOD2 and FKBP10, which encode respectively lysyl hydroxylase 2 and a foldase required for its activity in forming mature cross-links in bone collagen; SERPIN H1, which encodes the collagen chaperone HSP47; SERPIN F1, which encodes pigment epithelium-derived factor required for osteoid mineralization; and BMP1, which encodes the type I procollagen C-propeptidase. All cause fragile bone in infancy, which can include over-mineralization or under-mineralization defects as well as abnormal collagen post-translational modifications. Consistently both dominant and recessive variants lead to abnormal cross-linking chemistry in bone collagen. These recent discoveries strengthen the potential for a common pathogenic mechanism of misassembled collagen fibrils. Of the new genes identified, eight encode proteins required for collagen post-translational modification, chaperoning of newly synthesized collagen chains into native molecules or transport through the endoplasmic reticulum and Golgi for polymerization, cross-linking and mineralization. In reviewing these findings, we conclude that a common theme is emerging in the pathogenesis of brittle bone disease of mishandled collagen assembly with important insights on post-translational features of bone collagen that have evolved to optimize it as a biomineral template. PMID:23508630
Stability of Tandem Repeats in the Drosophila Melanogaster HSR-Omega Nuclear RNA
Hogan, N. C.; Slot, F.; Traverse, K. L.; Garbe, J. C.; Bendena, W. G.; Pardue, M. L.
1995-01-01
The Drosophila melanogaster Hsr-omega locus produces a nuclear RNA containing >5 kb of tandem repeat sequences. These repeats are unique to Hsr-omega and show concerted evolution similar to that seen with classical satellite DNAs. In D. melanogaster the monomer is ~280 bp. Sequences of 191/2 monomers differ by 8 +/- 5% (mean +/- SD), when all pairwise comparisons are considered. Differences are single nucleotide substitutions and 1-3 nucleotide deletions/insertions. Changes appear to be randomly distributed over the repeat unit. Outer repeats do not show the decrease in monomer homogeneity that might be expected if homogeneity is maintained by recombination. However, just outside the last complete repeat at each end, there are a few fragments of sequence similar to the monomer. The sequences in these flanking regions are not those predicted for sequences decaying in the absence of recombination. Instead, the fragmentation of the sequence homology suggests that flanking regions have undergone more severe disruptions, possibly during an insertion or amplification event. Hsr-omega alleles differing in the number of repeats are detected and appear to be stable over a few thousand generations; however, both increases and decreases in repeat numbers have been observed. The new alleles appear to be as stable as their predecessors. No alleles of less than ~5 kb nor more than ~16 kb of repeats were seen in any stocks examined. The evidence that there is a limit on the minimum number of repeats is consistent with the suggestion that these repeats are important in the function of the unusual Hsr-omega nuclear RNA. PMID:7540581
Typing Clostridium difficile strains based on tandem repeat sequences
2009-01-01
Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124
First investigation of the collagen D-band ultrastructure in fossilized vertebrate integument.
Lingham-Soliar, Theagarten; Wesley-Smith, James
2008-10-07
The ultrastructure of dermal fibres of a 200Myr thunniform ichthyosaur, Ichthyosaurus, specifically the 67nm axial repeat D-banding of the fibrils, which characterizes collagen, is presented for the first time by means of scanning electron microscopy (SEM) analysis. The fragment of material investigated is part of previously described fossilized skin comprising an architecture of layers of oppositely oriented fibre bundles. The wider implication, as indicated by the extraordinary quality of preservation, is the robustness of the collagen molecule at the ultrastructural level, which presumably contributed to its survival during the initial processes of decomposition prior to mineralization. Investigation of the elemental composition of the sample by SEM-energy dispersive X-ray spectroscopy indicates that calcite and phosphate played important roles in the rapid mineralization and fine replication of the collagen fibres and fibrils. The exceedingly small sample used in the investigation and high level of information achieved indicate the potential for minimal damage to prized museum specimens; for example, ultrastructural investigations by SEM may be used to help resolve highly contentious questions, for example, 'protofeathers' in the Chinese dinosaurs.
First investigation of the collagen D-band ultrastructure in fossilized vertebrate integument
Lingham-Soliar, Theagarten; Wesley-Smith, James
2008-01-01
The ultrastructure of dermal fibres of a 200 Myr thunniform ichthyosaur, Ichthyosaurus, specifically the 67 nm axial repeat D-banding of the fibrils, which characterizes collagen, is presented for the first time by means of scanning electron microscopy (SEM) analysis. The fragment of material investigated is part of previously described fossilized skin comprising an architecture of layers of oppositely oriented fibre bundles. The wider implication, as indicated by the extraordinary quality of preservation, is the robustness of the collagen molecule at the ultrastructural level, which presumably contributed to its survival during the initial processes of decomposition prior to mineralization. Investigation of the elemental composition of the sample by SEM–energy dispersive X-ray spectroscopy indicates that calcite and phosphate played important roles in the rapid mineralization and fine replication of the collagen fibres and fibrils. The exceedingly small sample used in the investigation and high level of information achieved indicate the potential for minimal damage to prized museum specimens; for example, ultrastructural investigations by SEM may be used to help resolve highly contentious questions, for example, ‘protofeathers’ in the Chinese dinosaurs. PMID:18577504
A unique evolution of the kidney phenotype in a patient with autosomal recessive Alport syndrome.
Vischini, Gisella; Kapp, Meghan E; Wheeler, Ferrin C; Hopp, Laszlo; Fogo, Agnes B
2018-03-09
Alport syndrome is due to mutations in one of the genes encoding (α3,4,5) type IV collagen resulting in defective type IV collagen, a key component of the glomerular basement membrane (GBM). The GBM is initially thin, and with ongoing remodeling, develops a thickened basket-woven appearance. We report a unique case of a 9-year-old boy who was biopsied for hematuria and proteinuria, diagnosed as IgA nephropathy, with normal GBM appearance and thickness. Due to a family history of hematuria and chronic kidney disease, he subsequently underwent genetic evaluation and a mutation of α3 type IV collagen (COL4A3) was detected. Additional studies of the initial biopsy demonstrated abnormal type IV collagen immunostaining. A repeat biopsy 4years later showed characteristic glomerular basement membrane morphology of Alport syndrome, and scarring consistent with sequelae of IgA nephropathy. This is the first description of this unusual transition from an initial normal appearance of the glomerular basement membrane to the classic Alport phenotype. Copyright © 2018. Published by Elsevier Inc.
Dermal damage promoted by repeated low-level UV-A1 exposure despite tanning response in human skin.
Wang, Frank; Smith, Noah R; Tran, Bao Anh Patrick; Kang, Sewon; Voorhees, John J; Fisher, Gary J
2014-04-01
Solar UV irradiation causes photoaging, characterized by fragmentation and reduced production of type I collagen fibrils that provide strength to skin. Exposure to UV-B irradiation (280-320 nm) causes these changes by inducing matrix metalloproteinase 1 and suppressing type I collagen synthesis. The role of UV-A irradiation (320-400 nm) in promoting similar molecular alterations is less clear yet important to consider because it is 10 to 100 times more abundant in natural sunlight than UV-B irradiation and penetrates deeper into the dermis than UV-B irradiation. Most (approximately 75%) of solar UV-A irradiation is composed of UV-A1 irradiation (340-400 nm), which is also the primary component of tanning beds. To evaluate the effects of low levels of UV-A1 irradiation, as might be encountered in daily life, on expression of matrix metalloproteinase 1 and type I procollagen (the precursor of type I collagen). In vivo biochemical analyses were conducted after UV-A1 irradiation of normal human skin at an academic referral center. Participants included 22 healthy individuals without skin disease. Skin pigmentation was measured by a color meter (chromometer) under the L* variable (luminescence), which ranges from 0 (black) to 100 (white). Gene expression in skin samples was assessed by real-time polymerase chain reaction. Lightly pigmented human skin (L* >65) was exposed up to 4 times (1 exposure/d) to UV-A1 irradiation at a low dose (20 J/cm2), mimicking UV-A levels from strong sun exposure lasting approximately 2 hours. A single exposure to low-dose UV-A1 irradiation darkened skin slightly and did not alter matrix metalloproteinase 1 or type I procollagen gene expression. With repeated low-dose UV-A1 irradiation, skin darkened incrementally with each exposure. Despite this darkening, 2 or more exposures to low-dose UV-A1 irradiation significantly induced matrix metalloproteinase 1 gene expression, which increased progressively with successive exposures. Repeated UV-A1 exposures did not suppress type I procollagen expression. A limited number of low-dose UV-A1 exposures, as commonly experienced in daily life, potentially promotes photoaging by affecting breakdown, rather than synthesis, of collagen. Progressive skin darkening in response to repeated low-dose UV-A1 exposures in lightly pigmented individuals does not prevent UV-A1-induced collagenolytic changes. Therefore, for optimal protection against skin damage, sunscreen formulations should filter all UV wavelengths, including UV-A1 irradiation.
The role of collagen charge clusters in the modulation of matrix metalloproteinase activity.
Lauer, Janelle L; Bhowmick, Manishabrata; Tokmina-Roszyk, Dorota; Lin, Yan; Van Doren, Steven R; Fields, Gregg B
2014-01-24
Members of the matrix metalloproteinase (MMP) family selectively cleave collagens in vivo. Several substrate structural features that direct MMP collagenolysis have been identified. The present study evaluated the role of charged residue clusters in the regulation of MMP collagenolysis. A series of 10 triple-helical peptide (THP) substrates were constructed in which either Lys-Gly-Asp or Gly-Asp-Lys motifs replaced Gly-Pro-Hyp (where Hyp is 4-hydroxy-L-proline) repeats. The stabilities of THPs containing the two different motifs were analyzed, and kinetic parameters for substrate hydrolysis by six MMPs were determined. A general trend for virtually all enzymes was that, as Gly-Asp-Lys motifs were moved from the extreme N and C termini to the interior next to the cleavage site sequence, kcat/Km values increased. Additionally, all Gly-Asp-Lys THPs were as good or better substrates than the parent THP in which Gly-Asp-Lys was not present. In turn, the Lys-Gly-Asp THPs were also always better substrates than the parent THP, but the magnitude of the difference was considerably less compared with the Gly-Asp-Lys series. Of the MMPs tested, MMP-2 and MMP-9 most greatly favored the presence of charged residues with preference for the Gly-Asp-Lys series. Lys-Gly-(Asp/Glu) motifs are more commonly found near potential MMP cleavage sites than Gly-(Asp/Glu)-Lys motifs. As Lys-Gly-Asp is not as favored by MMPs as Gly-Asp-Lys, the Lys-Gly-Asp motif appears advantageous over the Gly-Asp-Lys motif by preventing unwanted MMP hydrolysis. More specifically, the lack of Gly-Asp-Lys clusters may diminish potential MMP-2 and MMP-9 collagenolytic activity. The present study indicates that MMPs have interactions spanning the P23-P23' subsites of collagenous substrates.
Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian
2009-11-01
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
The Apis mellifera Filamentous Virus Genome
Gauthier, Laurent; Cornman, Scott; Hartmann, Ulrike; Cousserans, François; Evans, Jay D.; de Miranda, Joachim R.; Neumann, Peter
2015-01-01
A complete reference genome of the Apis mellifera Filamentous virus (AmFV) was determined using Illumina Hiseq sequencing. The AmFV genome is a double stranded DNA molecule of approximately 498,500 nucleotides with a GC content of 50.8%. It encompasses 247 non-overlapping open reading frames (ORFs), equally distributed on both strands, which cover 65% of the genome. While most of the ORFs lacked threshold sequence alignments to reference protein databases, twenty-eight were found to display significant homologies with proteins present in other large double stranded DNA viruses. Remarkably, 13 ORFs had strong similarity with typical baculovirus domains such as PIFs (per os infectivity factor genes: pif-1, pif-2, pif-3 and p74) and BRO (Baculovirus Repeated Open Reading Frame). The putative AmFV DNA polymerase is of type B, but is only distantly related to those of the baculoviruses. The ORFs encoding proteins involved in nucleotide metabolism had the highest percent identity to viral proteins in GenBank. Other notable features include the presence of several collagen-like, chitin-binding, kinesin and pacifastin domains. Due to the large size of the AmFV genome and the inconsistent affiliation with other large double stranded DNA virus families infecting invertebrates, AmFV may belong to a new virus family. PMID:26184284
The Apis mellifera Filamentous Virus Genome.
Gauthier, Laurent; Cornman, Scott; Hartmann, Ulrike; Cousserans, François; Evans, Jay D; de Miranda, Joachim R; Neumann, Peter
2015-07-09
A complete reference genome of the Apis mellifera Filamentous virus (AmFV) was determined using Illumina Hiseq sequencing. The AmFV genome is a double stranded DNA molecule of approximately 498,500 nucleotides with a GC content of 50.8%. It encompasses 247 non-overlapping open reading frames (ORFs), equally distributed on both strands, which cover 65% of the genome. While most of the ORFs lacked threshold sequence alignments to reference protein databases, twenty-eight were found to display significant homologies with proteins present in other large double stranded DNA viruses. Remarkably, 13 ORFs had strong similarity with typical baculovirus domains such as PIFs (per os infectivity factor genes: pif-1, pif-2, pif-3 and p74) and BRO (Baculovirus Repeated Open Reading Frame). The putative AmFV DNA polymerase is of type B, but is only distantly related to those of the baculoviruses. The ORFs encoding proteins involved in nucleotide metabolism had the highest percent identity to viral proteins in GenBank. Other notable features include the presence of several collagen-like, chitin-binding, kinesin and pacifastin domains. Due to the large size of the AmFV genome and the inconsistent affiliation with other large double stranded DNA virus families infecting invertebrates, AmFV may belong to a new virus family.
Genome Wide Characterization of Simple Sequence Repeats in Cucumber
USDA-ARS?s Scientific Manuscript database
The whole genome sequence of the cucumber cultivar Gy14 was recently sequenced at 15× coverage with the Roche 454 Titanium technology. The microsatellite DNA sequences (simple sequence repeats, SSRs) in the assembled scaffolds were computationally explored and characterized. A total of 112,073 SSRs ...
Lee, Michael; Hills, Mark; Conomos, Dimitri; Stutz, Michael D.; Dagg, Rebecca A.; Lau, Loretta M.S.; Reddel, Roger R.; Pickett, Hilda A.
2014-01-01
Telomeres are terminal repetitive DNA sequences on chromosomes, and are considered to comprise almost exclusively hexameric TTAGGG repeats. We have evaluated telomere sequence content in human cells using whole-genome sequencing followed by telomere read extraction in a panel of mortal cell strains and immortal cell lines. We identified a wide range of telomere variant repeats in human cells, and found evidence that variant repeats are generated by mechanistically distinct processes during telomerase- and ALT-mediated telomere lengthening. Telomerase-mediated telomere extension resulted in biased repeat synthesis of variant repeats that differed from the canonical sequence at positions 1 and 3, but not at positions 2, 4, 5 or 6. This indicates that telomerase is most likely an error-prone reverse transcriptase that misincorporates nucleotides at specific positions on the telomerase RNA template. In contrast, cell lines that use the ALT pathway contained a large range of variant repeats that varied greatly between lines. This is consistent with variant repeats spreading from proximal telomeric regions throughout telomeres in a stochastic manner by recombination-mediated templating of DNA synthesis. The presence of unexpectedly large numbers of variant repeats in cells utilizing either telomere maintenance mechanism suggests a conserved role for variant sequences at human telomeres. PMID:24225324
Srivastava, Deepika; Shanker, Asheesh
2016-12-01
Basal angiosperms or Magnoliids is an important clade of commercially important plants which mainly include spices and edible fruits. In this study, 17 chloroplast genome sequences belonging to clade Magnoliids were screened for the identification of chloroplast simple sequence repeats (cpSSRs). Simple sequence repeats or microsatellites are short stretches of DNA up to 1-6 base pair in length. These repeats are ubiquitous and play important role in the development of molecular markers and to study the mapping of traits of economic, medical or ecological interest. A total of 479 SSRs were detected, showing average density of 1 SSR/6.91 kb. Depending on the repeat units, the length of SSRs ranged from 12 to 24 bp for mono-, 12 to 18 bp for di-, 12 to 26 bp for tri-, 12 to 24 bp for tetra-, 15 bp for penta- and 18 bp for hexanucleotide repeats. Mononucleotide repeats were the most frequent (207, 43.21 %) followed by tetranucleotide repeats (130, 27.13 %). Penta- and hexanucleotide repeats were least frequent or absent in these chloroplast genomes.
Watanabe, Keijirou; Hida, Mariko; Sasaki, Takako; Yano, Hiroyuki; Kawano, Kenji; Yoshioka, Hidekatsu; Matsuo, Noritaka
2016-02-01
Type XI collagen is a cartilage-specific extracellular matrix, and is important for collagen fibril formation and skeletal morphogenesis. We have previously reported that NF-Y regulated the proximal promoter activity of the mouse collagen α1(XI) gene (Col11a1) in chondrocytes (Hida et. al. In Vitro Cell. Dev. Biol. Anim. 2014). However, the mechanism of the Col11a1 gene regulation in chondrocytes has not been fully elucidated. In this study, we further characterized the proximal promoter activity of the mouse Col11a1 gene in chondrocytes. Cell transfection experiments with deletion and mutation constructs indicated that the downstream region of the NF-Y binding site (-116 to +1) is also necessary to regulate the proximal promoter activity of the mouse Col11a1 gene. This minimal promoter region has no TATA box and GC-rich sequence; we therefore examined whether the GC-rich sequence (-96 to -67) is necessary for the transcription regulation of the Col11a1 gene. Luciferase assays using a series of mutation constructs exhibited that the GC-rich sequence is a critical element of Col11a1 promoter activity in chondrocytes. Moreover, in silico analysis of this region suggested that one of the most effective candidates was transcription factor Sp1. Consistent with the prediction, overexpression of Sp1 significantly increased the promoter activity. Furthermore, knockdown of Sp1 expression by siRNA transfection suppressed the proximal promoter activity and the expression of endogenous transcript of the mouse Col11a1 gene. Taken together, these results indicate that the transcription factor Sp1 upregulates the proximal promoter activity of the mouse Col11a1 gene in chondrocytes.
Molecular characterization and distribution of a 145-bp tandem repeat family in the genus Populus.
Rajagopal, J; Das, S; Khurana, D K; Srivastava, P S; Lakshmikumaran, M
1999-10-01
This report aims to describe the identification and molecular characterization of a 145-bp tandem repeat family that accounts for nearly 1.5% of the Populus genome. Three members of this repeat family were cloned and sequenced from Populus deltoides and P. ciliata. The dimers of the repeat were sequenced in order to confirm the head-to-tail organization of the repeat. Hybridization-based analysis using the 145-bp tandem repeat as a probe on genomic DNA gave rise to ladder patterns which were identified to be a result of methylation and (or) sequence heterogeneity. Analysis of the methylation pattern of the repeat family using methylation-sensitive isoschizomers revealed variable methylation of the C residues and lack of methylation of the A residues. Sequence comparisons between the monomers revealed a high degree of sequence divergence that ranged between 6% and 11% in P. deltoides and between 4.2% and 8.3% in P. ciliata. This indicated the presence of sub-families within the 145-bp tandem family of repeats. Divergence was mainly due to the accumulation of point mutations and was concentrated in the central region of the repeat. The 145-bp tandem repeat family did not show significant homology to known tandem repeats from plants. A short stretch of 36 bp was found to show homology of 66.7% to a centromeric repeat from Chironomus plumosus. Dot-blot analysis and Southern hybridization data revealed the presence of the repeat family in 13 of the 14 Populus species examined. The absence of the 145-bp repeat from P. euphratica suggested that this species is relatively distant from other members of the genus, which correlates with taxonomic classifications. The widespread occurrence of the tandem family in the genus indicated that this family may be of ancient origin.
Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.
Benslimane, A A; Dron, M; Hartmann, C; Rode, A
1986-01-01
Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553
Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.
Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C
1997-12-01
Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.
Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis
Tong, Helin; Chen, You; Wang, Jingyi; Chen, Yeyuan; Sun, Guangming; He, Junhu; Wu, Yaoting
2013-01-01
Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8%) of the 94 Simple Sequence Repeat (SSR) loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp.), and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus). Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region. PMID:24024187
Osteogenesis imperfecta type III/Ehlers-Danlos overlap syndrome in a Chinese man.
Lu, Yanqin; Wang, Yanzhou; Rauch, Frank; Li, Hu; Zhang, Yao; Zhai, Naixiang; Zhang, Jian; Ren, Xiuzhi; Han, Jinxiang
2018-02-01
Osteogenesis imperfecta (OI) and Ehlers-Danlos syndrome (EDS) are rare genetic disorders that are typically inherited in an autosomal dominant manner. Few cases of OI/EDS overlap syndrome have been documented. Described here is a 30-year-old Chinese male with OI type III and EDS. Sequencing of genomic DNA revealed a heterozygous COL1A1 mutation (c.671G>A, p.Gly224Asp) that affected the N-anchor domain of the alpha 1 chain of collagen type I. Ultrastructural analysis of a skin biopsy specimen revealed thin collagen fibers with irregular alignment of collagen fibers. These findings have expanded the genotypic spectrum of the OI/EDS overlap syndrome.
De novo self-assembling collagen heterotrimers using explicit positive and negative design.
Xu, Fei; Zhang, Lei; Koder, Ronald L; Nanda, Vikas
2010-03-23
We sought to computationally design model collagen peptides that specifically associate as heterotrimers. Computational design has been successfully applied to the creation of new protein folds and functions. Despite the high abundance of collagen and its key role in numerous biological processes, fibrous proteins have received little attention as computational design targets. Collagens are composed of three polypeptide chains that wind into triple helices. We developed a discrete computational model to design heterotrimer-forming collagen-like peptides. Stability and specificity of oligomerization were concurrently targeted using a combined positive and negative design approach. The sequences of three 30-residue peptides, A, B, and C, were optimized to favor charge-pair interactions in an ABC heterotrimer, while disfavoring the 26 competing oligomers (i.e., AAA, ABB, BCA). Peptides were synthesized and characterized for thermal stability and triple-helical structure by circular dichroism and NMR. A unique A:B:C-type species was not achieved. Negative design was partially successful, with only A + B and B + C competing mixtures formed. Analysis of computed versus experimental stabilities helps to clarify the role of electrostatics and secondary-structure propensities determining collagen stability and to provide important insight into how subsequent designs can be improved.
Shaw, Gregory; Lee-Barthel, Ann; Ross, Megan LR; Wang, Bing; Baar, Keith
2017-01-01
Background: Musculoskeletal injuries are the most common complaint in active populations. More than 50% of all injuries in sports can be classified as sprains, strains, ruptures, or breaks of musculoskeletal tissues. Nutritional and/or exercise interventions that increase collagen synthesis and strengthen these tissues could have an important effect on injury rates. Objective: This study was designed to determine whether gelatin supplementation could increase collagen synthesis. Design: Eight healthy male subjects completed a randomized, double-blinded, crossover-design study in which they consumed either 5 or 15 g of vitamin C–enriched gelatin or a placebo control. After the initial drink, blood was taken every 30 min to determine amino acid content in the blood. A larger blood sample was taken before and 1 h after consumption of gelatin for treatment of engineered ligaments. One hour after the initial supplement, the subjects completed 6 min of rope-skipping to stimulate collagen synthesis. This pattern of supplementation was repeated 3 times/d with ≥6 h between exercise bouts for 3 d. Blood was drawn before and 4, 24, 48, and 72 h after the first exercise bout for determination of amino-terminal propeptide of collagen I content. Results: Supplementation with increasing amounts of gelatin increased circulating glycine, proline, hydroxyproline, and hydroxylysine, peaking 1 h after the supplement was given. Engineered ligaments treated for 6 d with serum from samples collected before or 1 h after subjects consumed a placebo or 5 or 15 g gelatin showed increased collagen content and improved mechanics. Subjects who took 15 g gelatin 1 h before exercise showed double the amino-terminal propeptide of collagen I in their blood, indicating increased collagen synthesis. Conclusion: These data suggest that adding gelatin to an intermittent exercise program improves collagen synthesis and could play a beneficial role in injury prevention and tissue repair. This trial was registered at the Australian New Zealand Clinical Trials Registry as ACTRN12616001092482. PMID:27852613
Vitamin C-enriched gelatin supplementation before intermittent activity augments collagen synthesis.
Shaw, Gregory; Lee-Barthel, Ann; Ross, Megan Lr; Wang, Bing; Baar, Keith
2017-01-01
Musculoskeletal injuries are the most common complaint in active populations. More than 50% of all injuries in sports can be classified as sprains, strains, ruptures, or breaks of musculoskeletal tissues. Nutritional and/or exercise interventions that increase collagen synthesis and strengthen these tissues could have an important effect on injury rates. This study was designed to determine whether gelatin supplementation could increase collagen synthesis. Eight healthy male subjects completed a randomized, double-blinded, crossover-design study in which they consumed either 5 or 15 g of vitamin C-enriched gelatin or a placebo control. After the initial drink, blood was taken every 30 min to determine amino acid content in the blood. A larger blood sample was taken before and 1 h after consumption of gelatin for treatment of engineered ligaments. One hour after the initial supplement, the subjects completed 6 min of rope-skipping to stimulate collagen synthesis. This pattern of supplementation was repeated 3 times/d with ≥6 h between exercise bouts for 3 d. Blood was drawn before and 4, 24, 48, and 72 h after the first exercise bout for determination of amino-terminal propeptide of collagen I content. Supplementation with increasing amounts of gelatin increased circulating glycine, proline, hydroxyproline, and hydroxylysine, peaking 1 h after the supplement was given. Engineered ligaments treated for 6 d with serum from samples collected before or 1 h after subjects consumed a placebo or 5 or 15 g gelatin showed increased collagen content and improved mechanics. Subjects who took 15 g gelatin 1 h before exercise showed double the amino-terminal propeptide of collagen I in their blood, indicating increased collagen synthesis. These data suggest that adding gelatin to an intermittent exercise program improves collagen synthesis and could play a beneficial role in injury prevention and tissue repair. This trial was registered at the Australian New Zealand Clinical Trials Registry as ACTRN12616001092482. © 2017 American Society for Nutrition.
A novel tandem repeat sequence located on human chromosome 4p: isolation and characterization.
Kogi, M; Fukushige, S; Lefevre, C; Hadano, S; Ikeda, J E
1997-06-01
In an effort to analyze the genomic region of the distal half of human chromosome 4p, to where Huntington disease and other diseases have been mapped, we have isolated the cosmid clone (CRS447) that was likely to contain a region with specific repeat sequences. Clone CRS447 was subjected to detailed analysis, including chromosome mapping, restriction mapping, and DNA sequencing. Chromosome mapping by both a human-CHO hybrid cell panel and FISH revealed that CRS447 was predominantly located in the 4p15.1-15.3 region. CRS447 was shown to consist of tandem repeats of 4.7-kb units present on chromosome 4p. A single EcoRI unit was subcloned (pRS447), and the complete sequence was determined as 4752 nucleotides. When pRS447 was used as a probe, the number of copies of this repeat per haploid genome was estimated to be 50-70. Sequence analysis revealed that it contained two internal CA repeats and one putative ORF. Database search established that this sequence was unreported. However, two homologous STS markers were found in the database. We concluded that CRS447/pRS447 is a novel tandem repeat sequence that is mainly specific to human chromosome 4p.
Collagen Content Limits Optical Coherence Tomography Image Depth in Porcine Vocal Fold Tissue.
Garcia, Jordan A; Benboujja, Fouzi; Beaudette, Kathy; Rogers, Derek; Maurer, Rie; Boudoux, Caroline; Hartnick, Christopher J
2016-11-01
Vocal fold scarring, a condition defined by increased collagen content, is challenging to treat without a method of noninvasively assessing vocal fold structure in vivo. The goal of this study was to observe the effects of vocal fold collagen content on optical coherence tomography imaging to develop a quantifiable marker of disease. Excised specimen study. Massachusetts Eye and Ear Infirmary. Porcine vocal folds were injected with collagenase to remove collagen from the lamina propria. Optical coherence tomography imaging was performed preinjection and at 0, 45, 90, and 180 minutes postinjection. Mean pixel intensity (or image brightness) was extracted from images of collagenase- and control-treated hemilarynges. Texture analysis of the lamina propria at each injection site was performed to extract image contrast. Two-factor repeated measure analysis of variance and t tests were used to determine statistical significance. Picrosirius red staining was performed to confirm collagenase activity. Mean pixel intensity was higher at injection sites of collagenase-treated vocal folds than control vocal folds (P < .0001). Fold change in image contrast was significantly increased in collagenase-treated vocal folds than control vocal folds (P = .002). Picrosirius red staining in control specimens revealed collagen fibrils most prominent in the subepithelium and above the thyroarytenoid muscle. Specimens treated with collagenase exhibited a loss of these structures. Collagen removal from vocal fold tissue increases image brightness of underlying structures. This inverse relationship may be useful in treating vocal fold scarring in patients. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Genome-wide characterization of centromeric satellites from multiple mammalian genomes.
Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario
2011-01-01
Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.
Survey and Analysis of Microsatellites in the Silkworm, Bombyx mori
Prasad, M. Dharma; Muthulakshmi, M.; Madhu, M.; Archak, Sunil; Mita, K.; Nagaraju, J.
2005-01-01
We studied microsatellite frequency and distribution in 21.76-Mb random genomic sequences, 0.67-Mb BAC sequences from the Z chromosome, and 6.3-Mb EST sequences of Bombyx mori. We mined microsatellites of ≥15 bases of mononucleotide repeats and ≥5 repeat units of other classes of repeats. We estimated that microsatellites account for 0.31% of the genome of B. mori. Microsatellite tracts of A, AT, and ATT were the most abundant whereas their number drastically decreased as the length of the repeat motif increased. In general, tri- and hexanucleotide repeats were overrepresented in the transcribed sequences except TAA, GTA, and TGA, which were in excess in genomic sequences. The Z chromosome sequences contained shorter repeat types than the rest of the chromosomes in addition to a higher abundance of AT-rich repeats. Our results showed that base composition of the flanking sequence has an influence on the origin and evolution of microsatellites. Transitions/transversions were high in microsatellites of ESTs, whereas the genomic sequence had an equal number of substitutions and indels. The average heterozygosity value for 23 polymorphic microsatellite loci surveyed in 13 diverse silkmoth strains having 2–14 alleles was 0.54. Only 36 (18.2%) of 198 microsatellite loci were polymorphic between the two divergent silkworm populations and 10 (5%) loci revealed null alleles. The microsatellite map generated using these polymorphic markers resulted in 8 linkage groups. B. mori microsatellite loci were the most conserved in its immediate ancestor, B. mandarina, followed by the wild saturniid silkmoth, Antheraea assama. PMID:15371363
Collagen Sequence Analysis of the Extinct Giant Ground Sloths Lestodon and Megatherium
Buckley, Michael; Fariña, Richard A.; Lawless, Craig; Tambusso, P. Sebastián; Varela, Luciano; Carlini, Alfredo A.; Powell, Jaime E.; Martinez, Jorge G.
2015-01-01
For over 200 years, fossils of bizarre extinct creatures have been described from the Americas that have ranged from giant ground sloths to the ‘native’ South American ungulates, groups of mammals that evolved in relative isolation on South America. Ground sloths belong to the South American xenarthrans, a group with modern although morphologically and ecologically very different representatives (anteaters, armadillos and sloths), which has been proposed to be one of the four main eutherian clades. Recently, proteomics analyses of bone collagen have recently been used to yield a molecular phylogeny for a range of mammals including the unusual ‘Malagasy aardvark’ shown to be most closely related to the afrotherian tenrecs, and the south American ungulates supporting their morphological association with condylarths. However, proteomics results generate partial sequence information that could impact upon the phylogenetic placement that has not been appropriately tested. For comparison, this paper examines the phylogenetic potential of proteomics-based sequencing through the analysis of collagen extracted from two extinct giant ground sloths, Lestodon and Megatherium. The ground sloths were placed as sister taxa to extant sloths, but with a closer relationship between Lestodon and the extant sloths than the basal Megatherium. These results highlight that proteomics methods could yield plausible phylogenies that share similarities with other methods, but have the potential to be more useful in fossils beyond the limits of ancient DNA survival. PMID:26540101
NASA Astrophysics Data System (ADS)
Dominguez, L. A.; Taira, T.; Hjorleifsdottir, V.; Santoyo, M. A.
2015-12-01
Repeating earthquake sequences are sets of events that are thought to rupture the same area on the plate interface and thus provide nearly identical waveforms. We systematically analyzed seismic records from 2001 through 2014 to identify repeating earthquakes with highly correlated waveforms occurring along the subduction zone of the Cocos plate. Using the correlation coefficient (cc) and spectral coherency (coh) of the vertical components as selection criteria, we found a set of 214 sequences whose waveforms exceed cc≥95% and coh≥95%. Spatial clustering along the trench shows large variations in repeating earthquakes activity. Particularly, the rupture zone of the M8.1, 1985 earthquake shows an almost absence of characteristic repeating earthquakes, whereas the Guerrero Gap zone and the segment of the trench close to the Guerrero-Oaxaca border shows a significantly larger number of repeating earthquakes sequences. Furthermore, temporal variations associated to stress changes due to major shows episodes of unlocking and healing of the interface. Understanding the different components that control the location and recurrence time of characteristic repeating sequences is a key factor to pinpoint areas where large megathrust earthquakes may nucleate and consequently to improve the seismic hazard assessment.
Algorithm to find distant repeats in a single protein sequence
Banerjee, Nirjhar; Sarani, Rangarajan; Ranjani, Chellamuthu Vasuki; Sowmiya, Govindaraj; Michael, Daliah; Balakrishnan, Narayanasamy; Sekar, Kanagaraj
2008-01-01
Distant repeats in protein sequence play an important role in various aspects of protein analysis. A keen analysis of the distant repeats would enable to establish a firm relation of the repeats with respect to their function and three-dimensional structure during the evolutionary process. Further, it enlightens the diversity of duplication during the evolution. To this end, an algorithm has been developed to find all distant repeats in a protein sequence. The scores from Point Accepted Mutation (PAM) matrix has been deployed for the identification of amino acid substitutions while detecting the distant repeats. Due to the biological importance of distant repeats, the proposed algorithm will be of importance to structural biologists, molecular biologists, biochemists and researchers involved in phylogenetic and evolutionary studies. PMID:19052663
NASA Astrophysics Data System (ADS)
Ernenwein, Dawn M.
2011-12-01
Bottom-up self-assembly of peptides has driven the research progress for the following two projects: protein delivery vehicles of collagen microflorettes and the assembly of gold nanoparticles with coiled-coil peptides. Collagen is the most abundant protein in the mammals yet due to immunogenic responses, batch-to-batch variability and lack of sequence modifications, synthetic collagen has been designed to self-assemble into native collagen-like structures. In particular with this research, metal binding ligands were incorporated on the termini of collagen-like peptides to generate micron-sized particles, microflorettes. The over-arching goal of the first research project is to engineer MRI-active microflorettes, loaded with His-tagged growth factors with differential release rates while bound to stem cells that can be implemented toward regenerative cell-based therapies. His-tagged proteins, such as green fluorescent protein, have successfully been incorporated on the surface and throughout the microflorettes. Protein release was monitored under physiological conditions and was related to particle degradation. In human plasma full release was obtained within six days. Stability of the microflorettes under physiological conditions was also examined for the development of a therapeutically relevant delivery agent. Additionally, MRI active microflorettes have been generated through the incorporation of a gadolinium binding ligand, DOTA within the collagen-based peptide sequence. To probe peptide-promoted self-assemblies of gold nanoparticles (GNPs) by non-covalent, charge complementary interactions, a highly anionic coiled-coil peptide was designed and synthesized. Upon formation of peptide-GNP interactions, the hydrophobic domain of the coiled-coil were shown to promote the self-assembly of peptide-GNPs clustering. Hydrophobic forces were found to play an important role in the assembly process, as a peptide with an equally overall negative charge, but lacking an ordered hydrophobic face had no effect on GNP assembly. The self-assembly system herein is advantageous due to its reversible nature upon addition of high salt concentrations which masks the surface charge. There is great potential for using this uniquely designed self-assembled peptide-gold nanoparticle system for exploring the interplay between peptide ligation and GNP self-assembly.
Tang, Bo; Cullins, David L; Zhou, Jing; Zawaski, Janice A; Park, Hyelee; Brand, David D; Hasty, Karen A; Gaber, M Waleed; Stuart, John M; Kang, Andrew H; Myers, Linda K
2010-01-01
Rheumatoid arthritis (RA) is a systemic disease manifested by chronic inflammation in multiple articular joints, including the knees and small joints of the hands and feet. We have developed a unique modification to a clinically accepted method for delivering therapies directly to the synovium. Our therapy is based on our previous discovery of an analog peptide (A9) with amino acid substitutions made at positions 260 (I to A), 261 (A to B), and 263 (F to N) that could profoundly suppress immunity to type II collagen (CII) and arthritis in the collagen-induced arthritis model (CIA). We engineered an adenoviral vector to contain the CB11 portion of recombinant type II collagen and used PCR to introduce point mutations at three sites within (CII124-402, 260A, 261B, 263D), (rCB11-A9) so that the resulting molecule contained the A9 sequence at the exact site of the wild-type sequence. We used this construct to target intra-articular tissues of mice and utilized the collagen-induced arthritis model to show that this treatment strategy provided a sustained, local therapy for individual arthritic joints, effective whether given to prevent arthritis or as a treatment. We also developed a novel system for in vivo bioimaging, using the firefly luciferase reporter gene to allow serial bioluminescence imaging to show that luciferase can be detected as late as 18 days post injection into the joint. Our therapy is unique in that we target synovial cells to ultimately shut down T cell-mediated inflammation. Its effectiveness is based on its ability to transform potential inflammatory T cells and/or bystander T cells into therapeutic (regulatory-like) T cells which secrete interleukin (IL)-4. We believe this approach has potential to effectively suppress RA with minimal side effects.
2010-01-01
Introduction Rheumatoid arthritis (RA) is a systemic disease manifested by chronic inflammation in multiple articular joints, including the knees and small joints of the hands and feet. We have developed a unique modification to a clinically accepted method for delivering therapies directly to the synovium. Our therapy is based on our previous discovery of an analog peptide (A9) with amino acid substitutions made at positions 260 (I to A), 261 (A to B), and 263 (F to N) that could profoundly suppress immunity to type II collagen (CII) and arthritis in the collagen-induced arthritis model (CIA). Methods We engineered an adenoviral vector to contain the CB11 portion of recombinant type II collagen and used PCR to introduce point mutations at three sites within (CII124-402, 260A, 261B, 263D), (rCB11-A9) so that the resulting molecule contained the A9 sequence at the exact site of the wild-type sequence. Results We used this construct to target intra-articular tissues of mice and utilized the collagen-induced arthritis model to show that this treatment strategy provided a sustained, local therapy for individual arthritic joints, effective whether given to prevent arthritis or as a treatment. We also developed a novel system for in vivo bioimaging, using the firefly luciferase reporter gene to allow serial bioluminescence imaging to show that luciferase can be detected as late as 18 days post injection into the joint. Conclusions Our therapy is unique in that we target synovial cells to ultimately shut down T cell-mediated inflammation. Its effectiveness is based on its ability to transform potential inflammatory T cells and/or bystander T cells into therapeutic (regulatory-like) T cells which secrete interleukin (IL)-4. We believe this approach has potential to effectively suppress RA with minimal side effects. PMID:20615221
Characterization of (CA)n microsatellite repeats from large-insert clones.
Litt, M; Browne, D
2001-05-01
The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit determination of sequences flanking the microsatellites. When cosmids or large-insert phage clones are used as primary sources of (CA)n repeat markers, they have traditionally been subcloned into plasmid vectors such as pUC18 or M13 mp 18/19 cloning vectors to obtain fragments of suitable size for DNA sequencing. This unit presents an alternative approach whereby a set of degenerate sequencing primers that anneal directly to (CA)n microsatellites can be used to determine sequences that are inaccessible with vector-derived primers. Because the primers anneal to the repeat and not to the vector, they can be used with subclones containing inserts of several kilobases and should, in theory, always give sequence in the regions directly flanking the repeat. Degeneracy at the 3 end of each of these primers prevents elongation of primers that have annealed out-of-register. The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit.
Bhatia, S; Singh Negi, M; Lakshmikumaran, M
1996-11-01
EcoRI restriction of the B. nigra rDNA recombinants, isolated from a lambda genomic library, showed that the 3.9-kb fragment corresponded to the Intergenic Spacer (IGS), which was sequenced and found to be 3,928 bp in size. Sequence and dot-matrix analyses showed that the organization of the B. nigra rDNA IGS was typical of most rDNA spacers, consisting of a central repetitive region and flanking unique sequences on either side. The repetitive region was composed of two repeat families-RF 'A' and RF 'B.' The B. nigra RF 'A' consisted of a tandem array of three full-length copies of a 106-bp sequence element. RF 'B' was composed of 66 tandemly repeated elements. Each 'B' element was only 21-bp in size and this is the smallest repeat unit identified in plant rDNA to date. The putative transcription initiation site (TIS) was identified as nucleotide position 3,110. Based on the sequence analysis it was suggested that the present organization of the repeat families was generated by successive cycles of deletions and amplifications and was being maintained by homogenization processes such as gene conversion and crossing-over.A detailed comparison of the rDNA IGS sequences of the three diploid Brassica species-namely, B. nigra, B. campestris, and B. oleracea-was carried out. First, comparisons revealed that B. campestris and B. oleracea were close to each other as the repeat families in both showed high sequence homology between each other. Second, the repeat elements in both the species were organized in an interspersed manner. Third, a 52-bp sequence, present just downstream of the repeats in B. campestris, was found to be identical to the B. oleracea repeats, thereby suggesting a common progenitor. On the other hand, in B. nigra no interspersion pattern of organization of repeats was observed. Further, the B. nigra RF 'A' was identified as distinct from the repeat families of B. campestris and B. oleracea. Based on this analysis, it was suggested that during speciation B. campestris and B. oleracea evolved in one lineage whereas B. nigra diverged into a separate lineage. The comparative analysis of the IGS helped in identifying not only conserved ancestral sequence motifs of possible functional significance such as promoters and enhancers, but also sequences which showed variation between the three diploid species and were therefore identified as species-specific sequences.
Characterization of PepB, a group B streptococcal oligopeptidase.
Lin, B; Averett, W F; Novak, J; Chatham, W W; Hollingshead, S K; Coligan, J E; Egan, M L; Pritchard, D G
1996-01-01
Group B streptococci were recently reported to possess a cell-associated collagenase. Although the enzyme hydrolyzed the synthetic collagen-like substrate N-(3-[2-furyl]acryloyl)-Leu-Gly-Pro-Ala, we found that neither the highly purified enzyme nor crude group B streptococcal cell lysate solubilized a film of reconstituted rat tail collagen, an activity regarded as obligatory for a true collagenase. We cloned and sequenced the gene for the enzyme (pepB). The deduced amino acid sequence showed 66.4% identity to the PepF oligopeptidase from Lactococcus lactis, a member of the M3 or thimet family of zinc metallopeptidases. The group B streptococcal enzyme also showed oligopeptidase activity and degraded a variety of small bioactive peptides, including bradykinin, neurotensin, and peptide fragments of substance P and adrenocorticotropin. PMID:8757883
Zeng, Shaokui; Yin, Juanjuan; Yang, Shuqi; Zhang, Chaohua; Yang, Ping; Wu, Wenlong
2012-12-01
Acid-solubilized collagen (ASC) and pepsin-solubilized collagen (PSC) were extracted from the skin of cobia (Rachycentron canadum). The yields of ASC and PSC were 35.5% and 12.3%, respectively. Based on the protein patterns and carboxymethyl-cellulose chromatography, ASC and PSC were composed of α1α2α3 heterotrimers and were characterised as type I collagen with no disulfide bond. Their amounts of imino acids were 203 and 191 residues per 1000 residues, respectively. LC-MS/MS analysis demonstrated the high sequences similarities of ASC and PSC. Fourier transform infrared spectroscopy spectra showed that the amide I, II and III peaks of PSC were obtained at a lower wave number compared with ASC. The thermal denaturation temperatures of ASC and PSC, as measured by viscometry, were 34.62 and 33.97°C, respectively. The transition temperatures (T(max)) were 38.17 and 36.03°C, respectively, as determined by differential scanning calorimetry (DSC). Both collagens were soluble at acidic pH and below 2% (w/v) NaCl concentration. Copyright © 2012 Elsevier Ltd. All rights reserved.
Kreikemeyer, Bernd; Nakata, Masanobu; Oehmcke, Sonja; Gschwendtner, Caroline; Normann, Jana; Podbielski, Andreas
2005-09-30
The Streptococcus pyogenes collagen type I-binding protein Cpa (collagen-binding protein of group A streptococci) expressed by 28 serotypes of group A streptococci has been extensively characterized at the gene and protein levels. Evidence for three distinct families of cpa genes was found, all of which shared a common sequence encoding a 60-amino acid domain that accounted for selective binding to type I collagen. Surface plasmon resonance-based affinity measurements and functional studies indicated that the expression of Cpa was consistent with an attachment role for bacteria to tissue containing collagen type I. A cpa mutant displayed a significantly decreased internalization rate when incubated with HEp-2 cells but had no effect on the host cell viability. By utilizing serum from patients with a positive titer for streptolysin/DNase antibody, an increased anti-Cpa antibody titer was noted for patients with a clinical history of arthritis or osteomyelitis. Taken together, these results suggest Cpa may be a relevant matrix adhesin contributing to the pathogenesis of S. pyogenes infection of bones and joints.
Effects of Calendula officinalis on human gingival fibroblasts.
Saini, Pragtipal; Al-Shibani, Nouf; Sun, Jun; Zhang, Weiping; Song, Fengyu; Gregson, Karen S; Windsor, L Jack
2012-04-01
Calendula officinalis is commonly called the marigold. It is a staple topical remedy in homeopathic medicine. It is rich in quercetin, carotenoids, lutein, lycopene, rutin, ubiquinone, xanthophylls, and other anti-oxidants. It has anti-inflammatory properties. Quercetin, one of the active components in Calendula, has been shown to inhibit recombinant human matrix metalloproteinase (MMP) activity and decrease the expression of tumor necrosis factor-α, interleukin-1β (IL), IL-6 and IL-8 in phorbol 12-myristate 13-acetate and calcium ionophore-stimulated human mast cells. To examine the effects of Calendula on human gingival fibroblast (HGF) mediated collagen degradation and MMP activity. Lactate dehydrogenate assays were performed to determine the non-toxic concentrations of Calendula, doxycycline and quercetin. Cell-mediated collagen degradation assays were performed to examine the inhibitory effect on cell-mediated collagen degradation. Gelatin zymography was performed to examine their effects on MMP-2 activity. The experiments were repeated three times and ANOVA used for statistical analyses. Calendula at 2-3% completely inhibited the MMP-2 activity in the zymograms. Doxycycline inhibited HGF-mediated collagen degradation at 0.005, 0.01, 0.02 and 0.05%, and MMP-2 activity completely at 0.05%. Quercetin inhibited HGF-mediated collagen degradation at 0.005, 0.01 and 0.02%, and MMP-2 activity in a dose-dependent manner. Calendula inhibited HGF-mediated collagen degradation and MMP-2 activity more than the same correlated concentration of pure quercetin. Calendula inhibits HGF-mediated collagen degradation and MMP-2 activity more than the corresponding concentration of quercetin. This may be attributed to additional components in Calendula other than quercetin. Published by Elsevier Ltd.
Bhate, Manjiri; Wang, Xin; Baum, Jean; Brodsky, Barbara
2002-05-21
The collagen model peptide T1-892 includes a C-terminal nucleation domain, (Gly-Pro-Hyp)(4), and an N-terminal (Gly-X-Y)(6) sequence taken from type I collagen. In osteogenesis imperfecta (OI) and other collagen diseases, single base mutations often convert one Gly to a larger residue, and T1-892 homologues modeling such mutations were synthesized with Gly to Ala substitutions in either the (Gly-Pro-Hyp)(4) domain, Gly25Ala, or the (Gly-X-Y)(6) domain, Gly10Ala. CD and NMR studies show the Gly10Ala peptide forms a normal triple-helix at the C-terminal end and propagates from the C- to the N-terminus until the Gly --> Ala substitution is encountered. At this point, triple-helix folding is terminated and cannot be reinitiated, leaving a nonhelical N-terminus. A decreased thermal stability is observed as a result of the shorter length of the triple-helix. In contrast, introduction of the Gly to Ala replacement at position 25, in the nucleation domain, shifts the monomer/trimer equilibrium toward the monomer form. The increased monomer and lower trimer populations are reflected in the dramatic decrease in triple-helix content and stability. Unlike the Ala replacement at position 10, the Ala substitution in the (Gly-Pro-Hyp)(4) region can still be incorporated into a triple-helix, but at a greatly decreased rate of folding, since the original efficient nucleation site is no longer operative. The specific consequences of Gly to Ala replacements in two distinctive sequences in this triple-helical peptide may help clarify the variability in OI clinical severity resulting from mutations at different sites along type I collagen chains.
Bolzán, Alejandro D
2017-07-01
By definition, telomeric sequences are located at the very ends or terminal regions of chromosomes. However, several vertebrate species show blocks of (TTAGGG)n repeats present in non-terminal regions of chromosomes, the so-called interstitial telomeric sequences (ITSs), interstitial telomeric repeats or interstitial telomeric bands, which include those intrachromosomal telomeric-like repeats located near (pericentromeric ITSs) or within the centromere (centromeric ITSs) and those telomeric repeats located between the centromere and the telomere (i.e., truly interstitial telomeric sequences) of eukaryotic chromosomes. According with their sequence organization, localization and flanking sequences, ITSs can be classified into four types: 1) short ITSs, 2) subtelomeric ITSs, 3) fusion ITSs, and 4) heterochromatic ITSs. The first three types have been described mainly in the human genome, whereas heterochromatic ITSs have been found in several vertebrate species but not in humans. Several lines of evidence suggest that ITSs play a significant role in genome instability and evolution. This review aims to summarize our current knowledge about the origin, function, instability and evolution of these telomeric-like repeats in vertebrate chromosomes. Copyright © 2017 Elsevier B.V. All rights reserved.
Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine
2009-01-01
Clustered regularly interspaced short palindromic repeats (CRISPRs) are DNA sequences composed of a succession of repeats (23- to 47-bp long) separated by unique sequences called spacers. Polymorphism can be observed in different strains of a species and may be used for genotyping. We describe protocols and bioinformatics tools that allow the identification of CRISPRs from sequenced genomes, their comparison, and their component determination (the direct repeats and the spacers). A schematic representation of the spacer organization can be produced, allowing an easy comparison between strains.
Mapping Simple Repeated DNA Sequences in Heterochromatin of Drosophila Melanogaster
Lohe, A. R.; Hilliker, A. J.; Roberts, P. A.
1993-01-01
Heterochromatin in Drosophila has unusual genetic, cytological and molecular properties. Highly repeated DNA sequences (satellites) are the principal component of heterochromatin. Using probes from cloned satellites, we have constructed a chromosome map of 10 highly repeated, simple DNA sequences in heterochromatin of mitotic chromosomes of Drosophila melanogaster. Despite extensive sequence homology among some satellites, chromosomal locations could be distinguished by stringent in situ hybridizations for each satellite. Only two of the localizations previously determined using gradient-purified bulk satellite probes are correct. Eight new satellite localizations are presented, providing a megabase-level chromosome map of one-quarter of the genome. Five major satellites each exhibit a multichromosome distribution, and five minor satellites hybridize to single sites on the Y chromosome. Satellites closely related in sequence are often located near one another on the same chromosome. About 80% of Y chromosome DNA is composed of nine simple repeated sequences, in particular (AAGAC)(n) (8 Mb), (AAGAG)(n) (7 Mb) and (AATAT)(n) (6 Mb). Similarly, more than 70% of the DNA in chromosome 2 heterochromatin is composed of five simple repeated sequences. We have also generated a high resolution map of satellites in chromosome 2 heterochromatin, using a series of translocation chromosomes whose breakpoints in heterochromatin were ordered by N-banding. Finally, staining and banding patterns of heterochromatic regions are correlated with the locations of specific repeated DNA sequences. The basis for the cytochemical heterogeneity in banding appears to depend exclusively on the different satellite DNAs present in heterochromatin. PMID:8375654
Cech, Jennifer N; Peichel, Catherine L
2015-12-01
Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.
2012-01-01
Background Staphylococcus aureus Repeat (STAR) elements are a type of interspersed intergenic direct repeat. In this study the conservation and variation in these elements was explored by bioinformatic analyses of published staphylococcal genome sequences and through sequencing of specific STAR element loci from a large set of S. aureus isolates. Results Using bioinformatic analyses, we found that the STAR elements were located in different genomic loci within each staphylococcal species. There was no correlation between the number of STAR elements in each genome and the evolutionary relatedness of staphylococcal species, however higher levels of repeats were observed in both S. aureus and S. lugdunensis compared to other staphylococcal species. Unexpectedly, sequencing of the internal spacer sequences of individual repeat elements from multiple isolates showed conservation at the sequence level within deep evolutionary lineages of S. aureus. Whilst individual STAR element loci were demonstrated to expand and contract, the sequences associated with each locus were stable and distinct from one another. Conclusions The high degree of lineage and locus-specific conservation of these intergenic repeat regions suggests that STAR elements are maintained due to selective or molecular forces with some of these elements having an important role in cell physiology. The high prevalence in two of the more virulent staphylococcal species is indicative of a potential role for STAR elements in pathogenesis. PMID:23020678
Repeatless and repeat-based centromeres in potato: implications for centromere evolution.
Gong, Zhiyun; Wu, Yufeng; Koblízková, Andrea; Torres, Giovana A; Wang, Kai; Iovene, Marina; Neumann, Pavel; Zhang, Wenli; Novák, Petr; Buell, C Robin; Macas, Jirí; Jiang, Jiming
2012-09-01
Centromeres in most higher eukaryotes are composed of long arrays of satellite repeats. By contrast, most newly formed centromeres (neocentromeres) do not contain satellite repeats and instead include DNA sequences representative of the genome. An unknown question in centromere evolution is how satellite repeat-based centromeres evolve from neocentromeres. We conducted a genome-wide characterization of sequences associated with CENH3 nucleosomes in potato (Solanum tuberosum). Five potato centromeres (Cen4, Cen6, Cen10, Cen11, and Cen12) consisted primarily of single- or low-copy DNA sequences. No satellite repeats were identified in these five centromeres. At least one transcribed gene was associated with CENH3 nucleosomes. Thus, these five centromeres structurally resemble neocentromeres. By contrast, six potato centromeres (Cen1, Cen2, Cen3, Cen5, Cen7, and Cen8) contained megabase-sized satellite repeat arrays that are unique to individual centromeres. The satellite repeat arrays likely span the entire functional cores of these six centromeres. At least four of the centromeric repeats were amplified from retrotransposon-related sequences and were not detected in Solanum species closely related to potato. The presence of two distinct types of centromeres, coupled with the boom-and-bust cycles of centromeric satellite repeats in Solanum species, suggests that repeat-based centromeres can rapidly evolve from neocentromeres by de novo amplification and insertion of satellite repeats in the CENH3 domains.
Repeatless and Repeat-Based Centromeres in Potato: Implications for Centromere Evolution[C][W
Gong, Zhiyun; Wu, Yufeng; Koblížková, Andrea; Torres, Giovana A.; Wang, Kai; Iovene, Marina; Neumann, Pavel; Zhang, Wenli; Novák, Petr; Buell, C. Robin; Macas, Jiří; Jiang, Jiming
2012-01-01
Centromeres in most higher eukaryotes are composed of long arrays of satellite repeats. By contrast, most newly formed centromeres (neocentromeres) do not contain satellite repeats and instead include DNA sequences representative of the genome. An unknown question in centromere evolution is how satellite repeat-based centromeres evolve from neocentromeres. We conducted a genome-wide characterization of sequences associated with CENH3 nucleosomes in potato (Solanum tuberosum). Five potato centromeres (Cen4, Cen6, Cen10, Cen11, and Cen12) consisted primarily of single- or low-copy DNA sequences. No satellite repeats were identified in these five centromeres. At least one transcribed gene was associated with CENH3 nucleosomes. Thus, these five centromeres structurally resemble neocentromeres. By contrast, six potato centromeres (Cen1, Cen2, Cen3, Cen5, Cen7, and Cen8) contained megabase-sized satellite repeat arrays that are unique to individual centromeres. The satellite repeat arrays likely span the entire functional cores of these six centromeres. At least four of the centromeric repeats were amplified from retrotransposon-related sequences and were not detected in Solanum species closely related to potato. The presence of two distinct types of centromeres, coupled with the boom-and-bust cycles of centromeric satellite repeats in Solanum species, suggests that repeat-based centromeres can rapidly evolve from neocentromeres by de novo amplification and insertion of satellite repeats in the CENH3 domains. PMID:22968715
Kim, Minseong; Kim, WonJin; Kim, GeunHyung
2017-12-20
Optimally designed three-dimensional (3D) biomedical scaffolds for skeletal muscle tissue regeneration pose significant research challenges. Currently, most studies on scaffolds focus on the two-dimensional (2D) surface structures that are patterned in the micro-/nanoscales with various repeating sizes and shapes to induce the alignment of myoblasts and myotube formation. The 2D patterned surface clearly provides effective analytical results of pattern size and shape of the myoblast alignment and differentiation. However, it is inconvenient in terms of the direct application for clinical usage due to the limited thickness and 3D shapeability. Hence, the present study suggests an innovative hydrogel or synthetic structure that consists of uniaxially surface-patterned cylindrical struts for skeleton muscle regeneration. The alignment of the pattern on the hydrogel (collagen) and poly(ε-caprolactone) struts was attained with the fibrillation of poly(vinyl alcohol) and the leaching process. Various cell culture results indicate that the C2C12 cells on the micropatterned collagen structure were fully aligned, and that a significantly high level of myotube formation was achieved when compared to the collagen structures that were not treated with the micropatterning process.
Selective Activation of Transcription by a Novel CCAAT Binding Factor
NASA Astrophysics Data System (ADS)
Maity, Sankar N.; Golumbek, Paul T.; Karsenty, Gerard; de Crombrugghe, Benoit
1988-07-01
A novel CCAAT binding factor (CBF) composed of two different subunits has been extensively purified from rat liver. Both subunits are needed for specific binding to DNA. Addition of this purified protein to nuclear extracts of NIH 3T3 fibroblasts stimulates transcription from several promoters including the α 2(I) collagen, the α 1(I) collagen, the Rous sarcoma virus long terminal repeat (RSV-LTR), and the adenovirus major late promoter. Point mutations in the CCAAT motif that show either no binding or a decreased binding of CBF likewise abolish or reduce activation of transcription by CBF. Activation of transcription requires, therefore, the specific binding of CBF to its recognition sites.
Mlinarec, Jelena; Chester, Mike; Siljak-Yakovlev, Sonja; Papes, Drazena; Leitch, Andrew R; Besendorfer, Visnja
2009-01-01
The structure, abundance and location of repetitive DNA sequences on chromosomes can characterize the nature of higher plant genomes. Here we report on three new repeat DNA families isolated from Anemone hortensis L.; (i) AhTR1, a family of satellite DNA (stDNA) composed of a 554-561 bp long EcoRV monomer; (ii) AhTR2, a stDNA family composed of a 743 bp long HindIII monomer and; (iii) AhDR, a repeat family composed of a 945 bp long HindIII fragment that exhibits some sequence similarity to Ty3/gypsy-like retroelements. Fluorescence in-situ hybridization (FISH) to metaphase chromosomes of A. hortensis (2n = 16) revealed that both AhTR1 and AhTR2 sequences co-localized with DAPI-positive AT-rich heterochromatic regions. AhTR1 sequences occur at intercalary DAPI bands while AhTR2 sequences occur at 8-10 terminally located heterochromatic blocks. In contrast AhDR sequences are dispersed over all chromosomes as expected of a Ty3/gypsy-like element. AhTR2 and AhTR1 repeat families include polyA- and polyT-tracks, AT/TA-motifs and a pentanucleotide sequence (CAAAA) that may have consequences for chromatin packing and sequence homogeneity. AhTR2 repeats also contain TTTAGGG motifs and degenerate variants. We suggest that they arose by interspersion of telomeric repeats with subtelomeric repeats, before hybrid unit(s) amplified through the heterochromatic domain. The three repetitive DNA families together occupy approximately 10% of the A. hortensis genome. Comparative analyses of eight Anemone species revealed that the divergence of the A. hortensis genome was accompanied by considerable modification and/or amplification of repeats.
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using
Weier, H.U.G.; Gray, J.W.
1995-06-27
A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using
Weier, Heinz-Ulrich G.; Gray, Joe W.
1995-01-01
A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.
NASA Astrophysics Data System (ADS)
Zeng, Like
Production of brand new protein-based materials with precise control over the amino acid sequences at single residue level has been made possible by genetic engineering, through which artificial genes can be developed that encode protein-based materials with desired features. As an example, silk-elastinlike protein polymers (SELPs), composed of tandem repeats of amino acid sequence motifs from Bombyx mori (silkworm) silk and mammalian elastin, have been produced in this approach. SELPs have been studied extensively in the past two decades, however, the fundamental mechanism governing the self-assembly process to date still remains largely unresolved. Further, regardless of the unprecedented success when exploited in areas including drug delivery, gene therapy, and tissue augmentation, SELPs scaffolds as a three-dimensional cell culture model system are complicated by the inability of SELPs to provide the embedded tissue cells with appropriate biochemical stimuli essential for cell survival and function. In this dissertation, it is reported that the self-assembly of silk-elastinlike protein polymers (SELPs) into nanofibers in aqueous solutions can be modulated by tuning the curing temperature, the size of the silk blocks, and the charge of the elastin blocks. A core-sheath model was proposed for nanofiber formation, with the silk blocks in the cores and the hydrated elastin blocks in the sheaths. The folding of the silk blocks into stable cores -- affected by the size of the silk blocks and the charge of the elastin blocks -- plays a critical role in the assembly of silk-elastin nanofibers. The assembled nanofibers further form nanofiber clusters on the microscale, and the nanofiber clusters then coalesce into nanofiber micro-assemblies, interconnection of which eventually leads to the formation of three-dimensional scaffolds with distinct nanoscale and microscale features. SELP-Collagen hybrid scaffolds were also fabricated to enable independent control over the scaffolds' biochemical input and matrix stiffness. It is reported herein that in the hybrid scaffolds, collagen provides essential biochemical cues needed to promote cell attachment and function while SELP imparts matrix stiffness tunability. To obtain tissue-specificity in matrix stiffness that spans over several orders of magnitude covering from soft brain to stiff cartilage, the hybrid SELP-Collagen scaffolds were crosslinked by transglutaminase at physiological conditions compatible for simultaneous cell encapsulation. The effect of the increase in matrix stiffness induced by such enzymatic crosslinking on cellular viability and proliferation was also evaluated using in vitro cell assays.
De novo identification of highly diverged protein repeats by probabilistic consistency.
Biegert, A; Söding, J
2008-03-15
An estimated 25% of all eukaryotic proteins contain repeats, which underlines the importance of duplication for evolving new protein functions. Internal repeats often correspond to structural or functional units in proteins. Methods capable of identifying diverged repeated segments or domains at the sequence level can therefore assist in predicting domain structures, inferring hypotheses about function and mechanism, and investigating the evolution of proteins from smaller fragments. We present HHrepID, a method for the de novo identification of repeats in protein sequences. It is able to detect the sequence signature of structural repeats in many proteins that have not yet been known to possess internal sequence symmetry, such as outer membrane beta-barrels. HHrepID uses HMM-HMM comparison to exploit evolutionary information in the form of multiple sequence alignments of homologs. In contrast to a previous method, the new method (1) generates a multiple alignment of repeats; (2) utilizes the transitive nature of homology through a novel merging procedure with fully probabilistic treatment of alignments; (3) improves alignment quality through an algorithm that maximizes the expected accuracy; (4) is able to identify different kinds of repeats within complex architectures by a probabilistic domain boundary detection method and (5) improves sensitivity through a new approach to assess statistical significance. Server: http://toolkit.tuebingen.mpg.de/hhrepid; Executables: ftp://ftp.tuebingen.mpg.de/pub/protevo/HHrepID
Detecting and Characterizing Repeating Earthquake Sequences During Volcanic Eruptions
NASA Astrophysics Data System (ADS)
Tepp, G.; Haney, M. M.; Wech, A.
2017-12-01
A major challenge in volcano seismology is forecasting eruptions. Repeating earthquake sequences often precede volcanic eruptions or lava dome activity, providing an opportunity for short-term eruption forecasting. Automatic detection of these sequences can lead to timely eruption notification and aid in continuous monitoring of volcanic systems. However, repeating earthquake sequences may also occur after eruptions or along with magma intrusions that do not immediately lead to an eruption. This additional challenge requires a better understanding of the processes involved in producing these sequences to distinguish those that are precursory. Calculation of the inverse moment rate and concepts from the material failure forecast method can lead to such insights. The temporal evolution of the inverse moment rate is observed to differ for precursory and non-precursory sequences, and multiple earthquake sequences may occur concurrently. These observations suggest that sequences may occur in different locations or through different processes. We developed an automated repeating earthquake sequence detector and near real-time alarm to send alerts when an in-progress sequence is identified. Near real-time inverse moment rate measurements can further improve our ability to forecast eruptions by allowing for characterization of sequences. We apply the detector to eruptions of two Alaskan volcanoes: Bogoslof in 2016-2017 and Redoubt Volcano in 2009. The Bogoslof eruption produced almost 40 repeating earthquake sequences between its start in mid-December 2016 and early June 2017, 21 of which preceded an explosive eruption, and 2 sequences in the months before eruptive activity. Three of the sequences occurred after the implementation of the alarm in late March 2017 and successfully triggered alerts. The nearest seismometers to Bogoslof are over 45 km away, requiring a detector that can work with few stations and a relatively low signal-to-noise ratio. During the Redoubt eruption, earthquake sequences were observed in the months leading up to the eruptive activity beginning in March 2009 as well as immediately preceding 7 of the 19 explosive events. In contrast to Bogoslof, Redoubt has a local monitoring network which allows for better detection and more detailed analysis of the repeating earthquake sequences.
Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine
2007-01-01
Background In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element) are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the dictionary creator. CRISPRdb is accessible at PMID:17521438
2013-01-01
Background Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely resolved. Results We construct a phased diploid genome assembly by deep sequencing a standard laboratory wild-type strain and a panel of strains homozygous for particular chromosomes. The assembly has 700-fold coverage on average, allowing extensive revision and expansion of the number of known SNPs and indels. This phased genome significantly enhances the sensitivity and specificity of allele-specific expression measurements by enabling pooling and cross-validation of signal across multiple polymorphic sites. Additionally, the diploid assembly reveals pervasive and unexpected patterns in allelic differences between homologous chromosomes. Firstly, we see striking clustering of indels, concentrated primarily in the repeat sequences in promoters. Secondly, both indels and their repeat-sequence substrate are enriched near replication origins. Finally, we reveal an intimate link between repeat sequences and indels, which argues that repeat length is under selective pressure for most eukaryotes. This connection is described by a concise one-parameter model that explains repeat-sequence abundance in C. albicans as a function of the indel rate, and provides a general framework to interpret repeat abundance in species ranging from bacteria to humans. Conclusions The phased genome assembly and insights into repeat plasticity will be valuable for better understanding allele-specific phenomena and genome evolution. PMID:24025428
Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies.
Wincker, P; Jubier-Maurin, V; Roizès, G
1987-01-01
Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies. Images PMID:3684566
Plant chromosomes from end to end: telomeres, heterochromatin and centromeres.
Lamb, Jonathan C; Yu, Weichang; Han, Fangpu; Birchler, James A
2007-04-01
Recent evidence indicates that heterochromatin in plants is composed of heterogeneous sequences, which are usually composed of transposable elements or tandem repeat arrays. These arrays are associated with chromatin modifications that produce a closed configuration that limits transcription. Centromere sequences in plants are usually composed of tandem repeat arrays that are homogenized across the genome. Analysis of such arrays in closely related taxa suggests a rapid turnover of the repeat unit that is typical of a particular species. In addition, two lines of evidence for an epigenetic component of centromere specification have been reported, namely an example of a neocentromere formed over sequences without the typical repeat array and examples of centromere inactivation. Although the telomere repeat unit is quite prevalent in the plant kingdom, unusual repeats have been found in some families. Recently, it was demonstrated that the introduction of telomere sequences into plants cells causes truncation of the chromosomes, and that this technique can be used to produce artificial chromosome platforms.
SSRscanner: a program for reporting distribution and exact location of simple sequence repeats.
Anwar, Tamanna; Khan, Asad U
2006-02-20
Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com.
Wang, Jun; Chang, Yaoguang; Wu, Fanxiu; Xu, Xiaoqi; Xue, Changhu
2018-04-15
Fucosylated chondroitin sulfate (fCS) is the major carbohydrate constituent of sea cucumber. However, the distribution of fCS in the sea cucumber body wall has not been fully described. We addressed this in the present study employing Apostichopus japonicus as the material, a sea cucumber species with significant commercial importance. It was found that fCS was covalently attached to collagen fibrils via O-glycosidic linkages. Transmission electron microscopy analysis revealed that fCS precipitate was present in gap regions of collagen fibrils as roughly globular or ellipsoidal dots. The fCS dots arranged circumferentially around the fibrils with an axial repeat period that matched the periodicity of the fibrils. Physicochemical analysis indicated that the presence of fCS significantly increased the negative charge of the fibrils. These findings provide novel insight into fCS distribution in the sea cucumber body wall and its supramolecular organization with other macromolecules. Copyright © 2018 Elsevier Ltd. All rights reserved.
2010-01-01
Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by HGT and intra-genomic shuffling. Conclusions We describe novel features of PARCELs (Palindromic Amphipathic Repeat Coding ELements), a set of widely distributed repeat protein domains and coding sequences that were likely acquired through HGT by diverse unicellular microbes, further mobilized and diversified within genomes, and co-opted for expression in the membrane proteome of some taxa. Disseminated by multiple gene-centric vehicles, ORFs harboring these elements enhance accessory gene pools as part of the "mobilome" connecting genomes of various clades, in taxa sharing common niches. PMID:20626840
A TALE-inspired computational screen for proteins that contain approximate tandem repeats.
Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.
A TALE-inspired computational screen for proteins that contain approximate tandem repeats
Krwawicz, Joanna
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen. PMID:28617832
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing.
Hribová, Eva; Neumann, Pavel; Matsumoto, Takashi; Roux, Nicolas; Macas, Jirí; Dolezel, Jaroslav
2010-09-16
Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection.
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing
2010-01-01
Background Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. Results In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. Conclusion A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic diversity studies and in marker-assisted selection. PMID:20846365
Optimization of sequence alignment for simple sequence repeat regions.
Jighly, Abdulqader; Hamwieh, Aladdin; Ogbonnaya, Francis C
2011-07-20
Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs).SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type.When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.
Do, Hoang Dang Khoa; Kim, Joo-Hwan
2017-01-01
Chloroplast genomes (cpDNA) are highly valuable resources for evolutionary studies of angiosperms, since they are highly conserved, are small in size, and play critical roles in plants. Slipped-strand mispairing (SSM) was assumed to be a mechanism for generating repeat units in cpDNA. However, research on the employment of different small repeated sequences through SSM events, which may induce the accumulation of distinct types of repeats within the same region in cpDNA, has not been documented. Here, we sequenced two chloroplast genomes from the endemic species Heloniopsis tubiflora (Korea) and Xerophyllum tenax (USA) to cover the gap between molecular data and explore "hot spots" for genomic events in Melanthiaceae. Comparative analysis of 23 complete cpDNA sequences revealed that there were different stages of deletion in the rps16 region across the Melanthiaceae. Based on the partial or complete loss of rps16 gene in cpDNA, we have firstly reported potential molecular markers for recognizing two sections ( Veratrum and Fuscoveratrum ) of Veratrum . Melathiaceae exhibits a significant change in the junction between large single copy and inverted repeat regions, ranging from trnH_GUG to a part of rps3 . Our results show an accumulation of tandem repeats in the rpl23-ycf2 regions of cpDNAs. Small conserved sequences exist and flank tandem repeats in further observation of this region across most of the examined taxa of Liliales. Therefore, we propose three scenarios in which different small repeated sequences were used during SSM events to generate newly distinct types of repeats. Occasionally, prior to the SSM process, point mutation event and double strand break repair occurred and induced the formation of initial repeat units which are indispensable in the SSM process. SSM may have likely occurred more frequently for short repeats than for long repeat sequences in tribe Parideae (Melanthiaceae, Liliales). Collectively, these findings add new evidence of dynamic results from SSM in chloroplast genomes which can be useful for further evolutionary studies in angiosperms. Additionally, genomics events in cpDNA are potential resources for mining molecular markers in Liliales.
Molecular and bioinformatic analysis of the FB-NOF transposable element.
Badal, Martí; Portela, Anna; Xamena, Noel; Cabré, Oriol
2006-04-12
The Drosophila melanogaster transposable element FB-NOF is known to play a role in genome plasticity through the generation of all sort of genomic rearrangements. Moreover, several insertional mutants due to FB mobilizations have been reported. Its structure and sequence, however, have been poorly studied mainly as a consequence of the long, complex and repetitive sequence of FB inverted repeats. This repetitive region is composed of several 154 bp blocks, each with five almost identical repeats. In this paper, we report the sequencing process of 2 kb long FB inverted repeats of a complete FB-NOF element, with high precision and reliability. This achievement has been possible using a new map of the FB repetitive region, which identifies unambiguously each repeat with new features that can be used as landmarks. With this new vision of the element, a list of FB-NOF in the D. melanogaster genomic clones has been done, improving previous works that used only bioinformatic algorithms. The availability of many FB and FB-NOF sequences allowed an analysis of the FB insertion sequences that showed no sequence specificity, but a preference for A/T rich sequences. The position of NOF into FB is also studied, revealing that it is always located after a second repeat in a random block. With the results of this analysis, we propose a model of transposition in which NOF jumps from FB to FB, using an unidentified transposase enzyme that should specifically recognize the second repeat end of the FB blocks.
The repetitive landscape of the chicken genome.
Wicker, Thomas; Robertson, Jon S; Schulze, Stefan R; Feltus, F Alex; Magrini, Vincent; Morrison, Jason A; Mardis, Elaine R; Wilson, Richard K; Peterson, Daniel G; Paterson, Andrew H; Ivarie, Robert
2005-01-01
Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7 x coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available.
The repetitive landscape of the chicken genome
Wicker, Thomas; Robertson, Jon S.; Schulze, Stefan R.; Feltus, F. Alex; Magrini, Vincent; Morrison, Jason A.; Mardis, Elaine R.; Wilson, Richard K.; Peterson, Daniel G.; Paterson, Andrew H.; Ivarie, Robert
2005-01-01
Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7× coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available. PMID:15256510
NASA Astrophysics Data System (ADS)
Makhotkina, L. Yu; Sharifullin, S. N.
2016-06-01
Research results shows that RF-plasma treatment increases the adhesion of the coating film to the leather uppers and resistance to abrasion and repeated bending of uppers, which define the ability of material to preserve its consumer properties and characterize longer safety of special purpose footwear form during its wearing.
Danilowicz, Claudia; Hermans, Laura; Coljee, Vincent; Prévost, Chantal
2017-01-01
Abstract During DNA recombination and repair, RecA family proteins must promote rapid joining of homologous DNA. Repeated sequences with >100 base pair lengths occupy more than 1% of bacterial genomes; however, commitment to strand exchange was believed to occur after testing ∼20–30 bp. If that were true, pairings between different copies of long repeated sequences would usually become irreversible. Our experiments reveal that in the presence of ATP hydrolysis even 75 bp sequence-matched strand exchange products remain quite reversible. Experiments also indicate that when ATP hydrolysis is present, flanking heterologous dsDNA regions increase the reversibility of sequence matched strand exchange products with lengths up to ∼75 bp. Results of molecular dynamics simulations provide insight into how ATP hydrolysis destabilizes strand exchange products. These results inspired a model that shows how pairings between long repeated sequences could be efficiently rejected even though most homologous pairings form irreversible products. PMID:28854739
Trinh, T. Q.; Sinden, R. R.
1993-01-01
We describe a system to measure the frequency of both deletions and duplications between direct repeats. Short 17- and 18-bp palindromic and nonpalindromic DNA sequences were cloned into the EcoRI site within the chloramphenicol acetyltransferase gene of plasmids pBR325 and pJT7. This creates an insert between direct repeated EcoRI sites and results in a chloramphenicol-sensitive phenotype. Selection for chloramphenicol resistance was utilized to select chloramphenicol resistant revertants that included those with precise deletion of the insert from plasmid pBR325 and duplication of the insert in plasmid pJT7. The frequency of deletion or duplication varied more than 500-fold depending on the sequence of the short sequence inserted into the EcoRI site. For the nonpalindromic inserts, multiple internal direct repeats and the length of the direct repeats appear to influence the frequency of deletion. Certain palindromic DNA sequences with the potential to form DNA hairpin structures that might stabilize the misalignment of direct repeats had a high frequency of deletion. Other DNA sequences with the potential to form structures that might destabilize misalignment of direct repeats had a very low frequency of deletion. Duplication mutations occurred at the highest frequency when the DNA between the direct repeats contained no direct or inverted repeats. The presence of inverted repeats dramatically reduced the frequency of duplications. The results support the slippage-misalignment model, suggesting that misalignment occurring during DNA replication leads to deletion and duplication mutations. The results also support the idea that the formation of DNA secondary structures during DNA replication can facilitate and direct specific mutagenic events. PMID:8325478
2014-01-01
Background DNA repeats, such as transposable elements, minisatellites and palindromic sequences, are abundant in sequences and have been shown to have significant and functional roles in the evolution of the host genomes. In a previous study, we introduced the concept of a repeat DNA module, a flexible motif present in at least two occurences in the sequences. This concept was embedded into ModuleOrganizer, a tool allowing the detection of repeat modules in a set of sequences. However, its implementation remains difficult for larger sequences. Results Here we present Visual ModuleOrganizer, a Java graphical interface that enables a new and optimized version of the ModuleOrganizer tool. To implement this version, it was recoded in C++ with compressed suffix tree data structures. This leads to less memory usage (at least 120-fold decrease in average) and decreases by at least four the computation time during the module detection process in large sequences. Visual ModuleOrganizer interface allows users to easily choose ModuleOrganizer parameters and to graphically display the results. Moreover, Visual ModuleOrganizer dynamically handles graphical results through four main parameters: gene annotations, overlapping modules with known annotations, location of the module in a minimal number of sequences, and the minimal length of the modules. As a case study, the analysis of FoldBack4 sequences clearly demonstrated that our tools can be extended to comparative and evolutionary analyses of any repeat sequence elements in a set of genomic sequences. With the increasing number of sequences available in public databases, it is now possible to perform comparative analyses of repeated DNA modules in a graphic and friendly manner within a reasonable time period. Availability Visual ModuleOrganizer interface and the new version of the ModuleOrganizer tool are freely available at: http://lcb.cnrs-mrs.fr/spip.php?rubrique313. PMID:24678954
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.
Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V
1985-09-01
The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this.
USDA-ARS?s Scientific Manuscript database
Expressed sequence tag (EST) simple sequence repeats (SSRs) in Prunus were mined, and flanking primers designed and used for genome-wide characterization and selection of primers to optimize marker distribution and reliability. A total of 12,618 contigs were assembled from 84,727 ESTs, along with 34...
Microsatellite analysis in the genome of Acanthaceae: An in silico approach.
Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar
2015-01-01
Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future.
Alcivar-Warren, Acacia; Meehan-Meola, Dawn; Wang, Yongping; Guo, Ximing; Zhou, Linghua; Xiang, Jianhai; Moss, Shaun; Arce, Steve; Warren, William; Xu, Zhenkang; Bell, Kireina
2006-01-01
To develop genetic and physical maps for shrimp, accurate information on the actual number of chromosomes and a large number of genetic markers is needed. Previous reports have shown two different chromosome numbers for the Pacific whiteleg shrimp, Penaeus vannamei, the most important penaeid shrimp species cultured in the Western hemisphere. Preliminary results obtained by direct sequencing of clones from a Sau3A-digested genomic library of P. vannamei ovary identified a large number of (TAACC/GGTTA)-containing SSRs. The objectives of this study were to (1) examine the frequency of (TAACC)n repeats in 662 P. vannamei genomic clones that were directly sequenced, and perform homology searches of these clones, (2) confirm the number of chromosomes in testis of P. vannamei, and (3) localize the TAACC repeats in P. vannamei chromosome spreads using fluorescence in situ hybridization (FISH). Results for objective 1 showed that 395 out of the 662 clones sequenced contained single or multiple SSRs with three or more repeat motifs, 199 of which contained variable tandem repeats of the pentanucleotide (TAACC/GGTTA)n, with 3 to 14 copies per sequence. The frequency of (TAACC)n repeats in P. vannamei is 4.68 kb for SSRs with five or more repeat motifs. Sequence comparisons using the BLASTN nonredundant and expressed sequence tag (EST) databases indicated that most of the TAACC-containing clones were similar to either the core pentanucleotide repeat in PVPENTREP locus (GenBank accession no. X82619) or portions of 28S rRNA. Transposable elements (transposase for Tn1000 and reverse transcriptase family members), hypothetical or unnamed protein products, and genes of known function such as 18S and 28S rRNAs, heat shock protein 70, and thrombospondin were identified in non-TAACC-containing clones. For objective 2, the meiotic chromosome number of P. vannamei was confirmed as N = 44. For objective 3, four FISH probes (P1 to P4) containing different numbers of TAACC repeats produced positive signals on telomeres of P. vannamei chromosomes. A few chromosomes had positive signals interstitially. Probe signal strength and chromosome coverage differed in the general order of P1>P2>P3>P4, which correlated with the length of TAACC repeats within the probes: 83, 66, 35, and 30 bp, respectively, suggesting that the TAACC repeats, and not the flanking sequences, produced the TAACC signals at chromosome ends and TAACC is likely the telomere sequence for P. vannamei.
Kim, Min Jee; Im, Hyun Hwak; Lee, Kwang Youll; Han, Yeon Soo; Kim, Iksoo
2014-06-01
Abstract The complete nucleotide sequences of the mitochondrial genome from the whiter-spotted flower chafer, Protaetia brevitarsis (Coleoptera: Scarabaeidae), was determined. The 20,319-bp long circular genome is the longest among completely sequenced Coleoptera. As is typical in animals, the P. brevitarsis genome consisted of two ribosomal RNAs, 22 transfer RNAs, 13 protein-coding genes and one A + T-rich region. Although the size of the coding genes was typical, the non-coding A + T-rich region was 5654 bp, which is the longest in insects. The extraordinary length of this region was composed of 28,117-bp tandem repeats and 782-bp tandem repeats. These repeat sequences were encompassed by three non-repeat sequences constituting 1804 bp.
Parson, Walther; Ballard, David; Budowle, Bruce; Butler, John M; Gettings, Katherine B; Gill, Peter; Gusmão, Leonor; Hares, Douglas R; Irwin, Jodi A; King, Jonathan L; Knijff, Peter de; Morling, Niels; Prinz, Mechthild; Schneider, Peter M; Neste, Christophe Van; Willuweit, Sascha; Phillips, Christopher
2016-05-01
The DNA Commission of the International Society for Forensic Genetics (ISFG) is reviewing factors that need to be considered ahead of the adoption by the forensic community of short tandem repeat (STR) genotyping by massively parallel sequencing (MPS) technologies. MPS produces sequence data that provide a precise description of the repeat allele structure of a STR marker and variants that may reside in the flanking areas of the repeat region. When a STR contains a complex arrangement of repeat motifs, the level of genetic polymorphism revealed by the sequence data can increase substantially. As repeat structures can be complex and include substitutions, insertions, deletions, variable tandem repeat arrangements of multiple nucleotide motifs, and flanking region SNPs, established capillary electrophoresis (CE) allele descriptions must be supplemented by a new system of STR allele nomenclature, which retains backward compatibility with the CE data that currently populate national DNA databases and that will continue to be produced for the coming years. Thus, there is a pressing need to produce a standardized framework for describing complex sequences that enable comparison with currently used repeat allele nomenclature derived from conventional CE systems. It is important to discern three levels of information in hierarchical order (i) the sequence, (ii) the alignment, and (iii) the nomenclature of STR sequence data. We propose a sequence (text) string format the minimal requirement of data storage that laboratories should follow when adopting MPS of STRs. We further discuss the variant annotation and sequence comparison framework necessary to maintain compatibility among established and future data. This system must be easy to use and interpret by the DNA specialist, based on a universally accessible genome assembly, and in place before the uptake of MPS by the general forensic community starts to generate sequence data on a large scale. While the established nomenclature for CE-based STR analysis will remain unchanged in the future, the nomenclature of sequence-based STR genotypes will need to follow updated rules and be generated by expert systems that translate MPS sequences to match CE conventions in order to guarantee compatibility between the different generations of STR data. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
SSRscanner: a program for reporting distribution and exact location of simple sequence repeats
Anwar, Tamanna; Khan, Asad U
2006-01-01
Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. Availability This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com PMID:17597863
Structure and stability of the ankyrin domain of the Drosophila Notch receptor.
Zweifel, Mark E; Leahy, Daniel J; Hughson, Frederick M; Barrick, Doug
2003-11-01
The Notch receptor contains a conserved ankyrin repeat domain that is required for Notch-mediated signal transduction. The ankyrin domain of Drosophila Notch contains six ankyrin sequence repeats previously identified as closely matching the ankyrin repeat consensus sequence, and a putative seventh C-terminal sequence repeat that exhibits lower similarity to the consensus sequence. To better understand the role of the Notch ankyrin domain in Notch-mediated signaling and to examine how structure is distributed among the seven ankyrin sequence repeats, we have determined the crystal structure of this domain to 2.0 angstroms resolution. The seventh, C-terminal, ankyrin sequence repeat adopts a regular ankyrin fold, but the first, N-terminal ankyrin repeat, which contains a 15-residue insertion, appears to be largely disordered. The structure reveals a substantial interface between ankyrin polypeptides, showing a high degree of shape and charge complementarity, which may be related to homotypic interactions suggested from indirect studies. However, the Notch ankyrin domain remains largely monomeric in solution, demonstrating that this interface alone is not sufficient to promote tight association. Using the structure, we have classified reported mutations within the Notch ankyrin domain that are known to disrupt signaling into those that affect buried residues and those restricted to surface residues. We show that the buried substitutions greatly decrease protein stability, whereas the surface substitutions have only a marginal affect on stability. The surface substitutions are thus likely to interfere with Notch signaling by disrupting specific Notch-effector interactions and map the sites of these interactions.
Functionally conserved cis-regulatory elements of COL18A1 identified through zebrafish transgenesis.
Kague, Erika; Bessling, Seneca L; Lee, Josephine; Hu, Gui; Passos-Bueno, Maria Rita; Fisher, Shannon
2010-01-15
Type XVIII collagen is a component of basement membranes, and expressed prominently in the eye, blood vessels, liver, and the central nervous system. Homozygous mutations in COL18A1 lead to Knobloch Syndrome, characterized by ocular defects and occipital encephalocele. However, relatively little has been described on the role of type XVIII collagen in development, and nothing is known about the regulation of its tissue-specific expression pattern. We have used zebrafish transgenesis to identify and characterize cis-regulatory sequences controlling expression of the human gene. Candidate enhancers were selected from non-coding sequence associated with COL18A1 based on sequence conservation among mammals. Although these displayed no overt conservation with orthologous zebrafish sequences, four regions nonetheless acted as tissue-specific transcriptional enhancers in the zebrafish embryo, and together recapitulated the major aspects of col18a1 expression. Additional post-hoc computational analysis on positive enhancer sequences revealed alignments between mammalian and teleost sequences, which we hypothesize predict the corresponding zebrafish enhancers; for one of these, we demonstrate functional overlap with the orthologous human enhancer sequence. Our results provide important insight into the biological function and regulation of COL18A1, and point to additional sequences that may contribute to complex diseases involving COL18A1. More generally, we show that combining functional data with targeted analyses for phylogenetic conservation can reveal conserved cis-regulatory elements in the large number of cases where computational alignment alone falls short. Copyright 2009 Elsevier Inc. All rights reserved.
Shu, Bin; Ni, Guo-Xin; Zhang, Lian-Yang; Li, Xiang-Ping; Jiang, Wan-Ling; Zhang, Li-Qun
2013-05-01
This study explored the inhibitory effect of the high-power helium-neon (He-Ne) laser on the growth of scars post trauma. For the in vitro study, human wound fibroblasts were exposed to the high-power He-Ne laser for 30 min, once per day with different power densities (10, 50, 100, and 150 mW/cm(2)). After 3 days of repeated irradiation with the He-Ne laser, fibroblast proliferation and collagen synthesis were evaluated. For in vivo evaluation, a wounded animal model of hypertrophic scar formation was established. At postoperative day 21, the high-power He-Ne laser irradiation (output power 120 mW, 6 mm in diameter, 30 min each session, every other day) was performed on 20 scars. At postoperative day 35, the hydroxyproline content, apoptosis rate, PCNA protein expression and FADD mRNA level were assessed. The in vitro study showed that the irradiation group that received the power densities of 100 and 150 mW/cm(2) showed decreases in the cell proliferation index, increases in the percentage of cells in the G0/G1 phase, and decreases in collagen synthesis and type I procollagen gene expression. In the in vivo animal studies, regions exposed to He-Ne irradiation showed a significant decrease in scar thickness as well as decreases in hydroxyproline levels and PCNA protein expression. Results from the in vitro and in vivo studies suggest that repeated irradiation with a He-Ne laser at certain power densities inhibits fibroblast proliferation and collagen synthesis, thereby inhibits the growth of hypertrophic scars.
NASA Astrophysics Data System (ADS)
Schütz, R.; Rabin, I.; Hahn, O.; Fratzl, P.; Masic, A.
2010-08-01
The collection generally known as Qumran scrolls or Dead Sea Scrolls (DSS) comprises some 900 highly fragmented manuscripts (mainly written on parchment) from the Second Temple period. In the years since their manufacture the writing materials have undergone serious deterioration due to a combination of natural ageing and environmental effects. Therefore, understanding quantitatively state of conservation of such manuscripts is a challenging task and a deep knowledge of damage pathways on all hierarchical levels (from molecular up to macroscopic) results of fundamental importance for a correct protection and conservation strategy. However, the degradation of parchments is very complex and not well understood process. Parchment is a final product of processing of animal skin and consist mainly of type I collagen, which is the most abundant constituent of the dermal matrix. Collagen molecule is built by folding of three polypeptide α-chains into a right-handed triple helix. Every α-chain is made by a repetitive sequence of (Gly-X-Y)n, where X and Y are often proline and hydroxyproline. Parallel and staggered collagen triple helices associate into fibrils, which than assemble into fibers. Deterioration of parchment is caused by chemical changes due to gelatinization, oxidation and hydrolysis of the collagen chains, promoted by several factors, summarized as biological and microbiological (bacteria, fungi etc.), heat, light, humidity and pollutants (1, 2). In this work we have focused on studying the collagen within parchments on two different levels of organization (molecular and fibrilar) by applying polarized Raman spectroscopic technique. Beside spectral information related to chemical bonding, polarization anisotropy of some collagen bands (i.e. amide I) has been used to explore organization of collagen on higher levels (three-dimensional arrangement of the triple-helix molecules and their alignment within a fibril of collagen). To this aim we have compared native and gelatinized (random coiled collagen), stretched and not stretched rat tail tendon (RTT), bovine skin collagen, new and artificially aged parchments and collagen fibers from the Temple scroll (Figure 1).
Hori, Hisae; Hattori, Shunji; Inouye, Sakae; Kimura, Akinori; Irie, Shinkichi; Miyazawa, Hiroshi; Sakaguchi, Masahiro
2002-10-01
Anaphylaxis to measles, mumps, and rubella vaccines has been reported. It has been found that most of these reactions to live vaccines are caused by type I allergy with the bovine gelatin present in the vaccines as an allergen. Gelatin mainly includes denatured type I collagen, which consists of alpha1 and alpha2 chains. We previously reported that allergic reactions to gelatin are caused by the type I collagen alpha2 (alpha2[I]) chain. To aid in the development of gelatin that has little or no allergenicity in human subjects, we investigated epitopes of bovine alpha2(I) chain with use of IgE in gelatin-sensitive children. Serum samples were collected from 15 patients who had systemic allergic reactions to vaccines and high levels of specific IgE to bovine gelatin. Eleven overlapping recombinant proteins that cover bovine alpha2(I) were prepared with a bacterial expression vector. We examined IgE reactivity to these recombinant proteins by means of ELISA. Fifteen peptides covering a major reactive recombinant protein were synthesized. The IgE-reacting epitope was identified by means of IgE-ELISA inhibition with these synthetic peptides and pooled serum from the patients. We found that of the 15 patients, 13 showed IgE reactivity to a recombinant protein (no. 3) spanning the central region of the collagenous domain ((418)Gly-(662)Pro). Furthermore, all 13 patients showed IgE reactivity to the 4-kd recombinant protein (no. 3a) spanning the region from (461)Pro to (500)Glu. In IgE-ELISA inhibition we found that a minimum IgE epitope of gelatin allergen was composed of the 10-amino-acid sequence (485)Ile-Pro-Gly-Glu-Phe-Gly-Leu-Pro-Gly-Pro(494). This sequence is not observed in the human type I collagen alpha1 and alpha2 chains, nor is it found in the bovine type I collagen alpha1 chain. We found that Ile-Pro-Gly-Glu-Phe-Gly-Leu-Pro-Gly-Pro is a major IgE epitope of the alpha2 chain of bovine type I collagen in patients with gelatin allergy. The degree of anaphylaxis to gelatin in vaccines might be reduced by digestion of this IgE-binding site in gelatin.
Laminin peptide YIGSR induces collagen synthesis in Hs27 human dermal fibroblasts
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yoon, Jong Hyuk; Kim, Jaeyoon; Lee, Hyeongjoo
Highlights: Black-Right-Pointing-Pointer We identify a function of the YIGSR peptide to enhance collagen synthesis in Hs27. Black-Right-Pointing-Pointer YIGSR peptide enhanced collagen type 1 synthesis both of gene and protein levels. Black-Right-Pointing-Pointer There were no changes in cell proliferation and MMP-1 level in YIGSR treatment. Black-Right-Pointing-Pointer The YIGSR effect on collagen synthesis mediated activation of FAK, pyk2 and ERK. Black-Right-Pointing-Pointer The YIGSR-induced FAK and ERK activation was modulated by FAK and MEK inhibitors. -- Abstract: The dermal ECM is synthesized from fibroblasts and is primarily compromised of fibrillar collagen and elastic fibers, which support the mechanical strength and resiliency of skin,more » respectively. Laminin, a major glycoprotein located in the basement membrane, promotes cell adhesion, cell growth, differentiation, and migration. The laminin tyrosine-isoleucine-glycine-serine-arginine (YIGSR) peptide, corresponding to the 929-933 sequence of the {beta}1 chain, is known to be a functional motif with effects on the inhibition of tumor metastasis, the regulation of sensory axonal response and the inhibition of angiogenesis through high affinity to the 67 kDa laminin receptor. In this study, we identified a novel function of the YIGSR peptide to enhance collagen synthesis in human dermal fibroblasts. To elucidate this novel function regarding collagen synthesis, we treated human dermal fibroblasts with YIGSR peptide in both a time- and dose-dependent manner. According to subsequent experiments, we found that the YIGSR peptide strongly enhanced collagen type 1 synthesis without changing cell proliferation or cellular MMP-1 level. This YIGSR peptide-mediated collagen type 1 synthesis was modulated by FAK inhibitor and MEK inhibitor. This study clearly reveals that YIGSR peptide plays a novel function on the collagen type 1 synthesis of dermal fibroblasts and also suggests that YIGSR is a strong candidate peptide for the treatment of skin aging and wrinkles.« less
Molecular architecture of classical cytological landmarks: Centromeres and telomeres
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meyne, J.
1994-11-01
Both the human telomere repeat and the pericentromeric repeat sequence (GGAAT)n were isolated based on evolutionary conservation. Their isolation was based on the premise that chromosomal features as structurally and functionally important as telomeres and centromeres should be highly conserved. Both sequences were isolated by high stringency screening of a human repetitive DNA library with rodent repetitive DNA. The pHuR library (plasmid Human Repeat) used for this project was enriched for repetitive DNA by using a modification of the standard DNA library preparation method. Usually DNA for a library is cut with restriction enzymes, packaged, infected, and the library ismore » screened. A problem with this approach is that many tandem repeats don`t have any (or many) common restriction sites. Therefore, many of the repeat sequences will not be represented in the library because they are not restricted to a viable length for the vector used. To prepare the pHuR library, human DNA was mechanically sheared to a small size. These relatively short DNA fragments were denatured and then renatured to C{sub o}t 50. Theoretically only repetitive DNA sequences should renature under C{sub o}t 50 conditions. The single-stranded regions were digested using S1 nuclease, leaving the double-stranded, renatured repeat sequences.« less
Wei, Yunzhou; Chesne, Megan T.; Terns, Rebecca M.; Terns, Michael P.
2015-01-01
CRISPR-Cas systems are RNA-based immune systems that protect prokaryotes from invaders such as phages and plasmids. In adaptation, the initial phase of the immune response, short foreign DNA fragments are captured and integrated into host CRISPR loci to provide heritable defense against encountered foreign nucleic acids. Each CRISPR contains a ∼100–500 bp leader element that typically includes a transcription promoter, followed by an array of captured ∼35 bp sequences (spacers) sandwiched between copies of an identical ∼35 bp direct repeat sequence. New spacers are added immediately downstream of the leader. Here, we have analyzed adaptation to phage infection in Streptococcus thermophilus at the CRISPR1 locus to identify cis-acting elements essential for the process. We show that the leader and a single repeat of the CRISPR locus are sufficient for adaptation in this system. Moreover, we identified a leader sequence element capable of stimulating adaptation at a dormant repeat. We found that sequences within 10 bp of the site of integration, in both the leader and repeat of the CRISPR, are required for the process. Our results indicate that information at the CRISPR leader-repeat junction is critical for adaptation in this Type II-A system and likely other CRISPR-Cas systems. PMID:25589547
Xu, Huihui; Wu, Xiaoyan; Qin, Hao; Tian, Wenfang; Chen, Junliang; Sun, Lina; Fang, Mingming
2015-01-01
Diabetic nephropathy (DN) is one of the most common complications associated with diabetes and characterized by renal microvascular injury along with accelerated synthesis of extracellular matrix proteins causing tubulointerstitial fibrosis. Production of type I collagen, the major component of extracellular matrix, is augmented during renal fibrosis after chronic exposure to hyperglycemia. However, the transcriptional modulator responsible for the epigenetic manipulation leading to induction of type I collagen genes is not clearly defined. We show here that tubulointerstitial fibrosis as a result of DN was diminished in myocardin-related transcription factor A (MRTF-A) -deficient mice. In cultured renal tubular epithelial cells and the kidneys of mice with DN, MRTF-A was induced by glucose and synergized with glucose to activate collagen transcription. Notably, MRTF-A silencing led to the disappearance of prominent histone modifications indicative of transcriptional activation, including acetylated histone H3K18/K27 and trimethylated histone H3K4. Detailed analysis revealed that MRTF-A recruited p300, a histone acetyltransferase, and WD repeat-containing protein 5 (WDR5), a key component of the histone H3K4 methyltransferase complex, to the collagen promoters and engaged these proteins in transcriptional activation. Estradiol suppressed collagen production by dampening the expression and binding activity of MRTF-A and interfering with the interaction between p300 and WDR5 in renal epithelial cells. Therefore, targeting the MRTF-A–associated epigenetic machinery might yield interventional strategies against DN-associated renal fibrosis. PMID:25349198
Waye, J S; Willard, H F
1986-09-01
The centromeric regions of all human chromosomes are characterized by distinct subsets of a diverse tandemly repeated DNA family, alpha satellite. On human chromosome 17, the predominant form of alpha satellite is a 2.7-kilobase-pair higher-order repeat unit consisting of 16 alphoid monomers. We present the complete nucleotide sequence of the 16-monomer repeat, which is present in 500 to 1,000 copies per chromosome 17, as well as that of a less abundant 15-monomer repeat, also from chromosome 17. These repeat units were approximately 98% identical in sequence, differing by the exclusion of precisely 1 monomer from the 15-monomer repeat. Homologous unequal crossing-over is suggested as a probable mechanism by which the different repeat lengths on chromosome 17 were generated, and the putative site of such a recombination event is identified. The monomer organization of the chromosome 17 higher-order repeat unit is based, in part, on tandemly repeated pentamers. A similar pentameric suborganization has been previously demonstrated for alpha satellite of the human X chromosome. Despite the organizational similarities, substantial sequence divergence distinguishes these subsets. Hybridization experiments indicate that the chromosome 17 and X subsets are more similar to each other than to the subsets found on several other human chromosomes. We suggest that the chromosome 17 and X alpha satellite subsets may be related components of a larger alphoid subfamily which have evolved from a common ancestral repeat into the contemporary chromosome-specific subsets.
Post-translational control of collagen fibrillogenesis in mineralizing cultures of chick osteoblasts
NASA Technical Reports Server (NTRS)
Gerstenfeld, L. C.; Riva, A.; Hodgens, K.; Eyre, D. R.; Landis, W. J.
1993-01-01
Cultured osteoblasts from chick embryo calvaria were used as a model system to investigate the post-translational extracellular mechanisms controlling the macroassembly of collagen fibrils. The results of these studies demonstrated that cultured osteoblasts secreted a collagenous extracellular matrix that assembled and mineralized in a defined temporal and spatial sequence. The assembly of collagen occurred in a polarized fashion, such that successive orthogonal arrays of fibrils formed between successive cell layers proceeding from the culture surface toward the media. Mineralization followed in the same manner, being observed first in the deepest and oldest fibril layers. Collagen fibrillogenesis, the kinetics of cross-link formation, and collagen stability in the extracellular matrix of the cultures were examined over a 30 day culture period. Between days 8 and 12 in culture, collagen fibril diameters increased from < 30 nm to an average of 30-45 nm. Thereafter, diameters ranged in size from 20 to 200 nm. Quantitation of the collagen cross-linking residues, hydroxylysyl pyridinoline (HP) and lysyl pyridinoline (LP), showed that these mature cross-links increased from undetectable levels to concentrations found in normal chick bone. Analysis of the kinetics of their formation by pulse-chase labeling the cultures with [3H]lysine showed a doubling time of approximately 5 days. The relationships between cross-link formation, fibrillogenesis, and collagen stability were examined in cultures treated with beta-aminopropionitrile (beta-APN), a potent inhibitor of lysyl oxidase and cross-link formation. In beta-APN-treated cultures, total collagen synthesis was increased twofold, with no change in mRNA levels for type I collagen, whereas the amount of collagen accumulated in the cell layer was decreased by 50% and mineral deposition was reduced. The rate of collagen retention in the matrix was assessed by pulse-chase analysis of [3H]proline over a 16 day period in control and beta-APN-treated cultures. In control cultures, about 20% of the labeled collagen was lost from the cell layers over a 16 day period compared with > 80% in the presence of beta-APN. The beta-APN-treated cultures also showed a wider diversity of fibril diameters with a median in the > 45-60 nm range. In summary, these data suggest that cross-linking and assembly of collagen fibrils secreted by osteoblasts in vitro occur in a fashion similar to that found in vivo. The rate of cross-link formation is relatively constant and may be correlated with increasing collagen mass.(ABSTRACT TRUNCATED AT 400 WORDS).
Shaw, D R; Richter, H; Giorda, R; Ohmachi, T; Ennis, H L
1989-09-01
A Dictyostelium discoideum repetitive element composed of long repeats of the codon (AAC) is found in developmentally regulated transcripts. The concentration of (AAC) sequences is low in mRNA from dormant spores and growing cells and increases markedly during spore germination and multicellular development. The sequence hybridizes to many different sized Dictyostelium DNA restriction fragments indicating that it is scattered throughout the genome. Four cDNA clones isolated contain (AAC) sequences in the deduced coding region. Interestingly, the (AAC)-rich sequences are present in all three reading frames in the deduced proteins, i.e., AAC (asparagine), ACA (threonine) and CAA (glutamine). Three of the clones contain only one of these in-frame so that the individual proteins carry either asparagine, threonine, or glutamine clusters, not mixtures. However, one clone is both glutamine- and asparagine-rich. The (AAC) portion of the transcripts are reiterated 300 times in the haploid genome while the other portions of the cDNAs represent single copy genes, whose sequences show no similarity other than the (AAC) repeats. The repeated sequence is similar to the opa or M sequence found in Drosophila melanogaster notch and homeo box genes and in fly developmentally regulated transcripts. The transcripts are present on polysomes suggesting that they are translated. Although the function of these repeats is unknown, long amino acid repeats are a characteristic feature of extracellular proteins of lower eukaryotes.
Urvoas, Agathe; Guellouz, Asma; Valerio-Lepiniec, Marie; Graille, Marc; Durand, Dominique; Desravines, Danielle C; van Tilbeurgh, Herman; Desmadril, Michel; Minard, Philippe
2010-11-26
Repeat proteins have a modular organization and a regular architecture that make them attractive models for design and directed evolution experiments. HEAT repeat proteins, although very common, have not been used as a scaffold for artificial proteins, probably because they are made of long and irregular repeats. Here, we present and validate a consensus sequence for artificial HEAT repeat proteins. The sequence was defined from the structure-based sequence analysis of a thermostable HEAT-like repeat protein. Appropriate sequences were identified for the N- and C-caps. A library of genes coding for artificial proteins based on this sequence design, named αRep, was assembled using new and versatile methodology based on circular amplification. Proteins picked randomly from this library are expressed as soluble proteins. The biophysical properties of proteins with different numbers of repeats and different combinations of side chains in hypervariable positions were characterized. Circular dichroism and differential scanning calorimetry experiments showed that all these proteins are folded cooperatively and are very stable (T(m) >70 °C). Stability of these proteins increases with the number of repeats. Detailed gel filtration and small-angle X-ray scattering studies showed that the purified proteins form either monomers or dimers. The X-ray structure of a stable dimeric variant structure was solved. The protein is folded with a highly regular topology and the repeat structure is organized, as expected, as pairs of alpha helices. In this protein variant, the dimerization interface results directly from the variable surface enriched in aromatic residues located in the randomized positions of the repeats. The dimer was crystallized both in an apo and in a PEG-bound form, revealing a very well defined binding crevice and some structure flexibility at the interface. This fortuitous binding site could later prove to be a useful binding site for other low molecular mass partners. Copyright © 2010 Elsevier Ltd. All rights reserved.
TRedD—A database for tandem repeats over the edit distance
Sokol, Dina; Atagun, Firat
2010-01-01
A ‘tandem repeat’ in DNA is a sequence of two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats are common in the genomes of both eukaryotic and prokaryotic organisms. They are significant markers for human identity testing, disease diagnosis, sequence homology and population studies. In this article, we describe a new database, TRedD, which contains the tandem repeats found in the human genome. The database is publicly available online, and the software for locating the repeats is also freely available. The definition of tandem repeats used by TRedD is a new and innovative definition based upon the concept of ‘evolutive tandem repeats’. In addition, we have developed a tool, called TandemGraph, to graphically depict the repeats occurring in a sequence. This tool can be coupled with any repeat finding software, and it should greatly facilitate analysis of results. Database URL: http://tandem.sci.brooklyn.cuny.edu/ PMID:20624712
Yamakawa, Tomohiro; Ohigashi, Hiroyuki; Hashimoto, Daigo; Hayase, Eiko; Takahashi, Shuichiro; Miyazaki, Miyono; Minomi, Kenjiro; Onozawa, Masahiro; Niitsu, Yoshiro; Teshima, Takanori
2018-03-29
Chronic graft-versus-host disease (GVHD) after allogeneic hematopoietic stem cell transplantation (SCT) is characterized by multiorgan fibrosis and profoundly affects the quality of life of transplant survivors. Heat shock protein 47 (HSP47), a collagen-specific molecular chaperone, plays a critical role in collagen synthesis in myofibroblasts. We explored the role of HSP47 in the fibrotic process of cutaneous chronic GVHD in mice. Immunohistochemical analysis showed massive fibrosis with elevated amounts of collagen deposits and accumulation of F4/80 + macrophages, as well as myofibroblasts expressing HSP47 and retinol-binding protein 1 in the skin after allogeneic SCT. Repeated injection of anti-colony-stimulating factor (CSF-1) receptor-blocking antibodies significantly reduced HSP47 + myofibroblasts in the skin, indicating a macrophage-dependent accumulation of myofibroblasts. Vitamin A-coupled liposomes carrying HSP47 small interfering RNA (siRNA) (VA-lip HSP47) delivered HSP47 siRNA to cells expressing vitamin A receptors and knocked down their HSP47 in vitro. Intravenously injected VA-lip HSP47 were specifically distributed to skin fibrotic lesions and did not affect collagen synthesis in healthy skin. VA-lip HSP47 knocked down HSP47 expression in myofibroblasts and significantly reduced collagen deposition without inducing systemic immunosuppression. It also abrogated fibrosis in the salivary glands. These results highlight a cascade of fibrosis in chronic GVHD; macrophage production of transforming growth factor β mediates fibroblast differentiation to HSP47 + myofibroblasts that produce collagen. VA-lip HSP47 represent a novel strategy to modulate fibrosis in chronic GVHD by targeting HSP47 + myofibroblasts without inducing immunosuppression. © 2018 by The American Society of Hematology.
The Contribution of Interchain Salt Bridges to Triple-Helical Stability in Collagen
Gurry, Thomas; Nerenberg, Paul S.; Stultz, Collin M.
2010-01-01
Abstract Studies on collagen and collagen-like peptides suggest that triple-helical stability can vary along the amino acid chain. In this regard, it has been shown that lysine residues in the Y position and acidic residues in the X′ position of (GPO)3GXYGX′Y′(GPO)3 peptides lead to triple-helical structures with melting temperatures similar to (GPO)8 (where O is hydroxyproline), which is generally regarded as the most stable collagen-like sequence of this length. This enhanced stability has been attributed to the formation of salt bridges between adjacent collagen chains. In this study, we explore the relationship between interchain salt bridge formation and triple-helical stability using detailed molecular simulations. Although our results confirm that salt bridges promote triple-helical stability, we find that not all salt bridges are created equal. In particular, lysine-glutamate salt bridges are most stabilizing when formed between residues in the middle strand (B) and the trailing strand (C), whereas lysine-aspartate salt bridges are most stabilizing when formed between residues in the leading (A) and middle (B) strand—the latter observation being consistent with recent NMR data on a heterotrimeric model peptide. Overall, we believe these data clarify the role of salt bridges in modulating triple-helical stability and can be used to guide the design of collagen-like peptides that have specific interchain interactions. PMID:20513408
Garcia, Yolanda; Hemantkumar, Naik; Collighan, Russell; Griffin, Martin; Rodriguez-Cabello, Jose Carlos; Pandit, Abhay
2009-04-01
Collagen, the main structural component of the extracellular matrix (ECM), provides tensile stiffness to different structures and organs against rupture. However, collagen tissue-engineered implants are hereto still lacking in mechanical strength. Attempts to create stiffer scaffolds have resulted in increased brittleness of the material, reducing the versatility of the original component. The hypothesis behind this research is that the introduction of an elastic element in the scaffold will enhance the mechanical properties of the collagen-based scaffolds, as elastin does in the ECM to prevent irreversible deformation. In this study, an elastin-like polymer (ELP) designed and synthesized using recombinant DNA methodology is used with the view to providing increased proteolytic resistance and increased functionality to the scaffolds by carrying specific sequences for microbial transglutaminase cross-linking, endothelial cell adhesion, and drug delivery. Evaluation of the effects that cross-linking ELP-collagen has on the physicochemical properties of the scaffold such as porosity, presence of cross-linking, thermal behavior, and mechanical strength demonstrated that the introduction of enzymatically resistant covalent bonds between collagen and ELP increases the mechanical strength of the scaffolds in a dose-dependent manner without significantly affecting the porosity or thermal properties of the original scaffold. Importantly, the scaffolds also showed selective behavior, in a dose (ELP)-dependent manner toward human umbilical vein endothelial cells and smooth muscle cells when compared to fibroblasts.
Park, Eun Hye; Kim, Seokho; Jo, Ji Yoon; Kim, Su Jin; Hwang, Yeonsil; Kim, Jin-Man; Song, Si Young; Lee, Dong-Ki; Koh, Sang Seok
2013-03-01
Collagen triple helix repeat containing-1 (CTHRC1) is a secreted protein involved in vascular remodeling, bone formation and developmental morphogenesis. CTHRC1 has recently been shown to be expressed in human cancers such as breast cancer and melanoma. In this study, we show that CTHRC1 is highly expressed in human pancreatic cancer tissues and plays a role in the progression and metastasis of the disease. CTHRC1 promoted primary tumor growth and metastatic spread of cancer cells to distant organs in orthotopic xenograft tumor mouse models. Overexpression of CTHRC1 in cancer cells resulted in increased motility and adhesiveness, whereas these cellular activities were diminished by down-regulation of the protein. CTHRC1 activated several key signaling molecules, including Src, focal adhesion kinase, paxillin, mitogen-activated protein kinase kinase (MEK), extracellular signal-regulated kinase and Rac1. Treatment with chemical inhibitors of Src, MEK or Rac1 and expression of dominant-negative Rac1 attenuated CTHRC1-induced cell migration and adhesion. Collectively, our results suggest that CTHRC1 has a role in pancreatic cancer progression and metastasis by regulating migration and adhesion activities of cancer cells.
Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta
2012-11-07
Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found.
2012-01-01
Background Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. Results ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. Conclusions We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found. PMID:23134664
A candidate gene for choanal atresia in alpaca.
Reed, Kent M; Bauer, Miranda M; Mendoza, Kristelle M; Armién, Aníbal G
2010-03-01
Choanal atresia (CA) is a common nasal craniofacial malformation in New World domestic camelids (alpaca and llama). CA results from abnormal development of the nasal passages and is especially debilitating to newborn crias. CA in camelids shares many of the clinical manifestations of a similar condition in humans (CHARGE syndrome). Herein we report on the regulatory gene CHD7 of alpaca, whose homologue in humans is most frequently associated with CHARGE. Sequence of the CHD7 coding region was obtained from a non-affected cria. The complete coding region was 9003 bp, corresponding to a translated amino acid sequence of 3000 aa. Additional genomic sequences corresponding to a significant portion of the CHD7 gene were identified and assembled from the 2x alpaca whole genome sequence, providing confirmatory sequence for much of the CHD7 coding region. The alpaca CHD7 mRNA sequence was 97.9% similar to the human sequence, with the greatest sequence difference being an insertion in exon 38 that results in a polyalanine repeat (A12). Polymorphism in this repeat was tested for association with CA in alpaca by cloning and sequencing the repeat from both affected and non-affected individuals. Variation in length of the poly-A repeat was not associated with CA. Complete sequencing of the CHD7 gene will be necessary to determine whether other mutations in CHD7 are the cause of CA in camelids.
Naveilhan, P; Baudet, C; Jabbour, W; Wion, D
1994-09-01
A model that may explain the limited division potential of certain cells such as human fibroblasts in culture is presented. The central postulate of this theory is that there exists, prior to certain key exons that code for materials needed for cell division, a unique sequence of specific repeating segments of DNA. One copy of such repeating segments is deleted during each cell cycle in cells that are not protected from such deletion through methylation of their cytosine residues. According to this theory, the means through which such repeated sequences are removed, one per cycle, is through the sequential action of enzymes that act much as bacterial restriction enzymes do--namely to produce scissions in both strands of DNA in areas that correspond to the DNA base sequence recognition specificities of such enzymes. After the first scission early in a replicative cycle, that enzyme becomes inhibited, but the cleavage of the first site exposes the closest site in the repetitive element to the action of a second restriction enzyme after which that enzyme also becomes inhibited. Then repair occurs, regenerating the original first site. Through this sequential activation and inhibition of two different restriction enzymes, only one copy of the repeating sequence is deleted during each cell cycle. In effect, the repeating sequence operates as a precise counter of the numbers of cell doubling that have occurred since the cells involved differentiated during development.
Schroeter, Elena R.; Feranec, Robert S.; Vashishth, Deepak
2016-01-01
Vertebrate fossils have been collected for hundreds of years and are stored in museum collections around the world. These remains provide a readily available resource to search for preserved proteins; however, the vast majority of palaeoproteomic studies have focused on relatively recently collected bones with a well-known handling history. Here, we characterize proteins from the nasal turbinates of the first Castoroides ohioensis skull ever discovered. Collected in 1845, this is the oldest museum-curated specimen characterized using palaeoproteomic tools. Our mass spectrometry analysis detected many collagen I peptides, a peptide from haemoglobin beta, and in vivo and diagenetic post-translational modifications. Additionally, the identified collagen I sequences provide enough resolution to place C. ohioensis within Rodentia. This study illustrates the utility of archived museum specimens for both the recovery of preserved proteins and phylogenetic analyses. PMID:27306052
Kuipers, A G J; Kamstra, S A; de Jeu, M J; Visser, R G F
2002-01-01
Highly repetitive DNA sequences were isolated from genomic DNA libraries of Alstroemeria psittacina and A. inodora. Among the repetitive sequences that were isolated, tandem repeats as well as dispersed repeats could be discerned. The tandem repeats belonged to a family of interlinked Sau3A subfragments with sizes varying from 68-127 bp, and constituted a larger HinfI repeat of approximately 400 bp. Southern hybridization showed a similar molecular organization of the tandem repeats in each of the Brazilian Alstroemeria species tested. None of the repeats hybridized with DNA from Chilean Alstroemeria species, which indicates that they are specific for the Brazilian species. In-situ localization studies revealed the tandem repeats to be localized in clusters on the chromosomes of A. inodora and A. psittacina: distal hybridization sites were found on chromosome arms 2PS, 6PL, 7PS, 7PL and 8PL, interstitial sites on chromosome arms 2PL, 3PL, 4PL and 5PL. The applicability of the tandem repeats for cytogenetic analysis of interspecific hybrids and their role in heterochromatin organization are discussed.
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.
Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D
2015-05-01
Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.
USDA-ARS?s Scientific Manuscript database
Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...
USDA-ARS?s Scientific Manuscript database
Simple sequence repeats (SSR) markers were developed from a small insert genomic library for Bipolaris sorokiniana, a mitosporic fungal pathogen that causes spot blotch and root rot in switchgrass. About 59% of sequenced clones (n=384) harbored various SSR motifs. After eliminating the redundant seq...
Are the TTAGG and TTAGGG telomeric repeats phylogenetically conserved in aculeate Hymenoptera?
NASA Astrophysics Data System (ADS)
Menezes, Rodolpho S. T.; Bardella, Vanessa B.; Cabral-de-Mello, Diogo C.; Lucena, Daercio A. A.; Almeida, Eduardo A. B.
2017-10-01
Despite the (TTAGG)n telomeric repeat supposed being the ancestral DNA motif of telomeres in insects, it was repeatedly lost within some insect orders. Notably, parasitoid hymenopterans and the social wasp Metapolybia decorata (Gribodo) lack the (TTAGG)n sequence, but in other representatives of Hymenoptera, this motif was noticed, such as different ant species and the honeybee. These findings raise the question of whether the insect telomeric repeat is or not phylogenetically predominant in Hymenoptera. Thus, we evaluated the occurrence of both the (TTAGG)n sequence and the vertebrate telomere sequence (TTAGGG)n using dot-blotting hybridization in 25 aculeate species of Hymenoptera. Our results revealed the absence of (TTAGG)n sequence in all tested species, elevating the number of hymenopteran families lacking this telomeric sequence to 13 out of the 15 tested families so far. The (TTAGGG)n was not observed in any tested species. Based on our data and compiled information, we suggest that the (TTAGG)n sequence was putatively lost in the ancestor of Apocrita with at least two subsequent independent regains (in Formicidae and Apidae).
Macas, Jiří; Neumann, Pavel; Navrátilová, Alice
2007-01-01
Background Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (Pisum sativum). Results Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in M. truncatula. Conclusion We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data provide a starting point for further investigations of legume plant genomes based on their global comparative analysis and for the development of more sophisticated approaches for data mining. PMID:18031571
Molecular basis of length polymorphism in the human zeta-globin gene complex.
Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J
1983-01-01
The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667
Hemalatha, G. R.; Rao, D. Satyanarayana; Guruprasad, L.
2007-01-01
We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure. PMID:17538688
Complete mitochondrial genome of the larch hawk moth, Sphinx morio (Lepidoptera: Sphingidae).
Kim, Min Jee; Choi, Sei-Woong; Kim, Iksoo
2013-12-01
The larch hawk moth, Sphinx morio, belongs to the lepidopteran family Sphingidae that has long been studied as a family of model insects in a diverse field. In this study, we describe the complete mitochondrial genome (mitogenome) sequences of the species in terms of general genomic features and characteristic short repetitive sequences found in the A + T-rich region. The 15,299-bp-long genome consisted of a typical set of genes (13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes) and one major non-coding A + T-rich region, with the typical arrangement found in Lepidoptera. The 316-bp-long A + T-rich region located between srRNA and tRNA(Met) harbored the conserved sequence blocks that are typically found in lepidopteran insects. Additionally, the A + T-rich region of S. morio contained three characteristic repeat sequences that are rarely found in Lepidoptera: two identical 12-bp repeat, three identical 5-bp-long tandem repeat, and six nearly identical 5-6 bp long repeat sequences.
Kumar, Pankaj; Chaitanya, Pasumarthy S; Nagarajaram, Hampapathalu A
2011-01-01
PSSRdb (Polymorphic Simple Sequence Repeats database) (http://www.cdfd.org.in/PSSRdb/) is a relational database of polymorphic simple sequence repeats (PSSRs) extracted from 85 different species of prokaryotes. Simple sequence repeats (SSRs) are the tandem repeats of nucleotide motifs of the sizes 1-6 bp and are highly polymorphic. SSR mutations in and around coding regions affect transcription and translation of genes. Such changes underpin phase variations and antigenic variations seen in some bacteria. Although SSR-mediated phase variation and antigenic variations have been well-studied in some bacteria there seems a lot of other species of prokaryotes yet to be investigated for SSR mediated adaptive and other evolutionary advantages. As a part of our on-going studies on SSR polymorphism in prokaryotes we compared the genome sequences of various strains and isolates available for 85 different species of prokaryotes and extracted a number of SSRs showing length variations and created a relational database called PSSRdb. This database gives useful information such as location of PSSRs in genomes, length variation across genomes, the regions harboring PSSRs, etc. The information provided in this database is very useful for further research and analysis of SSRs in prokaryotes.
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.
Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V
1985-01-01
The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this. Images PMID:3016521
Effecting skin renewal: a multifaceted approach.
Widgerow, Alan D; Grekin, Steven K
2011-06-01
The skin undergoes intrinsic aging as a normal course, but exposure to ultraviolet (UV) light results in major cumulative damage that manifests as the typical aged photodamaged skin. UV irradiation produces a sequence of changes within the skin layers starting with signaling processes following DNA damage and culminating in nonabsorbed fragmentation of collagen and other proteins within the extracellular matrix. These fragments promote the synthesis of matrix metalloproteinases (MMPs) that further aggravate the damage to the ground substance and add to fragment accumulation. This study describes a unique sequential approach to controlling this photodamage - inhibition of signaling, inhibition of MMPs, proteasome stimulation and mopping up of fragments, stimulation of procollagen and collagen production, and uniform packaging of new collagen fibers. Thus, a multifaceted approach is introduced with presentation of a unique product formulation based on these research principles. © 2011 Wiley Periodicals, Inc.
Spectroscopic insights into quadruplexes of five-repeat telomere DNA sequences upon G-block damage.
Dvořáková, Zuzana; Vorlíčková, Michaela; Renčiuk, Daniel
2017-11-01
The DNA lesions, resulting from oxidative damage, were shown to destabilize human telomere four-repeat quadruplex and to alter its structure. Long telomere DNA, as a repetitive sequence, offers, however, other mechanisms of dealing with the lesion: extrusion of the damaged repeat into loop or shifting the quadruplex position by one repeat. Using circular dichroism and UV absorption spectroscopy and polyacrylamide electrophoresis, we studied consequences of lesions at different positions of the model five-repeat human telomere DNA sequences on the structure and stability of their quadruplexes in sodium and in potassium. The repeats affected by lesion are preferentially positioned as terminal overhangs of the core quadruplex structurally similar to the four-repeat one. Forced affecting of the inner repeats leads to presence of variety of more parallel folds in potassium. In sodium the designed models form mixture of two dominant antiparallel quadruplexes whose population varies with the position of the affected repeat. The shapes of quadruplex CD spectra, namely the height of dominant peaks, significantly correlate with melting temperatures. Lesion in one guanine tract of a more than four repeats long human telomere DNA sequence may cause re-positioning of its quadruplex arrangement associated with a shift of the structure to less common quadruplex conformations. The type of the quadruplex depends on the loop position and external conditions. The telomere DNA quadruplexes are quite resistant to the effect of point mutations due to the telomere DNA repetitive nature, although their structure and, consequently, function might be altered. Copyright © 2017. Published by Elsevier B.V.
Microsatellite analysis in the genome of Acanthaceae: An in silico approach
Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar
2015-01-01
Background: Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. Objective: The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. Materials and Methods: The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Results: Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. Conclusion: The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future. PMID:25709226
Van Kreijl, C F; Bos, J L
1977-01-01
The repeating nucleotide sequence of 68 base pairs in the mtDNA from an ethidium-induced cytoplasmic petite mutant of yeast has been determined. For sequence analysis specifically primed and terminated RNA copies, obtained by in vitro transcription of the separated strands, were use. The sequence consists of 66 consecutive AT base pairs flanked by two GC pairs and comprises nearly all of the mutant mitochondrial genome. The sequence, moreover, also represents the first part of wild-type mtDNA sequence so far. Images PMID:198740
A Lossy Compression Technique Enabling Duplication-Aware Sequence Alignment
Freschi, Valerio; Bogliolo, Alessandro
2012-01-01
In spite of the recognized importance of tandem duplications in genome evolution, commonly adopted sequence comparison algorithms do not take into account complex mutation events involving more than one residue at the time, since they are not compliant with the underlying assumption of statistical independence of adjacent residues. As a consequence, the presence of tandem repeats in sequences under comparison may impair the biological significance of the resulting alignment. Although solutions have been proposed, repeat-aware sequence alignment is still considered to be an open problem and new efficient and effective methods have been advocated. The present paper describes an alternative lossy compression scheme for genomic sequences which iteratively collapses repeats of increasing length. The resulting approximate representations do not contain tandem duplications, while retaining enough information for making their comparison even more significant than the edit distance between the original sequences. This allows us to exploit traditional alignment algorithms directly on the compressed sequences. Results confirm the validity of the proposed approach for the problem of duplication-aware sequence alignment. PMID:22518086
Tandemly repeated sequences in mtDNA control region of whitefish, Coregonus lavaretus.
Brzuzan, P
2000-06-01
Length variation of the mitochondrial DNA control region was observed with PCR amplification of a sample of 138 whitefish (Coregonus lavaretus). Nucleotide sequences of representative PCR products showed that the variation was due to the presence of an approximately 100-bp motif tandemly repeated two, three, or five times in the region between the conserved sequence block-3 (CSB-3) and the gene for phenylalanine tRNA. This is the first report on the tandem array composed of long repeat units in mitochondrial DNA of salmonids.
Blackburn, Patrick R; Xu, Zhi; Tumelty, Kathleen E; Zhao, Rose W; Monis, William J; Harris, Kimberly G; Gass, Jennifer M; Cousin, Margot A; Boczek, Nicole J; Mitkov, Mario V; Cappel, Mark A; Francomano, Clair A; Parisi, Joseph E; Klee, Eric W; Faqeih, Eissa; Alkuraya, Fowzan S; Layne, Matthew D; McDonnell, Nazli B; Atwal, Paldeep S
2018-04-05
AEBP1 encodes the aortic carboxypeptidase-like protein (ACLP) that associates with collagens in the extracellular matrix (ECM) and has several roles in development, tissue repair, and fibrosis. ACLP is expressed in bone, the vasculature, and dermal tissues and is involved in fibroblast proliferation and mesenchymal stem cell differentiation into collagen-producing cells. Aebp1 -/- mice have abnormal, delayed wound repair correlating with defects in fibroblast proliferation. In this study, we describe four individuals from three unrelated families that presented with a unique constellation of clinical findings including joint laxity, redundant and hyperextensible skin, poor wound healing with abnormal scarring, osteoporosis, and other features reminiscent of Ehlers-Danlos syndrome (EDS). Analysis of skin biopsies revealed decreased dermal collagen with abnormal collagen fibrils that were ragged in appearance. Exome sequencing revealed compound heterozygous variants in AEBP1 (c.1470delC [p.Asn490_Met495delins(40)] and c.1743C>A [p.Cys581 ∗ ]) in the first individual, a homozygous variant (c.1320_1326del [p.Arg440Serfs ∗ 3]) in the second individual, and a homozygous splice site variant (c.1630+1G>A) in two siblings from the third family. We show that ACLP enhances collagen polymerization and binds to several fibrillar collagens via its discoidin domain. These studies support the conclusion that bi-allelic pathogenic variants in AEBP1 are the cause of this autosomal-recessive EDS subtype. Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Heideman, Simone G; van Ede, Freek; Nobre, Anna C
2018-05-24
In daily life, temporal expectations may derive from incidental learning of recurring patterns of intervals. We investigated the incidental acquisition and utilisation of combined temporal-ordinal (spatial/effector) structure in complex visual-motor sequences using a modified version of a serial reaction time (SRT) task. In this task, not only the series of targets/responses, but also the series of intervals between subsequent targets was repeated across multiple presentations of the same sequence. Each participant completed three sessions. In the first session, only the repeating sequence was presented. During the second and third session, occasional probe blocks were presented, where a new (unlearned) spatial-temporal sequence was introduced. We first confirm that participants not only got faster over time, but that they were slower and less accurate during probe blocks, indicating that they incidentally learned the sequence structure. Having established a robust behavioural benefit induced by the repeating spatial-temporal sequence, we next addressed our central hypothesis that implicit temporal orienting (evoked by the learned temporal structure) would have the largest influence on performance for targets following short (as opposed to longer) intervals between temporally structured sequence elements, paralleling classical observations in tasks using explicit temporal cues. We found that indeed, reaction time differences between new and repeated sequences were largest for the short interval, compared to the medium and long intervals, and that this was the case, even when comparing late blocks (where the repeated sequence had been incidentally learned), to early blocks (where this sequence was still unfamiliar). We conclude that incidentally acquired temporal expectations that follow a sequential structure can have a robust facilitatory influence on visually-guided behavioural responses and that, like more explicit forms of temporal orienting, this effect is most pronounced for sequence elements that are expected at short inter-element intervals. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Evolutionary conservation of sequence and secondary structures inCRISPR repeats
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kunin, Victor; Sorek, Rotem; Hugenholtz, Philip
Clustered Regularly Interspaced Palindromic Repeats (CRISPRs) are a novel class of direct repeats, separated by unique spacer sequences of similar length, that are present in {approx}40% of bacterial and all archaeal genomes analyzed to date. More than 40 gene families, called CRISPR-associated sequences (CAS), appear in conjunction with these repeats and are thought to be involved in the propagation and functioning of CRISPRs. It has been proposed that the CRISPR/CAS system samples, maintains a record of, and inactivates invasive DNA that the cell has encountered, and therefore constitutes a prokaryotic analog of an immune system. Here we analyze CRISPR repeatsmore » identified in 195 microbial genomes and show that they can be organized into multiple clusters based on sequence similarity. All individual repeats in any given cluster were inferred to form characteristic RNA secondary structure, ranging from non-existent to pronounced. Stable secondary structures included G:U base pairs and exhibited multiple compensatory base changes in the stem region, indicating evolutionary conservation and functional importance. We also show that the repeat-based classification corresponds to, and expands upon, a previously reported CAS gene-based classification including specific relationships between CRISPR and CAS subtypes.« less
Ba Abdullah, Mohammed M; Palermo, Richard D; Palser, Anne L; Grayson, Nicholas E; Kellam, Paul; Correia, Samantha; Szymula, Agnieszka; White, Robert E
2017-12-01
Epstein-Barr virus (EBV) is a ubiquitous pathogen of humans that can cause several types of lymphoma and carcinoma. Like other herpesviruses, EBV has diversified through both coevolution with its host and genetic exchange between virus strains. Sequence analysis of the EBV genome is unusually challenging because of the large number and lengths of repeat regions within the virus. Here we describe the sequence assembly and analysis of the large internal repeat 1 of EBV (IR1; also known as the BamW repeats) for more than 70 strains. The diversity of the latency protein EBV nuclear antigen leader protein (EBNA-LP) resides predominantly within the exons downstream of IR1. The integrity of the putative BWRF1 open reading frame (ORF) is retained in over 80% of strains, and deletions truncating IR1 always spare BWRF1. Conserved regions include the IR1 latency promoter (Wp) and one zone upstream of and two within BWRF1. IR1 is heterogeneous in 70% of strains, and this heterogeneity arises from sequence exchange between strains as well as from spontaneous mutation, with interstrain recombination being more common in tumor-derived viruses. This genetic exchange often incorporates regions of <1 kb, and allelic gene conversion changes the frequency of small regions within the repeat but not close to the flanks. These observations suggest that IR1-and, by extension, EBV-diversifies through both recombination and breakpoint repair, while concerted evolution of IR1 is driven by gene conversion of small regions. Finally, the prototype EBV strain B95-8 contains four nonconsensus variants within a single IR1 repeat unit, including a stop codon in the EBNA-LP gene. Repairing IR1 improves EBNA-LP levels and the quality of transformation by the B95-8 bacterial artificial chromosome (BAC). IMPORTANCE Epstein-Barr virus (EBV) infects the majority of the world population but causes illness in only a small minority of people. Nevertheless, over 1% of cancers worldwide are attributable to EBV. Recent sequencing projects investigating virus diversity to see if different strains have different disease impacts have excluded regions of repeating sequence, as they are more technically challenging. Here we analyze the sequence of the largest repeat in EBV (IR1). We first characterized the variations in protein sequences encoded across IR1. In studying variations within the repeat of each strain, we identified a mutation in the main laboratory strain of EBV that impairs virus function, and we suggest that tumor-associated viruses may be more likely to contain DNA mixed from two strains. The patterns of this mixing suggest that sequences can spread between strains (and also within the repeat) by copying sequence from another strain (or repeat unit) to repair DNA damage. Copyright © 2017 Ba abdullah et al.
Pilotte, Nils; Papaiakovou, Marina; Grant, Jessica R; Bierwert, Lou Ann; Llewellyn, Stacey; McCarthy, James S; Williams, Steven A
2016-03-01
The soil transmitted helminths are a group of parasitic worms responsible for extensive morbidity in many of the world's most economically depressed locations. With growing emphasis on disease mapping and eradication, the availability of accurate and cost-effective diagnostic measures is of paramount importance to global control and elimination efforts. While real-time PCR-based molecular detection assays have shown great promise, to date, these assays have utilized sub-optimal targets. By performing next-generation sequencing-based repeat analyses, we have identified high copy-number, non-coding DNA sequences from a series of soil transmitted pathogens. We have used these repetitive DNA elements as targets in the development of novel, multi-parallel, PCR-based diagnostic assays. Utilizing next-generation sequencing and the Galaxy-based RepeatExplorer web server, we performed repeat DNA analysis on five species of soil transmitted helminths (Necator americanus, Ancylostoma duodenale, Trichuris trichiura, Ascaris lumbricoides, and Strongyloides stercoralis). Employing high copy-number, non-coding repeat DNA sequences as targets, novel real-time PCR assays were designed, and assays were tested against established molecular detection methods. Each assay provided consistent detection of genomic DNA at quantities of 2 fg or less, demonstrated species-specificity, and showed an improved limit of detection over the existing, proven PCR-based assay. The utilization of next-generation sequencing-based repeat DNA analysis methodologies for the identification of molecular diagnostic targets has the ability to improve assay species-specificity and limits of detection. By exploiting such high copy-number repeat sequences, the assays described here will facilitate soil transmitted helminth diagnostic efforts. We recommend similar analyses when designing PCR-based diagnostic tests for the detection of other eukaryotic pathogens.
Evolution Analysis of Simple Sequence Repeats in Plant Genome.
Qin, Zhen; Wang, Yanping; Wang, Qingmei; Li, Aixian; Hou, Fuyun; Zhang, Liming
2015-01-01
Simple sequence repeats (SSRs) are widespread units on genome sequences, and play many important roles in plants. In order to reveal the evolution of plant genomes, we investigated the evolutionary regularities of SSRs during the evolution of plant species and the plant kingdom by analysis of twelve sequenced plant genome sequences. First, in the twelve studied plant genomes, the main SSRs were those which contain repeats of 1-3 nucleotides combination. Second, in mononucleotide SSRs, the A/T percentage gradually increased along with the evolution of plants (except for P. patens). With the increase of SSRs repeat number the percentage of A/T in C. reinhardtii had no significant change, while the percentage of A/T in terrestrial plants species gradually declined. Third, in dinucleotide SSRs, the percentage of AT/TA increased along with the evolution of plant kingdom and the repeat number increased in terrestrial plants species. This trend was more obvious in dicotyledon than monocotyledon. The percentage of CG/GC showed the opposite pattern to the AT/TA. Forth, in trinucleotide SSRs, the percentages of combinations including two or three A/T were in a rising trend along with the evolution of plant kingdom; meanwhile with the increase of SSRs repeat number in plants species, different species chose different combinations as dominant SSRs. SSRs in C. reinhardtii, P. patens, Z. mays and A. thaliana showed their specific patterns related to evolutionary position or specific changes of genome sequences. The results showed that, SSRs not only had the general pattern in the evolution of plant kingdom, but also were associated with the evolution of the specific genome sequence. The study of the evolutionary regularities of SSRs provided new insights for the analysis of the plant genome evolution.
Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N
2003-09-01
Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.
Kuroda, Tsuyoshi; Tomimatsu, Erika; Grondin, Simon; Miyazaki, Makoto
2016-11-01
We investigated how perceived duration of empty time intervals would be modulated by the length of sounds marking those intervals. Three sounds were successively presented in Experiment 1. Each sound was short (S) or long (L), and the temporal position of the middle sound's onset was varied. The lengthening of each sound resulted in delayed perception of the onset; thus, the middle sound's onset had to be presented earlier in the SLS than in the LSL sequence so that participants perceived the three sounds as presented at equal interonset intervals. In Experiment 2, a short sound and a long sound were alternated repeatedly, and the relative duration of the SL interval to the LS interval was varied. This repeated sequence was perceived as consisting of equal interonset intervals when the onsets of all sounds were aligned at physically equal intervals. If the same onset delay as in the preceding experiment had occurred, participants should have perceived equality between the interonset intervals in the repeated sequence when the SL interval was physically shortened relative to the LS interval. The effects of sound length seemed to be canceled out when the presentation of intervals was repeated. Finally, the perceived duration of the interonset intervals in the repeated sequence was not influenced by whether the participant's native language was French or Japanese, or by how the repeated sequence was perceptually segmented into rhythmic groups.
Genetic and DNA sequence analysis of the kanamycin resistance transposon Tn903.
Grindley, N D; Joyce, C M
1980-01-01
The kanamycin resistance transposon Tn903 consists of a unique region of about 1000 base pairs bounded by a pair of 1050-base-pair inverted repeat sequences. Each repeat contains two Pvu II endonuclease cleavage sites separated by 520 base pairs. We have constructed derivatives of Tn903 in which this 520-base-pair fragment is deleted from one or both repeats. Those derivatives that lack both 520-base-pair fragments cannot transpose, whereas those that lack just one remain transposition proficient. One such transposable derivative, Tn903 delta I, has been selected for further study. We have determined the sequence of the intact inverted repeat. The 18 base pairs at each end are identical and inverted relative to one another, a structure characteristic of insertion sequences. Additional experiments indicate that a single inverted repeat from Tn903 can, in fact, transpose; we propose that this element be called IS903. To correlate the DNA sequence with genetic activities, we have created mutations by inserting a 10-base-pair DNA fragment at several sites within the intact repeat of Tn903 delta 1, and we have examined the effect of such insertions on transposability. The results suggest that IS903 encodes a 307-amino-acid polypeptide (a "transposase") that is absolutely required for transposition of IS903 or Tn903. Images PMID:6261245
Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.
Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru
2015-01-01
The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.
Ni, Xiangyang; Westpheling, Janet
1997-01-01
The chi63 promoter directs glucose-sensitive, chitin-dependent transcription of a gene involved in the utilization of chitin as carbon source. Analysis of 5′ and 3′ deletions of the promoter region revealed that a 350-bp segment is sufficient for wild-type levels of expression and regulation. The analysis of single base changes throughout the promoter region, introduced by random and site-directed mutagenesis, identified several sequences to be important for activity and regulation. Single base changes at −10, −12, −32, −33, −35, and −37 upstream of the transcription start site resulted in loss of activity from the promoter, suggesting that bases in these positions are important for RNA polymerase interaction. The sequences centered around −10 (TATTCT) and −35 (TTGACC) in this promoter are, in fact, prototypical of eubacterial promoters. Overlapping the RNA polymerase binding site is a perfect 12-bp direct repeat sequence. Some base changes within this direct repeat resulted in constitutive expression, suggesting that this sequence is an operator for negative regulation. Other base changes resulted in loss of glucose repression while retaining the requirement for chitin induction, suggesting that this sequence is also involved in glucose repression. The fact that cis-acting mutations resulted in glucose resistance but not inducer independence rules out the possibility that glucose repression acts exclusively by inducer exclusion. The fact that mutations that affect glucose repression and chitin induction fall within the same direct repeat sequence module suggests that the direct repeat sequence facilitates both chitin induction and glucose repression. PMID:9371809
Zeng, Lu; Kortschak, R Daniel; Raison, Joy M; Bertozzi, Terry; Adelson, David L
2018-01-01
Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package.
Zeng, Lu; Kortschak, R. Daniel; Raison, Joy M.
2018-01-01
Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package. PMID:29538441
Oshima, Masao; Kikuchi, Rie; Imamura, Jun; Handa, Hirokazu
2010-01-01
CMS (cytoplasmic male sterile) rapeseed is produced by asymmetrical somatic cell fusion between the Brassica napus cv. Westar and the Raphanus sativus Kosena CMS line (Kosena radish). The CMS rapeseed contains a CMS gene, orf125, which is derived from Kosena radish. Our sequence analyses revealed that the orf125 region in CMS rapeseed originated from recombination between the orf125/orfB region and the nad1C/ccmFN1 region by way of a 63 bp repeat. A precise sequence comparison among the related sequences in CMS rapeseed, Kosena radish and normal rapeseed showed that the orf125 region in CMS rapeseed consisted of the Kosena orf125/orfB region and the rapeseed nad1C/ccmFN1 region, even though Kosena radish had both the orf125/orfB region and the nad1C/ccmFN1 region in its mitochondrial genome. We also identified three tandem repeat sequences in the regions surrounding orf125, including a 63 bp repeat, which were involved in several recombination events. Interestingly, differences in the recombination activity for each repeat sequence were observed, even though these sequences were located adjacent to each other in the mitochondrial genome. We report results indicating that recombination events within the mitochondrial genomes are regulated at the level of specific repeat sequences depending on the cellular environment.
Locke, John; Podemski, Lynn; Roy, Ken; Pilgrim, David; Hodgetts, Ross
1999-01-01
Chromosome 4 from Drosophila melanogaster has several unusual features that distinguish it from the other chromosomes. These include a diffuse appearance in salivary gland polytene chromosomes, an absence of recombination, and the variegated expression of P-element transgenes. As part of a larger project to understand these properties, we are assembling a physical map of this chromosome. Here we report the sequence of two cosmids representing ∼5% of the polytenized region. Both cosmid clones contain numerous repeated DNA sequences, as identified by cross hybridization with labeled genomic DNA, BLAST searches, and dot matrix analysis, which are positioned between and within the transcribed sequences. The repetitive sequences include three copies of the mobile element Hoppel, one copy of the mobile element HB, and 18 DINE repeats. DINE is a novel, short repeated sequence dispersed throughout both cosmid sequences. One cosmid includes the previously described cubitus interruptus (ci) gene and two new genes: that a gene with a predicted amino acid sequence similar to ribosomal protein S3a which is consistent with the Minute(4)101 locus thought to be in the region, and a novel member of the protein family that includes plexin and met–hepatocyte growth factor receptor. The other cosmid contains only the two short 5′-most exons from the zinc-finger-homolog-2 (zfh-2) gene. This is the first extensive sequence analysis of noncoding DNA from chromosome 4. The distribution of the various repeats suggests its organization is similar to the β-heterochromatic regions near the base of the major chromosome arms. Such a pattern may account for the diffuse banding of the polytene chromosome 4 and the variegation of many P-element transgenes on the chromosome. PMID:10022978
Galectin-1 suppresses alpha2(I) collagen through Smad3 in renal epithelial cells.
Okano, K; Uchida, K; Nitta, K; Hayashida, T
2008-10-01
Transforming growth factor (TGF-beta1) promotes renal fibrogenesis through activation of Smads. Galectin-1 is reported to prevent experimental glomerulonephritis. Here we investigated the fact that transfected galectin-1 significantly suppressed the transcription of alpha2(I) collagen (COL1A2) in TGF-beta1- activated human renal epithelial cells. Conversely, galectin-1 silencing RNA reduced secretion of type I collagen by HKC cells. Galectin-1 significantly decreased activation of a TGF-beta1-responsive reporter construct and of a minimal reporter construct that contains four repeats of the Smad binding element (SBE). Galectin-1 had no effect on phosphorylation of Smad3 at the linker region and C-terminus, whereas it decreased affinity of Smad3 to the SBE. Additionally, the inhibitory effect of galectin-1 disappeared using a mutated reporter construct, 376 m-LUC, in which a potential Smad recognition site within the promoter is mutated. Taken together, the results suggest that galectin-1 decreases Smad3-complex from binding to the SBE, down-regulating transcription of COL1A2 in TGF-beta1-stimulated renal epithelial cells.
Goren, Moran G; Yosef, Ido; Auster, Oren; Qimron, Udi
2012-10-12
We analyzed sequences of newly inserted repeats in an Escherichia coli CRISPR (clustered regularly interspaced short palindromic repeats) array in vivo and showed that a base previously thought to belong to the repeat is actually derived from a protospacer. Based on further experimental results, we propose to use the term "duplicon" for a repeated sequence in a CRISPR array that serves as a template for a new duplicon. Our findings suggest the possibility of redrawing the borders between repeats, spacers, and protospacer adjacent motifs. Copyright © 2012 Elsevier Ltd. All rights reserved.
Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M; Kashi, Yechezkel
2004-04-01
Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria.
Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M.; Kashi, Yechezkel
2004-01-01
Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria. PMID:15066845
Berasaín, P; Goñi, F; McGonigle, S; Dowd, A; Dalton, J P; Frangione, B; Carmona, C
1997-02-01
The invasive stages of the parasitic trematode Fasciola hepatica release proteinases into the medium in which they are maintained. In this study, we investigated the interaction of F. hepatica excretory/secretory (E/S) products and 2 cysteine proteinases (CL1 and CL2) purified from these products with extracellular matrix and basement membrane macromolecules. Fasciola hepatica E/S products contained collagenolytic activity on fibrillar types I and III collagen as well as basement membrane type IV collagen. CL1 and CL2 were capable of degrading acid-soluble type III and type IV collagen but not insoluble type I collagen. In contrast, neither the E/S products nor the purified CL1 and CL2 showed elastinolytic activity. Fibronectin and laminin were degraded by E/S products and by CL1 and CL2. Sequence analysis of fibronectin degradation products showed that the fragments obtained corresponded to complete biologically active domains. These results indicate that the cysteine proteinases secreted by F. hepatica may be involved in the process of tissue invasion by the parasite.
Quinn, J S; Guglich, E; Seutin, G; Lau, R; Marsolais, J; Parna, L; Boag, P T; White, B N
1992-02-01
The first tandemly repeated sequence examined in a passerine bird, a 431-bp PstI fragment named pMAT1, has been cloned from the genome of the brown-headed cowbird (Molothrus ater). The sequence represents about 5-10% of the genome (about 4 x 10(5) copies) and yields prominent ethidium bromide stained bands when genomic DNA cut with a variety of restriction enzymes is electrophoresed in agarose gels. A particularly striking ladder of fragments is apparent when the DNA is cut with HinfI, indicative of a tandem arrangement of the monomer. The cloned PstI monomer has been sequenced, revealing no internal repeated structure. There are sequences that hybridize with pMAT1 found in related nine-primaried oscines but not in more distantly related oscines, suboscines, or nonpasserine species. Little sequence similarity to tandemly repeated PstI cut sequences from the merlin (Falco columbarius), saurus crane (Grus antigone), or Puerto Rican parrot (Amazona vittata) or to HinfI digested sequence from the Toulouse goose (Anser anser) was detected. The isolated sequence was used as a probe to examine DNA samples of eight members of the tribe Icterini. This examination revealed phylogenetically informative characters. The repeat contains cutting sites from a number of restriction enzymes, which, if sufficiently polymorphic, would provide new phylogenetic characters. Sequences like these, conserved within a species, but variable between closely related species, may be very useful for phylogenetic studies of closely related taxa.
Simon-Lukasik, Kristine V.; Persikov, Anton V.; Brodsky, Barbara; Ramshaw, John A. M.; Laws, William R.; Alexander Ross, J. B.; Ludescher, Richard D.
2003-01-01
We report tryptophan fluorescence measurements of emission intensity, iodide quenching, and anisotropy that describe the environment and dynamics at X and Y sites in stable collagen-like peptides of sequence (Gly-X-Y)n. About 90% of tryptophans at both sites have similar solvent exposed fluorescence properties and a lifetime of 8.5–9 ns. Analysis of anisotropy decays using an associative model indicates that these long lifetime populations undergo rapid depolarizing motion with a 0.5 ns correlation time; however, the extent of fast motion at the Y site is considerably less than the essentially unrestricted motion at the X site. About 10% of tryptophans at both sites have a shorter (∼3 ns) lifetime indicating proximity to a protein quenching group; these minor populations are immobile on the peptide surface, depolarizing only by overall trimer rotation. Iodide quenching indicates that tryptophans at the X site are more accessible to solvent. Side chains at X sites are more solvent accessible and considerably more mobile than residues at Y sites and can more readily fluctuate among alternate intermolecular interactions in collagen fibrils. This fluorescence analysis of collagen-like peptides lays a foundation for studies on the structure, dynamics, and function of collagen and of triple-helical junctions in gelatin gels. PMID:12524302
“One code to find them all”: a perl tool to conveniently parse RepeatMasker output files
2014-01-01
Background Of the different bioinformatic methods used to recover transposable elements (TEs) in genome sequences, one of the most commonly used procedures is the homology-based method proposed by the RepeatMasker program. RepeatMasker generates several output files, including the .out file, which provides annotations for all detected repeats in a query sequence. However, a remaining challenge consists of identifying the different copies of TEs that correspond to the identified hits. This step is essential for any evolutionary/comparative analysis of the different copies within a family. Different possibilities can lead to multiple hits corresponding to a unique copy of an element, such as the presence of large deletions/insertions or undetermined bases, and distinct consensus corresponding to a single full-length sequence (like for long terminal repeat (LTR)-retrotransposons). These possibilities must be taken into account to determine the exact number of TE copies. Results We have developed a perl tool that parses the RepeatMasker .out file to better determine the number and positions of TE copies in the query sequence, in addition to computing quantitative information for the different families. To determine the accuracy of the program, we tested it on several RepeatMasker .out files corresponding to two organisms (Drosophila melanogaster and Homo sapiens) for which the TE content has already been largely described and which present great differences in genome size, TE content, and TE families. Conclusions Our tool provides access to detailed information concerning the TE content in a genome at the family level from the .out file of RepeatMasker. This information includes the exact position and orientation of each copy, its proportion in the query sequence, and its quality compared to the reference element. In addition, our tool allows a user to directly retrieve the sequence of each copy and obtain the same detailed information at the family level when a local library with incomplete TE class/subclass information was used with RepeatMasker. We hope that this tool will be helpful for people working on the distribution and evolution of TEs within genomes.
Rodríguez-Arias, Marta; Montagud-Romero, Sandra; Rubio-Araiz, Ana; Aguilar, María A; Martín-García, Elena; Cabrera, Roberto; Maldonado, Rafael; Porcu, Francesca; Colado, María Isabel; Miñarro, José
2017-01-01
Social stress in adulthood enhances cocaine self-administration, an effect that has been related with an increase in extracellular signal-regulated kinase and p38α mitogen-activated protein kinase phosphorylation. A detrimental effect of cocaine on blood-brain barrier (BBB) integrity has also been reported. This study evaluates the effects of repeated social defeat (RSD) during adolescence on the reinforcing and motivational effects of cocaine in adult mice and the changes induced by RSD on BBB permeability. Cocaine self-administration, conditioned place preference and quantitative analysis of claudin-5, laminin, collagen-IV and IgG immunoreactivity took place 3 weeks after RSD. Mice socially defeated during adolescence developed conditioned place preference and exhibited reinstated preference with a non-effective dose of cocaine (1 mg/kg). RSD mice needed significantly more sessions than control animals for the preference induced by 25 mg/kg of cocaine to be extinguished. However, acquisition of cocaine self-administration (0.5 mg/kg per injection) was delayed in the RSD group. Mice exposed to RSD displayed significant changes in BBB structure in adulthood, with a marked reduction in expression of the tight junction protein claudin-5 and an increase in basal laminin degradation (reflected by a decrease in laminin and collagen-IV expression) in the nucleus accumbens and hippocampus. The detrimental effect induced by cocaine (25 mg/kg) on collagen-IV expression in the hippocampus was more pronounced in RSD mice. In summary, our findings suggest that stress and cocaine can increase the long-term vulnerability of the brain to subsequent environmental insults as a consequence of a sustained disruption of the BBB. © 2015 Society for the Study of Addiction.
Collagen triple helix repeat containing 1 is a new promigratory marker of arthritic pannus.
Shekhani, Mohammed Talha; Forde, Toni S; Adilbayeva, Altynai; Ramez, Mohamed; Myngbay, Askhat; Bexeitov, Yergali; Lindner, Volkhard; Adarichev, Vyacheslav A
2016-07-19
The formation of destructive hypercellular pannus is critical to joint damage in rheumatoid arthritis (RA). The collagen triple helix repeat containing 1 (CTHRC1) protein expressed by activated stromal cells of diverse origin has previously been implicated in tissue remodeling and carcinogenesis. We recently discovered that the synovial Cthrc1 mRNA directly correlates with arthritis severity in mice. This study characterizes the role of CTHRC1 in arthritic pannus formation. Synovial joints of mice with collagen antibody-induced arthritis (CAIA) and human RA-fibroblast-like synoviocytes (FLS) were immunostained for CTHRC1, FLS and macrophage-specific markers. CTHRC1 levels in plasma from patients with RA were measured using sandwich ELISA. The migratory response of fibroblasts was studied with a transwell migration assay and time-lapse microscopy. Velocity and directness of cell migration was analyzed by recording the trajectories of cells treated with rhCTHRC1. Immunohistochemical analysis of normal and inflamed synovium revealed highly inducible expression of CTHRC1 in arthritis (10.9-fold). At the tissue level, CTHRC1-expressing cells occupied the same niche as large fibroblast-like cells positive for α-smooth muscle actin (α-SMA) and cadherin 11 (CDH11). CTHRC1 was produced by activated FLS predominantly located at the synovial intimal lining and at the bone-pannus interface. Cultured RA-FLS expressed CDH11, α-SMA, and CTHRC1. Upon treatment with exogenous rhCTHRC1, embryonic fibroblasts and RA-FLS significantly increased migration velocity, directness, and cell length along the front-tail axis (1.4-fold, p < 0.01). CTHRC1 was established as a novel marker of activated synoviocytes in murine experimental arthritis and RA. The pro-migratory effect of CTHRC1 on synoviocytes is considered one of the mechanisms promoting hypercellularity of the arthritic pannus.
Chagnot, Caroline; Agus, Allison; Renier, Sandra; Peyrin, Frédéric; Talon, Régine; Astruc, Thierry; Desvaux, Mickaël
2013-01-01
Enterohemorrhagic Escherichia coli (EHEC) O157:H7 are responsible for repeated food-poisoning cases often caused by contaminated burgers. EHEC infection is predominantly a pediatric illness, which can lead to life-threatening diseases. Ruminants are the main natural reservoir for EHEC and food contamination almost always originates from faecal contamination. In beef meat products, primary bacterial contamination occurs at the dehiding stage of slaughtering. The extracellular matrix (ECM) is the most exposed part of the skeletal muscles in beef carcasses. Investigating the adhesion to the main muscle fibrous ECM proteins, insoluble fibronectin, collagen I, III and IV, laminin-α2 and elastin, results demonstrated that the preceding growth conditions had a great influence on subsequent bacterial attachment. In the tested experimental conditions, maximal adhesion to fibril-forming collagens I or III occurred at 25°C and pH 7. Once initially adhered, exposure to lower temperatures, as applied to meat during cutting and storage, or acidification, as in the course of post-mortem physiological modifications of muscle, had no effect on detachment, except at pHu. In addition, dense biofilm formation occurred on immobilized collagen I or III and was induced in growth medium supplemented with collagen I in solution. From this first comprehensive investigation of EHEC adhesion to ECM proteins with respect to muscle biology and meat processing, new research directions for the development of innovative practices to minimize the risk of meat contamination are further discussed.
Ni, Y; Nesrallah, J; Agnew, M; Geske, F J; Favaloro, E J
2013-01-01
Introduction Laboratory diagnosis of von Willebrand disease (VWD) requires determination of both von Willebrand factor (VWF) protein levels and activity. Current VWF activity tests include the ristocetin cofactor assay and the collagen-binding assay (VWF:CB). The goal of this investigation is to characterize a new collagen-binding assay and to determine its effectiveness in identifying VWD. Methods Analytical studies were carried out to characterize the performance of a new VWF:CB ELISA. Additionally, samples from a normal population were tested as were well-characterized type 1 and type 2 VWD samples. Results Repeatability and within-laboratory precision studies resulted in coefficients of variation (CVs) of ≤11%. A linear range of 1–354% (0.01–3.54 IU/mL) was determined, along with a limit of detection and a lower limit of quantitation of 1.6% and 4.0% (0.016 and 0.04 IU/mL), respectively. Samples tested from apparently healthy individuals resulted in a normal range of 54–217% (0.54–2.17 IU/mL). Known VWD type 1 and type 2 samples were also analyzed by the ELISA, with 99% of samples having VWF:CB below the normal reference range and an estimated 96% sensitivity and 87% specificity using a VWF collagen-binding/antigen cutoff ratio of 0.50. Conclusion This new VWF:CB ELISA provides an accurate measure of collagen-binding activity that aids in the diagnosis and differentiation of type 1 from type 2 VWD. PMID:23107512
Effects of "D"-Amphetamine and Ethanol on Variable and Repetitive Key-Peck Sequences in Pigeons
ERIC Educational Resources Information Center
Ward, Ryan D.; Bailey, Ericka M.; Odum, Amy L.
2006-01-01
This experiment assessed the effects of "d"-Amphetamine and ethanol on reinforced variable and repetitive key-peck sequences in pigeons. Pigeons responded on two keys under a multiple schedule of Repeat and Vary components. In the Repeat component, completion of a target sequence of right, right, left, left resulted in food. In the Vary component,…
Alverson, Andrew J; Zhuo, Shi; Rice, Danny W; Sloan, Daniel B; Palmer, Jeffrey D
2011-01-20
The mitochondrial genomes of seed plants are exceptionally fluid in size, structure, and sequence content, with the accumulation and activity of repetitive sequences underlying much of this variation. We report the first fully sequenced mitochondrial genome of a legume, Vigna radiata (mung bean), and show that despite its unexceptional size (401,262 nt), the genome is unusually depauperate in repetitive DNA and "promiscuous" sequences from the chloroplast and nuclear genomes. Although Vigna lacks the large, recombinationally active repeats typical of most other seed plants, a PCR survey of its modest repertoire of short (38-297 nt) repeats nevertheless revealed evidence for recombination across all of them. A set of novel control assays showed, however, that these results could instead reflect, in part or entirely, artifacts of PCR-mediated recombination. Consequently, we recommend that other methods, especially high-depth genome sequencing, be used instead of PCR to infer patterns of plant mitochondrial recombination. The average-sized but repeat- and feature-poor mitochondrial genome of Vigna makes it ever more difficult to generalize about the factors shaping the size and sequence content of plant mitochondrial genomes.
Connolly, T M; Jacobs, J W; Condra, C
1992-04-05
A protein that blocks collagen-stimulated platelet aggregation has been identified and isolated from the soluble fraction of salivary glands from Haementeria officinalis leeches. We have named this protein leech antiplatelet protein (LAPP). LAPP was isolated from soluble crude salivary gland extract by heparin-agarose, size exclusion, and C18 reverse phase high-performance chromatography. Its molecular weight is approximately 16,000 on sodium dodecyl sulfate-polyacrylamide gel electrophoresis under both reduced and nonreduced conditions. The sequences of peptides generated by V8 digestion of LAPP as well as its amino acid composition suggested no homology to other known proteins. The IC50 for LAPP to inhibit platelet aggregation was approximately 60 nM. This inhibitory activity is specific for collagen-induced aggregation. Platelet aggregation in response to ADP, arachidonic acid, U46619, thrombin, and ionophore A23187 was not inhibited by LAPP at a concentration that blocked platelet aggregation to collagen by 100%. In contrast, crude salivary gland-soluble extract contained activity(ies) which inhibited aggregation to all these agonists except thrombin at 1 unit/ml and 2 microM A23187. Thus, the H. officinalis leech has evolved multiple mechanisms to prevent hemostasis, including an inhibitor of collagen-stimulated platelet aggregation. The identification and isolation of LAPP demonstrates the existence of a new type of platelet inhibitor that should be useful to better understand the mechanism of collagen stimulation of platelets.
Highly Informative Simple Sequence Repeat (SSR) Markers for Fingerprinting Hazelnut
USDA-ARS?s Scientific Manuscript database
Simple sequence repeat (SSR) or microsatellite markers have many applications in breeding and genetic studies of plants, including fingerprinting of cultivars and investigations of genetic diversity, and therefore provide information for better management of germplasm collections. They are repeatab...
Distribution and sequence homogeneity of an abundant satellite DNA in the beetle, Tenebrio molitor.
Davis, C A; Wyatt, G R
1989-01-01
The mealworm beetle, Tenebrio molitor, contains an unusually abundant and homogeneous satellite DNA which constitutes up to 60% of its genome. The satellite DNA is shown to be present in all of the chromosomes by in situ hybridization. 18 dimers of the repeat unit were cloned and sequenced. The consensus sequence is 142 nt long and lacks any internal repeat structure. Monomers of the sequence are very similar, showing on average a 2% divergence from the calculated consensus. Variant nucleotides are scattered randomly throughout the sequence although some variants are more common than others. Neighboring repeat units are no more alike than randomly chosen ones. The results suggest that some mechanism, perhaps gene conversion, is acting to maintain the homogeneity of the satellite DNA despite its abundance and distribution on all of the chromosomes. Images PMID:2762148
Izuchi, Yukari; Takashima, Tsuneo; Hatano, Naoya
2016-01-01
The demand for leather goods has grown globally in recent years. Industry revenue is forecast to reach $91.2 billion by 2018. There is an ongoing labelling problem in the leather items market, in that it is currently impossible to identify the species that a given piece of leather is derived from. To address this issue, we developed a rapid and simple method for the specific identification of leather derived from cattle, horses, pigs, sheep, goats, and deer by analysing peptides produced by the trypsin-digestion of proteins contained in leather goods using liquid chromatography/mass spectrometry. We determined species-specific amino acid sequences by liquid chromatography/tandem mass spectrometry analysis using the Mascot software program and demonstrated that collagen α-1(I), collagen α-2(I), and collagen α-1(III) from the dermal layer of the skin are particularly useful in species identification. PMID:27313979
Structure to function: Spider silk and human collagen
NASA Astrophysics Data System (ADS)
Rabotyagova, Olena S.
Nature has the ability to assemble a variety of simple molecules into complex functional structures with diverse properties. Collagens, silks and muscles fibers are some examples of fibrous proteins with self-assembling properties. One of the great challenges facing Science is to mimic these designs in Nature to find a way to construct molecules that are capable of organizing into functional supra-structures by self-assembly. In order to do so, a construction kit consisting of molecular building blocks along with a complete understanding on how to form functional materials is required. In this current research, the focus is on spider silk and collagen as fibrous protein-based biopolymers that can shed light on how to generate nanostructures through the complex process of self-assembly. Spider silk in fiber form offers a unique combination of high elasticity, toughness, and mechanical strength, along with biological compatibility and biodegrability. Spider silk is an example of a natural block copolymer, in which hydrophobic and hydrophilic blocks are linked together generating polymers that organize into functional materials with extraordinary properties. Since silks resemble synthetic block copolymer systems, we adopted the principles of block copolymer design from the synthetic polymer literature to build block copolymers based on spider silk sequences. Moreover, we consider spider silk to be an important model with which to study the relationships between structure and properties in our system. Thus, the first part of this work was dedicated to a novel family of spider silk block copolymers, where we generated a new family of functional spider silk-like block copolymers through recombinant DNA technology. To provide fundamental insight into relationships between peptide primary sequence, block composition, and block length and observed morphological and structural features, we used these bioengineered spider silk block copolymers to study secondary structure, morphological features and assembly. Aside from fundamental perspectives, we anticipate that these results will provide a blueprint for the design of precise materials for a range of potential applications such as controlled release devices, functional coatings, components of tissue regeneration materials and environmentally friendly polymers in future studies. In the second part of this work, human collagen type I was studied as another representative of the family of fibrous proteins. Collagen type I is the most abundant extracellular matrix protein in the human body, providing the basis for tissue structure and directing cellular functions. Collagen has a complex structural hierarchy, organized at different length scales, including the characteristic triple helical feature. In the present study we assessed the relationship between collagen structure (native vs. denatured) and sensitivity to UV radiation with a focus on changes in the primary structure, conformation, microstructure and material properties. Free radical reactions are involved in collagen degradation and a mechanism for UV-induced collagen degradation related to structure was proposed. The results from this study demonstrated the role of collagen supramolecular organization (triple helix) in the context of the effects of electromagnetic radiation on extracellular matrices. Owing to the fact that both silks and collagens are proteins that have found widespread interest for biomaterial related needs, we anticipate that the current studies will serve as a foundation for future biomaterial designs with controlled properties. Furthermore, fundamental insight into self-assembly and environmentally-2mediated degradation, will build a foundation for fundamental understanding of the remodeling and functions of these types of fibrous proteins in vivo and in vitro. This type of insight is essential for many areas of scientific inquiry, from drug delivery, to scaffolds for tissue engineering, and to the stability of materials in space.
Amino acid sequence analysis of the annexin super-gene family of proteins.
Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J
1991-06-15
The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of the predictions and shows the power of techniques for the determination of tertiary structural information from the amino acid sequences of an aligned protein family.
Malone, Andrew F; Funk, Steven D; Alhamad, Tarek; Miner, Jeffrey H
2017-06-01
Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Targeted next-generation sequencing results of an individual with Alport syndrome were analyzed and the results confirmed by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant's effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. Using this approach we demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance.
Malone, Andrew F.; Funk, Steven D.; Alhamad, Tarek; Miner, Jeffrey H.
2016-01-01
Introduction Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Methods We analyzed targeted next-generation sequencing results of an individual with Alport syndrome and confirmed results by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant’s effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. Results A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. We demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Conclusions Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance. PMID:28013382
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.
Militello, Kevin T; Lazatin, Justine C
2017-05-01
Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
Sequence investigation of 34 forensic autosomal STRs with massively parallel sequencing.
Zhang, Suhua; Niu, Yong; Bian, Yingnan; Dong, Rixia; Liu, Xiling; Bao, Yun; Jin, Chao; Zheng, Hancheng; Li, Chengtao
2018-05-01
STRs vary not only in the length of the repeat units and the number of repeats but also in the region with which they conform to an incremental repeat pattern. Massively parallel sequencing (MPS) offers new possibilities in the analysis of STRs since they can simultaneously sequence multiple targets in a single reaction and capture potential internal sequence variations. Here, we sequenced 34 STRs applied in the forensic community of China with a custom-designed panel. MPS performance were evaluated from sequencing reads analysis, concordance study and sensitivity testing. High coverage sequencing data were obtained to determine the constitute ratios and heterozygous balance. No actual inconsistent genotypes were observed between capillary electrophoresis (CE) and MPS, demonstrating the reliability of the panel and the MPS technology. With the sequencing data from the 200 investigated individuals, 346 and 418 alleles were obtained via CE and MPS technologies at the 34 STRs, indicating MPS technology provides higher discrimination than CE detection. The whole study demonstrated that STR genotyping with the custom panel and MPS technology has the potential not only to reveal length and sequence variations but also to satisfy the demands of high throughput and high multiplexing with acceptable sensitivity.
Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor.
Kohany, Oleksiy; Gentles, Andrew J; Hankus, Lukasz; Jurka, Jerzy
2006-10-25
Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at http://www.girinst.org/repbase/submission.html (RepbaseSubmitter) and http://www.girinst.org/censor/index.php (Censor).
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence
2017-01-01
During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana. We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays, although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. PMID:28223399
Repeating aftershocks of the great 2004 Sumatra and 2005 Nias earthquakes
NASA Astrophysics Data System (ADS)
Yu, Wen-che; Song, Teh-Ru Alex; Silver, Paul G.
2013-05-01
We investigate repeating aftershocks associated with the great 2004 Sumatra-Andaman (Mw 9.2) and 2005 Nias-Simeulue (Mw 8.6) earthquakes by cross-correlating waveforms recorded by the regional seismographic station PSI and teleseismic stations. We identify 10 and 18 correlated aftershock sequences associated with the great 2004 Sumatra and 2005 Nias earthquakes, respectively. The majority of the correlated aftershock sequences are located near the down-dip end of a large afterslip patch. We determine the precise relative locations of event pairs among these sequences and estimate the source rupture areas. The correlated event pairs identified are appropriately referred to as repeating aftershocks, in that the source rupture areas are comparable and significantly overlap within a sequence. We use the repeating aftershocks to estimate afterslip based on the slip-seismic moment scaling relationship and to infer the temporal decay rate of the recurrence interval. The estimated afterslip resembles that measured from the near-field geodetic data to the first order. The decay rate of repeating aftershocks as a function of lapse time t follows a power-law decay 1/tp with the exponent p in the range 0.8-1.1. Both types of observations indicate that repeating aftershocks are governed by post-seismic afterslip.
Plourde, Marie; Gingras, Hélène; Roy, Gaétan; Lapointe, Andréanne; Leprohon, Philippe; Papadopoulou, Barbara; Corbeil, Jacques; Ouellette, Marc
2014-01-01
Gene amplification of specific loci has been described in all kingdoms of life. In the protozoan parasite Leishmania, the product of amplification is usually part of extrachromosomal circular or linear amplicons that are formed at the level of direct or inverted repeated sequences. A bioinformatics screen revealed that repeated sequences are widely distributed in the Leishmania genome and the repeats are chromosome-specific, conserved among species, and generally present in low copy number. Using sensitive PCR assays, we provide evidence that the Leishmania genome is continuously being rearranged at the level of these repeated sequences, which serve as a functional platform for constitutive and stochastic amplification (and deletion) of genomic segments in the population. This process is adaptive as the copy number of advantageous extrachromosomal circular or linear elements increases upon selective pressure and is reversible when selection is removed. We also provide mechanistic insights on the formation of circular and linear amplicons through RAD51 recombinase-dependent and -independent mechanisms, respectively. The whole genome of Leishmania is thus stochastically rearranged at the level of repeated sequences, and the selection of parasite subpopulations with changes in the copy number of specific loci is used as a strategy to respond to a changing environment. PMID:24844805
Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao
2018-05-01
Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.
Connective tissue growth factor hammerhead ribozyme attenuates human hepatic stellate cell function
Gao, Run-Ping; Brigstock, David R
2009-01-01
AIM: To determine the effect of hammerhead ribozyme targeting connective tissue growth factor (CCN2) on human hepatic stellate cell (HSC) function. METHODS: CCN2 hammerhead ribozyme cDNA plus two self-cleaving sequences were inserted into pTriEx2 to produce pTriCCN2-Rz. Each vector was individually transfected into cultured LX-2 human HSCs, which were then stimulated by addition of transforming growth factor (TGF)-β1 to the culture medium. Semi-quantitative RT-PCR was used to determine mRNA levels for CCN2 or collagen I, while protein levels of each molecule in cell lysates and conditioned medium were measured by ELISA. Cell-cycle progression of the transfected cells was assessed by flow cytometry. RESULTS: In pTriEx2-transfected LX-2 cells, TGF-β1 treatment caused an increase in the mRNA level for CCN2 or collagen I, and an increase in produced and secreted CCN2 or extracellular collagen I protein levels. pTriCCN2-Rz-transfected LX-2 cells showed decreased basal CCN2 or collagen mRNA levels, as well as produced and secreted CCN2 or collagen I protein. Furthermore, the TGF-β1-induced increase in mRNA or protein for CCN2 or collagen I was inhibited partially in pTriCCN2-Rz-transfected LX-2 cells. Inhibition of CCN2 using hammerhead ribozyme cDNA resulted in fewer of the cells transitioning into S phase. CONCLUSION: Endogenous CCN2 is a mediator of basal or TGF-β1-induced collagen I production in human HSCs and regulates entry of the cells into S phase. PMID:19673024
Begum, Rabeya; Alam, Sheikh Shamimul; Menzel, Gerhard; Schmidt, Thomas
2009-01-01
Background and Aims Dendrobium species show tremendous morphological diversity and have broad geographical distribution. As repetitive sequence analysis is a useful tool to investigate the evolution of chromosomes and genomes, the aim of the present study was the characterization of repetitive sequences from Dendrobium moschatum for comparative molecular and cytogenetic studies in the related species Dendrobium aphyllum, Dendrobium aggregatum and representatives from other orchid genera. Methods In order to isolate highly repetitive sequences, a c0t-1 DNA plasmid library was established. Repeats were sequenced and used as probes for Southern hybridization. Sequence divergence was analysed using bioinformatic tools. Repetitive sequences were localized along orchid chromosomes by fluorescence in situ hybridization (FISH). Key Results Characterization of the c0t-1 library resulted in the detection of repetitive sequences including the (GA)n dinucleotide DmoO11, numerous Arabidopsis-like telomeric repeats and the highly amplified dispersed repeat DmoF14. The DmoF14 repeat is conserved in six Dendrobium species but diversified in representative species of three other orchid genera. FISH analyses showed the genome-wide distribution of DmoF14 in D. moschatum, D. aphyllum and D. aggregatum. Hybridization with the telomeric repeats demonstrated Arabidopsis-like telomeres at the chromosome ends of Dendrobium species. However, FISH using the telomeric probe revealed two pairs of chromosomes with strong intercalary signals in D. aphyllum. FISH showed the terminal position of 5S and 18S–5·8S–25S rRNA genes and a characteristic number of rDNA sites in the three Dendrobium species. Conclusions The repeated sequences isolated from D. moschatum c0t-1 DNA constitute major DNA families of the D. moschatum, D. aphyllum and D. aggregatum genomes with DmoF14 representing an ancient component of orchid genomes. Large intercalary telomere-like arrays suggest chromosomal rearrangements in D. aphyllum while the number and localization of rRNA genes as well as the species-specific distribution pattern of an abundant microsatellite reflect the genomic diversity of the three Dendrobium species. PMID:19635741
Collagenolytic Matrix Metalloproteinase Activities toward Peptomeric Triple-Helical Substrates.
Stawikowski, Maciej J; Stawikowska, Roma; Fields, Gregg B
2015-05-19
Although collagenolytic matrix metalloproteinases (MMPs) possess common domain organizations, there are subtle differences in their processing of collagenous triple-helical substrates. In this study, we have incorporated peptoid residues into collagen model triple-helical peptides and examined MMP activities toward these peptomeric chimeras. Several different peptoid residues were incorporated into triple-helical substrates at subsites P3, P1, P1', and P10' individually or in combination, and the effects of the peptoid residues were evaluated on the activities of full-length MMP-1, MMP-8, MMP-13, and MMP-14/MT1-MMP. Most peptomers showed little discrimination between MMPs. However, a peptomer containing N-methyl Gly (sarcosine) in the P1' subsite and N-isobutyl Gly (NLeu) in the P10' subsite was hydrolyzed efficiently only by MMP-13 [nomenclature relative to the α1(I)772-786 sequence]. Cleavage site analysis showed hydrolysis at the Gly-Gln bond, indicating a shifted binding of the triple helix compared to the parent sequence. Favorable hydrolysis by MMP-13 was not due to sequence specificity or instability of the substrate triple helix but rather was based on the specific interactions of the P7' peptoid residue with the MMP-13 hemopexin-like domain. A fluorescence resonance energy transfer triple-helical peptomer was constructed and found to be readily processed by MMP-13, not cleaved by MMP-1 and MMP-8, and weakly hydrolyzed by MT1-MMP. The influence of the triple-helical structure containing peptoid residues on the interaction between MMP subsites and individual substrate residues may provide additional information about the mechanism of collagenolysis, the understanding of collagen specificity, and the design of selective MMP probes.
Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R
2006-12-01
Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.
Correlation between fibroin amino acid sequence and physical silk properties.
Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek
2003-09-12
The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet.
Evidence for Long-Timescale Patterns of Synaptic Inputs in CA1 of Awake Behaving Mice.
Kolb, Ilya; Talei Franzesi, Giovanni; Wang, Michael; Kodandaramaiah, Suhasa B; Forest, Craig R; Boyden, Edward S; Singer, Annabelle C
2018-02-14
Repeated sequences of neural activity are a pervasive feature of neural networks in vivo and in vitro In the hippocampus, sequential firing of many neurons over periods of 100-300 ms reoccurs during behavior and during periods of quiescence. However, it is not known whether the hippocampus produces longer sequences of activity or whether such sequences are restricted to specific network states. Furthermore, whether long repeated patterns of activity are transmitted to single cells downstream is unclear. To answer these questions, we recorded intracellularly from hippocampal CA1 of awake, behaving male mice to examine both subthreshold activity and spiking output in single neurons. In eight of nine recordings, we discovered long (900 ms) reoccurring subthreshold fluctuations or "repeats." Repeats generally were high-amplitude, nonoscillatory events reoccurring with 10 ms precision. Using statistical controls, we determined that repeats occurred more often than would be expected from unstructured network activity (e.g., by chance). Most spikes occurred during a repeat, and when a repeat contained a spike, the spike reoccurred with precision on the order of ≤20 ms, showing that long repeated patterns of subthreshold activity are strongly connected to spike output. Unexpectedly, we found that repeats occurred independently of classic hippocampal network states like theta oscillations or sharp-wave ripples. Together, these results reveal surprisingly long patterns of repeated activity in the hippocampal network that occur nonstochastically, are transmitted to single downstream neurons, and strongly shape their output. This suggests that the timescale of information transmission in the hippocampal network is much longer than previously thought. SIGNIFICANCE STATEMENT We found long (≥900 ms), repeated, subthreshold patterns of activity in CA1 of awake, behaving mice. These repeated patterns ("repeats") occurred more often than expected by chance and with 10 ms precision. Most spikes occurred within repeats and reoccurred with a precision on the order of 20 ms. Surprisingly, there was no correlation between repeat occurrence and classical network states such as theta oscillations and sharp-wave ripples. These results provide strong evidence that long patterns of activity are repeated and transmitted to downstream neurons, suggesting that the hippocampus can generate longer sequences of repeated activity than previously thought. Copyright © 2018 the authors 0270-6474/18/381822-14$15.00/0.
CRF: detection of CRISPR arrays using random forest.
Wang, Kai; Liang, Chun
2017-01-01
CRISPRs (clustered regularly interspaced short palindromic repeats) are particular repeat sequences found in wide range of bacteria and archaea genomes. Several tools are available for detecting CRISPR arrays in the genomes of both domains. Here we developed a new web-based CRISPR detection tool named CRF (CRISPR Finder by Random Forest). Different from other CRISPR detection tools, a random forest classifier was used in CRF to filter out invalid CRISPR arrays from all putative candidates and accordingly enhanced detection accuracy. In CRF, particularly, triplet elements that combine both sequence content and structure information were extracted from CRISPR repeats for classifier training. The classifier achieved high accuracy and sensitivity. Moreover, CRF offers a highly interactive web interface for robust data visualization that is not available among other CRISPR detection tools. After detection, the query sequence, CRISPR array architecture, and the sequences and secondary structures of CRISPR repeats and spacers can be visualized for visual examination and validation. CRF is freely available at http://bioinfolab.miamioh.edu/crf/home.php.
Expanded complexity of unstable repeat diseases
Polak, Urszula; McIvor, Elizabeth; Dent, Sharon Y.R.; Wells, Robert D.; Napierala, Marek
2015-01-01
Unstable Repeat Diseases (URDs) share a common mutational phenomenon of changes in the copy number of short, tandemly repeated DNA sequences. More than 20 human neurological diseases are caused by instability, predominantly expansion, of microsatellite sequences. Changes in the repeat size initiate a cascade of pathological processes, frequently characteristic of a unique disease or a small subgroup of the URDs. Understanding of both the mechanism of repeat instability and molecular consequences of the repeat expansions is critical to developing successful therapies for these diseases. Recent technological breakthroughs in whole genome, transcriptome and proteome analyses will almost certainly lead to new discoveries regarding the mechanisms of repeat instability, the pathogenesis of URDs, and will facilitate development of novel therapeutic approaches. The aim of this review is to give a general overview of unstable repeats diseases, highlight the complexities of these diseases, and feature the emerging discoveries in the field. PMID:23233240
Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.
Mayer, K; Schüller, C; Wambutt, R; Murphy, G; Volckaert, G; Pohl, T; Düsterhöft, A; Stiekema, W; Entian, K D; Terryn, N; Harris, B; Ansorge, W; Brandt, P; Grivell, L; Rieger, M; Weichselgartner, M; de Simone, V; Obermaier, B; Mache, R; Müller, M; Kreis, M; Delseny, M; Puigdomenech, P; Watson, M; Schmidtheini, T; Reichert, B; Portatelle, D; Perez-Alonso, M; Boutry, M; Bancroft, I; Vos, P; Hoheisel, J; Zimmermann, W; Wedler, H; Ridley, P; Langham, S A; McCullagh, B; Bilham, L; Robben, J; Van der Schueren, J; Grymonprez, B; Chuang, Y J; Vandenbussche, F; Braeken, M; Weltjens, I; Voet, M; Bastiaens, I; Aert, R; Defoor, E; Weitzenegger, T; Bothe, G; Ramsperger, U; Hilbert, H; Braun, M; Holzer, E; Brandt, A; Peters, S; van Staveren, M; Dirske, W; Mooijman, P; Klein Lankhorst, R; Rose, M; Hauf, J; Kötter, P; Berneiser, S; Hempel, S; Feldpausch, M; Lamberth, S; Van den Daele, H; De Keyser, A; Buysshaert, C; Gielen, J; Villarroel, R; De Clercq, R; Van Montagu, M; Rogers, J; Cronin, A; Quail, M; Bray-Allen, S; Clark, L; Doggett, J; Hall, S; Kay, M; Lennard, N; McLay, K; Mayes, R; Pettett, A; Rajandream, M A; Lyne, M; Benes, V; Rechmann, S; Borkova, D; Blöcker, H; Scharfe, M; Grimm, M; Löhnert, T H; Dose, S; de Haan, M; Maarse, A; Schäfer, M; Müller-Auer, S; Gabel, C; Fuchs, M; Fartmann, B; Granderath, K; Dauner, D; Herzl, A; Neumann, S; Argiriou, A; Vitale, D; Liguori, R; Piravandi, E; Massenet, O; Quigley, F; Clabauld, G; Mündlein, A; Felber, R; Schnabl, S; Hiller, R; Schmidt, W; Lecharny, A; Aubourg, S; Chefdor, F; Cooke, R; Berger, C; Montfort, A; Casacuberta, E; Gibbons, T; Weber, N; Vandenbol, M; Bargues, M; Terol, J; Torres, A; Perez-Perez, A; Purnelle, B; Bent, E; Johnson, S; Tacon, D; Jesse, T; Heijnen, L; Schwarz, S; Scholler, P; Heber, S; Francs, P; Bielke, C; Frishman, D; Haase, D; Lemcke, K; Mewes, H W; Stocker, S; Zaccaria, P; Bevan, M; Wilson, R K; de la Bastide, M; Habermann, K; Parnell, L; Dedhia, N; Gnoj, L; Schutz, K; Huang, E; Spiegel, L; Sehkon, M; Murray, J; Sheet, P; Cordes, M; Abu-Threideh, J; Stoneking, T; Kalicki, J; Graves, T; Harmon, G; Edwards, J; Latreille, P; Courtney, L; Cloud, J; Abbott, A; Scott, K; Johnson, D; Minx, P; Bentley, D; Fulton, B; Miller, N; Greco, T; Kemp, K; Kramer, J; Fulton, L; Mardis, E; Dante, M; Pepin, K; Hillier, L; Nelson, J; Spieth, J; Ryan, E; Andrews, S; Geisel, C; Layman, D; Du, H; Ali, J; Berghoff, A; Jones, K; Drone, K; Cotton, M; Joshu, C; Antonoiu, B; Zidanic, M; Strong, C; Sun, H; Lamar, B; Yordan, C; Ma, P; Zhong, J; Preston, R; Vil, D; Shekher, M; Matero, A; Shah, R; Swaby, I K; O'Shaughnessy, A; Rodriguez, M; Hoffmann, J; Till, S; Granat, S; Shohdy, N; Hasegawa, A; Hameed, A; Lodhi, M; Johnson, A; Chen, E; Marra, M; Martienssen, R; McCombie, W R
1999-12-16
The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
Hyriopsis cumingii Hic52-A novel nacreous layer matrix protein with a collagen-like structure.
Liu, Xiaojun; Pu, Jingwen; Zeng, Shimei; Jin, Can; Dong, Shaojian; Li, Jiale
2017-09-01
Nacre is a product of a precisely regulated biomineralization process and a major contributor to the luster of pearls. Nacre is composed of calcium carbonate and an organic matrix of proteins that is secreted from mollusc mantle tissue and is exclusively associated with shell formation. In this study, hic52, a novel matrix protein gene from mantle of Hyriopsis cumingii, was cloned and functionally analyzed. The full-length cDNA of hic52 encoded 542 amino acids and contained a signal peptide of 18 amino acids. Excluding the signal peptide, the theoretical molecular mass of the polypeptide was 52.2kDa. The predicted isoelectric point was 10.37, indicating a basic shell protein. The amino acid sequence of hic52 featured high proportion of Gly (28.8%) and Gln (12.4%) residues. The predicted tertiary structure was characterized as having similarities to collagen I, alpha 1 and alpha 2 in the structure. The polypeptide sequence shared no homology with collagen. The hic52 expression pattern by quantitative real-time PCR and in situ hybridization exhibits at the dorsal epithelial cells of the mantle. Expression increased during the stages of pearl sac development. The data showed that hic52 is probably a framework shell protein that mediates and controls the nacreous biomineralization process. Copyright © 2017 Elsevier B.V. All rights reserved.
Isenberg, Jeff S; Yu, Christine; Roberts, David D
2008-02-15
ABT-510 is a potent mimetic of an anti-angiogenic sequence from the second type 1 repeat of thrombospondin-1. ABT-510 and the original d-Ile mimetic from which it was derived, GDGV(dI)TRIR, are similarly active for inhibiting vascular outgrowth in a B16 melanoma explant assay. Because GDGV(dI)TRIR and thrombospondin-1 modulate nitric oxide signaling by inhibiting the fatty translocase activity of CD36, we examined the ability ABT-510 to modulate fatty acid uptake into vascular cells and downstream nitric oxide/cGMP signaling. Remarkably, ABT-510 is less active than GDGV(dI)TRIR for inhibiting myristic acid uptake into both endothelial and vascular smooth muscle cells. Correspondingly, ABT-510 is less potent than GDGV(dI)TRIR for blocking a myristate-stimulated increase in cell adhesion to collagen and nitric oxide-driven accumulation of cGMP. ABT-510 at concentrations sufficient to inhibit CD36 fatty acid translocase activity synergizes with thrombin in aggregating platelets and blunts the activity of NO to delay aggregation, but again less than GDGV(dI)TRIR. In contrast, ABT-510 is more potent than GDGV(dI)TRIR for inducing caspase activation in vascular cells. Thus, we propose that ABT-510 is a drug with at least two mechanisms of action, and its potent anti-tumor activity may be in part independent of CD36 fatty acid translocase inhibition.
Isenberg, Jeff S.; Yu, Christine; Roberts, David D.
2008-01-01
ABT-510 is a potent mimetic of an anti-angiogenic sequence from the second type 1 repeat of thrombospondin-1. ABT-510 and the original d-Ile mimetic from which it was derived, GDGV(dI)TRIR, are similarly active for inhibiting vascular outgrowth in a B16 melanoma explant assay. Because GDGV(dI)TRIR and thrombospondin-1 modulate nitric oxide signaling by inhibiting the fatty translocase activity of CD36, we examined the ability ABT-510 to modulate fatty acid uptake into vascular cells and downstream nitric oxide/cGMP signaling. Remarkably, ABT-510 is less active than GDGV(dI)TRIR for inhibiting myristic acid uptake into both endothelial and vascular smooth muscle cells. Correspondingly, ABT-510 is less potent than GDGV(dI)TRIR for blocking a myristate-stimulated increase in cell adhesion to collagen and nitric oxide-driven accumulation of cGMP. ABT-510 at concentrations sufficient to inhibit CD36 fatty acid translocase activity synergizes with thrombin in aggregating platelets and blunts the activity of NO to delay aggregation, but again less than GDGV(dI)TRIR. In contrast, ABT-510 is more potent than GDGV(dI)TRIR for inducing caspase activation in vascular cells. Thus, we propose that ABT-510 is a drug with at least two mechanisms of action, and its potent anti-tumor activity may be in part independent of CD36 fatty acid translocase inhibition. PMID:18068687
Characterization of species-specific repeated DNA sequences from B. nigra.
Gupta, V; Lakshmisita, G; Shaila, M S; Jagannathan, V; Lakshmikumaran, M S
1992-07-01
The construction and characterization of two genome-specific recombinant DNA clones from B. nigra are described. Southern analysis showed that the two clones belong to a dispersed repeat family. They differ from each other in their length, distribution and sequence, though the average GC content is nearly the same (45%). These B genome-specific repeats have been used to analyse the phylogenetic relationships between cultivated and wild species of the family Brassicaceae.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Greenspan, D.S.; Northrup, H.; Au, K.S.
1995-02-10
COL5A1, the gene for the {alpha}1 chain of type V collagen, has been considered a candidate gene for certain diseases based on chromosomal location and/or disease phenotype. We have employed 3{prime}-untranslated region RFLPs to exclude COL5A1 as a candidate gene in families with tuberous sclerosis 1, Ehlers-Danlos syndrome type H, and nail-patella syndrome. In addition, we describe a polymorphic simple sequence repeat (SSR) within a COL5A1 intron. This SSR is used to exclude COL5A1 as a candidate gene in hereditary hemorrhagic telangiectasia (Osler-Rendu-Weber disease) and to add COL5A1 to the existing map of {open_quotes}index{close_quotes} markers of chromosome 9 by evaluationmore » of the COL5A1 locus on the CEPH 40-family reference pedigree set. This genetic mapping places COL5A1 between markers D9S66 and D9S67. 14 refs., 1 fig., 2 tabs.« less
Jobke, B.; Bolbos, R.; Saadat, E.; Cheng, J.; Li, X.; Majumdar, S.
2012-01-01
The application of biomolecular magnetic resonance imaging becomes increasingly important in the context of early cartilage changes in degenerative and inflammatory joint disease before gross morphological changes become apparent. In this limited technical report, we investigate the correlation of MRI T1, T2 and T1
Solov'ev, V V; Kel', A E; Kolchanov, N A
1989-01-01
The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-01-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163
Reneker, Jeff; Shyu, Chi-Ren; Zeng, Peiyu; Polacco, Joseph C.; Gassmann, Walter
2004-01-01
We have developed a web server for the life sciences community to use to search for short repeats of DNA sequence of length between 3 and 10 000 bases within multiple species. This search employs a unique and fast hash function approach. Our system also applies information retrieval algorithms to discover knowledge of cross-species conservation of repeat sequences. Furthermore, we have incorporated a part of the Gene Ontology database into our information retrieval algorithms to broaden the coverage of the search. Our web server and tutorial can be found at http://acmes.rnet.missouri.edu. PMID:15215469
Gupta, Rashmi; Mirdha, Bijay Ranjan; Guleria, Randeep; Kumar, Lalit; Luthra, Kalpana; Agarwal, Sanjay Kumar; Sreenivas, Vishnubhatla
2013-01-01
Pneumocystis jirovecii is an opportunistic pathogen that causes severe pneumonia in immunocompromised patients. To study the genetic diversity of P. jirovecii in India the upstream conserved sequence (UCS) region of Pneumocystis genome was amplified, sequenced and genotyped from a set of respiratory specimens obtained from 50 patients with a positive result for nested mitochondrial large subunit ribosomal RNA (mtLSU rRNA) PCR during the years 2005-2008. Of these 50 cases, 45 showed a positive PCR for UCS region. Variations in the tandem repeats in UCS region were characterized by sequencing all the positive cases. Of the 45 cases, one case showed five repeats, 11 cases showed four repeats, 29 cases showed three repeats and four cases showed two repeats. By running amplified DNA from all these cases on a high-resolution gel, mixed infection was observed in 12 cases (26.7%, 12/45). Forty three of 45 cases included in this study had previously been typed at mtLSU rRNA and internal transcribed spacer (ITS) region by our group. In the present study, the genotypes at those two regions were combined with UCS repeat patterns to construct allelic profiles of 43 cases. A total of 36 allelic profiles were observed in 43 isolates indicating high genetic variability. A statistically significant association was observed between mtLSU rRNA genotype 1, ITS type Ea and UCS repeat pattern 4. Copyright © 2012 Elsevier B.V. All rights reserved.
Evolution and selection of Rhg1, a copy-number variant nematode-resistance locus
Lee, Tong Geon; Kumar, Indrajit; Diers, Brian W; Hudson, Matthew E
2015-01-01
The soybean cyst nematode (SCN) resistance locus Rhg1 is a tandem repeat of a 31.2 kb unit of the soybean genome. Each 31.2-kb unit contains four genes. One allele of Rhg1, Rhg1-b, is responsible for protecting most US soybean production from SCN. Whole-genome sequencing was performed, and PCR assays were developed to investigate allelic variation in sequence and copy number of the Rhg1 locus across a population of soybean germplasm accessions. Four distinct sequences of the 31.2-kb repeat unit were identified, and some Rhg1 alleles carry up to three different types of repeat unit. The total number of copies of the repeat varies from 1 to 10 per haploid genome. Both copy number and sequence of the repeat correlate with the resistance phenotype, and the Rhg1 locus shows strong signatures of selection. Significant linkage disequilibrium in the genome outside the boundaries of the repeat allowed the Rhg1 genotype to be inferred using high-density single nucleotide polymorphism genotyping of 15 996 accessions. Over 860 germplasm accessions were found likely to possess Rhg1 alleles. The regions surrounding the repeat show indications of non-neutral evolution and high genetic variability in populations from different geographic locations, but without evidence of fixation of the resistant genotype. A compelling explanation of these results is that balancing selection is in operation at Rhg1. PMID:25735447
SSR allelic variation in almond (Prunus dulcis Mill.).
Xie, Hua; Sui, Yi; Chang, Feng-Qi; Xu, Yong; Ma, Rong-Cai
2006-01-01
Sixteen SSR markers including eight EST-SSR and eight genomic SSRs were used for genetic diversity analysis of 23 Chinese and 15 international almond cultivars. EST- and genomic SSR markers previously reported in species of Prunus, mainly peach, proved to be useful for almond genetic analysis. DNA sequences of 117 alleles of six of the 16 SSR loci were analysed to reveal sequence variation among the 38 almond accessions. For the four SSR loci with AG/CT repeats, no insertions or deletions were observed in the flanking regions of the 98 alleles sequenced. Allelic size variation of these loci resulted exclusively from differences in the structures of repeat motifs, which involved interruptions or occurrences of new motif repeats in addition to varying number of AG/CT repeats. Some alleles had a high number of uninterrupted repeat motifs, indicating that SSR mutational patterns differ among alleles at a given SSR locus within the almond species. Allelic homoplasy was observed in the SSR loci because of base substitutions, interruptions or compound repeat motifs. Substitutions in the repeat regions were found at two SSR loci, suggesting that point mutations operate on SSRs and hinder the further SSR expansion by introducing repeat interruptions to stabilize SSR loci. Furthermore, it was shown that some potential point mutations in the flanking regions are linked with new SSR repeat motif variation in almond and peach.
Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.
2000-01-01
In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863
Tasaki, E; Hirayama, J; Tazumi, A; Hayashi, K; Hara, Y; Ueno, H; Moore, J E; Millar, B C; Matsuda, M
2012-02-01
Novel clustered regularly-interspaced short palindromic repeats (CRISPRs) locus [7,500 base pairs (bp) in length] occurred in the urease-positive thermophilic Campylobacter (UPTC) Japanese isolate, CF89-12. The 7,500 bp gene loci consisted of the 5'-methylaminomethyl-2-thiouridylate methyltransferase gene, putative (P) CRISPR associated (p-Cas), putative open reading frames, Cas1 and Cas2, leader sequence region (146 bp), 12 CRISPRs consensus sequence repeats (each 36 bp) separated by a non-repetitive unique spacer region of similar length (26-31 bp) and the phosphatidyl glycerophosphatase A gene. When the CRISPRs loci in the UPTC CF89-12 and five C. jejuni isolates were compared with one another, these six isolates contained p-Cas, Cas1 and Cas2 within the loci. Four to 12 CRISPRs consensus sequence repeats separated by a non-repetitive unique spacer region occurred in six isolates and the nucleotide sequences of those repeats gave approximately 92-100% similarity with each other. However, no sequence similarity occurred in the unique spacer regions among these isolates. The putative σ(70) transcriptional promoter and the hypothetical ρ-independent terminator structures for the CRISPRs and Cas were detected. No in vivo transcription of p-Cas, Cas1 and Cas2 was confirmed in the UPTC cells.
Metal stabilization of collagen and de novo designed mimetic peptides
Parmar, Avanish S.; Xu, Fei; Pike, Douglas H.; Belure, Sandeep V.; Hasan, Nida F.; Drzewiecki, Kathryn E.; Shreiber, David I.; Nanda, Vikas
2017-01-01
We explore the design of metal binding sites to modulate triple-helix stability of collagen and collagen-mimetic peptides. Globular proteins commonly utilize metals to connect tertiary structural elements that are well separated in sequence, constraining structure and enhancing stability. It is more challenging to engineer structural metals into fibrous protein scaffolds, which lack the extensive tertiary contacts seen in globular proteins. In the collagen triple helix, the structural adjacency of the carboxy-termini of the three chains makes this region an attractive target for introducing metal binding sites. We engineered His3 sites based on structural modeling constraints into a series of designed homotrimeric and heterotrimeric peptides, assessing the capacity of metal binding to improve stability and in the case of heterotrimers, affect specificity of assembly. Notable enhancements in stability for both homo and heteromeric systems were observed upon addition of zinc(II) and several other metal ions only when all three histidine ligands were present. Metal binding affinities were consistent with the expected Irving-Williams series for imidazole. Unlike other metals tested, copper(II) also bound to peptides lacking histidine ligands. Acetylation of the peptide N-termini prevented copper binding, indicating proline backbone amide metal-coordination at this site. Copper similarly stabilized animal extracted Type I collagen in a metal specific fashion, highlighting the potential importance of metal homeostasis within the extracellular matrix. PMID:26225466
Metal Stabilization of Collagen and de Novo Designed Mimetic Peptides.
Parmar, Avanish S; Xu, Fei; Pike, Douglas H; Belure, Sandeep V; Hasan, Nida F; Drzewiecki, Kathryn E; Shreiber, David I; Nanda, Vikas
2015-08-18
We explore the design of metal binding sites to modulate triple-helix stability of collagen and collagen-mimetic peptides. Globular proteins commonly utilize metals to connect tertiary structural elements that are well separated in sequence, constraining structure and enhancing stability. It is more challenging to engineer structural metals into fibrous protein scaffolds, which lack the extensive tertiary contacts seen in globular proteins. In the collagen triple helix, the structural adjacency of the carboxy-termini of the three chains makes this region an attractive target for introducing metal binding sites. We engineered His3 sites based on structural modeling constraints into a series of designed homotrimeric and heterotrimeric peptides, assessing the capacity of metal binding to improve stability and in the case of heterotrimers, affect specificity of assembly. Notable enhancements in stability for both homo- and heteromeric systems were observed upon addition of zinc(II) and several other metal ions only when all three histidine ligands were present. Metal binding affinities were consistent with the expected Irving-Williams series for imidazole. Unlike other metals tested, copper(II) also bound to peptides lacking histidine ligands. Acetylation of the peptide N-termini prevented copper binding, indicating proline backbone amide metal-coordination at this site. Copper similarly stabilized animal extracted Type I collagen in a metal-specific fashion, highlighting the potential importance of metal homeostasis within the extracellular matrix.
Tooth agenesis in osteogenesis imperfecta related to mutations in the collagen type I genes.
Malmgren, B; Andersson, K; Lindahl, K; Kindmark, A; Grigelioniene, G; Zachariadis, V; Dahllöf, G; Åström, E
2017-01-01
Osteogenesis imperfecta (OI) is a heterogeneous group of disorders of connective tissue, mainly caused by mutations in the collagen type I genes (COL1A1 and COL1A2). Tooth agenesis is a common feature of OI. We investigated the association between tooth agenesis and collagen type I mutations in individuals with OI. In this cohort study, 128 unrelated individuals with OI were included. Panoramic radiographs were analyzed regarding dentinogenesis imperfecta (DGI) and congenitally missing teeth. The collagen I genes were sequenced in all individuals, and in 25, multiplex ligation-dependent probe amplification was performed. Mutations in the COL1A1 and COL1A2 genes were found in 104 of 128 individuals. Tooth agenesis was diagnosed in 17% (hypodontia 11%, oligodontia 6%) and was more frequent in those with DGI (P = 0.016), and in those with OI type III, 47%, compared to those with OI types I, 12% (P = 0.003), and IV, 13% (P = 0.017). Seventy-five percent of the individuals with oligodontia (≥6 missing teeth) had qualitative mutations, but there was no association with OI type, gender, or presence of DGI. The prevalence of tooth agenesis is high (17%) in individuals with OI, and OI caused by a qualitative collagen I mutation is associated with oligodontia. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Tenascin-X, Collagen, Elastin and the Ehlers-Danlos Syndrome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bristow, James; Carey, William; Schalkwijk, Joost
2005-08-31
Tenascin-X is an extracellular matrix protein initially identified because of its overlap with the human CYP21B gene. Because studies of gene and protein function of other tenascins had been poorly predictive of essential functions in vivo, we used a genetic approach that critically relied on an understanding of the genomic locus to uncover an association between inactivating tenascin-X mutations and novel recessive and dominant forms of Ehlers-Danlos syndrome. Tenascin-X provides the first example of a gene outside of the fibrillar collagens and their processing enzymes that causes Ehlers-Danlos syndrome. Tenascin-X null mice recapitulate the skin findings of the human disease,more » confirming a causative role for this gene in Ehlers-Danlos syndrome. Further evaluation of these mice showed that tenascin-X is an important regulator of collagen deposition in vivo, suggesting a novel mechanism of disease in this form of Ehlers-Danlos syndrome. Further studies suggest that tenascin-X may do this through both direct and indirect interactions with the collagen fibril. Recent studies show that TNX effects on matrix extend beyond the collagen to the elastogenic pathway and matrix remodeling enzymes. Tenascin-X serves as a compelling example of how human experiments of nature can guide us to an understanding of genes whose function may not be evident from their sequence or in vitro studies of their encoded proteins.« less
Wu, Ying; Liu, Fang; Yang, Dai-Gang; Li, Wei; Zhou, Xiao-Jian; Pei, Xiao-Yu; Liu, Yan-Gai; He, Kun-Lun; Zhang, Wen-Sheng; Ren, Zhong-Ying; Zhou, Ke-Hai; Ma, Xiong-Feng; Li, Zhong-Hu
2018-01-01
Cotton is one of the most economically important fiber crop plants worldwide. The genus Gossypium contains a single allotetraploid group (AD) and eight diploid genome groups (A–G and K). However, the evolution of repeat sequences in the chloroplast genomes and the phylogenetic relationships of Gossypium species are unclear. Thus, we determined the variations in the repeat sequences and the evolutionary relationships of 40 cotton chloroplast genomes, which represented the most diverse in the genus, including five newly sequenced diploid species, i.e., G. nandewarense (C1-n), G. armourianum (D2-1), G. lobatum (D7), G. trilobum (D8), and G. schwendimanii (D11), and an important semi-wild race of upland cotton, G. hirsutum race latifolium (AD1). The genome structure, gene order, and GC content of cotton species were similar to those of other higher plant plastid genomes. In total, 2860 long sequence repeats (>10 bp in length) were identified, where the F-genome species had the largest number of repeats (G. longicalyx F1: 108) and E-genome species had the lowest (G. stocksii E1: 53). Large-scale repeat sequences possibly enrich the genetic information and maintain genome stability in cotton species. We also identified 10 divergence hotspot regions, i.e., rpl33-rps18, psbZ-trnG (GCC), rps4-trnT (UGU), trnL (UAG)-rpl32, trnE (UUC)-trnT (GGU), atpE, ndhI, rps2, ycf1, and ndhF, which could be useful molecular genetic markers for future population genetics and phylogenetic studies. Site-specific selection analysis showed that some of the coding sites of 10 chloroplast genes (atpB, atpE, rps2, rps3, petB, petD, ccsA, cemA, ycf1, and rbcL) were under protein sequence evolution. Phylogenetic analysis based on the whole plastomes suggested that the Gossypium species grouped into six previously identified genetic clades. Interestingly, all 13 D-genome species clustered into a strong monophyletic clade. Unexpectedly, the cotton species with C, G, and K-genomes were admixed and nested in a large clade, which could have been due to their recent radiation, incomplete lineage sorting, and introgression hybridization among different cotton lineages. In conclusion, the results of this study provide new insights into the evolution of repeat sequences in chloroplast genomes and interspecific relationships in the genus Gossypium. PMID:29619041
Liu, Qian; Xu, Xue-Nian; Zhou, Yan; Cheng, Na; Dong, Yu-Ting; Zheng, Hua-Jun; Zhu, Yong-Qiang; Zhu, Yong-Qiang
2013-08-01
To find and clone new antigen genes from the lambda-ZAP cDNA expression library of adult Clonorchis sinensis, and determine the immunological characteristics of the recombinant proteins. The cDNA expression library of adult C. sinensis was screened by pooled sera of clonorchiasis patients. The sequences of the positive phage clones were compared with the sequences in EST database, and the full-length sequence of the gene (Cs22 gene) was obtained by RT-PCR. cDNA fragments containing 2 and 3 times tandem repeat sequences were generated by jumping PCR. The sequence encoding the mature peptide or the tandem repeat sequence was respectively cloned into the prokaryotic expression vector pET28a (+), and then transformed into E. coli Rosetta DE3 cells for expression. The recombinant proteins (rCs22-2r, rCs22-3r, rCs22M-2r, and rCs22M-3r) were purified by His-bind-resin (Ni-NTA) affinity chromatography. The immunogenicity of rCs22-2r and rCs22-3r was identified by ELISA. To evaluate the immunological diagnostic value of rCs22-2r and rCs22-3r, serum samples from 35 clonorchiasis patients, 31 healthy individuals, 15 schistosomiasis patients, 15 paragonimiasis westermani patients and 13 cysticercosis patients were examined by ELISA. To locate antigenic determinants, the pooled sera of clonorchiasis patients and healthy persons were analyzed for specific antibodies by ELISA with recombinant protein rCs22M-2r and rCs22M-3r containing the tandem repeat sequences. The full-length sequence of Cs22 antigen gene of C. sinensis was obtained. It contained 13 times tandem repeat sequences of EQQDGDEEGMGGDGGRGKEKGKVEGEDGAGEQKEQA. Bioinformatics analysis indicated that the protein (Cs22) belonged to GPI-anchored proteins family. The recombinant proteins rCs22-2r and rCs22-3r showed a certain level of immunogenicity. The positive rate by ELISA coated with the purified PrCs22-2r and PrCs22-3r for sera of clonorchiasis patients both were 45.7% (16/35), and 3.2% (1/31) for those of healthy persons. There was no cross reaction with sera of schistosomiasis and cysticercosis patients. The cross reaction with sera of paragonimiasis westermani patients was 1/15. The recombinant proteins rCs22M-2r and rCs22M-3r which only contained tandem repeats were specifically recognized by pooled sera of clonorchiasis patients. The Cs22 antigen gene of Clonorchis sinensis is obtained, and the recombinant proteins have certain diagnostic value. The antigenic determinant is located in tandem repeat sequences.
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence.
Maheshwari, Shamoni; Ishii, Takayoshi; Brown, C Titus; Houben, Andreas; Comai, Luca
2017-03-01
During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays , although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. © 2017 Maheshwari et al.; Published by Cold Spring Harbor Laboratory Press.
Structural analysis of two length variants of the rDNA intergenic spacer from Eruca sativa.
Lakshmikumaran, M; Negi, M S
1994-03-01
Restriction enzyme analysis of the rRNA genes of Eruca sativa indicated the presence of many length variants within a single plant and also between different cultivars which is unusual for most crucifers studied so far. Two length variants of the rDNA intergenic spacer (IGS) from a single individual E. sativa (cv. Itsa) plant were cloned and characterized. The complete nucleotide sequences of both the variants (3 kb and 4 kb) were determined. The intergenic spacer contains three families of tandemly repeated DNA sequences denoted as A, B and C. However, the long (4 kb) variant shows the presence of an additional repeat, denoted as D, which is a duplication of a 224 bp sequence just upstream of the putative transcription initiation site. Repeat units belonging to the three different families (A, B and C) were in the size range of 22 to 30 bp. Such short repeat elements are present in the IGS of most of the crucifers analysed so far. Sequence analysis of the variants (3 kb and 4 kb) revealed that the length heterogeneity of the spacer is located at three different regions and is due to the varying copy numbers of repeat units belonging to families A and B. Length variation of the spacer is also due to the presence of a large duplication (D repeats) in the 4 kb variant which is absent in the 3 kb variant. The putative transcription initiation site was identified by comparisons with the rDNA sequences from other plant species.
White, J H; Johnson, A L; Lowndes, N F; Johnston, L H
1991-01-01
By fusing the CDC9 structural gene to the PGK upstream sequences and the CDC9 upstream to lacZ, we showed that the cell cycle expression of CDC9 is largely due to transcriptional regulation. To investigate the role of six ATGATT upstream repeats in CDC9 regulation, synthetic copies of the sequence were attached to a heterologous gene. The repeats stimulated transcription strongly and additively, but, unlike conventional yeast UAS elements, only when present in one orientation. Transcription driven by the repeats declines in cells held at START of the cell cycle or in stationary phase, as occurs with CDC9. However, the repeats by themselves cannot impart cell cycle regulation to a heterologous gene. CDC9 may therefore be controlled by an activating system operating through the repeats that is sensitive to cellular proliferation and a separate mechanism that governs the periodic expression in the cell cycle. Images PMID:1901644
Choi, Yoon Jung; Lee, Jue Yeon; Lee, Seung Jin; Chung, Chong-Pyoung; Park, Yoon Jeong
2012-03-09
Bone sialoprotein (BSP) is a mineralized, tissue-specific, non-collagenous protein that is normally expressed only in mineralized tissues such as bone, dentin, cementum, and calcified cartilage, and at sites of new mineral formation. The binding of BSP to collagen is thought to be important for initiating bone mineralization and bone cell adhesion to the mineralized matrix. Several recent studies have isolated stem cells from muscle tissue, but their functional properties are still unclear. In this study, we examined the effects of a synthetic collagen-binding peptide (CBP) on the differentiation efficiency of muscle-derived stem cells (MDSCs). The CBP sequence (NGVFKYRPRYYLYKHAYFYPHLKRFPVQ) corresponds to residues 35-62 of bone sialoprotein (BSP), which are located within the collagen-binding domain in BSP. Interestingly, this synthetic CBP inhibited adipogenic differentiation but increased osteogenic differentiation in MDSCs. The CBP also induced expression of osteoblastic marker proteins, including alkaline phosphatase (ALP), type I collagen, Runt-related transcription factor 2 (Runx2), and osteocalcin; prevented adipogenic differentiation in MDSCs; and down-regulated adipose-specific mRNAs, such as adipocyte protein 2 (aP2) and peroxisome proliferator-activated receptor γ. The CBP increased Extracellular signal-regulated kinases (ERK) 1/2 protein phosphorylation, which is important in lineage determination. These observations suggest that this CBP determines the osteogenic or adipogenic lineage in MDSCs by activating ERK1/2. Taken together, a novel CBP could be a useful candidate for regenerating bone and treating osteoporosis, which result from an imbalance in osteogenesis and adipogenesis differentiation. Copyright © 2012 Elsevier Inc. All rights reserved.
In Vitro Expansion of CAG, CAA, and Mixed CAG/CAA Repeats.
Figura, Grzegorz; Koscianska, Edyta; Krzyzosiak, Wlodzimierz J
2015-08-11
Polyglutamine diseases, including Huntington's disease and a number of spinocerebellar ataxias, are caused by expanded CAG repeats that are located in translated sequences of individual, functionally-unrelated genes. Only mutant proteins containing polyglutamine expansions have long been thought to be pathogenic, but recent evidence has implicated mutant transcripts containing long CAG repeats in pathogenic processes. The presence of two pathogenic factors prompted us to attempt to distinguish the effects triggered by mutant protein from those caused by mutant RNA in cellular models of polyglutamine diseases. We used the SLIP (Synthesis of Long Iterative Polynucleotide) method to generate plasmids expressing long CAG repeats (forming a hairpin structure), CAA-interrupted CAG repeats (forming multiple unstable hairpins) or pure CAA repeats (not forming any secondary structure). We successfully modified the original SLIP protocol to generate repeats of desired length starting from constructs containing short repeat tracts. We demonstrated that the SLIP method is a time- and cost-effective approach to manipulate the lengths of expanded repeat sequences.
Collagen fibre characterisation in arterial tissue under load using SALS.
Gaul, R T; Nolan, D R; Lally, C
2017-11-01
The collagen fibre architecture of arterial tissue is known to play a key role in its resultant mechanical behaviour, while maladaptive remodelling of this architecture may be linked to disease. Many of the techniques currently used to analyse collagen fibre architecture require time consuming tissue preparation procedures and are destructive in nature. The aim of this study is to fully explore Small Angle Light Scattering (SALS) as a means to non-destructively assess collagen fibre architecture in arterial tissue and subsequently gain insights into load induced reorientation. The optimised configuration of the SALS system for arterial tissue was determined using quantitative comparisons to histological analyses of porcine carotid artery as its basis. Once established, layer specific fibre orientation and the influence of tissue loading was determined for thin sections of carotid artery using SALS. This process was subsequently repeated for intact carotid artery layers. A single family of circumferentially orientated collagen fibres were found in the intima (- 0.1 ± 1.4° (5.5°)) and media (- 1.7 ± 1.9° (4.7°)) while two perpendicular families of fibres were identified in the adventitia (- 6.4 ± 0.7° (37.7°)) and (118.3 ± 2.7 (39.9°)). An increase in fibre alignment in response to a 20% circumferential strain was also identified using SALS, characterised by an increase in scattered light eccentricity. determined using SALS agreed with those found using traditional destructive techniques, however SALS has the important benefits of allowing vessel layers to remain intact, and has a fast processing time. SALS unique ability to identify load induced reorganisation in intact arterial layers offers an efficient means to gain crucial insights into arterial disease and its development over time. Copyright © 2017 Elsevier Ltd. All rights reserved.
Lee, Christine K; Mokhtari, Tara; Connolly, Ian D; Li, Gordon; Shuer, Lawrence M; Chang, Steven D; Steinberg, Gary K; Hayden Gephart, Melanie
2017-12-01
Posterior fossa decompression surgeries for Chiari malformations are susceptible to postoperative complications such as pseudomeningocele, external cerebrospinal fluid (CSF) leak, and meningitis. Various dural substitutes have been used to improve surgical outcomes. This study examined whether the collagen matrix dural substitute type correlated with the incidence of postoperative complications after posterior fossa decompression in adult patients with Chiari I malformations. A retrospective cohort study was conducted of 81 adult patients who underwent an elective decompressive surgery for treatment of symptomatic Chiari I malformations, with duraplasty involving a dural substitute derived from either bovine or porcine collagen matrix. Demographics and treatment characteristics were correlated with surgical outcomes. A total of 81 patients were included in the study. Compared with bovine dural substitute, porcine dural substitute was associated with a significantly higher risk of pseudomeningocele occurrence (odds ratio, 5.78; 95% confidence interval, 1.65-27.15; P = 0.01) and a higher overall complication rate (odds ratio, 3.70; 95% confidence interval, 1.23-12.71; P = 0.03) by univariate analysis. There was no significant difference in the rate of meningitis, repeat operations, or overall complication rate between the 2 dural substitutes. In addition, estimated blood loss was a significant risk factor for meningitis (P = 0.03). Multivariate analyses again showed that porcine dural substitute was associated with pseudomeningocele occurrence, although the association with higher overall complication rate did not reach significance. Dural substitutes generated from porcine collagen, compared with those from bovine collagen, were associated with a higher likelihood of pseudomeningocele development in adult patients undergoing Chiari I malformation decompression and duraplasty. Copyright © 2017 Elsevier Inc. All rights reserved.
A simplified approach to quasi-linear viscoelastic modeling
Nekouzadeh, Ali; Pryse, Kenneth M.; Elson, Elliot L.; Genin, Guy M.
2007-01-01
The fitting of quasi-linear viscoelastic (QLV) constitutive models to material data often involves somewhat cumbersome numerical convolution. A new approach to treating quasi-linearity in one dimension is described and applied to characterize the behavior of reconstituted collagen. This approach is based on a new principle for including nonlinearity and requires considerably less computation than other comparable models for both model calibration and response prediction, especially for smoothly applied stretching. Additionally, the approach allows relaxation to adapt with the strain history. The modeling approach is demonstrated through tests on pure reconstituted collagen. Sequences of “ramp-and-hold” stretching tests were applied to rectangular collagen specimens. The relaxation force data from the “hold” was used to calibrate a new “adaptive QLV model” and several models from literature, and the force data from the “ramp” was used to check the accuracy of model predictions. Additionally, the ability of the models to predict the force response on a reloading of the specimen was assessed. The “adaptive QLV model” based on this new approach predicts collagen behavior comparably to or better than existing models, with much less computation. PMID:17499254
Fan, Guangyi; Jiao, Yu; Zhang, He; Huang, Ronglian; Zheng, Zhe; Bian, Chao; Deng, Yuewen; Wang, Qingheng; Wang, Zhongduo; Liang, Xinming; Liang, Haiying; Shi, Chengcheng; Zhao, Xiaoxia; Sun, Fengming; Hao, Ruijuan; Bai, Jie; Liu, Jialiang; Chen, Wenbin; Liang, Jinlian; Liu, Weiqing; Xu, Zhe; Shi, Qiong; Xu, Xun
2017-01-01
Abstract Nacre, the iridescent material found in pearls and shells of molluscs, is formed through an extraordinary process of matrix-assisted biomineralization. Despite recent advances, many aspects of the biomineralization process and its evolutionary origin remain unknown. The pearl oyster Pinctada fucata martensii is a well-known master of biomineralization, but the molecular mechanisms that underlie its production of shells and pearls are not fully understood. We sequenced the highly polymorphic genome of the pearl oyster and conducted multi-omic and biochemical studies to probe nacre formation. We identified a large set of novel proteins participating in matrix-framework formation, many in expanded families, including components similar to that found in vertebrate bones such as collagen-related VWA-containing proteins, chondroitin sulfotransferases, and regulatory elements. Considering that there are only collagen-based matrices in vertebrate bones and chitin-based matrices in most invertebrate skeletons, the presence of both chitin and elements of collagen-based matrices in nacre suggests that elements of chitin- and collagen-based matrices have deep roots and might be part of an ancient biomineralizing matrix. Our results expand the current shell matrix-framework model and provide new insights into the evolution of diverse biomineralization systems. PMID:28873964
Isolation and biochemical characterisation of a novel collagen from Catostylus tagi.
Calejo, M T; Morais, Z B; Fernandes, A I
2009-01-01
A preliminary biochemical approach to the study of collagen isolated from the medusa Catostylus tagi is reported and results are discussed in view of its use as a natural matrix for biomedical applications. Collagen from the jellyfish umbrella was isolated by pepsin digestion and purified by dialysis and salt precipitation. As expected, glycine represented almost one-third of the total amino acids. Aromatic amino-acid content was very low and imino acids were fewer than in collagens from fish and mammalian sources. Results from SDS-PAGE, ion-exchange chromatography and N-terminal amino-acid sequencing revealed an alpha1alpha2alpha3 heterotrimer, similar to vertebrate type V/XI. The molecular mass of two of the polypeptide chains was close to 85 kDa and 100 kDa for the third. However, the two chains presenting similar molecular mass, showed differences in charge and primary structure. Further characterisation showed a glycosylated protein with the carbohydrate moiety comprising almost 7% of the total mass, a denaturation temperature of 29.9 degrees C and multiple isoelectric points. Incubation with glutamyl endopeptidase resulted in significant digestion, in agreement with the protein's high content of Asp and Glu.
Guizard, Sébastien; Piégu, Benoît; Arensburger, Peter; Guillou, Florian; Bigot, Yves
2016-08-19
The program RepeatMasker and the database Repbase-ISB are part of the most widely used strategy for annotating repeats in animal genomes. They have been used to show that avian genomes have a lower repeat content (8-12 %) than the sequenced genomes of many vertebrate species (30-55 %). However, the efficiency of such a library-based strategies is dependent on the quality and completeness of the sequences in the database that is used. An alternative to these library based methods are methods that identify repeats de novo. These alternative methods have existed for a least a decade and may be more powerful than the library based methods. We have used an annotation strategy involving several complementary de novo tools to determine the repeat content of the model genome galGal4 (1.04 Gbp), including identifying simple sequence repeats (SSRs), tandem repeats and transposable elements (TEs). We annotated over one Gbp. of the galGal4 genome and showed that it is composed of approximately 19 % SSRs and TEs repeats. Furthermore, we estimate that the actual genome of the red jungle fowl contains about 31-35 % repeats. We find that library-based methods tend to overestimate TE diversity. These results have a major impact on the current understanding of repeats distributions throughout chromosomes in the red jungle fowl. Our results are a proof of concept of the reliability of using de novo tools to annotate repeats in large animal genomes. They have also revealed issues that will need to be resolved in order to develop gold-standard methodologies for annotating repeats in eukaryote genomes.
Rational design of alpha-helical tandem repeat proteins with closed architectures
Doyle, Lindsey; Hallinan, Jazmine; Bolduc, Jill; Parmeggiani, Fabio; Baker, David; Stoddard, Barry L.; Bradley, Philip
2015-01-01
Tandem repeat proteins, which are formed by repetition of modular units of protein sequence and structure, play important biological roles as macromolecular binding and scaffolding domains, enzymes, and building blocks for the assembly of fibrous materials1,2. The modular nature of repeat proteins enables the rapid construction and diversification of extended binding surfaces by duplication and recombination of simple building blocks3,4. The overall architecture of tandem repeat protein structures – which is dictated by the internal geometry and local packing of the repeat building blocks – is highly diverse, ranging from extended, super-helical folds that bind peptide, DNA, and RNA partners5–9, to closed and compact conformations with internal cavities suitable for small molecule binding and catalysis10. Here we report the development and validation of computational methods for de novo design of tandem repeat protein architectures driven purely by geometric criteria defining the inter-repeat geometry, without reference to the sequences and structures of existing repeat protein families. We have applied these methods to design a series of closed alpha-solenoid11 repeat structures (alpha-toroids) in which the inter-repeat packing geometry is constrained so as to juxtapose the N- and C-termini; several of these designed structures have been validated by X-ray crystallography. Unlike previous approaches to tandem repeat protein engineering12–20, our design procedure does not rely on template sequence or structural information taken from natural repeat proteins and hence can produce structures unlike those seen in nature. As an example, we have successfully designed and validated closed alpha-solenoid repeats with a left-handed helical architecture that – to our knowledge – is not yet present in the protein structure database21. PMID:26675735
Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai
2017-01-01
The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.
Wang, Q Z; Huang, M; Downie, S R; Chen, Z X
2016-05-23
Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes
Huang, Yongjie; Mrázek, Jan
2014-01-01
Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877
Simple sequence repeat markers that identify Claviceps species and strains
USDA-ARS?s Scientific Manuscript database
Claviceps purpurea is a pathogen that infects most members of the Pooideae subfamily and causes ergot, a floral disease in which the ovary is replaced with a sclerotium. This study was initiated to develop Simple Sequence Repeat (SSRs) markers for rapid identification of C. purpurea. SSRs were desi...
Biological sequence compression algorithms.
Matsumoto, T; Sadakane, K; Imai, H
2000-01-01
Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.
Psychosocial aspects of osteogenesis imperfecta.
Shea-Landry, G L; Cole, D E
1986-01-01
Osteogenesis imperfecta is a heterogeneous group of inherited disorders characterized by bone fragility and recurrent fractures. It is currently classified into four types on clinical grounds and appears to arise from different disorders of bone collagen synthesis. The biochemical identification of disturbances in collagen metabolism and the genetic delineation of new mutations of collagen genes have made prenatal diagnosis by molecular methods feasible in some cases. Most people with osteogenesis imperfecta suffer frequent fractures (and sometimes consequent serious disability), for which there are few effective preventive measures. This disorder may have a profound psychosocial influence on patients and their families. In this report the extent of this influence is reviewed and aspects important to the medical community are highlighted; these include the emotional burdens imposed by unfounded suspicions of child abuse, the social and financial costs of repeated hospitalization and immobility, and the frustrations generated by the lack of helpful, practical information for families and health care workers. An important social outcome has been the rise of self-help organizations, exemplified by the Canadian Osteogenesis Imperfecta Society. For Canadian families the society has been an important vehicle for exchange of information and an active, positive response to a lifelong, often severely disabling disorder. PMID:3756737
Exploring Molecular and Mechanical Gradients in Structural Bioscaffolds†
Waite, J. Herbert; Lichtenegger, Helga C.; Stucky, Galen D.; Hansma, Paul
2007-01-01
Most organisms consist of a functionally adaptive assemblage of hard and soft tissues. Despite the obvious advantages of reinforcing soft protoplasm with a hard scaffold, such composites can lead to tremendous mechanical stresses where the two meet. Although little is known about how nature relieves these stresses, it is generally agreed that fundamental insights about molecular adaptation at hard/soft interfaces could profoundly influence how we think about biomaterials. Based on two noncellular tissues, mussel byssus and polychaete jaws, recent studies suggest that one natural strategy to minimize interfacial stresses between adjoining stiff and soft tissue appears to be the creation of a “fuzzy” boundary, which avoids abrupt changes in mechanical properties. Instead there is a gradual mechanical change that accompanies the transcendence from stiff to soft and vice versa. In byssal threads, the biochemical medium for achieving such a gradual mechanical change involves the elegant use of collagen-based self-assembling block copolymers. There are three distinct diblock copolymer types in which one block is always collagenous, whereas the other can be either elastin-like (soft), amorphous polyglycine (intermediate), or silk-like (stiff). Gradients of these are made by an incrementally titrated expression of the three proteins in secretory cells the titration phenotype of which is linked to their location. Thus, reflecting exactly the composition of each thread, the distal cells secrete primarily the silk– and polyglycine–collagen diblocks, whereas the proximal cells secrete the elastin– and polyglycine–collagen diblocks. Those cells in between exhibit gradations of collagens with silk or elastin blocks. Spontaneous self-assembly appears to be by pH triggered metal binding by histidine (HIS)-rich sequences at both the amino and carboxy termini of the diblocks. In the polychaete jaws, HIS-rich sequences are expanded into a major block domain. Histidine predominates at over 20 mol % near the distal tip and diminishes to about 5 mol % near the proximal base. The abundance of histidine is directly correlated to transition metal content (Zn or Cu) as well as hardness determined by nanoindentation. EXAFS analyses of the jaws indicate that transition metals such as Zn are directly bound to histidine ligands and may serve as cross-linkers. PMID:15196007
Alu repeats: A source for the genesis of primate microsatellites
DOE Office of Scientific and Technical Information (OSTI.GOV)
Arcot, S.S.; Batzer, M.A.; Wang, Zhenyuan
1995-09-01
As a result of their abundance, relatively uniform distribution, and high degree of polymorphism, microsatellites and minisatellites have become valuable tools in genetic mapping, forensic identity testing, and population studies. In recent years, a number of microsatellite repeats have been found to be associated with Alu interspersed repeated DNA elements. The association of an Alu element with a microsatellite repeat could result from the integration of an Alu element within a preexisting microsatellite repeat. Alternatively, Alu elements could have a direct role in the origin of microsatellite repeats. Errors introduced during reverse transcription of the primary transcript derived from anmore » Alu {open_quotes}master{close_quote} gene or the accumulation of random mutations in the middle A-rich regions and oligo(dA)-rich tails of Alu elements after insertion and subsequent expansion and contraction of these sequences could result in the genesis of a microsatellite repeat. We have tested these hypotheses by a direct evolutionary comparison of the sequences of some recent Alu elements that are found only in humans and are absent from nonhuman primates, as well as some older Alu elements that are present at orthologous positions in a number of nonhuman primates. The origin of {open_quotes}young{close_quotes} Alu insertions, absence of sequences that resemble microsatellite repeats at the orthologous loci in chimpanzees, and the gradual expansion of microsatellite repeats in some old Alu repeats at orthologous positions within the genomes of a number of nonhuman primates suggest that Alu elements are a source for the genesis of primate microsatellite repeats. 48 refs., 5 figs., 3 tabs.« less
Short-Sequence DNA Repeats in Prokaryotic Genomes
van Belkum, Alex; Scherer, Stewart; van Alphen, Loek; Verbrugh, Henri
1998-01-01
Short-sequence DNA repeat (SSR) loci can be identified in all eukaryotic and many prokaryotic genomes. These loci harbor short or long stretches of repeated nucleotide sequence motifs. DNA sequence motifs in a single locus can be identical and/or heterogeneous. SSRs are encountered in many different branches of the prokaryote kingdom. They are found in genes encoding products as diverse as microbial surface components recognizing adhesive matrix molecules and specific bacterial virulence factors such as lipopolysaccharide-modifying enzymes or adhesins. SSRs enable genetic and consequently phenotypic flexibility. SSRs function at various levels of gene expression regulation. Variations in the number of repeat units per locus or changes in the nature of the individual repeat sequences may result from recombination processes or polymerase inadequacy such as slipped-strand mispairing (SSM), either alone or in combination with DNA repair deficiencies. These rather complex phenomena can occur with relative ease, with SSM approaching a frequency of 10−4 per bacterial cell division and allowing high-frequency genetic switching. Bacteria use this random strategy to adapt their genetic repertoire in response to selective environmental pressure. SSR-mediated variation has important implications for bacterial pathogenesis and evolutionary fitness. Molecular analysis of changes in SSRs allows epidemiological studies on the spread of pathogenic bacteria. The occurrence, evolution and function of SSRs, and the molecular methods used to analyze them are discussed in the context of responsiveness to environmental factors, bacterial pathogenicity, epidemiology, and the availability of full-genome sequences for increasing numbers of microorganisms, especially those that are medically relevant. PMID:9618442
2014-03-01
then locks into the microscope stage for extreme stability. Extremely stable intravital images can then be collected with nearly no breathing...Szulczewski, PJ Keely, KW Eliceiri. Novel Intravital Imaging Approaches to Characterize Collagen Alignment in Defined Mammary Tumor Models. Microscopy and...repeated 3 times on different days. 13 Figure 5: New fixturing for intravital FLIM imaging through a rodent mammary imaging window. Stage is raised
NASA Technical Reports Server (NTRS)
Landis, W. J.; Hodgens, K. J.; Arena, J.; Song, M. J.; McEwen, B. F.
1996-01-01
Aspects of the ultrastructural interaction between collagen and mineral crystals in embryonic chick bone have been examined by the novel technique of high voltage electron microscopic tomography to obtain three-dimensional information concerning extracellular calcification in this tissue. Newly mineralizing osteoid along periosteal surfaces of mid-diaphyseal regions from normal chick tibiae was embedded, cut into 0.25 microns thick sections, and documented at 1.0 MV in the Albany AEI-EM7 high voltage electron microscope. The areas of the tissue studied contained electron dense mineral crystals associated with collagen fibrils, some marked by crystals disposed along their cylindrically shaped lengths. Tomographic reconstructions of one site with two mineralizing fibrils were computed from a 5 degrees tilt series of micrographs over a +/- 60 degrees range. Reconstructions showed that the mineral crystals were platelets of irregular shape. Their sizes were variable, measured here up to 80 x 30 x 8 nm in length, width, and thickness, respectively. The longest crystal dimension, corresponding to the c-axis crystallographically, was generally parallel to the collagen fibril long axis. Individual crystals were oriented parallel to one another in each fibril examined. They were also parallel in the neighboring but apparently spatially separate fibrils. Crystals were periodically (approximately 67 nm repeat distance) arranged along the fibrils and their location appeared to correspond to collagen hole and overlap zones defined by geometrical imaging techniques. The crystals appeared to be continuously distributed along a fibril, their size and number increasing in a tapered fashion from a relatively narrow tip containing smaller and infrequent crystals to wider regions having more densely packed and larger crystals. Defined for the first time by direct visual 3D imaging, these data describe the size, shape, location, orientation, and development of early crystals in normal bone collagen. The results suggest that platelet-shaped crystals are arranged in channels or grooves which are formed by collagen hole zones in register and that crystal sizes may exceed the dimensions of hole zones. Such data agree with those from mineral-matrix interaction in normally calcifying avian tendon obtained by similar high voltage tomographic means, but in addition they indicate a possible gradual and continuous deposition of crystals in collagen of bone unlike tendon and imply that individual collagen fibrils in local regions of osteoid are organized such that they all may be aligned in a coherent manner.
Collagen like peptide bioconjugates for targeted drug delivery applications
NASA Astrophysics Data System (ADS)
Luo, Tianzhi
Collagen is the most abundant protein in mammals, and there has been long-standing interest in understanding and controlling collagen assembly in the design of new materials. Collagen-like peptides (CLP), also known as collagen-mimetic peptides (CMP), are short synthetic peptides which mimic the triple helical conformation of native collagens. In the past few decades, collagen like peptides and their conjugated hybrids have become a new class of biomaterials that possesses unique structures and properties. In addition to traditional applications of using CLPs to decipher the role of different amino acid residues and tripeptide motifs in stabilizing the collagen triple helix and mimicking collagen fibril formation, with the introduction of specific interactions including electrostatic interactions, pi-pi stacking interaction and metal-ligand coordination, a variety of artificial collagen-like peptides with well-defined sequences have been designed to create higher order assemblies with specific biological functions. The CLPs have also been widely used as bioactive domains or physical cross-linkers to fabricate hydrogels, which have shown potential to improve cell adhesion, proliferation and ECM macromolecule production. Despite this widespread use, the utilization of CLPs as domains in stimuli responsive bioconjugates represents a relatively new area for the development of functional polymeric materials. In this work, a new class of thermoresponsive diblock conjugates, containing collagen-like peptides and a thermoresponsive polymer, namely poly(diethylene glycol methyl ether methacrylate) (PDEGMEMA), is introduced. The CLP domain maintains its triple helix conformation after conjugation with the polymer. The engineered LCST of these conjugates has enabled temperature-induced assembly under aqueous conditions, at physiologically relevant temperatures, into well-defined vesicles with diameters of approximately 50-200 nm. The formation of nanostructures was driven by the coil/globule conformational transition of the PDEGMEMA building block above its LCST with stabilization of the nanostructures by the hydrophilic CLP. To the best of our knowledge, this is the first report on such assembled nanostructures from collagen-like peptide containing copolymers. Due to the strong propensity for CLPs to bind to natural collagen via strand invasion processes, these nanosized vesicles may be used as drug carriers for targeted delivery. In addition to synthetic polymers, the collagen like peptide is then conjugated with a thermoresponsive elastin-like peptide (ELP). The resulting ELP-CLP diblock conjugates show a remarkable reduction in the inverse transition temperature of the ELP domain, attributed to the anchoring effect of the CLP triple helix. The lower transition temperature of the conjugate enables facile formation of well-defined vesicles at physiological temperature and the unexpected resolubilization of the vesicles at elevated temperatures upon unfolding of the CLP domain. Given the ability of CLPs to modify collagens, this work provides not only a simple and versatile avenue for controlling the inverse transition behavior of elastin-like peptides, but also suggest future opportunities for these thermoresponsive nanostructures in biologically relevant environments. In the last section, the potential of using the ELP-CLP nanoparticles as drug delivery vehicles for targeting collagen containing matrices is evaluated. A sustained release of clinically relevant amount of encapsulated modelled drug is achieved within three weeks, followed by a thermally controlled burst release. As expected, the ELP-CLP nanoparticles show strong retention on collagen substrate, via specific binding through collagen triple helix hybridization. Additionally, cell viability and proliferation studies using fibroblasts and chondrocytes suggest the nanoparticles are non-cytotoxic. Additionally, almost no TNF-alpha expression from macrophages is observed, suggesting that the nanoparticles do not initiate inflammatory response. Endowed with specific collagen binding, controlled thermoresponsiveness, excellent cytocompatibility, and non-immune responsiveness, we believe the ELP-CLP nanoparticles are promising candidates as drug delivery vehicles for targeting collagen containing matrices. Considering the critical role of collagens in extracellular matrix and the unique ability of the CLP to target native collagens, our work offers significant opportunities for the design of collagen-like peptides and their bioconjugates for targeted application in the biomedical arena.
GATA simple sequence repeats function as enhancer blocker boundaries.
Kumar, Ram P; Krishnan, Jaya; Pratap Singh, Narendra; Singh, Lalji; Mishra, Rakesh K
2013-01-01
Simple sequence repeats (SSRs) account for ~3% of the human genome, but their functional significance still remains unclear. One of the prominent SSRs the GATA tetranucleotide repeat has preferentially accumulated in complex organisms. GATA repeats are particularly enriched on the human Y chromosome, and their non-random distribution and exclusive association with genes expressed during early development indicate their role in coordinated gene regulation. Here we show that GATA repeats have enhancer blocker activity in Drosophila and human cells. This enhancer blocker activity is seen in transgenic as well as native context of the enhancers at various developmental stages. These findings ascribe functional significance to SSRs and offer an explanation as to why SSRs, especially GATA, may have accumulated in complex organisms.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Greenspan, D.S.; Papenberg, K.A.; Marchuk, D.A.
1994-09-01
Type V collagen is the only fibrillar collagen which has yet to be implicated in the pathogenesis of genetic diseases in humans or mice. To begin examining the possible role of type V collagen in genetic disease, we have previously mapped COL5A1, the gene for the {alpha}1 chain of type V collagen, to 9q23.2{r_arrow}q34.3 and described two restriction site polymorphisms which allowed us to exclude COL5A1 as candidate gene for nail-patella syndrome. We have now used these polymorphisms to exclude COL5A1 as candidate gene for tuberous sclerosis complex 1 and Ehlers-Danlos syndrome type II. In addition, we describe a CAmore » repeat, with observed heterozygosity of about 0.5, in a COL5A1 intron, which has allowed us to exclude COL5A1 as a candidate gene in hereditary hemorrhagic telangiectasia and to place COL5A1 on the CEPH family genetic map between markers D9S66 and D9S67. We have also determined the entire intron/exon organization of COL5A1, which will facilitate characterization of mutations in genetic diseases with which COL5A1 may be linked in future studies.« less
CRISPRDetect: A flexible algorithm to define CRISPR arrays.
Biswas, Ambarish; Staals, Raymond H J; Morales, Sergio E; Fineran, Peter C; Brown, Chris M
2016-05-17
CRISPR (clustered regularly interspaced short palindromic repeats) RNAs provide the specificity for noncoding RNA-guided adaptive immune defence systems in prokaryotes. CRISPR arrays consist of repeat sequences separated by specific spacer sequences. CRISPR arrays have previously been identified in a large proportion of prokaryotic genomes. However, currently available detection algorithms do not utilise recently discovered features regarding CRISPR loci. We have developed a new approach to automatically detect, predict and interactively refine CRISPR arrays. It is available as a web program and command line from bioanalysis.otago.ac.nz/CRISPRDetect. CRISPRDetect discovers putative arrays, extends the array by detecting additional variant repeats, corrects the direction of arrays, refines the repeat/spacer boundaries, and annotates different types of sequence variations (e.g. insertion/deletion) in near identical repeats. Due to these features, CRISPRDetect has significant advantages when compared to existing identification tools. As well as further support for small medium and large repeats, CRISPRDetect identified a class of arrays with 'extra-large' repeats in bacteria (repeats 44-50 nt). The CRISPRDetect output is integrated with other analysis tools. Notably, the predicted spacers can be directly utilised by CRISPRTarget to predict targets. CRISPRDetect enables more accurate detection of arrays and spacers and its gff output is suitable for inclusion in genome annotation pipelines and visualisation. It has been used to analyse all complete bacterial and archaeal reference genomes.
Han, Yonghua; Wang, Guixiang; Liu, Zhao; Liu, Jinhua; Yue, Wei; Song, Rentao; Zhang, Xueyong; Jin, Weiwei
2010-02-01
Knowledge about the composition and structure of centromeres is critical for understanding how centromeres perform their functional roles. Here, we report the sequences of one centromere-associated bacterial artificial chromosome clone from a Coix lacryma-jobi library. Two Ty3/gypsy-class retrotransposons, centromeric retrotransposon of C. lacryma-jobi (CRC) and peri-centromeric retrotransposon of C. lacryma-jobi, and a (peri)centromere-specific tandem repeat with a unit length of 153 bp were identified. The CRC is highly homologous to centromere-specific retrotransposons reported in grass species. An 80-bp DNA region in the 153-bp satellite repeat was found to be conserved to centromeric satellite repeats from maize, rice, and pearl millet. Fluorescence in situ hybridization showed that the three repetitive sequences were located in (peri-)centromeric regions of both C. lacryma-jobi and Coix aquatica. However, the 153-bp satellite repeat was only detected on 20 out of the 30 chromosomes in C. aquatica. Immunostaining with an antibody against rice CENH3 indicates that the 153-bp satellite repeat and CRC might be both the major components for functional centromeres, but not all the 153-bp satellite repeats or CRC sequences are associated with CENH3. The evolution of centromeric repeats of C. lacryma-jobi during the polyploidization was discussed.
CRISPR Detection From Short Reads Using Partial Overlap Graphs.
Ben-Bassat, Ilan; Chor, Benny
2016-06-01
Clustered regularly interspaced short palindromic repeats (CRISPR) are structured regions in bacterial and archaeal genomes, which are part of an adaptive immune system against phages. CRISPRs are important for many microbial studies and are playing an essential role in current gene editing techniques. As such, they attract substantial research interest. The exponential growth in the amount of bacterial sequence data in recent years enables the exploration of CRISPR loci in more and more species. Most of the automated tools that detect CRISPR loci rely on fully assembled genomes. However, many assemblers do not handle repetitive regions successfully. The first tool to work directly on raw sequence data is Crass, which requires reads that are long enough to contain two copies of the same repeat. We present a method to identify CRISPR repeats from raw sequence data of short reads. The algorithm is based on an observation differentiating CRISPR repeats from other types of repeats, and it involves a series of partial constructions of the overlap graph. This enables us to avoid many of the difficulties that assemblers face, as we merely aim to identify the repeats that belong to CRISPR loci. A preliminary implementation of the algorithm shows good results and detects CRISPR repeats in cases where other existing tools fail to do so.
Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh
2014-01-01
Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1–6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ PMID:25380781
Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh
2014-01-01
Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ © The Author(s) 2014. Published by Oxford University Press.
Zhu, H; Senalik, D; McCown, B H; Zeldin, E L; Speers, J; Hyman, J; Bassil, N; Hummer, K; Simon, P W; Zalapa, J E
2012-01-01
The American cranberry (Vaccinium macrocarpon Ait.) is a major commercial fruit crop in North America, but limited genetic resources have been developed for the species. Furthermore, the paucity of codominant DNA markers has hampered the advance of genetic research in cranberry and the Ericaceae family in general. Therefore, we used Roche 454 sequencing technology to perform low-coverage whole genome shotgun sequencing of the cranberry cultivar 'HyRed'. After de novo assembly, the obtained sequence covered 266.3 Mb of the estimated 540-590 Mb in cranberry genome. A total of 107,244 SSR loci were detected with an overall density across the genome of 403 SSR/Mb. The AG repeat was the most frequent motif in cranberry accounting for 35% of all SSRs and together with AAG and AAAT accounted for 46% of all loci discovered. To validate the SSR loci, we designed 96 primer-pairs using contig sequence data containing perfect SSR repeats, and studied the genetic diversity of 25 cranberry genotypes. We identified 48 polymorphic SSR loci with 2-15 alleles per locus for a total of 323 alleles in the 25 cranberry genotypes. Genetic clustering by principal coordinates and genetic structure analyzes confirmed the heterogeneous nature of cranberries. The parentage composition of several hybrid cultivars was evident from the structure analyzes. Whole genome shotgun 454 sequencing was a cost-effective and efficient way to identify numerous SSR repeats in the cranberry sequence for marker development.
Target Site Recognition by a Diversity-Generating Retroelement
Guo, Huatao; Tse, Longping V.; Nieh, Angela W.; Czornyj, Elizabeth; Williams, Steven; Oukil, Sabrina; Liu, Vincent B.; Miller, Jeff F.
2011-01-01
Diversity-generating retroelements (DGRs) are in vivo sequence diversification machines that are widely distributed in bacterial, phage, and plasmid genomes. They function to introduce vast amounts of targeted diversity into protein-encoding DNA sequences via mutagenic homing. Adenine residues are converted to random nucleotides in a retrotransposition process from a donor template repeat (TR) to a recipient variable repeat (VR). Using the Bordetella bacteriophage BPP-1 element as a prototype, we have characterized requirements for DGR target site function. Although sequences upstream of VR are dispensable, a 24 bp sequence immediately downstream of VR, which contains short inverted repeats, is required for efficient retrohoming. The inverted repeats form a hairpin or cruciform structure and mutational analysis demonstrated that, while the structure of the stem is important, its sequence can vary. In contrast, the loop has a sequence-dependent function. Structure-specific nuclease digestion confirmed the existence of a DNA hairpin/cruciform, and marker coconversion assays demonstrated that it influences the efficiency, but not the site of cDNA integration. Comparisons with other phage DGRs suggested that similar structures are a conserved feature of target sequences. Using a kanamycin resistance determinant as a reporter, we found that transplantation of the IMH and hairpin/cruciform-forming region was sufficient to target the DGR diversification machinery to a heterologous gene. In addition to furthering our understanding of DGR retrohoming, our results suggest that DGRs may provide unique tools for directed protein evolution via in vivo DNA diversification. PMID:22194701
Tran, Trung D; Cao, Hieu X; Jovtchev, Gabriele; Neumann, Pavel; Novák, Petr; Fojtová, Miloslava; Vu, Giang T H; Macas, Jiří; Fajkus, Jiří; Schubert, Ingo; Fuchs, Joerg
2015-12-01
Linear chromosomes of eukaryotic organisms invariably possess centromeres and telomeres to ensure proper chromosome segregation during nuclear divisions and to protect the chromosome ends from deterioration and fusion, respectively. While centromeric sequences may differ between species, with arrays of tandemly repeated sequences and retrotransposons being the most abundant sequence types in plant centromeres, telomeric sequences are usually highly conserved among plants and other organisms. The genome size of the carnivorous genus Genlisea (Lentibulariaceae) is highly variable. Here we study evolutionary sequence plasticity of these chromosomal domains at an intrageneric level. We show that Genlisea nigrocaulis (1C = 86 Mbp; 2n = 40) and G. hispidula (1C = 1550 Mbp; 2n = 40) differ as to their DNA composition at centromeres and telomeres. G. nigrocaulis and its close relative G. pygmaea revealed mainly 161 bp tandem repeats, while G. hispidula and its close relative G. subglabra displayed a combination of four retroelements at centromeric positions. G. nigrocaulis and G. pygmaea chromosome ends are characterized by the Arabidopsis-type telomeric repeats (TTTAGGG); G. hispidula and G. subglabra instead revealed two intermingled sequence variants (TTCAGG and TTTCAGG). These differences in centromeric and, surprisingly, also in telomeric DNA sequences, uncovered between groups with on average a > 9-fold genome size difference, emphasize the fast genome evolution within this genus. Such intrageneric evolutionary alteration of telomeric repeats with cytosine in the guanine-rich strand, not yet known for plants, might impact the epigenetic telomere chromatin modification. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K.; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Clarke, Jihong Liu
2009-01-01
Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19–37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16–21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C–U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae. PMID:17534593
Oggioni, M R; Claverys, J P
1999-10-01
A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.
Analysis of SINE and LINE repeat content of Y chromosomes in the platypus, Ornithorhynchus anatinus.
Kortschak, R Daniel; Tsend-Ayush, Enkhjargal; Grützner, Frank
2009-01-01
Monotremes feature an extraordinary sex-chromosome system that consists of five X and five Y chromosomes in males. These sex chromosomes share homology with bird sex chromosomes but no homology with the therian X. The genome of a female platypus was recently completed, providing unique insights into sequence and gene content of autosomes and X chromosomes, but no Y-specific sequence has so far been analysed. Here we report the isolation, sequencing and analysis of approximately 700 kb of sequence of the non-recombining regions of Y2, Y3 and Y5, which revealed differences in base composition and repeat content between autosomes and sex chromosomes, and within the sex chromosomes themselves. This provides the first insights into repeat content of Y chromosomes in platypus, which overall show similar patterns of repeat composition to Y chromosomes in other species. Interestingly, we also observed differences between the various Y chromosomes, and in combination with timing and activity patterns we provide an approach that can be used to examine the evolutionary history of the platypus sex-chromosome chain.
Handedness in shearing auxetics creates rigid and compliant structures
NASA Astrophysics Data System (ADS)
Lipton, Jeffrey Ian; MacCurdy, Robert; Manchester, Zachary; Chin, Lillian; Cellucci, Daniel; Rus, Daniela
2018-05-01
In nature, repeated base units produce handed structures that selectively bond to make rigid or compliant materials. Auxetic tilings are scale-independent frameworks made from repeated unit cells that expand under tension. We discovered how to produce handedness in auxetic unit cells that shear as they expand by changing the symmetries and alignments of auxetic tilings. Using the symmetry and alignment rules that we developed, we made handed shearing auxetics that tile planes, cylinders, and spheres. By compositing the handed shearing auxetics in a manner inspired by keratin and collagen, we produce both compliant structures that expand while twisting and deployable structures that can rigidly lock. This work opens up new possibilities in designing chemical frameworks, medical devices like stents, robotic systems, and deployable engineering structures.
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-06-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Characterization of genetic sequence variation of 58 STR loci in four major population groups.
Novroski, Nicole M M; King, Jonathan L; Churchill, Jennifer D; Seah, Lay Hong; Budowle, Bruce
2016-11-01
Massively parallel sequencing (MPS) can identify sequence variation within short tandem repeat (STR) alleles as well as their nominal allele lengths that traditionally have been obtained by capillary electrophoresis. Using the MiSeq FGx Forensic Genomics System (Illumina), STRait Razor, and in-house excel workbooks, genetic variation was characterized within STR repeat and flanking regions of 27 autosomal, 7 X-chromosome and 24 Y-chromosome STR markers in 777 unrelated individuals from four population groups. Seven hundred and forty six autosomal, 227 X-chromosome, and 324 Y-chromosome STR alleles were identified by sequence compared with 357 autosomal, 107 X-chromosome, and 189 Y-chromosome STR alleles that were identified by length. Within the observed sequence variation, 227 autosomal, 156 X-chromosome, and 112 Y-chromosome novel alleles were identified and described. One hundred and seventy six autosomal, 123 X-chromosome, and 93 Y-chromosome sequence variants resided within STR repeat regions, and 86 autosomal, 39 X-chromosome, and 20 Y-chromosome variants were located in STR flanking regions. Three markers, D18S51, DXS10135, and DYS385a-b had 1, 4, and 1 alleles, respectively, which contained both a novel repeat region variant and a flanking sequence variant in the same nucleotide sequence. There were 50 markers that demonstrated a relative increase in diversity with the variant sequence alleles compared with those of traditional nominal length alleles. These population data illustrate the genetic variation that exists in the commonly used STR markers in the selected population samples and provide allele frequencies for statistical calculations related to STR profiling with MPS data. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Pietras, D F; Bennett, K L; Siracusa, L D; Woodworth-Gutai, M; Chapman, V M; Gross, K W; Kane-Haas, C; Hastie, N D
1983-01-01
We report the construction of a small library of recombinant plasmids containing Mus musculus repetitive DNA inserts. The repetitive cloned fraction was derived from denatured genomic DNA by reassociation to a Cot value at which repetitive, but not unique, sequences have reannealed followed by exhaustive S1 nuclease treatment to degrade single stranded DNA. Initial characterizations of this library by colony filter hybridizations have led to the identification of a previously undetected M. musculus minor satellite as well as to clones containing M. musculus major satellite sequences. This new satellite is repeated 10-20 times less than the major satellite in the M. musculus genome. It has a repeat length of 130 nucleotides compared with the M. musculus major satellite with a repeat length of 234 nucleotides. Sequence analysis of the minor satellite has shown that it has a 29 base pair region with extensive homology to one of the major satellite repeating subunits. We also show by in situ hybridization that this minor satellite sequence is located at the centromeres and possibly the arms of at least half the M musculus chromosomes. Sequences related to the minor satellite have been found in the DNA of a related Mus species, Mus spretus, and may represent the major satellite of that species. Images PMID:6314268
Cho, Kwang-Soo; Yun, Bong-Kyoung; Yoon, Young-Ho; Hong, Su-Young; Mekapogu, Manjulatha; Kim, Kyung-Hee; Yang, Tae-Jin
2015-01-01
We report the chloroplast (cp) genome sequence of tartary buckwheat (Fagopyrum tataricum) obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale) cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp) were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats) and F. esculentum (one repeat), and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes—rpoC2, ycf3, accD, and clpP—have high synonymous (Ks) value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum. PMID:25966355
Two new miniature inverted-repeat transposable elements in the genome of the clam Donax trunculus.
Šatović, Eva; Plohl, Miroslav
2017-10-01
Repetitive sequences are important components of eukaryotic genomes that drive their evolution. Among them are different types of mobile elements that share the ability to spread throughout the genome and form interspersed repeats. To broaden the generally scarce knowledge on bivalves at the genome level, in the clam Donax trunculus we described two new non-autonomous DNA transposons, miniature inverted-repeat transposable elements (MITEs), named DTC M1 and DTC M2. Like other MITEs, they are characterized by their small size, their A + T richness, and the presence of terminal inverted repeats (TIRs). DTC M1 and DTC M2 are 261 and 286 bp long, respectively, and in addition to TIRs, both of them contain a long imperfect palindrome sequence in their central parts. These elements are present in complete and truncated versions within the genome of the clam D. trunculus. The two new MITEs share only structural similarity, but lack any nucleotide sequence similarity to each other. In a search for related elements in databases, blast search revealed within the Crassostrea gigas genome a larger element sharing sequence similarity only to DTC M1 in its TIR sequences. The lack of sequence similarity with any previously published mobile elements indicates that DTC M1 and DTC M2 elements may be unique to D. trunculus.
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miller, Matthew T.; Higgin, Joshua J.; Hall, Traci M.Tanaka
2008-06-06
Pumilio/FBF (PUF) family proteins are found in eukaryotic organisms and regulate gene expression post-transcriptionally by binding to sequences in the 3' untranslated region of target transcripts. PUF proteins contain an RNA binding domain that typically comprises eight {alpha}-helical repeats, each of which recognizes one RNA base. Some PUF proteins, including yeast Puf4p, have altered RNA binding specificity and use their eight repeats to bind to RNA sequences with nine or ten bases. Here we report the crystal structures of Puf4p alone and in complex with a 9-nucleotide (nt) target RNA sequence, revealing that Puf4p accommodates an 'extra' nucleotide by modestmore » adaptations allowing one base to be turned away from the RNA binding surface. Using structural information and sequence comparisons, we created a mutant Puf4p protein that preferentially binds to an 8-nt target RNA sequence over a 9-nt sequence and restores binding of each protein repeat to one RNA base.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bhattacharya, Monolekha; Das, Amit Kumar, E-mail: amitk@hijli.iitkgp.ernet.in
Highlights: Black-Right-Pointing-Pointer The regulatory sequences recognized by TcrX have been identified. Black-Right-Pointing-Pointer The regulatory region comprises of inverted repeats segregated by 30 bp region. Black-Right-Pointing-Pointer The mode of binding of TcrX with regulatory sequence is unique. Black-Right-Pointing-Pointer In silico TcrX-DNA docked model binds one of the inverted repeats. Black-Right-Pointing-Pointer Both phosphorylated and unphosphorylated TcrX binds regulatory sequence in vitro. -- Abstract: TcrY, a histidine kinase, and TcrX, a response regulator, constitute a two-component system in Mycobacterium tuberculosis. tcrX, which is expressed during iron scarcity, is instrumental in the survival of iron-dependent M. tuberculosis. However, the regulator of tcrX/Y has notmore » been fully characterized. Crosslinking studies of TcrX reveal that it can form oligomers in vitro. Electrophoretic mobility shift assays (EMSAs) show that TcrX recognizes two regions in the promoter that are comprised of inverted repeats separated by {approx}30 bp. The dimeric in silico model of TcrX predicts binding to one of these inverted repeat regions. Site-directed mutagenesis and radioactive phosphorylation indicate that D54 of TcrX is phosphorylated by H256 of TcrY. However, phosphorylated and unphosphorylated TcrX bind the regulatory sequence with equal efficiency, which was shown with an EMSA using the D54A TcrX mutant.« less
Structural features of the rice chromosome 4 centromere.
Zhang, Yu; Huang, Yuchen; Zhang, Lei; Li, Ying; Lu, Tingting; Lu, Yiqi; Feng, Qi; Zhao, Qiang; Cheng, Zhukuan; Xue, Yongbiao; Wing, Rod A; Han, Bin
2004-01-01
A complete sequence of a chromosome centromere is necessary for fully understanding centromere function. We reported the sequence structures of the first complete rice chromosome centromere through sequencing a large insert bacterial artificial chromosome clone-based contig, which covered the rice chromosome 4 centromere. Complete sequencing of the 124-kb rice chromosome 4 centromere revealed that it consisted of 18 tracts of 379 tandemly arrayed repeats known as CentO and a total of 19 centromeric retroelements (CRs) but no unique sequences were detected. Four tracts, composed of 65 CentO repeats, were located in the opposite orientation, and 18 CentO tracts were flanked by 19 retroelements. The CRs were classified into four types, and the type I retroelements appeared to be more specific to rice centromeres. The preferential insert of the CRs among CentO repeats indicated that the centromere-specific retroelements may contribute to centromere expansion during evolution. The presence of three intact retrotransposons in the centromere suggests that they may be responsible for functional centromere initiation through a transcription-mediated mechanism.
Chromosome rearrangements via template switching between diverged repeated sequences
Anand, Ranjith P.; Tsaponina, Olga; Greenwell, Patricia W.; Lee, Cheng-Sheng; Du, Wei; Petes, Thomas D.
2014-01-01
Recent high-resolution genome analyses of cancer and other diseases have revealed the occurrence of microhomology-mediated chromosome rearrangements and copy number changes. Although some of these rearrangements appear to involve nonhomologous end-joining, many must have involved mechanisms requiring new DNA synthesis. Models such as microhomology-mediated break-induced replication (MM-BIR) have been invoked to explain these rearrangements. We examined BIR and template switching between highly diverged sequences in Saccharomyces cerevisiae, induced during repair of a site-specific double-strand break (DSB). Our data show that such template switches are robust mechanisms that give rise to complex rearrangements. Template switches between highly divergent sequences appear to be mechanistically distinct from the initial strand invasions that establish BIR. In particular, such jumps are less constrained by sequence divergence and exhibit a different pattern of microhomology junctions. BIR traversing repeated DNA sequences frequently results in complex translocations analogous to those seen in mammalian cells. These results suggest that template switching among repeated genes is a potent driver of genome instability and evolution. PMID:25367035
Simple sequence repeat marker loci discovery using SSR primer.
Robinson, Andrew J; Love, Christopher G; Batley, Jacqueline; Barker, Gary; Edwards, David
2004-06-12
Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. With the increase in the availability of DNA sequence information, an automated process to identify and design PCR primers for amplification of SSR loci would be a useful tool in plant breeding programs. We report an application that integrates SPUTNIK, an SSR repeat finder, with Primer3, a PCR primer design program, into one pipeline tool, SSR Primer. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. The results are parsed to Primer3 for locus-specific primer design. The script makes use of a Web-based interface, enabling remote use. This program has been written in PERL and is freely available for non-commercial users by request from the authors. The Web-based version may be accessed at http://hornbill.cspp.latrobe.edu.au/
Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining
2014-01-01
Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.
Jiang, W; Gupta, D; Gallagher, D; Davis, S; Bhavanandan, V P
2000-04-01
We previously elucidated five distinct protein domains (I-V) for bovine submaxillary mucin, which is encoded by two genes, BSM1 and BSM2. Using Southern blot analysis, genomic cloning and sequencing of the BSM1 gene, we now show that the central domain (V) consists of approximately 55 tandem repeats of 329 amino acids and that domains III-V are encoded by a 58.4-kb exon, the largest exon known for all genes to date. The BSM1 gene was mapped by fluorescence in situ hybridization to the proximal half of chromosome 5 at bands q2. 2-q2.3. The amino-acid sequence of six tandem repeats (two full and four partial) were found to have only 92-94% identities. We propose that the variability in the amino-acid sequences of the mucin tandem repeat is important for generating the combinatorial library of saccharides that are necessary for the protective function of mucins. The deduced peptide sequences of the central domain match those determined from the purified bovine submaxillary mucin and also show 68-94% identity to published peptide sequences of ovine submaxillary mucin. This indicates that the core protein of ovine submaxillary mucin is closely related to that of bovine submaxillary mucin and contains similar tandem repeats in the central domain. In contrast, the central domain of porcine submaxillary mucin is reported to consist of 81-amino-acid tandem repeats. However, both bovine submaxillary mucin and porcine submaxillary mucin contain similar N-terminal and C-terminal domains and the corresponding genes are in the conserved linkage regions of the respective genomes.
Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining
2014-01-01
Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551
In Vitro Engineering of Vascularized Tissue Surrogates
Sakaguchi, Katsuhisa; Shimizu, Tatsuya; Horaguchi, Shigeto; Sekine, Hidekazu; Yamato, Masayuki; Umezu, Mitsuo; Okano, Teruo
2013-01-01
In vitro scaling up of bioengineered tissues is known to be limited by diffusion issues, specifically a lack of vasculature. Here, we report a new strategy for preserving cell viability in three-dimensional tissues using cell sheet technology and a perfusion bioreactor having collagen-based microchannels. When triple-layer cardiac cell sheets are incubated within this bioreactor, endothelial cells in the cell sheets migrate to vascularize in the collagen gel, and finally connect with the microchannels. Medium readily flows into the cell sheets through the microchannels and the newly developed capillaries, while the cardiac construct shows simultaneous beating. When additional triple-layer cell sheets are repeatedly layered, new multi-layer construct spontaneously integrates and the resulting construct becomes a vascularized thick tissue. These results confirmed our method to fabricate in vitro vascularized tissue surrogates that overcomes engineered-tissue thickness limitations. The surrogates promise new therapies for damaged organs as well as new in vitro tissue models. PMID:23419835
Sheikh, Faruk G; Mukhopadhyay, Sudit S; Gupta, Prabhakar
2002-02-01
The PstI family of elements are short, highly repetitive DNA sequences interspersed throughout the genome of the Bovidae. We have cloned and sequenced some members of the PstI family from cattle, goat, and buffalo. These elements are approximately 500 bp, have a copy number of 2 x 10(5) - 4 x 10(5), and comprise about 4% of the haploid genome. Studies of nucleotide sequence homology indicate that the buffalo and goat PstI repeats (type II) are similar types of short interspersed nucleotide element (SINE) sequences, but the cattle PstI repeat (type I) is considerably more divergent. Additionally, the goat PstI sequence showed significant sequence homology with bovine serine tRNA, and is therefore likely derived from serine tRNA. Interestingly, Southern hybridization suggests that both types of SINEs (I and II) are present in all the species of Bovidae. Dendrogram analysis indicates that cattle PstI SINE is similar to bovine Alu-like SINEs. Goat and buffalo SINEs formed a separate cluster, suggesting that these two types of SINEs evolved separately in the genome of the Bovidae.
Repeat-aware modeling and correction of short read errors.
Yang, Xiao; Aluru, Srinivas; Dorman, Karin S
2011-02-15
High-throughput short read sequencing is revolutionizing genomics and systems biology research by enabling cost-effective deep coverage sequencing of genomes and transcriptomes. Error detection and correction are crucial to many short read sequencing applications including de novo genome sequencing, genome resequencing, and digital gene expression analysis. Short read error detection is typically carried out by counting the observed frequencies of kmers in reads and validating those with frequencies exceeding a threshold. In case of genomes with high repeat content, an erroneous kmer may be frequently observed if it has few nucleotide differences with valid kmers with multiple occurrences in the genome. Error detection and correction were mostly applied to genomes with low repeat content and this remains a challenging problem for genomes with high repeat content. We develop a statistical model and a computational method for error detection and correction in the presence of genomic repeats. We propose a method to infer genomic frequencies of kmers from their observed frequencies by analyzing the misread relationships among observed kmers. We also propose a method to estimate the threshold useful for validating kmers whose estimated genomic frequency exceeds the threshold. We demonstrate that superior error detection is achieved using these methods. Furthermore, we break away from the common assumption of uniformly distributed errors within a read, and provide a framework to model position-dependent error occurrence frequencies common to many short read platforms. Lastly, we achieve better error correction in genomes with high repeat content. The software is implemented in C++ and is freely available under GNU GPL3 license and Boost Software V1.0 license at "http://aluru-sun.ece.iastate.edu/doku.php?id = redeem". We introduce a statistical framework to model sequencing errors in next-generation reads, which led to promising results in detecting and correcting errors for genomes with high repeat content.
Mangericao, Tatiana C; Peng, Zhanhao; Zhang, Xuegong
2016-01-11
CRISPR has been becoming a hot topic as a powerful technique for genome editing for human and other higher organisms. The original CRISPR-Cas (Clustered Regularly Interspaced Short Palindromic Repeats coupled with CRISPR-associated proteins) is an important adaptive defence system for prokaryotes that provides resistance against invading elements such as viruses and plasmids. A CRISPR cassette contains short nucleotide sequences called spacers. These unique regions retain a history of the interactions between prokaryotes and their invaders in individual strains and ecosystems. One important ecosystem in the human body is the human gut, a rich habitat populated by a great diversity of microorganisms. Gut microbiomes are important for human physiology and health. Metagenome sequencing has been widely applied for studying the gut microbiomes. Most efforts in metagenome study has been focused on profiling taxa compositions and gene catalogues and identifying their associations with human health. Less attention has been paid to the analysis of the ecosystems of microbiomes themselves especially their CRISPR composition. We conducted a preliminary analysis of CRISPR sequences in a human gut metagenomic data set of Chinese individuals of type-2 diabetes patients and healthy controls. Applying an available CRISPR-identification algorithm, PILER-CR, we identified 3169 CRISPR cassettes in the data, from which we constructed a set of 1302 unique repeat sequences and 36,709 spacers. A more extensive analysis was made for the CRISPR repeats: these repeats were submitted to a more comprehensive clustering and classification using the web server tool CRISPRmap. All repeats were compared with known CRISPRs in the database CRISPRdb. A total of 784 repeats had matches in the database, and the remaining 518 repeats from our set are potentially novel ones. The computational analysis of CRISPR composition based contigs of metagenome sequencing data is feasible. It provides an efficient approach for finding potential novel CRISPR arrays and for analysing the ecosystem and history of human microbiomes.
Deutzmann, R; Fowler, S; Zhang, X; Boone, K; Dexter, S; Boot-Handford, R P; Rachel, R; Sarras, M P
2000-11-01
The body wall of hydra (a member of the phylum Cnidaria) is structurally reduced to an epithelial bilayer with an intervening extracellular matrix (ECM). Previous studies have established that cell-ECM interactions are important for morphogenesis and cell differentiation in this simple metazoan. The ECM of hydra is particularly interesting because it represents a primordial form of matrix. Despite progress in our understanding of hydra ECM, we still know little about the nature of hydra collagens. In the current study we provide a molecular, biochemical and functional analysis of a hydra fibrillar collagen that has similarity to vertebrate type I and type II collagens. This fibrillar collagen has been named hydra collagen-I (Hcol-I) because of its structure and because it is the first ECM collagen to be identified in hydra. It represents a novel member of the collagen family. Similar to vertebrate type I and II collagens, Hcol-I contains an N-terminal propeptide-like domain, a triple helical domain containing typical Gly-X-Y repeats and a C-terminal propeptide domain. The overall identity to vertebrate fibrillar collagens is about 30%, while the identity of the C-terminal propeptide domain is 50%. Because the N-terminal propeptide domain is retained after post-translational processing, Hcol-I does not form thick fibers as seen in vertebrates. This was confirmed using transmission electron microscopy to study rotary shadow images of purified Hcol-I. In addition, absence of crucial lysine residues and an overall reduction in proline content, results in reduced crosslinking of fibrils and increased flexibility of the molecule, respectively. These structural changes in Hcol-I help to explain the flexible properties of hydra ECM. Immunocytochemical studies indicate that Hcol-I forms the 10 nm fibrils that comprise the majority of molecules in the central fibrous zone of hydra ECM. The central fibrous zone resides between the two subepithelial zones where hydra laminin is localized. While previous studies have shown that basal lamina components like laminin are expressed by the endoderm, in situ hybridisation studies show that Hcol-I mRNA expression is restricted to the ectoderm. Hcol-I expression is upregulated during head regeneration, and antisense studies using thio-oligonucleotides demonstrated that blocking the translation of Hcol-I leads to a reversible inhibition of head morphogenesis during this regenerative process. Taken in total, the data presented in this study indicate that Hcol-I is required for morphogensis in hydra and represents a novel fibrillar collagen whose structural characteristics help to explain the unique biophysical properties of hydra ECM. Interestingly, the structure of Hcol-I mimics what is seen in Ehlers-Danlos syndrome type VII in humans; an inherited pathological condition that leads to joint and skin abnormalities. Hcol-I therefore illustrates an adaptive trait in which the normal physiological situation in hydra translates into a pathological condition in humans.
The three-dimensional structure of anosteocytic lamellated bone of fish.
Atkins, Ayelet; Reznikov, Natalie; Ofer, Lior; Masic, Admir; Weiner, Steve; Shahar, Ron
2015-02-01
Fish represent the most diverse and numerous of the vertebrate clades. In contrast to the bones of all tetrapods and evolutionarily primitive fish, many of the evolutionarily more advanced fish have bones that do not contain osteocytes. Here we use a variety of imaging techniques to show that anosteocytic fish bone is composed of a sequence of planar layers containing mainly aligned collagen fibrils, in which the prevailing principal orientation progressively spirals. When the sequence of fibril orientations completes a rotation of around 180°, a thin layer of poorly oriented fibrils is present between it and the next layer. The thick layer of aligned fibrils and the thin layer of non-aligned fibrils constitute a lamella. Although both basic components of mammalian lamellar bone are found here as well, the arrangement is unique, and we therefore call this structure lamellated bone. We further show that the lamellae of anosteocytic fish bone contain an array of dense, small-diameter (1-4 μm) bundles of hypomineralized collagen fibrils that are oriented mostly orthogonal to the lamellar plane. Results of mechanical tests conducted on beams from anosteocytic fish bone and human cortical bone show that the fish bones are less stiff but much tougher than the human bones. We propose that the unique lamellar structure and the orthogonal hypomineralized collagen bundles are responsible for the unusual mechanical properties and mineral distribution in anosteocytic fish bone. Copyright © 2014 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
Processing of an anglerfish somatostatin precursor to a hydroxylysine-containing somatostatin 28.
Spiess, J; Noe, B D
1985-01-01
A novel 28-residue somatostatin (SS) has been isolated from anglerfish pancreatic islets and characterized by complete Edman degradation, peptide mapping, and amino acid analysis. The primary structure of this anglerfish SS-28 (aSS-28) containing hydroxylysine (Hyl) was established to be H-Ser-Val-Asp-Ser-Thr-Asn-Asn-Leu-Pro-Pro-Arg-Glu-Arg-Lys-Ala-Gly-Cys- Lys-Asn-Phe-Tyr-Trp-Hyl-Gly-Phe-Thr-Ser-Cys-OH. This sequence (with the exception of hydroxylysine-23, which is replaced by lysine) is identical to the sequence of the COOH-terminal 28 residues of prepro-SS II predicted on the basis of cDNA analysis [Hobart, P., Crawford, R., Shen, L., Pictet, R. & Rutter, W. J. (1980) Nature (London) 288, 137-141]. This is the first instance in which hydroxylysine (to date characteristically observed in collagen or collagen-like structures) has been found in a potential regulatory peptide. Chromatographic characterization of peptides, radiolabeled in islet culture, revealed that aSS-28 contained 10-12% of the radioactivity incorporated into the 8000- to 1000-dalton SS-like polypeptides, whereas 88-90% of this radioactivity was detected in anglerfish SS-14. It appears probable that aSS-28 represents the predominant primary cleavage product derived from prepro-SS II by cleavage at the COOH-terminal side of a single arginine. Based on knowledge of the collagen biosynthesis, it is speculated that hydroxylation may take place as an early post-translational event. Images PMID:2857489
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis
Zheng, Jin-shuang; Sun, Cheng-zhen; Zhang, Shu-ning; Hou, Xi-lin; Bonnema, Guusje
2016-01-01
A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis. PMID:27507974
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.
Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje
2016-01-01
A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis.
Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada
2002-07-01
Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Evolutional dynamics of 45S and 5S ribosomal DNA in ancient allohexaploid Atropa belladonna.
Volkov, Roman A; Panchuk, Irina I; Borisjuk, Nikolai V; Hosiawa-Baranska, Marta; Maluszynska, Jolanta; Hemleben, Vera
2017-01-23
Polyploid hybrids represent a rich natural resource to study molecular evolution of plant genes and genomes. Here, we applied a combination of karyological and molecular methods to investigate chromosomal structure, molecular organization and evolution of ribosomal DNA (rDNA) in nightshade, Atropa belladonna (fam. Solanaceae), one of the oldest known allohexaploids among flowering plants. Because of their abundance and specific molecular organization (evolutionarily conserved coding regions linked to variable intergenic spacers, IGS), 45S and 5S rDNA are widely used in plant taxonomic and evolutionary studies. Molecular cloning and nucleotide sequencing of A. belladonna 45S rDNA repeats revealed a general structure characteristic of other Solanaceae species, and a very high sequence similarity of two length variants, with the only difference in number of short IGS subrepeats. These results combined with the detection of three pairs of 45S rDNA loci on separate chromosomes, presumably inherited from both tetraploid and diploid ancestor species, example intensive sequence homogenization that led to substitution/elimination of rDNA repeats of one parent. Chromosome silver-staining revealed that only four out of six 45S rDNA sites are frequently transcriptionally active, demonstrating nucleolar dominance. For 5S rDNA, three size variants of repeats were detected, with the major class represented by repeats containing all functional IGS elements required for transcription, the intermediate size repeats containing partially deleted IGS sequences, and the short 5S repeats containing severe defects both in the IGS and coding sequences. While shorter variants demonstrate increased rate of based substitution, probably in their transition into pseudogenes, the functional 5S rDNA variants are nearly identical at the sequence level, pointing to their origin from a single parental species. Localization of the 5S rDNA genes on two chromosome pairs further supports uniparental inheritance from the tetraploid progenitor. The obtained molecular, cytogenetic and phylogenetic data demonstrate complex evolutionary dynamics of rDNA loci in allohexaploid species of Atropa belladonna. The high level of sequence unification revealed in 45S and 5S rDNA loci of this ancient hybrid species have been seemingly achieved by different molecular mechanisms.
Identification of presumed ancestral DNA sequences of phaseolin in Phaseolus vulgaris.
Kami, J; Velásquez, V B; Debouck, D G; Gepts, P
1995-01-01
Common bean (Phaseolus vulgaris) consists of two major geographic gene pools, one distributed in Mexico, Central America, and Colombia and the other in the southern Andes (southern Peru, Bolivia, and Argentina). Amplification and sequencing of members of the multigene family coding for phaseolin, the major seed storage protein of the common bean, provide evidence for accumulation of tandem direct repeats in both introns and exons during evolution of the multigene family in this species. The presumed ancestral phaseolin sequences, without tandem repeats, were found in recently discovered but nearly extinct wild common bean populations of Ecuador and northern Peru that are intermediate between the two major gene pools of the species based on geographical and molecular arguments. Our results illustrate the usefulness of tandem direct repeats in establishing the polarity of DNA sequence divergence and therefore in proposing phylogenies. Images Fig. 1 Fig. 3 PMID:7862642
Tek, Ahmet L; Kashihara, Kazunari; Murata, Minoru; Nagaki, Kiyotaka
2011-11-01
The centromere plays an essential role for proper chromosome segregation during cell division and usually harbors long arrays of tandem repeated satellite DNA sequences. Although this function is conserved among eukaryotes, the sequences of centromeric DNA repeats are variable. Most of our understanding of functional centromeres, which are defined by localization of a centromere-specific histone H3 (CENH3) protein, comes from model organisms. The components of the functional centromere in legumes are poorly known. The genus Astragalus is a member of the legumes and bears the largest numbers of species among angiosperms. Therefore, we studied the components of centromeres in Astragalus sinicus. We identified the CenH3 homolog of A. sinicus, AsCenH3 that is the most compact in size among higher eukaryotes. A CENH3-based assay revealed the functional centromeric DNA sequences from A. sinicus, called CentAs. The CentAs repeat is localized in A. sinicus centromeres, and comprises an AT-rich tandem repeat with a monomer size of 20 nucleotides.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ostrander, E.A.; Sprague, G.F. Jr.; Rine, J.
1993-04-01
A large block of simple sequence repeat (SSR) polymorphisms for the dog genome has been isolated and characterized. Screening of primary libraries by conventional hybridization methods as well as by screening of enriched marker-selected libraries led to the isolation of a large number of genomic clones that contained (CA)[sub n] repeats. The sequences of 101 clones showed that the size and complexity of (CA)[sub n] repeats in the dog genome were similar to those reported for these markers in the human genome. Detailed analysis of a representative subset of these markers revealed that most markers were moderately to highly polymorphic,more » with PIC values exceeding 0.70 for 33% of the markers tested. An association between higher PIC values and markers containing longer (CA)[sub n] repeats was observed in these studies, as previously noted for similar markers in the human genome. A list of primer sequences that tag each characterized marker is provided, and a comprehensive system of nomenclature for the dog genome is suggested. 28 refs., 4 figs., 2 tabs.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ghosh, Asish K; Wei, Jun; Wu, Minghua
2008-09-19
Transforming growth factor-{beta} (TGF-{beta}), a potent inducer of collagen synthesis, is implicated in pathological fibrosis. Peroxisome proliferator-activated receptor-{gamma} (PPAR-{gamma}) is a nuclear hormone receptor that regulates adipogenesis and numerous other biological processes. Here, we demonstrate that collagen gene expression was markedly elevated in mouse embryonic fibroblasts (MEFs) lacking PPAR-{gamma} compared to heterozygous control MEFs. Treatment with the PPAR-{gamma} ligand 15d-PGJ{sub 2} failed to down-regulate collagen gene expression in PPAR-{gamma} null MEFs, whereas reconstitution of these cells with ectopic PPAR-{gamma} resulted in their normalization. Compared to control MEFs, PPAR-{gamma} null MEFs displayed elevated levels of the Type I TGF-{beta} receptor (T{beta}RI),more » and secreted more TGF-{beta}1 into the media. Furthermore, PPAR-{gamma} null MEFs showed constitutive phosphorylation of cellular Smad2 and Smad3, even in the absence of exogenous TGF-{beta}, which was abrogated by the ALK5 inhibitor SB431542. Constitutive Smad2/3 phosphorylation in PPAR-{gamma} null MEFs was associated with Smad3 binding to its cognate DNA recognition sequences, and interaction with coactivator p300 previously implicated in TGF-{beta} responses. Taken together, these results indicate that loss of PPAR-{gamma} in MEFs is associated with upregulation of collagen synthesis, and activation of intracellular Smad signal transduction, due, at least in part, to autocrine TGF-{beta} stimulation.« less
van Gijlswijk, R P; Wiegant, J; Vervenne, R; Lasan, R; Tanke, H J; Raap, A K
1996-01-01
We present a sensitive and rapid fluorescence in situ hybridization (FISH) strategy for detecting chromosome-specific repeat sequences. It uses horseradish peroxidase (HRP)-labeled oligonucleotide sequences in combination with fluorescent tyramide-based detection. After in situ hybridization, the HRP conjugated to the oligonucleotide probe is used to deposit fluorescently labeled tyramide molecules at the site of hybridization. The method features full chemical synthesis of probes, strong FISH signals, and short processing periods, as well as multicolor capabilities.
USDA-ARS?s Scientific Manuscript database
The genetic relationships and pedigree inferences among peach (Prunus persica (L.) Batsch) accessions and breeding lines used in genetic improvement were evaluated using 15 simple sequence repeat (SSR) markers. A total of 80 alleles were detected among the 37 peach accessions with an average of 5.53...
We are attempting to identify specific root fragments from soil cores with individual trees. We successfully used Inter Simple Sequence Repeats (ISSR) to distinguish neighboring old-growth Douglas-fir trees from one another, while maintaining identity among each tree's parts. W...
Cross-species transferability and mapping of genomic and cDNA SSRs in pines
D. Chagne; P. Chaumeil; A. Ramboer; C. Collada; A. Guevara; M. T. Cervera; G. G. Vendramin; V. Garcia; J-M. Frigerio; Craig Echt; T. Richardson; Christophe Plomion
2004-01-01
Two unigene datasets of Pinus taeda and Pinus pinaster were screened to detect di-, tri and tetranucleotide repeated motifs using the SSRIT script. A total of 419 simple sequence repeats (SSRs) were identified, from which only 12.8% overlapped between the two sets. The position of the SSRs within the coding sequence were predicted...
USDA-ARS?s Scientific Manuscript database
Watermelon (Citrullus lanatus var. lanatus) is an important vegetable fruit throughout the world. A high number of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers should provide large coverage of the watermelon genome and high phylogenetic resolution of germplasm acces...
A Repeat Look at Repeating Patterns
ERIC Educational Resources Information Center
Markworth, Kimberly A.
2016-01-01
A "repeating pattern" is a cyclical repetition of an identifiable core. Children in the primary grades usually begin pattern work with fairly simple patterns, such as AB, ABC, or ABB patterns. The unique letters represent unique elements, whereas the sequence of letters represents the core that is repeated. Based on color, shape,…
Rapid and accurate synthesis of TALE genes from synthetic oligonucleotides.
Wang, Fenghua; Zhang, Hefei; Gao, Jingxia; Chen, Fengjiao; Chen, Sijie; Zhang, Cuizhen; Peng, Gang
2016-01-01
Custom synthesis of transcription activator-like effector (TALE) genes has relied upon plasmid libraries of pre-fabricated TALE-repeat monomers or oligomers. Here we describe a novel synthesis method that directly incorporates annealed synthetic oligonucleotides into the TALE-repeat units. Our approach utilizes iterative sets of oligonucleotides and a translational frame check strategy to ensure the high efficiency and accuracy of TALE-gene synthesis. TALE arrays of more than 20 repeats can be constructed, and the majority of the synthesized constructs have perfect sequences. In addition, this novel oligonucleotide-based method can readily accommodate design changes to the TALE repeats. We demonstrated an increased gene targeting efficiency against a genomic site containing a potentially methylated cytosine by incorporating non-conventional repeat variable di-residue (RVD) sequences.
[Polymorphic loci and polymorphism analysis of short tandem repeats within XNP gene].
Liu, Qi-Ji; Gong, Yao-Qin; Guo, Chen-Hong; Chen, Bing-Xi; Li, Jiang-Xia; Guo, Yi-Shou
2002-01-01
To select polymorphic short tandem repeat markers within X-linked nuclear protein (XNP) gene, genomic clones which contain XNP gene were recognized by homologous analysis with XNP cDNA. By comparing the cDNA with genomic DNA, non-exonic sequences were identified, and short tandem repeats were selected from non-exonic sequences by using BCM search Launcher. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five short tandem repeats were identified from XNP gene, two of which were polymorphic. Four and 11 alleles were observed in Chinese population for XNPSTR1 and XNPSTR4, respectively. Heterozygosities were 47% for XNPSTR1 and 70% for XNPSTR4. XNPSTR1 and XNPSTR4 localized within 3' end and intron 10, respectively. Two polymorphic short tandem repeats have been identified within XNP gene and will be useful for linkage analysis and gene diagnosis of XNP gene.
Just, Rebecca S; Irwin, Jodi A
2018-05-01
Some of the expected advantages of next generation sequencing (NGS) for short tandem repeat (STR) typing include enhanced mixture detection and genotype resolution via sequence variation among non-homologous alleles of the same length. However, at the same time that NGS methods for forensic DNA typing have advanced in recent years, many caseworking laboratories have implemented or are transitioning to probabilistic genotyping to assist the interpretation of complex autosomal STR typing results. Current probabilistic software programs are designed for length-based data, and were not intended to accommodate sequence strings as the product input. Yet to leverage the benefits of NGS for enhanced genotyping and mixture deconvolution, the sequence variation among same-length products must be utilized in some form. Here, we propose use of the longest uninterrupted stretch (LUS) in allele designations as a simple method to represent sequence variation within the STR repeat regions and facilitate - in the nearterm - probabilistic interpretation of NGS-based typing results. An examination of published population data indicated that a reference LUS region is straightforward to define for most autosomal STR loci, and that using repeat unit plus LUS length as the allele designator can represent greater than 80% of the alleles detected by sequencing. A proof of concept study performed using a freely available probabilistic software demonstrated that the LUS length can be used in allele designations when a program does not require alleles to be integers, and that utilizing sequence information improves interpretation of both single-source and mixed contributor STR typing results as compared to using repeat unit information alone. The LUS concept for allele designation maintains the repeat-based allele nomenclature that will permit backward compatibility to extant STR databases, and the LUS lengths themselves will be concordant regardless of the NGS assay or analysis tools employed. Further, these biologically based, easy-to-derive designations uphold clear relationships between parent alleles and their stutter products, enabling analysis in fully continuous probabilistic programs that model stutter while avoiding the algorithmic complexities that come with string based searches. Though using repeat unit plus LUS length as the allele designator does not capture variation that occurs outside of the core repeat regions, this straightforward approach would permit the large majority of known STR sequence variation to be used for mixture deconvolution and, in turn, result in more informative mixture statistics in the near term. Ultimately, the method could bridge the gap from current length-based probabilistic systems to facilitate broader adoption of NGS by forensic DNA testing laboratories. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Sun, Lidan; Yang, Weiru; Zhang, Qixiang; Cheng, Tangren; Pan, Huitang; Xu, Zongda; Zhang, Jie; Chen, Chuguang
2013-01-01
Because of its popularity as an ornamental plant in East Asia, mei (Prunus mume Sieb. et Zucc.) has received increasing attention in genetic and genomic research with the recent shotgun sequencing of its genome. Here, we performed the genome-wide characterization of simple sequence repeats (SSRs) in the mei genome and detected a total of 188,149 SSRs occurring at a frequency of 794 SSR/Mb. Mononucleotide repeats were the most common type of SSR in genomic regions, followed by di- and tetranucleotide repeats. Most of the SSRs in coding sequences (CDS) were composed of tri- or hexanucleotide repeat motifs, but mononucleotide repeats were always the most common in intergenic regions. Genome-wide comparison of SSR patterns among the mei, strawberry (Fragaria vesca), and apple (Malus×domestica) genomes showed mei to have the highest density of SSRs, slightly higher than that of strawberry (608 SSR/Mb) and almost twice as high as that of apple (398 SSR/Mb). Mononucleotide repeats were the dominant SSR motifs in the three Rosaceae species. Using 144 SSR markers, we constructed a 670 cM-long linkage map of mei delimited into eight linkage groups (LGs), with an average marker distance of 5 cM. Seventy one scaffolds covering about 27.9% of the assembled mei genome were anchored to the genetic map, depending on which the macro-colinearity between the mei genome and Prunus T×E reference map was identified. The framework map of mei constructed provides a first step into subsequent high-resolution genetic mapping and marker-assisted selection for this ornamental species. PMID:23555708
Sequence of retrovirus provirus resembles that of bacterial transposable elements
NASA Astrophysics Data System (ADS)
Shimotohno, Kunitada; Mizutani, Satoshi; Temin, Howard M.
1980-06-01
The nucleotide sequences of the terminal regions of an infectious integrated retrovirus cloned in the modified λ phage cloning vector Charon 4A have been elucidated. There is a 569-base pair direct repeat at both ends of the viral DNA. The cell-virus junctions at each end consist of a 5-base pair direct repeat of cell DNA next to a 3-base pair inverted repeat of viral DNA. This structure resembles that of a transposable element and is consistent with the protovirus hypothesis that retroviruses evolved from the cell genome.
Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji
2012-12-01
In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
Nallapareddy, Sreedhar R; Weinstock, George M; Murray, Barbara E
2003-03-01
A collagen-binding adhesin of Enterococcus faecium, Acm, was identified. Acm shows 62% similarity to the Staphylococcus aureus collagen adhesin Cna over the entire protein and is more similar to Cna (60% and 75% similarity with Cna A and B domains respectively) than to the Enterococcus faecalis collagen-binding adhesin, Ace, which shares homology with Acm only in the A domain. Despite the detection of acm in 32 out of 32 E. faecium isolates, only 11 of these (all clinical isolates, including four vancomycin-resistant endocarditis isolates and seven other isolates) exhibited binding to collagen type I (CI). Although acm from three CI-binding vancomycin-resistant E. faecium clinical isolates showed 100% identity, analysis of acm genes and their promoter regions from six non-CI-binding strains identified deletions or mutations that introduced stop codons and/or IS elements within the gene or the promoter region in five out of six strains, suggesting that the presence of an intact functional acm gene is necessary for binding of E. faecium strains to CI. Recombinant Acm A domain showed specific and concentration-dependent binding to collagen, and this protein competed with E. faecium binding to immobilized CI. Consistent with the adherence phenotype and sequence data, probing with Acm-specific IgGs purified from anti-recombinant Acm A polyclonal rabbit serum confirmed the surface expression of Acm in three out of three collagen-binding clinical isolates of E. faecium tested, but in none of the strains with a non-functional pseudo acm gene. Introduction of a functional acm gene into two non-CI-binding natural acm mutant strains conferred a CI-binding phenotype, further confirming that native Acm is sufficient for the binding of E. faecium to CI. These results demonstrate that acm, which encodes a potential virulence factor, is functional only in certain infection-derived clinical isolates of E. faecium, and suggest that Acm is the primary adhesin responsible for the ability of E. faecium to bind collagen.
Mendez-Bermudez, Aaron; Hills, Mark; Pickett, Hilda A.; Phan, Anh Tuân; Mergny, Jean-Louis; Riou, Jean-François; Royle, Nicola J.
2009-01-01
A number of different processes that impact on telomere length dynamics have been identified but factors that affect the turnover of repeats located proximally within the telomeric DNA are poorly defined. We have identified a particular repeat type (CTAGGG) that is associated with an extraordinarily high mutation rate (20% per gamete) in the male germline. The mutation rate is affected by the length and sequence homogeneity of the (CTAGGG)n array. This level of instability was not seen with other sequence-variant repeats, including the TCAGGG repeat type that has the same composition. Telomeres carrying a (CTAGGG)n array are also highly unstable in somatic cells with the mutation process resulting in small gains or losses of repeats that also occasionally result in the deletion of the whole (CTAGGG)n array. These sequences are prone to quadruplex formation in vitro but adopt a different topology from (TTAGGG)n (see accompanying article). Interestingly, short (CTAGGG)2 oligonucleotides induce a DNA damage response (γH2AX foci) as efficiently as (TTAGGG)2 oligos in normal fibroblast cells, suggesting they recruit POT1 from the telomere. Moreover, in vitro assays show that (CTAGGG)n repeats bind POT1 more efficiently than (TTAGGG)n or (TCAGGG)n. We estimate that 7% of human telomeres contain (CTAGGG)n repeats and when present, they create additional problems that probably arise during telomere replication. PMID:19656953
Richard, François D; Kajava, Andrey V
2014-06-01
The dramatic growth of sequencing data evokes an urgent need to improve bioinformatics tools for large-scale proteome analysis. Over the last two decades, the foremost efforts of computer scientists were devoted to proteins with aperiodic sequences having globular 3D structures. However, a large portion of proteins contain periodic sequences representing arrays of repeats that are directly adjacent to each other (so called tandem repeats or TRs). These proteins frequently fold into elongated fibrous structures carrying different fundamental functions. Algorithms specific to the analysis of these regions are urgently required since the conventional approaches developed for globular domains have had limited success when applied to the TR regions. The protein TRs are frequently not perfect, containing a number of mutations, and some of them cannot be easily identified. To detect such "hidden" repeats several algorithms have been developed. However, the most sensitive among them are time-consuming and, therefore, inappropriate for large scale proteome analysis. To speed up the TR detection we developed a rapid filter that is based on the comparison of composition and order of short strings in the adjacent sequence motifs. Tests show that our filter discards up to 22.5% of proteins which are known to be without TRs while keeping almost all (99.2%) TR-containing sequences. Thus, we are able to decrease the size of the initial sequence dataset enriching it with TR-containing proteins which allows a faster subsequent TR detection by other methods. The program is available upon request. Copyright © 2014 Elsevier Inc. All rights reserved.
Ohno, S
1984-01-01
Three outstanding properties uniquely qualify repeats of base oligomers as the primordial coding sequences of all polypeptide chains. First, when compared with randomly generated base sequences in general, they are more likely to have long open reading frames. Second, periodical polypeptide chains specified by such repeats are more likely to assume either alpha-helical or beta-sheet secondary structures than are polypeptide chains of random sequence. Third, provided that the number of bases in the oligomeric unit is not a multiple of 3, these internally repetitious coding sequences are impervious to randomly sustained base substitutions, deletions, and insertions. This is because the recurring periodicity of their polypeptide chains is given by three consecutive copies of the oligomeric unit translated in three different reading frames. Accordingly, when one reading frame is open, the other two are automatically open as well, all three being capable of coding for polypeptide chains of identical periodicity. Under this circumstance, a frame shift due to the deletion or insertion of a number of bases that is not a multiple of 3 fails to alter the down-stream amino acid sequence, and even a base change causing premature chain-termination can silence only one of the three potential coding units. Newly arisen coding sequences in modern organisms are oligomeric repeats, and most of the older genes retain various vestiges of their original internal repetitions. Some of the genes (e.g., oncogenes) have even inherited the property of being impervious to randomly sustained base changes.
van Eyk, Clare L; O'Keefe, Louise V; Lawlor, Kynan T; Samaraweera, Saumya E; McLeod, Catherine J; Price, Gareth R; Venter, Deon J; Richards, Robert I
2011-07-15
Recent evidence supports a role for RNA as a common pathogenic agent in both the 'polyglutamine' and 'untranslated' dominant expanded repeat disorders. One feature of all repeat sequences currently associated with disease is their predicted ability to form a hairpin secondary structure at the RNA level. In order to investigate mechanisms by which hairpin-forming repeat RNAs could induce neurodegeneration, we have looked for alterations in gene transcript levels as hallmarks of the cellular response to toxic hairpin repeat RNAs. Three disease-associated repeat sequences--CAG, CUG and AUUCU--were specifically expressed in the neurons of Drosophila and resultant common transcriptional changes assessed by microarray analyses. Transcripts that encode several components of the Akt/Gsk3-β signalling pathway were altered as a consequence of expression of these repeat RNAs, indicating that this pathway is a component of the neuronal response to these pathogenic RNAs and may represent an important common therapeutic target in this class of diseases.
2011-01-01
Background Many plants have large and complex genomes with an abundance of repeated sequences. Many plants are also polyploid. Both of these attributes typify the genome architecture in the tribe Triticeae, whose members include economically important wheat, rye and barley. Large genome sizes, an abundance of repeated sequences, and polyploidy present challenges to genome-wide SNP discovery using next-generation sequencing (NGS) of total genomic DNA by making alignment and clustering of short reads generated by the NGS platforms difficult, particularly in the absence of a reference genome sequence. Results An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions from repetitive sequences and sequences shared by paralogous genes. Multiple genome equivalents of shotgun reads of another genotype generated with SOLiD or Solexa are then mapped to the annotated Roche 454 reads to identify putative SNPs. A pipeline program package, AGSNP, was developed and used for genome-wide SNP discovery in Aegilops tauschii-the diploid source of the wheat D genome, and with a genome size of 4.02 Gb, of which 90% is repetitive sequences. Genomic DNA of Ae. tauschii accession AL8/78 was sequenced with the Roche 454 NGS platform. Genomic DNA and cDNA of Ae. tauschii accession AS75 was sequenced primarily with SOLiD, although some Solexa and Roche 454 genomic sequences were also generated. A total of 195,631 putative SNPs were discovered in gene sequences, 155,580 putative SNPs were discovered in uncharacterized single-copy regions, and another 145,907 putative SNPs were discovered in repeat junctions. These SNPs were dispersed across the entire Ae. tauschii genome. To assess the false positive SNP discovery rate, DNA containing putative SNPs was amplified by PCR from AL8/78 and AS75 and resequenced with the ABI 3730 xl. In a sample of 302 randomly selected putative SNPs, 84.0% in gene regions, 88.0% in repeat junctions, and 81.3% in uncharacterized regions were validated. Conclusion An annotation-based genome-wide SNP discovery pipeline for NGS platforms was developed. The pipeline is suitable for SNP discovery in genomic libraries of complex genomes and does not require a reference genome sequence. The pipeline is applicable to all current NGS platforms, provided that at least one such platform generates relatively long reads. The pipeline package, AGSNP, and the discovered 497,118 Ae. tauschii SNPs can be accessed at (http://avena.pw.usda.gov/wheatD/agsnp.shtml). PMID:21266061
REPPER—repeats and their periodicities in fibrous proteins
Gruber, Markus; Söding, Johannes; Lupas, Andrei N.
2005-01-01
REPPER (REPeats and their PERiodicities) is an integrated server that detects and analyzes regions with short gapless repeats in protein sequences or alignments. It finds periodicities by Fourier Transform (FTwin) and internal similarity analysis (REPwin). FTwin assigns numerical values to amino acids that reflect certain properties, for instance hydrophobicity, and gives information on corresponding periodicities. REPwin uses self-alignments and displays repeats that reveal significant internal similarities. Both programs use a sliding window to ensure that different periodic regions within the same protein are detected independently. FTwin and REPwin are complemented by secondary structure prediction (PSIPRED) and coiled coil prediction (COILS), making the server a versatile analysis tool for sequences of fibrous proteins. REPPER is available at . PMID:15980460
Ba, Hengxing; Wu, Lang; Liu, Zongyue; Li, Chunyi
2016-01-01
Tandem repeat units are only detected in the left domain of the mitochondrial DNA control region in sika deer. Previous studies showed that Japanese sika deer have more tandem repeat units than its cousins from the Asian continent and Taiwan, which often have only three repeat units. To determine the origin and evolution of these additional repeat units in Japanese sika deer, we obtained the sequence of repeat units from an expanded dataset of the control region from all sika deer lineages. The functional constraint is inferred to act on the first repeat unit because this repeat has the least sequence divergence in comparison to the other units. Based on slipped-strand mispairing mechanisms, the illegitimate elongation model could account for the addition or deletion of these additional repeat units in the Japanese sika deer population. We also report that these additional repeat units could be occurring in the internal positions of tandem repeat regions, possibly via coupling with a homogenization mechanism within and among these lineages. Moreover, the increased number of repeat units in the Japanese sika deer population could reflect a balance between mutation and selection, as well as genetic drift.
Myotonin protein-kinase [AGC]n trinucleotide repeat in seven nonhuman primates
DOE Office of Scientific and Technical Information (OSTI.GOV)
Novelli, G.; Sineo, L.; Pontieri, E.
Myotonic dystrophy (DM) is due to a genomic instability of a trinucleotide [AGC]n motif, located at the 3{prime} UTR region of a protein-kinase gene (myotonin protein kinase, MT-PK). The [AGC] repeat is meiotically and mitotically unstable, and it is directly related to the manifestations of the disorder. Although a gene dosage effect of the MT-PK has been demonstrated n DM muscle, the mechanism(s) by which the intragenic repeat expansion leads to disease is largely unknown. This non-standard mutational event could reflect an evolutionary mechanism widespread among animal genomes. We have isolated and sequenced the complete 3{prime}UTR region of the MT-PKmore » gene in seven primates (macaque, orangutan, gorilla, chimpanzee, gibbon, owl monkey, saimiri), and examined by comparative sequence nucleotide analysis the [AGC]n intragenic repeat and the surrounding nucleotides. The genomic organization, including the [AGC]n repeat structure, was conserved in all examined species, excluding the gibbon (Hylobates agilis), in which the [AGC]n upstream sequence (GGAA) is replaced by a GA dinucleotide. The number of [AGC]n in the examined species ranged between 7 (gorilla) and 13 repeats (owl monkeys), with a polymorphism informative content (PIC) similar to that observed in humans. These results indicate that the 3{prime}UTR [AGC] repeat within the MT-PK gene is evolutionarily conserved, supporting that this region has important regulatory functions.« less
Yi, Xuan; Gao, Lei; Wang, Bo; Su, Ying-Juan; Wang, Ting
2013-01-01
We have determined the complete chloroplast (cp) genome sequence of Cephalotaxus oliveri. The genome is 134,337 bp in length, encodes 113 genes, and lacks inverted repeat (IR) regions. Genome-wide mutational dynamics have been investigated through comparative analysis of the cp genomes of C. oliveri and C. wilsoniana. Gene order transformation analyses indicate that when distinct isomers are considered as alternative structures for the ancestral cp genome of cupressophyte and Pinaceae lineages, it is not possible to distinguish between hypotheses favoring retention of the same IR region in cupressophyte and Pinaceae cp genomes from a hypothesis proposing independent loss of IRA and IRB. Furthermore, in cupressophyte cp genomes, the highly reduced IRs are replaced by short repeats that have the potential to mediate homologous recombination, analogous to the situation in Pinaceae. The importance of repeats in the mutational dynamics of cupressophyte cp genomes is also illustrated by the accD reading frame, which has undergone extreme length expansion in cupressophytes. This has been caused by a large insertion comprising multiple repeat sequences. Overall, we find that the distribution of repeats, indels, and substitutions is significantly correlated in Cephalotaxus cp genomes, consistent with a hypothesis that repeats play a role in inducing substitutions and indels in conifer cp genomes.
Jobke, Bjoern; Bolbos, Radu; Saadat, Ehsan; Cheng, Jonathan; Li, Xiaojuan; Majumdar, Sharmila
2013-01-01
The application of biomolecular magnetic resonance imaging becomes increasingly important in the context of early cartilage changes in degenerative and inflammatory joint disease before gross morphological changes become apparent. In this limited technical report, we investigate the correlation of MRI T1, T2 and T1ρ relaxation times with quantitative biochemical measurements of proteoglycan and collagen contents of cartilage in close synopsis with histologic morphology. A recently developed MRI sequence, T1ρ, was able to detect early intracartilaginous degeneration quantitatively and also qualitatively by color mapping demonstrating a higher sensitivity than standard T2-weighted sequences. The results correlated highly with reduced proteoglycan content and disrupted collagen architecture as measured by biochemistry and histology. The findings lend support to a clinical implementation that allows rapid visual capturing of pathology on a local, millimeter level. Further information about articular cartilage quality otherwise not detectable in vivo, via normal inspection, is needed for orthopedic treatment decisions in the present and future. Copyright © 2013 Elsevier Inc. All rights reserved.
Barzideh, Zoha; Latiff, Aishah Abd; Gan, Chee-Yuen; Abedin, Md Zainul; Alias, Abd Karim
2014-12-01
Collagen isolated from the ribbon jellyfish ( Chrysaora sp.) was hydrolysed using three different proteases ( i.e. trypsin, alcalase and Protamex) to obtain bioactive peptides. Angiotensin-I-converting enzyme (ACE) inhibitory activity and antioxidant activities ( i.e. ferric reducing antioxidant power (FRAP) and 2,2-diphenyl-1-picrylhydrazyl (DPPH) radical scavenging activity) of the peptides were measured and compared, and the effect of the duration of hydrolysis on the bioactivity (ACE inhibitory and antioxidant activities) of peptides was also evaluated. FRAP activity was the highest in Protamex-induced (25-27 mM) and trypsin-induced hydrolysates (24-26 mM) at 7 and 9 h, respectively. Conversely, hydrolysates produced by trypsin for 1 and 3 h showed the highest DPPH radical scavenging activities (94 and 92%, respectively). Trypsin-induced hydrolysates (at 3 h) also showed the highest ACE inhibitory activity (89%). The peptide sequences with the highest activities were identified using tandem mass spectrometry, and the results show that the hydrolysates had a high content of hydrophobic amino acids as well as unique amino acid sequences, which likely contribute to their biological activities.
Identification, variation and transcription of pneumococcal repeat sequences
2011-01-01
Background Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. Results Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. Conclusions BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/. PMID:21333003
Evidence of birth-and-death evolution of 5S rRNA gene in Channa species (Teleostei, Perciformes).
Barman, Anindya Sundar; Singh, Mamta; Singh, Rajeev Kumar; Lal, Kuldeep Kumar
2016-12-01
In higher eukaryotes, minor rDNA family codes for 5S rRNA that is arranged in tandem arrays and comprises of a highly conserved 120 bp long coding sequence with a variable non-transcribed spacer (NTS). Initially the 5S rDNA repeats are considered to be evolved by the process of concerted evolution. But some recent reports, including teleost fishes suggested that evolution of 5S rDNA repeat does not fit into the concerted evolution model and evolution of 5S rDNA family may be explained by a birth-and-death evolution model. In order to study the mode of evolution of 5S rDNA repeats in Perciformes fish species, nucleotide sequence and molecular organization of five species of genus Channa were analyzed in the present study. Molecular analyses revealed several variants of 5S rDNA repeats (four types of NTS) and networks created by a neighbor net algorithm for each type of sequences (I, II, III and IV) did not show a clear clustering in species specific manner. The stable secondary structure is predicted and upstream and downstream conserved regulatory elements were characterized. Sequence analyses also shown the presence of two putative pseudogenes in Channa marulius. Present study supported that 5S rDNA repeats in genus Channa were evolved under the process of birth-and-death.
Teng, Ye; Pramanik, Smritimoy; Tateishi-Karimata, Hisae; Ohyama, Tatsuya; Sugimoto, Naoki
2018-02-05
The trinucleotide repeat d(CXG) (X = A, C, G or T) is the most common sequence causing repeat expansion disorders. The formation of non-canonical structures, such as hairpin structures with X-X mismatches, has been proposed to affect gene expression and regulation, which are important in pathological studies of these devastating neurological diseases. However, little information is available regarding the thermodynamics of the repeat sequence under crowded cellular conditions where many non-canonical structures such as G-quadruplexes are highly stabilized, while duplexes are destabilised. In this study, we investigated the different stabilities of X-X mismatches in the context of internal d(CXG) self-complementary sequences in an environment with a high concentration of cosolutes to mimic the crowding conditions in cells. The stabilities of full-matched duplexes and duplexes with A-A, G-G, and T-T mismatched base pairs under molecular crowding conditions were notably decreased compared to under dilute conditions. However, the stability of the DNA duplex with a C-C mismatch base pair was only slightly destabilised. Investigating different stabilities of X-X mismatches in d(CXG) sequences is important for improving our understanding of the formation and transition of multiple non-canonical structures in trinucleotide repeat diseases, and may provide insights for pathological studies and drug development. Copyright © 2018 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Campbell, Una C.; Winsauer, Peter J.; Stevenson, Michael W.; Moerschbaecher, Joseph M.
2004-01-01
The present study investigated the effects of positive and negative GABA[subscript A] modulators under three different baselines of repeated acquisition in squirrel monkeys in which the monkeys acquired a three-response sequence on three keys under a second-order fixed-ratio (FR) schedule of food reinforcement. In two of these baselines, the…
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
Typing of artiodactyl MHC-DRB genes with the help of intronic simple repeated DNA sequences.
Schwaiger, F W; Buitkamp, J; Weyers, E; Epplen, J T
1993-02-01
An efficient oligonucleotide typing method for the highly polymorphic MHC-DRB genes is described for artiodactyls like cattle, sheep and goat. By means of the polymerase chain reaction, the second exon of MHC-DRB is amplified as well as part of the adjacent intron containing a mixed simple repeat sequence. Using this primer combination we were able to amplify the MHC-DRB exons 2 and adjacent introns from all of the investigated 10 species of the family of Bovidae and giraffes. Therefore, the DRB genes of novel artiodactyl species can also be readily studied. Oligonucleotide probes specific for the polymorphisms of ungulate DRB genes are used with which sequences differing in at least one single base can be distinguished. Exonic polymorphism was found to be correlated with the allele lengths and the patterns of the repeat structures. Hence oligonucleotide probes specific for different simple repeats and polymorphic positions serve also for typing across species barriers. The strict correlation of sequence length and exonic polymorphism permits a preselection of specific oligonucleotides for hybridization. Thus more than 20 alleles can already be differentiated from each of the three species.
Cell density signal protein suitable for treatment of connective tissue injuries and defects
Schwarz, Richard I.
2002-08-13
Identification, isolation and partial sequencing of a cell density protein produced by fibroblastic cells. The cell density signal protein comprising a 14 amino acid peptide or a fragment, variant, mutant or analog thereof, the deduced cDNA sequence from the 14 amino acid peptide, a recombinant protein, protein and peptide-specific antibodies, and the use of the peptide and peptide-specific antibodies as therapeutic agents for regulation of cell differentiation and proliferation. A method for treatment and repair of connective tissue and tendon injuries, collagen deficiency, and connective tissue defects.
Sekiguchi, Hiroyuki; Uchida, Kentaro; Matsushita, Osamu; Inoue, Gen; Nishi, Nozomu; Masuda, Ryo; Hamamoto, Nana; Koide, Takaki; Shoji, Shintaro; Takaso, Masashi
2018-01-01
Basic fibroblast growth factor 2 (bFGF) accelerates bone formation during fracture healing. Because the efficacy of bFGF decreases rapidly following its diffusion from fracture sites, however, repeated dosing is required to ensure a sustained therapeutic effect. We previously developed a fusion protein comprising bFGF, a polycystic kidney disease domain (PKD; s2b), and collagen-binding domain (CBD; s3) sourced from the Clostridium histolyticum class II collagenase, ColH, and reported that the combination of this fusion protein with a collagen-like peptide, poly(Pro-Hyp-Gly) 10 , induced mesenchymal cell proliferation and callus formation at fracture sites. In addition, C. histolyticum produces class I collagenase (ColG) with tandem CBDs (s3a and s3b) at the C-terminus. We therefore hypothesized that a bFGF fusion protein containing ColG-derived tandem CBDs (s3a and s3b) would show enhanced collagen-binding activity, leading to improved bone formation. Here, we examined the binding affinity of four collagen anchors derived from the two clostridial collagenases to H-Gly-Pro-Arg-Gly-(Pro-Hyp-Gly) 12 -NH 2 , a collagenous peptide, by surface plasmon resonance and found that tandem CBDs (s3a-s3b) have the highest affinity for the collagenous peptide. We also constructed four fusion proteins consisting of bFGF and s3 (bFGF-s3), s2b-s3b (bFGF-s2b-s3), s3b (bFGF-s3b), and s3a-s3b (bFGF-s3a-s3b) and compared their biological activities to those of a previous fusion construct (bFGF-s2b-s3) using a cell proliferation assay in vitro and a mouse femoral fracture model in vivo. Among these CB-bFGFs, bFGF-s3a-s3b showed the highest capacity to induce mesenchymal cell proliferation and callus formation in the mice fracture model. The poly(Pro-Hyp-Gly) 10 /bFGF-s3a-s3b construct may therefore have the potential to promote bone formation in clinical settings.
2014-01-01
Background The Drosophila heart (dorsal vessel) is a relatively simple tubular organ that serves as a model for several aspects of cardiogenesis. Cardiac morphogenesis, proper heart function and stability require structural components whose identity and ways of assembly are only partially understood. Structural components are also needed to connect the myocardial tube with neighboring cells such as pericardial cells and specialized muscle fibers, the so-called alary muscles. Results Using an EMS mutagenesis screen for cardiac and muscular abnormalities in Drosophila embryos we obtained multiple mutants for two genetically interacting complementation groups that showed similar alary muscle and pericardial cell detachment phenotypes. The molecular lesions underlying these defects were identified as domain-specific point mutations in LamininB1 and Cg25C, encoding the extracellular matrix (ECM) components laminin β and collagen IV α1, respectively. Of particular interest within the LamininB1 group are certain hypomorphic mutants that feature prominent defects in cardiac morphogenesis and cardiac ECM layer formation, but in contrast to amorphic mutants, only mild defects in other tissues. All of these alleles carry clustered missense mutations in the laminin LN domain. The identified Cg25C mutants display weaker and largely temperature-sensitive phenotypes that result from glycine substitutions in different Gly-X-Y repeats of the triple helix-forming domain. While initial basement membrane assembly is not abolished in Cg25C mutants, incorporation of perlecan is impaired and intracellular accumulation of perlecan as well as the collagen IV α2 chain is detected during late embryogenesis. Conclusions Assembly of the cardiac ECM depends primarily on laminin, whereas collagen IV is needed for stabilization. Our data underscore the importance of a correctly assembled ECM particularly for the development of cardiac tissues and their lateral connections. The mutational analysis suggests that the β6/β3/β8 interface of the laminin β LN domain is highly critical for formation of contiguous cardiac ECM layers. Certain mutations in the collagen IV triple helix-forming domain may exert a semi-dominant effect leading to an overall weakening of ECM structures as well as intracellular accumulation of collagen and other molecules, thus paralleling observations made in other organisms and in connection with collagen-related diseases. PMID:24935095
High Quality Maize Centromere 10 Sequence Reveals Evidence of Frequent Recombination Events
Wolfgruber, Thomas K.; Nakashima, Megan M.; Schneider, Kevin L.; Sharma, Anupma; Xie, Zidian; Albert, Patrice S.; Xu, Ronghui; Bilinski, Paul; Dawe, R. Kelly; Ross-Ibarra, Jeffrey; Birchler, James A.; Presting, Gernot G.
2016-01-01
The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR) has presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here, we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 × 10−6 and 5 × 10−5 for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb from the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length CR from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB) repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. In many cases examined here, DSB repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to efficiently repair frequent DSBs in centromeres. PMID:27047500
Li, Lixin; Piatek, Marek J; Atef, Ahmed; Piatek, Agnieszka; Wibowo, Anjar; Fang, Xiaoyun; Sabir, J S M; Zhu, Jian-Kang; Mahfouz, Magdy M
2012-03-01
Transcription activator-like effectors (TALEs) can be used as DNA-targeting modules by engineering their repeat domains to dictate user-selected sequence specificity. TALEs have been shown to function as site-specific transcriptional activators in a variety of cell types and organisms. TALE nucleases (TALENs), generated by fusing the FokI cleavage domain to TALE, have been used to create genomic double-strand breaks. The identity of the TALE repeat variable di-residues, their number, and their order dictate the DNA sequence specificity. Because TALE repeats are nearly identical, their assembly by cloning or even by synthesis is challenging and time consuming. Here, we report the development and use of a rapid and straightforward approach for the construction of designer TALE (dTALE) activators and nucleases with user-selected DNA target specificity. Using our plasmid set of 100 repeat modules, researchers can assemble repeat domains for any 14-nucleotide target sequence in one sequential restriction-ligation cloning step and in only 24 h. We generated several custom dTALEs and dTALENs with new target sequence specificities and validated their function by transient expression in tobacco leaves and in vitro DNA cleavage assays, respectively. Moreover, we developed a web tool, called idTALE, to facilitate the design of dTALENs and the identification of their genomic targets and potential off-targets in the genomes of several model species. Our dTALE repeat assembly approach along with the web tool idTALE will expedite genome-engineering applications in a variety of cell types and organisms including plants.
Inter-plate aseismic slip on the subducting plate boundaries estimated from repeating earthquakes
NASA Astrophysics Data System (ADS)
Igarashi, T.
2015-12-01
Sequences of repeating earthquakes are caused by repeating slips of small patches surrounded by aseismic slip areas at plate boundary zones. Recently, they have been detected in many regions. In this study, I detected repeating earthquakes which occurred in Japan and the world by using seismograms observed in the Japanese seismic network, and investigated the space-time characteristics of inter-plate aseismic slip on the subducting plate boundaries. To extract repeating earthquakes, I calculate cross-correlation coefficients of band-pass filtering seismograms at each station following Igarashi [2010]. I used two data-set based on USGS catalog for about 25 years from May 1990 and JMA catalog for about 13 years from January 2002. As a result, I found many sequences of repeating earthquakes in the subducting plate boundaries of the Andaman-Sumatra-Java and Japan-Kuril-Kamchatka-Aleutian subduction zones. By applying the scaling relations among a seismic moment, recurrence interval and slip proposed by Nadeau and Johnson [1998], they indicate the space-time changes of inter-plate aseismic slips. Pairs of repeating earthquakes with the longest time interval occurred in the Solomon Islands area and the recurrence interval was about 18.5 years. The estimated slip-rate is about 46 mm/year, which correspond to about half of the relative plate motion in this area. Several sequences with fast slip-rates correspond to the post-seismic slips after the 2004 Sumatra-Andaman earthquake (M9.0), the 2006 Kuril earthquake (M8.3), the 2007 southern Sumatra earthquake (M8.5), and the 2011 Tohoku-oki earthquake (M9.0). The database of global repeating earthquakes enables the comparison of the inter-plate aseismic slips of various plate boundary zones of the world. I believe that I am likely to detect more sequences by extending analysis periods in the area where they were not found in this analysis.
Chien, Maw-Sheng; Gilbert , Teresa L.; Huang, Chienjin; Landolt, Marsha L.; O'Hara, Patrick J.; Winton, James R.
1992-01-01
The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated Mr value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of Mr 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.
Franco, Bernardo; González-Cerón, Gabriela; Servín-González, Luis
2003-11-01
The functionality of direct and inverted repeat sequences inside the cis acting locus of transfer (clt) of the Streptomyces plasmid pJV1 was determined by testing the effect of different deletions on plasmid transfer. The results show that the single most important element for pJV1 clt function is a series of evenly spaced 9 bp long direct repeats which match the consensus CCGCACA(C/G)(C/G), since their deletion caused a dramatic reduction in plasmid transfer. The presence of these repeats in the absence of any other clt sequences allowed plasmid transfer to occur at a frequency that was at least two orders of magnitude higher than that obtained in the complete absence of clt. A database search revealed regions with a similar organization, and in the same position, in Streptomyces plasmids pSN22 and pSLS, which have transfer proteins homologous to those of pJV1.
Ali, A F; Taha, M M Reda; Thornton, G M; Shrive, N G; Frank, C B
2005-06-01
In normal daily activities, ligaments are subjected to repeated loads, and respond to this environment with creep and fatigue. While progressive recruitment of the collagen fibers is responsible for the toe region of the ligament stress-strain curve, recruitment also represents an elegant feature to help ligaments resist creep. The use of artificial intelligence techniques in computational modeling allows a large number of parameters and their interactions to be incorporated beyond the capacity of classical mathematical models. The objective of the work described here is to demonstrate a tool for modeling creep of the rabbit medial collateral ligament that can incorporate the different parameters while quantifying the effect of collagen fiber recruitment during creep. An intelligent algorithm was developed to predict ligament creep. The modeling is performed in two steps: first, the ill-defined fiber recruitment is quantified using the fuzzy logic. Second, this fiber recruitment is incorporated along with creep stress and creep time to model creep using an adaptive neurofuzzy inference system. The model was trained and tested using an experimental database including creep tests and crimp image analysis. The model confirms that quantification of fiber recruitment is important for accurate prediction of ligament creep behavior at physiological loads.
Becker, Jutta; Semler, Oliver; Gilissen, Christian; Li, Yun; Bolz, Hanno Jörn; Giunta, Cecilia; Bergmann, Carsten; Rohrbach, Marianne; Koerber, Friederike; Zimmermann, Katharina; de Vries, Petra; Wirth, Brunhilde; Schoenau, Eckhard; Wollnik, Bernd; Veltman, Joris A.; Hoischen, Alexander; Netzer, Christian
2011-01-01
Osteogenesis imperfecta (OI) is a heterogeneous genetic disorder characterized by bone fragility and susceptibility to fractures after minimal trauma. After mutations in all known OI genes had been excluded by Sanger sequencing, we applied next-generation sequencing to analyze the exome of a single individual who has a severe form of the disease and whose parents are second cousins. A total of 26,922 variations from the human reference genome sequence were subjected to several filtering steps. In addition, we extracted the genotypes of all dbSNP130-annotated SNPs from the exome sequencing data and used these 299,494 genotypes as markers for the genome-wide identification of homozygous regions. A single homozygous truncating mutation, affecting SERPINF1 on chromosome 17p13.3, that was embedded into a homozygous stretch of 2.99 Mb remained. The mutation was also homozygous in the affected brother of the index patient. Subsequently, we identified homozygosity for two different truncating SERPINF1 mutations in two unrelated patients with OI and parental consanguinity. All four individuals with SERPINF1 mutations have severe OI. Fractures of long bones and severe vertebral compression fractures with resulting deformities were observed as early as the first year of life in these individuals. Collagen analyses with cultured dermal fibroblasts displayed no evidence for impaired collagen folding, posttranslational modification, or secretion. SERPINF1 encodes pigment epithelium-derived factor (PEDF), a secreted glycoprotein of the serpin superfamily. PEDF is a multifunctional protein and one of the strongest inhibitors of angiogenesis currently known in humans. Our data provide genetic evidence for PEDF involvement in human bone homeostasis. PMID:21353196
NASA Astrophysics Data System (ADS)
Zhao, Cui; Zhang, Xiaojun; Liu, Chengzhang; Huan, Pin; Li, Fuhua; Xiang, Jianhai; Huang, Chao
2012-05-01
Little is known about the genome of Pacific white shrimp ( Litopenaeus vannamei). To address this, we conducted BAC (bacterial artificial chromosome) end sequencing of L. vannamei. We selected and sequenced 7 812 BAC clones from the BAC library LvHE from the two ends of the inserts by Sanger sequencing. After trimming and quality filtering, 11 279 BAC end sequences (BESs) including 4 609 pairedends BESs were obtained. The total length of the BESs was 4 340 753 bp, representing 0.18% of the L. vannamei haploid genome. The lengths of the BESs ranged from 100 bp to 660 bp with an average length of 385 bp. Analysis of the BESs indicated that the L. vannamei genome is AT-rich and that the primary repeats patterns were simple sequence repeats (SSRs) and low complexity sequences. Dinucleotide and hexanucleotide repeats were the most common SSR types in the BESs. The most abundant transposable element was gypsy, which may contribute to the generation of the large genome size of L. vannamei. We successfully annotated 4 519 BESs by BLAST searching, including genes involved in immunity and sex determination. Our results provide an important resource for functional gene studies, map construction and integration, and complete genome assembly for this species.
Pearston, Douglas H.; Gordon, Mairi; Hardman, Norman
1985-01-01
A family of long, highly-repetitive sequences, referred to previously as `HpaII-repeats', dominates the genome of the eukaryotic slime mould Physarum polycephalum. These sequences are found exclusively in scrambled clusters. They account for about one-half of the total complement of repetitive DNA in Physarum, and represent the major sequence component found in hypermethylated, 20-50 kb segments of Physarum genomic DNA that fail to be cleaved using the restriction endonuclease HpaII. The structure of this abundant repetitive element was investigated by analysing cloned segments derived from the hypermethylated genomic DNA compartment. We show that the `HpaII-repeat' forms part of a larger repetitive DNA structure, ∼8.6 kb in length, with several structural features in common with recognised eukaryotic transposable genetic elements. Scrambled clusters of the sequence probably arise as a result of transposition-like events, during which the element preferentially recombines in either orientation with target sites located in other copies of the same repeated sequence. The target sites for transposition/recombination are not related in sequence but in all cases studied they are potentially capable of promoting the formation of small `cruciforms' or `Z-DNA' structures which might be recognised during the recombination process. ImagesFig. 3.Fig. 4. PMID:16453652
Zheng, Yang; Cai, Jing; Li, JianWen; Li, Bo; Lin, Runmao; Tian, Feng; Wang, XiaoLing; Wang, Jun
2010-01-01
A 10-fold BAC library for giant panda was constructed and nine BACs were selected to generate finish sequences. These BACs could be used as a validation resource for the de novo assembly accuracy of the whole genome shotgun sequencing reads of giant panda newly generated by the Illumina GA sequencing technology. Complete sanger sequencing, assembly, annotation and comparative analysis were carried out on the selected BACs of a joint length 878 kb. Homologue search and de novo prediction methods were used to annotate genes and repeats. Twelve protein coding genes were predicted, seven of which could be functionally annotated. The seven genes have an average gene size of about 41 kb, an average coding size of about 1.2 kb and an average exon number of 6 per gene. Besides, seven tRNA genes were found. About 27 percent of the BAC sequence is composed of repeats. A phylogenetic tree was constructed using neighbor-join algorithm across five species, including giant panda, human, dog, cat and mouse, which reconfirms dog as the most related species to giant panda. Our results provide detailed sequence and structure information for new genes and repeats of giant panda, which will be helpful for further studies on the giant panda.
Alrifai, Mohammed; Marsh, Leigh M; Dicke, Tanja; Kılıç, Ayse; Conrad, Melanie L; Renz, Harald; Garn, Holger
2014-01-01
Allergic asthma is associated with chronic airway inflammation and progressive airway remodelling. However, the dynamics of the development of these features and their spontaneous and pharmacological reversibility are still poorly understood. We have therefore investigated the dynamics of airway remodelling and repair in an experimental asthma model and studied how pharmacological intervention affects these processes. Using BALB/c mice, the kinetics of chronic asthma progression and resolution were characterised in absence and presence of inhaled corticosteroid (ICS) treatment. Airway inflammation and remodelling was assessed by the analysis of bronchoalveolar and peribronichal inflammatory cell infiltrate, goblet cell hyperplasia, collagen deposition and smooth muscle thickening. Chronic allergen exposure resulted in early (goblet cell hyperplasia) and late remodelling (collagen deposition and smooth muscle thickening). After four weeks of allergen cessation eosinophilic inflammation, goblet cell hyperplasia and collagen deposition were resolved, full resolution of lymphocyte inflammation and smooth muscle thickening was only observed after eight weeks. ICS therapy when started before the full establishment of chronic asthma reduced the development of lung inflammation, decreased goblet cell hyperplasia and collagen deposition, but did not affect smooth muscle thickening. These effects of ICS on airway remodelling were maintained for a further four weeks even when therapy was discontinued. Utilising a chronic model of experimental asthma we have shown that repeated allergen exposure induces reversible airway remodelling and inflammation in mice. Therapeutic intervention with ICS was partially effective in inhibiting the transition from acute to chronic asthma by reducing airway inflammation and remodelling but was ineffective in preventing smooth muscle hypertrophy.
Highly sensitive detection of individual HEAT and ARM repeats with HHpred and COACH.
Kippert, Fred; Gerloff, Dietlind L
2009-09-24
HEAT and ARM repeats occur in a large number of eukaryotic proteins. As these repeats are often highly diverged, the prediction of HEAT or ARM domains can be challenging. Except for the most clear-cut cases, identification at the individual repeat level is indispensable, in particular for determining domain boundaries. However, methods using single sequence queries do not have the sensitivity required to deal with more divergent repeats and, when applied to proteins with known structures, in some cases failed to detect a single repeat. Testing algorithms which use multiple sequence alignments as queries, we found two of them, HHpred and COACH, to detect HEAT and ARM repeats with greatly enhanced sensitivity. Calibration against experimentally determined structures suggests the use of three score classes with increasing confidence in the prediction, and prediction thresholds for each method. When we applied a new protocol using both HHpred and COACH to these structures, it detected 82% of HEAT repeats and 90% of ARM repeats, with the minimum for a given protein of 57% for HEAT repeats and 60% for ARM repeats. Application to bona fide HEAT and ARM proteins or domains indicated that similar numbers can be expected for the full complement of HEAT/ARM proteins. A systematic screen of the Protein Data Bank for false positive hits revealed their number to be low, in particular for ARM repeats. Double false positive hits for a given protein were rare for HEAT and not at all observed for ARM repeats. In combination with fold prediction and consistency checking (multiple sequence alignments, secondary structure prediction, and position analysis), repeat prediction with the new HHpred/COACH protocol dramatically improves prediction in the twilight zone of fold prediction methods, as well as the delineation of HEAT/ARM domain boundaries. A protocol is presented for the identification of individual HEAT or ARM repeats which is straightforward to implement. It provides high sensitivity at a low false positive rate and will therefore greatly enhance the accuracy of predictions of HEAT and ARM domains.
Highly Sensitive Detection of Individual HEAT and ARM Repeats with HHpred and COACH
Kippert, Fred; Gerloff, Dietlind L.
2009-01-01
Background HEAT and ARM repeats occur in a large number of eukaryotic proteins. As these repeats are often highly diverged, the prediction of HEAT or ARM domains can be challenging. Except for the most clear-cut cases, identification at the individual repeat level is indispensable, in particular for determining domain boundaries. However, methods using single sequence queries do not have the sensitivity required to deal with more divergent repeats and, when applied to proteins with known structures, in some cases failed to detect a single repeat. Methodology and Principal Findings Testing algorithms which use multiple sequence alignments as queries, we found two of them, HHpred and COACH, to detect HEAT and ARM repeats with greatly enhanced sensitivity. Calibration against experimentally determined structures suggests the use of three score classes with increasing confidence in the prediction, and prediction thresholds for each method. When we applied a new protocol using both HHpred and COACH to these structures, it detected 82% of HEAT repeats and 90% of ARM repeats, with the minimum for a given protein of 57% for HEAT repeats and 60% for ARM repeats. Application to bona fide HEAT and ARM proteins or domains indicated that similar numbers can be expected for the full complement of HEAT/ARM proteins. A systematic screen of the Protein Data Bank for false positive hits revealed their number to be low, in particular for ARM repeats. Double false positive hits for a given protein were rare for HEAT and not at all observed for ARM repeats. In combination with fold prediction and consistency checking (multiple sequence alignments, secondary structure prediction, and position analysis), repeat prediction with the new HHpred/COACH protocol dramatically improves prediction in the twilight zone of fold prediction methods, as well as the delineation of HEAT/ARM domain boundaries. Significance A protocol is presented for the identification of individual HEAT or ARM repeats which is straightforward to implement. It provides high sensitivity at a low false positive rate and will therefore greatly enhance the accuracy of predictions of HEAT and ARM domains. PMID:19777061
Genetic characterization of the UCS and Kex1 loci of Pneumocystis jirovecii.
Esteves, F; Tavares, A; Costa, M C; Gaspar, J; Antunes, F; Matos, O
2009-02-01
Nucleotide variation in the Pneumocystis jirovecii upstream conserved sequence (UCS) and kexin-like serine protease (Kex1) loci was studied in pulmonary specimens from Portuguese HIV-positive patients. DNA was extracted and used for specific molecular sequence analysis. The number of UCS tandem repeats detected in 13 successfully sequenced isolates ranged from three (9 isolates, 69%) to four (4 isolates, 31%). A novel tandem repeat pattern and two novel polymorphisms were detected in the UCS region. For the Kex1 gene, the wild-type (24 isolates, 86%) was the most frequent sequence detected among the 28 sequenced isolates. Nevertheless, a nonsynonymous (1 isolate, 3%) and three synonymous (3 isolates, 11%) polymorphisms were detected and are described here for the first time.
APE1 incision activity at abasic sites in tandem repeat sequences.
Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M
2014-05-29
Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.
Chen, Caihui; Zheng, Yongjie; Liu, Sian; Zhong, Yongda; Wu, Yanfang; Li, Jiang; Xu, Li-An; Xu, Meng
2017-01-01
Cinnamomum camphora , a member of the Lauraceae family, is a valuable aromatic and timber tree that is indigenous to the south of China and Japan. All parts of Cinnamomum camphora have secretory cells containing different volatile chemical compounds that are utilized as herbal medicines and essential oils. Here, we reported the complete sequencing of the chloroplast genome of Cinnamomum camphora using illumina technology. The chloroplast genome of Cinnamomum camphora is 152,570 bp in length and characterized by a relatively conserved quadripartite structure containing a large single copy region of 93,705 bp, a small single copy region of 19,093 bp and two inverted repeat (IR) regions of 19,886 bp. Overall, the genome contained 123 coding regions, of which 15 were repeated in the IR regions. An analysis of chloroplast sequence divergence revealed that the small single copy region was highly variable among the different genera in the Lauraceae family. A total of 40 repeat structures and 83 simple sequence repeats were detected in both the coding and non-coding regions. A phylogenetic analysis indicated that Calycanthus is most closely related to Lauraceae , both being members of Laurales , which forms a sister group to Magnoliids . The complete sequence of the chloroplast of Cinnamomum camphora will aid in in-depth taxonomical studies of the Lauraceae family in the future. The genetic sequence information will also have valuable applications for chloroplast genetic engineering.
[Detection of CRISPR and its relationship to drug resistance in Shigella].
Wang, Linlin; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Guo, Xiangjiao; Wang, Pengfei; Xi, Yuanlin; Yang, Haiyan
2015-04-04
To detect clustered regularly interspaced short palindromic repeats (CRISPR) in Shigella, and to analyze its relationship to drug resistance. Four pairs of primers were used for the detection of convincing CRISPR structures CRISPR-S2 and CRISPR-S4, questionable CRISPR structures CRISPR-S1 and CRISPR-S3 in 60 Shigella strains. All primers were designed using sequences in CRISPR database. CRISPR Finder was used to analyze CRISPR and susceptibilities of Shigella strains were tested by agar diffusion method. Furthermore, we analyzed the relationship between drug resistance and CRISPR-S4. The positive rate of convincing CRISPR structures was 95%. The four CRISPR loci formed 12 spectral patterns (A-L), all of which contained convincing CRISPR structures except type K. We found one new repeat and 12 new spacers. The multi-drug resistance rate was 53. 33% . We found no significant difference between CRISPR-S4 and drug resistant. However, the repeat sequence of CRISPR-S4 in multi- or TE-resistance strains was mainly R4.1 with AC deletions in the 3' end, and the spacer sequences of CRISPR-S4 in multi-drug resistance strains were mainly Sp5.1, Sp6.1 and Sp7. CRISPR was common in Shigella. Variations df repeat sequences and diversities of spacer sequences might be related to drug resistance in Shigella.
Breast Cancer in African American Women: Molecular Analysis of Differences in Incidence and Outcomes
2005-10-01
supporting proteins derived from serum used in the culture me- bone cell growth than collagen dip coated PHBV and dium. In the absence of natural ...medium suspension. Earlier, Concentration (mM) Medium Stock solution we had done a 10:1 dilution with trypan blue dye , the dye that allows us to...cancer model has to the tube. Vortex. traditionally used CyQUANT because of its simplicity, but 6. Using a repeater, add 200 pL of your dye mixture to
Jeong, Jae-Hee; Kim, Yi-Seul; Rojviriya, Catleya; Cha, Hyung Jin; Ha, Sung-Chul; Kim, Yeon-Gil
2013-10-01
The members of the ARM/HEAT repeat-containing protein superfamily in eukaryotes have been known to mediate protein-protein interactions by using their concave surface. However, little is known about the ARM/HEAT repeat proteins in prokaryotes. Here we report the crystal structure of TON1937, a hypothetical protein from the hyperthermophilic archaeon Thermococcus onnurineus NA1. The structure reveals a crescent-shaped molecule composed of a double layer of α-helices with seven anti-parallel α-helical repeats. A structure-based sequence alignment of the α-helical repeats identified a conserved pattern of hydrophobic or aliphatic residues reminiscent of the consensus sequence of eukaryotic HEAT repeats. The individual repeats of TON1937 also share high structural similarity with the canonical eukaryotic HEAT repeats. In addition, the concave surface of TON1937 is proposed to be its potential binding interface based on this structural comparison and its surface properties. These observations lead us to speculate that the archaeal HEAT-like repeats of TON1937 have evolved to engage in protein-protein interactions in the same manner as eukaryotic HEAT repeats. Copyright © 2013 Elsevier B.V. All rights reserved.
Linkage analysis in a family with Stickler syndrome leads to the exclusion of the COL2A1 locus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mottes, M.; Zolezzi, F.; Pignatti, P.F.
1994-09-01
Hereditary arthro-ophtalmopathy (AO) or Stickler Syndrome (MIM No. 10830) is a dominantly inherited disorder characterized by vitro-retinal degeneration and other connective tissue disturbances. Mutations in the COL2A1 gene, coding for type II collagen chains, have been described in a few patients. The wide spectrum of clinical manifestations is presumably due to genetic heterogeneity, since only about 50% of the Stickler families so far studied show cosegregation of the disease with the COL2A1 locus. We have investigated a large pedigree (19 individuals of whom 9 are affected) in which severe myopia with vitro-retinal degeneration consegregated with joint laxity, recurrent inguinal hernias,more » and degenerative changes of the hip and the knee. The 3{prime} end COL2A1 VNTR polymorphism was utilized for linkage analysis. In order to get the maximum informativity, we have analyzed the allelic microheterogeneity of this VNTR, due to the repeat sequence variation, by means of a single strand polymorphism. Mendelian inheritance of the different single strands was observed as expected. Discordance of segregation between the disease and the COL2A1 locus was thus established inequivocally in this family.« less
Mutation at a distance caused by homopolymeric guanine repeats in Saccharomyces cerevisiae
McDonald, Michael J.; Yu, Yen-Hsin; Guo, Jheng-Fen; Chong, Shin Yen; Kao, Cheng-Fu; Leu, Jun-Yi
2016-01-01
Mutation provides the raw material from which natural selection shapes adaptations. The rate at which new mutations arise is therefore a key factor that determines the tempo and mode of evolution. However, an accurate assessment of the mutation rate of a given organism is difficult because mutation rate varies on a fine scale within a genome. A central challenge of evolutionary genetics is to determine the underlying causes of this variation. In earlier work, we had shown that repeat sequences not only are prone to a high rate of expansion and contraction but also can cause an increase in mutation rate (on the order of kilobases) of the sequence surrounding the repeat. We perform experiments that show that simple guanine repeats 13 bp (base pairs) in length or longer (G13+) increase the substitution rate 4- to 18-fold in the downstream DNA sequence, and this correlates with DNA replication timing (R = 0.89). We show that G13+ mutagenicity results from the interplay of both error-prone translesion synthesis and homologous recombination repair pathways. The mutagenic repeats that we study have the potential to be exploited for the artificial elevation of mutation rate in systems biology and synthetic biology applications. PMID:27386516
Misas, Elizabeth; Muñoz, José Fernando; Gallo, Juan Esteban; McEwen, Juan Guillermo; Clay, Oliver Keatinge
2016-04-01
The presence of repetitive or non-unique DNA persisting over sizable regions of a eukaryotic genome can hinder the genome's successful de novo assembly from short reads: ambiguities in assigning genome locations to the non-unique subsequences can result in premature termination of contigs and thus overfragmented assemblies. Fungal mitochondrial (mtDNA) genomes are compact (typically less than 100 kb), yet often contain short non-unique sequences that can be shown to impede their successful de novo assembly in silico. Such repeats can also confuse processes in the cell in vivo. A well-studied example is ectopic (out-of-register, illegitimate) recombination associated with repeat pairs, which can lead to deletion of functionally important genes that are located between the repeats. Repeats that remain conserved over micro- or macroevolutionary timescales despite such risks may indicate functionally or structurally (e.g., for replication) important regions. This principle could form the basis of a mining strategy for accelerating discovery of function in genome sequences. We present here our screening of a sample of 11 fully sequenced fungal mitochondrial genomes by observing where exact k-mer repeats occurred several times; initial analyses motivated us to focus on 17-mers occurring more than three times. Based on the diverse repeats we observe, we propose that such screening may serve as an efficient expedient for gaining a rapid but representative first insight into the repeat landscapes of sparsely characterized mitochondrial chromosomes. Our matching of the flagged repeats to previously reported regions of interest supports the idea that systems of persisting, non-trivial repeats in genomes can often highlight features meriting further attention. Copyright © 2016 Elsevier Ltd. All rights reserved.
Comparison of the carboxy-terminal DP-repeat region in the co-chaperones Hop and Hip
Nelson, Gregory M.; Huffman, Holly; Smith, David F.
2003-01-01
Functional steroid receptor complexes are assembled and maintained by an ordered pathway of interactions involving multiple components of the cellular chaperone machinery. Two of these components, Hop and Hip, serve as co-chaperones to the major heat shock proteins (Hsps), Hsp70 and Hsp90, and participate in intermediate stages of receptor assembly. In an effort to better understand the functions of Hop and Hip in the assembly process, we focused on a region of similarity located near the C-terminus of each co-chaperone. Contained within this region is a repeated sequence motif we have termed the DP repeat. Earlier mutagenesis studies implicated the DP repeat of either Hop or Hip in Hsp70 binding and in normal assembly of the co-chaperones with progesterone receptor (PR) complexes. We report here that the DP repeat lies within a protease-resistant domain that extends to or is near the C-terminus of both co-chaperones. Point mutations in the DP repeats render the C-terminal regions hypersensitive to proteolysis. In addition, a Hop DP mutant displays altered proteolytic digestion patterns, which suggest that the DP-repeat region influences the folding of other Hop domains. Although the respective DP regions of Hop and Hip share sequence and structural similarities, they are not functionally interchangeable. Moreover, a double-point mutation within the second DP-repeat unit of Hop that converts this to the sequence found in Hip disrupts Hop function; however, the corresponding mutation in Hip does not alter its function. We conclude that the DP repeats are important structural elements within a C-terminal domain, which is important for Hop and Hip function. PMID:14627198
Comparison of the carboxy-terminal DP-repeat region in the co-chaperones Hop and Hip.
Nelson, Gregory M; Huffman, Holly; Smith, David F
2003-01-01
Functional steroid receptor complexes are assembled and maintained by an ordered pathway of interactions involving multiple components of the cellular chaperone machinery. Two of these components, Hop and Hip, serve as co-chaperones to the major heat shock proteins (Hsps), Hsp70 and Hsp90, and participate in intermediate stages of receptor assembly. In an effort to better understand the functions of Hop and Hip in the assembly process, we focused on a region of similarity located near the C-terminus of each co-chaperone. Contained within this region is a repeated sequence motif we have termed the DP repeat. Earlier mutagenesis studies implicated the DP repeat of either Hop or Hip in Hsp70 binding and in normal assembly of the co-chaperones with progesterone receptor (PR) complexes. We report here that the DP repeat lies within a protease-resistant domain that extends to or is near the C-terminus of both co-chaperones. Point mutations in the DP repeats render the C-terminal regions hypersensitive to proteolysis. In addition, a Hop DP mutant displays altered proteolytic digestion patterns, which suggest that the DP-repeat region influences the folding of other Hop domains. Although the respective DP regions of Hop and Hip share sequence and structural similarities, they are not functionally interchangeable. Moreover, a double-point mutation within the second DP-repeat unit of Hop that converts this to the sequence found in Hip disrupts Hop function; however, the corresponding mutation in Hip does not alter its function. We conclude that the DP repeats are important structural elements within a C-terminal domain, which is important for Hop and Hip function.
Selfish DNA in protein-coding genes of Rickettsia.
Ogata, H; Audic, S; Barbe, V; Artiguenave, F; Fournier, P E; Raoult, D; Claverie, J M
2000-10-13
Rickettsia conorii, the aetiological agent of Mediterranean spotted fever, is an intracellular bacterium transmitted by ticks. Preliminary analyses of the nearly complete genome sequence of R. conorii have revealed 44 occurrences of a previously undescribed palindromic repeat (150 base pairs long) throughout the genome. Unexpectedly, this repeat was found inserted in-frame within 19 different R. conorii open reading frames likely to encode functional proteins. We found the same repeat in proteins of other Rickettsia species. The finding of a mobile element inserted in many unrelated genes suggests the potential role of selfish DNA in the creation of new protein sequences.
Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.
Ananiev, E V; Phillips, R L; Rines, H W
1998-01-01
The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055
Pavelitz, T; Rusché, L; Matera, A G; Scharf, J M; Weiner, A M
1995-01-01
In primates, the tandemly repeated genes encoding U2 small nuclear RNA evolve concertedly, i.e. the sequence of the U2 repeat unit is essentially homogeneous within each species but differs somewhat between species. Using chromosome painting and the NGFR gene as an outside marker, we show that the U2 tandem array (RNU2) has remained at the same chromosomal locus (equivalent to human 17q21) through multiple speciation events over > 35 million years leading to the Old World monkey and hominoid lineages. The data suggest that the U2 tandem repeat, once established in the primate lineage, contained sequence elements favoring perpetuation and concerted evolution of the array in situ, despite a pericentric inversion in chimpanzee, a reciprocal translocation in gorilla and a paracentric inversion in orang utan. Comparison of the 11 kb U2 repeat unit found in baboon and other Old World monkeys with the 6 kb U2 repeat unit in humans and other hominids revealed that an ancestral U2 repeat unit was expanded by insertion of a 5 kb retrovirus bearing 1 kb long terminal repeats (LTRs). Subsequent excision of the provirus by homologous recombination between the LTRs generated a 6 kb U2 repeat unit containing a solo LTR. Remarkably, both junctions between the human U2 tandem array and flanking chromosomal DNA at 17q21 fall within the solo LTR sequence, suggesting a role for the LTR in the origin or maintenance of the primate U2 array. Images PMID:7828589
NASA Astrophysics Data System (ADS)
Iacumin, P.; Bocherens, H.; Delgado Huertas, A.; Mariotti, A.; Longinelli, A.
1997-04-01
A set of 102 tooth and bone samples of Pleistocene age (32,600-13,300 yr BP) belonging to the species Cervus elaphus, Bos primigenius and Equus caballus and coming from the Paglicci cave (Southern Italy) was studied for the carbon (δ 13C) and nitrogen (δ 15N) isotopic composition of bone and dentine collage and for the carbon (δ 13C c) isotopic composition of tooth enamel carbonate. The amount of collagen extracted from bone and tooth samples (mg/g) was rather variable, representing approximately only 0.5-15% of the collagen present in a fresh bone. However, the loss of an important fraction of the original collagen during diagenesis did not change the in vivo isotopic composition. In general, when the δ 13C of both collagen and carbonate and the δ 15N of collagen obtained from each level for the three species are compared, wild ox shows the most increased values, deer the most decreased values and horse shows intermediate results. These differences are probably related to distinct diets or to differences in their physiological behaviour. However, the isotopic results suggest that the three species considered lived in an open environment with a diet based on C 3 plants. The stratigraphic sequence of light and heavy nitrogen isotope values between 19,000 and 15,000 may be related to shifts from arid to humid conditions, while the overall trend shown by δ 13C toward lighter values may be related to a progressive development of a forest habitat.
CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats.
Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine
2007-07-01
Clustered regularly interspaced short palindromic repeats (CRISPRs) constitute a particular family of tandem repeats found in a wide range of prokaryotic genomes (half of eubacteria and almost all archaea). They consist of a succession of highly conserved regions (DR) varying in size from 23 to 47 bp, separated by similarly sized unique sequences (spacer) of usually viral origin. A CRISPR cluster is flanked on one side by an AT-rich sequence called the leader and assumed to be a transcriptional promoter. Recent studies suggest that this structure represents a putative RNA-interference-based immune system. Here we describe CRISPRFinder, a web service offering tools to (i) detect CRISPRs including the shortest ones (one or two motifs); (ii) define DRs and extract spacers; (iii) get the flanking sequences to determine the leader; (iv) blast spacers against Genbank database and (v) check if the DR is found elsewhere in prokaryotic sequenced genomes. CRISPRFinder is freely accessible at http://crispr.u-psud.fr/Server/CRISPRfinder.php.
Roe, Daisy; Miles, Christopher; Johnson, Andrew J
2017-07-01
The present paper examines the effect of within-sequence item repetitions in tactile order memory. Employing an immediate serial recall procedure, participants reconstructed a six-item sequence tapped upon their fingers by moving those fingers in the order of original stimulation. In Experiment 1a, within-sequence repetition of an item separated by two-intervening items resulted in a significant reduction in recall accuracy for that repeated item (i.e., the Ranschburg effect). In Experiment 1b, within-sequence repetition of an adjacent item resulted in significant recall facilitation for that repeated item. These effects mirror those reported for verbal stimuli (e.g., Henson, 1998a . Item repetition in short-term memory: Ranschburg repeated. Journal of Experimental Psychology: Learning, Memory, and Cognition, 24(5), 1162-1181. doi:doi.org/10.1037/0278-7393.24.5.1162). These data are the first to demonstrate the Ranschburg effect with non-verbal stimuli and suggest further cross-modal similarities in order memory.
Siju, S; Dhanya, K; Syamkumar, S; Sasikumar, B; Sheeja, T E; Bhat, A I; Parthasarathy, V A
2010-02-01
Expressed sequence tags (ESTs) from turmeric (Curcuma longa L.) were used for the screening of type and frequency of Class I (hypervariable) simple sequence repeats (SSRs). A total of 231 microsatellite repeats were detected from 12,593 EST sequences of turmeric after redundancy elimination. The average density of Class I SSRs accounts to one SSR per 17.96 kb of EST. Mononucleotides were the most abundant class of microsatellite repeat in turmeric ESTs followed by trinucleotides. A robust set of 17 polymorphic EST-SSRs were developed and used for evaluating 20 turmeric accessions. The number of alleles detected ranged from 3 to 8 per loci. The developed markers were also evaluated in 13 related species of C. longa confirming high rate (100%) of cross species transferability. The polymorphic microsatellite markers generated from this study could be used for genetic diversity analysis and resolving the taxonomic confusion prevailing in the genus.
A Glance at Microsatellite Motifs from 454 Sequencing Reads of Watermelon Genomic DNA
USDA-ARS?s Scientific Manuscript database
A single 454 (Life Sciences Sequencing Technology) run of Charleston Gray watermelon (Citrullus lanatus var. lanatus) genomic DNA was performed and sequence data were assembled. A large scale identification of simple sequence repeat (SSR) was performed and SSR sequence data were used for the develo...
Yu, Jeong-Nam; Won, Changman; Jun, Jumin; Lim, YoungWoon; Kwak, Myounghai
2011-01-01
Background Microsatellites, a special class of repetitive DNA sequence, have become one of the most popular genetic markers for population/conservation genetic studies. However, its application to endangered species has been impeded by high development costs, a lack of available sequences, and technical difficulties. The water deer Hydropotes inermis is the sole existing endangered species of the subfamily Capreolinae. Although population genetics studies are urgently required for conservation management, no species-specific microsatellite marker has been reported. Methods We adopted next-generation sequencing (NGS) to elucidate the microsatellite markers of Korean water deer and overcome these impediments on marker developments. We performed genotyping to determine the efficiency of this method as applied to population genetics. Results We obtained 98 Mbp of nucleotide information from 260,467 sequence reads. A total of 20,101 di-/tri-nucleotide repeat motifs were identified; di-repeats were 5.9-fold more common than tri-repeats. [CA]n and [AAC]n/[AAT]n repeats were the most frequent di- and tri-repeats, respectively. Of the 17,206 di-repeats, 12,471 microsatellite primer pairs were derived. PCR amplification of 400 primer pairs yielded 106 amplicons and 79 polymorphic markers from 20 individual Korean water deer. Polymorphic rates of the 79 new microsatellites varied from 2 to 11 alleles per locus (He: 0.050–0.880; Ho: 0.000–1.000), while those of known microsatellite markers transferred from cattle to Chinese water deer ranged from 4 to 6 alleles per locus (He: 0.279–0.714; Ho: 0.300–0.400). Conclusions Polymorphic microsatellite markers from Korean water deer were successfully identified using NGS without any prior sequence information and deposited into the public database. Thus, the methods described herein represent a rapid and low-cost way to investigate the population genetics of endangered/non-model species. PMID:22069476
Sun, Cheng; Wyngaard, Grace; Walton, D Brian; Wichman, Holly A; Mueller, Rachel Lockridge
2014-03-11
Chromatin diminution is the programmed deletion of DNA from presomatic cell or nuclear lineages during development, producing single organisms that contain two different nuclear genomes. Phylogenetically diverse taxa undergo chromatin diminution--some ciliates, nematodes, copepods, and vertebrates. In cyclopoid copepods, chromatin diminution occurs in taxa with massively expanded germline genomes; depending on species, germline genome sizes range from 15 - 75 Gb, 12-74 Gb of which are lost from pre-somatic cell lineages at germline--soma differentiation. This is more than an order of magnitude more sequence than is lost from other taxa. To date, the sequences excised from copepods have not been analyzed using large-scale genomic datasets, and the processes underlying germline genomic gigantism in this clade, as well as the functional significance of chromatin diminution, have remained unknown. Here, we used high-throughput genomic sequencing and qPCR to characterize the germline and somatic genomes of Mesocyclops edax, a freshwater cyclopoid copepod with a germline genome of ~15 Gb and a somatic genome of ~3 Gb. We show that most of the excised DNA consists of repetitive sequences that are either 1) verifiable transposable elements (TEs), or 2) non-simple repeats of likely TE origin. Repeat elements in both genomes are skewed towards younger (i.e. less divergent) elements. Excised DNA is a non-random sample of the germline repeat element landscape; younger elements, and high frequency DNA transposons and LINEs, are disproportionately eliminated from the somatic genome. Our results suggest that germline genome expansion in M. edax reflects explosive repeat element proliferation, and that billions of base pairs of such repeats are deleted from the somatic genome every generation. Thus, we hypothesize that chromatin diminution is a mechanism that controls repeat element load, and that this load can evolve to be divergent between tissue types within single organisms.
2014-01-01
Background Chromatin diminution is the programmed deletion of DNA from presomatic cell or nuclear lineages during development, producing single organisms that contain two different nuclear genomes. Phylogenetically diverse taxa undergo chromatin diminution — some ciliates, nematodes, copepods, and vertebrates. In cyclopoid copepods, chromatin diminution occurs in taxa with massively expanded germline genomes; depending on species, germline genome sizes range from 15 – 75 Gb, 12–74 Gb of which are lost from pre-somatic cell lineages at germline – soma differentiation. This is more than an order of magnitude more sequence than is lost from other taxa. To date, the sequences excised from copepods have not been analyzed using large-scale genomic datasets, and the processes underlying germline genomic gigantism in this clade, as well as the functional significance of chromatin diminution, have remained unknown. Results Here, we used high-throughput genomic sequencing and qPCR to characterize the germline and somatic genomes of Mesocyclops edax, a freshwater cyclopoid copepod with a germline genome of ~15 Gb and a somatic genome of ~3 Gb. We show that most of the excised DNA consists of repetitive sequences that are either 1) verifiable transposable elements (TEs), or 2) non-simple repeats of likely TE origin. Repeat elements in both genomes are skewed towards younger (i.e. less divergent) elements. Excised DNA is a non-random sample of the germline repeat element landscape; younger elements, and high frequency DNA transposons and LINEs, are disproportionately eliminated from the somatic genome. Conclusions Our results suggest that germline genome expansion in M. edax reflects explosive repeat element proliferation, and that billions of base pairs of such repeats are deleted from the somatic genome every generation. Thus, we hypothesize that chromatin diminution is a mechanism that controls repeat element load, and that this load can evolve to be divergent between tissue types within single organisms. PMID:24618421
Trabulsi, Manal; Oh, Tae-Ju; Eber, Robert; Weber, Daniel; Wang, Hom-Lay
2004-11-01
Enamel matrix derivative (EMD) has been shown to promote periodontal wound healing and/or regeneration when applied to tooth root surfaces in soft tissue dehiscence models. In addition, guided tissue regeneration (GTR)-based root coverage using collagen membrane (GTRC) has shown promising results. However, limited information is available regarding how EMD may influence GTRC outcome. Twenty-six patients with Miller's Class I or II gingival recession defects of 2.5 mm were recruited for the study. Subjects were randomly assigned to receive either EMD + collagen (EMDC; test group) or collagen membrane (GTRC; control group). Clinical parameters, including plaque index (PI), gingival index (GI), relative clinical attachment levels (RCAL) to the stent, recession depth (RD), recession width (RW), probing depth (PD), gingival tissue thickness (GTT), and width of keratinized gingiva (KG) were assessed at baseline, and 3 and 6 months after surgery. A repeated measure of analysis of variance (ANOVA) was used to determine differences between treatment groups and time effect. Both treatments (GTRC and EMDC) resulted in a statistically significant decrease in RD and RW between baseline and 6 months (P <0.05). However, no difference was noted between treatment groups. The percent of root coverage after 6 months was 75% for GTRC and 63% for EMDC. Complete 100% root coverage was achieved in five patients in the GTRC group, compared to only one patient in the EMDC group. There was a statistically significant gain (P <0.05) in the clinical attachment level (CAL) between baseline and 6 months in both groups, as reflected on the RCAL data. No other significant differences were noted on other clinical parameters (PD, GTT, KG, GI, and PI). GTR-based root coverage utilizing collagen membrane, with or without enamel matrix derivative, can be successfully used in obtaining gingival recession coverage. The application of EMD during GTRC procedures did not add additional benefit to the final clinical outcome.
Binks, Andrew P; Beyer, Megyn; Miller, Ryan; LeClair, Renee J
2017-03-01
Idiopathic pulmonary fibrosis (IPF) involves collagen deposition that results in a progressive decline in lung function. This process involves activation of Smad2/3 by transforming growth factor (TGF)- β and Wnt signaling pathways. Collagen Triple Helix Repeat-Containing-1 (Cthrc1) protein inhibits Smad2/3 activation. To test the hypothesis that Cthrc1 limits collagen deposition and the decline of lung function, Cthrc1 knockout (Cthrc1 -/- ) and wild-type mice (WT) received intratracheal injections of 2.5 U/kg bleomycin or saline. Lungs were harvested after 14 days and Bronchoalveolar lavage (BAL) TGF- β , IL1- β , hydroxyproline and lung compliance were assessed. TGF- β was significantly higher in Cthrc1 -/- compared to WT (53.45 ± 6.15 ng/mL vs. 34.48 ± 11.05) after saline injection. Bleomycin injection increased TGF- β in both Cthrc1 -/- (66.37 ± 8.54 ng/mL) and WT (63.64 ± 8.09 ng/mL). Hydroxyproline was significantly higher in Cthrc1 -/- compared to WT after bleomycin-injection (2.676 ± 0.527 μ g/mg vs. 1.889 ± 0.520, P = 0.028). Immunohistochemistry of Cthrc1 -/- lung sections showed intracellular localization and activation of β -catenin Y654 in areas of tissue remodeling that was not evident in WT Lung compliance was significantly reduced by bleomycin in Cthrc1 -/- but there was no effect in WT animals. These data suggest Cthrc1 reduces fibrotic tissue formation in bleomycin-induced lung fibrosis and the effect is potent enough to limit the decline in lung function. We conclude that Cthrc1 plays a protective role, limiting collagen deposition and could form the basis of a novel therapy for pulmonary fibrosis. © 2017 The Authors. Physiological Reports published by Wiley Periodicals, Inc. on behalf of The Physiological Society and the American Physiological Society.
Whitehouse, Michael R; Howells, Nicholas R; Parry, Michael C; Austin, Eric; Kafienah, Wael; Brady, Kyla; Goodship, Allen E; Eldridge, Jonathan D; Blom, Ashley W; Hollander, Anthony P
2017-04-01
Meniscal cartilage tears are common and predispose to osteoarthritis (OA). Most occur in the avascular portion of the meniscus where current repair techniques usually fail. We described previously the use of undifferentiated autologous mesenchymal stem cells (MSCs) seeded onto a collagen scaffold (MSC/collagen-scaffold) to integrate meniscal tissues in vitro. Our objective was to translate this method into a cell therapy for patients with torn meniscus, with the long-term goal of delaying or preventing the onset of OA. After in vitro optimization, we tested an ovine-MSC/collagen-scaffold in a sheep meniscal cartilage tear model with promising results after 13 weeks, although repair was not sustained over 6 months. We then conducted a single center, prospective, open-label first-in-human safety study of patients with an avascular meniscal tear. Autologous MSCs were isolated from an iliac crest bone marrow biopsy, expanded and seeded into the collagen scaffold. The resulting human-MSC/collagen-scaffold implant was placed into the meniscal tear prior to repair with vertical mattress sutures and the patients were followed for 2 years. Five patients were treated and there was significant clinical improvement on repeated measures analysis. Three were asymptomatic at 24 months with no magnetic resonance imaging evidence of recurrent tear and clinical improvement in knee function scores. Two required subsequent meniscectomy due to retear or nonhealing of the meniscal tear at approximately 15 months after implantation. No other adverse events occurred. We conclude that undifferentiated MSCs could provide a safe way to augment avascular meniscal repair in some patients. Registration: EU Clinical Trials Register, 2010-024162-22. Stem Cells Translational Medicine 2017;6:1237-1248. © 2017 The Authors Stem Cells Translational Medicine published by Wiley Periodicals, Inc. on behalf of AlphaMed Press.
Diversity and evolution of centromere repeats in the maize genome.
Bilinski, Paul; Distor, Kevin; Gutierrez-Lopez, Jose; Mendoza, Gabriela Mendoza; Shi, Jinghua; Dawe, R Kelly; Ross-Ibarra, Jeffrey
2015-03-01
Centromere repeats are found in most eukaryotes and play a critical role in kinetochore formation. Though centromere repeats exhibit considerable diversity both within and among species, little is understood about the mechanisms that drive centromere repeat evolution. Here, we use maize as a model to investigate how a complex history involving polyploidy, fractionation, and recent domestication has impacted the diversity of the maize centromeric repeat CentC. We first validate the existence of long tandem arrays of repeats in maize and other taxa in the genus Zea. Although we find considerable sequence diversity among CentC copies genome-wide, genetic similarity among repeats is highest within these arrays, suggesting that tandem duplications are the primary mechanism for the generation of new copies. Nonetheless, clustering analyses identify similar sequences among distant repeats, and simulations suggest that this pattern may be due to homoplasious mutation. Although the two ancestral subgenomes of maize have contributed nearly equal numbers of centromeres, our analysis shows that the majority of all CentC repeats derive from one of the parental genomes, with an even stronger bias when examining the largest assembled contiguous clusters. Finally, by comparing maize with its wild progenitor teosinte, we find that the abundance of CentC likely decreased after domestication, while the pericentromeric repeat Cent4 has drastically increased.
Medium-sized tandem repeats represent an abundant component of the Drosophila virilis genome.
Abdurashitov, Murat A; Gonchar, Danila A; Chernukhin, Valery A; Tomilov, Victor N; Tomilova, Julia E; Schostak, Natalia G; Zatsepina, Olga G; Zelentsova, Elena S; Evgen'ev, Michael B; Degtyarev, Sergey K H
2013-11-09
Previously, we developed a simple method for carrying out a restriction enzyme analysis of eukaryotic DNA in silico, based on the known DNA sequences of the genomes. This method allows the user to calculate lengths of all DNA fragments that are formed after a whole genome is digested at the theoretical recognition sites of a given restriction enzyme. A comparison of the observed peaks in distribution diagrams with the results from DNA cleavage using several restriction enzymes performed in vitro have shown good correspondence between the theoretical and experimental data in several cases. Here, we applied this approach to the annotated genome of Drosophila virilis which is extremely rich in various repeats. Here we explored the combined approach to perform the restriction analysis of D. virilis DNA. This approach enabled to reveal three abundant medium-sized tandem repeats within the D. virilis genome. While the 225 bp repeats were revealed previously in intergenic non-transcribed spacers between ribosomal genes of D. virilis, two other families comprised of 154 bp and 172 bp repeats were not described. Tandem Repeats Finder search demonstrated that 154 bp and 172 bp units are organized in multiple clusters in the genome of D. virilis. Characteristically, only 154 bp repeats derived from Helitron transposon are transcribed. Using in silico digestion in combination with conventional restriction analysis and sequencing of repeated DNA fragments enabled us to isolate and characterize three highly abundant families of medium-sized repeats present in the D. virilis genome. These repeats comprise a significant portion of the genome and may have important roles in genome function and structural integrity. Therefore, we demonstrated an approach which makes possible to investigate in detail the gross arrangement and expression of medium-sized repeats basing on sequencing data even in the case of incompletely assembled and/or annotated genomes.
Vickers, Timothy A.; Freier, Susan M.; Bui, Huynh-Hoa; Watt, Andrew; Crooke, Stanley T.
2014-01-01
A new strategy for identifying potent RNase H-dependent antisense oligonucleotides (ASOs) is presented. Our analysis of the human transcriptome revealed that a significant proportion of genes contain unique repeated sequences of 16 or more nucleotides in length. Activities of ASOs targeting these repeated sites in several representative genes were compared to those of ASOs targeting unique single sites in the same transcript. Antisense activity at repeated sites was also evaluated in a highly controlled minigene system. Targeting both native and minigene repeat sites resulted in significant increases in potency as compared to targeting of non-repeated sites. The increased potency at these sites is a result of increased frequency of ASO/RNA interactions which, in turn, increases the probability of a productive interaction between the ASO/RNA heteroduplex and human RNase H1 in the cell. These results suggest a new, highly efficient strategy for rapid identification of highly potent ASOs. PMID:25334092
Begum, Rabeya; Zakrzewski, Falk; Menzel, Gerhard; Weber, Beatrice; Alam, Sheikh Shamimul; Schmidt, Thomas
2013-07-01
The cultivated jute species Corchorus olitorius and Corchorus capsularis are important fibre crops. The analysis of repetitive DNA sequences, comprising a major part of plant genomes, has not been carried out in jute but is useful to investigate the long-range organization of chromosomes. The aim of this study was the identification of repetitive DNA sequences to facilitate comparative molecular and cytogenetic studies of two jute cultivars and to develop a fluorescent in situ hybridization (FISH) karyotype for chromosome identification. A plasmid library was generated from C. olitorius and C. capsularis with genomic restriction fragments of 100-500 bp, which was complemented by targeted cloning of satellite DNA by PCR. The diversity of the repetitive DNA families was analysed comparatively. The genomic abundance and chromosomal localization of different repeat classes were investigated by Southern analysis and FISH, respectively. The cytosine methylation of satellite arrays was studied by immunolabelling. Major satellite repeats and retrotransposons have been identified from C. olitorius and C. capsularis. The satellite family CoSat I forms two undermethylated species-specific subfamilies, while the long terminal repeat (LTR) retrotransposons CoRetro I and CoRetro II show similarity to the Metaviridea of plant retroelements. FISH karyotypes were developed by multicolour FISH using these repetitive DNA sequences in combination with 5S and 18S-5·8S-25S rRNA genes which enable the unequivocal chromosome discrimination in both jute species. The analysis of the structure and diversity of the repeated DNA is crucial for genome sequence annotation. The reference karyotypes will be useful for breeding of jute and provide the basis for karyotyping homeologous chromosomes of wild jute species to reveal the genetic and evolutionary relationship between cultivated and wild Corchorus species.
A Method for WD40 Repeat Detection and Secondary Structure Prediction
Wang, Yang; Jiang, Fan; Zhuo, Zhu; Wu, Xian-Hui; Wu, Yun-Dong
2013-01-01
WD40-repeat proteins (WD40s), as one of the largest protein families in eukaryotes, play vital roles in assembling protein-protein/DNA/RNA complexes. WD40s fold into similar β-propeller structures despite diversified sequences. A program WDSP (WD40 repeat protein Structure Predictor) has been developed to accurately identify WD40 repeats and predict their secondary structures. The method is designed specifically for WD40 proteins by incorporating both local residue information and non-local family-specific structural features. It overcomes the problem of highly diversified protein sequences and variable loops. In addition, WDSP achieves a better prediction in identifying multiple WD40-domain proteins by taking the global combination of repeats into consideration. In secondary structure prediction, the average Q3 accuracy of WDSP in jack-knife test reaches 93.7%. A disease related protein LRRK2 was used as a representive example to demonstrate the structure prediction. PMID:23776530
Gebhard, Harry; Bowles, Robby; Dyke, Jonathan; Saleh, Tatianna; Doty, Stephen; Bonassar, Lawrence; Härtl, Roger
2010-01-01
Study type: Basic science Introduction: Chronic back pain due to degenerative disc disease (DDD) is among the most important medical conditions causing morbidity and significant health care costs. Surgical treatment options include disc replacement or fusion surgery, but are associated with significant short- and long-term risks.1 Biological tissue-engineering of human intervertebral discs (IVD) could offer an important alternative.2 Recent in vitro data from our group have shown successful engineering and growth of ovine intervertebral disc composites with circumferentially aligned collagen fibrils in the annulus fibrosus (AF) (Figure 1).3 Figure 1 Tissue-engineered composite disc a Experimental steps to generate composite tissue-engineered IVDs3 b Example of different AF formulations on collagen alignment in the AF. Second harmonic generation and two-photon excited fluorescence images of seeded collagen gels (for AF) of 1 and 2.5 mg/ml over time. At seeding, cells and collagen were homogenously distributed in the gels. Over time, AF cells elongated and collagen aligned parallel to cells. Less contraction and less alignment is noted after 3 days in the 2.5 mg/mL gel. c Imaging-based creation of a virtual disc model that will serve as template for the engineered disc. Total disc dimensions (AF and NP) were retrieved from micro-computer tomography (CT) (left images), and nucleus pulposus (NP) dimensions alone were retrieved from T2-weighted MRI images (right images). Merging of MRI and micro-CT models revealed a composite disc model (middle image)—Software: Microview, GE Healthcare Inc., Princeton, NJ; and slicOmatic v4.3, TomoVision, Montreal, Canada. d Flow chart describing the process for generating multi-lamellar tissue engineered IVDs. IVDs are produced by allowing cell-seeded collagen layers to contract around a cell-seeded alginate core (NP) over time Objective: The next step is to investigate if biological disc implants survive, integrate, and restore function to the spine in vivo. A model will be developed that allows efficient in vivo testing of tissue-engineered discs of various compositions and characteristics. Methods: Athymic rats were anesthetized and a dorsal approach was chosen to perform a microsurgical discectomy in the rat caudal spine (Fig. 2,Fig. 3). Control group I (n = 6) underwent discectomy only, Control group II (n = 6) underwent discectomy, followed by reimplantation of the autologous disc. Two treatment groups (group III, n = 6, 1 month survival; group IV, n = 6, 6 months survival) received a tissue-engineered composite disc implant. The rodents were followed clinically for signs of infection, pain level and wound healing. X-rays and magnetic resonance imaging (MRI) were assessed postoperatively and up to 6 months after surgery (Fig. 6,Fig. 7). A 7 Tesla MRI (Bruker) was implemented for assessment of the operated level as well as the adjacent disc (hydration). T2-weighted sequences were interpreted by a semiquantitative score (0 = no signal, 1 = weak signal, 2 = strong signal and anatomical features of a normal disc). Histology was performed with staining for proteoglycans (Alcian blue) and collagen (Picrosirius red) (Fig. 4,Fig. 5). Figure 2 Disc replacement surgery a Operative situs with native disc that has been disassociated from both adjacent vertebrae b Native disc (left) and tissue-engineered implant (right) c Implant in situ before wound closureAF: Annulus fi brosus, nP: nucleus pulposus, eP: endplate, M: Muscle, T: Tendon, s: skin, art: artery, GP: Growth plate, B: Bone Figure 3 Disc replacement surgery. Anatomy of the rat caudal disc space a Pircrosirius red stained axial cut of native disc space b Saffranin-O stained sagittal cut of native disc space Figure 4 Histologies of three separate motion segments from three different rats. Animal one = native IVD, Animal two = status after discectomy, Animal three = tissue-engineered implant (1 month) a–c H&E (overall tissue staining for light micrsocopy) d–f Alcian blue (proteoglycans) g–i Picrosirius red (collagen I and II) Figure 5 Histology from one motion segment four months after implantation of a bio-engineered disc construct a Picrosirius red staining (collagen) b Polarized light microscopy showing collagen staining and collagen organization in AF region c Increased Safranin-O staining (proteoglycans) in NP region of the disc implant d Higher magnification of figure 5c: Integration between implanted tissue-engineered total disc replacement and vertebral body bone Figure 6 MRI a Disc space height measurements in flash/T1 sequence (top: implant (714.0 micrometer), bottom: native disc (823.5 micrometer) b T2 sequence, red circle surrounding the implant NP Figure 7 7 Tesla MRI imaging of rat tail IVDs showing axial images (preliminary pilot data) a Diffusion tensor imaging (DTI) on two explanted rat tail discs in Formalin b Higher magnification of a, showing directional alignment of collagen fibers (red and green) when compared to the color ball on top which maps fibers' directional alignment (eg, fibers directing from left to right: red, from top to bottom: blue) c Native IVD in vivo (successful imaging of top and bottom of the IVD (red) d Gradient echo sequence (GE) showing differentiation between NP (light grey) and AF (dark margin) e GE of reimplanted tail IVD at the explantation level f T1Rho sequence demonstrating the NP (grey) within the AF (dark margin), containing the yellow marked region of interest for value acquisition (preliminary data are consistent with values reported in the literature). g T2 image of native IVD in vivo for monitoring of hydration (white: NP) Results: The model allowed reproducible and complete discectomies as well as disc implantation in the rat tail spine without any surgical or postoperative complications. Discectomy resulted in immediate collapse of the disc space. Preliminary results indicate that disc space height was maintained after disc implantation in groups II, III and IV over time. MRI revealed high resolution images of normal intervertebral discs in vivo. Eight out of twelve animals (groups III and IV) showed a positive signal in T2-weighted images after 1 month (grade 0 = 4, grade 1 = 4, grade 2 = 4). Positive staining was seen for collagen as well as proteoglycans at the site of disc implantation after 1 month in each of the six animals with engineered implants (group III). Analysis of group IV showed positive T2 signal in five out of six animals and disc-height preservation in all animals after 6 months. Conclusions: This study demonstrates for the first time that tissue-engineered composite IVDs with circumferentially aligned collagen fibrils survive and integrate with surrounding vertebral bodies when placed in the rat spine for up to 6 months. Tissue-engineered composite IVDs restored function to the rat spine as indicated by maintenance of disc height and vertebral alignment. A significant finding was that maintenance of the composite structure in group III was observed, with increased proteoglycan staining in the nucleus pulposus region (Figure 4d–f). Proteoglycan and collagen matrix as well as disc height preservation and positive T2 signals in MRI are promising parameters and indicate functionality of the implants. PMID:23637671
Gebhard, Harry; Bowles, Robby; Dyke, Jonathan; Saleh, Tatianna; Doty, Stephen; Bonassar, Lawrence; Härtl, Roger
2010-08-01
Basic science Introduction: Chronic back pain due to degenerative disc disease (DDD) is among the most important medical conditions causing morbidity and significant health care costs. Surgical treatment options include disc replacement or fusion surgery, but are associated with significant short- and long-term risks.1 Biological tissue-engineering of human intervertebral discs (IVD) could offer an important alternative.2 Recent in vitro data from our group have shown successful engineering and growth of ovine intervertebral disc composites with circumferentially aligned collagen fibrils in the annulus fibrosus (AF) (Figure 1).3 Figure 1 Tissue-engineered composite disc a Experimental steps to generate composite tissue-engineered IVDs3b Example of different AF formulations on collagen alignment in the AF. Second harmonic generation and two-photon excited fluorescence images of seeded collagen gels (for AF) of 1 and 2.5 mg/ml over time. At seeding, cells and collagen were homogenously distributed in the gels. Over time, AF cells elongated and collagen aligned parallel to cells. Less contraction and less alignment is noted after 3 days in the 2.5 mg/mL gel. c Imaging-based creation of a virtual disc model that will serve as template for the engineered disc. Total disc dimensions (AF and NP) were retrieved from micro-computer tomography (CT) (left images), and nucleus pulposus (NP) dimensions alone were retrieved from T2-weighted MRI images (right images). Merging of MRI and micro-CT models revealed a composite disc model (middle image)-Software: Microview, GE Healthcare Inc., Princeton, NJ; and slicOmatic v4.3, TomoVision, Montreal, Canada. d Flow chart describing the process for generating multi-lamellar tissue engineered IVDs. IVDs are produced by allowing cell-seeded collagen layers to contract around a cell-seeded alginate core (NP) over time Objective: The next step is to investigate if biological disc implants survive, integrate, and restore function to the spine in vivo. A model will be developed that allows efficient in vivo testing of tissue-engineered discs of various compositions and characteristics. Athymic rats were anesthetized and a dorsal approach was chosen to perform a microsurgical discectomy in the rat caudal spine (Fig. 2,Fig. 3). Control group I (n = 6) underwent discectomy only, Control group II (n = 6) underwent discectomy, followed by reimplantation of the autologous disc. Two treatment groups (group III, n = 6, 1 month survival; group IV, n = 6, 6 months survival) received a tissue-engineered composite disc implant. The rodents were followed clinically for signs of infection, pain level and wound healing. X-rays and magnetic resonance imaging (MRI) were assessed postoperatively and up to 6 months after surgery (Fig. 6,Fig. 7). A 7 Tesla MRI (Bruker) was implemented for assessment of the operated level as well as the adjacent disc (hydration). T2-weighted sequences were interpreted by a semiquantitative score (0 = no signal, 1 = weak signal, 2 = strong signal and anatomical features of a normal disc). Histology was performed with staining for proteoglycans (Alcian blue) and collagen (Picrosirius red) (Fig. 4,Fig. 5). Figure 2 Disc replacement surgery a Operative situs with native disc that has been disassociated from both adjacent vertebrae b Native disc (left) and tissue-engineered implant (right) c Implant in situ before wound closureAF: Annulus fi brosus, nP: nucleus pulposus, eP: endplate, M: Muscle, T: Tendon, s: skin, art: artery, GP: Growth plate, B: BoneFigure 3 Disc replacement surgery. Anatomy of the rat caudal disc space a Pircrosirius red stained axial cut of native disc space b Saffranin-O stained sagittal cut of native disc spaceFigure 4 Histologies of three separate motion segments from three different rats. Animal one = native IVD, Animal two = status after discectomy, Animal three = tissue-engineered implant (1 month) a-c H&E (overall tissue staining for light micrsocopy) d-f Alcian blue (proteoglycans) g-i Picrosirius red (collagen I and II)Figure 5 Histology from one motion segment four months after implantation of a bio-engineered disc construct a Picrosirius red staining (collagen) b Polarized light microscopy showing collagen staining and collagen organization in AF region c Increased Safranin-O staining (proteoglycans) in NP region of the disc implant d Higher magnification of figure 5c: Integration between implanted tissue-engineered total disc replacement and vertebral body boneFigure 6 MRI a Disc space height measurements in flash/T1 sequence (top: implant (714.0 micrometer), bottom: native disc (823.5 micrometer) b T2 sequence, red circle surrounding the implant NPFigure 7 7 Tesla MRI imaging of rat tail IVDs showing axial images (preliminary pilot data) a Diffusion tensor imaging (DTI) on two explanted rat tail discs in Formalin b Higher magnification of a, showing directional alignment of collagen fibers (red and green) when compared to the color ball on top which maps fibers' directional alignment (eg, fibers directing from left to right: red, from top to bottom: blue) c Native IVD in vivo (successful imaging of top and bottom of the IVD (red) d Gradient echo sequence (GE) showing differentiation between NP (light grey) and AF (dark margin) e GE of reimplanted tail IVD at the explantation level f T1Rho sequence demonstrating the NP (grey) within the AF (dark margin), containing the yellow marked region of interest for value acquisition (preliminary data are consistent with values reported in the literature). g T2 image of native IVD in vivo for monitoring of hydration (white: NP) Results: The model allowed reproducible and complete discectomies as well as disc implantation in the rat tail spine without any surgical or postoperative complications. Discectomy resulted in immediate collapse of the disc space. Preliminary results indicate that disc space height was maintained after disc implantation in groups II, III and IV over time. MRI revealed high resolution images of normal intervertebral discs in vivo. Eight out of twelve animals (groups III and IV) showed a positive signal in T2-weighted images after 1 month (grade 0 = 4, grade 1 = 4, grade 2 = 4). Positive staining was seen for collagen as well as proteoglycans at the site of disc implantation after 1 month in each of the six animals with engineered implants (group III). Analysis of group IV showed positive T2 signal in five out of six animals and disc-height preservation in all animals after 6 months. This study demonstrates for the first time that tissue-engineered composite IVDs with circumferentially aligned collagen fibrils survive and integrate with surrounding vertebral bodies when placed in the rat spine for up to 6 months. Tissue-engineered composite IVDs restored function to the rat spine as indicated by maintenance of disc height and vertebral alignment. A significant finding was that maintenance of the composite structure in group III was observed, with increased proteoglycan staining in the nucleus pulposus region (Figure 4d-f). Proteoglycan and collagen matrix as well as disc height preservation and positive T2 signals in MRI are promising parameters and indicate functionality of the implants.
Memory for sequences of events impaired in typical aging.
Allen, Timothy A; Morris, Andrea M; Stark, Shauna M; Fortin, Norbert J; Stark, Craig E L
2015-03-01
Typical aging is associated with diminished episodic memory performance. To improve our understanding of the fundamental mechanisms underlying this age-related memory deficit, we previously developed an integrated, cross-species approach to link converging evidence from human and animal research. This novel approach focuses on the ability to remember sequences of events, an important feature of episodic memory. Unlike existing paradigms, this task is nonspatial, nonverbal, and can be used to isolate different cognitive processes that may be differentially affected in aging. Here, we used this task to make a comprehensive comparison of sequence memory performance between younger (18-22 yr) and older adults (62-86 yr). Specifically, participants viewed repeated sequences of six colored, fractal images and indicated whether each item was presented "in sequence" or "out of sequence." Several out of sequence probe trials were used to provide a detailed assessment of sequence memory, including: (i) repeating an item from earlier in the sequence ("Repeats"; e.g., AB A: DEF), (ii) skipping ahead in the sequence ("Skips"; e.g., AB D: DEF), and (iii) inserting an item from a different sequence into the same ordinal position ("Ordinal Transfers"; e.g., AB 3: DEF). We found that older adults performed as well as younger controls when tested on well-known and predictable sequences, but were severely impaired when tested using novel sequences. Importantly, overall sequence memory performance in older adults steadily declined with age, a decline not detected with other measures (RAVLT or BPS-O). We further characterized this deficit by showing that performance of older adults was severely impaired on specific probe trials that required detailed knowledge of the sequence (Skips and Ordinal Transfers), and was associated with a shift in their underlying mnemonic representation of the sequences. Collectively, these findings provide unambiguous evidence that the capacity to remember sequences of events is fundamentally affected by typical aging. © 2015 Allen et al.; Published by Cold Spring Harbor Laboratory Press.
Hirosawa, I; Aritomi, K; Hoshida, H; Kashiwagi, S; Nishizawa, Y; Akada, R
2004-07-01
The commercial application of genetically modified industrial microorganisms has been problematic due to public concerns. We constructed a "self-cloning" sake yeast strain that overexpresses the ATF1 gene encoding alcohol acetyltransferase, to improve the flavor profile of Japanese sake. A constitutive yeast overexpression promoter, TDH3p, derived from the glyceraldehyde-3-phosphate dehydrogenase gene from sake yeast was fused to ATF1; and the 5' upstream non-coding sequence of ATF1 was further fused to TDH3p-ATF1. The fragment was placed on a binary vector, pGG119, containing a drug-resistance marker for transformation and a counter-selection marker for excision of unwanted DNA. The plasmid was integrated into the ATF1 locus of a sake yeast strain. This integration constructed tandem repeats of ATF1 and TDH3p-ATF1 sequences, between which the plasmid was inserted. Loss of the plasmid, which occurs through homologous recombination between either the TDH3p downstream ATF1 repeats or the TDH3p upstream repeat sequences, was selected by growing transformants on counter-selective medium. Recombination between the downstream repeats led to reversion to a wild type strain, but that between the upstream repeats resulted in a strain that possessed TDH3p-ATF1 without the extraneous DNA sequences. The self-cloning TDH3p-ATF1 yeast strain produced a higher amount of isoamyl acetate. This is the first expression-controlled self-cloning industrial yeast.
Zheng, Renhua; Xu, Haibin; Zhou, Yanwei; Li, Meiping; Lu, Fengjuan; Dong, Yini; Liu, Xin; Chen, Jinhui; Shi, Jisen
2016-01-01
Glyptostrobus pensilis, belonging to the monotypic genus Glyptostrobus (Family: Cupressaceae), is an ancient conifer that is naturally distributed in low-lying wet areas. Here, we report the complete chloroplast (cp) genome sequence (132,239 bp) of G. pensilis. The G. pensilis cp genome is similar in gene content, organization and genome structure to the sequenced cp genomes from other cupressophytes, especially with respect to the loss of the inverted repeat region A (IRA). Through phylogenetic analysis, we demonstrated that the genus Glyptostrobus is closely related to the genus Cryptomeria, supporting previous findings based on physiological characteristics. Since IRs play an important role in stabilize cp genome and conifer cp genomes lost different IR regions after splitting in two clades (cupressophytes and Pinaceae), we performed cp genome rearrangement analysis and found more extensive cp genome rearrangements among the species of cupressophytes relative to Pinaceae. Additional repeat analysis indicated that cupressophytes cp genomes contained less potential functional repeats, especially in Cupressaceae, compared with Pinaceae. These results suggested that dynamics of cp genome rearrangement in conifers differed since the two clades, Pinaceae and cupressophytes, lost IR copies independently and developed different repeats to complement the residual IRs. In addition, we identified 170 perfect simple sequence repeats that will be useful in future research focusing on the evolution of genetic diversity and conservation of genetic variation for this endangered species in the wild. PMID:27560965
Haider, Nadia
2017-01-01
Investigation of genetic variation and phylogenetic relationships among date palm (Phoenix dactylifera L.) cultivars is useful for their conservation and genetic improvement. Various molecular markers such as restriction fragment length polymorphisms (RFLPs), simple sequence repeat (SSR), representational difference analysis (RDA), and amplified fragment length polymorphism (AFLP) have been developed to molecularly characterize date palm cultivars. PCR-based markers random amplified polymorphic DNA (RAPD) and inter-simple sequence repeat (ISSR) are powerful tools to determine the relatedness of date palm cultivars that are difficult to distinguish morphologically. In this chapter, the principles, materials, and methods of RAPD and ISSR techniques are presented. Analysis of data generated from these two techniques and the use of these data to reveal phylogenetic relationships among date palm cultivars are also discussed.
Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.
Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi
2017-07-01
PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.
Length and sequence variability in mitochondrial control region of the milkfish, Chanos chanos.
Ravago, Rachel G; Monje, Virginia D; Juinio-Meñez, Marie Antonette
2002-01-01
Extensive length variability was observed in the mitochondrial control region of the milkfish, Chanos chanos. The nucleotide sequence of the control region and flanking regions was determined. Length variability and heteroplasmy was due to the presence of varying numbers of a 41-bp tandemly repeated sequence and a 48-bp insertion/deletion (indel). The structure and organization of the milkfish control region is similar to that of other teleost fish and vertebrates. However, extensive variation in the copy number of tandem repeats (4-20 copies) and the presence of a relatively large (48-bp) indel, are apparently uncommon in teleost fish control region sequences reported to date. High sequence variability of control region peripheral domains indicates the potential utility of selected regions as markers for population-level studies.
Length and sequence heterogeneity in 5S rDNA of Populus deltoides.
Negi, Madan S; Rajagopal, Jyothi; Chauhan, Neeti; Cronn, Richard; Lakshmikumaran, Malathi
2002-12-01
The 5S rRNA genes and their associated non-transcribed spacer (NTS) regions are present as repeat units arranged in tandem arrays in plant genomes. Length heterogeneity in 5S rDNA repeats was previously identified in Populus deltoides and was also observed in the present study. Primers were designed to amplify the 5S rDNA NTS variants from the P. deltoides genome. The PCR-amplified products from the two accessions of P. deltoides (G3 and G48) suggested the presence of length heterogeneity of 5S rDNA units within and among accessions, and the size of the spacers ranged from 385 to 434 bp. Sequence analysis of the non-transcribed spacer (NTS) revealed two distinct classes of 5S rDNA within both accessions: class 1, which contained GAA trinucleotide microsatellite repeats, and class 2, which lacked the repeats. The class 1 spacer shows length variation owing to the microsatellite, with two clones exhibiting 10 GAA repeat units and one clone exhibiting 16 such repeat units. However, distance analysis shows that class 1 spacer sequences are highly similar inter se, yielding nucleotide diversity (pi) estimates that are less than 0.15% of those obtained for class 2 spacers (pi = 0.0183 vs. 0.1433, respectively). The presence of microsatellite in the NTS region leading to variation in spacer length is reported and discussed for the first time in P. deltoides.
M.N. lslam-Faridi; C.D. Nelson; S.P. DiFazio; L.E. Gunter; G.A. Tuskan
2009-01-01
The 185-285 rDNA and 55 rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 185-285 rDNA sites and one 55 rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis-type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones...
Tochio, Naoya; Umehara, Kohei; Uewaki, Jun-ichi; Flechsig, Holger; Kondo, Masaharu; Dewa, Takehisa; Sakuma, Tetsushi; Yamamoto, Takashi; Saitoh, Takashi; Togashi, Yuichi; Tate, Shin-ichi
2016-01-01
Transcription activator-like effector (TALE) nuclease (TALEN) is widely used as a tool in genome editing. The DNA binding part of TALEN consists of a tandem array of TAL-repeats that form a right-handed superhelix. Each TAL-repeat recognises a specific base by the repeat variable diresidue (RVD) at positions 12 and 13. TALEN comprising the TAL-repeats with periodic mutations to residues at positions 4 and 32 (non-RVD sites) in each repeat (VT-TALE) exhibits increased efficacy in genome editing compared with a counterpart without the mutations (CT-TALE). The molecular basis for the elevated efficacy is unknown. In this report, comparison of the physicochemical properties between CT- and VT-TALEs revealed that VT-TALE has a larger amplitude motion along the superhelical axis (superhelical motion) compared with CT-TALE. The greater superhelical motion in VT-TALE enabled more TAL-repeats to engage in the target sequence recognition compared with CT-TALE. The extended sequence recognition by the TAL-repeats improves site specificity with limiting the spatial distribution of FokI domains to facilitate their dimerization at the desired site. Molecular dynamics simulations revealed that the non-RVD mutations alter inter-repeat hydrogen bonding to amplify the superhelical motion of VT-TALE. The TALEN activity is associated with the inter-repeat hydrogen bonding among the TAL repeats. PMID:27883072
NASA Astrophysics Data System (ADS)
Villiger, Martin; Karanasos, Antonios; Ren, Jian; Lippok, Norman; Shishkov, Milen; Daemen, Joost; Van Mieghem, Nicolas; Diletti, Roberto; Valgimigli, Marco; van Geuns, Robert-Jan; de Jaegere, Peter; Zijlstra, Felix; van Soest, Gijs; Nadkarni, Seemantini; Regar, Evelyn; Bouma, Brett E.
2016-02-01
Polarization sensitive (PS) OCT measures the polarization states of the light backscattered by tissue and provides measures of tissue birefringence and depolarization in addition to the structural OCT signal. Ex vivo studies have demonstrated that birefringence is increased in tissue rich in collagen and with elevated smooth muscle cell content. Preliminary data further suggests that depolarization can identify regions of macrophage infiltration, lipid, and irregularly arranged collagen fibers. These are important aspects of the mechanical integrity and vulnerability of atherosclerotic plaques. To evaluate the potential of PS-OCT in the clinical setting, we combined our custom PS-OCT system with commercially available OCT catheters (Fastview, Terumo Corporation) and performed a pilot study in 30 patients, scheduled to undergo percutaneous coronary intervention (PCI) on the grounds of stable or unstable angina. A total of 82 pullbacks in 39 vessels were performed, either in the native coronary arteries or post procedure. Comparing consecutive pullbacks of the same coronary artery, we found excellent agreement between the polarization features in the repeat pullbacks, validating the repeatability and robustness of PS-OCT in the clinical in vivo setting. In addition we observed that the birefringence and depolarization features vary significantly across lesions with identical structural OCT appearance, suggesting morphological subtypes. This first human pilot study proved the feasibility and robustness of intravascular PS-OCT. PS-OCT achieves improved tissue characterization and may help in identifying high-risk plaques, with the potential to ultimately improve risk stratification and help guiding PCI.
USDA-ARS?s Scientific Manuscript database
Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
Development and characterization of a eukaryotic expression system for human type II procollagen.
Wieczorek, Andrew; Rezaei, Naghmeh; Chan, Clara K; Xu, Chuan; Panwar, Preety; Brömme, Dieter; Merschrod S, Erika F; Forde, Nancy R
2015-12-15
Triple helical collagens are the most abundant structural protein in vertebrates and are widely used as biomaterials for a variety of applications including drug delivery and cellular and tissue engineering. In these applications, the mechanics of this hierarchically structured protein play a key role, as does its chemical composition. To facilitate investigation into how gene mutations of collagen lead to disease as well as the rational development of tunable mechanical and chemical properties of this full-length protein, production of recombinant expressed protein is required. Here, we present a human type II procollagen expression system that produces full-length procollagen utilizing a previously characterized human fibrosarcoma cell line for production. The system exploits a non-covalently linked fluorescence readout for gene expression to facilitate screening of cell lines. Biochemical and biophysical characterization of the secreted, purified protein are used to demonstrate the proper formation and function of the protein. Assays to demonstrate fidelity include proteolytic digestion, mass spectrometric sequence and posttranslational composition analysis, circular dichroism spectroscopy, single-molecule stretching with optical tweezers, atomic-force microscopy imaging of fibril assembly, and transmission electron microscopy imaging of self-assembled fibrils. Using a mammalian expression system, we produced full-length recombinant human type II procollagen. The integrity of the collagen preparation was verified by various structural and degradation assays. This system provides a platform from which to explore new directions in collagen manipulation.
Günaltay, Sezin; Rademacher, Lech; Hultgren Hörnquist, Elisabeth; Bohr, Johan
2017-01-01
One to six percent of patients with microscopic colitis are refractory to medical treatment. The effect of faecal microbiota transplantation (FMT) in active collagenous colitis (CC) has, to the best of our knowledge, never been reported before. Here, we report the effect of repeated FMT in a patient with CC. The patient presented with severe symptoms including profuse diarrhea and profound weight loss. Although she responded to budesonide in the beginning, she became gradually refractory to medical treatment, and was therefore treated with FMT. The patient remained in remission for 11 mo after the third faecal transplantation. The immunomodulatory effect of the therapy was evaluated using flow cytometry, which showed alterations in the profile of intraepithelial and lamina propria lymphocyte subsets after the second transplantation. Our observations indicate that FMT can have an effect in CC, which support the hypothesis that luminal factors, influencing the intestinal microbiota, are involved in the pathogenesis of CC. PMID:28275312
A switch in disulfide linkage during minicollagen assembly in Hydra nematocysts.
Engel, U; Pertz, O; Fauser, C; Engel, J; David, C N; Holstein, T W
2001-06-15
The smallest known collagens with only 14 Gly-X-Y repeats referred to as minicollagens are the main constituents of the capsule wall of nematocysts. These are explosive organelles found in Hydra, jellyfish, corals and other Cnidaria. Minicollagen-1 of Hydra recombinantly expressed in mammalian 293 cells contains disulfide bonds within its N- and C-terminal Cys-rich domains but no interchain cross-links. It is soluble and self-associates through non-covalent interactions to form 25-nm-long trimeric helical rod-like molecules. We have used a polyclonal antibody prepared against the recombinant protein to follow the maturation of minicollagens from soluble precursors present in the endoplasmic reticulum and post-Golgi vacuoles to the disulfide-linked insoluble assembly form of the wall. The switch from intra- to intermolecular disulfide bonds is associated with 'hardening' of the capsule wall and provides an explanation for its high tensile strength and elasticity. The process is comparable to disulfide reshuffling between the NC1 domains of collagen IV in mammalian basement membranes.
Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram P; Gupta, Deepak K; Singh, Sangeeta; Dogra, Vivek; Gaikwad, Kishor; Sharma, Tilak R; Raje, Ranjeet S; Bandhopadhya, Tapas K; Datta, Subhojit; Singh, Mahendra N; Bashasab, Fakrudin; Kulwal, Pawan; Wanjari, K B; K Varshney, Rajeev; Cook, Douglas R; Singh, Nagendra K
2011-01-20
Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥ 18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea.
2011-01-01
Background Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. Results In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. Conclusion We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea. PMID:21251263
Contrasting Patterns of rDNA Homogenization within the Zygosaccharomyces rouxii Species Complex
Chand Dakal, Tikam; Giudici, Paolo; Solieri, Lisa
2016-01-01
Arrays of repetitive ribosomal DNA (rDNA) sequences are generally expected to evolve as a coherent family, where repeats within such a family are more similar to each other than to orthologs in related species. The continuous homogenization of repeats within individual genomes is a recombination process termed concerted evolution. Here, we investigated the extent and the direction of concerted evolution in 43 yeast strains of the Zygosaccharomyces rouxii species complex (Z. rouxii, Z. sapae, Z. mellis), by analyzing two portions of the 35S rDNA cistron, namely the D1/D2 domains at the 5’ end of the 26S rRNA gene and the segment including the internal transcribed spacers (ITS) 1 and 2 (ITS regions). We demonstrate that intra-genomic rDNA sequence variation is unusually frequent in this clade and that rDNA arrays in single genomes consist of an intermixing of Z. rouxii, Z. sapae and Z. mellis-like sequences, putatively evolved by reticulate evolutionary events that involved repeated hybridization between lineages. The levels and distribution of sequence polymorphisms vary across rDNA repeats in different individuals, reflecting four patterns of rDNA evolution: I) rDNA repeats that are homogeneous within a genome but are chimeras derived from two parental lineages via recombination: Z. rouxii in the ITS region and Z. sapae in the D1/D2 region; II) intra-genomic rDNA repeats that retain polymorphisms only in ITS regions; III) rDNA repeats that vary only in their D1/D2 domains; IV) heterogeneous rDNA arrays that have both polymorphic ITS and D1/D2 regions. We argue that an ongoing process of homogenization following allodiplodization or incomplete lineage sorting gave rise to divergent evolutionary trajectories in different strains, depending upon temporal, structural and functional constraints. We discuss the consequences of these findings for Zygosaccharomyces species delineation and, more in general, for yeast barcoding. PMID:27501051
Larracuente, Amanda M
2014-11-25
Satellite DNA can make up a substantial fraction of eukaryotic genomes and has roles in genome structure and chromosome segregation. The rapid evolution of satellite DNA can contribute to genomic instability and genetic incompatibilities between species. Despite its ubiquity and its contribution to genome evolution, we currently know little about the dynamics of satellite DNA evolution. The Responder (Rsp) satellite DNA family is found in the pericentric heterochromatin of chromosome 2 of Drosophila melanogaster. Rsp is well-known for being the target of Segregation Distorter (SD)- an autosomal meiotic drive system in D. melanogaster. I present an evolutionary genetic analysis of the Rsp family of repeats in D. melanogaster and its closely-related species in the melanogaster group (D. simulans, D. sechellia, D. mauritiana, D. erecta, and D. yakuba) using a combination of available BAC sequences, whole genome shotgun Sanger reads, Illumina short read deep sequencing, and fluorescence in situ hybridization. I show that Rsp repeats have euchromatic locations throughout the D. melanogaster genome, that Rsp arrays show evidence for concerted evolution, and that Rsp repeats exist outside of D. melanogaster, in the melanogaster group. The repeats in these species are considerably diverged at the sequence level compared to D. melanogaster, and have a strikingly different genomic distribution, even between closely-related sister taxa. The genomic organization of the Rsp repeat in the D. melanogaster genome is complex-it exists of large blocks of tandem repeats in the heterochromatin and small blocks of tandem repeats in the euchromatin. My discovery of heterochromatic Rsp-like sequences outside of D. melanogaster suggests that SD evolved after its target satellite and that the evolution of the Rsp satellite family is highly dynamic over a short evolutionary time scale (<240,000 years).
1988-01-01
The primary amino acid sequence of contactin, a neuronal cell surface glycoprotein of 130 kD that is isolated in association with components of the cytoskeleton (Ranscht, B., D. J. Moss, and C. Thomas. 1984. J. Cell Biol. 99:1803-1813), was deduced from the nucleotide sequence of cDNA clones and is reported here. The cDNA sequence contains an open reading frame for a 1,071-amino acid transmembrane protein with 962 extracellular and 89 cytoplasmic amino acids. In its extracellular portion, the polypeptide features six type 1 and two type 2 repeats. The six amino-terminal type 1 repeats (I-VI) each consist of 81-99 amino acids and contain two cysteine residues that are in the right context to form globular domains as described for molecules with immunoglobulin structure. Within the proposed globular region, contactin shares 31% identical amino acids with the neural cell adhesion molecule NCAM. The two type 2 repeats (I-II) are each composed of 100 amino acids and lack cysteine residues. They are 20-31% identical to fibronectin type III repeats. Both the structural similarity of contactin to molecules of the immunoglobulin supergene family, in particular the amino acid sequence resemblance to NCAM, and its relationship to fibronectin indicate that contactin could be involved in some aspect of cellular adhesion. This suggestion is further strengthened by its localization in neuropil containing axon fascicles and synapses. PMID:3049624
The Peculiar Landscape of Repetitive Sequences in the Olive (Olea europaea L.) Genome
Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea
2014-01-01
Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome. PMID:24671744
The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.
Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea
2014-04-01
Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome.
Ni, Lianghong; Zhao, Zhili; Xu, Hongxi; Chen, Shilin; Dorje, Gaawe
2016-02-15
Endemic to the Sino-Himalayan subregion, the medicinal alpine plant Gentiana straminea is a threatened species. The genetic and molecular data about it is deficient. Here we report the complete chloroplast (cp) genome sequence of G. straminea, as the first sequenced member of the family Gentianaceae. The cp genome is 148,991bp in length, including a large single copy (LSC) region of 81,240bp, a small single copy (SSC) region of 17,085bp and a pair of inverted repeats (IRs) of 25,333bp. It contains 112 unique genes, including 78 protein-coding genes, 30 tRNAs and 4 rRNAs. The rps16 gene lacks exon2 between trnK-UUU and trnQ-UUG, which is the first rps16 pseudogene found in the nonparasitic plants of Asterids clade. Sequence analysis revealed the presence of 13 forward repeats, 13 palindrome repeats and 39 simple sequence repeats (SSRs). An entire cp genome comparison study of G. straminea and four other species in Gentianales was carried out. Phylogenetic analyses using maximum likelihood (ML) and maximum parsimony (MP) were performed based on 69 protein-coding genes from 36 species of Asterids. The results strongly supported the position of Gentianaceae as one member of the order Gentianales. The complete chloroplast genome sequence will provide intragenic information for its conservation and contribute to research on the genetic and phylogenetic analyses of Gentianales and Asterids. Copyright © 2015 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jackson, P.J.; Walthers, E.A.; Richmond, K.L.
1997-04-01
PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less